MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance Paper โข 2510.00499 โข Published Oct 1, 2025 โข 19
MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization Paper โข 2601.01554 โข Published 5 days ago โข 50
Running Featured 34 MOSS Transcribe Diarize ๐ข 34 Transcribe audio/video files with speaker identification
Running Featured 34 MOSS Transcribe Diarize ๐ข 34 Transcribe audio/video files with speaker identification
MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization Paper โข 2601.01554 โข Published 5 days ago โข 50
ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL Paper โข 2505.24875 โข Published May 30, 2025 โข 10
World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning Paper โข 2503.10480 โข Published Mar 13, 2025 โข 55
Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models Paper โข 2410.02416 โข Published Oct 3, 2024 โข 34
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper โข 2502.14786 โข Published Feb 20, 2025 โข 157