Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs Paper • 2603.16932 • Published 11 days ago • 56
CARES: Context-Aware Resolution Selector for VLMs Paper • 2510.19496 • Published Oct 22, 2025 • 9
NLE: Non-autoregressive LLM-based ASR by Transcript Editing Paper • 2603.08397 • Published 15 days ago • 21
view article Article Granite 4.0 1B Speech: Compact, Multilingual, and Built for the Edge 15 days ago • 12
Granite 3.1 Language Models Collection A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 9 items • Updated 8 days ago • 69
Charting and Navigating Hugging Face's Model Atlas Paper • 2503.10633 • Published Mar 13, 2025 • 93
Advancing Speech Understanding in Speech-Aware Language Models with GRPO Paper • 2509.16990 • Published Sep 21, 2025 • 22
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 Sep 11, 2025 • 185
Continuous Speech Synthesis using per-token Latent Diffusion Paper • 2410.16048 • Published Oct 21, 2024 • 30
view article Article Saving Memory Using Padding-Free Transformer Layers during Finetuning Jun 11, 2024 • 21