Jina-VLM: Small Multilingual Vision Language Model Paper • 2512.04032 • Published 7 days ago • 12 • 4
MemMamba: Rethinking Memory Patterns in State Space Model Paper • 2510.03279 • Published Sep 28 • 72 • 3
Dynamic Chunking for End-to-End Hierarchical Sequence Modeling Paper • 2507.07955 • Published Jul 10 • 25 • 4
Energy-Based Transformers are Scalable Learners and Thinkers Paper • 2507.02092 • Published Jul 2 • 69 • 26