lmms-lab/RefCOCO
Viewer • Updated • 17.6k • 8.92k • 35
Feeling and building the multimodal intelligence.
ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning
Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling