ObjEmbed: Towards Universal Multimodal Object Embeddings Paper • 2602.01753 • Published 3 days ago • 4
ObjEmbed: Towards Universal Multimodal Object Embeddings Paper • 2602.01753 • Published 3 days ago • 4
WeDetect: Fast Open-Vocabulary Object Detection as Retrieval Paper • 2512.12309 • Published Dec 13, 2025 • 3
IRG-MotionLLM: Interleaving Motion Generation, Assessment and Refinement for Text-to-Motion Generation Paper • 2512.10730 • Published Dec 11, 2025 • 3
IRG-MotionLLM: Interleaving Motion Generation, Assessment and Refinement for Text-to-Motion Generation Paper • 2512.10730 • Published Dec 11, 2025 • 3
ViSpeak Collection ViSpeak: Visual Instruction Feedback in Streaming Videos • 5 items • Updated Oct 29, 2025