Visual In-Context Learning for Large Vision-Language Models Paper • 2402.11574 • Published Feb 18, 2024
Accelerating Vision Diffusion Transformers with Skip Branches Paper • 2411.17616 • Published Nov 26, 2024
SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information Paper • 2409.14083 • Published Sep 21, 2024
EmoBench-M: Benchmarking Emotional Intelligence for Multimodal Large Language Models Paper • 2502.04424 • Published Feb 6 • 1
MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution Paper • 2403.17927 • Published Mar 26, 2024 • 1
KADEL: Knowledge-Aware Denoising Learning for Commit Message Generation Paper • 2401.08376 • Published Jan 16, 2024
Self-Rewarding Large Vision-Language Models for Optimizing Prompts in Text-to-Image Generation Paper • 2505.16763 • Published May 22
Beyond Empathy: Integrating Diagnostic and Therapeutic Reasoning with Large Language Models for Mental Health Counseling Paper • 2505.15715 • Published May 21 • 1
Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation Paper • 2505.24787 • Published May 30 • 1
ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies Paper • 2506.12830 • Published Jun 15
Speed Always Wins: A Survey on Efficient Architectures for Large Language Models Paper • 2508.09834 • Published Aug 13 • 53
TheraMind: A Strategic and Adaptive Agent for Longitudinal Psychological Counseling Paper • 2510.25758 • Published Oct 29 • 1
HiCoGen: Hierarchical Compositional Text-to-Image Generation in Diffusion Models via Reinforcement Learning Paper • 2511.19965 • Published 15 days ago • 1