Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments Paper • 2605.30280 • Published 26 days ago • 146
Claw-Anything: Benchmarking Always-On Personal Assistants with Broader Access to User's Digital World Paper • 2605.26086 • Published 29 days ago • 24
MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research Paper • 2605.26114 • Published 29 days ago • 65
MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research Paper • 2605.26114 • Published 29 days ago • 65
Claw-Anything: Benchmarking Always-On Personal Assistants with Broader Access to User's Digital World Paper • 2605.26086 • Published 29 days ago • 24
Claw-Anything: Benchmarking Always-On Personal Assistants with Broader Access to User's Digital World Paper • 2605.26086 • Published 29 days ago • 24
DexJoCo: A Benchmark and Toolkit for Task-Oriented Dexterous Manipulation on MuJoCo Paper • 2605.16257 • Published May 15 • 55
CLI-Gym: Scalable CLI Task Generation via Agentic Environment Inversion Paper • 2602.10999 • Published Feb 11 • 11
FeatureBench: Benchmarking Agentic Coding for Complex Feature Development Paper • 2602.10975 • Published Feb 11 • 18
CLI-Gym: Scalable CLI Task Generation via Agentic Environment Inversion Paper • 2602.10999 • Published Feb 11 • 11
RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System Paper • 2602.02488 • Published Feb 2 • 36
FeatureBench: Benchmarking Agentic Coding for Complex Feature Development Paper • 2602.10975 • Published Feb 11 • 18