AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration Paper • 2605.20025 • Published 5 days ago • 154
GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment Paper • 2605.19577 • Published 5 days ago • 56
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published 11 days ago • 263
ESARBench: A Benchmark for Agentic UAV Embodied Search and Rescue Paper • 2605.01371 • Published 22 days ago • 6
DCAgent2/swebench_verified_random_100_folders_g1_top8_100k_32b_step2100_20260501_070332 Viewer • Updated 23 days ago • 300 • 128 • 1
Back to Repair: A Minimal Denoising Network\ for Time Series Anomaly Detection Paper • 2604.17388 • Published Apr 19 • 3
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 503
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published Apr 8 • 326