SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization Paper • 2604.02268 • Published 5 days ago • 86
GEMS: Agent-Native Multimodal Generation with Memory and Skills Paper • 2603.28088 • Published 8 days ago • 83
LongCat-Next: Lexicalizing Modalities as Discrete Tokens Paper • 2603.27538 • Published 9 days ago • 134
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published 17 days ago • 319
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills Paper • 2603.25158 • Published 12 days ago • 48
SkillOrchestra: Learning to Route Agents via Skill Transfer Paper • 2602.19672 • Published Feb 23 • 57
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning Paper • 2603.21065 • Published 16 days ago • 77
TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas Paper • 2603.16448 • Published 20 days ago • 58
Online Experiential Learning for Language Models Paper • 2603.16856 • Published 20 days ago • 57
Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model Paper • 2603.05438 • Published Mar 5 • 40
Remember Me, Refine Me: A Dynamic Procedural Memory Framework for Experience-Driven Agent Evolution Paper • 2512.10696 • Published Dec 11, 2025 • 3
AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning Paper • 2505.24298 • Published May 30, 2025 • 34
CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation Paper • 2602.24286 • Published Feb 27 • 97
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens Paper • 2603.02138 • Published Mar 2 • 150
Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems Paper • 2602.08847 • Published Feb 9 • 29