Yaorui SHI

yrshi

syr-cn

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

upvoted a paper 4 days ago

GEMS: Agent-Native Multimodal Generation with Memory and Skills

upvoted a paper 4 days ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

View all activity

Organizations

upvoted 3 papers 4 days ago

upvoted a paper 5 days ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 17 days ago • 319

upvoted a paper 8 days ago

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

Paper • 2603.25158 • Published 12 days ago • 48

upvoted 2 papers 10 days ago

SkillOrchestra: Learning to Route Agents via Skill Transfer

Paper • 2602.19672 • Published Feb 23 • 57

LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning

Paper • 2603.21065 • Published 16 days ago • 77

upvoted 3 papers 20 days ago

TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas

Paper • 2603.16448 • Published 20 days ago • 58

Online Experiential Learning for Language Models

Paper • 2603.16856 • Published 20 days ago • 57

Attention Residuals

Paper • 2603.15031 • Published 21 days ago • 175

upvoted a paper 27 days ago

Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model

Paper • 2603.05438 • Published Mar 5 • 40

liked a dataset 29 days ago

OldKingMeister/lmsys-arena-processed-data

Preview • Updated about 1 month ago • 30 • 1

upvoted 4 papers about 1 month ago

Remember Me, Refine Me: A Dynamic Procedural Memory Framework for Experience-Driven Agent Evolution

Paper • 2512.10696 • Published Dec 11, 2025 • 3

AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning

Paper • 2505.24298 • Published May 30, 2025 • 34

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Paper • 2602.24286 • Published Feb 27 • 97

OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens

Paper • 2603.02138 • Published Mar 2 • 150

upvoted a collection about 1 month ago

Qwen3.5

Collection

21 items • Updated 28 days ago • 1.43k

liked a model about 1 month ago

meituan/MemOCR-7B

Visual Question Answering • 8B • Updated Mar 3 • 22 • 7

updated a model about 1 month ago

meituan/MemOCR-7B

Visual Question Answering • 8B • Updated Mar 3 • 22 • 7

upvoted a paper about 1 month ago

Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems

Paper • 2602.08847 • Published Feb 9 • 29

Yaorui SHI

AI & ML interests

Recent Activity

Organizations

yrshi's activity