2 18 5

Zijie Chen

Zijie-chen

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

upvoted a paper 10 days ago

Attention Residuals

upvoted a paper 15 days ago

HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing

View all activity

Organizations

None yet

upvoted a paper 7 days ago

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

Paper • 2603.15726 • Published 11 days ago • 180

upvoted a paper 10 days ago

Attention Residuals

Paper • 2603.15031 • Published 11 days ago • 162

upvoted a paper 15 days ago

HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing

Paper • 2601.21459 • Published Jan 29 • 10

liked a dataset 23 days ago

MiniByte-666/Dr.SCI

Viewer • Updated 23 days ago • 891k • 78 • 5

authored 3 papers 28 days ago

Breaking the Data Barrier -- Building GUI Agents Through Task Generalization

Paper • 2504.10127 • Published Apr 14, 2025 • 17

A2Eval: Agentic and Automated Evaluation for Embodied Brain

Paper • 2602.01640 • Published Feb 2 • 8

Improving Data and Reward Design for Scientific Reasoning in Large Language Models

Paper • 2602.08321 • Published Feb 9 • 42

upvoted an article about 1 month ago

Article

DenseR: Dense Rewards For Free in LLM Reasoning

Feb 18

•

upvoted 3 papers about 1 month ago

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Paper • 2602.09443 • Published Feb 10 • 59

Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability

Paper • 2602.02477 • Published Feb 2 • 11

Improving Data and Reward Design for Scientific Reasoning in Large Language Models

Paper • 2602.08321 • Published Feb 9 • 42

submitted a paper to Daily Papers about 1 month ago

Improving Data and Reward Design for Scientific Reasoning in Large Language Models

Paper • 2602.08321 • Published Feb 9 • 42

upvoted 3 papers about 2 months ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 94

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

Paper • 2602.01734 • Published Feb 2 • 32

A2Eval: Agentic and Automated Evaluation for Embodied Brain

Paper • 2602.01640 • Published Feb 2 • 8

submitted a paper to Daily Papers about 2 months ago

A2Eval: Agentic and Automated Evaluation for Embodied Brain

Paper • 2602.01640 • Published Feb 2 • 8

upvoted 2 papers 4 months ago

Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models

Paper • 2511.23319 • Published Nov 28, 2025 • 24

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 134

upvoted a paper 5 months ago

Knocking-Heads Attention

Paper • 2510.23052 • Published Oct 27, 2025 • 30

upvoted a paper 10 months ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21, 2025 • 88

Zijie Chen

AI & ML interests

Recent Activity

Organizations

Zijie-chen's activity

DenseR: Dense Rewards For Free in LLM Reasoning