Yichen You's picture

Yichen You

youyc22

·

youyc22

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

updated a dataset 14 days ago

youyc22/amteam-8b-121k-top16

published a dataset 14 days ago

youyc22/amteam-8b-121k-top16

View all activity

Organizations

upvoted a paper 10 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 12 days ago • 85

updated a dataset 14 days ago

youyc22/amteam-8b-121k-top16

Viewer • Updated 14 days ago • 83.9k • 23

published a dataset 14 days ago

youyc22/amteam-8b-121k-top16

Viewer • Updated 14 days ago • 83.9k • 23

updated a collection 14 days ago

TaH

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models • 9 items • Updated 14 days ago • 2

published a dataset 15 days ago

youyc22/amteam-121k-8k

Updated 15 days ago • 93

updated a dataset 15 days ago

youyc22/amteam-121k-8k

Updated 15 days ago • 93

updated a collection 27 days ago

TaH

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models • 9 items • Updated 14 days ago • 2

updated a collection 28 days ago

TaH

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models • 9 items • Updated 14 days ago • 2

updated a collection about 1 month ago

TaH

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models • 9 items • Updated 14 days ago • 2

upvoted a paper about 2 months ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 59

liked a dataset 3 months ago

Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b

Viewer • Updated Jan 31 • 306k • 2.95k • 348

published a model 3 months ago

nics-efc/Standard-1.7B

Text Generation • 2B • Updated Jan 12 • 8

updated a model 3 months ago

nics-efc/Standard-1.7B

Text Generation • 2B • Updated Jan 12 • 8

liked 2 models 4 months ago

Nanbeige/Nanbeige4-3B-Thinking-2511

Text Generation • 4B • Updated Dec 17, 2025 • 1.31k • 205

openai/circuit-sparsity

Text Generation • 0.4B • Updated Dec 12, 2025 • 766 • 205

upvoted an article 5 months ago

Article

Continuous batching from first principles

+1

Nov 25, 2025

•

365

published a model 5 months ago

nics-efc/TaH-plus-1.7B

Text Generation • 2B • Updated Nov 24, 2025 • 12 • 1

updated a model 5 months ago

nics-efc/TaH-plus-1.7B

Text Generation • 2B • Updated Nov 24, 2025 • 12 • 1