PRM and fine-tuned LLM used in our PURE github repo: https://github.com/CJReinforce/PURE
Jie Cheng
jinachris
AI & ML interests
Reinforcement learning, LLM
Recent Activity
upvoted
a
paper
1 day ago
PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning
upvoted
a
collection
30 days ago
Nemotron-Post-Training-v3
upvoted
a
paper
4 months ago
VGGT-X: When VGGT Meets Dense Novel View Synthesis
Organizations
None yet