Collection of models and datasets for Beyond Binary Rewards: Training LMs to Reason about their Uncertainty
Mehul Damani PRO
mehuldamani
AI & ML interests
Reinforcement Learning, Large Language Models
Recent Activity
published
a model
about 16 hours ago
mehuldamani/regularBrier_mixedNumCandidates_rlcr_multi_from_rlvr_chkpt360
published
a model
2 days ago
mehuldamani/strongBrier_newPrompt_rlcr_multi_from_rlvr_chkpt360
published
a model
3 days ago
mehuldamani/rlcr_single_from_rlvr_chkpt360
Organizations
None yet