Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jin Zhu's picture
2 2 30

Jin Zhu

mamba413
callmespring's profile picture Eehan's profile picture Kyleyee's profile picture
·
https://mamba413.github.io/
  • Mamba413

AI & ML interests

reinforcement learning

Recent Activity

authored a paper 6 days ago
Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning
upvoted a paper 6 days ago
Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning
liked a dataset about 1 month ago
bookcorpus/bookcorpus
View all activity

Organizations

Stats-powered AI's profile picture

mamba413 's datasets 8

mamba413/GenerateText_Qwen2.5-1.5B-Instruct_GRPO_HH_Seed1

Viewer • Updated Jun 10 • 7.06k • 9

mamba413/GenerateText_HH_Seed1

Viewer • Updated Mar 25 • 11.8k • 27

mamba413/GenerateText_HH_Seed1_new

Viewer • Updated Mar 24 • 640 • 37

mamba413/RewardModel-BENCH-HH-Seed1

Viewer • Updated Mar 23 • 64 • 9

mamba413/RewardModel-DR-HH-Seed1

Viewer • Updated Mar 23 • 64 • 11

mamba413/train_data_imdb_simu_valid

Viewer • Updated Mar 16 • 48.1k • 38

mamba413/train_data_imdb_simu

Viewer • Updated Mar 15 • 48.1k • 32

mamba413/train_data_imdb

Viewer • Updated Mar 3 • 2 • 10
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs