2 3 29

Kideng

luo-li-ba-suo

AI & ML interests

Reinforcement Learning, Language Model

Recent Activity

upvoted an article about 1 month ago

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

liked a dataset 4 months ago

hfl/ruozhiba_gpt4

liked a model 12 months ago

ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-F16

View all activity

Organizations

None yet

upvoted an article about 1 month ago

Article

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

Aug 9, 2025

•

liked a dataset 4 months ago

hfl/ruozhiba_gpt4

Viewer • Updated May 18, 2024 • 4.9k • 201 • 92

liked a model 12 months ago

ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-F16

Reinforcement Learning • 8B • Updated Mar 25, 2025 • 455 • 91

liked a model about 1 year ago

unsloth/DeepSeek-R1-BF16

Text Generation • Updated Apr 19, 2025 • 322 • 24

liked a model over 1 year ago

MiniMaxAI/MiniMax-Text-01

Text Generation • Updated Jul 3, 2025 • 12.9k • 652

upvoted a collection over 1 year ago

Skywork-Reward-Data-Collection

Collection

Open-source preference datasets used to train the Skywork reward model series • 16 items • Updated Mar 2 • 21

liked a dataset over 1 year ago

NCSOFT/offsetbias

Viewer • Updated Jul 12, 2024 • 8.5k • 75 • 25

liked a Space over 1 year ago

Open LLM Leaderboard

🏆

14k

Track, rank and evaluate open LLMs and chatbots

liked a model almost 2 years ago

OpenRLHF/Llama-3-8b-rlhf-100k

Text Generation • 8B • Updated Jun 24, 2024 • 82 • 4

liked a dataset almost 2 years ago

lmarena-ai/arena-human-preference-55k

Viewer • Updated May 17, 2024 • 57.5k • 778 • 159

New activity in allenai/preference-test-sets about 2 years ago

Doubts about “LMSYS Human MT Bench Pairs” split

#2 opened about 2 years ago by

Kideng

liked a dataset about 2 years ago

argilla/dpo-mix-7k

Viewer • Updated Jul 16, 2024 • 7.5k • 552 • 171

liked a model about 2 years ago

jondurbin/bagel-34b-v0.4

Text Generation • 34B • Updated Feb 21, 2024 • 8 • 10

liked a dataset about 2 years ago

berkeley-nest/Nectar

Viewer • Updated Mar 20, 2024 • 183k • 1.17k • 295

upvoted a paper over 2 years ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 153

liked a Space over 2 years ago

Arena Leaderboard

🏆

4.86k

View the LMArena model leaderboard

liked a model over 2 years ago

LingxinAI/CharacterGLM-6b

Updated Feb 2, 2024 • 55

liked a dataset over 2 years ago

openbmb/UltraFeedback

Viewer • Updated Dec 29, 2023 • 64k • 6.18k • 415

liked a model over 2 years ago

Qwen/Qwen-72B

Text Generation • 72B • Updated Oct 9, 2024 • 2.97k • 362

liked a Space over 2 years ago

Yi-34B-Chat

🔥

342

Kideng

AI & ML interests

Recent Activity

Organizations

Kideng's activity

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

Open LLM Leaderboard

Doubts about “LMSYS Human MT Bench Pairs” split

Arena Leaderboard

Yi-34B-Chat