ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-F16 Reinforcement Learning • 8B • Updated Mar 25, 2025 • 455 • 91
Skywork-Reward-Data-Collection Collection Open-source preference datasets used to train the Skywork reward model series • 16 items • Updated Mar 2 • 21
Running on CPU Upgrade 14k Open LLM Leaderboard 🏆 14k Track, rank and evaluate open LLMs and chatbots