UGI Leaderboard
Uncensored General Intelligence Leaderboard
Uncensored General Intelligence Leaderboard
Embedding Leaderboard
Track, rank and evaluate open LLMs and chatbots
View LMArena model leaderboard
VLMEvalKit Evaluation Results Collection
View model performance leaderboard in a web app
Explore and compare speechβrecognition model benchmarks
TabArena
View and filter LLM hallucination leaderboard
Compare and rank visual document retrieval models across different benchmarks
Ranking of LLMs for agentic tasks
Redirect to leaderboard page
Track, rank and evaluate open LLMs in Portuguese
Explore and discover all leaderboards from the HF community
Leaderboard of LLMs based on detailed human feedback
Which Video and Image Generation Model is better?
Display and search reinforcement learning leaderboard data
Explore LLM performance across hardware configurations
Explore and filter language model benchmark results
Upload video model evaluation data to update the VBench leaderboard
Show OCRBench leaderboard rankings for OCR models
View leaderboard results for Q-Bench
Compare and rank large language models side-by-side
View the Berkeley Function-Calling Leaderboard