Spaces

·

The AI App Directory

New Space Get PRO Learn more

UGI Leaderboard

Uncensored General Intelligence Leaderboard

MTEB Leaderboard

Embedding Leaderboard

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

LMArena Leaderboard

View LMArena model leaderboard

Open VLM Leaderboard

VLMEvalKit Evaluation Results Collection

DeepResearch Bench

View model performance leaderboard in a web app

Open ASR Leaderboard

Explore and compare speech‑recognition model benchmarks

TabArena

TabArena

LLM Hallucination Leaderboard

View and filter LLM hallucination leaderboard

Vidore Leaderboard

Compare and rank visual document retrieval models across different benchmarks

Agent Leaderboard

Ranking of LLMs for agentic tasks

Hallucination Evaluation Leaderboard

Redirect to leaderboard page

Open Portuguese LLM Leaderboard

Track, rank and evaluate open LLMs in Portuguese

Find a leaderboard

Explore and discover all leaderboards from the HF community

HUMAINE Leaderboard

Leaderboard of LLMs based on detailed human feedback

VBench-IBench-Leaderboard

Which Video and Image Generation Model is better?

Deep Reinforcement Learning Leaderboard

Display and search reinforcement learning leaderboard data

LLM-Perf Leaderboard

Explore LLM performance across hardware configurations

Open Ko-LLM Leaderboard

Explore and filter language model benchmark results

VBench Leaderboard

Upload video model evaluation data to update the VBench leaderboard

Ocrbench Leaderboard

Show OCRBench leaderboard rankings for OCR models

Q-Bench+ Leaderboard

View leaderboard results for Q-Bench

Open PL LLM Leaderboard

Compare and rank large language models side-by-side

Berkeley Function Calling Leaderboard

View the Berkeley Function-Calling Leaderboard