Nathan Habib PRO
AI & ML interests
Evals
Recent Activity
new activity about 8 hours ago
google/gemma-4-E2B-it:Add MMMU-Pro evaluation result new activity about 8 hours ago
Qwen/Qwen2.5-VL-3B-Instruct:Add MMMU-Pro evaluation result new activity about 8 hours ago
Qwen/Qwen2.5-VL-7B-Instruct:Add MMMU-Pro evaluation resultOrganizations
benchmarks
RULER Datasets Falcon-H1-3B-Base
RULER Datasets
RULER Datasets Lamma3-Instruct
RULER Datasets
RULER Datasets Qwen2.5-Instruct
RULER Datasets
RULER Datasets Qwen-3-Instruct
RULER Datasets
RULER Datasets Qwen-3
RULER Datasets
agents
Agents ressources
All the ressources I found / used when getting up to speed with agents.