Models and datasets for Elastic Reset (NeurIPS 2023), code at https://github.com/mnoukhov/elastic-reset
Michael N
mnoukhov
AI & ML interests
Representation learning for functional language
Recent Activity
updated a model 1 day ago
mnoukhov/nuevamol-80m-reinvent-sft published a model 1 day ago
mnoukhov/nuevamol-80m-reinvent-sft updated a model 1 day ago
mnoukhov/nuevamol-80m-baseOrganizations
models 48
mnoukhov/nuevamol-80m-reinvent-sft
Text Generation • 78.1M • Updated • 95
mnoukhov/nuevamol-80m-base
Text Generation • 78.1M • Updated • 19
mnoukhov/nuevamol-220m-reinvent-sft
Text Generation • 0.2B • Updated • 201
mnoukhov/nuevamol-80m-init
Text Generation • 0.1B • Updated • 25
mnoukhov/nuevamol-135m-reinvent-sft
Text Generation • 0.1B • Updated • 400
mnoukhov/nuevamol-46m-reinvent-sft
Text Generation • 46.2M • Updated • 314
mnoukhov/nuevamol-220m-base
Text Generation • 0.2B • Updated • 31
mnoukhov/nuevamol-135m-base
Text Generation • 0.1B • Updated • 25
mnoukhov/nuevamol-46m-base
Text Generation • 46.2M • Updated • 31
mnoukhov/test
Updated
datasets 102
mnoukhov/chembl_filtered
Viewer • Updated • 1.18M • 22
mnoukhov/brumo-2025-openinstruct-qwen3-4b-base-32samples-quartiles
Viewer • Updated • 60 • 60
mnoukhov/brumo-2025-openinstruct-qwen3-4b-base-32samples
Viewer • Updated • 30 • 65
mnoukhov/aime-2025-openinstruct-qwen3-4b-base-32samples-quartiles
Viewer • Updated • 60 • 150
mnoukhov/aime-2025-openinstruct-qwen3-4b-base-32samples
Viewer • Updated • 30 • 65
mnoukhov/dapo-math-17k-processed-filtered-qwen3-4b-base-32samples-quartiles
Viewer • Updated • 25.3k • 82
mnoukhov/dapo-math-17k-processed-filtered-qwen3-4b-base-32samples
Viewer • Updated • 12.6k • 171
mnoukhov/gsm8k-train-harder-quartiles
Viewer • Updated • 11.2k • 256
mnoukhov/manufactoria-qwen3-4b-instruct-warmup650-pass128
Viewer • Updated • 874 • 112
mnoukhov/manufactoria-qwen3-4b-instruct-warmup650-pass128-completions
Viewer • Updated • 874 • 180