-
Iterative Layer Pruning for Efficient Translation Inference
Paper • 2510.22763 • Published -
ymoslem/wmt25-ces-deu-24layers-2e-5lr-news-commentary
Text Generation • 6B • Updated • 3 -
ymoslem/wmt25-ces-deu-20layers-2e-5lr-news-commentary
Text Generation • 5B • Updated • 4 -
ymoslem/wmt25-ces-deu-16layers-2e-5lr-news-commentary
Text Generation • 5B • Updated • 4
Yasmin Moslem PRO
ymoslem
AI & ML interests
Machine Translation, Speech Translation, Large Language Models, Natural Language Processing
Recent Activity
liked a model about 19 hours ago
intfloat/multilingual-e5-small liked a model about 20 hours ago
openbmb/VoxCPM1.5 liked a Space 15 days ago
chuuhtetnaing/background-removerOrganizations
MT Quality Estimation
Models for reference-free quality estimation of machine translation
-
ymoslem/ModernBERT-base-long-context-qe-v1
Text Classification • 0.1B • Updated • 6 • 5 -
ymoslem/ModernBERT-large-qe-v1
Text Classification • 0.4B • Updated • 2 • 2 -
ymoslem/xlm-roberta-large-qe-v1
Text Classification • 0.6B • Updated • 3 • 1 -
ymoslem/ModernBERT-large-qe-maxlen512-v1
Text Classification • 0.4B • Updated • 8 • 1
WMT-Model-Compression
-
Iterative Layer Pruning for Efficient Translation Inference
Paper • 2510.22763 • Published -
ymoslem/wmt25-ces-deu-24layers-2e-5lr-news-commentary
Text Generation • 6B • Updated • 3 -
ymoslem/wmt25-ces-deu-20layers-2e-5lr-news-commentary
Text Generation • 5B • Updated • 4 -
ymoslem/wmt25-ces-deu-16layers-2e-5lr-news-commentary
Text Generation • 5B • Updated • 4
MT Quality Estimation
Models for reference-free quality estimation of machine translation
-
ymoslem/ModernBERT-base-long-context-qe-v1
Text Classification • 0.1B • Updated • 6 • 5 -
ymoslem/ModernBERT-large-qe-v1
Text Classification • 0.4B • Updated • 2 • 2 -
ymoslem/xlm-roberta-large-qe-v1
Text Classification • 0.6B • Updated • 3 • 1 -
ymoslem/ModernBERT-large-qe-maxlen512-v1
Text Classification • 0.4B • Updated • 8 • 1
models 69
ymoslem/wmt25-eng-arz-16layers-2e-5lr-news-commentary
Text Generation • 5B • Updated • 1
ymoslem/wmt25-eng-arz-20layers-2e-5lr-news-commentary
Text Generation • 5B • Updated • 3
ymoslem/wmt25-eng-arz-24layers-2e-5lr-news-commentary
Text Generation • 6B • Updated • 1
ymoslem/aya-expanse-8b-eng-arz-16layers
Text Generation • 5B • Updated • 3
ymoslem/aya-expanse-8b-eng-arz-20layers
Text Generation • 5B • Updated • 5
ymoslem/aya-expanse-8b-eng-arz-24layers
Text Generation • 6B • Updated • 1
ymoslem/aya-expanse-8b-20layers-cs-de-iter
Text Generation • 5B • Updated • 1
ymoslem/wmt25-ces-deu-16layers-2e-5lr-news-commentary
Text Generation • 5B • Updated • 4
ymoslem/wmt25-ces-deu-20layers-2e-5lr-news-commentary
Text Generation • 5B • Updated • 4
ymoslem/wmt25-ces-deu-24layers-2e-5lr-news-commentary
Text Generation • 6B • Updated • 3
datasets 41
ymoslem/AIME-clustered
Viewer • Updated • 951 • 18
ymoslem/TeleQnA-clustered-2
Viewer • Updated • 10k • 15
ymoslem/news-commentary-eng-arz
Viewer • Updated • 83.7k • 24
ymoslem/flores-test-pruning
Viewer • Updated • 1.1k • 3
ymoslem/TeleQnA-processed
Viewer • Updated • 10k • 29
ymoslem/Anhui-Telecom-QA
Viewer • Updated • 157k • 30 • 2
ymoslem/TeleQnA-clustered-3
Viewer • Updated • 10k • 13
ymoslem/Law-StackExchange
Viewer • Updated • 24.4k • 308 • 32
ymoslem/IWSLT2025-Test
Viewer • Updated • 772 • 23
ymoslem/news-commentary-en-ar
Viewer • Updated • 84.3k • 8 • 1