-
Ejafa/qwen2-0.5b-instruct-simpo-lr-5e-07-gamma-1.5
Text Generation • 0.5B • Updated • 50 • -
Ejafa/qwen2-1.5b-instruct-simpo-lr-5e-07-gamma-1.5
Text Generation • 2B • Updated • 4 -
Ejafa/phi-3-mini-128k-instruct-simpo-lr-5e-07-gamma-1.5
Text Generation • 4B • Updated • 4 -
Ejafa/phi-3-mini-128k-instruct-dpo-lr-5e-07
Text Generation • 4B • Updated • 6
Ejafa Bassam
Ejafa
AI & ML interests
LLM, Diffusion Model, FL, Decentralized AI
Organizations
None yet
SFT
policy optimization
-
Ejafa/qwen2-0.5b-instruct-simpo-lr-5e-07-gamma-1.5
Text Generation • 0.5B • Updated • 50 • -
Ejafa/qwen2-1.5b-instruct-simpo-lr-5e-07-gamma-1.5
Text Generation • 2B • Updated • 4 -
Ejafa/phi-3-mini-128k-instruct-simpo-lr-5e-07-gamma-1.5
Text Generation • 4B • Updated • 4 -
Ejafa/phi-3-mini-128k-instruct-dpo-lr-5e-07
Text Generation • 4B • Updated • 6
vicuna&koala
SFT