Alexander Gurung PRO

agurung

alex-gurung

AI & ML interests

None yet

Recent Activity

updated a model 5 days ago

agurung/flawed-fictions-qwen3-4b-litereason

published a model 5 days ago

agurung/flawed-fictions-qwen3-4b-litereason

updated a model 6 days ago

agurung/flawed-fictions-qwen3-4b

View all activity

Organizations

updated a model 5 days ago

agurung/flawed-fictions-qwen3-4b-litereason

Reinforcement Learning • 4B • Updated 5 days ago • 28

published a model 5 days ago

agurung/flawed-fictions-qwen3-4b-litereason

Reinforcement Learning • 4B • Updated 5 days ago • 28

updated a model 6 days ago

agurung/flawed-fictions-qwen3-4b

Reinforcement Learning • 4B • Updated 6 days ago • 32

updated a model 12 days ago

agurung/colar-qwen25-7b-ff-post-sft

8B • Updated 12 days ago • 17

published a model 12 days ago

agurung/colar-qwen25-7b-ff-post-sft

8B • Updated 12 days ago • 17

updated a model 12 days ago

agurung/qwen-coconut-ff-v2

8B • Updated 12 days ago • 15

updated a model 13 days ago

agurung/colar-qwen3-4b-ff-rl

Reinforcement Learning • 4B • Updated 13 days ago • 32

updated a model 15 days ago

agurung/ncp-qwen25-7b-lengthpenalty

Reinforcement Learning • 8B • Updated 15 days ago • 245

updated a model 16 days ago

agurung/flawed-fictions-qwen3-4b-lengthpenalty-litereason

Reinforcement Learning • 4B • Updated 16 days ago • 84

published 2 models 17 days ago

agurung/colar-qwen3-4b-ff-sft

4B • Updated 17 days ago • 30

agurung/colar-qwen3-4b-ff-rl

Reinforcement Learning • 4B • Updated 13 days ago • 32

updated a model 17 days ago

agurung/colar-qwen3-4b-ff-sft

4B • Updated 17 days ago • 30

published a model 28 days ago

agurung/ncp-qwen25-7b-lengthpenalty

Reinforcement Learning • 8B • Updated 15 days ago • 245

updated 2 models 29 days ago

agurung/flawed-fictions-gemma-3-4b

Reinforcement Learning • 4B • Updated Feb 15 • 91

agurung/flawed-fictions-gemma-3-4b-lengthpenalty

Reinforcement Learning • 4B • Updated 29 days ago • 63

published a model 29 days ago

agurung/flawed-fictions-gemma-3-4b-lengthpenalty

Reinforcement Learning • 4B • Updated 29 days ago • 63

published a model about 1 month ago

agurung/flawed-fictions-qwen3-4b-lengthpenalty-litereason

Reinforcement Learning • 4B • Updated 16 days ago • 84

updated a model about 1 month ago

agurung/flawed-fictions-qwen3-4b-lengthpenalty

Reinforcement Learning • 4B • Updated about 1 month ago • 12

published a model about 1 month ago

agurung/flawed-fictions-qwen3-4b-lengthpenalty

Reinforcement Learning • 4B • Updated about 1 month ago • 12

updated a model about 1 month ago

agurung/qwen3-4b-ff-grpo-lengthpenalty

4B • Updated Feb 24 • 4

Alexander Gurung PRO

AI & ML interests

Recent Activity

Organizations

agurung's activity