shuoxing/llama3-8b-full-pretrain-mix-low-tweet-1m-en Text Generation • 266k • Updated about 6 hours ago • 3
shuoxing/llama3-8b-full-pretrain-mix-low-tweet-1m-en-sft Text Generation • 266k • Updated about 6 hours ago • 4
shuoxing/llama3-8b-full-pretrain-mix-mid-tweet-1m-en Text Generation • 266k • Updated about 6 hours ago • 1
shuoxing/llama3-8b-full-pretrain-mix-mid-tweet-1m-en-sft Text Generation • 266k • Updated about 6 hours ago • 4
shuoxing/llama3-8b-full-pretrain-mix-high-tweet-1m-en Text Generation • 266k • Updated about 6 hours ago • 1
shuoxing/llama3-8b-full-pretrain-mix-high-tweet-1m-en-sft Text Generation • 266k • Updated about 6 hours ago • 2
shuoxing/llama3-8b-full-pretrain-control-tweet-1m-en-sft Text Generation • 266k • Updated about 6 hours ago • 2
shuoxing/qwen2.5-0.5b-instruct-full-pretrain-mix-low-tweet-1m-en-sft Text Generation • 0.5B • Updated about 6 hours ago • 2
shuoxing/qwen2.5-0.5b-instruct-full-pretrain-mix-mid-tweet-1m-en-sft Text Generation • 0.5B • Updated about 6 hours ago • 1
shuoxing/qwen2.5-0.5b-instruct-full-pretrain-mix-high-tweet-1m-en-sft Text Generation • 0.5B • Updated about 6 hours ago • 4
shuoxing/qwen2.5-0.5b-instruct-full-pretrain-control-tweet-1m-en-sft Text Generation • 0.5B • Updated about 6 hours ago • 4