NamrataThakur/Small_Language_Model_GQA_48M_Pretrained Text Generation • Updated 12 days ago • 467 • 1
NamrataThakur/Small_Language_Model_MOE_127M_Pretrained Text Generation • Updated 12 days ago • 384 • 1