SURESHBEEKHANI/llama_3.2_vision_amazon_product_description Image-Text-to-Text • Updated Feb 5, 2025 • 5
SURESHBEEKHANI/Gemma_2B_Medical_ORPO_RLHF_Fine_Tuning Question Answering • 3B • Updated Feb 3, 2025 • 33 • 1
SURESHBEEKHANI/Deep-seek-R1-Medical-reasoning-SFT Text Generation • 8B • Updated Jan 30, 2025 • 96 • 1
SURESHBEEKHANI/llama_3_2_3B-dpo-rlhf-fine-tuning Question Answering • 3B • Updated Jan 25, 2025 • 16 • 1