·
AI & ML interests
None yet
Organizations
deqing/llama-3.2-350M-fourier_otf_multiplication_unk
Text Generation
•
0.4B
•
Updated
•
10
deqing/llama-3.2-350M-fourier_otf_multiplication_llama
Text Generation
•
0.3B
•
Updated
•
17
deqing/nim_llm_pretrained_pretrained
68.3M
•
Updated
•
1
deqing/nim_llm_pretrained_pretraining_10piles
68.3M
•
Updated
•
1
deqing/nim_llm_pretrained_pretraining_9piles
68.3M
•
Updated
•
2
deqing/nim_llm_pretrained_pretraining_8piles
68.3M
•
Updated
•
1
deqing/nim_llm_pretrained_pretraining_7piles
68.3M
•
Updated
•
1
deqing/nim_llm_pretrained_pretraining_6piles
68.3M
•
Updated
•
1
deqing/nim_llm_pretrained_pretraining_5piles
68.3M
•
Updated
•
1
deqing/nim_llm_pretrained_pretraining_4piles
68.3M
•
Updated
•
2
deqing/nim_llm_pretrained_pretraining_3piles
68.3M
•
Updated
•
1
deqing/nim_llm_sft_pretrained
68.3M
•
Updated
•
1
deqing/llama-3.2-350M-fourier_otf_multiplication_old_fashioned
Updated
deqing/fourier_causal_lm_Llama_3.2_1B_pretrain_megamath_2025_05_29
Updated
deqing/llama_3.2_1b_megamath_fourier_full_2025_05_10
Text Generation
•
1B
•
Updated
•
1
deqing/llama_3.2_1b_megamath_ablation_full_2025_05_06
Text Generation
•
1B
•
Updated
•
1
deqing/llama_3.2_1b_megamath_fourier_full_2025_05_02
Text Generation
•
1B
•
Updated
•
2
deqing/llama_3.2_1b_megamath_full_2025_05_01
Updated
deqing/llama_3.2_1b_megamath_2025_04_25
Updated
deqing/llama_3.2_1b_megamath_2025_04_25_converted
1B
•
Updated
deqing/llama_3.2_1b_openwebtext_2025_03_02_converted_fne_gsm8k_2025_03_11
Text Generation
•
1B
•
Updated
•
4
•
deqing/llama_3.2_1b_openwebtext_2025_03_02_converted
1B
•
Updated
•
2
deqing/llama_3.2_1b_openwebtext_2025_03_02
1B
•
Updated
•
2
deqing/llama-3.2-350M-fourier_multiplication_dataset_pert
Text Generation
•
0.3B
•
Updated
•
1
deqing/llama-3.2-350M-fourier_multiplication_dataset
Text Generation
•
0.3B
•
Updated
deqing/llama-3.2-350M-fourier
Text Generation
•
0.3B
•
Updated
•
3
deqing/llama-3.2-350M-fourier_arithmetic_dataset
Text Generation
•
0.3B
•
Updated
•
2
deqing/llama-3.2-350M-fourier_addition_dataset
0.3B
•
Updated
•
1
deqing/llama-3.2-350M-fourier_openwebtext_filtered
0.3B
•
Updated
•
3
deqing/llama-3.2-350M-fourier-arithmetic-only
Updated