Massive Javanese language pre-training corpus
Afrizal Hasbi Azizy
afrizalha
AI & ML interests
LLMs for the low-resource languages of Indonesia.
Organizations
models 11
afrizalha/Sasando-1-7M
Text Generation • 7.34M • Updated
• 4 • 1
afrizalha/Sasando-1-25M
Text Generation • 25.2M • Updated
• 3 • 1
afrizalha/yourtts-ckpt
Updated
afrizalha/musetalk-models
Updated
afrizalha/Bakpia-V1-9B-Javanese-4bit
Text Generation • 9B • Updated
• 1
afrizalha/Bakpia-V1-0.5B-Javanese
Text Generation • 0.5B • Updated
• 1
afrizalha/Bakpia-V1-9B-Javanese-fp16
Text Generation • 9B • Updated
afrizalha/Bakpia-V1-1.5B-Javanese
Text Generation • 2B • Updated
• 2
afrizalha/Kancil-V1-llama3-4bit
Text Generation • 8B • Updated
• 3 • 1
afrizalha/Kancil-V0-llama3
Text Generation • 5B • Updated
• 2 • 3
datasets 11
afrizalha/Centhini-1-Javanese
Viewer
• Updated
• 530k • 11 • 2
afrizalha/Gatra-1-Javanese
Viewer
• Updated
• 560 • 15 • 3
afrizalha/Tumpeng-1-Indonesian
Viewer
• Updated
• 48.7k • 15 • 5
afrizalha/Gatra-2-Javanese
Viewer
• Updated
• 36.9k • 19 • 3
afrizalha/wikipedia-Only-Indo-Articles-About-Indo
Viewer
• Updated
• 220k • 13 • 1
afrizalha/Indo4B-8M-Uncontrolled
Viewer
• Updated
• 8.05M • 2
afrizalha/Indo4B-Combined-T18000
Viewer
• Updated
• 8.05M • 11
afrizalha/javanese-collection
Viewer
• Updated
• 128k • 10
afrizalha/TumpengQA
Viewer
• Updated
• 28.2k • 36 • 1
afrizalha/KamusOne-28M-Indonesian
Viewer
• Updated
• 575k • 24 • 2