Melvin Vivas PRO
melvindave
AI & ML interests
Small Language Models, Vision, TTS, STT, Image Gen
Recent Activity
updated
a collection
about 6 hours ago
Vision
updated
a collection
about 6 hours ago
Notable Spaces
liked
a Space
about 6 hours ago
sergiopaniego/vlm_object_understanding
Organizations
Datasets
Evaluation
Text to Speech
-
Running on ZeroFeatured87
VibeVoice-Realtime-0.5B
🐨87Generate speech from text
-
microsoft/VibeVoice-1.5B
Text-to-Speech • 3B • Updated • 390k • 2.07k -
RunningFeatured269
Qwen3 TTS Demo
🚀269Generate speech from text with voice options
-
mradermacher/Qwen3-1.7B-Multilingual-TTS-GGUF
2B • Updated • 626
Image Generation
Coding
Customer Conversations Datasets
Vision
-
Running on CPU Upgrade944
Open VLM Leaderboard
🌎944VLMEvalKit Evaluation Results Collection
-
Running on ZeroFeatured278
DeepSeek OCR Demo
🚀278Try out DeepSeek-OCR on your PDFs or images
-
Running on ZeroMCP48
Multimodal OCR3
🔥48nanonets2-ocr / chandra-ocr / dots.ocr / olm-ocr2
-
Qwen/Qwen3-VL-30B-A3B-Instruct
Image-Text-to-Text • 31B • Updated • 1.25M • • 434
Papers
Language Models (Reasoning)
Audio Transcription
Fine-tuning Models
OCR Datasets
Notable Spaces
Vision
-
Running on CPU Upgrade944
Open VLM Leaderboard
🌎944VLMEvalKit Evaluation Results Collection
-
Running on ZeroFeatured278
DeepSeek OCR Demo
🚀278Try out DeepSeek-OCR on your PDFs or images
-
Running on ZeroMCP48
Multimodal OCR3
🔥48nanonets2-ocr / chandra-ocr / dots.ocr / olm-ocr2
-
Qwen/Qwen3-VL-30B-A3B-Instruct
Image-Text-to-Text • 31B • Updated • 1.25M • • 434
Datasets
Papers
Evaluation
Language Models (Reasoning)
Text to Speech
-
Running on ZeroFeatured87
VibeVoice-Realtime-0.5B
🐨87Generate speech from text
-
microsoft/VibeVoice-1.5B
Text-to-Speech • 3B • Updated • 390k • 2.07k -
RunningFeatured269
Qwen3 TTS Demo
🚀269Generate speech from text with voice options
-
mradermacher/Qwen3-1.7B-Multilingual-TTS-GGUF
2B • Updated • 626
Audio Transcription
Image Generation
Fine-tuning Models
Coding
OCR Datasets
Customer Conversations Datasets