Vision To VibeVoice En
Image-Text to Voice (en)
Image-Text to Voice (en)
Improve and restore old photos with faces
Convert spoken words into text
Submit code models for evaluation and view leaderboard
VLMEvalKit Evaluation Results Collection
Create quantized models from Hugging Face repos
Generate high-quality text data for LLMs using FineWeb
270+ Impressive LoRAs for Flux.1
Transcribe audio to text
Generate edited images using scribble inputs
Generate captions for images in various styles
Ultralytics YOLO11 Gradio Application for Testing
Upscale images using custom-trained models
Remove/Change background of video.
Generate images with SD3.5
Wan: Open and Advanced Large-Scale Video Generative Models
VGGT (CVPR 2025)
Remove Silence From Audio
ComfyUI Reactor Fast Face Swap HYPERSWAP 256!
ServiceNow-AI model chat
Expressive Zeroshot TTS
Extract entities, classify text, and extract JSON from input text
Simulated Pre-visit Intake Demo built using MedGemma
Generate a multi-speaker podcast from a script