Vision - a syddharth Collection

syddharth 's Collections

LLM

Audio

Video

Image

Vision

Vision

updated Sep 23, 2024

01-ai/Yi-VL-34B

Image-Text-to-Text • Updated Jun 26, 2024 • 160 • 264
01-ai/Yi-VL-6B

Image-Text-to-Text • Updated Jun 26, 2024 • 164 • 124
NousResearch/Nous-Hermes-2-Vision-Alpha

Text Generation • Updated Dec 3, 2023 • 119 • 304
liuhaotian/llava-v1.5-13b

Image-Text-to-Text • Updated May 9, 2024 • 19.5k • 521
fancyfeast/joytag

Image Classification • Updated Mar 9, 2024 • 1.49k • 109
internlm/internlm-xcomposer2-7b

Text Generation • Updated Feb 27, 2024 • 3.83k • 31
internlm/internlm-xcomposer2-4khd-7b

Visual Question Answering • Updated Apr 18, 2024 • 1.97k • 73
SmilingWolf/wd-vit-large-tagger-v3

Updated Jul 26, 2024 • 33.1k • 90
Aryn/deformable-detr-DocLayNet

Object Detection • 41.1M • Updated Aug 8, 2025 • 55k • 49
abetlen/Phi-3.5-vision-instruct-gguf

4B • Updated Oct 1, 2024 • 659 • 30
MiaoshouAI/Florence-2-base-PromptGen-v1.5

0.3B • Updated Oct 9, 2024 • 1.63k • 96
stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • 0.7B • Updated Feb 4, 2025 • 18k • 1.53k