Inference Providers
Active filters: instruct
ai-sage/GigaChat3.1-10B-A1.8B-GGUF
Text Generation
• 11B • Updated • 4.49k
• 34
ai-sage/GigaChat3.1-702B-A36B
Text Generation
• 715B • Updated • 417
• 20
ai-sage/GigaChat3.1-10B-A1.8B
Text Generation
• 11B • Updated • 939
• 19
ai-sage/GigaChat3.1-702B-A36B-GGUF
Text Generation
• 702B • Updated • 479
• 13
ai-sage/GigaChat3.1-10B-A1.8B-bf16
Text Generation
• 11B • Updated • 1.09k
• 9
mradermacher/Llama3.3-8B-Instruct-Thinking-Heretic-Uncensored-Claude-4.5-Opus-High-Reasoning-i1-GGUF
8B • Updated • 124k
• 30
ai-sage/GigaChat3.1-702B-A36B-bf16
Text Generation
• 715B • Updated • 524
• 5
teknium/OpenHermes-2.5-Mistral-7B
Text Generation
• Updated • 150k
• 892
aaditya/Llama3-OpenBioLLM-70B
Text Generation
• Updated • 3.19k
• 503
NousResearch/Hermes-3-Llama-3.1-8B
Text Generation
• 8B • Updated • 252k
• • 399
NousResearch/Hermes-3-Llama-3.1-8B-GGUF
8B • Updated • 7.9k
• 140
NousResearch/Hermes-3-Llama-3.1-405B
Text Generation
• Updated • 161
• 265
DavidAU/Mistral-Small-3.1-24B-Instruct-2503-MAX-NEO-Imatrix-GGUF
Text Generation
• 24B • Updated • 885
• 38
NousResearch/Hermes-4-14B
Text Generation
• 425k • Updated • 4.21k
• 125
YCWTG/Qwen3.5-35B-A3B-Instruct-int4-mixed-AutoRound
Text Generation
• 7B • Updated • 323
• 2
teknium/Mistral-Trismegistus-7B
Text Generation
• Updated • 154
• 236
NousResearch/Nous-Hermes-2-Yi-34B
Text Generation
• Updated • 8.16k
• 256
NousResearch/Nous-Hermes-2-Yi-34B-GGUF
34B • Updated • 814
• 51
NousResearch/Hermes-2-Pro-Mistral-7B-GGUF
7B • Updated • 4.3k
• 246
LiteLLMs/Llama3-OpenBioLLM-70B-GGUF
71B • Updated • 200
• 8
typealias/Hermes-2-Theta-Llama-3-8B-mlx-4bit
1B • Updated • 10
• 1
unsloth/mistral-7b-instruct-v0.3-bnb-4bit
Text Generation
• 7B • Updated • 49.3k
• 35
mlx-community/Hermes-3-Llama-3.1-8B-4bit
1B • Updated • 439
• 5
bartowski/Pantheon-RP-Pure-1.6.2-22b-Small-GGUF
Text Generation
• 22B • Updated • 191
• 19
NousResearch/Hermes-3-Llama-3.2-3B
Text Generation
• 3B • Updated • 14.8k
• 176
bartowski/Hermes-3-Llama-3.2-3B-GGUF
Text Generation
• Updated • 8.99k
• 13
mlx-community/Hermes-3-Llama-3.2-3B-4bit
Text Generation
• 0.5B • Updated • 89
• 1
CuckmeisterFuller/Hermes-3-Llama-3.2-3B-Q4-mlx
Text Generation
• 0.5B • Updated • 10
• 1
roleplaiapp/Qwen2.5-7B-Instruct-Uncensored-Q5_K_M-GGUF
Text Generation
• 8B • Updated • 171
• 1
DavidAU/Reka-Flash-3-21B-Reasoning-Uncensored-MAX-NEO-Imatrix-GGUF
Text Generation
• 21B • Updated • 1.18k
• 56