/
Recipes
Browse
Docs
GitHub
Browse all recipes
Filter 143 recipes by task, architecture, size, precision, and hardware.
Loading...
All recipes
arcee-ai/Trinity-Large-Thinking
baidu/ERNIE-4.5-21B-A3B-PT
baidu/ERNIE-4.5-VL-28B-A3B-PT
baidu/Unlimited-OCR
bosonai/higgs-audio-v3-tts-4b
ByteDance-Seed/Seed-OSS-36B-Instruct
deepseek-ai/DeepSeek-OCR
deepseek-ai/DeepSeek-OCR-2
deepseek-ai/DeepSeek-R1
deepseek-ai/DeepSeek-V3
deepseek-ai/DeepSeek-V3.1
deepseek-ai/DeepSeek-V3.2
deepseek-ai/DeepSeek-V3.2-Exp
deepseek-ai/DeepSeek-V4-Flash
deepseek-ai/DeepSeek-V4-Pro
fishaudio/s2-pro
Google/diffusiongemma-26B-A4B-it
Google/gemma-4-12B-it
Google/gemma-4-26B-A4B-it
Google/gemma-4-31B-it
Google/gemma-4-E2B-it
Google/gemma-4-E4B-it
Google/translategemma-27b-it
inclusionAI/Ling-2.6-1T
inclusionAI/Ling-2.6-flash
inclusionAI/Ming-omni-tts-0.5B
inclusionAI/Ring-1T-FP8
inclusionAI/Ring-2.6-1T
internlm/Intern-S1
internlm/Intern-S2-Preview
JetBrains/Mellum2-12B-A2.5B-Instruct
JetBrains/Mellum2-12B-A2.5B-Thinking
jinaai/jina-embeddings-v5-text-small
jinaai/jina-reranker-m0
LiquidAI/LFM2.5-1.2B-Base
LiquidAI/LFM2.5-1.2B-Instruct
LiquidAI/LFM2.5-1.2B-JP
LiquidAI/LFM2.5-1.2B-JP-202606
LiquidAI/LFM2.5-1.2B-Thinking
LiquidAI/LFM2.5-230M
LiquidAI/LFM2.5-350M
LiquidAI/LFM2.5-8B-A1B
LiquidAI/LFM2.5-VL-1.6B
LiquidAI/LFM2.5-VL-450M
meituan-longcat/LongCat-Image-Edit
meta-llama/Llama-3.1-8B-Instruct
meta-llama/Llama-3.3-70B-Instruct
meta-llama/Llama-4-Scout-17B-16E-Instruct
microsoft/Phi-4-mini-instruct
MiniMaxAI/MiniMax-M2
MiniMaxAI/MiniMax-M2.1
MiniMaxAI/MiniMax-M2.5
MiniMaxAI/MiniMax-M2.7
MiniMaxAI/MiniMax-M3
mistralai/Ministral-3-14B-Instruct-2512
mistralai/Ministral-3-8B-Reasoning-2512
mistralai/Mistral-Large-3-675B-Instruct-2512
mistralai/Mistral-Medium-3.5-128B
mistralai/Mistral-Small-4-119B-2603
mistralai/Voxtral-4B-TTS-2603
mistralai/Voxtral-Mini-4B-Realtime-2602
moonshotai/Kimi-K2-Instruct
moonshotai/Kimi-K2-Thinking
moonshotai/Kimi-K2.5
moonshotai/Kimi-K2.6
moonshotai/Kimi-K2.7-Code
moonshotai/Kimi-Linear-48B-A3B-Instruct
nvidia/Cosmos3-Nano
nvidia/Cosmos3-Super
nvidia/Cosmos3-Super-Image2Video
nvidia/Cosmos3-Super-Text2Image
nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16
nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16
nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-BF16
nvidia/NVIDIA-Nemotron-Nano-9B-v2
openai/gpt-oss-120b
openai/gpt-oss-20b
openbmb/MiniCPM-V-4.6
openbmb/MiniCPM5-1B
openbmb/VoxCPM2
OpenGVLab/InternVL3_5-8B
OpenMOSS-Team/MOSS-SoundEffect
OpenMOSS-Team/MOSS-TTS
OpenMOSS-Team/MOSS-TTS-Realtime
OpenMOSS-Team/MOSS-TTSD-v1.0
OpenMOSS-Team/MOSS-VoiceGenerator
PaddlePaddle/PaddleOCR-VL
PaddlePaddle/PaddleOCR-VL-1.5
pfnet/plamo-2-translate
pfnet/plamo-3-nict-31b-base
poolside/Laguna-M.1
poolside/Laguna-XS.2
Qwen/Qwen-Image
Qwen/Qwen2.5-32B
Qwen/Qwen2.5-VL-72B-Instruct
Qwen/Qwen2.5-VL-7B-Instruct
Qwen/Qwen3-235B-A22B-Instruct-2507
Qwen/Qwen3-32B
Qwen/Qwen3-4B
Qwen/Qwen3-ASR-1.7B
Qwen/Qwen3-Coder-480B-A35B-Instruct
Qwen/Qwen3-Next-80B-A3B-Instruct
Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice
Qwen/Qwen3-VL-235B-A22B-Instruct
Qwen/Qwen3.5-0.8B
Qwen/Qwen3.5-122B-A10B
Qwen/Qwen3.5-27B
Qwen/Qwen3.5-2B
Qwen/Qwen3.5-35B-A3B
Qwen/Qwen3.5-397B-A17B
Qwen/Qwen3.5-4B
Qwen/Qwen3.5-9B
Qwen/Qwen3.6-27B
Qwen/Qwen3.6-35B-A3B
Qwen/Qwen3Guard-Gen-8B
stabilityai/stable-audio-open-1.0
stabilityai/stable-diffusion-3.5-medium
stepfun-ai/Step-3.5-Flash
stepfun-ai/Step-3.7-Flash
tencent/Hunyuan-A13B-Instruct
tencent/HunyuanOCR
tencent/Hy3-preview
Wan-AI/Wan2.2-T2V-A14B-Diffusers
XiaomiMiMo/MiMo-V2-Flash
XiaomiMiMo/MiMo-V2.5
XiaomiMiMo/MiMo-V2.5-Pro
zai-org/GLM-4.5
zai-org/GLM-4.5V
zai-org/GLM-4.6
zai-org/GLM-4.6V
zai-org/GLM-4.7
zai-org/GLM-5
zai-org/GLM-5.1
zai-org/GLM-5.2
zai-org/GLM-ASR-Nano-2512
zai-org/GLM-GA
zai-org/GLM-Image
zai-org/GLM-OCR
zai-org/GLM-TTS
zai-org/Glyph