-
-
-
-
-
-
Inference Providers
Active filters:
fp8
Text Generation
•
229B
•
Updated
•
155
•
9
unsloth/DeepSeek-V3-0324-GGUF
Text Generation
•
671B
•
Updated
•
3.08k
•
195
Text Generation
•
0.8B
•
Updated
•
58.4k
•
55
Text Generation
•
8B
•
Updated
•
2.92k
•
1
Text Generation
•
31B
•
Updated
•
41.2k
•
79
stabilityai/stable-diffusion-3.5-large-tensorrt
Text-to-Image
•
Updated
•
1.24k
•
50
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation
•
235B
•
Updated
•
29.2k
•
77
Qwen/Qwen3-4B-Thinking-2507-FP8
Text Generation
•
4B
•
Updated
•
164k
•
46
Qwen/Qwen3-4B-Instruct-2507-FP8
Text Generation
•
4B
•
Updated
•
95.8k
•
60
stabilityai/stable-diffusion-3.5-controlnets-tensorrt
Text-to-Image
•
Updated
•
99
•
5
RedHatAI/gpt-oss-120b-FP8-dynamic
Text Generation
•
117B
•
Updated
•
3.04k
•
10
deepseek-ai/DeepSeek-V3.1
Text Generation
•
685B
•
Updated
•
50.8k
•
•
811
brandonbeiler/InternVL3_5-GPT-OSS-20B-A4B-Preview-FP8-Dynamic
Image-Text-to-Text
•
21B
•
Updated
•
100
•
2
Qwen/Qwen3-Next-80B-A3B-Thinking-FP8
Text Generation
•
81B
•
Updated
•
293k
•
45
RedHatAI/NVIDIA-Nemotron-Nano-9B-v2-FP8-dynamic
Text Generation
•
9B
•
Updated
•
1.46k
•
3
Qwen/Qwen3-VL-235B-A22B-Instruct-FP8
Image-Text-to-Text
•
236B
•
Updated
•
118k
•
34
cerebras/MiniMax-M2-REAP-172B-A10B
Text Generation
•
173B
•
Updated
•
1.03k
•
17
ai-sage/GigaChat3-10B-A1.8B
Text Generation
•
11B
•
Updated
•
6.19k
•
57
deepseek-ai/DeepSeek-Math-V2
Text Generation
•
685B
•
Updated
•
2.22k
•
677
jiangchengchengNLP/qwen3-4b-fp8-scaled
Updated
•
43
•
21
Aratako/Ministral-3-14B-Instruct-2512-TextOnly
14B
•
Updated
•
719
•
4
mlx-community/Ministral-3-8B-Instruct-2512
Text Generation
•
Updated
•
588
•
2
cerebras/DeepSeek-V3.2-REAP-345B-A37B
Text Generation
•
345B
•
Updated
•
1.93k
•
29
XiaomiMiMo/MiMo-V2-Flash-Base
Text Generation
•
310B
•
Updated
•
650
•
37
MedAIBase/AntAngelMed-FP8
103B
•
Updated
•
58
•
2
openaudio/qwen3_omni_fp8_dynamic
32B
•
Updated
•
579
•
2
0xSero/MiniMax-M2.1-REAP-40-REPAIR-IN-PROGRESS
Text Generation
•
139B
•
Updated
•
42
•
1
FriendliAI/Meta-Llama-3-8B-Instruct-fp8
Text Generation
•
8B
•
Updated
•
27
•
2
RedHatAI/Meta-Llama-3-8B-Instruct-FP8
Text Generation
•
8B
•
Updated
•
2.28k
•
•
24
RedHatAI/Mixtral-8x7B-Instruct-v0.1-AutoFP8
Text Generation
•
47B
•
Updated
•
6
•
3