RedHatAI/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8
Text Generation
•
32B
•
Updated
•
129
•
2
RedHatAI/Apertus-8B-Instruct-2509-FP8-dynamic
Text Generation
•
8B
•
Updated
•
173
•
2
RedHatAI/Ministral-3-14B-Instruct-2512
14B
•
Updated
•
75
RedHatAI/Mistral-Large-3-675B-Instruct-2512-NVFP4
Updated
•
43
RedHatAI/Mistral-Large-3-675B-Instruct-2512
RedHatAI/Qwen3-Next-80B-A3B-Instruct-FP8
Text Generation
•
81B
•
Updated
•
371
RedHatAI/starcoder2-15b-quantized.w8a16
Text Generation
•
4B
•
Updated
•
24
RedHatAI/Phi-3-mini-128k-instruct-quantized.w8a8
Text Generation
•
4B
•
Updated
•
30
RedHatAI/Qwen3-8B-FP8-block
Text Generation
•
8B
•
Updated
•
46
Text Generation
•
358B
•
Updated
•
16
RedHatAI/Qwen3-Next-80B-A3B-Instruct-FP8-dynamic
Text Generation
•
80B
•
Updated
•
69
RedHatAI/Kimi-K2-Thinking
Text Generation
•
Updated
•
16
RedHatAI/Llama-3.1-8B-Instruct-speculator.eagle3
Text Generation
•
1.0B
•
Updated
•
5.95k
•
1
RedHatAI/Qwen3-30B-A3B-Instruct-2507.w4a16
Text Generation
•
5B
•
Updated
•
52
RedHatAI/Qwen3-30B-A3B-Instruct-2507-speculator.eagle3
Text Generation
•
0.5B
•
Updated
•
168
•
1
Text Generation
•
120B
•
Updated
•
104
•
4
Text Generation
•
22B
•
Updated
•
10.1k
•
5
RedHatAI/Llama-3.3-70B-Instruct-FP8-dynamic
Text Generation
•
71B
•
Updated
•
63k
•
13
RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic
Text Generation
•
8B
•
Updated
•
33.4k
•
9
RedHatAI/Qwen3-Next-80B-A3B-Instruct-quantized.w4a16
Updated
•
19
RedHatAI/Qwen3-Next-80B-A3B-Instruct-NVFP4
Text Generation
•
Updated
•
907
•
4
RedHatAI/Qwen3-Next-80B-A3B-Instruct-FP8-block
Text Generation
•
80B
•
Updated
•
9
RedHatAI/Qwen3-VL-32B-Instruct-NVFP4
Text Generation
•
20B
•
Updated
•
1.48k
RedHatAI/Qwen3-VL-32B-Instruct-FP8-dynamic
Text Generation
•
33B
•
Updated
•
268
•
1
RedHatAI/Qwen3-VL-32B-Instruct-FP8-block
Text Generation
•
33B
•
Updated
•
8
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-NVFP4
Text Generation
•
229B
•
Updated
•
497
•
2
RedHatAI/Qwen3-30B-A3B-NVFP4
Text Generation
•
17B
•
Updated
•
3.27k
•
2
RedHatAI/Qwen3-235B-A22B-NVFP4
Text Generation
•
136B
•
Updated
•
81
RedHatAI/Qwen3-VL-235B-A22B-Instruct-NVFP4
Text Generation
•
133B
•
Updated
•
5.55k
•
6
RedHatAI/Qwen3-235B-A22B-Instruct-2507-NVFP4
Text Generation
•
136B
•
Updated
•
316
•
4