-
-
-
-
-
-
Inference Providers
Active filters:
modelopt
Text Generation
•
Updated
•
30.7k
•
35
nvidia/Qwen3-Next-80B-A3B-Thinking-NVFP4
Text Generation
•
Updated
•
58.3k
•
40
nvidia/Kimi-K2-Thinking-NVFP4
Text Generation
•
Updated
•
112k
•
23
lukealonso/MiniMax-M2.1-NVFP4
115B
•
Updated
•
50.5k
•
24
nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4
Text Generation
•
Updated
•
21.2k
•
23
nvidia/Qwen3-235B-A22B-Thinking-2507-NVFP4
Text Generation
•
120B
•
Updated
•
162
•
3
vincentzed-hf/Qwen3-Coder-Next-NVFP4
Text Generation
•
Updated
•
1.39k
•
3
nvidia/Llama-4-Scout-17B-16E-Instruct-NVFP4
56B
•
Updated
•
24.5k
•
20
nvidia/Phi-4-multimodal-instruct-NVFP4
4B
•
Updated
•
3.23k
•
7
Text Generation
•
15B
•
Updated
•
3.23k
•
3
shanjiaz/gpt-oss-120b-nvfp4-modelopt
59B
•
Updated
•
9.12k
•
2
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1-FP4-QAD
Image-Text-to-Text
•
6B
•
Updated
•
169
•
12
Ex0bit/Qwen3-VLTO-32B-Instruct-NVFP4
Text Generation
•
17B
•
Updated
•
222
•
1
Ex0bit/Qwen3-VLTO-32B-Instruct-NVFP4-256K
Text Generation
•
17B
•
Updated
•
148
•
1
Ex0bit/OLMo-3-7B-Instruct-NVFP4-1M
Text Generation
•
4B
•
Updated
•
26
•
2
Text Generation
•
177B
•
Updated
•
4.62k
•
15
nvidia/Qwen3-Coder-480B-A35B-Instruct-NVFP4
Text Generation
•
241B
•
Updated
•
121
•
1
Geodd/GLM-4.7-Flash-W8A16
Text Generation
•
0.7B
•
Updated
•
103
•
1
vincentzed-hf/Kimi-K2.5-MXFP8
Image-Text-to-Text
•
1T
•
Updated
•
17
•
1
nvidia/Llama-4-Maverick-17B-128E-Instruct-FP8
402B
•
Updated
•
492
•
12
nvidia/Llama-4-Scout-17B-16E-Instruct-FP8
109B
•
Updated
•
109k
•
11
ishan24/test_modelopt_quant
nvidia/Llama-4-Maverick-17B-128E-Eagle3
Updated
•
16
•
9
nvidia/Qwen3-30B-A3B-NVFP4
Text Generation
•
16B
•
Updated
•
32.2k
•
23
jiangchengchengNLP/L3.3-MS-Nevoria-70b-FP8
Text Generation
•
71B
•
Updated
•
6
NVFP4/Qwen3-30B-A3B-Instruct-2507-FP4
Text Generation
•
16B
•
Updated
•
1.33k
•
11
NVFP4/Qwen3-Coder-30B-A3B-Instruct-FP4
Text Generation
•
16B
•
Updated
•
22.6k
•
7
gesong2077/Qwen3-32B-NVFP4
19B
•
Updated
•
1
54B
•
Updated
nvidia/Phi-4-multimodal-instruct-FP8
6B
•
Updated
•
35.4k
•
4