Stable Diffusion 3

Fireworks has partnered with Stability to provide blazing fast image generation using SD3, the latest and most advanced generative image model yet.

Try It Now

Featured Models

These models are deployed for industry-leading speeds to excel at production tasks

Stable Diffusion 3

The most capable text-to-image model produced by stability.ai, with greatly improved performanc...

~1.25 sec for a 1024 x 1024 30-step image

Mistral Small 24B Instruct 2501

Mistral Small 3 ( 2501 ) sets a new benchmark in the "small" Large Language Models category below 70...

up to 200 tokens/sec

Stable Diffusion XL

Image generation model, produced by stability.ai.

~1.25 sec for a 1024 x 1024 30-step image

FLUX.1 [schnell] FP8

FLUX.1 [schnell] is a 12 billion parameter rectified flow transformer capable of generating images f...

up to 200 tokens/sec

Qwen2.5-Coder 32B Instruct

Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as Co...

up to 200 tokens/sec

Qwen2.5-Coder 32B Instruct 128K

Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as Co...

up to 200 tokens/sec

Qwen2.5-Coder 32B Instruct 32K RoPE

Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as Co...

up to 200 tokens/sec

Qwen2.5-Coder 32B Instruct 64k

Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as Co...

up to 200 tokens/sec

DeepSeek R1 Distill Llama 70B

Llama 70B distilled with reasoning from Deepseek R1

up to 200 tokens/sec

DeepSeek R1 Distill Llama 8B

Llama 8B distilled with reasoning from Deepseek R1

up to 200 tokens/sec

DeepSeek R1 Distill Qwen 14B

Qwen 14B distilled with reasoning from Deepseek R1

up to 200 tokens/sec

DeepSeek R1 Distill Qwen 1.5B

Qwen 1.5B distilled with reasoning from Deepseek R1

up to 200 tokens/sec

DeepSeek R1 Distill Qwen 7B

Qwen 7B distilled with reasoning from Deepseek R1

up to 200 tokens/sec

FLUX.1 [dev] FP8

FLUX.1 [dev] is a 12 billion parameter rectified flow transformer capable of generating images from ...

up to 200 tokens/sec

Llama 3.2 3B Instruct

The Llama 3.2 collection of multilingual large language models (LLMs) is a collection of pretrained ...

up to 200 tokens/sec

Llama 3.3 70B Instruct

Llama 3.3 70B Instruct is the December update of Llama 3.1 70B. The model improves upon Llama 3.1 70...

up to 200 tokens/sec

Deepseek V3 03-24

A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for e...

up to 200 tokens/sec

Llama 3.1 8B Instruct

The Meta Llama 3.1 collection of multilingual large language models (LLMs) is a collection of pretra...

up to 200 tokens/sec

Qwen2.5-VL 7B Instruct

Qwen2.5-VL is a multimodal large language model series developed by Qwen team, Alibaba Cloud, availa...

up to 200 tokens/sec

DeepSeek V3

A a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for...

up to 200 tokens/sec

Qwen2.5-VL 32B Instruct

Qwen2.5-VL is a multimodal large language model series developed by Qwen team, Alibaba Cloud, availa...

up to 200 tokens/sec

DeepSeek R1 (Fast)

DeepSeek R1 (Fast) is the speed-optimized serverless deployment of DeepSeek-R1. Compared to the Deep...

up to 200 tokens/sec

Llama 3.1 405B Instruct

The Meta Llama 3.1 collection of multilingual large language models (LLMs) is a collection of pretra...

up to 200 tokens/sec

Llama 4 Scout Instruct (Basic)

The Llama 4 collection of models are natively multimodal AI models that enable text and multimodal e...

up to 200 tokens/sec

Llama 4 Maverick Instruct (Basic)

The Llama 4 collection of models are natively multimodal AI models that enable text and multimodal e...

up to 200 tokens/sec

Deepseek R1 05/28

05/28 updated checkpoint of Deepseek R1. Its overall performance is now approaching that of leading ...

up to 200 tokens/sec

Qwen3 235B A22B

Latest Qwen3 state of the art model, 235B with 22B active parameter model

up to 200 tokens/sec

Qwen3 30B-A3B

Latest Qwen3 state of the art model, 30B with 3B active parameter model

up to 200 tokens/sec

Kimi K2 Instruct

Kimi K2 is a state-of-the-art mixture-of-experts (MoE) language model with 32 billion activated para...

up to 200 tokens/sec

Qwen3 235B A22B Instruct 2507

Updated FP8 version of Qwen3-235B-A22B non-thinking mode, with better tool use, coding, instruction ...

up to 200 tokens/sec

Qwen3 235B A22B Thinking 2507

Latest Qwen3 thinking model, competitive against the best close source models in Jul 2025.

up to 200 tokens/sec

Qwen3 Coder 480B A35B Instruct

Qwen3's most agentic code model to date

up to 200 tokens/sec

Image Models

All currently deployed image models.

Stable Diffusion 3ServerlessTry it now

The most capable text-to-image model produced by stability.ai, with greatly improved performance in multi-subject prompts, image quality, and spelling abilities. The Stable Diffusion 3 API is provided by Stability and the model is powered by Fireworks. Unlike other models on the Fireworks playground, you'll need a Stability API key to use this model. To use the API directly, visit https://platform.stability.ai/docs/api-reference#tag/Generate/paths/~1v2beta~1stable-image~1generate~1sd3/postaccounts/stability/models/sd3

Stable Diffusion XLServerlessTry it now

Image generation model, produced by stability.ai.accounts/fireworks/models/stable-diffusion-xl-1024-v1-0

Playground v2 1024ServerlessTry it now

Playground v2 is a diffusion-based text-to-image generative model. The model was trained from scratch by the research team at playground.com.accounts/fireworks/models/playground-v2-1024px-aesthetic

Playground v2.5 1024ServerlessTry it now

Playground v2.5 is a diffusion-based text-to-image generative model, and a successor to Playground v2.accounts/fireworks/models/playground-v2-5-1024px-aesthetic

Segmind Stable Diffusion 1B (SSD-1B)ServerlessTry it now

Image generation model. Distilled from Stable Diffusion XL 1.0 and 50% smaller.accounts/fireworks/models/SSD-1B

Japanese Stable Diffusion XLServerlessTry it now

Japanese Stable Diffusion XL (JSDXL) is a Japanese-specific SDXL model that is capable of inputting prompts in Japanese and generating Japanese-style images.accounts/fireworks/models/japanese-stable-diffusion-xl

Stable Diffusion 3 TurboServerlessTry it now

Distilled, few-step version of Stable Diffusion 3, the newest image generation model from Stability AI, which is equal to or outperforms state-of-the-art text-to-image generation systems such as DALL-E 3 and Midjourney v6 in typography and prompt adherence, based on human preference evaluations. Stability AI has partnered with Fireworks AI, the fastest and most reliable API platform in the market, to deliver Stable Diffusion 3 and Stable Diffusion 3 Turbo. To use the API directly, visit https://platform.stability.ai/docs/api-reference#tag/Generate/paths/~1v2beta~1stable-image~1generate~1sd3/postaccounts/stability/models/sd3-turbo

Language Models

Serverless models are hosted by Fireworks — No need to configure hardware or deploy models. Usage is billed per token.

No models found for value