FLUX.2 [klein]

Generate and edit in less than a second with state-of-the-art quality. Available now via our API, or fine-tune and run it locally.

Blazing fast.

Sub-second inference*, making it 30%+ faster than any competing model.

* Inference time varies by model and hardware. See model comparison for more details.

Beautiful.

Production-quality visuals as fast as you can prompt.

Seeing is believing.

We built a free demo so you don't have to take our word for it.
No signup, no credit card.

Production ready.

Access [klein] through our platform in the way that works best for you.

API Access

Integrate FLUX.2 [klein] directly into your applications with our production-ready API.

Playground

Try FLUX.2 [klein] instantly in our interactive playground. No setup required.

Run it locally.
Built to fine-tune.

FLUX.2 [klein] runs on your hardware.
Choose from four variants optimized for different use cases.

ModelDescriptionLicenseInference Time
(GB200)
In seconds
Inference Time
(RTX5090)
In seconds
VRAM
FLUX.2 [klein] 9BOur distilled model. Outstanding quality at sub-second speed. Great for real-time generation while retaining quality. Marketing launch will focus on this model.FLUX Non-Commercial License~0.5~219.6 GB
FLUX.2 [klein] 9B BaseOur undistilled foundation model. Maximum flexibility and control. Great for fine-tuning.FLUX Non-Commercial License~6~3521.7 GB
FLUX.2 [klein] 4BThe fastest variant in the Klein family. Built for interactive applications, real-time previews, and latency-critical production use cases.Apache 2.0~0.3~1.28.4 GB
FLUX.2 [klein] 4B BaseA smaller foundation model with exceptional quality-to-size ratio. Ideal for local deployment, fine-tuning on limited hardware, and efficient generation and editing workflows.Apache 2.0~3~179.2 GB

Need help with enterprise deployment or custom fine-tuning?