Supported Matrix
Currently, cache-dit library supports almost Any Diffusion Transformers (with Transformer Blocks that match the specific Input and Output patterns ). Please check πExamples for more details. Here are just some of the tested models listed.
One Model Series may contain many pipelines. cache-dit applies optimizations at the Transformer level; thus,any pipelines that include the supported transformer are already supported by cache-dit. β
: supported now; βοΈ: not supported now; π€Q : nunchaku w/ SVDQ W4A4;
πModels: π€70+
Hybrid Cache
Context Parallel
Tensor Parallel
FLUX.2-Klein-9b-kv
βοΈ
βοΈ
β
FLUX.2-Klein-4B
β
β
β
FLUX.2-Klein-base-4B
β
β
β
FLUX.2-Klein-9B
β
β
β
FLUX.2-Klein-base-9B
β
β
β
Helios-Base
β
β
β
Helios-Mid
β
β
β
Helios-Distilled
β
β
β
FireRed-Image-Edit-1.0
β
β
β
FireRed-Image-Edit-1.1
β
β
β
GLM-Image-T2I
β
βοΈ
β
GLM-Image-I2I
β
βοΈ
β
Z-Image
β
β
β
LTX-2-I2V
β
β
β
LTX-2-T2V
β
β
β
Qwen-Image-2512
β
β
β
Z-Image-Turbo π€Q
β
β
βοΈ
Qwen-Image-Layered
β
β
β
Qwen-Image-Edit-2511-Lightning
β
β
β
Qwen-Image-Edit-2511
β
β
β
LongCat-Image
β
β
β
LongCat-Image-Edit
β
β
β
Z-Image-Turbo
β
β
β
Z-Image-Turbo-Fun-ControlNet-2.0
β
β
β
Z-Image-Turbo-Fun-ControlNet-2.1
β
β
β
Ovis-Image
β
β
β
FLUX.2-dev
β
β
β
FLUX.1-dev
β
β
β
FLUX.1-Fill-dev
β
β
β
FLUX.1-Kontext-dev
β
β
β
Qwen-Image
β
β
β
Qwen-Image-Edit
β
β
β
Qwen-Image-Edit-2509
β
β
β
Qwen-Image-ControlNet
β
β
β
Qwen-Image-ControlNet-Inpainting
β
β
β
Qwen-Image-Lightning
β
β
β
Qwen-Image-Edit-Lightning
β
β
β
Qwen-Image-Edit-2509-Lightning
β
β
β
Wan-2.2-T2V
β
β
β
Wan-2.2-I2V
β
β
β
Wan-2.2-VACE-Fun
β
β
β
Wan-2.1-T2V
β
β
β
Wan-2.1-I2V
β
β
β
Wan-2.1-FLF2V
β
β
β
Wan-2.1-VACE
β
β
β
HunyuanImage-2.1
β
β
β
HunyuanVideo-1.5
β
βοΈ
βοΈ
HunyuanVideo
β
β
β
FLUX.1-dev π€Q
β
β
βοΈ
FLUX.1-Fill-dev π€Q
β
β
βοΈ
FLUX.1-Kontext-dev π€Q
β
β
βοΈ
Qwen-Image π€Q
β
β
βοΈ
Qwen-Image-Edit π€Q
β
β
βοΈ
Qwen-Image-Edit-2509 π€Q
β
β
βοΈ
Qwen-Image-Lightning π€Q
β
β
βοΈ
Qwen-Image-Edit-Lightning π€Q
β
β
βοΈ
Qwen-Image-Edit-2509-Lightning π€Q
β
β
βοΈ
SkyReels-V2-T2V
β
β
β
LongCat-Video
β
βοΈ
βοΈ
ChronoEdit-14B
β
β
β
Kandinsky-5.0-T2V-Lite
β
β
οΈ
β
οΈ
PRX-512-t2i-sft
β
βοΈ
βοΈ
LTX-Video-v0.9.8
β
β
β
LTX-Video-v0.9.7
β
β
β
CogVideoX
β
β
β
CogVideoX-1.5
β
β
β
CogView-4
β
β
β
CogView-3-Plus
β
β
β
Chroma1-HD
β
β
β
PixArt-Sigma-XL-2-1024-MS
β
β
β
PixArt-XL-2-1024-MS
β
β
β
VisualCloze-512
β
β
β
ConsisID-preview
β
β
β
mochi-1-preview
β
βοΈ
β
Lumina-Image-2.0
β
βοΈ
β
HiDream-I1-Full
β
βοΈ
βοΈ
HunyuanDiT
β
βοΈ
β
Sana-1600M-1024px
β
βοΈ
βοΈ
DiT-XL-2-256
β
β
βοΈ
Allegro-T2V
β
βοΈ
βοΈ
OmniGen-2
β
βοΈ
βοΈ
stable-diffusion-3.5-large
β
βοΈ
β
Amused-512
β
βοΈ
βοΈ
AuraFlow
β
βοΈ
βοΈ
Text Encoder & VAE Optimization
πModels: π€70+
Text Encoder Parallel
AutoEncoder(VAE) Parallel
FLUX.2-Klein-9b-kv
β
β
FLUX.2-Klein-4B
β
β
FLUX.2-Klein-base-4B
β
β
FLUX.2-Klein-9B
β
β
FLUX.2-Klein-base-9B
β
β
FLUX.2-dev
β
β
Helios-Base
β
β
Helios-Mid
β
β
Helios-Distilled
β
β
FireRed-Image-Edit-1.0
β
β
FireRed-Image-Edit-1.1
β
β
GLM-Image-T2I
βοΈ
β
GLM-Image-I2I
βοΈ
β
Z-Image
β
β
LTX-2-I2V
β
β
LTX-2-T2V
β
β
Qwen-Image-2512
β
β
Z-Image-Turbo π€Q
β
β
Qwen-Image-Layered
β
β
Qwen-Image-Edit-2511-Lightning
β
β
Qwen-Image-Edit-2511
β
β
LongCat-Image
β
β
LongCat-Image-Edit
β
β
Z-Image-Turbo
β
β
Z-Image-Turbo-Fun-ControlNet-2.0
β
β
Z-Image-Turbo-Fun-ControlNet-2.1
β
β
Ovis-Image
β
β
FLUX.1-dev
β
β
FLUX.1-Fill-dev
β
β
FLUX.1-Kontext-dev
β
β
Qwen-Image
β
β
Qwen-Image-Edit
β
β
Qwen-Image-Edit-2509
β
β
Qwen-Image-ControlNet
β
β
Qwen-Image-ControlNet-Inpainting
β
β
Qwen-Image-Lightning
β
β
Qwen-Image-Edit-Lightning
β
β
Qwen-Image-Edit-2509-Lightning
β
β
Wan-2.2-T2V
β
β
Wan-2.2-I2V
β
β
Wan-2.2-VACE-Fun
β
β
Wan-2.1-T2V
β
β
Wan-2.1-I2V
β
β
Wan-2.1-FLF2V
β
β
Wan-2.1-VACE
β
β
HunyuanImage-2.1
β
βοΈ
HunyuanVideo-1.5
β
βοΈ
HunyuanVideo
β
β
FLUX.1-dev π€Q
β
β
FLUX.1-Fill-dev π€Q
β
β
FLUX.1-Kontext-dev π€Q
β
β
Qwen-Image π€Q
β
β
Qwen-Image-Edit π€Q
β
β
Qwen-Image-Edit-2509 π€Q
β
β
Qwen-Image-Lightning π€Q
β
β
Qwen-Image-Edit-Lightning π€Q
β
β
Qwen-Image-Edit-2509-Lightning π€Q
β
β
SkyReels-V2-T2V
β
β
ChronoEdit-14B
β
β
Kandinsky-5.0-T2V-Lite
β
β
PRX-512-t2i-sft
β
βοΈ
LTX-Video-v0.9.8
β
βοΈ
LTX-Video-v0.9.7
β
βοΈ
CogVideoX
β
βοΈ
CogVideoX-1.5
β
βοΈ
CogView-4
β
β
CogView-3-Plus
β
β
Chroma1-HD
β
β
PixArt-Sigma-XL-2-1024-MS
β
β
PixArt-XL-2-1024-MS
β
β
VisualCloze-512
β
β
ConsisID-preview
β
βοΈ
mochi-1-preview
β
βοΈ
Lumina-Image-2.0
β
β
HiDream-I1-Full
β
β
HunyuanDiT
β
β
Sana-1600M-1024px
β
βοΈ
DiT-XL-2-256
β
β
Allegro-T2V
β
βοΈ
OmniGen-2
β
β
stable-diffusion-3.5-large
βοΈ
β
Amused-512
β
βοΈ
AuraFlow
β
β
ControlNet Optimization
Models
ControlNet Parallel
Z-Image-Turbo-Fun-ControlNet-2.0
β
Z-Image-Turbo-Fun-ControlNet-2.1
β
Qwen-Image-ControlNet
TODO
Qwen-Image-ControlNet-Inpainting
TODO