Back to List
Z-Image Turbo Fun ControlNet Union
Multi-functional ControlNet released by Alibaba PAI, supporting Canny, HED, Depth, Pose and MLSD controls
ControlNet
Image Control
Multi-functional
PAI
Overview
This is a ControlNet model with 6 blocks added, trained from scratch on 1 million high-quality image dataset for 10,000 steps, supporting multiple control conditions.
Features
- Supports Canny, HED, Depth, Pose and MLSD controls
- 1328 resolution training
- BFloat16 precision
- control_context_scale adjustable control strength (0.65-0.80)
- Detailed prompts improve stability
Images
Pose control input example
Generation result based on pose control
Canny edge detection input
Canny edge control generation result
Installation
# Clone VideoX-Fun repository git clone https://github.com/aigc-apps/VideoX-Fun.git cd VideoX-Fun # Create model directories mkdir -p models/Diffusion_Transformer mkdir -p models/Personalized_Model # Download weights to corresponding directories
Usage
Run examples/z_image_fun/predict_t2i_control.py
Requirements
- Z-Image Turbo
- VideoX-Fun environment
- Additional preprocessing dependencies
- GPU memory support