DoodleAssist: Progressive Interactive Line Art Generation with Latent Distribution Alignment - TVCG 2025

[Paper] | [Paper (IEEE)] | [Project Page]

DoodleAssist is an interactive and progressive line art generation system controlled by sketches and prompts, which helps both experts and novices concretize their design intentions or explore possibilities.

Setup

1. Install Environment via Anaconda (Recommended)

conda create -n doodleassist python==3.10.15
conda activate doodleassist
pip install -r requirements.txt

2. Model Preparation

Checkpoint
- Download the checkpoint controlnext-48000.bin (13MB) here, and place it to ./checkpoint/controlnext-48000.bin.
Base model
- We use a Stable Diffusion 1.5 model fine-tuned on line art data on civitai.com. Download it (foolkatGODOF_v3.safetensors) here.
- Convert the safetensors to diffusers models using the following commands (they are placed in ./backbone/foolkatGODOF_v3/):

git clone https://github.com/huggingface/diffusers.git
cd diffusers
python scripts/convert_original_stable_diffusion_to_diffusers.py \
  --checkpoint_path your/path/to/foolkatGODOF_v3.safetensors \
  --dump_path your/path/to/DoodleAssist/backbone/foolkatGODOF_v3/ \
  --from_safetensors

Gradio UI (Single-GPU)

To deploy the interface without the need for installing a web development environment, we provide a Gradio demo. It integrates an SVG editor (SVG-edit) and our processing interface.

It requires ~13GB of GPU memory and can be deployed on a single NVIDIA 4090 GPU.

Linux Users

Use the following command:

python gradio_app.py

Then, open the gradio_interface/app.html in the browser. Please use Google Chrome.

Refer to the tutorial here for instructions on using the interface.

Windows Users

Please select a directory for placing the outputs first. Then, use the following command:

python gradio_app.py --data_base your/selected/directory

Afterwards, open the gradio_interface/app.html in the browser. Remember to save the SVG as untitled.svg to that selected directory.

Refer to the tutorial here for instructions on using the interface.

Web UI (Multi-GPU)

We also provide a web interface, which is deployed on 4 NVIDIA 4090 GPUs.

Use the following commands one by one to set up backend servers on each GPU:

CUDA_VISIBLE_DEVICES=0 python web_app_server.py --port=9000
CUDA_VISIBLE_DEVICES=1 python web_app_server.py --port=9001
CUDA_VISIBLE_DEVICES=2 python web_app_server.py --port=9002
CUDA_VISIBLE_DEVICES=3 python web_app_server.py --port=9003

Make sure to install the nodejs and npm environments first (ask AI 😄). Then, set up the frontend web UI (Vue 2) using the following commands:

cd web_interface/
npm install
npm run serve

Afterwards, open http://localhost:8080/ in any browser.

Refer to the video for instructions on using the interface.

Citation

If you use the code and models, please cite:

@article{mo2025doodleassist,
  title={DoodleAssist: Progressive Interactive Line Art Generation with Latent Distribution Alignment},
  author={Mo, Haoran and Shen, Yulin and Simo-Serra, Edgar and Wang, Zeyu},
  journal={IEEE Transactions on Visualization and Computer Graphics},
  year={2025},
  publisher={IEEE}
}

Acknowledgements

This work is built based on ControlNeXt and the dataset SketchMan. We would like to thank their authors.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
docs		docs
gradio_interface		gradio_interface
models		models
pipeline		pipeline
utils		utils
web_interface		web_interface
.gitignore		.gitignore
README.md		README.md
gradio_app.py		gradio_app.py
requirements.txt		requirements.txt
web_app_helper.py		web_app_helper.py
web_app_server.py		web_app_server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DoodleAssist: Progressive Interactive Line Art Generation with Latent Distribution Alignment - TVCG 2025

Outline

Setup

1. Install Environment via Anaconda (Recommended)

2. Model Preparation

Gradio UI (Single-GPU)

Linux Users

Windows Users

Web UI (Multi-GPU)

Citation

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

MarkMoHR/DoodleAssist

Folders and files

Latest commit

History

Repository files navigation

DoodleAssist: Progressive Interactive Line Art Generation with Latent Distribution Alignment - TVCG 2025

Outline

Setup

1. Install Environment via Anaconda (Recommended)

2. Model Preparation

Gradio UI (Single-GPU)

Linux Users

Windows Users

Web UI (Multi-GPU)

Citation

Acknowledgements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages