ACL 2025 Oral Presentation
Bocheng Li*, Zhujin Gao*, and Linli Xu
* equal contribution
tl;dr We introduce NeoDiff, a continuous text diffusion model that integrates the per-token flexibility of discrete diffusion with the precise control of continuous diffusion using a non-simultaneous Poisson process and a semantically-aware time predictor with optimized noise schedule.
-
Clone the repository:
git clone https://github.com/bc-li/neodiff.git cd neodiff -
Set up the environment and install dependencies:
# Get fairseq-0.12.2 wget https://github.com/facebookresearch/fairseq/archive/refs/tags/v0.12.2.zip unzip v0.12.2.zip # Create a new conda environment conda create -n neodiff python=3.8 # Activate the environment conda activate neodiff # Install PyTorch 1.11 conda install pytorch=1.11.0 -c pytorch # Downgrade MKL to resolve compatibility issues pip install mkl==2024.0.0 # Install other dependencies pip install -r requirements.txt
-
Prepare the IWSLT14 DE-EN dataset by following the instructions in fairseq.
Note: Knowledge distillation is not required; use the default settings. -
Place the preprocessed data in the
data-binfolder.
Train the NeoDiff model on the IWSLT14 DE-EN dataset using a single GPU:
bash scripts/iwslt14_de_en/train_neodiff.shOptimize the time schedule for better performance using Bayesian optimization:
bash scripts/iwslt14_de_en/optimize_neodiff.shEvaluate the trained model on IWSLT14 DE-EN dataset using default setting:
bash scripts/iwslt14_de_en/evaluate_neodiff.shEvaluate the trained model on IWSLT14 DE-EN dataset using LLM-based evaluation:
bash scripts/iwslt14_de_en/llm_eval.py <path_to_generated_file.txt>If you find our work useful, please consider citing:
@inproceedings{li-etal-2025-unifying,
title = "Unifying Continuous and Discrete Text Diffusion with Non-simultaneous Diffusion Processes",
author = "Li, Bocheng and
Gao, Zhujin and
Xu, Linli",
editor = "Che, Wanxiang and
Nabende, Joyce and
Shutova, Ekaterina and
Pilehvar, Mohammad Taher",
booktitle = "Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
month = jul,
year = "2025",
address = "Vienna, Austria",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2025.acl-long.565/",
doi = "10.18653/v1/2025.acl-long.565",
pages = "11530--11551",
}