Back Home

Publications

Selected and preprint work spanning diffusion model quantization, efficient reasoning, LLM ternarization, and sampling for masked diffusion models. * indicates equal contribution.

ICML 2026 · 2026

RobuQ: Pushing DiTs to W1.58A2 via Robust Activation Quantization

Kaicheng Yang*, Xun Zhang*, Haotong Qin, Yucheng Lin, Kaisen Yang, Yulun Zhang

ICML 2026 · 2026

Q-DiT4SR: Exploration of Detail-Preserving Diffusion Transformer Quantization for Real-World Image Super-Resolution

Xun Zhang*, Kaicheng Yang*, Hongliang Lu, Haotong Qin, Yong Guo, Yulun Zhang

ICML 2026 · 2026

Improving Sampling for Masked Diffusion Models via Information Gain

Kaisen Yang, Jayden Teoh, Kaicheng Yang, Yitong Zhang, Alex Lamb

ICLR 2026 · 2026

PT^2-LLM: Post-Training Ternarization for Large Language Models

Xianglong Yan*, Chengzhu Bao*, Zhiteng Li, Tianao Zhang, Kaicheng Yang, Haotong Qin, Ruobing Xie, Xingwu Sun, Yulun Zhang

ICML 2025 · 2025

BiMaCoSR: Binary One-Step Diffusion Model Leveraging Flexible Matrix Compression for Real Super-Resolution

Kai Liu*, Kaicheng Yang*, Zheng Chen, Zhiteng Li, Yong Guo, Wenbo Li, Linghe Kong, Yulun Zhang

arXiv 2026 · 2026

AdaTSQ: Pushing the Pareto Frontier of Diffusion Transformers via Temporal-Sensitivity Quantization

Shaoqiu Zhang*, Zizhong Ding*, Kaicheng Yang, Junyi Wu, Xianglong Yan, Xi Li, Bingnan Duan, Jianping Fang, Yulun Zhang

arXiv 2025 · 2025

TreeQ: Pushing the Quantization Boundary of Diffusion Transformer via Tree-Structured Mixed-Precision Search

Kaicheng Yang, Kaisen Yang, Baiting Wu, Xun Zhang, Qianrui Yang, Haotong Qin, He Zhang, Yulun Zhang

arXiv 2025 · 2025

QArtSR: Quantization via Reverse-Module and Timestep-Retraining in One-Step Diffusion based Image Super-Resolution

Libo Zhu, Haotong Qin, Kaicheng Yang, Wenbo Li, Yong Guo, Yulun Zhang, Susanto Rahardj, Xiaokang Yang