Back HomeResearch
My work has two long-term threads: using Efficient ML to identify the necessary structures that intelligence truly depends on, and using AI4Human to understand the human brain and expand the interaction bandwidth between carbon-based and silicon-based intelligence. I am also interested in other generative models that feel intellectually alive.
View Publications
Efficient ML
I treat compression as a scientific probe. Quantization, sparsity, and low-rank structure are not only engineering tools; they reveal which parts of a model remain essential after redundancy is stripped away. Those surviving structures are clues for next-generation AI architectures.
AI4Human
I want to use AI to understand the human brain, explore next-generation human-computer interaction, and increase the communication bandwidth between carbon-based and silicon-based intelligence.
Others
I am interested in other compelling generative models as well, especially when they reveal new ways to sample, reason, or organize uncertainty.
Representative Efficient ML Papers
ICML 2026 · 2026
RobuQ: Pushing DiTs to W1.58A2 via Robust Activation Quantization
Kaicheng Yang*, Xun Zhang*, Haotong Qin, Yucheng Lin, Kaisen Yang, Yulun Zhang
ICML 2026 · 2026
Q-DiT4SR: Exploration of Detail-Preserving Diffusion Transformer Quantization for Real-World Image Super-Resolution
Xun Zhang*, Kaicheng Yang*, Hongliang Lu, Haotong Qin, Yong Guo, Yulun Zhang
ICML 2025 · 2025
BiMaCoSR: Binary One-Step Diffusion Model Leveraging Flexible Matrix Compression for Real Super-Resolution
Kai Liu*, Kaicheng Yang*, Zheng Chen, Zhiteng Li, Yong Guo, Wenbo Li, Linghe Kong, Yulun Zhang
arXiv 2025 · 2025
TreeQ: Pushing the Quantization Boundary of Diffusion Transformer via Tree-Structured Mixed-Precision Search
Kaicheng Yang, Kaisen Yang, Baiting Wu, Xun Zhang, Qianrui Yang, Haotong Qin, He Zhang, Yulun Zhang
AI4Human
I am working on it.
Others
ICML 2026 · 2026
Improving Sampling for Masked Diffusion Models via Information Gain
Kaisen Yang, Jayden Teoh, Kaicheng Yang, Yitong Zhang, Alex Lamb