RobuQ: Pushing DiTs to W1.58A2 via Robust Activation Quantization
Kaicheng Yang*, Xun Zhang*, Haotong Qin, Yucheng Lin, Kaisen Yang, Yulun Zhang
Selected and preprint work spanning diffusion model quantization, efficient reasoning, LLM ternarization, and sampling for masked diffusion models. * indicates equal contribution.
Kaicheng Yang*, Xun Zhang*, Haotong Qin, Yucheng Lin, Kaisen Yang, Yulun Zhang
Xun Zhang*, Kaicheng Yang*, Hongliang Lu, Haotong Qin, Yong Guo, Yulun Zhang
Kaisen Yang, Jayden Teoh, Kaicheng Yang, Yitong Zhang, Alex Lamb
Xianglong Yan*, Chengzhu Bao*, Zhiteng Li, Tianao Zhang, Kaicheng Yang, Haotong Qin, Ruobing Xie, Xingwu Sun, Yulun Zhang
Kai Liu*, Kaicheng Yang*, Zheng Chen, Zhiteng Li, Yong Guo, Wenbo Li, Linghe Kong, Yulun Zhang
Shaoqiu Zhang*, Zizhong Ding*, Kaicheng Yang, Junyi Wu, Xianglong Yan, Xi Li, Bingnan Duan, Jianping Fang, Yulun Zhang
Kaicheng Yang, Kaisen Yang, Baiting Wu, Xun Zhang, Qianrui Yang, Haotong Qin, He Zhang, Yulun Zhang
Libo Zhu, Haotong Qin, Kaicheng Yang, Wenbo Li, Yong Guo, Yulun Zhang, Susanto Rahardj, Xiaokang Yang