Hello! I am a senior researcher in Natural Language Computing group at Microsoft Research Asia, working with Dr. Shujie Liu, Dr. Furu Wei, and Dr. Ming Zhou.
Before joining MSRA, I received my Ph.D. in 2020 from NLPR of CASIA under the supervision of Prof. Chengqing Zong, working with Prof. Jiajun Zhang.
My research interests include natural/spoken/programming language processing and multi-modal large language models.
News
Selected Publications
Spoken Language Processing
-
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing. Junyi Ao, Rui Wang, Long Zhou, Chengyi Wang, Shuo Ren, Yu Wu, Shujie Liu, Tom Ko, Qing Li, Yu Zhang, Zhihua Wei, Yao Qian, Jinyu Li, Furu Wei. ACL 2022.
-
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing. Sanyuan Chen, Chengyi Wang, Zhengyang Chen, Yu Wu, Shujie Liu, Zhuo Chen, Jinyu Li, Naoyuki Kanda, Takuya Yoshioka, Xiong Xiao, Jian Wu, Long Zhou, Shuo Ren, Yanmin Qian, Yao Qian, Jian Wu, Michael Zeng, Xiangzhan Yu, Furu Wei. IEEE Journal of Selected Topics in Signal Processing, 2022.
-
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training. Ziqiang Zhang, Long Zhou, Junyi Ao, Shujie Liu, Lirong Dai, Jinyu Li, Furu Wei. EMNLP 2022.
-
SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data. Ziqiang Zhang, Sanyuan Chen, Long Zhou, Yu Wu, Shuo Ren, Shujie Liu, Zhuoyuan Yao, Xun Gong, Lirong Dai, Jinyu Li, Furu Wei. IEEE/ACM Transactions on Audio Speech and Language Processing.
-
VATLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning. Qiushi Zhu, Long Zhou, Ziqiang Zhang, Shujie Liu, Binxing Jiao, Jie Zhang, Lirong Dai, Daxin Jiang, Jinyu Li, Furu Wei. IEEE Transactions on Multimedia.
-
A Configurable Multilingual Model is All You Need to Recognize All Languages. Long Zhou, Jinyu Li, Eric Sun, Shujie Liu. ICASSP 2022.
-
Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling. Ziqiang Zhang, Long Zhou, Chengyi Wang, Sanyuan Chen, Yu Wu, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei. Arxiv 2023.
Programming Language Processing
-
GraphCodeBERT: Pre-training Code Representations with Data Flow. Daya Guo, Shuo Ren, Shuai Lu, Zhangyin Feng, Duyu Tang, Shujie Liu, Long Zhou, Nan Duan, Alexey Svyatkovskiy, Shengyu Fu, Michele Tufano, Shao Kun Deng, Colin Clement, Dawn Drain, Neel Sundaresan, Jian Yin, Daxin Jiang, Ming Zhou. ICLR 2021.
-
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation. Shuai Lu, Daya Guo, Shuo Ren, Junjie Huang, Alexey Svyatkovskiy, Ambrosio Blanco, Colin Clement, Dawn Drain, Daxin Jiang, Duyu Tang, Ge Li, Lidong Zhou, Linjun Shou, Long Zhou, Michele Tufano, Ming Gong, Ming Zhou, Nan Duan, Neel Sundaresan, Shao Kun Deng, Shengyu Fu, Shujie Liu. NeurIPS 2021 Datasets and Benchmarks Track.
-
CodeBLEU: A Method for Automatic Evaluation of Code Synthesis. Shuo Ren, Daya Guo, Shuai Lu, Long Zhou, Shujie Liu, Duyu Tang, Neel Sundaresan, Ming Zhou, Ambrosio Blanco, Shuai Ma. Arxiv 2020.
-
Jointly Learning to Repair Code and Generate Commit Message. Jiaqi Bai, Long Zhou, Ambrosio Blanco, Shujie Liu, Furu Wei, Ming Zhou, Zhoujun Li. EMNLP 2021.
-
Grammar-Based Patches Generation for Automated Program Repair. Yu Tang, Long Zhou, Ambrosio Blanco, Shujie Liu, Furu Wei, Ming Zhou, Muyun Yang. Findings of ACL 2021.
Natural Language Processing
-
Nueral System Combination for Machine Translation. Long Zhou, Wenpeng Hu, Jiajun Zhang, Chengqing Zong. ACL 2017.
-
Look-ahead Attention for Generation in Neural Machine Translation. Long Zhou, Jiajun Zhang, Chengqing Zong. NLPCC 2017 (Best Paper Award).
-
Synchronous Bidirectional Neural Machine Translation. Long Zhou, Jiajun Zhang, Chengqing Zong. TACL 2019.
-
Sequence Generation: From Both Sides to the Middle. Long Zhou, Jiajun Zhang, Chengqing Zong, Heng Yu. IJCAI 2019.
-
A Compact and Language-Sensitive Multilingual Translation Method. Yining Wang, Long Zhou, Jiajun Zhang, Feifei Zhai, Jingfang Xu, Chengqing Zong. ACL 2019.
-
Synchronous Bidirectional Inference for Neural Sequence Generation. Jiajun Zhang, Long Zhou, Yang Zhao, Chengqing Zong. Journal of Artificial Intelligence, 2020.
-
SemFace: Pre-training Encoder and Decoder with a Semantic Interface for Neural Machine Translation. Shuo Ren, Long Zhou, Shujie Liu, Furu Wei, Ming Zhou, Shuai Ma. ACL 2021.
Software
Awards
- VALL-E wins the UNESCO Netexplo Innovation Award 2023, 2023
- Nomination Award for Excellent Doctoral Thesis of Chinese Information Processing Society, 2020
- CAS Presidential Scholarship, 2020
- NVIDIA Scholarship, 2018
- NLPCC2017 Best Paper Award, 2017
- Outstanding University Graduates of Sichuan Province, 2015 (Top 1%)
- National Scholarship, 2012/2013/2014 (Top 2%)
Links