I am an Associate Professor in National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy of Sciences. I obtained my Ph.D. in from NLPR in 2019. My advisor is Prof. Chengqing Zong. Prior to that, I got my Bachelor and Master degree in Beijing Jiaotong University.
My research interests include natural language processing, machine translation and knowledge graph. I have published more than 30 papers at top-tier conferences and journals including TASLP, ACL, IJCAI, EMNLP, COLING etc. I also have served as publication co-chairs for COLING-2020 and PC Members for many AI conferences. We win the Best paper ward for CCMT 2023.
PhD in Pattern Recognition, 2015-2019
Institute of Automation Chinese Academy of Sciences
MEng in Automatics, 2012-2015
Beijing Jiaotong University
BSc in Automatics, 2008-2012
Beijing Jiaotong University
Our Chinese book 《自然语言处理基础与大模型-案例与实践》(The Fundamentals of Natural Language Processing and Larg Models: Cases and Practices) has been published.
I have one paper accepted by NAACL 2024.
I have one paper accepted by COLING 2024.
I have one paper accepted by ICASSP 2024.
I have three papers accepted by EMNLP findings 2023.
Our paper wins the Best paper award of CCMT 2023.
I have one paper accepted by ACL 2023.
Cong Ma, Yaping Zhang, Yang Zhao, Yu Zhou, and Chengqing Zong. 2023. Vector Quantization Knowledge Transfer For End-to-end text image machine translation. Accepted by 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024).
Yupu Liang, Yaping Zhang, Cong Ma, Zhiyang Zhang, Yang Zhao, Lu Xiang, Chengqing Zong, Yu Zhou. Document Image Machine Translation with Dynamic Multi-pre-trained Models Assembling. Accepted by The 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024). Mexico City, Mexico. June 16-21, 2024.
Cong Ma, Yaping Zhang, Zhiyang Zhang, Yupu Liang, Yang Zhao, Yu Zhou, Chengqing Zong. Born a BabNet with Hierarchical Parental Supervision for End-to-End Text Image Machine Translation. Accepted by The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024). Torino, Italia. May 20-25, 2024.
Yang Zhao, Jiajun Zhang, Chengqing Zong. 2023. Transformer: A General Framework from Machine Translation to Others. Machine Intelligence Research 20, 514–538 (2023). https://doi.org/10.1007/s11633-022-1393-5.
Lu Xiang, Yang Zhao, Junnan Zhu, Yu Zhou, Chengqing Zong. 2023. Zero-shot language extension for dialogue state tracking via pre-trained models and multi-auxiliary-tasks fine-tuning. Knowledge-Based Systems, Volume 259, 2023, 110015.
Cong Ma, Xu Han, Linghui Wu, Yaping Zhang, Yang Zhao, Yu Zhou, and Chengqing Zong. 2023. Modal Contrastive Learning based End-to-End Text Image Machine Translation. IEEE/ACM Transactions on Audio, Speech, and Language Processing, doi: 10.1109/TASLP.2023.3324540.
Cong Ma, Yaping Zhang, Mei Tu, Yang Zhao, Yu Zhou, and Chengqing Zong. 2023. CCIM: Cross-modal Cross-lingual Interactive Image Translation. In Findings of the Association for Computational Linguistics: EMNLP 2023 (EMNLP Findings 2023), pages 4959–4965.
Zixuan Ren, Yang Zhao, and Chengqing Zong. 2023. Towards Informative Open-ended Text Generation with Dynamic Knowledge Triples. In Findings of the Association for Computational Linguistics: EMNLP 2023 (EMNLP Findings 2023), pages 3189–3203.
Zhiyang Zhang, Yaping Zhang, Yupu Liang, Lu Xiang, Yang Zhao, Yu Zhou, and Chengqing Zong. 2023. LayoutDIT: Layout-Aware End-to-End Document Image Translation with Multi-Step Conductive Decoder. In Findings of the Association for Computational Linguistics: EMNLP 2023 (EMNLP Findings 2023), pages 10043–10053.
Rongchuan Tang, Yang Zhao, Chengqing Zong, and Yu Zhou. 2023. Multilingual Knowledge Graph Completion with Language-Sensitive Multi-Graph Attention. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), pages 10508–10519.
Cong Ma, Yaping Zhang, Mei Tu, Yang Zhao, Yu Zhou, Chengqing Zong (2023). E2TIMT: Efficient and Effective Modal Adapter for Text Image Machine Translation. In Proceedings of the 17th International Conference on Document Analysis and Recognition (ICDAR 2023).
Cong Ma, Yaping Zhang, Mei Tu, Yang Zhao, Yu Zhou, Chengqing Zong (2023). Multi-Teacher Knowledge Distillation For Text Image Machine Translation. In Proceedings of the 17th International Conference on Document Analysis and Recognition (ICDAR 2023).
Zhiyang Zhang, Yaping Zhang, Lu Xiang, Yang Zhao, Yu Zhou. A Novel Dataset and Benchmark Analysis on Document Image Translation. : In Proceedings of the 19th China Conference on Machine Translation (CCMT 2023) (Best paper award).
Yang Zhao, Junnan Zhu, Lu Xiang, Jiajun Zhang, Yu Zhou, Feifei Zhai, Chengqing Zong. 2022. Life-long Learning for Multilingual Neural Machine Translation with Knowledge Distillation. arXiv preprint arXiv:2212.02800.
Xiaomian Kang, Yang Zhao, Jiajun Zhang, Chengqing Zong (2022). Enhancing Lexical Translation Consistency for Document-Level Neural Machine Translation. ACM Transactions on Asian and Low-Resource Language Information Processing, 21, 3, Article 59 (May 2022), 21 pages. https://doi.org/10.1145/3485469.
Cong Ma, Yaping Zhang, Mei Tu, Xu Han, Linghui Wu, Yang Zhao, Yu Zhou. 2022. Improving End-to-End Text Image Translation From the Auxiliary Text Translation Task. In Proceedings of the 26TH International Conference on Pattern Recognition (ICPR 2022).
Mei Li, Lu Xiang, Xiaomian Kang, Yang Zhao, Yu Zhou, Chengqing Zong (2021). Medical Term and Status Generation From Chinese Clinical Dialogue With Multi-Granularity Transformer. IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 29, pp. 3362-3374, 2021, doi: 10.1109/TASLP.2021.3122301.
Lu Xiang, Junnan Zhu, Yang Zhao, Yu Zhou, Chengqing Zong (2021). Robust cross-lingual task-oriented dialogue. ACM Transactions on Asian and Low-Resource Language Information Processing, 20, 6, Article 93 (November 2021), 24 pages. https://doi.org/10.1145/3457571
Hao He, Qian Wang, Zhipeng Yu, Yang Zhao, Jiajun Zhang, Chengqing Zong (2021). Synchronous interactive decoding for multilingual neural machine translation. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAA 2021), 35(14), 12981-12988. https://doi.org/10.1609/aaai.v35i14.17535
Lu Xiang, Yang Zhao, Junnan Zhu, Yu Zhou, Chengqing Zong (2021). Zero-Shot Deployment for Cross-Lingual Dialogue System. In CCF International Conference on Natural Language Processing and Chinese Computing (NLPCC 2021) . vol 13029. Springer. https://doi.org/10.1007/978-3-030-88483-3_15.
Jiajun Zhang, Long Zhou, Yang Zhao, Chengqing Zong. 2020. Synchronous bidirectional inference for neural sequence generation. Artificial Intelligence, Volume 281, 2020, 103234, ISSN 0004-3702, https://doi.org/10.1016/j.artint.2020.103234.
Feng Wang, Juan Du, Yang Zhao, Tao Tang, Jianjun Shi. 2020. A deep learning based data fusion method for degradation modeling and prognostics. IEEE Transactions on Reliability. vol. 70, no. 2, pp. 775-789, June 2021, doi: 10.1109/TR.2020.3011500.
Yang Zhao, Jiajun Zhang, Yu Zhou, Chengqing Zong. 2020. Knowledge graphs enhanced neural machine translation. In Proceedings of the Twenty-Ninth International Conference on International Joint Conferences on Artificial Intelligence (IJCAI 2020). Article 559, 4039–4045.
Yang Zhao, Lu Xiang, Junnan Zhu, Jiajun Zhang, Yu Zhou, Chengqing Zong. 2020. Knowledge graph enhanced neural machine translation via multi-task learning on sub-entity granularity. In Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), pages 4495–4505, Barcelona, Spain (Online). International Committee on Computational Linguistics.
Xiaomian Kang, Yang Zhao, Jiajun Zhang, Chengqing Zong. 2020. Dynamic context selection for document-level neural machine translation via reinforcement learning. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), pages 2242–2254, Online. Association for Computational Linguistics.
Long Zhou, Jiajun Zhang, Yang Zhao, Chengqing Zong. 2020. Non-autoregressive neural machine translation with distortion model. In CCF International Conference on Natural Language Processing and Chinese Computing (NLPCC 2020). vol 12430. Springer. https://doi.org/10.1007/978-3-030-60450-9_32.
Qian Wang, Yuchen Liu, Cong Ma, Yu Lu, Yining Wang, Long Zhou, Yang Zhao, Jiajun Zhang, Chengqing Zong. 2020. CASIA’s System for IWSLT 2020 Open Domain Translation. In Proceedings of the 17th International Conference on Spoken Language Translation, pages 130–139, Online. Association for Computational Linguistics.
Yang Zhao, Jiajun Zhang, Chengqing Zong, Zhongjun He, Hua Wu. 2019. Addressing the under-translation problem from the entropy perspective. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2019), 33(01), 451-458. https://doi.org/10.1609/aaai.v33i01.3301451
Jiajun Zhang, Yang Zhao, Haoran Li, Chengqing Zong. 2018. Attention with sparsity regularization for neural machine translation and summarization. IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 27, no. 3, pp. 507-518, March 2019, doi: 10.1109/TASLP.2018.2883740.
Yang Zhao, Jiajun Zhang, Zhongjun He, Chengqing Zong, Hua Wu. 2018. Addressing troublesome words in neural machine translation. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP 2018), pages 391–400, Brussels, Belgium. Association for Computational Linguistics.
Yang Zhao, Yining Wang, Jiajun Zhang, Chengqing Zong. 2018. Phrase table as recommendation memory for neural machine translation. In Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI 2018), AAAI Press, 4609–4615.
Yang Zhao, Jiajun Zhang and Chengqing Zong. 2018. Exploiting pre-ordering for neural machine translation. In Proceedings of the eleventh international conference on language resources and evaluation (lrec 2018), pages 893–899.
Yuchen Liu, Long Zhou, Yining Wang, Yang Zhao, Jiajun Zhang, and Chengqing Zong. 2018. A comparable study on model averaging, ensembling and reranking in nmt. In CCF International Conference on Natural Language Processing and Chinese Computing (NLPCC 2018). vol 11109. Springer. https://doi.org/10.1007/978-3-319-99501-4_26.
Yang Zhao, Yining Wang, Jiajun Zhang, and Chengqing Zong. 2017. Cost-aware learning rate for neural machine translation. Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data (CCL 2017), vol 10565. Springer. https://doi.org/10.1007/978-3-319-69005-6_8
Yining Wang, Yang Zhao, Jiajun Zhang, Chengqing Zong, and Zhengshan Xue. 2017. Towards Neural Machine Translation with Partially Aligned Corpora. In Proceedings of the Eighth International Joint Conference on Natural Language Processing (IJCNLP 2017), pages 384–393, Taipei, Taiwan. Asian Federation of Natural Language Processing.
Yang Zhao, Tian-hua Xu, Hai-feng Wang. 2015. Text mining based fault diagnosis of vehicle on-board equipment for high speed railway. In Proceedings of the 17th International IEEE Conference on Intelligent Transportation Systems (ITSC 2015).
Publication Co-Chairs: COLING 2020
Program Committee: ACL (2019-2021), EMNLP (2019-2022), COLING (2022), IJCAI (2020-2021), AAAI(2020-2021)
Journal Reviewer: IEEE/ACM TASLP, MIR.