a photo    

王 瑞 (Wang, Rui)

上海交通大学计算机系 副教授 & 博士生导师
Associate Professor & Ph.D. Advisor
Department of Computer Science and Engineering
Shanghai Jiao Tong University

Email: wangrui12 (as you know) sjtu.edu.cn
@EMNLP/WMT-2018

Biography

Dr. Rui Wang is a computational linguist working as an associate professor and he leads the machine translation lab at Shanghai Jiao Tong University since 2021. Before that, he was a researcher (tenured in 2020) at Japan National Institute of Information and Communications Technology (NICT) from 2016 to 2020. His research interest is NLP, especially machine translation (MT). He has published more than 40 papers in top-tier NLP/ML/AI conferences and journals, such as ACL, EMNLP, ICLR, AAAI, IJCAI, TPAMI, TASLP, etc. He has also won several first places in top-tier MT/NLP shared tasks, such as WMT-2018, WMT-2019, WMT-2020, CoNLL-2019, etc. He served as the area chairs of ICLR-2021/2022, NAACL-2021, and CoNLL-2021. He gave cutting-edge tutorials at EACL-2021 and EMNLP-2021.


Machine Translation Lab @SJTU 上海交通大学机器翻译研究室

I am always fortunate to work with these brilliant researchers and Ph.D./master/undergraduate students. Please send your CV and research proposal (optional) to me if you want to join us.
非常有幸能和这样一群优秀的年轻人共事,希望我们能从彼此身上学习到有意义和有意思的东西,共同进步!
对于希望保研/直博/考研加入实验室攻读学位的同学,请在取得交大录取资格后通过邮件与我联系,邮件内容包括但不限于CV,成绩单和research proposal。
对于希望进入实验室从事机器翻译和NLP研究的本科生同学,欢迎与我邮件联系(最好有学习/旁听过我讲授的NLP/数据挖掘等课程基础),邮件内容包括但不限于CV,成绩单和能够专注科研的时间段。

@SJTU

Students:

Zhiwei He (Ph.D. Student, 2021-)
Pingchuan Ma (Master Student, 2021-)
Ruize Gao (Master Student, 2021-)
Hongkun Hao (Undergraduate Student, 2021; Master Student, 2022-)
Xiaoyi Bao (Undergraduate Student, 2021)

@NICT

Collaborator:

Kehai Chen (Researcher, 2018-2020)

Interns:

Shintaro Harada (Matser Student, NAIST, Japan, 2020)
Chaoqun Duan (Ph.D. Student, HIT, China, 2019-2020)
Fengshun Xiao (Matser Student, SJTU, China, 2019-2020)
Zhuosheng Zhang (Matser Student, SJTU, China, 2019-2020)
Zuchao Li (Ph.D. Student, SJTU, China, 2019)
Mingming Yang (Ph.D. Student, Soochow University, China, 2018-2019)
Shu Jiang (Ph.D. Student, SJTU, China, 2018)
Haipeng Sun (Ph.D. Student, HIT, China, 2018-2020)
Zhisong Zhang (Matser Student, SJTU, China, 2017-2018)
Kehai Chen (Ph.D. Student, HIT, China, 2017-2018)

Teaching

Lecture

CS3602: Natural Language Processing for the AI major, 2021 Fall

CS247: Data Mining for the CS major, 2021 Fall

CS438: Information Extraction for the CS major, 2021 Fall

Tutorial

Advances and Challenges in Unsupervised Neural Machine Translation.
    Rui Wang and Hai Zhao
    16th conference of the European Chapter of the Association for Computational Linguistics (EACL-Tutorial), 2021

Syntax in End-to-End Natural Language Processing
    Hai Zhao, Rui Wang, and Kehai Chen
    The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP-Tutorial), 2021

Domain Adaptation for Neural Machine Translation
    Chenhui Chu and Rui Wang
    The 16th China Conference on Machine Translation (CCMT-Tutorial), 2019
    --You also refer to our survey paper in [COLING-2018]


Academic Services

Area Chairs: ICLR-2021/2022, NAACL-2021, CoNLL-2021, CCL-2019, and CCL-2018
Organization Chairs: PACLIC-29 and YCCL-2012
PC Members: ACL, EMNLP, NAACL, ICLR, AAAI, IJCAI, etc.
Reviewers: CL, TACL, IEEE TASLP, etc.

Shared Tasks

WMT-2020: 1st in three tasks (supervised English->Chinese, supervised Polish->English, and unsupervised/low-resource German-Upper Sorbian) [Results][Paper]
CoNLL-2019: 1st in the DM sub-task and the 2nd overall [Results][Paper]
WMT-2019: 1st in the only unsupervised MT task (German-Czech) [Results] [Paper]
WAT-2018: 1st places in Myanmar (Burmese) <- English [Results][Paper]
WMT-2018: 1st places in four tasks (English<->Estonian and English<->Finnish) [Results][Paper]


Fundings

2022-2025: PI of NSFC General Program: "Research on Multilingual Unsupervised Machine Translation" (6217020129)
2021-2023: PI of Shanghai Pujiang Program:"Research on the Key Problem of Unsupervised Machine Translation" (21PJ1406800)
2021-2022: PI of CCF-Tencent Rhino-Bird Young Faculty Open Research Fund: "Research on Low-resource Machine Translation" (RAGR20210119)
2019-2020: PI of Japan national fund (JSPS) for early-career scientists: "Unsupervised Neural Machine Translation in Universal Scenarios" (19K20354)

Selected Publication [Google Scholar] [DBLP]

Note: If you have a technical question about a research paper, it is best to try to get in contact with all of the authors, so the most appropriate person can respond as quickly as possible.

2021

Advances and Challenges in Unsupervised Neural Machine Translation
    Rui Wang and Hai Zhao
    16th conference of the European Chapter of the Association for Computational Linguistics (EACL-Tutorial), 2021

Syntax in End-to-End Natural Language Processing
    Hai Zhao, Rui Wang, and Kehai Chen
    The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP-Tutorial), 2021

Self-Training for Unsupervised Neural Machine Translation in Unbalanced Training Data Scenarios
    Haipeng Sun, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita and Tiejun Zhao
    The 2021 Conference of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL-HLT), 2021

Text Compression-aided Transformer Encoding
    Zuchao Li, Zhuosheng Zhang, Hai Zhao*, Rui Wang*, Kehai Chen, Masao Utiyama, and Eiichiro Sumita
    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
    [Codes]

SG-Net: Syntax Guided Transformer for Language Representation
    Zhuosheng Zhang, Yuwei Wu, Junru Zhou, Sufeng Duan, Hai Zhao*, and Rui Wang
    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
    [Codes]

Detecting Generalization Barriers for Understanding Neural Machine Translation
    Guanlin Li, Conghui Zhu*, Rui Wang, and Tiejun Zhao
    IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2021

Modeling Future Cost for Neural Machine Translation
    Chaoqun Duan, Kehai Chen, Rui Wang, Masao Utiyama, Eiichiro Sumita, Conghui Zhu*, and Tiejun Zhao
    IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2021

Unsupervised Neural Machine Translation for Similar and Distant Language Pairs: An Empirical Study
    Haipeng Sun, Rui Wang, Masao Utiyama, Benjamin Marie, Kehai Chen, Eiichiro Sumita, and Tiejun Zhao*
    ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 2021

Context-Aware Positional Representation for Self-Attention Networks
    Kehai Chen, Rui Wang*, Masao Utiyama, and Eiichiro Sumita
    Neurocomputing, 2021

Tri-training for Dependency Parsing Domain Adaptation
    Shu Jiang, Zuchao Li, Hai Zhao*, Bao-Liang Lu, Rui Wang
    ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 2021

2020

Data-dependent Gaussian Prior Objective for Language Generation
    Zuchao Li, Rui Wang*, Kehai Chen, Masao Utiyama, Eiichiro Sumita, Zhuosheng Zhang, and Hai Zhao*
    International Conference on Learning Representations (ICLR-2020), Addis Ababa, Ethiopia
    [Codes] Note: this is a full-score paper and a long-time talk presentation

Neural Machine Translation with Universal Visual Representation
    Zhuosheng Zhang, Kehai Chen, Rui Wang*, Masao Utiyama, Eiichiro Sumita, Zuchao Li, and Hai Zhao*
    International Conference on Learning Representations (ICLR-2020), Addis Ababa, Ethiopia
    [Codes]

Knowledge Distillation for Multilingual Unsupervised Neural Machine Translation
    Haipeng Sun, Rui Wang*, Kehai Chen, Masao Utiyama, Eiichiro Sumita, and Tiejun Zhao*
    The 58th Annual Meeting of the Association for Computational Linguistics (ACL-2020), Seattle, USA

Content Word Aware Neural Machine Translation
    Kehai Chen, Rui Wang*, Masao Utiyama, and Eiichiro Sumita
    The 58th Annual Meeting of the Association for Computational Linguistics (ACL-2020), Seattle, USA

Regularized Context Gates on Transformer for Machine Translation
    Xintong Li, Lemao Liu, Rui Wang, Guoping Huang, and Max Meng
    The 58th Annual Meeting of the Association for Computational Linguistics (ACL-2020), Seattle, USA

High-order Semantic Role Labeling
    Zuchao Li, Hai Zhao*, Rui Wang and Kevin Parnow
    The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP-2020-Findings), Punta Cana, Dominican Republic
    [Codes]

Reference Language based Unsupervised Neural Machine Translation
    Zuchao Li, Hai Zhao*, Rui Wang*, Masao Utiyama and Eiichiro Sumita
    The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP-2020-Findings), Punta Cana, Dominican Republic
    [Codes]

Robust Unsupervised Neural Machine Translation with Adversarial Denoising Training
    Haipeng Sun, Rui Wang, Kehai Chen, Xugang Lu, Masao Utiyama, Eiichiro Sumita, and Tiejun Zhao*
    The 28th International Conference on Computational Linguistics (COLING-2020), Barcelona, Spain

Explicit Sentence Compression for Neural Machine Translation
    Zuchao Li, Rui Wang*, Kehai Chen, Masao Utiyama, Eiichiro Sumita, Zhuosheng Zhang, and Hai Zhao*
    Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-2020), New York, USA
    [Codes]

SG-Net: Syntax-Guided Machine Reading Comprehension
    Zhuosheng Zhang, Yuwei Wu, Junru Zhou, Sufeng Duan, Hai Zhao*, and Rui Wang*
    Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-2020), New York, USA
    [Codes]

Memory Network for Linguistic Structure Parsing
    Zuchao Li, Chaoyu Guan, Hai Zhao, Rui Wang, Kevin Parnow, and Zhuosheng Zhang
    IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2020

A Novel Sentence-Level Agreement Architecture for Neural Machine Translation
    Mingming Yang, Rui Wang, Kehai Chen, Xing Wang, Tiejun Zhao, and Min Zhang
    IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2020

Towards More Diverse Input Representation for Neural Machine Translation
    Kehai Chen, Rui Wang*, Masao Utiyama, Eiichiro Sumita, Tiejun Zhao, Munyun Yang, and Hai Zhao
    IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2020

Unsupervised Neural Machine Translation with Cross-lingual Language Representation Agreement
    Haipeng Sun, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita, and Tiejun Zhao
    IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2020

2019

Unsupervised Bilingual Word Embedding Agreement for Unsupervised Neural Machine Translation
    Haipeng Sun, Rui Wang*, Kehai Chen, Masao Utiyama, Eiichiro Sumita, and Tiejun Zhao*
    The 57th Annual Meeting of the Association for Computational Linguistics (ACL-2019), Florence, Italy

Neural Machine Translation with Reordering Embeddings
    Kehai Chen, Rui Wang*, Masao Utiyama, and Eiichiro Sumita
    The 57th Annual Meeting of the Association for Computational Linguistics (ACL-2019), Florence, Italy

Sentence-Level Agreement for Neural Machine Translation
     Mingming Yang, Rui Wang*, Kehai Chen, Masao Utiyama, Eiichiro Sumita, Min Zhang*, and Tiejun Zhao
    The 57th Annual Meeting of the Association for Computational Linguistics (ACL-2019), Florence, Italy

Lattice-Based Transformer Encoder for Neural Machine Translation
     Fengshun Xiao, Jiangtong Li, Hai Zhao*, Rui Wang, and Kehai Chen
    The 57th Annual Meeting of the Association for Computational Linguistics (ACL-2019), Florence, Italy

Recurrent Positional Embedding for Neural Machine Translation
    Kehai Chen, Rui Wang*, Masao Utiyama, and Eiichiro Sumita
    2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP2019), Hong Kong, China

Neural Machine Translation with Sentence-level Topic Context
    Kehai Chen, Rui Wang, Masao Utiyama, Eiichiro Sumita, and Tiejun Zhao
    IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2019

2018

Dynamic Sentence Sampling for Efficient Training of Neural Machine Translation
    Rui Wang, Masao Utiyama, and Eiichiro Sumita
    The 56th Annual Meeting of the Association for Computational Linguistics (ACL-2018), Melbourne, Australia

Exploring Recombination for Efficient Decoding of Neural Machine Translation
    Zhisong Zhang, Rui Wang*, Masao Utiyama, Eiichiro Sumita, and Hai Zhao*
    2018 Conference on Empirical Methods in Natural Language Processing (EMNLP-2018), Brussels, Belgium
    [Codes]

A Survey of Domain Adaptation for Neural Machine Translation
    Chenhui Chu and Rui Wang
    The 27th International Conference on Computational Linguistics (COLING-2018), Santa Fe, USA

Syntax-Directed Attention for Neural Machine Translation
    Kehai Chen, Rui Wang*, Masao Utiyama, Eiichiro Sumita, and Tiejun Zhao
    The Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-2018), New Orleans, Lousiana, USA

Sentence Selection and Weighting for Neural Machine Translation Domain Adaptation
    Rui Wang, Masao Utiyama, Andrew Finch, Lemao Liu, Kehai Chen, and Eiichiro Sumita
    IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2018

A Neural Approach to Source Dependency-Based Context Model for Statistical Machine Translation
    Kehai Chen, Tiejun Zhao, Muyun Yang, Lemao Liu*, Akihiro Tamura, Rui Wang, Masao Utiyama, and Eiichiro Sumita
    IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2018

Graph-based Bilingual Word Embedding for Statistical Machine Translation
    Rui Wang, Hai Zhao*, Sabine Ploux*, Bao-Liang Lu, Masao Utiyama, and Eiichiro Sumita
    ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 2018
    [Codes]

2017

Sentence Embedding for Neural Machine Translation Domain Adaptation
    Rui Wang, Andrew Finch, Masao Utiyama, and Eiichro Sumita
    The 55th annual meeting of the Association for Computational Linguistics (ACL-2017), Vancouver, Canada

Instance Weighting for Neural Machine Translation Domain Adaptation
    Rui Wang, Masao Utiyama, Lemao Liu, Kehai Chen, and Eiichro Sumita
    Conference on Empirical Methods in Natural Language Processing (EMNLP-2017), Copenhagen, Denmark

Neural Machine Translation with Source Dependency Representation
    Kehai Chen, Rui Wang*, Masao Utiyama, Lemao Liu, Akihiro Tamura, Eiichiro Sumita, and Tiejun Zhao
    Conference on Empirical Methods in Natural Language Processing (EMNLP-2017), Copenhagen, Denmark

Context-Aware Smoothing for Neural Machine Translation
    Kehai Chen, Rui Wang*, Masao Utiyama, Eiichiro Sumita, and Tiejun Zhao
    The 8th International Joint Conference on Natural Language Processing (IJCNLP 2017), Taipei, China

2016 and Before

A Bilingual Graph-based Semantic Model for Statistical Machine Translation
    Rui Wang, Hai Zhao*, Sabine Ploux*, Bao-Liang Lu, and Masao Utiyama
    25th International Joint Conference on Artificial Intelligence (IJCAI-16), New York, USA
    [Codes]

Converting Continuous-Space Language Models into N-gram Language Models with Efficient Bilingual Pruning for Statistical Machine Translation
    Rui Wang, Masao Utiyama*, Isao Goto, Eiichiro Sumita, Hai Zhao*, and Bao-Liang Lu
    ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 2016

Connecting Phrase based Statistical Machine Translation Adaptation
    Rui Wang, Hai Zhao*, Bao-Liang Lu, Masao Utiyama*, and Eiichro Sumita
    The 26th International Conference on Computational Linguistics (COLING-2016), Osaka, Japan

Bilingual Continuous-Space Language Model Growing for Statistical Machine Translation
    Rui Wang, Hai Zhao*, Bao-Liang Lu, Masao Utiyama, and Eiichiro Sumita
    IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2015

Neural Network Based Bilingual Language Model Growing for Statistical Machine Translation
    Rui Wang, Hai Zhao*, Bao-Liang Lu, Masao Utiyama, and Eiichro Sumita
    Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP-2014), Doha, Qatar

Converting Continuous-Space Language Models into N-gram Language Models for Statistical Machine Translation
    Rui Wang, Masao Utiyama, Isao Goto, Eiichro Sumita, Hai Zhao, and Bao-Liang Lu
    Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (EMNLP-2013), Seattle, USA