HOME PAGE Wei Peng

Wei Peng (Albert)

Phone：010 612 16123

Birth：1996.02

E-mail：pengwei@iie.ac.cn

English：CET-6

Other pages

Educational background

Institute of Information Engineering, Chinese Academy of Sciences | PhD
Chang'an University | Undergraduate education (211) Rank First

Programming Language

PYTHON LINUX C++ PyTorch

Published

The conference and journal

Accepted 31 papers (SIGIR, IJCAI, TIFS, EMNLP, AAAI, KBs et al.), first 13/corresponding author 3 papers. SCI District 1, Chinese Academy of Sciences 3 papers; SCI District 2, Chinese Academy of Sciences 2 papers; CCF-A 4 paper，CCF-B 6 papers，CCF-C 1 paper

|Paper Link| Wei Peng, Lei Cui, Wei Cai, Wei Wang, Zhiyu Hao et al. Bottom Aggregating, Top Separating: An Aggregator and Separator Network for Encrypted Traffic Understanding. TIFS, 2025. CCF-A.

|Paper Link| Junmei Ding, Wei Peng* et al. RETO: Reinforcement Learning Enhanced Terminology Optimization for Cyber Threat Intelligence Summarization. Neurocomputing, 2025. SCI 2区.

|Paper Link| Wei Peng, Lei Cui, Wei Cai, Wei Wang, Zhiyu Hao et al. Efficiently and Effectively: A Two-stage Approach to Balance Plaintext and Encrypted Text for Traffic Classification. 2024.

|Paper Link| Wei Peng, Junmei Ding, Wei Wang, Lei Cui, Zhiyu Hao et al. CTISum: A New Benchmark Dataset For Cyber Threat Intelligence Summarization. 2024.

|Paper Link| Qijun Xie, Wei Peng*. MAGO: Multi-Knowledge Aware and Global Strategy Sequence Optimizing Network for Emotional Support Conversation. Neurocomputing, 2024. SCI 2区.

|Paper Link| Wei Peng, Yue Hu, Yunpeng Li et al. 基于阅读技巧识别和双通道融合机制的机器阅读理解方法. 自动化学报, 2024. CCF-A.

|Paper Link| Wei Peng, Wanshui Li, Yue Hu et al. Leader-Generator Net: Dividing Skill and Implicitness for Conquering FairytaleQA. SIGIR, 2023. CCF-A.

|Paper Link| Wei Peng, Ziyuan Qin, Yue Hu, Yuqiang Xie, Yunpeng Li. FADO: Feedback-Aware Double COntrolling Network for Emotional Support Conversation. Knowl. Based Syst, 2023. SCI District 1, Chinese Academy of Sciences.

|Paper Link| Wei Peng, Yue Hu, Luxi Xing, Yuqiang Xie, Yajing Sun et al. Control Globally, Understand Locally: A Global-to-Local Hierarchical Graph Network for Emotional Support Conversation. IJCAI, 2022. CCF-A.

|Paper Link| Wei Peng, Yue Hu, Jing Yu et al. APER: AdaPtive Evidence-driven Reasoning Network for Machine Reading Comprehension. Knowl. Based Syst, 2021. SCI District 1, Chinese Academy of Sciences, Impact Factor 8.038.

|Paper Link| Wei Peng, Yue Hu, Yajing Sun et al. Modeling Intention, Emotion and External World in Dialogue Systems. ICASSP, 2022. CCF-B.

|Paper Link| Jingcheng Deng, Wei Peng*. IRRGN: An Implicit Relational Reasoning Graph Network for Multi-turn Response Selection. EMNLP,2022. CCF-B.

|Paper Link| Yunpeng Li, Wei Peng, et al. Learning to Know Myself: A Coarse-to-Fine Persona-aware Training Framework for Personalized Dialogue Generation. AAAI,2023. CCF-A.

|Paper Link| Zekai Li, Wei Peng. Self-Adaptive Reasoning on Sub-Questions for Multi-Hop Question Answering. ICASSP, 2023. CCF-B.

|Paper Link| Wei Peng, Yue Hu, Luxi Xing et al. Do You Know My Emotion? Emotion-Aware Strategy Recognition towards a Persuasive Dialogue System. ECML-PKDD, 2022. CCF-B.

|Paper Link| Wei Peng, Yue Hu , Luxi Xing, Yuqiang Xie, Jing Yu, Yajing Sun, Xiangpeng Wei. Bi-directional Cognitive Thinking Network for Machine Reading Comprehension. COLING, 2020. CCF-B.

|Paper Link| Wei Peng, Yue Hu, Jing Yu, Yajing Sun et al. MCR-NET: A Multi-Step Co-Interactive Relation Network for Unanswerable Questions on Machine Reading Comprehension. ICASSP, 2021. CCF-B.

|Paper Link| Wei Peng, Yue Hu, Luxi Xing, Yuqiang Xie, Yajing Sun. CogIntAc: Modeling the Relationships between Intention, Emotion and Action in Interactive Process from Cognitive Perspective. IJCNN, 2022. CCF-C.

|Paper Link| Luxi Xing, Yuqiang Xie, Wei Peng. IIE-NLP-NUT at SemEval-2020 Task 4: Guiding PLM with Prompt Template Reconstruction Strategy for ComVE. SemEval, 2020.

|Paper Link| Yuqiang Xie, Yue Hu, Luxi Xing, Wei Peng. CLSEG: Contrastive Learning of Story Ending Generation. ICASSP, 2022. CCF-B.

|Paper Link| Luxi Xing, Yue Hu, Yuqiang Xie, Wei Peng. Coarse-to-Careful: Seeking Semantic-Related Knowledge for Open-domain Commonsense Question Answering. ICASSP, 2021. CCF-B.

|Paper Link| Yuqiang Xie, Wei Peng, Yue Hu et al. IIE-NLP-Eyas at SemEval-2021 Task 4: Enhancing PLM for ReCAM with Special Tokens, Re-Ranking, Siamese Encoders, Back Translation. SemEval, 2021.

Posting

Award

Competition

2022 Natural Language Generation and Intelligent Writing Conference - small model (avg 4.8%=60M/1233M) TOP1

2020 SemEval Task4: Commonsense Validation and Explanation TOP 3

2020 Language and Intelligence Technology Competition – Machine Reading Comprehension Tasks 2%

2021 SemEval Task4: Machine Reading Comprehension TOP 8

2017 National Electronic Design Competition Shaanxi Province Undergraduate Group The Third Prize

2017 The 11th Extracurricular Academic Works Competition Challenge Cup of Shaanxi Province The Second Prize

2016 National College Students Mathematical Contest in Modeling The Second Prize (Shaanxi Province)

2016 The fourth Program Design Competition for College students in Shaanxi Province The Third Prize

Scholarship

2020-2021 Institute of Information Engineering, Chinese Academy of Sciences, Institute-level Excellence Scholarship

2019-2020 Institute of Information Engineering, Chinese Academy of Sciences, First Class Scholarship

2018-2019 Outstanding Merit Student of Chinese Academy of Sciences, Second Class Scholarship

2016-2017 National Inspirational Scholarship

2015-2016 National Scholarship

2014-2015 National Scholarship

Patent soft and others

Two invention patents

Three utility model patents

A software copyright mobile terminal application system of Internet of vehicles, combined with wireless wifi indoor positioning, navigation, location fingerprint matching, Bluetooth communication and other technologies, uses mobile app to dynamically master parking space information, so as to facilitate users' travel and improve the utilization rate of indoor parking lots

More than 16 awards for cultural and sports activities and 8 awards for organizational work

Internship

2021.07-2021.09

MSRA | NLC Group | Algorithmic intern

Common sense knowledge generation, try to generate common sense knowledge using the Method of Non Autoregressive (Bang) on the CommonGen dataset. At the same time, the concept-related images retrieved also provide the basis for subsequent research.

We investigate edit-based operation generation methods and try to experiment on sentence decomposition, sentence fusion, dialogue summary and grammar correction tasks.

2020.07-2020.09

Tencent | KeenLab AI Lab | Algorithmic intern

Research the code similarity detection of programming languages, the main work involves model improvement and experimental iteration.

Applied to CNN, DPCNN, Transformer and pre-trained language model and other technologies.

On the task of measuring the similarity of code versions between different libraries (JPEG, Zlib, etc.), the running efficiency of the model is improved through code optimization.

By improving the scoring function and increasing the penalty factor (considering the string matched between versions), as far as the FREETYPE library is concerned, the accuracy of the identification of TOP3 has increased from 79.4% to 98.5%, and some progress has been made for subsequent product launches.

2019.02-2019.06

Didi Chuxing | Intelligent Customer Service AI Lab | Algorithmic intern

Research automatic conversation summary generation.

Considering the redundancy of conversations, sentence level statements are filtered based on the gate mechanism, which is improved by 1.21% in rouge-L.

An automatic tag construction method is designed, which uses MRC's Rouge indicator to calculate the most relevant subsummaries.

A multi-task learning method is proposed. By considering the relationship between sub-summaries and sentences, multiple tasks are jointly learned.

Wrote a patent on - A summary generation method based on dialog tag information.

Projects

Multi-choice Machine Reading Comprehension

2021.03 - 2021.04

Converting the multi-choice machine reading comprehension task into a multi-classification problem, mainly considers the use of pre-trained language models, data enhancement, back translation, adversarial training, and model fusion to improve the effect of the model and improve the accuracy of the model.

GIT

Bi-directional Cognitive Thinking Network

2020.02 - 2020.06

Inspired by the cognitive complementary learning system, the reverse thinking network is proposed, which considers the inverse relationship between the passages and the answers, so that the machine can learn to take the initiative to ask questions, and decouple the two-way knowledge, to assist the answer generation.

GIT

MRC method based on Multi-step Reasoning

2019.07 - 2020.01

The evidence extraction module is designed for unanswerable questions in DuReader and SQuAD 2.0, and the key evidence are extracted by multi-step reasoning, which adaptively determines whether the question is answerable or not, with an improvement of 2.2% in rouge-L and 1.8% in F1 compared to the two datasets, respectively.

GIT

2018.10 - 2019.01

Classification based on hierarchical Attention

Combined with the GRU neural network and the Attention mechanism, considering the hierarchical structure between words, sentences and documents, the hierarchical Attention is introduced to capture word order and semantic information to a certain extent, and trained on a subset of the Tsinghua THUCNews dataset, the accuracy rate is 91.95 % increased to 93.75%.

2017.12 - 2018.04

Named Entity Recognition(NER) based on CRF+LSTM

When the LSTM neural network performs NER, it only considers the local optimum, and the effect is poor, and CRF has the problem of feature engineering. Combined with the method of automatic feature extraction by neural network and global normalization of conditional random field. In the Chinese NER task, the model is improved by 4.65%, 5.34% and 5.78% compared with the CRF on person, location and recognition.

Society

Live platform

2021.4 Bilibili, the Haihua Research Institute Machine Reading Comprehension Competition was broadcast live, viewed by 800+.

2021.3 Bilibili, live thesis – emotion and behavior recognition in conversations, viewed by 500+.

2020.7 Bilibili, live thesis – Select, Answer and Explain: Interpretable Multi-hop Reading Comprehension over Multiple Documents，viewed by 1000+.

2019.3 Post-Baoyan HPY, lecture on The Difference Between Graduate and Undergraduate，viewed by 2300+.

Academic impact

CSDN blog, cumulative reading 18W+，ranking 10,047 / 48W.

2021.12 Invited by the Student Union, to share scientific research and academic experience in the "Peer Piloting Exchange Meeting" of the Institute of Information Engineering.

2021.1 COLING-2020 The machine reading comprehension model based on Bi-directional Cognitive Thinking has been reported by many media such as Zhuanzhi, Sohu and PaperWeekly.

2020.10 SemEval-2020 has been reported by AI Technology, Tencent and other media.

彭伟 Albert

电话：010 612 16123

出生：1996.02

邮箱：pengwei@iie.ac.cn

英语：六级

政治面貌：中共党员

其他主页

教育经历

中国科学院大学信工所 | 博士
长安大学 | 本科 (211) 专业排名第一

熟练语言 & 网络框架

PYTHON LINUX C++ PyTorch

发表论文

会议期刊

已发表/接受论文31篇(SIGIR, IJCAI, TIFS, EMNLP, AAAI, KBs et al.)；其中一作13篇/通讯3篇，包含一区期刊 3篇(影响因子8.032)，包含二区期刊 2篇, CCF-A 4篇，CCF-B 6篇，CCF-C1篇

|Paper Link| Junmei Ding, Wei Peng* et al. RETO: Reinforcement Learning Enhanced Terminology Optimization for Cyber Threat Intelligence Summarization. Neurocomputing, 2025. SCI 2区.

|Paper Link| Wei Peng, Lei Cui, Wei Cai, Wei Wang, Zhiyu Hao et al. Efficiently and Effectively: A Two-stage Approach to Balance Plaintext and Encrypted Text for Traffic Classification. 2024.

|Paper Link| Wei Peng, Junmei Ding, Wei Wang, Lei Cui, Zhiyu Hao et al. CTISum: A New Benchmark Dataset For Cyber Threat Intelligence Summarization. 2024.

|Paper Link| Wei Peng, Yue Hu, Yunpeng Li et al. 基于阅读技巧识别和双通道融合机制的机器阅读理解方法. 自动化学报, 2024. CCF-A.

|Paper Link| Qijun Xie, Wei Peng*. MAGO: Multi-Knowledge Aware and Global Strategy Sequence Optimizing Network for Emotional Support Conversation. Neurocomputing, 2024. SCI 2区.

|Paper Link| Wei Peng, Wanshui Li, Yue Hu et al. Leader-Generator Net: Dividing Skill and Implicitness for Conquering FairytaleQA. SIGIR, 2023. CCF-A.

|Paper Link| Wei Peng, Yue Hu, Jing Yu et al. APER: AdaPtive Evidence-driven Reasoning Network for Machine Reading Comprehension. Knowl. Based Syst, 2021. SCI 1区, Impact Factor 8.038.

|Paper Link| Jingcheng Deng, Wei Peng*. IRRGN: An Implicit Relational Reasoning Graph Network for Multi-turn Response Selection. EMNLP,2022. CCF-B.