About Me
I’m a second-year doctoral student at The Ohio State University, advised by Prof. Yu Su. My research explores retrieval-augmented generation (RAG) and advanced language systems. Specifically, I focus on: 1) developing efficient information retrieval approaches to enhance reasoning and long-term memory capabilities, and 2) integrating these models with complex external environments such as the Web, knowledge graphs, and databases.
I received my Master degree from Nanjing University in 2023 and Bachelor degree from Northeastern University (CN) in 2020. I was research intern at MSRA in 2022.
Feel free to reach out to me if you’re interested in my research. I’m looking for summer internship opportunities.
Preprints
- [arXiv] Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents
Boyu Gou, Ruohan Wang, Boyuan Zheng, Yanan Xie, Cheng Chang, Yiheng Shu, Huan Sun, Yu Su
[paper] [code] [home]
Publications
- [NeurIPS’24] HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models
Bernal Jiménez Gutiérrez, Yiheng Shu, Yu Gu, Michihiro Yasunaga, Yu Su
[paper] [code (>1.4k stars)] - [EMNLP’24] Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments
Yu Gu, Yiheng Shu, Hao Yu, Xiao Liu, Yuxiao Dong, Jie Tang, Jayanth Srinivasa, Hugo Latapie, Yu Su
[paper] [code] [benchmark] [proceedings] [BibTeX] - [EACL’24 SRW] Distribution Shifts Are Bottlenecks: Extensive Evaluation for Grounding Language Models to Knowledge Bases
Yiheng Shu, Zhiwei Yu
[paper] [code] [data] [poster] [video] [proceedings] [BibTeX] - [AAAI’23] Question Decomposition Tree for Answering Complex Questions over Knowledge Bases
Xiang Huang, Sitao Cheng, Yiheng Shu, Yuheng Bao, Yuzhong Qu
[paper] [code] [proceedings] [BibTeX] - [EMNLP’22] TIARA: Multi-grained Retrieval for Robust Question Answering over Large Knowledge Base
Yiheng Shu, Zhiwei Yu, Yuhan Li, Börje F. Karlsson, Tingting Ma, Yuzhong Qu, Chin-Yew Lin
[paper] [code] [data] [poster] [slides] [video] [proceedings] [BibTeX] - [COLING’22] Logical Form Generation via Multi-task Learning for Complex Question Answering over Knowledge Bases
Xixin Hu, Xuan Wu, Yiheng Shu, Yuzhong Qu
[paper] [code] [slides] [video] [proceedings] [BibTeX] - [ISWC’21] EDG-based Question Decomposition for Complex Question Answering over Knowledge Bases
Xixin Hu, Yiheng Shu, Xiang Huang, Yuzhong Qu
[paper] [code] [home] [video] [proceedings] [BibTeX] - [TOIS’20] Deep Learning for Sequential Recommendation: Algorithms, Influential Factors, and Evaluations
Hui Fang, Danning Zhang, Yiheng Shu, Guibing Guo
[paper] [code] [ACM Library] [BibTeX] - [ICWE’19 tutorial] Deep Learning-based Sequential Recommender Systems: Concepts, Algorithms, and Evaluations
Hui Fang, Danning Zhang, Guibing Guo, Yiheng Shu
[slides] [intro] [Springer Link] [BibTeX]
Services
- Conference reviewer: ARR 2024 (ACL 2024, EMNLP 2024, NAACL 2025), WiNLP 2024
- Journal reviewer: IEEE Trans. Big Data
Update: 12/03/2024