Yu Su's Homepage - The Ohio State University

<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"> <html xmlns="http://www.w3.org/1999/xhtml"> <head> <meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> <link rel="stylesheet" type="text/css" href="style.css" /> <title>Yu Su's Homepage - The Ohio State University</title> <script async> (function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){ (i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o), m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m) })(window,document,'script','//www.google-analytics.com/analytics.js','ga'); ga('create', 'UA-71295941-1', 'auto'); ga('send', 'pageview'); </script> </head> <body> <div id="main"> <h1>Yu Su (苏煜)</h1> <div id="header"> <a href="#">Home</a> <a href="resources/CV_YuSu.pdf">CV</a>   <a href="#awards">Awards</a> <a href="#student">Students</a> <a href="#publications">Publications</a> <a href="#teaching">Teaching</a> <a href="https://github.com/OSU-NLP-Group">Software</a>   <a href="#contact">Contact</a>  </div> <div id="contentColumn"> <div class="contentBlock"> <h2>About Me</h2> <div class="divider"></div> <img src="images/YuSu.jpeg" alt="Yu Su" style="width:245;height:245px;float:left;margin-right:20px;margin-bottom:10px;margin-top:5px;"> I'm a Distinguished Assistant Professor of Engineering at the <a href="https://cse.osu.edu/">Department of Computer Science and Engineering</a>, The Ohio State University, where I co-direct the <a href="https://twitter.com/osunlp">OSU NLP group</a>, co-lead the Foundational AI team in the <a href="https://icicle.osu.edu/">ICICLE AI Institute</a> and lead the Machine Learning Foundations team in the <a href="https://imageomics.osu.edu/">Imageomics Institute</a>. I got my PhD from <a href="https://cs.ucsb.edu/">University of California, Santa Barbara</a> and my bachelor's degree from <a href="https://www.cs.tsinghua.edu.cn/csen/">Tsinghua University</a>, both in Computer Science. I'm a 2025 Sloan Research Fellow and received several best/outstanding paper awards from CVPR and ACL. I'm broadly interested in artificial intelligence, with a primary interest in the role of language as a vehicle for reasoning and communication. These days, I spend much of my time thinking about language agents [<a href="https://yusu.substack.com/p/language-agents">blog</a>, <a href="https://language-agent-tutorial.github.io/">tutorial</a>], an emerging class of AI agents characterized by their language understanding and production capabilities. I'm fascinated by biological intelligence and the power of natural selection, so one may find many references or direct inspirations from biological intelligence in my work. Meanwhile, biological systems also have their constraints and limitations, and I hope to develop advanced artificial intelligence to augment human intelligence. Some facets and applications of intelligence I'm currently interested in: <ul> <li>Reasoning and grounding, especially in multimodal contexts [<a href="https://mmmu-benchmark.github.io/">MMMU</a>, <a href="https://osu-nlp-group.github.io/SeeAct/">SeeAct</a>, <a href="https://arxiv.org/abs/2405.15071">Grokked Transformers</a>, <a href="https://aclanthology.org/2023.acl-long.270.pdf">Pangu</a>, <a href="https://osu-nlp-group.github.io/UGround/">UGround</a>]</li> <li>Planning and world models [<a href="https://dki-lab.github.io/LLM-Planner/">LLM-Planner</a>, <a href="https://arxiv.org/abs/2411.06559">WebDreamer</a>]</li> <li>Memory and non-parametric continual learning [<a href="https://github.com/OSU-NLP-Group/HippoRAG">HippoRAG</a>]</li> <li>Benchmarking and evaluation [<a href="https://osu-nlp-group.github.io/Mind2Web/">Mind2Web</a>, <a href="https://osu-nlp-group.github.io/TravelPlanner/">TravelPlanner</a>]</li> <li>AI for sciences [<a href="https://imageomics.github.io/bioclip/">BioCLIP</a>, <a href="https://osu-nlp-group.github.io/ScienceAgentBench/">ScienceAgentBench</a>]</li> </ul> <div>   </div> </div> <div class="contentBlock"> <a name="news"></a> <h2>What's New</h2> <div class="divider"></div> <div class="news"> <ul>   <li>02/2025: Honored to receive the <a href="https://sloan.org/fellowships/2025-Fellows">Sloan Research Fellowship</a>!</li> <li>02/2025: Guest lecture on memory, reasoning, and planning of language agents at the <a href="https://llmagents-learning.org/sp25">Berkeley Advanced LLM Agents MOOC</a>.</li> <li>01/2025: 4 papers accepted to ICLR'25: <a href="https://osu-nlp-group.github.io/UGround/">UGround</a> (Oral), <a href="https://osu-nlp-group.github.io/ScienceAgentBench/">ScienceAgentBench</a>, <a href="https://arxiv.org/abs/2410.02642">In-context Reranking</a>, <a href="https://arxiv.org/abs/2408.06327">VisualAgentBench</a></li> <li>12/2024: Talk at NeurIPS workshop about <a href="https://tinyurl.com/language-agent-safety-2024">agent safety</a>.</li> <li>11/2024: Check out our tutorial "<a href="https://language-agent-tutorial.github.io/">Language Agents: Foundations, Prospects, and Risks</a>" at EMNLP'24!</li> <li>11/2024: Talk at University of Michigan, Princeton University, LMU Munich/TU Darmstadt on language agents.</li> <li>09/2024: 4 papers accepted to NeurIPS'24 and 1 paper to EMNLP'24: <a href="https://arxiv.org/abs/2405.14831">HippoRAG</a>, <a href="https://arxiv.org/abs/2405.15071">Grokked Transformers</a>, Calibrated Fine-tuning, <a href="https://arxiv.org/abs/2408.16176">VLM4Bio</a>, and <a href="https://arxiv.org/abs/2402.14672">Middleware for LLMs</a>.</li> <li>09/2024: Recent services: Senior Area Chair for ICLR'25, Area Chair for ACL'24, EMNLP'24, COLM'24.</li> <li>06/2024: Honored to receive Best Student Paper Award (BioCLIP) and Best Paper Finalist (MMMU) at CVPR'24!</li> <li>05/2024: Thrilled to release <a href="https://arxiv.org/abs/2405.14831">HippoRAG</a> and bring my passion about biological intelligence into AI!</li> <li>05/2024: 2 papers accepted to ACL'24: <a href="https://arxiv.org/abs/2403.04746">tool learning through simulated trial and error</a> and <a href="https://arxiv.org/abs/2402.10890">planning with LLMs</a> (or why it's hard).</li> <li>05/2024: New talk on a holistic and critical look at language agents at the <a href="https://cmu-agent-workshop.github.io/">CMU Agent Workshop</a>.</li> <li>05/2024: 3 papers accepted to ICML'24: <a href="https://osu-nlp-group.github.io/SeeAct/">SeeAct</a>, <a href="https://osu-nlp-group.github.io/TravelPlanner/">TravelPlanner</a> (Spotlight), <a href="https://open-vision-language.github.io/MagicLens/">MagicLens</a> (Oral).</li> <li>04/2024: Honored that both <a href="https://imageomics.github.io/bioclip/">BioCLIP</a> and <a href="https://mmmu-benchmark.github.io/">MMMU</a> are selected for oral presentation at CVPR'24 (90/11,532, 0.8%)!</a> </li> <li>02/2024: 3 papers accepted to CVPR'24: <a href="https://imageomics.github.io/bioclip/">BioCLIP</a> (Best Student Paper), <a href="https://mmmu-benchmark.github.io/">MMMU</a> (Best Paper Finalist), <a href="https://arxiv.org/abs/2402.04476">multimodal web agents</a>.</li> <li>02/2024: Excited to release <a href="https://osu-nlp-group.github.io/TravelPlanner/">TravelPlanner</a>, a real-world planning benchmark for language agents.</li> <li>01/2024: 5 papers accepted to ICLR'24 on <a href="https://arxiv.org/abs/2305.13300">knowledge conflicts in LLMs</a> (Spotlight), <a href="https://arxiv.org/abs/2309.05653">MAmmoTH</a> (Spotlight), <a href="https://arxiv.org/abs/2308.03688">AgentBench</a>, <a href="https://arxiv.org/abs/2311.04157">Interpretable Transformer</a>, <a href="https://arxiv.org/abs/2312.02436">MUFFIN</a>.</li> <li>01/2024: Thrilled to release <a href="https://osu-nlp-group.github.io/SeeAct/">SeeAct</a>, enabling everyone to use GPT-4V-based web agents with one click.</li> <li>12/2023: Thrilled to release <a href="https://imageomics.github.io/bioclip/">BioCLIP</a>, a vision foundation model for the tree of life.</li> <li>11/2023: Excited to release <a href="https://mmmu-benchmark.github.io/">MMMU</a>, a new multimodal benchmark for Expert AGI.</li> <li>10/2023: Invited talks at IJCAI, UCSD, Tsinghua, Fudan, University of Hong Kong, Pinterest, and Salesforce AI Research on language agents (<a href="resources/language_agents_YuSu_2023.pdf">slides</a>).</li> <li>10/2023: 3 papers accepted to EMNLP'23 on <a href="https://arxiv.org/abs/2305.06311">attribution evaluation of LLMs</a>, <a href="https://arxiv.org/abs/2305.13683">text-to-SQL error detection</a>, and biomedical NLP.</li> <li>09/2023: 3 papers accepted to NeurIPS'23: <a href="https://arxiv.org/abs/2306.06070">Mind2Web</a> (Spotlight), <a href="https://arxiv.org/abs/2306.10012">MagicBrush</a>, and Holistic Transfer.</li> <li>07/2023: Glad that <a href="https://dki-lab.github.io/LLM-Planner/">LLM-Planner</a> that uses LLMs for embodied agent planning got accepted to ICCV'23!</li> <li>07/2023: Honored that our paper <a href="https://arxiv.org/abs/2212.09736">Pangu</a> received the Outstanding Paper Award from ACL 2023!</li> <li>06/2023: Invited talks at Army Research Lab and OSU RISK Institute.</li> <li>05/2023: 6 papers accepted to ACL'23. Congrats to all the students and collaborators!</li> <li>05/2023: Grateful to receive support from NIH R01 for our research on AI and biomedicine!</li> <li>04/2023: Honored to receive support from President Johnson for our research on large language models!</li> <li>03/2023: Invited talks at University of Tokyo and Amazon on <a href="resources/grounding_LMs.pdf">grounding language models to real-world environments</a>.</li> <li>03/2023: Honored to receive the <a href="https://awards.webapps.engineering.osu.edu/awards/13">College of Engineering Lumley Research Award</a>!</li> <li>01/2023: Check out <a href="https://twitter.com/osunlp/status/1609642737409855490">a summary of the major achievements</a> by the OSU NLP group in 2022!</li> <li>12/2022: Serve as Area Chair for ACL'23.</li> <li>11/2022: Serve as Workflow Co-Chair for SIGKDD'23.</li> <li>10/2022: Excited that our <a href="https://arxiv.org/abs/2204.08109">ArcaneQA</a> paper won the Outstanding Paper Award at COLING'22 (top 15 out of 2253 submissions)!</li> <li>10/2022: Papers on broad-coverage conversational AI and GPT-3 for biomedical information extraction accepted to EMNLP'22!</li> <li>09/2022: New grant from ARL for knowledge-based embodied AI. </li> <li>08/2022: Honored to receive the Distinguished Assistant Professorship of Engineering Inclusive Excellence from OSU for research and contributions towards democratizing AI!</li> <li>08/2022: Paper on question answering over large knowledge graphs accepted to COLING'22. </li> <li>07/2022: Serve as Senior PC member for AAAI'23. </li> <li>07/2022: Invited talk at the <a href="https://dlg4nlp-workshop.github.io/dlg4nlp-naacl22/schedule">DLG4NLP workshop</a> at NAACL'22: <a href="resources/Graphs_and_Conversational_AI.pdf">Will Graphs Lead to the Next Breakthrough of Conversational AI?</a></li> <li>06/2022: Our OSU team won the 3rd place in the inaugural <a href="https://www.amazon.science/alexa-prize/three-top-performers-emerge-in-inaugural-alexa-prize-taskbot-challenge">Amazon Alexa Prize TaskBot Challenge</a>! Check out our <a href="https://sunlab-osu.github.io/tacobot/">website</a>.</li> <li>05/2022: Thank you, Walmart and Cisco, for supporting our research!</li> <li>04/2022: Talk at Nanjing University and JD.com on emerging frontiers of conversational AI.</li> <li>02/2022: Paper on long-horizon vison-and-language navigation accepted to CVPR 2022.</li> <li>02/2022: Paper on text-to-SQL generalization accepted to ACL 2022.</li> <li>12/2021: Check out <a href="https://twitter.com/osunlp/status/1477501035954454528?s=20">a summary of the major achievements</a> by the OSU NLP group in 2021!</li> <li>11/2021: Our team is selected to participate in the <a href="https://developer.amazon.com/alexaprize/challenges/current-challenge/simbot">Alexa Prize SimBot challenge</a>!</li> <li>09/2021: Excited to be a part of the <a href="https://engineering.osu.edu/news/2021/09/new-15-million-nsf-grant-launches-ohio-state-imageomics-institute">Imageomics Institute</a> -- a new NSF HDR Institute dedicated for knowledge-guided machine learning for biology. I will lead the Machine Learning Foundations team.</li> <li>08/2021: Paper on pre-trained language models with better reasoning capabilities accepted to EMNLP 2021.</li> <li>07/2021: Excited to be a part of <a href="https://icicle.osu.edu/">ICICLE</a> -- a new NSF AI Institute dedicated to democratizing AI through AI and cyberinfrastructure innovations. I will lead the <a href="https://icicle.osu.edu/who-we-are/management"></href>AI team</a> with Eric Fosler-Lussier. <a href="https://news.osu.edu/nsf-funds-2-ohio-state-based-institutes-to-expand-artificial-intelligence-research/">Read more</a>.</li> <li>07/2021: Talk at USC/ISI and Beijing Academy of Artificial Intelligence on Emerging Frontiers of Conversational AI.</li> <li>05/2021: Long paper on large-scale joint KB and text embedding accepted to ACL 2021. </li> <li>03/2021: Received an Accelerator Grant from OSU TDAI on NLP for Social Media Pharmacovigilance.</li> <li>03/2021: Short Paper on compositional generalization for neural semantic parsing accepted to NAACL-HLT 2021. </li> <li>01/2021: Paper on <a href="https://arxiv.org/pdf/2011.07743.pdf">non-i.i.d. generalization of question answering on knowledge bases</a> accepted to TheWebConf 2021 (previously WWW). </li> <li>11/2020: Will co-organize the <a href="https://nlp4prog.github.io/2021/">First Workshop on Natural Language Processing for Programming</a> at ACL-IJCNLP 2021. </li> <li>09/2020: Will serve on the organizing committee of NAACL 2021.</li> <li>09/2020: Super excited to share some of the work I've been working on at <a href="https://www.microsoft.com/en-us/research/project/semantic-machines/">Microsoft Semantic Machines</a> — <a href="https://www.microsoft.com/en-us/research/blog/dialogue-as-dataflow-a-new-approach-to-conversational-ai/?OCID=msr_blog_semantic_tw">Task-Oriented Dialogue as Dataflow Synthesis</a> (TACL'20)</li> <li>09/2020: Two long papers (learning language interfaces from use and data-to-text generation) accepted to EMNLP'20. One short paper on document classification for COVID-19 literature accepted to Findings of EMNLP.</li> <li>05/2020: Serve as Area Chair (Conversational Bot/QA) at NLPCC'20. Serve in the Program Committee of ACL'20, KDD'20 (chair of Trustworthy Data Mining session), EMNLP'20, AAAI'21, AKBC'20, IntEx-SemPar'20.</li> <li>04/2020: Long paper on logical natural language generation accepted to ACL 2020</li> <li>03/2020: Thank you, Fujitsu Laboratories of America, for supporting our research!</li> <li>01/2020: Started as Assistant Professor of Computer Science and Engineering at the Ohio State University</li> <li>08/2019: Long paper on model-based interactive semantic parsing got accepted to EMNLP 2019</li> <li>08/2019: Long paper on taxonomic categorization of documents got accepted to ICDM 2019</li> <li>05/2019: Short paper on general-purpose textual relation embedding got accepted to ACL 2019</li> <li>05/2019: Received Outstanding Dissertation Award of Computer Science from UCSB. Thank you UCSB!</li> <li>05/2019: Check out <a href="https://www.computerworld.com/article/3394041/new-cortana-and-google-assistant-updates-highlight-workplace-potential.html">what we are doing</a> at Microsoft Semantic Machines (highlighted in Microsoft Build 2019)! </li> <li>02/2019: Full paper on vocabulary selection got accepted to NAACL 2019</li> <li>02/2019: Talk at Stanford NLP Seminar on democratizing data science with knowledge engines</li> <li>11/2018: Full paper on zero-shot video captioning got accepted to AAAI 2019</li> <li>10/2018: Started as researcher at <a href="http://www.semanticmachines.com/">Microsoft Semantic Machines</a> in Berkeley working on conversational AI.</li> <li>08/2018: Full paper on concept mining from text got accepted to ICDM 2018.</li> <li>08/2018: Two long papers on dialog/semantic parsing got accepted to EMNLP 2018.</li> <li>07/2018: Our work on <a href="https://www.microsoft.com/en-us/research/blog/democratizing-apis-with-natural-language-interfaces/">natural language interfaces to APIs</a> highlighted in Microsoft Research Blog!</li> <li>06/2018: Serve as PC member for ACL'18, EMNLP'18, CoNLL'18, NLPCC'18, and AAAI'19. <li>04/2018: Paper "DialSQL: Dialogue Based Structured Query Generation" accepted to ACL'18 as long paper: Improve semantic parsing with dialog.</li> <li>04/2018: Paper "Natural Language Interfaces with Fine-Grained User Interaction: A Case Study on Web APIs" accepted to SIGIR'18 as long paper.</li> <li>03/2018: Awarded the Best Distinguished Graduate Student Lecture of <a href="http://www.ucsb-cs-summit.com/">UCSB CS Summit</a>.</li> <li>02/2018: Paper "Global Relation Embedding for Relation Extraction" accepted to NAACL-HLT'18: Robust relation extraction from text with global statistics.</li> <li>02/2018: Talk about "Bridging the Gap between Human and Data with AI" at the University of Massachusetts, Amherst.</li> <li>02/2018: Successfully organized the first <a href="http://kbcom.org">Workshop on Knowledge Base Construction, Reasoning and Mining</a> at Los Angeles. Check out the great invited talks and accepted papers!</li> <li>01/2018: Talk about "Bridging the Gap between Human and Data with AI" at the Ohio State University.</li> <li>12/2017: I will serve in the Program Committee (Research Track) of KDD'18</li> <li>12/2017: Paper "Unsupervised Neural Categorization for Scientific Publications" accepted to SDM'18.</li> <li>11/2017: Attended CIKM'17 in Singapore and gave a talk on natural lanugage interface and a tutorial on construction and querying of large-scale knowledge bases. </li> <li>10/2017: Upcoming visits in China: 10.09-10.15 (Alibaba, Hangzhou), 10.10 (Fudan University, Shanghai), 10.11 (The Computing Conferencce, Hangzhou), 10.16 (Tsinghua University, Beijing), 10.17 (Toutiao AI Lab, Beijing) </li> <li>09/2017: I'm co-organizing the <a href="http://www.kbcom.org">First Workshop on Knowledge Base Construction, Reasoning and Mining</a> (KBCOM'18) co-located with WSDM'18 on Feb 9, 2018 at Los Angeles. <a href="http://kbcom.org/1-cfp.html">CFP</a> is out!</li> <li>09/2017: Finished summer internship at MSR. Flying to Copenhagen for EMNLP.</li> <li>08/2017: I will serve in the Program Committee of WWW'18</li> <li>08/2017: Paper on natural language interface to web API from zero user and data accepted to CIKM'17.</li> <li>07/2017: Tutorial on Construction and Querying of Large-Scale Knowledge Bases accepted to CIKM'17. See you in Singapore!</li> <li>06/2017: Three papers on semantic parsing/QA accepted to EMNLP'17. Thanks to my collaborators!</li> <li>06/2017: Started summer internship in Microsoft Research</li> <li>04/2017: I will serve in the Program Committee of CIKM'17</li> <li>03/2017: Attended a project meeting at UIUC and gave a talk on unsupervised document categorization</li> <li>03/2017: I will serve in the Program Committee of NLPCC'17</li> <li>02/2017: I will serve in the Program Committee of EMNLP'17</li> <li>01/2017: I will serve in the Program Committee of ACL'17</li> <li>11/2016: Attended EMNLP'16 in Austin, US</li> <li>09/2016: Our QA dataset GraphQuestions v1 is released. Check <a href="https://github.com/ysu1989/GraphQuestions">it</a> out!</li> <li>09/2016: Two papers on knowledge base question answering got accepted to EMNLP'16!</li> <li>09/2016: Attended the Bay Area Deep Learning School, Stanford</li> <li>06/2016: Started summer internship in Microsoft Research, Redmond</li> </ul> </div> </div>  <div class="contentBlock"> <a name="talks"></a> <h2>Recent Talks</h2> <div class="divider"></div> <div class="publication"> <ul> <li> Tutorial: Language Agents: Foundations, Prospects, and Risks (<a href="https://tinyurl.com/language-agent-tutorial-2024">slides</a>) (<a href="https://tinyurl.com/agent-tutorial-recording-2024">recording</a>) <a href="https://language-agent-tutorial.github.io/">EMNLP 2024</a> (w/ Diyi Yang, Shunyu Yao, Tao Yu) </li> <li> On Memory, Reasoning, and Planning of Language Agents (<a href="https://www.youtube.com/watch?v=zvI4UN2_i-w&t=3s">recording</a>) <a href="https://llmagents-learning.org/sp25">Berkeley Advanced LLM Agents MOOC</a> </li> <li> A Holistic and Critical Look at Language Agents (<a href="resources/language_agents_YuSu_2024.pdf">slides</a>) <a href="https://cmu-agent-workshop.github.io/">CMU Agent Workshop</a>, JPMorgan Chase, Amazon AGI, Apple NLU Workshop, LMU Munich/TU Darmstadt, Stanford, Princeton, NVIDIA </li>  <li> Web Agents: A New Frontier for Embodied Agents (<a href="https://tinyurl.com/web-agents-2024">slides</a>) <a href="https://cse.engin.umich.edu/event/yu-su-seminar-guest-lecture">University of Michigan</a>, <a href="https://splu-robonlp-2024.github.io/">SpLU-RoboNLP Workshop@ACL'24</a>, ServiceNow </li>  </ul> </div> <div class="contentBlock"> <a name="awards"></a> <h2>Awards & Honors</h2> <div class="divider"></div> <ul> <li>Alfred P. Sloan Research Fellowship, 2025</li> <li>Best Student Paper Award, CVPR, 2024</li> <li>Best Paper Finalist, CVPR, 2024</li> <li>Cisco Faculty Award, 2024</li> <li>Outstanding Area Chair, EMNLP, 2024</li> <li>Outstanding Paper Award, ACL, 2023</li> <li>Lumley Research Award, OSU, 2023</li> <li>Outstanding Paper Award, COLING, 2022</li> <li>Distinguished Assistant Professorship of Engineering Inclusive Excellence, OSU, 2022</li> <li>Third-Place Honor, Inaugural Amazon Alexa Prize TaskBot Challenge, 2022</li> <li>Outstanding Dissertation Award of Computer Science, UCSB, 2019</li> <li>Outstanding Freshman/Graduate Awards, Tsinghua University, 2008/2012</li>  </ul> </div> <div class="contentBlock"> <a name="student"></a> <h2>Students</h2> <div class="divider"></div>      Ph.D. Students <ul> <li><a href="https://scholar.google.com/citations?user=c5RwjjcAAAAJ">Yu Gu</a> (SP20 &#8211)</li> <li><a href="https://bernaljg.github.io/">Bernal Jiménez Gutiérrez</a> (SP20 &#8211)</li> <li><a href="https://vardaan123.github.io/">Vardaan Pahuja</a> (SP20 &#8211)</li> <li><a href="https://chanh.ee/">Chan Hee (Luke) Song</a> (AU20 &#8211)</li> <li><a href="https://chensj98.github.io/">Shijie Chen</a> (AU21 &#8211, co-advised with Huan Sun)</li> <li><a href="https://samuelstevens.me/">Sam Stevens</a> (AU21 &#8211)</li> <li><a href="https://drogozhang.github.io/">Kai Zhang</a> (AU22 &#8211)</li> <li><a href="https://scholar.google.com/citations?user=0wfNprQAAAAJ&hl=en">Jiaman (Lisa) Wu</a> (AU22 &#8211, co-advised with Wei-Lun Chao)</li> <li><a href="https://boyuanzheng010.github.io/">Boyuan Zheng</a> (AU23 &#8211)</li> <li><a href="https://yihengshu.github.io/">Yiheng Shu</a> (AU23 &#8211)</li> <li><a href="https://boyugou.github.io/">Boyu Gou</a> (AU23 &#8211)</li> </ul>   </div> <div class="contentBlock"> <a name="publications"></a> <h2>Publications</h2> <div class="divider"></div> <div class="publication"> <div style="margin-bottom: 10px;font-size: 18px">(<a href="https://scholar.google.com/citations?user=rIh5OqoAAAAJ&hl=en">Google Scholar</a>) (<a href="https://yusu.substack.com/">Blog</a>)</div>       Refereed Publications <ul> <li> Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents Boyu Gou, Ruohan Wang, Boyuan Zheng, Yanan Xie, Cheng Chang, Yiheng Shu, Huan Sun, Yu Su. In the International Conference on Learning Representations, 2025 (ICLR'25) [<a href="https://osu-nlp-group.github.io/UGround/">website</a>] [<a href="https://arxiv.org/abs/2410.05243">paper</a>] [<a href="https://github.com/OSU-NLP-Group/UGround">code</a>] Oral (1.8%) </li> <li> ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery Ziru Chen, Shijie Chen, Yuting Ning, Qianheng Zhang, Boshi Wang, Botao Yu, Yifei Li, Zeyi Liao, Chen Wei, Zitong Lu, Vishal Dey, Mingyi Xue, Frazier N. Baker, Benjamin Burns, Daniel Adu-Ampratwum, Xuhui Huang, Xia Ning, Song Gao, Yu Su, Huan Sun. In the International Conference on Learning Representations, 2025 (ICLR'25) [<a href="https://osu-nlp-group.github.io/ScienceAgentBench/">website</a>] [<a href="https://arxiv.org/abs/2410.05080">paper</a>] [<a href="https://github.com/OSU-NLP-Group/ScienceAgentBench">code</a>] </li> <li> Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers Shijie Chen, Bernal Jiménez Gutiérrez, Yu Su. In the International Conference on Learning Representations, 2025 (ICLR'25) [<a href="https://arxiv.org/abs/2410.02642">paper</a>] [<a href="https://github.com/OSU-NLP-Group/In-Context-Reranking">code</a>] </li> <li> VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents Xiao Liu*, Tianjie Zhang*, Yu Gu*, Iat Long Iong, Yifan Xu, Xixuan Song, Shudan Zhang, Hanyu Lai, Xinyi Liu, Hanlin Zhao, Jiadai Sun, Xinyue Yang, Yu Yang, Zehan Qi, Shuntian Yao, Xueqiao Sun, Siyi Cheng, Qinkai Zheng, Hao Yu, Hanchen Zhang, Wenyi Hong, Ming Ding, Lihang Pan, Xiaotao Gu, Aohan Zeng, Zhengxiao Du, Chan Hee Song, Yu Su, Yuxiao Dong, Jie Tang. In the International Conference on Learning Representations, 2025 (ICLR'25) [<a href="https://arxiv.org/abs/2408.06327">paper</a>] [<a href="https://github.com/THUDM/VisualAgentBench/">code</a>] (*: Equal Contribution) </li> <li> HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models Bernal Jiménez Gutiérrez, Yiheng Shu, Yu Gu, Michihiro Yasunaga, Yu Su. In the Conference on Neural Information Processing Systems, 2024 (NeurIPS'24) [<a href="https://arxiv.org/abs/2405.14831">paper</a>] [<a href="https://github.com/OSU-NLP-Group/HippoRAG">code</a>] </li> <li> Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization Boshi Wang, Xiang Yue, Yu Su, Huan Sun. In the Conference on Neural Information Processing Systems, 2024 (NeurIPS'24) [<a href="https://arxiv.org/abs/2405.15071">paper</a>] [<a href="https://github.com/OSU-NLP-Group/GrokkedTransformer">code</a>] </li> <li> Fine-Tuning is Fine, if Calibrated Zheda Mai, Arpita Chowdhury, Ping Zhang, Cheng-Hao Tu, Hong-You Chen, Vardaan Pahuja, Tanya Berger-Wolf, Song Gao, Charles Stewart, Yu Su, Wei-Lun Chao. In the Conference on Neural Information Processing Systems, 2024 (NeurIPS'24) [<a href="https://arxiv.org/abs/2409.16223">paper</a>] </li> <li> VLM4Bio: A Benchmark Dataset to Evaluate Pretrained Vision-Language Models for Trait Discovery from Biological Images M Maruf, Arka Daw, Kazi Sajeed Mehrab, Harish Babu Manogaran, Abhilash Neog, Medha Sawhney, Mridul Khurana, James P Balhoff, Yasin Bakis, Bahadir Altintas, Matthew J Thompson, Elizabeth G Campolongo, Josef C Uyeda, Hilmar Lapp, Henry L Bart, Paula M Mabee, Yu Su, Wei-Lun Chao, Charles Stewart, Tanya Berger-Wolf, Wasila Dahdul, Anuj Karpatne. In the Conference on Neural Information Processing Systems, 2024 (NeurIPS'24 D&B) [<a href="https://arxiv.org/abs/2408.16176">paper</a>] </li> <li> Middleware for LLMs: Tools are Instrumental for Language Agents in Complex Environments Yu Gu, Yiheng Shu, Hao Yu, Xiao Liu, Yuxiao Dong, Jie Tang, Jayanth Srinivasa, Hugo Latapie, Yu Su. In the Conference on Empirical Methods in Natural Language Processing, 2024 (EMNLP'24) [<a href="https://arxiv.org/abs/2402.14672">paper</a>] </li> <li> Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning Tinghui Zhu*, Kai Zhang*, Jian Xie, Yu Su. In the Conference on Language Modeling, 2024 (COLM'24) [<a href="https://arxiv.org/abs/2401.17686">paper</a>] [<a href="https://github.com/OSU-NLP-Group/Deductive-Beam-Search">code</a>] (*: Equal Contribution) </li> <li> Reviving the Context: Camera Trap Species Classification as Link Prediction on Multimodal Knowledge Graphs Vardaan Pahuja, Weidi Luo, Yu Gu, Cheng-Hao Tu, Hong-You Chen, Tanya Berger-Wolf, Charles Stewart, Song Gao, Wei-Lun Chao, Yu Su. In the ACM International Conference on Information and Knowledge Management (CIKM'24) [<a href="https://arxiv.org/abs/2401.00608">paper</a>] </li> <li> LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error Boshi Wang, Hao Fang, Jason Eisner, Benjamin Van Durme, Yu Su. In the Annual Conference of the Association for Computational Linguistics, 2024 (ACL'24) [<a href="https://arxiv.org/abs/2403.04746">paper</a>][<a href="https://github.com/microsoft/simulated-trial-and-error">code</a>] </li> <li> When is Tree Search Useful for LLM Planning? It Depends on the Discriminator Ziru Chen, Michael White, Raymond Mooney, Ali Payani, Yu Su, Huan Sun. In the Annual Conference of the Association for Computational Linguistics, 2024 (ACL'24) [<a href="https://arxiv.org/abs/2402.10890">paper</a>] </li> <li> MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions Kai Zhang, Yi Luan, Hexiang Hu, Kenton Lee, Siyuan Qiao, Wenhu Chen, Yu Su, Ming-Wei Chang. In the International Conference on Machine Learning, 2024 (ICML'24) [<a href="https://open-vision-language.github.io/MagicLens/">project</a>][<a href="https://arxiv.org/abs/2403.19651">paper</a>] Oral (1.5%) </li> <li> GPT-4V(ision) is a Generalist Web Agent, if Grounded Boyuan Zheng, Boyu Gou, Jihyung Kil, Huan Sun, Yu Su. In the International Conference on Machine Learning, 2024 (ICML'24) [<a href="https://osu-nlp-group.github.io/SeeAct/">project website</a>] [<a href="https://arxiv.org/abs/2401.01614">paper</a>] [<a href="https://github.com/OSU-NLP-Group/SeeAct/">code</a>] </li> <li> TravelPlanner: A Benchmark for Real-World Planning with Language Agents Jian Xie*, Kai Zhang*, Jiangjie Chen, Tinghui Zhu, Renze Lou, Yuandong Tian, Yanghua Xiao, Yu Su. In the International Conference on Machine Learning, 2024 (ICML'24) [<a href="https://osu-nlp-group.github.io/TravelPlanner/">project website</a>] [<a href="https://arxiv.org/abs/2402.01622">paper</a>] [<a href="https://github.com/OSU-NLP-Group/TravelPlanner">code</a>] (*: Equal Contribution) Spotlight (3.5%) </li> <li> BioCLIP: A Vision Foundation Model for the Tree of Life Samuel Stevens*, Jiaman Wu*, Matthew J Thompson, Elizabeth G Campolongo, Chan Hee Song, David Edward Carlyn, Li Dong, Wasila M Dahdul, Charles Stewart, Tanya Berger-Wolf, Wei-Lun Chao, Yu Su. In the Conference on Computer Vision and Pattern Recognition, 2024 (CVPR'24) [<a href="https://imageomics.github.io/bioclip/">project website</a>] [<a href="https://arxiv.org/abs/2311.18803">paper</a>] [<a href="https://github.com/Imageomics/BioCLIP">code</a>] (*: Equal Contribution) Best Student Paper </li> <li> MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI Xiang Yue*, Yuansheng Ni, Kai Zhang, Tianyu Zheng, Ruoqi Liu, Ge Zhang, Samuel Stevens, Dongfu Jiang, Weiming Ren, Yuxuan Sun, Cong Wei, Botao Yu, Ruibin Yuan, Renliang Sun, Ming Yin, Boyuan Zheng, Zhenzhu Yang, Yibo Liu, Wenhao Huang, Huan Sun, Yu Su*, Wenhu Chen*. In the Conference on Computer Vision and Pattern Recognition, 2024 (CVPR'24) [<a href="https://mmmu-benchmark.github.io/">project website</a>] [<a href="https://arxiv.org/abs/2311.16502">paper</a>] [<a href="https://github.com/MMMU-Benchmark/MMMU">code</a>] [<a href="https://huggingface.co/datasets/MMMU/MMMU">data</a>] (*: corresponding authors) Best Paper Finalist (0.2%); Oral (0.8%) </li> <li> Dual-View Visual Contextualization for Web Navigation Jihyung Kil, Chan Hee Song, Boyuan Zheng, Xiang Deng, Yu Su, Wei-Lun Chao. In the Conference on Computer Vision and Pattern Recognition, 2024 (CVPR'24) [<a href="https://arxiv.org/abs/2402.04476">paper</a>] </li> <li> Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts Jian Xie*, Kai Zhang*, Jiangjie Chen, Renze Lou, Yu Su. In the International Conference on Learning Representations, 2024 (ICLR'24) [<a href="https://arxiv.org/abs/2305.13300">paper</a>] [<a href="https://github.com/OSU-NLP-Group/LLM-Knowledge-Conflict">code</a>] (*: Equal Contribution) Spotlight (5%) </li> <li> MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning Xiang Yue, Xingwei Qu, Ge Zhang, Yao Fu, Wenhao Huang, Huan Sun, Yu Su, Wenhu Chen. In the International Conference on Learning Representations, 2024 (ICLR'24) [<a href="https://arxiv.org/abs/2309.05653">paper</a>] [<a href="https://tiger-ai-lab.github.io/MAmmoTH/">project</a>] [<a href="https://github.com/TIGER-AI-Lab/MAmmoTH">code</a>] [<a href="https://huggingface.co/datasets/TIGER-Lab/MathInstruct">data</a>] Spotlight (5%) </li> <li> AgentBench: Evaluating LLMs as Agents Xiao Liu*, Hao Yu*, Hanchen Zhang, Yifan Xu, Xuanyu Lei, Hanyu Lai, Yu Gu, Hangliang Ding, Kaiwen Men, Kejuan Yang, Shudan Zhang, Xiang Deng, Aohan Zeng, Zhengxiao Du, Chenhui Zhang, Sheng Shen, Tianjun Zhang, Yu Su, Huan Sun, Minlie Huang, Yuxiao Dong, Jie Tang. In the International Conference on Learning Representations, 2024 (ICLR'24) [<a href="https://arxiv.org/abs/2308.03688">paper</a>] [<a href="https://github.com/THUDM/AgentBench">code</a>] (*: Equal Contribution) </li> <li> MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following Renze Lou, Kai Zhang, Jian Xie, Yuxuan Sun, Janice Ahn, Hanzi Xu, Yu Su, Wenpeng Yin. In the International Conference on Learning Representations, 2024 (ICLR'24) [<a href="https://renzelou.github.io/Muffin/">project website</a>] [<a href="https://arxiv.org/abs/2312.02436">paper</a>] [<a href="https://github.com/RenzeLou/Muffin">code</a>] </li> <li> A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis Dipanjyoti Paul, Arpita Chowdhury, Xinqi Xiong, Feng-Ju Chang, David Carlyn, Samuel Stevens, Kaiya Provost, Anuj Karpatne, Bryan Carstens, Daniel Rubenstein, Charles Stewart, Tanya Berger-Wolf, Yu Su, Wei-Lun Chao. In the International Conference on Learning Representations, 2024 (ICLR'24) [<a href="https://arxiv.org/abs/2311.04157">paper</a>] [<a href="https://github.com/Imageomics/INTR">code</a>] </li> <li> Automatic Evaluation of Attribution by Large Language Models Xiang Yue, Boshi Wang, Kai Zhang, Ziru Chen, Yu Su, Huan Sun. In the Findings of the Conference on Empirical Methods in Natural Language Processing, 2023 (EMNLP'23: Findings) [<a href="https://arxiv.org/abs/2305.06311">paper</a>] [<a href="https://github.com/OSU-NLP-Group/AttrScore">code</a>] [<a href="https://huggingface.co/datasets/osunlp/AttrScore">data</a>] </li> <li> Error Detection for Text-to-SQL Semantic Parsing Shijie Chen, Ziru Chen, Huan Sun, Yu Su. In the Findings of the Conference on Empirical Methods in Natural Language Processing, 2023 (EMNLP'23: Findings) [<a href="https://arxiv.org/abs/2305.13683">paper</a>] [<a href="https://internlp.github.io/2022/documents/2022/papers/25.pdf">InterNLP workshop at NeurIPS'22</a>] </li> <li> Solving the Right Problem is Key for Translational NLP: A Case Study in UMLS Vocabulary Insertion Bernal Jimenez Gutierrez, Yuqing Mao, Vinh Nguyen, Kin Wah Fung, Yu Su, Olivier Bodenreider. In the Findings of the Conference on Empirical Methods in Natural Language Processing, 2023 (EMNLP'23: Findings) [<a href="https://arxiv.org/abs/2311.15106">paper</a>] </li> <li> Mind2Web: Towards a Generalist Agent for the Web Xiang Deng, Yu Gu, Boyuan Zheng, Shijie Chen, Samuel Stevens, Boshi Wang, Huan Sun, Yu Su. In the Conference on Neural Information Processing Systems, 2023 (NeurIPS'23) [<a href="https://arxiv.org/abs/2306.06070">paper</a>] [<a href="https://osu-nlp-group.github.io/Mind2Web/">project website</a>] [<a href="https://github.com/OSU-NLP-Group/Mind2Web">code</a>] Spotlight (5%) </li> <li> MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing Kai Zhang*, Lingbo Mo*, Wenhu Chen, Huan Sun, Yu Su. In the Conference on Neural Information Processing Systems, 2023 (NeurIPS'23) [<a href="https://arxiv.org/abs/2306.10012">paper</a>] [<a href="https://osu-nlp-group.github.io/MagicBrush/">project website</a>] [<a href="https://github.com/OSU-NLP-Group/MagicBrush">code</a>] (*: Equal Contribution) </li> <li> Holistic Transfer: Towards Non-Disruptive Fine-Tuning with Partial Target Data Cheng-Hao Tu, Hong-You Chen, Jike Zhong, Zheda Mai, Vardaan Pahuja, Tanya Berger-Wolf, Song Gao, Charles Stewart, Yu Su, Wei-Lun Chao. In the Conference on Neural Information Processing Systems, 2023 (NeurIPS'23) [<a href="https://arxiv.org/abs/2311.01420">paper</a>] </li> <li> LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models Chan Hee Song, Jiaman Wu, Clayton Washington, Brian M. Sadler, Wei-Lun Chao, Yu Su. In the International Conference on Computer Vision, 2023 (ICCV'23) [<a href="https://arxiv.org/abs/2212.04088">paper</a>] [<a href="https://dki-lab.github.io/LLM-Planner/">project website</a>] [<a href="https://github.com/OSU-NLP-Group/LLM-Planner">code</a>] [<a href="https://embodied-ai.org/papers/2023/7.pdf">Embodied AI workshop at CVPR'23</a>] </li> <li> A Retrieve-and-Read Framework for Knowledge Graph Link Prediction Vardaan Pahuja, Boshi Wang, Hugo Latapie, Jayanth Srinivasa, Yu Su. In the ACM International Conference on Information and Knowledge Management, 2023 (CIKM'23) [<a href="https://arxiv.org/abs/2212.09724">paper</a>] </li>  <li> Biomedical Language Models are Robust to Sub-optimal Tokenization Bernal Jimenez Gutierrez, Huan Sun, Yu Su. In the 22nd BioNLP Workshop at ACL, 2023 (BioNLP'23) [<a href="https://arxiv.org/abs/2306.17649">paper</a>] [<a href="https://github.com/OSU-NLP-Group/Bio-Tokenization">code</a>] </li> <li> Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments Yu Gu, Xiang Deng, Yu Su. In the Annual Conference of the Association for Computational Linguistics, 2023 (ACL'23) [<a href="https://arxiv.org/abs/2212.09736">paper</a>] [<a href="https://github.com/dki-lab/Pangu">code</a>] Outstanding Paper Award </li> <li> Few-shot In-context Learning on Knowledge Base Question Answering Tianle Li, Xueguang Ma, Alex Zhuang, Yu Gu, Yu Su, Wenhu Chen. In the Annual Conference of the Association for Computational Linguistics, 2023 (ACL'23) [<a href="https://arxiv.org/abs/2305.01750">paper</a>] [<a href="https://github.com/ltl3A87/KB-BINDER">code</a>] </li> <li> Federated Learning for Semantic Parsing: Task Formulation, Evaluation Setup, New Algorithms Tianshu Zhang, Changchang Liu, Wei-Han Lee, Yu Su, Huan Sun. In the Annual Conference of the Association for Computational Linguistics, 2023 (ACL'23) [<a href="https://arxiv.org/abs/2305.17221">paper</a>] [<a href="https://github.com/OSU-NLP-Group/FL4SemanticParsing">code</a>] </li> <li> Privacy-Preserving Domain Adaptation of Semantic Parsers Fatemehsadat Mireshghallah, Yu Su, Tatsunori Hashimoto, Jason Eisner, Richard Shin. In the Annual Conference of the Association for Computational Linguistics, 2023 (ACL'23) [<a href="https://arxiv.org/abs/2212.10520">paper</a>] </li> <li> Text-to-SQL Error Correction with Language Models of Code Ziru Chen, Shijie Chen, Michael White, Raymond Mooney, Ali Payani, Jayanth Srinivasa, Yu Su, Huan Sun. In the Annual Conference of the Association for Computational Linguistics, 2023, short paper (ACL'23) [<a href="https://arxiv.org/abs/2305.13073">paper</a>] [<a href="https://github.com/OSU-NLP-Group/Auto-SQL-Correction">code</a>] </li> <li> Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors Kai Zhang, Bernal Jimenez Gutierrez, Yu Su. Findings of the Annual Conference of the Association for Computational Linguistics, 2023 (Findings of ACL'23) [<a href="https://arxiv.org/abs/2305.11159">paper</a>] [<a href="https://github.com/OSU-NLP-Group/QA4RE">code</a>] </li> <li> When More Data Hurts: A Troubling Quirk in Developing Broad-Coverage Natural Language Understanding Systems Elias Stengel-Eskin, Emmanouil Antonios Platanios, Adam Pauls, Sam Thomson, Hao Fang, Benjamin Van Durme, Jason Eisner, Yu Su. In the Proc. of the Conference on Empirical Methods in Natural Language Processing, 2022 (EMNLP'22) [<a href="https://arxiv.org/abs/2205.12228">paper</a>] [<a href="https://github.com/microsoft/nlu-incremental-symbol-learning">code</a>] </li> <li> Thinking about GPT-3 In-Context Learning for Biomedical IE? Think Again Bernal Jiménez Gutiérrez, Nikolas McNeal, Clay Washington, You Chen, Lang Li, Huan Sun, Yu Su. Findings of the Conference on Empirical Methods in Natural Language Processing, 2022 (EMNLP'22: Findings) [<a href="https://arxiv.org/abs/2203.08410">paper</a>] [<a href="https://github.com/dki-lab/few-shot-bioIE">code</a>] </li> <li> Knowledge Base Question Answering: A Semantic Parsing Perspective Yu Gu, Vardaan Pahuja, Gong Cheng, Yu Su. In the Conference on Automated Knowledge Base Construction, 2022 (AKBC'22) [<a href="https://arxiv.org/abs/2209.04994"">paper</a>] </li> <li> ArcaneQA: Dynamic Program Induction and Contextualized Encoding for Knowledge Base Question Answering Yu Gu and Yu Su. In the International Conference on Computational Linguistics, 2022 (COLING'22) [<a href="https://arxiv.org/abs/2204.08109">paper</a>] [<a href="https://github.com/dki-lab/ArcaneQA">code</a>] Outstanding Paper Award </li> <li> Bootstrapping a User-Centered Task-Oriented Dialogue System Shijie Chen, Ziru Chen, Xiang Deng, Ashley Lewis, Lingbo Mo, Samuel Stevens, Zhen Wang, Xiang Yue, Tianshu Zhang, Yu Su, Huan Sun. Alexa Prize TaskBot Challenge Proceedings, 2022 [<a href="https://assets.amazon.science/9a/30/5e4931ec41d78abad730707ce95a/bootstrapping-a-user-centered-task-oriented-dialogue-system.pdf">paper</a>] </li> <li> Detecting Drug-Drug Interactions Between COVID-19 Therapies and Concomitant Medications Using the FDA Adverse Event Reporting System Eugene Jeong, Scott D Nelson, Yu Su, Bradley Malin, Lang Li, You Chen. Frontiers in Pharmacology, 2022 [<a href="https://www.frontiersin.org/articles/10.3389/fphar.2022.938552/full">paper</a>] <li> Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansion Chen Zhao, Yu Su, Adam Pauls, Emmanouil Antonios Platanios. In the Annual Conference of the Association for Computational Linguistics, 2022 (ACL'22) [<a href="https://aclanthology.org/2022.acl-long.381/">paper</a>] [<a href="https://github.com/microsoft/text-to-sql-schema-expansion-generalization">code</a>] </li> <li> One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones Chan Hee Song, Jihyung Kil, Tai-Yu Pan, Brian M. Sadler, Wei-Lun Chao, Yu Su. In the Conference on Computer Vision and Pattern Recognition, 2022 (CVPR'22) [<a href="https://arxiv.org/abs/2202.07028">paper</a>] [<a href="https://github.com/chanhee-luke/M-Track">code</a>] </li> <li> Random Control Selection for Conducting High-throughput Adverse Drug Events Screening Using Large-scale Longitudinal Health Data Chien-Wei Chiang, Pengyue Zhang, Macarius Donneyong, You Chen, Yu Su, Lang Li. CPT: Pharmacometrics & Systems Pharmacology, 2021 (PSP'21) [<a href="https://pubmed.ncbi.nlm.nih.gov/34313404/">paper</a>] </li> <li> An Investigation of Language Model Interpretability via Sentence Editing Samuel Stevens and Yu Su. In the Proc. of the BlackboxNLP Workshop at EMNLP, 2021 (BlackboxNLP'21) [<a href="https://arxiv.org/abs/2011.14039">paper</a>] [<a href="https://github.com/samuelstevens/sentence-editing-interpretability">code and data</a>] </li> <li> ReasonBERT: Pre-trained to Reason with Distant Supervision Xiang Deng, Yu Su, Alyssa Lees, You Wu, Cong Yu and Huan Sun. In the Proc. of the Conference on Empirical Methods in Natural Language Processing, 2021 (EMNLP'21) [<a href="https://arxiv.org/abs/2109.04912">paper</a>] [<a href="https://github.com/sunlab-osu/ReasonBERT">code and pre-trained models</a>] </li> <li> A Systematic Investigation of KB-Text Embedding Alignment at Scale Vardaan Pahuja, Yu Gu, Wenhu Chen, Mehdi Bahrami, Lei Liu, Wei-Peng Chen, and Yu Su. In the Proc. of the Annual Conference of the Association for Computational Linguistics, 2021 (ACL'21) [<a href="https://arxiv.org/abs/2106.01586">paper</a>] [<a href="https://github.com/dki-lab/joint-kb-text-embedding">code</a>] </li> <li> Compositional Generalization for Neural Semantic Parsing via Span-level Supervised Attention Pengcheng Yin, Hao Fang, Graham Neubig, Adam Pauls, Emmanouil Antonios Platanios, Yu Su, Sam Thomson, and Jacob Andreas. In the Proc. of the Annual Conference of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies, 2021, short paper (NAACL-HLT'21) [<a href="https://www.aclweb.org/anthology/2021.naacl-main.225.pdf">paper</a>] </li> <li> Beyond I.I.D.: Three Levels of Generalization for Question Answering on Knowledge Bases Yu Gu, Sue Kase, Michelle Vanni, Brian Sadler, Percy Liang, Xifeng Yan, and Yu Su. In the Proc. of the Web Conference (previously WWW), 2021 (TheWebConf'21) [<a href="https://arxiv.org/abs/2011.07743">paper</a>] [<a href="https://dki-lab.github.io/GrailQA/">data and leaderboard</a>] [<a href="https://github.com/dki-lab/GrailQA">code</a>] </li> <li> Task-Oriented Dialogue as Dataflow Synthesis Semantic Machines, Jacob Andreas, John Bufe, David Burkett, Charles Chen, Josh Clausman, Jean Crawford, Kate Crim, Jordan DeLoach, Leah Dorner, Jason Eisner, Hao Fang, Alan Guo, David Hall, Kristin Hayes, Kellie Hill, Diana Ho, Wendy Iwaszuk, Smriti Jha, Dan Klein, Jayant Krishnamurthy, Theo Lanman, Percy Liang, Christopher H. Lin, Ilya Lintsbakh, Andy McGovern, Aleksandr Nisnevich, Adam Pauls, Dmitrij Petters, Brent Read, Dan Roth, Subhro Roy, Jesse Rusak, Beth Short, Div Slomin, Ben Snyder, Stephon Striplin, Yu Su, Zachary Tellman, Sam Thomson, Andrei Vorobev, Izabela Witoszko, Jason Wolfe, Abby Wray, Yuchen Zhang and Alexander Zotov. Transactions of the Association for Computational Linguistics, 2020 (TACL’20) [<a href="https://www.microsoft.com/en-us/research/blog/dialogue-as-dataflow-a-new-approach-to-conversational-ai/?OCID=msr_blog_semantic_tw">blog</a>][<a href="https://www.mitpressjournals.org/doi/10.1162/tacl_a_00333">paper</a>] [<a href="https://github.com/microsoft/task_oriented_dialogue_as_dataflow_synthesis">code</a>] [<a href="https://microsoft.github.io/task_oriented_dialogue_as_dataflow_synthesis/">data and leaderboard</a>] [<a href="https://youtu.be/ZbpjLchrYgw?t=5908">Satya Nadella's Presentation at Microsoft Build 2019</a>] [<a href="https://www.youtube.com/watch?v=jFCiYlK6Rb8">talk at EMNLP'20</a>]  Deployed as Conversational Interface of Microsoft Outlook </li> <li> An Imitation Game for Learning Semantic Parsers from User Interaction Ziyu Yao, Yiqi Tang, Wen-tau Yih, Huan Sun, Yu Su. In Proc. of the Conference on Empirical Methods in Natural Language Processing, 2020 (EMNLP’20) [<a href="https://arxiv.org/abs/2005.00689">paper</a>] [<a href="https://github.com/sunlab-osu/MISP">code</a>] </li> <li> KGPT: Knowledge-Grounded Pre-Training for Data-to-Text Generation Wenhu Chen, Yu Su, Xifeng Yan, William Yang Wang. In Proc. of the Conference on Empirical Methods in Natural Language Processing, 2020 (EMNLP’20) [<a href="https://arxiv.org/abs/2010.02307">paper</a>] [<a href="https://github.com/wenhuchen/KGPT">code</a>] </li> <li> Document Classification for COVID-19 Literature Bernal Jiménez Gutiérrez, Juncheng Zeng, Dongdong Zhang, Ping Zhang, Yu Su. Findings of EMNLP'20, short paper. Presented at <a href="https://www.nlpcovid19workshop.org/">NLP-COVID</a> workshop at ACL'20 [<a href="https://arxiv.org/abs/2006.13816">paper</a>] </li> <li> Logical Natural Language Generation from Open-Domain Tables Wenhu Chen, Jianshu Chen, Yu Su, Zhiyu Chen, William Yang Wang. In Proc. of the Annual Conference of the Association for Computational Linguistics, 2020 (ACL’20) [<a href="https://arxiv.org/abs/2004.10404">paper</a>][<a href="https://github.com/wenhuchen/LogicNLG">code and data</a>] </li> <li> Model-based Interactive Semantic Parsing: A Unified Formulation and A Text-to-SQL Case Study Ziyu Yao, Yu Su, Huan Sun, Scott Wen-tau Yih. In Proc. of the Conference on Empirical Methods in Natural Language Processing, 2019 (EMNLP’19) [<a href="https://arxiv.org/abs/1910.05389">paper</a>][<a href="https://github.com/sunlab-osu/MISP">code</a>] </li> <li> HierCon: Hierarchical Organization of Technical Documents based on Concepts Keqian Li, Shiyang Li, Semih Yavuz, Hanwen Zha, Yu Su, and Xifeng Yan. In Proc. of the IEEE International Conference on Data Mining, 2019 (ICDM’19) [<a href="papers/icdm19_hiercon.pdf">paper</a>] Best of ICDM 2019 Selection </li> <li> Global Textual Relation Embedding for Relational Understanding Zhiyu Chen, Hanwen Zha, Honglei Liu, Wenhu Chen, Xifeng Yan and Yu Su. In Proc. of the Annual Conference of the Association for Computational Linguistics, 2019, short paper (ACL’19) [<a href="https://arxiv.org/abs/1906.00550">paper</a>] [<a href="https://github.com/czyssrs/GloREPlus">code and data</a>] </li> <li> How Large A Vocabulary Does Text Classification Need? A Variational Approach on Vocabulary Selection Wenhu Chen, Yu Su, Yilin Shen, Zhiyu Chen, Xifeng Yan and William Yang Wang. In Proc. of the Annual Conference of the North American Chapter of the Association for Computational Linguistics, 2019 (NAACL-HLT’19) [<a href="https://128.84.21.199/abs/1902.10339">paper</a>] </li> <li> Learning to Compose Topic-Aware Mixture of Experts for Zero-Shot Video Captioning Xin Wang, Jiawei Wu, Da Zhang, Yu Su, William Yang Wang. In Proc. of the AAAI Conference on Artificial Intelligence, 2019 (AAAI’19) [<a href="https://arxiv.org/abs/1811.02765">paper</a>] </li> <li> Concept Mining via Embedding Keqian Li, Hanwen Zha, Yu Su, Xifeng Yan. In Proc. of the IEEE International Conference on Data Mining, 2018 (ICDM’18) [<a href="papers/icdm18_concept.pdf">paper</a>] </li> <li> XL-NBT: A Cross-lingual Neural Belief Tracking Framework Wenhu Chen, Jianshu Chen, Yu Su, Xin Wang, Dong Yu, Xifeng Yan and William Yang Wang. In Proc. of the Conference on Empirical Methods in Natural Language Processing, 2018 (EMNLP’18) [<a href="https://arxiv.org/abs/1808.06244">paper</a>] </li> <li> What It Takes to Achieve 100% Condition Accuracy on WikiSQL Semih Yavuz, Izzeddin Gur, Yu Su and Xifeng Yan. In Proc. of the Conference on Empirical Methods in Natural Language Processing, 2018 (EMNLP’18) [<a href="http://aclweb.org/anthology/D18-1197">paper</a>] </li> <li> DialSQL: Dialogue Based Structured Query Generation Izzeddin Gur, Semih Yavuz, Yu Su, Xifeng Yan. In Proc. of the Annual Meeting of the Association for Computational Linguistics, 2018, oral (ACL’18) [<a href="papers/acl18_dialsql.pdf">paper</a>] </li> <li> Natural Language Interfaces with Fine-Grained User Interaction: A Case Study on Web APIs Yu Su, Ahmed Hassan Awadallah, Miaosen Wang, Ryen White. In Proc. of the International ACM SIGIR Conference on Research and Development in Information Retrieval, 2018, oral (SIGIR’18) [<a href="papers/sigir18_interactive_nli.pdf">paper</a>] [<a href="https://www.microsoft.com/en-us/research/blog/democratizing-apis-with-natural-language-interfaces/">Microsoft Research Blog</a>] </li> <li> Global Relation Embedding for Relation Extraction Yu Su*, Honglei Liu*, Semih Yavuz, Izzeddin Gur, Huan Sun, Xifeng Yan. In Proc. of the Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018 (NAACL-HLT’18) [<a href="https://arxiv.org/abs/1704.05958">paper</a>] [<a href="https://github.com/ppuliu/GloRE">code</a>] (*: Equal Contribution) </li> <li> Unsupervised Neural Categorization for Scientific Publications Keqian Li, Hanwen Zha, Yu Su, Xifeng Yan. In Proc. of the SIAM International Conference on Data Mining, 2018, oral (SDM’18) [<a href="papers/sdm18_categorization.pdf">paper</a>] </li> <li> Building Natural Language Interfaces to Web APIs Yu Su, Ahmed Hassan Awadallah, Madian Khabsa, Patrick Pantel, Michael Gamon, Mark Encarnacion. In Proc. of the ACM International Conference on Information and Knowledge Management, 2017, oral (CIKM’17) [<a href="papers/cikm17_nl2api.pdf">paper</a>] [<a href="https://www.microsoft.com/en-us/download/details.aspx?id=58222">data</a>] </li> <li> Cross-domain Semantic Parsing via Paraphrasing Yu Su, Xifeng Yan. In Proc. of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP’17) [<a href="https://arxiv.org/abs/1704.05974">paper</a>] [<a href="https://github.com/ysu1989/CrossSemparse">code</a>] </li> <li> An End-to-End Deep Framework for Answer Triggering with a Novel Group-Level Objective Jie Zhao, Yu Su, Ziyu Guan, Huan Sun. In Proc. of the 2017 Conference on Empirical Methods in Natural Language Processing, short paper (EMNLP’17) [<a href="papers/emnlp17_answer_triggering.pdf">paper</a>] </li> <li> Recovering Question Answering Errors via Query Revision Semih Yavuz, Izzeddin Gur, Yu Su, Xifeng Yan. In Proc. of the 2017 Conference on Empirical Methods in Natural Language Processing, short paper (EMNLP’17) [<a href="papers/emnlp17_question_revision.pdf">paper</a>] </li> <li> On Generating Characteristic-rich Question Sets for QA Evaluation Yu Su, Huan Sun, Brian Sadler, Mudhakar Srivatsa, Izzeddin Gur, Zenghui Yan, Xifeng Yan. In Proc. of the 2016 Conference on Empirical Methods in Natural Language Processing (EMNLP’16) [<a href="papers/emnlp16_graphquestions.pdf">paper</a>] [<a href="papers/emnlp16_graphquestions_appendix.pdf">appendix</a>] [<a href="https://github.com/ysu1989/GraphQuestions">data</a>] </li> <li> Improving Semantic Parsing via Answer Type Inference Semih Yavuz, Izzeddin Gur, Yu Su, Mudhakar Srivatsa, Xifeng Yan. In Proc. of the 2016 Conference on Empirical Methods in Natural Language Processing, oral (EMNLP’16) [<a href="papers/emnlp16_type.pdf">paper</a>] </li> <li> A Fast Kernel for Attributed Graphs Yu Su, Fangqiu Han, Richard E. Harang, Xifeng Yan. In Proc. of the SIAM International Conference on Data Mining, 2016, oral (SDM’16) [<a href="papers/sdm16_kernel.pdf">paper</a>] [<a href="papers/sdm16_kernel_appendix.pdf">appendix</a>] [<a href="papers/sdm16_kernel_slides.pptx">slides</a>] [<a href="papers/sdm16_kernel_poster.pdf">poster</a>] </li> <li> Table Cell Search for Question Answering Huan Sun, Hao Ma, Xiaodong He, Wen-Tau Yih, Yu Su, Xifeng Yan. In Proc. of the International World Wide Web Conference, 2016, oral (WWW’16) [<a href="papers/www16_table.pdf">paper</a>] </li> <li> Visual Graph Query Formulation and Exploration: A New Perspective on Information Retrieval at the Edge Sue Kase, Michelle Vanni, Joanne Knight, Yu Su, Xifeng Yan. In Proc. of SPIE 9851, Next-Generation Analyst IV, 2016 (SPIE Defense+Security’16) </li> <li> Exploiting Relevance Feedback in Knowledge Graph Search Yu Su, Shengqi Yang, Huan Sun, Mudhakar Srivatsa, Sue Kase, Michelle Vanni, Xifeng Yan. In Proc. of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015, oral (KDD’15) [<a href="papers/kdd15_feedback.pdf">paper</a>] [<a href="papers/kdd15_feedback_slides.pptx">slides</a>] [<a href="papers/kdd15_feedback_poster.pdf">poster</a>] [<a href="resources/grf_dataset.zip">data</a>] </li> <li> On the Validity of Geosocial Mobility Traces Zengbin Zhang, Lin Zhou, Xiaohan Zhao, Gang Wang, Yu Su, Miriam Metzger, Haitao Zheng, and Ben Y. Zhao. In Proc. of the ACM Workshop on Hot Topics in Networks, 2013 (HotNets’13) [<a href="papers/hotnets13_foursquare.pdf">paper</a>] </li> </ul>   </div>  </div>   <div class="contentBlock"> <a name="teaching"></a> <h2>Teaching</h2> <div class="divider"></div> <ul> <li>AU2022-2024: CSE6521 - Introduction to Artificial Intelligence</a> (Graduate) </li> <li>SP2022: CSE5525 - Foundations of Speech and Language Processing</a> (Undergrad & Graduate) </li> <li>AU2021: CSE6521 - Introduction to Artificial Intelligence</a> (Graduate) </li> <li>AU2020: <a href="courses/au20/cse5539/">CSE5539 - Cutting-Edge Topics in Natural Language Processing</a> (Undergrad & Graduate)</li> <li>SP2020: <a href="courses/sp20/cse5243/">CSE5243 - Introduction to Data Mining</a> (Undergrad & Graduate)</li> </ul> </div> <div class="contentBlock"> <a name="sponsors"></a> <h2>Sponsers</h2> <div class="divider"></div> <ul> <li>We are grateful for NSF (awards <a href="https://www.nsf.gov/awardsearch/showAward?AWD_ID=2118240&HistoricalAwards=false">2118240</a>, <a href="https://www.nsf.gov/awardsearch/showAward?AWD_ID=2112606&HistoricalAwards=false">2112606</a>, <a href="https://www.nsf.gov/awardsearch/showAward?AWD_ID=2137806&HistoricalAwards=false">2137806</a>), ARL, NIH, NAIRR, Amazon, Orby AI, Walmart, Cisco, Fujitsu, and OSU TDAI for supporting our research. </li> </ul> </div> <div class="contentBlock"> <a name="contact"></a> <h2>Contact</h2> <div class="divider"></div> <ul> <li>Email: %s@osu.edu % 'su.809' </li> </ul> </div> </div> </body> </html>

CINXE.COM

Yu Su's Homepage - The Ohio State University