CINXE.COM
Effy Xue Li
<!doctype html><html lang=en-us><head><meta name=generator content="Hugo 0.114.0"><meta charset=utf-8><meta name=viewport content="width=device-width,initial-scale=1"><script src=https://code.jquery.com/jquery-3.5.1.slim.min.js integrity="sha256-4+XzXVhsDmqanXGHaHvgh1gMQKX40OUvDEBTu8JcmNs=" crossorigin=anonymous></script> <link rel=stylesheet href=https://cdn.jsdelivr.net/npm/bootstrap@4.5.3/dist/css/bootstrap.min.css integrity=sha384-TX8t27EcRE3e/ihU7zmQxVncDAy5uIKz4rEkgIXeMed4M0jlfIDPvg6uqKI2xXr2 crossorigin=anonymous><link rel=stylesheet href=https://cdn.jsdelivr.net/gh/jpswalsh/academicons@1/css/academicons.min.css><link rel=stylesheet href="https://fonts.googleapis.com/css2?family=Nanum+Myeongjo&family=Noto+Serif+JP&family=Cormorant+Garamond&family=Libre+Baskerville&family=Source+Serif+Pro&family=Crimson+Text&family=Inter&family=Crimson+Pro&family=Literata&family=Ubuntu+Mono&family=Inter&family=Roboto"><link rel=stylesheet type=text/css href=/css/style.css><title>Effy Xue Li</title><script type=application/javascript>var doNotTrack=!1;doNotTrack||(function(e,t,n,s,o,i,a){e.GoogleAnalyticsObject=o,e[o]=e[o]||function(){(e[o].q=e[o].q||[]).push(arguments)},e[o].l=1*new Date,i=t.createElement(n),a=t.getElementsByTagName(n)[0],i.async=1,i.src=s,a.parentNode.insertBefore(i,a)}(window,document,"script","https://www.google-analytics.com/analytics.js","ga"),ga("create","UA-51640218-1","auto"),ga("send","pageview"))</script></head><body class="container d-flex flex-column min-vh-100"><div class="row w-100"><div class="col w-100"><div class="about row w-100"><div class="row w-100 justify-content-center"><img class=rounded-circle src=images/profile.png alt=profile_picture id=profile_picture></div><div class="row w-100 justify-content-center main_color" id=full_name>Effy Xue Li</div><div class="affiliations row w-100 justify-content-center"><div class="row w-100 justify-content-center"><div class="row w-100 justify-content-center" id=title-name><span class=main_color id=title>Last-year Ph.D. candidate,</span> <span id=name class=text-muted>University of Amsterdam</span></div><div class="row w-100 justify-content-center text-muted" id=email></div></div></div><div class="socials row w-100 justify-content-center"><div class="row w-100 justify-content-center"><a class=main_color rel=me href='https://scholar.google.com/citations?hl=en&user=5OKcoiEAAAAJ' target=_blank><i class="ai ai-2x ai-google-scholar academic_icons_customize"></i></a> <a class=main_color rel=me href=https://github.com/effyli target=_blank><i data-feather=github></i></a> <a class=main_color rel=me href=https://twitter.com/effyli4 target=_blank><i data-feather=twitter></i></a> <a class=main_color rel=me href=https://linkedin.com/in/effy-li target=_blank><i data-feather=linkedin></i></a> <a class=main_color href=cv.pdf target=_blank><i class="ai ai-2x ai-cv academic_icons_customize"></i></a> <a id=blog_logo class=main_color href=/blog target=_blank><i class="ai ai-2x ai-blog academic_icons_customize">Blog</i></a></div></div><div class="introduction row w-100 text-justify"><div class="col w-100"><p>Hi! I’m Effy, you can also call me 鏉庨洩 [l菒 xu臎]. I am currently a PhD student at University of Amsterdam in <a href=https://indelab.org/>INDE lab</a>, supervised by <a href=pgroth.com>Prof. Paul Groth</a> and <a href="https://scholar.google.de/citations?user=2EE-YUsAAAAJ&hl=de">Dr. Jan-Christoph Kalo</a>. I recently did an internship at <a href=https://motherduck.com/>MotherDuck</a> on LLMs for Data Management. Previously I was an AI resident at Microsoft Research Cambridge, UK. Before that I did my Masters’ in Univeristy of Edinburgh.</p><p>My PhD topic on Knowlege Graph Construction from Conversational Data. Specifically, I look into how can we better utilizing (large) language models to extract information such as entities and relations, and furthermore, making (large) language models more data-efficient, robust and adaptable. I am also interested in efficiently using LLMs for Data Management. My PhD research is funded and situated in a NWO project <a href=https://in-sight.it/><em>IN-SIGHT.it</em></a>.</p><p>I am originally from China and have lived in the UK and the Netherlands. When I am not in front of my laptop, you can often find me bouldering (in completely beginner level), cycling or drinking coffee (like everyone else :D). If you are keen to collaborate or exchange ideas over a cofee, you can reach out to me at x DOT li3 AT uva DOT nl.</p></div></div></div><div class="about row w-100"><div class="interests col"><h4 class=main_color>Interests</h4><ul><li>Knowledge graph construction from conversations</li><li>LLMs for Information Extraction</li><li>LLMs for Data Management</li><li>Domain adaptation for NLP</li></ul></div><div class="academia col"><h4 class=main_color>Academia</h4><div class=row><div class="course col"><div class="institution_dates row"><div class="institution col" id=institution_title>University of Amsterdam</div><div class="main_color dates col-md-auto justify-content-end">2020-10 - present</div></div><div class="degree_major row"><div class="text-muted col">Ph.D. Knowledge Graph Construction</div></div></div></div><div class=row><div class="course col"><div class="institution_dates row"><div class="institution col" id=institution_title>University of Edinburgh</div><div class="main_color dates col-md-auto justify-content-end">2017 - 2018</div></div><div class="degree_major row"><div class="text-muted col">M.Sc. Artificial Intelligence</div></div><div class="other_info row text-muted"><div class=col>supervised by Prof. Shay Cohen.</div></div></div></div><div class=row><div class="course col"><div class="institution_dates row"><div class="institution col" id=institution_title>Guangzhou University</div><div class="main_color dates col-md-auto justify-content-end">2012 - 2016</div></div><div class="degree_major row"><div class="text-muted col">B.Eng. Electronic Information Engineering</div></div><div class="other_info row text-muted"><div class=col>graduated with top 1% GPA with National Scholarship</div></div></div></div></div></div></div></div><div class="row w-100"><div class="col w-100"><div class="row w-100"><div class="col w-100"><h3 class="row w-100 section-title main_color">News</h3></div></div><div class="news row w-100"><div class="col w-100"><ul class=news_list><li><i data-feather=file-text></i> I was at VLDB'24, Guangzhou China. Here is my <a href=blog/blog_vldb>trip report</a> <span class="secondary_font text-muted">, August 2024.</span></li><li><i data-feather=file-text></i> Our paper titled <em>‘How different is different? Systematically identifying distribution shifts and their impacts in NER datasets’</em> got accepted to <a href=https://link.springer.com/article/10.1007/s10579-024-09754-8>Language Resources and Evaluation (LREC)</a>! <span class="secondary_font text-muted">, July 2024.</span></li><li><i data-feather=file-text></i> Our paper titled <em>‘Towards Efficient Data Wrangling with LLMs using Code Generation’</em> got accepted to <a href=https://deem-workshop.github.io/>DEEM@SIGMOD'24</a>! I will be in Chile to present our work. <span class="secondary_font text-muted">, May 2024.</span></li><li><i data-feather=file-text></i> Our paper titled <em>‘Do Instruction-tuned Large Language Models Help with Relation Extraction?’</em> got accepted to <a href=https://lm-kbc.github.io/workshop2023/>ISWC LM-KBC workshop</a>! <span class="secondary_font text-muted">, August 2023.</span></li><li><i data-feather=file-text></i> Our team scored 2nd on the <a href=https://lm-kbc.github.io/challenge2023/>LM-KBC challenge</a> on Track 2! <span class="secondary_font text-muted">, August 2023.</span></li></ul></div></div></div></div><div class="row w-100"><div class="col w-100"><div class="row w-100"><div class="col w-100"><h3 class="row w-100 section-title main_color">Recent Publications</h3></div></div><div class="publications row w-100"><div class="col w-100"><div class="publication row w-100"><div class="section-1 w-100"><span>How different is different? Systematically identifying distribution shifts and their impacts in NER datasets,</span> <span class="text-muted publication-date">2024,</span> <span class="text-muted publication-name">Language Resources and Evaluation (LREC)</span></div><div class="section-2 text-muted w-100"><span>Xue Li</span> <span class=separator>,</span> <span>Paul Groth</span></div><div class="section-3 w-100"><a class="main_color text-decoration-none rounded" href=https://link.springer.com/article/10.1007/s10579-024-09754-8 target=_blank>paper</a></div></div><div class="publication row w-100"><div class="section-1 w-100"><span>Towards Efficient Data Wrangling with LLMs using Code Generation,</span> <span class="text-muted publication-date">2024,</span> <span class="text-muted publication-name">DEEM@SIGMOD'24</span></div><div class="section-2 text-muted w-100"><span>Xue Li</span> <span class=separator>,</span> <span>Till D枚hmen</span></div><div class="section-3 w-100"><a class="main_color text-decoration-none rounded" href=https://github.com/effyli/efficient_llm_data_wrangling target=_blank>code</a> <a class="main_color text-decoration-none rounded" href=waddle_code.pdf target=_blank>pdf</a> <a class="main_color text-decoration-none rounded" href=waddle_code_poster.pdf target=_blank>poster</a></div></div><div class="publication row w-100"><div class="section-1 w-100"><span>Do Instruction-tuned Large Language Models Help with Relation Extraction?,</span> <span class="text-muted publication-date">2023,</span> <span class="text-muted publication-name">ISWC 2023 LM-KBC workshop</span></div><div class="section-2 text-muted w-100"><span>Xue Li</span> <span class=separator>,</span> <span>Fina Polat</span> <span class=separator>,</span> <span>Paul Groth</span></div><div class="section-3 w-100"><a class="main_color text-decoration-none rounded" href=llmre.pdf target=_blank>pdf</a></div></div><div class="publication row w-100"><div class="section-1 w-100"><span>Knowledge-centric Prompt Composition for Knowledge Base Construction from Pre-trained Language Models,</span> <span class="text-muted publication-date">2023,</span> <span class="text-muted publication-name">ISWC 2023 LM-KBC workshop</span></div><div class="section-2 text-muted w-100"><span>Xue Li</span> <span class=separator>,</span> <span>Author Name</span></div><div class="section-3 w-100"><a class="main_color text-decoration-none rounded" href=https://github.com/effyli/lm-kbc target=_blank>code</a> <a class="main_color text-decoration-none rounded" href=challenge.pdf target=_blank>pdf</a> <a class="main_color text-decoration-none rounded" href=lm-kbc-poster.pdf target=_blank>poster</a></div></div><div class="publication row w-100"><div class="section-1 w-100"><span>The Challenges of Cross-Document Coreference Resolution for Email,</span> <span class="text-muted publication-date">2021,</span> <span class="text-muted publication-name">Proceedings of the 11th Knowledge Capture Conference</span></div><div class="section-2 text-muted w-100"><span>Xue Li</span> <span class=separator>,</span> <span>Sara Magliacane</span> <span class=separator>,</span> <span>Paul Groth</span></div><div class="section-3 w-100"><a class="main_color text-decoration-none rounded" href=https://dl.acm.org/doi/abs/10.1145/3460210.3493573 target=_blank>pdf</a></div></div></div></div></div></div><div class="row w-100"><div class="col w-100"><div class="row w-100"><div class="col w-100"><h3 class="row w-100 section-title main_color">Teaching and Supervision</h3></div></div><div class="projects row w-100"><div class="col w-100"><div class="project row w-100"><div class="section-1 w-100"><span class=project-title>2023 October - December: Teaching Asistant NLP1 UvA</span></div></div><div class="project row w-100"><div class="section-1 w-100"><span class=project-title>2023 Janurary - July: Master thesis supervision. Student: Casper Smit, title: Graphical information for cross-document coreference resolution in emails.</span></div><div class="section-2 w-100"><a class="main_color text-decoration-none rounded" href=https://www.linkedin.com/in/casper-smit-12661b13b/ target=_blank>student website</a></div></div><div class="project row w-100"><div class="section-1 w-100"><span class=project-title>2023 April - May: Teaching Asistant Data and Knowledge UvA</span></div></div><div class="project row w-100"><div class="section-1 w-100"><span class=project-title>2022 Janurary - June: Master thesis supervision. Student: Timothy Dorr; Title: Not All Names Are WEIRD-Identifying Name Origin Bias in Named Entity Recognition Tasks on Email Data</span></div><div class="section-2 w-100"><a class="main_color text-decoration-none rounded" href=https://www.linkedin.com/in/timothy-dennys-doerr/ target=_blank>student website</a> <a class="main_color text-decoration-none rounded" href="https://www.linkedin.com/in/timothy-dennys-doerr/overlay/1635497433007/single-media-viewerprofileId=ACoAACeZJUABTniKbIVg_qI1fsq2RHPT625GCHs%29" target=_blank>thesis</a></div></div><div class="project row w-100"><div class="section-1 w-100"><span class=project-title>2022 Feburary - April: Teaching Asistant Causal Data Science UvA</span></div></div></div></div></div></div><footer class="mt-auto d-flex justify-content-center text-muted small secondary_font"><span class=text-muted>Copyright (c) 2024, Effy Xue Li, <a class=text-muted href=https://github.com/hadisinaee/avicenna target=_blank>created by Avicenna (MIT)</a></span></footer><script src=https://cdn.jsdelivr.net/npm/bootstrap@4.5.3/dist/js/bootstrap.bundle.min.js integrity=sha384-ho+j7jyWK8fNQe+A12Hb8AhRq26LrZ/JpcUGGOn+Y7RsweNrtN/tE3MoK7ZeZDyx crossorigin=anonymous></script> <script src=https://cdnjs.cloudflare.com/ajax/libs/feather-icons/4.28.0/feather.min.js></script> <script>feather.replace()</script></body></html>