CINXE.COM
Sihao Chen
<!DOCTYPE html> <html lang="en"> <head> <meta charset="UTF-8"> <meta name="viewport" content="width=device-width, initial-scale=1"> <title>Sihao Chen</title> <meta name="description" content="Ph.D student at University of Pennsylvania."> <script src="https://code.jquery.com/jquery-3.3.1.min.js" integrity="sha256-FgpCb/KJQlLNfOu91ta32o/NMZxltwRo8QtmkMRdAu8=" crossorigin="anonymous"></script> <script src="https://cdnjs.cloudflare.com/ajax/libs/popper.js/1.12.3/umd/popper.min.js" integrity="sha384-vFJXuSJphROIrBnz7yo7oB41mKfc8JzQZiCq4NCceLEaO4IHwicKwpJf9c9IpFgh" crossorigin="anonymous"></script> <script src="https://maxcdn.bootstrapcdn.com/bootstrap/4.0.0-beta.2/js/bootstrap.min.js" integrity="sha384-alpBpkh1PFOepccYVYDB4do5UnbKysX5WZXm3XxPqe5iKTfUKjNkCk9SaVuEZflJ" crossorigin="anonymous"></script> <link rel="stylesheet" href="https://maxcdn.bootstrapcdn.com/bootstrap/4.0.0-beta.2/css/bootstrap.min.css" integrity="sha384-PsH8R72JQ3SOdhVi3uxftmaW6Vc51MKb0q5P2rRUpPvrszuE4W1povHYgTpBfshb" crossorigin="anonymous"> <link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/flat-ui/2.3.0/css/flat-ui.min.css" integrity="sha256-7bYJaNviFZlH+bKqZlshmYKeyvkp+fXBQuerWp2AXlA=" crossorigin="anonymous"/> <link rel="stylesheet" href="https://use.fontawesome.com/releases/v5.8.1/css/all.css" integrity="sha384-50oBUHEmvpQ+1lW4y57PTFmhCaXp0ML5d60M1M7uH2+nqUivzIebhndOJK28anvf" crossorigin="anonymous"> <link rel="stylesheet" href="css/style.css"> </head> <body> <div class="container"> <!-- <header id="header"> <div class="row"> <div class="col-md-6 col-sm-12"> <img src="static/img/sihao-square.jpg" width="260px" height="auto"> </div> <div class="col-md-6 col-sm-12"> <h4>Sihao Chen</h4> <ul class="list-identity"> <li> Applied Scientist</li> <li> Microsoft</li> </ul> <ul class="list-contact"> <li>sihaochen [at] microsoft [dot] com</li> <li></li> </ul> <div class="btn-group"> <a class="btn btn-lg btn-primary" href="https://github.com/schen149" target="_blank"><i class="fab fa-lg fa-github"></i></a> <a class="btn btn-lg btn-primary" href="static/pdf/cv_sihaochen.pdf" target="_blank">CV</a> <a class="btn btn-lg btn-primary" href="https://scholar.google.com/citations?user=PQ9dRCgAAAAJ&sortby=pubdate" target="_blank"><i class="fab fa-lg fa-google"></i></a> </div> </div> </div> <hr> </header> --> <section id="about"> <h6>⚠️</h6> <p>I recently graduated from Penn and have joined Microsoft. My website has moved to <a href="https://sihaoc.github.io">sihaoc.github.io</a>.</p> <hr> </section> <!-- <section id="publications"> <h6>Publications</h6> <a href="https://arxiv.org/abs/2312.06648" target="_blank">Dense X Retrieval: What Retrieval Granularity Should We Use?</a> <br> Tong Chen, Hongwei Wang, Sihao Chen, Wenhao Yu, Kaixin Ma, Xinran Zhao, Hongming Zhang, Dong Yu <br> <i>EMNLP, 2024</i> <br> <a href="https://huggingface.co/datasets/chentong00/factoid-wiki" target="_blank">[dataset]</a> <a href="https://github.com/chentong0/factoid-wiki" target="_blank">[code]</a> <br> <br> <a href="https://arxiv.org/abs/2407.10691" target="_blank">MixGR :Enhancing Retriever Generalization for Scientific Domain through Complementary Granularity</a> <br> Fengyu Cai, Xinran Zhao, Tong Chen, Sihao Chen, Hongming Zhang, Iryna Gurevych, Heinz Koeppl <br> <i>EMNLP, 2024</i> <br> <a href="https://github.com/TRUMANCFY/MixGR" target="_blank">[code]</a> <br> <br> <a href="https://arxiv.org/pdf/2405.02714" target="_blank">Beyond Relevance: Evaluate and Improve Retrievers on Perspective Awareness</a> <br> Xinran Zhao, Tong Chen, Sihao Chen, Hongming Zhang, Tongshuang Wu <br> <i>COLM, 2024</i> <br> <a href="https://github.com/colinzhaoust/pir" target="_blank">[code]</a> <br> <br> <a href="https://arxiv.org/pdf/2401.13136" target="_blank">The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts</a> <br> Lingfeng Shen, Weiting Tan, Sihao Chen, Yunmo Chen, Jingyu Zhang, Haoran Xu, Boyuan Zheng, Philipp Koehn, Daniel Khashabi <br> <i>ACL (Findings), 2024</i> <br> <br> <br> <a href="https://arxiv.org/pdf/2311.04335" target="_blank">Sub-sentence Encoder: Contrastive Learning of Propositional Semantic Representations</a> <br> Sihao Chen, Hongming Zhang, Tong Chen, Ben Zhou, Wenhao Yu, Dian Yu, Baolin Peng, Hongwei Wang, Dan Roth, Dong Yu <br> <i>NAACL, 2024</i> <br> <a href="https://github.com/schen149/sub-sentence-encoder" target="_blank">[code]</a> <br> <br> <a href="https://arxiv.org/pdf/2309.07852" target="_blank">ExpertQA: Expert-Curated Questions and Attributed Answers</a> <br> Chaitanya Malaviya, Subin Lee, Sihao Chen, Elizabeth Sieber, Mark Yatskar, Dan Roth <br> <i>NAACL, 2024</i> <br> <a href="https://github.com/chaitanyamalaviya/expertqa" target="_blank">[code]</a> <br> <br> <a href="https://arxiv.org/abs/2309.16155" target="_blank">The Trickle-down Impact of Reward (In-)consistency on RLHF</a> <br> Lingfeng Shen and Sihao Chen and Linfeng Song and Lifeng Jin and Baolin Peng and Haitao Mi and Daniel Khashabi and Dong Yu <br> <i>ICLR, 2024</i> <br> <a href="https://github.com/shadowkiller33/Contrast-Instruction" target="_blank">[code]</a> <br> <br> <a href="https://aclanthology.org/2023.findings-emnlp.274.pdf" target="_blank">Using LLM for Improving Key Event Discovery: Temporal-Guided News Stream Clustering with Event Summaries</a> <br> Nishanth Nakshatri, Siyi Liu, Sihao Chen, Daniel J Hopkins, Dan Roth, Dan Goldwasser <br> <i>EMNLP (Findings), 2023</i> <br> <br> <br> <a href="https://arxiv.org/pdf/2212.10750" target="_blank">PropSegmEnt: A Large-Scale Corpus for Proposition-Level Segmentation and Entailment Recognition</a> <br> Sihao Chen, Senaka Buthpitiya, Alex Fabrikant, Dan Roth, Tal Schuster <br> <i>ACL (Findings), 2023</i> <br> <a href="https://huggingface.co/datasets/sihaochen/propsegment" target="_blank">[dataset]</a> <a href="https://github.com/google-research-datasets/PropSegmEnt" target="_blank">[code]</a> <br> <br> <a href="https://arxiv.org/pdf/2204.07447" target="_blank">Stretching Sentence-Pair NLI datasets to Reason Over Long Document And Clusters</a> <br> Tal Schuster, Sihao Chen, Senaka Buthpitiya, Alex Fabrikant, Donald Metzler <br> <i>EMNLP (Findings), 2022</i> <br> <br> <br> <a href="https://aclanthology.org/2022.findings-naacl.22/" target="_blank">Design Challenge for a Multi-Perspective Search Engine</a> <br> Sihao Chen*, Siyi Liu*, Xander Uyttendaele, William Bruno, Yi Zhang and Dan Roth <br> <i>NAACL (Findings), 2022</i> <br> <a href="https://github.com/cogcomp/multi-persp-search-engine" target="_blank">[code]</a> <br> <br> <a href="static/pdf/CZSR21.pdf" target="_blank">Improving Faithfulness in Abstractive Summarization with Contrast Candidate Generation and Selection</a> <br> Sihao Chen, Fan Zhang, Kazoo Sone and Dan Roth <br> <i>NAACL, 2021</i> <br> <a href="https://github.com/CogComp/faithful_summarization" target="_blank">[code]</a> <br> <br> <a href="https://arxiv.org/pdf/2106.02725" target="_blank">MULTIOPED:A Corpus of Multi-Perspective News Editorials</a> <br> Siyi Liu, Sihao Chen, Xander Uyttendaele and Dan Roth <br> <i>NAACL, 2021</i> <br> <a href="https://github.com/CogComp/MultiOpEd" target="_blank">[dataset]</a> <a href="https://github.com/CogComp/MultiOpEd" target="_blank">[code]</a> <br> <br> <a href="https://www.aclweb.org/anthology/2020.findings-emnlp.117.pdf" target="_blank">Evaluating Models’ Local Decision Boundaries via Contrast Sets</a> <br> Matt Gardner, Yoav Artzi, Victoria Basmova, Jonathan Berant, Ben Bogin, Sihao Chen, Pradeep Dasigi, Dheeru Dua, Yanai Elazar, Ananth Gottumukkala, Nitish Gupta, Hanna Hajishirzi, Gabriel Ilharco, Daniel Khashabi, Kevin Lin, Jiangming Liu, Nelson F. Liu, Phoebe Mulcaire, Qiang Ning, Sameer Singh, Noah A. Smith, Sanjay Subramanian, Reut Tsarfaty, Eric Wallace, Ally Zhang, Ben Zhou <br> <i>EMNLP Findings, 2020</i> <br> <a href="https://allennlp.org/contrast-sets" target="_blank">[dataset]</a> <a href="https://github.com/allenai/contrast-sets" target="_blank">[code]</a> <br> <br> <a href="https://cogcomp.seas.upenn.edu/page/publication_view/925" target="_blank">Do VQA Models Know What to Look At?</a> <br> Weiyu Du, Sihao Chen, Yijie Zhao, Jianbo Shi, Dan Roth <br> <i>Women in CV (WiCV) Workshop at ECCV, 2020</i> <br> <br> <br> <a href="http://cogcomp.org/page/publication_view/874" target="_blank">PerspectroScope: A Window to the World of Diverse Perspectives </a> <br> Sihao Chen, Daniel Khashabi, Chris Callison-Burch and Dan Roth <br> <i>ACL - Demos, 2019</i> <br> <a href="https://github.com/CogComp/perspectroscope" target="_blank">[code]</a> <br> <br> <a href="http://cogcomp.org/page/publication_view/870" target="_blank">Seeing Things from a Different Angle: Discovering Diverse Perspectives about Claims</a> <br> Sihao Chen, Daniel Khashabi, Wenpeng Yin, Chris Callison-Burch and Dan Roth <br> <i>NAACL, 2019</i> <br> <a href="http://cogcomp.org/perspectrum/" target="_blank">[dataset]</a> <a href="https://github.com/CogComp/perspectrum" target="_blank">[code]</a> <br> <br> <h6>Preprints</h6> <a href="https://arxiv.org/pdf/2404.00205" target="_blank">Conceptual and Unbiased Reasoning in Language Models</a> <br> Ben Zhou, Hongming Zhang, Sihao Chen, Dian Yu, Hongwei Wang, Baolin Peng, Dan Roth, Dong Yu <br> <i>2024</i> <br> <br> <br> <a href="https://arxiv.org/pdf/2305.14882" target="_blank">Interpretable by Design Visual Question Answering</a> <br> Xingyu Fu, Ben Zhou, Sihao Chen, Mark Yatskar, Dan Roth <br> <i>2023</i> <br> <br> <br> <a href="https://arxiv.org/pdf/2304.03414" target="_blank">Towards Corpus-Scale Discovery of Selection Biases in News Coverage: Comparing What Sources Say About Entities as a Start</a> <br> Sihao Chen, William Bruno, Dan Roth <br> <i>2023</i> <br> <br> <br> <h6>Book Chapters</h6> <a href="https://books.google.com/books?hl=en&lr=&id=g7BtEAAAQBAJ&oi=fnd&pg=PA193&dq=info:n91V4Wf8jwYJ:scholar.google.com&ots=-vZS2XCboY&sig=5zYXo3EDbvKEvVb9oo76NWCdkM8#v=onepage&q&f=false" target="_blank">Toward Automatic Discovery of Diverse Perspectives</a> <br> Sihao Chen, Daniel Khashabi, Dan Roth <br> <i>In Creating a More Transparent Internet: The Perspective Web</i> <br> <i>Cambridge University Press, 2022</i> <br> <br> <br> <hr> </section> --> <!--<section id="service">--> <!-- <h6>Service</h6>--> <!-- --> <!-- <ul> <li>Program Committee: AAAI (2020, 2021), EMNLP (2019-2023), ACL (2021-2023, 2019), ACL Student Research Workshop (2019), IJCAI (2020), NAACL (2019, secondary), Computational Linguistics (2019, secondary)</li> </ul> --> <!-- <hr>--> <!--</section>--> </div> </body> </html>