Andrea Wen-Yi Wang
Andrea standing in front of a row of trees with red autum leaves.
andreawwenyi [at] infosci.cornell.edu
CV Google Scholar GitHub Twitter
I'm a third-year PhD student at Cornell Information Science, advised by David Mimno.
My research interest is in developing NLP tools that meaningfully support social science researchers, who build insights through reading large collections of documents.

I'm looking for internship for Summer 2025. If you think I could be a good fit, please drop me an email!

Prior to joining Cornell, I was a data scientist at New York University's Public Safety Lab. I worked on the Jail Data Initiative with Orion Taylor and Anna Harvey. I was also a contributor in g0v ("gov-zero"), a grassroot civic-tech community in Taiwan, where I worked on the 0archive project.

For potential PhD applicant: I’m happy to chat about PhD applications and my PhD experiences. Feel free to email me.
Publications
Automate or Assist? The Role of Computational Models in Identifying Gendered Discourse in US Capital Trial Transcripts
Andrea W Wen-Yi, Kathryn Adamson, Nathalie Greenfield, Rachel Goldberg, Sandra Babcock, David Mimno, Allison Koenecke
AIES '24: AAAI/ACM Conference on AI, Ethics, and Society
Best Student Paper
Paper  Code  Slide
Hyperpolyglot LLMs: Cross-Lingual Interpretability in Token Embeddings
Andrea W Wen-Yi, David Mimno
EMNLP 2023
Paper  Code  Poster

The Evolution of Rumors on a Closed Social Networking Platform During COVID-19: Algorithm Development and Content Study
Andrea W Wang, Jo-Yu Lan, Ming-Hung Wang, Chihhao Yu
JMIR Med Inform 2021
doi:10.2196/30467
Paper
Working Papers
How Chinese are Chinese Language Models? The Puzzling Lack of Language Policy in China’s LLMs
Andrea W Wen-Yi*, Unso Eun Seo Jo*, Lu Jia Lin, David Mimno
Under Review
Paper
This website is developed by gyauney. Thanks Greg!