Andrea Wen-Yi Wang

I'm a Ph.D. candidate at Cornell Information Science, at the Culture and Computation Lab. I am advised by David Mimno.

I love studying how automating a judgment reveals the nature of that judgment itself. The most representative works are "Inside Baseball" and "Automate and Assist."

I carry out my studies using a mix of quantitative (comptutational text analysis, data science, finetuning LLMs) and qualitative methods (semi-structure interviews and field study).

Prior to joining Cornell, I was a data scientist at New York University's Public Safety Lab. I worked on the Jail Data Initiative with Orion Taylor and Anna Harvey. I was also a contributor in g0v ("gov-zero"), a grassroot civic-tech community in Taiwan, where I worked on the 0archive project. I held a Bachelor's degree in Finance from National Taiwan University, and a MS Data Science from NYU. I thank my teachers in both institutions for cultivating my appreciation of data and mathematics.

For potential PhD applicant: I’m happy to chat about PhD applications and my PhD experiences. Feel free to email me.

Publications

* denotes equal contribution.

Inside Baseball: The Automated Ball-Strike System as an Object Lesson in Technological Rule Enforcement

AWW, Waki Kamino, David Mimno, Karen Levy, Malte F. Jung

FAccT 2026

Paper

Do Chinese Models Speak Chinese Languages?

AWW*, Unso Eun Seo Jo*, David Mimno

FAccT 2026

Paper

What Is a Robot? Understanding Baseball's "Robot Umpire" through the Lens of Fluid Technology

Waki Kamino, AWW, Guy Hoffman, Selma Šabanović, Malte F. Jung

HRI '26: ACM/IEEE International Conference on Human-Robot Interaction

Honourable Mention

Paper

Automate or Assist? The Role of Computational Models in Identifying Gendered Discourse in US Capital Trial Transcripts

AWW, Kathryn Adamson, Nathalie Greenfield, Rachel Goldberg, Sandra Babcock, David Mimno, Allison Koenecke

AIES '24: AAAI/ACM Conference on AI, Ethics, and Society

Best Student Paper

Paper Code Slide

Hyperpolyglot LLMs: Cross-Lingual Interpretability in Token Embeddings

AWW, David Mimno

EMNLP'23: Conference on Empirical Methods in Natural Language Processing

Paper Code Poster

The Evolution of Rumors on a Closed Social Networking Platform During COVID-19: Algorithm Development and Content Study

AWW, Jo-Yu Lan, Ming-Hung Wang, Chihhao Yu

JMIR Med Inform 2021

doi:10.2196/30467

Paper

Workshop Papers and Extended Abstracts

Million Eyes on the "Robot Umps": The Case for Studying Sports in HRI Through Baseball

Waki Kamino*, AWW*, Dhruv Agarwal, Sil Hamilton, Eun Jeong Kang, Jieun Kim, Keigo Kusumegi, Pegah Moradi, Daniel Mwesigwa, Yan Tao, I-Ting Tsai, Ethan Yang, Shengqi Zhu, Shu-Jung Han, Chi-Jung Lee, Michael Joseph Sack, Tianhong Catherine Yu, Weslie Khoo, Andy Elliot Ricci, Yoyo Tsung-Yu Hou, Boyoung Kim, Selma šabanović, David J. Crandall, Karen Levy, Malte F. Jung

2025 ACM/IEEE International Conference on Human-Robot Interaction (HRI)

Paper

Resources for PhDs

Here are some resources that have greatly helped me grow as a PhD in Information Science.

Short Pieces on Reviewing by David Mimno
ACL Rolling Review Reviewer Guidelines
Heilmeier Catechism
Writing a Good Introduction by Jim Kurose
EDITOR’S COMMENTS: POSITIONING A THEORY PAPER FOR PUBLICATION by The Academy of Management Review
How to Get the Most Out of Attending a Conference by Gilly Leshed

This website is developed by gyauney. Thanks Greg!