
Hi! I am a postdoc at the Data Science Institute at the University of Chicago, working with Moon Duchin at the Data and Democracy Lab. I recently received my PhD from the Statistics Department at Columbia University, where I was advised by Michael Sobel. My first name is pronounced [Hayn], and my pronouns are they/them.
My research focuses on the intersection of statistics and political science, motivated by substantive questions from American politics–especially in public opinion polarization and redistricting. To operationalize and measure political phenomena, I apply diverse statistical methods: optimal transport, latent factor models, network theory, optimization, causal inference, and more. The topics I have been working on include developing new measures of ideological and affective polarization using the Wasserstein distance and measuring congressional social ties from roll call votes. I am also interested in studying race, ethnicity, and politics and statistical methods related to race from a critical theory perspective.
Before Columbia, I was at MIT Media Lab’s Opera of the Future group for an MS where I designed interactive AR musical experiences and helped produce hybrid acoustic+digital musical performances. I also received a BS from MIT in Electrical Engineering with a minor in Music.
Research
Forthcoming in the Annals of Applied Statistics. Hane Lee and Michael Sobel (2026). “Measuring Public Opinion: “The Wasserstein Bipolarization Index”, with Application to Cross-National Attitudes Toward Mandatory Vaccination for COVID-19.”
Abstract
Although the topic of opinion polarization receives much attention from the media, public opinion researchers and political scientists, the phenomenon itself has not been adequately characterized in either the lay or academic literature. To study opinion polarization among the public, researchers compare the distributions of respondents to survey questions or track the distribution of responses to a question over time using ad-hoc methods and measures such as visual comparisons, variances, and bimodality coefficients. To remedy this situation, we build on the axiomatic approach in the economics literature on income bipolarization, specifying key properties a measure of bipolarization should satisfy: in particular, it should increase as the distribution spreads away from a center toward the poles and/or as clustering below or above this center increases. We then show that measures of bipolarization used in public opinion research fail to satisfy one or more of these axioms. Next, we propose a p-Wasserstein polarization index that satisfies the axioms we set forth. Our index measures the dissimilarity between an observed distribution and a distribution with all the mass clustered on the lower and upper endpoints of the scale. We use our index to examine bipolarization in attitudes toward governmental COVID-19 vaccine mandates across 11 countries, finding the U.S and U.K are most polarized, China, France and India the least polarized, while the others (Brazil, Australia, Columbia, Canada, Italy, Spain) occupy an intermediate position.
Working paper. Hane Lee, Andrew Davison, and Zhiliang Ying. “Measuring Social Ties from Roll Call Votes: A Fused Latent Factor and Social Network Approach”.
Abstract
Congressional social ties influence legislative processes and outcomes, but measuring these ties presents a significant challenge. Recent research has adopted social network models to assess congressional social, but existing applications are limited in that they rely on indirect measurements using proxy relations, such as cosponsorship, and fail to account for party influence or ideological preferences of legislators, which may be more decisive factors. In this paper, we aim to directly measure social ties from roll call votes while taking account of partisan-ideological preferences. We combine the partisan-ideological and social approaches to roll call analysis through a fused latent factor and social network model. This model decomposes the variation in votes explained by the partisan-ideological factors from that of the social network, while prioritizing the former. Applying the model to the Senate, we find that the fitted latent factors capture known partisan and ideological patterns from previous literature, while social networks reflect notable friendships and geographical proximity.
Working paper. Yuki Atsusaka, Diana Da In Lee, and Hane Lee. “Quantifying Group-Based Electoral Competitiveness”.
Abstract
How can we measure electoral competitiveness among multiple candidates with varying group affiliations? We propose the concept of group-level electoral competitiveness, which generalizes the conventional margin of victory to any number of candidates and group affiliations while avoiding strong assumptions about perfect coordination among in-group candidates. We demonstrate its applicability in three domains of American politics: racial competitiveness in congressional elections, partisan competitiveness in local elections, and competitiveness across candidates' occupational backgrounds. Our findings reveal patterns of group advantage and disadvantage that are obscured by traditional measures. By extending electoral margins to group-based competition, this paper provides a unified and flexible methodological framework for evaluating long-standing questions in the study of electoral democracy. The proposed measure is implemented via an open-source software R package gmv.
Working paper. Hane Lee. “The Polarizing Middle Ground: Independents in Public Opinion Polarization”.
Abstract
Coming soon!
Chris Andrade, Jonathan Auerbach, Icaro Bacelar, Hane Lee, Angela Tan, Mariana Vazquez, and Owen Ward (2023). “Does it pay to park in front of a fire hydrant?”. Significance 20(1), pp. 28–30.
Teaching
Instructor (at Columbia University)
- Calculus-based Introduction to Statistics (Summer 2024) [Syllabus]
Teaching Assistant (at Columbia University)
Graduate
- Probability Theory (Fall 2021, Spring 2021, Fall 2020)
- Statistical Inference (Fall 2023)
- Accelerated Probability Theory/Statistical Inference (Fall 2022, Spring 2023, Fall 2024)
- Statistical Machine Learning (Spring 2022)
- Linear Regression Models (Spring 2024)
- Bayesian Statistics (Summer 2022)
Undergraduate
- Introduction to Statistics (Spring 2020)
- Introduction to Statistical Reasoning (Fall 2019)
Tutor (at MIT HKN and Math Learning Center)
Undergraduate
- Signals and Systems (6.003)
- Probability and Random Variables (18.600)
- Linear Algebra (18.06)
- Differential Equations (18.03)