Jae Yeon Kim

Jae Yeon Kim

Assistant Research Scholar

SNF Agora Institute and P3 Lab, Johns Hopkins University

I am an Assistant Research Scholar at the SNF Agora Institute at Johns Hopkins University. I study computational social science, racial and ethnic politics, political behavior, and public policy and organization. In 2022, I will join the KDI School of Public Policy and Management as an Assistant Professor of Data Science. I received my Ph.D. in political science from UC Berkeley, where I was a Senior Data Science Fellow at D-Lab.

My research agenda is interdisciplinary and lies at the intersection of social sciences and data science. My research centers around two themes: (1) advancing social science theory and methods on power, inequality, and resistance and (2) making social science research more efficient and scalable.

My research has been published or forthcoming in Perspectives on Politics, Political Research Quarterly, Studies in American Political Development, and among others. My research has also appeared in popular outlets such as the Washington Post’s Monkey Cage. I am the recipient of the Western Political Science Association’s 2020 Don T. Nakanishi Award for distinguished scholarship in Asian Pacific American Politics.

I have co-developed an R package that helps researchers and practitioners collect the most comprehensive data on nonprofit organizations in the U.S. I have taught computational social science at both graduate and undergraduate levels in semester-long courses and short workshops. I am currently working on an open textbook project titled “Computational Thinking for Social Scientists."

To get in touch with me, please send me an email at jkim638@jhu.edu.


  • Computational social science
  • Racial and ethnic politics
  • Political behavior
  • Public policy and organization


  • PhD in Political Science, 2021

    UC Berkeley

Peer-reviewed articles

(2020). Intersectional Bias in Hate Speech and Abusive Language Datasets. Proceedings of the Fourteenth International Conference on Web and Social Media (ICWSM), Data Challenge Workshop.

PDF Code Slides

Research briefs

Nathan Chan, Jae Yeon Kim, and Vivien Leung. Thanks to Trump’s Rhetoric, Asian Americans Are Moving Toward the Democratic Party. Washington Post’s Monkey Cage. March 30, 2021

Jae Yeon Kim. The Three Tales of Chinatown: Why Racism Is Not Enough to Create a Race-based Coalition among Marginalized Groups. UC Berkeley Canadian Studies Program. March 29, 2021

Other publications

Kim, Jae Yeon. Why Teaching Social Scientists How To Code Like A Professional Is Important. UC Berkeley D-Lab. September 23, 2020.

Haber, Jaren, Jae Yeon Kim, and Nick Camp. BAY-SICSS: Bridging Computational Social Scientists and Practitioners for Social Good. Berkeley Institute of Data Science. September 15, 2020.

Kim, Jae Yeon. Five Principles to Get Undergraduates Involved in Real-world Data Science Projects. SAGE Ocean. June 24, 2020.

Kim, Jae Yeon. How I Accidentally Became Interested in Data Science. UC Berkeley D-Lab. February 24, 2020.

Software & Data

Here is the list of software and data, which I have created for other researchers and practitioners.

Open-source Software

For research

  1. Stable
  • (a) MapAgora: R package for getting tax reports, websites, and social media handles related to nonprofit organizations in the United States (with Milan de Vries)
  • (b) tidytweetjson: R package for turning Tweet JSON files into a cleaned and wrangled dataset.
  • (c) tidyethnicnews: R package for turning search results from one of the largest databases on ethnic newspapers and magazines published in the United States into a cleaned and wrangled dataset.
  • (d) rnytapi: R interface for the New York Times API
  1. Still contains known or unknown bugs
  • (a) autotextclassifier: R package for automatically classifying texts based on tidymodels (with Milan de Vries)

For productivity

  • (a) tidybibtex: R package for tidying BibTeX files

For internal use (labs)

  • (a) p3themes: R package for applying p3 lab themes to ggplot2 objects (with Grace Park and Liz McKenna)

Public Data


Teaching Awards and Training

Graduate Seminars:

Undergraduate Lectures:

  • Graduate student instructor for Laura Stoker, Introduction to Empirical Analysis and Quantitative Methods, Department of Political Science, UC Berkeley, Fall 2016


Summer Institutes:


Please view my CV.


I believe that collaboration makes me a better researcher and research more fun.

I have co-founded three interdisciplinary working groups and co-organized the Summer Institute in Computational Social Science in the San Francisco Bay Area, hosted by UC Berkeley and Stanford, all while in graduate school.

I have conducted research with 22 amazing people across social sciences and engineering. The following is the list of their names, fields, and affiliations (ordered alphabetically). Please check out their websites to learn more about their research.


I was born and raised in South Korea, but I had also lived in Hong Kong and Taiwan by the time I finished college. I moved to the San Francisco Bay Area in 2014 as a graduate student in political science at UC Berkeley. Before my graduate studies, I worked in the tech industry in South Korea. I was a strategy manager at a software startup and served on the advisory board of Naver, “The Google of South Korea.”

I am a first-generation college student and grew up in a working-class family. My late father was a factory worker of 32 years at a paper mill, and my mother was a housekeeper. Yet, despite their humble educational backgrounds, they loved and valued learning. My supportive parents and numerous mentors put me on the path to be where I am today. To pay forward the generosity I have received, I am committed to increasing the pipeline of diverse talent in academia.

When I do not write or code, I enjoy cooking, distance running (both road and trail running) and drawing (people and landscape). I am also an avid reader and music enthusiast (blues, jazz, classical, and hip-hop).