Yekyung Kim

Hi! I am a third-year PhD student at the University of Maryland, CLIP Lab, advised by Mohit Iyyer, with a research focus on natural language processing. My work lies at the intersection of evaluation and alignment in long-context scenarios. I initially started my PhD at UMass NLP and later transferred to UMD along with my advisor.

Before starting my PhD, I worked at Hyundai Motors Group and LG Electronics as a research engineer. I was selected as a specialist in AI and conducted research at CMU LTI as a visiting scientist mentored by Jaime Carbonell.

Email / Google Scholar / Github

Research

Evaluating faithfulness and factuality
on long-context (FABLES, ONERULER) and long-form generation (VERISCORE)
Post-training with synthetic dataset
for instruction following (BLEUBERI) and compositional reasoning (ongoing work)
Agent for long-horizon task (ongoing work on complex claim verification)

Publications

	BLEUBERI: BLEU is a surprisingly effective reward for instruction following Yapei Chang, Yekyung Kim, Michael Krumdick, Amir Zadeh, Chuan Li, Chris Tanner, Mohit Iyyer NeurIPS 2025 Code
	One ruler to measure them all: Benchmarking multilingual long-context language models Yekyung Kim, Jenna Russell, Marzena Karpinska, Mohit Iyyer COLM 2025 Code
	VERISCORE: Evaluating the Factuality of Verifiable Claims in Long-form Text Generation Yixiao Song, Yekyung Kim, Mohit Iyyer EMNLP Findings 2024 Code
	FABLES: Evaluating Faithfulness and Content Selection in Book-length Summarization Yekyung Kim, Yapei Chang, Marzena Karpinska, Aparna Garimella, Varun Manjunatha, Kyle Lo, Tanya Goyal, Mohit Iyyer COLM 2024 Dataset + Code
	Is it safe to cross? Interpretable Risk Assessment with GPT-4V for Safety-Aware Street Crossing Hochul Hwang, Sunjae Kwon, Yekyung Kim, Donghyun Kim 21st International Conference on Ubiquitous Robots
	LINDA: Unsupervised Learning to Interpolate in Natural Language Processing Yekyung Kim, Seohyeong Jeong, Kyunghyun Cho arXiv
	A Universal Framework for Dataset Characterization with Multidimensional Meta-information Jaehyung Kim, Yekyung Kim, Karin Johanna Denton de Langis, Jinwoo Shin, Dongyeop Kang ACL 2023 Code
	Meta-Crafting: Improved Detection of Out-of-distributed Texts via Crafting Metadata Space Ryan Koo, Yekyung Kim, Dongyeop Kang, Jaehyung Kim AAAI 2024 Student Abstract and Poster Program
	Deep Active Learning for Sequence Labeling Based on Diversity and Uncertainty in Gradient Yekyung Kim Workshop on Life-long Learning for Spoken Language Systems at AACL, 2021
	Learning Sub-Character level representation for Korean Named Entity Recognition Yejin Kim, Yekyung Kim (equal contributions) The International FLAIRS Conference Proceedings, 2020
	#Nowplaying the Future Billboard: Mining Music Listening Behaviors of Twitter Users for Hit Song Prediction Yekyung Kim, Bongwon Suh, Kyogu Lee Workshop on Social Media Retrieval and Analysis (SoMeRA) at SIGIR, 2014
	A Visual Analytics Approach to Summarizing Tweets Ramik Sadana, Yekyung Kim, Bongwon Suh, Eunyee Koh Industry day at SIGIR, 2014

Industry Project

	Airstar - Incheon Airport Robot, LG Electronics
	AI assistant for car, Hyundai
	Chatbot for home-appliances, LG Electronics