Yekyung Kim

Hi! I am a third-year PhD student at the University of Maryland, CLIP Lab, advised by Mohit Iyyer, with a research focus on natural language processing. My work lies at the intersection of evaluation and alignment in long-context scenarios. I initially started my PhD at UMass NLP and later transferred to UMD along with my advisor.

Before starting my PhD, I worked at Hyundai Motors Group and LG Electronics as a research engineer. I was selected as a specialist in AI and conducted research at CMU LTI as a visiting scientist mentored by Jaime Carbonell.

Email  /  Google Scholar  /  Github

profile photo
Research
  • Evaluating faithfulness and factuality
    on long-context (FABLES, ONERULER) and long-form generation (VERISCORE)
  • Post-training with synthetic dataset
    for instruction following (BLEUBERI) and compositional reasoning (ongoing work)
  • Agent for long-horizon task (ongoing work on complex claim verification)
Publications
BLEUBERI thumbnail BLEUBERI: BLEU is a surprisingly effective reward for instruction following
Yapei Chang, Yekyung Kim, Michael Krumdick, Amir Zadeh, Chuan Li, Chris Tanner, Mohit Iyyer
NeurIPS 2025
Code
OneRULER image One ruler to measure them all: Benchmarking multilingual long-context language models
Yekyung Kim, Jenna Russell, Marzena Karpinska, Mohit Iyyer
COLM 2025
Code
VERISCORE image VERISCORE: Evaluating the Factuality of Verifiable Claims in Long-form Text Generation
Yixiao Song, Yekyung Kim, Mohit Iyyer
EMNLP Findings 2024
Code
FABLES icon FABLES: Evaluating Faithfulness and Content Selection in Book-length Summarization
Yekyung Kim, Yapei Chang, Marzena Karpinska, Aparna Garimella, Varun Manjunatha, Kyle Lo, Tanya Goyal, Mohit Iyyer
COLM 2024
Dataset + Code
Street crossing thumbnail Is it safe to cross? Interpretable Risk Assessment with GPT-4V for Safety-Aware Street Crossing
Hochul Hwang, Sunjae Kwon, Yekyung Kim, Donghyun Kim
21st International Conference on Ubiquitous Robots
LINDA image LINDA: Unsupervised Learning to Interpolate in Natural Language Processing
Yekyung Kim, Seohyeong Jeong, Kyunghyun Cho
arXiv
InfoVerse image A Universal Framework for Dataset Characterization with Multidimensional Meta-information
Jaehyung Kim, Yekyung Kim, Karin Johanna Denton de Langis, Jinwoo Shin, Dongyeop Kang
ACL 2023
Code
Meta-Crafting image Meta-Crafting: Improved Detection of Out-of-distributed Texts via Crafting Metadata Space
Ryan Koo, Yekyung Kim, Dongyeop Kang, Jaehyung Kim
AAAI 2024 Student Abstract and Poster Program
Active learning image Deep Active Learning for Sequence Labeling Based on Diversity and Uncertainty in Gradient
Yekyung Kim
Workshop on Life-long Learning for Spoken Language Systems at AACL, 2021
Korean NER image Learning Sub-Character level representation for Korean Named Entity Recognition
Yejin Kim, Yekyung Kim (equal contributions)
The International FLAIRS Conference Proceedings, 2020
Music Twitter image #Nowplaying the Future Billboard: Mining Music Listening Behaviors of Twitter Users for Hit Song Prediction
Yekyung Kim, Bongwon Suh, Kyogu Lee
Workshop on Social Media Retrieval and Analysis (SoMeRA) at SIGIR, 2014
Adobe Twitter image A Visual Analytics Approach to Summarizing Tweets
Ramik Sadana, Yekyung Kim, Bongwon Suh, Eunyee Koh
Industry day at SIGIR, 2014
Industry Project
Incheon Airport robot Airstar - Incheon Airport Robot, LG Electronics
Hyundai chatbot AI assistant for car, Hyundai
LG ThinQ chatbot Chatbot for home-appliances, LG Electronics