Danny To Eun Kim
PhD student @CMU LTI
NLP | IR | Evaluation | RAG
https://kimdanny.github.io
- Reposted by Danny To Eun Kim🎭 How do LLMs (mis)represent culture? 🧮 How often? 🧠 Misrepresentations = missing knowledge? spoiler: NO! At #CHI2026 we are bringing ✨TALES✨ a participatory evaluation of cultural (mis)reps & knowledge in multilingual LLM-stories for India 📜 arxiv.org/abs/2511.21322 1/10
- #ChatGPT began to put ads in their response. Check our paper on “how fair ranking can positively impact the LLM response and content/ad exposure”. dl.acm.org/doi/10.1145/...
- as AI increasingly supports shopping and ads, it’s worth remembering that retrieval often shapes who gets exposure in final generated output. in a recent paper, @teknology.bsky.social uses methods from fair ranking to assess and address exposure bias in downstream generation. 841.io/doc/fairrag....
- #chatGPT began to put ads in their response. Check out our paper on “Ads detection and integration in the era of LLMs”. ceur-ws.org/Vol-4038/pap...
- Reposted by Danny To Eun Kimas AI increasingly supports shopping and ads, it’s worth remembering that retrieval often shapes who gets exposure in final generated output. in a recent paper, @teknology.bsky.social uses methods from fair ranking to assess and address exposure bias in downstream generation. 841.io/doc/fairrag....
- Reposted by Danny To Eun KimSome exciting news! 🤗 After 3 amazing years at TREC, the Tip-of-the-Tongue (ToT) shared task will be a core task at NTCIR-19 in 2026. The new track will focus on tip-of-the-tongue information needs in English and East Asian languages. More details coming soon. See you all in Tokyo next year!

- Reposted by Danny To Eun KimGentle reminder 📢 All run submissions for the Tip-of-the-Tongue (ToT) Track are due next week Wednesday (Aug 27). More info: trec-tot.github.io/guidelines #TREC2025 #TRECToT #TREC2025ToT
- Important announcement: All run submissions for TREC'25 Tip-of-the-Tongue (TREC-ToT) Track are due by **August 27th**. The run submission form is now open. Please submit your runs before the deadline. More information: trec-tot.github.io/guidelines #TREC2025 #TRECToT #TREC2025ToT Spread the word!
- This year's TREC Tip of the Tongue (ToT) track will be amazing! Based on our rigorous experiments on synthetic ToT query generation presented at #SIGIR2025, we extended the track to open domain ToT queries. We provide codes for baseline systems, and submissions are due by August 27th!
- Important announcement: All run submissions for TREC'25 Tip-of-the-Tongue (TREC-ToT) Track are due by **August 27th**. The run submission form is now open. Please submit your runs before the deadline. More information: trec-tot.github.io/guidelines #TREC2025 #TRECToT #TREC2025ToT Spread the word!
- Reposted by Danny To Eun KimTo Eun Kim just presented the work on "Tip of the Tongue Query Elicitation for Simulated Evaluation" at #SIGIR2025. The approach will be used in the #TREC2025 Tip-of-the-Tongue track, and we had some sweets at the poster :) The paper is available online: dl.acm.org/doi/10.1145/...
- Reposted by Danny To Eun KimHello TREC-ToTers! We have released the test queries for the TREC 2025 Tip-of-the-Tongue (TREC-ToT) Track. Please see the guidelines for more information: trec-tot.github.io/guidelines. Run submission deadline will tentatively be in August. #TREC2025 #TRECToT #TREC2025ToT Please spread the word!
- Hello TREC-ToTers! 👋🏽 Excited to announce the release of TREC 2025 Tip-of-the-Tongue (TREC-ToT) Track guidelines: trec-tot.github.io/guidelines. We will release test queries in July and run submission deadline will be in August. #TREC2025 #TRECToT #TREC2025ToT Please register to participate:
- ❓How do LLMs respond to fair ranking in RAG? 🤩 See how fair ranking boosts downstream utility while promoting fairer attribution of cited sources. Catch our oral presentation at #ICTIR2025! #SIGIR2025 @841io.bsky.social
- Heading to #NeurIPS2024 to present our ‘Fair RAG’ paper at the #AFME2024 workshop! Let's talk about RAG, Information Retrieval, and Fairness. Honored that our paper was selected as one of the Top 5 Spotlight Papers! 🎉 Let’s connect and chat! Paper: arxiv.org/abs/2409.11598
- Reposted by Danny To Eun KimDo not forget to participate in the #TREC2025 Tip-of-the-Tongue (ToT) Track :) The corpus and baselines (with run files) are now available and easily accessible via the ir_datasets API and the HuggingFace Datasets API. More details are available at: trec-tot.github.io/guidelines
- Reposted by Danny To Eun Kim🖋️ Curious how writing differs across (research) cultures? 🚩 Tired of “cultural” evals that don't consult people? We engaged with interdisciplinary researchers to identify & measure ✨cultural norms✨in scientific writing, and show that❗LLMs flatten them❗ 📜 arxiv.org/abs/2506.00784 [1/11]
- Reposted by Danny To Eun KimHello TREC-ToTers! 👋🏽 Excited to announce the release of TREC 2025 Tip-of-the-Tongue (TREC-ToT) Track guidelines: trec-tot.github.io/guidelines. We will release test queries in July and run submission deadline will be in August. #TREC2025 #TRECToT #TREC2025ToT Please register to participate:
- Reposted by Danny To Eun KimEver trusted a metric that works great on average, only for it to fail in your specific use case? In our #NAACL2025 paper (w/ @841io.bsky.social), we show why global evaluations are not enough and why context matters more than you think. 📄 aclanthology.org/2025.finding... #NLP #Evaluation (🧵1/9)
- Reposted by Danny To Eun KimIf you're interested in OpenAI including shopping results, you might also be interested in @teknology.bsky.social's paper relating retrieval diversity/fairness and generation by downstream RAG models. This has implications for individuals selling products online. arxiv.org/abs/2409.11598
- Reposted by Danny To Eun KimIf you're working on a recall-oriented task or with ranking systems evaluated across varied users, content, or intents, check it out. 5/5 dl.acm.org/doi/10.1145/...
- Reposted by Danny To Eun Kim📢 New Paper: "Recall, Robustness, and Lexicographic Evaluation" (ACM TORS) F Diaz, M Ekstrand (@md.ekstrandom.net), B Mitra (@bmitra.bsky.social) For IR, NLP, and ML researchers working on ranking systems evaluated for recall and robustness. 🧵 1/5 dl.acm.org/doi/10.1145/...
- 🚨New Breakthrough in Tip-of-the-Tongue (TOT) Retrieval Research! We address data limitations and offer a fresh evaluation method for these complex queries. Curious how TREC TOT track test queries are created? Check out this thread 🧵 and our paper 📄: arxiv.org/abs/2502.17776
- 👅Tip-of-the-Tongue (TOT) search is a complex form of known-item search, shaped by the expression of partial recall, personal context, and uncertain memories. However, TOT research has long been hindered by the scarcity of high-quality TOT queries.
- 🤔Why the Problem? TOT query data collection relies heavily on community question answering websites (e.g., Reddit). This causes data availability issues and domain bias (most TOT queries end up being about movies or books).
-
View full threadHere's an overview of TREC 2024 TOT track runs with the test queries: trec.nist.gov/pubs/trec33/...
- Reposted by Danny To Eun KimDid you know? Gestures used to express universal concepts—like wishing for luck—vary DRAMATICALLY across cultures? 🤞means luck in US but deeply offensive in Vietnam 🚨 📣 We introduce MC-SIGNS, a test bed to evaluate how LLMs/VLMs/T2I handle such nonverbal behavior! 📜: arxiv.org/abs/2502.17710
- Heading to #NeurIPS2024 to present our ‘Fair RAG’ paper at the #AFME2024 workshop! Let's talk about RAG, Information Retrieval, and Fairness. Honored that our paper was selected as one of the Top 5 Spotlight Papers! 🎉 Let’s connect and chat! Paper: arxiv.org/abs/2409.11598
- Those who are attending #SIGIRAP2024, come by and learn how retrieval can enhance ML models!
- Today we'll be presenting the Tutorial on Retrieval-Enhanced Machine Learning (REML). Come by to learn about the emerging design patterns in this space and see how to use retrieval beyond RAG. In collaboration w/ the amazing @841io.bsky.social @teknology.bsky.social Alireza Salemi and Hamed Zamani.
- Link to the website: retrieval-enhanced-ml.github.io/sigir-ap2024...
- Reposted by Danny To Eun KimSlides are up! I presented on "Presentation & Consumption in the context of REML" The full deck is here. There's a lot of gems if you're interested in this space! retrieval-enhanced-ml.github.io/sigir-ap2024...
- Today we'll be presenting the Tutorial on Retrieval-Enhanced Machine Learning (REML). Come by to learn about the emerging design patterns in this space and see how to use retrieval beyond RAG. In collaboration w/ the amazing @841io.bsky.social @teknology.bsky.social Alireza Salemi and Hamed Zamani.
- Reposted by Danny To Eun KimCreating a 🦋 starter pack for people working in IR/RAG: go.bsky.app/88ULgwY I can’t seem to find everyone though, help definitely appreciated to fill this out (DM or comment)!at://did:plc:vm5ic6kt3kljuttejxqf3dz3/app.bsky.graph.starterpack/3lbnevuobkq2z
- Reposted by Danny To Eun KimMat is not on 🦋—posting on his behalf! It's time to revisit common assumptions in IR! Embeddings have improved drastically, but mainstream IR evals have stagnated since MSMARCO + BEIR. We ask: on private or tricky IR tasks, are rerankers better? Surely, reranking many docs is best?
- Reposted by Danny To Eun KimTime for a starter pack on information retrieval: go.bsky.app/MXPJoTnat://did:plc:fr4mrqeybprbevl5eenagk5f/app.bsky.graph.starterpack/3lawqgkwp2z25
- Reposted by Danny To Eun KimHey all! I started a second starter pack with people who didn't make the first one, please let me know if you'd like to be added: go.bsky.app/JgneRQkat://did:plc:tj7jc54fic4zlahv4qfmg7mq/app.bsky.graph.starterpack/3las2wmbj2e2s
- Reposted by Danny To Eun KimI'm keeping track of people at the CMU Language Technologies Institute here: go.bsky.app/NhTwCVb. Follow along!at://did:plc:rfkbaph36it2i66g6a7uzcht/app.bsky.graph.starterpack/3laepe7jj7x2j