See full post

Danny To Eun Kim

teknology.bsky.social

Followers · Following

PhD student @CMU LTI NLP | IR | Evaluation | RAG https://kimdanny.github.io

Joined November 2024

Posts Replies Media Original posts Likes

Reposted by Danny To Eun Kim
Shaily shaily99.bsky.social · Feb 2
🎭 How do LLMs (mis)represent culture? 🧮 How often? 🧠 Misrepresentations = missing knowledge? spoiler: NO! At #CHI2026 we are bringing ✨TALES✨ a participatory evaluation of cultural (mis)reps & knowledge in multilingual LLM-stories for India 📜 arxiv.org/abs/2511.21322 1/10

View on Bluesky Download image Show all post labels

Danny To Eun Kim teknology.bsky.social · Jan 17
#ChatGPT began to put ads in their response. Check our paper on “how fair ranking can positively impact the LLM response and content/ad exposure”. dl.acm.org/doi/10.1145/...
Towards Fair RAG: On the Impact of Fair Ranking in Retrieval-Augmented Generation | Proceedings of the 2025 International ACM SIGIR Conference on Innovative Concepts and Theories in Information Retrie...

dl.acm.org
- Fernando Diaz handle.invalid · Dec 31, 2025
  as AI increasingly supports shopping and ads, it’s worth remembering that retrieval often shapes who gets exposure in final generated output. in a recent paper, @teknology.bsky.social uses methods from fair ranking to assess and address exposure bias in downstream generation. 841.io/doc/fairrag....
View on Bluesky Show all post labels

Danny To Eun Kim teknology.bsky.social · Jan 17
#chatGPT began to put ads in their response. Check out our paper on “Ads detection and integration in the era of LLMs”. ceur-ws.org/Vol-4038/pap...
https://ceur-ws.org/Vol-4038/paper_385.pdf

ceur-ws.org
- Danny To Eun Kim teknology.bsky.social · Sep 9, 2025
  Excited to present at #CLEF2025 #Touché Lab (Session 2) shared task "Advertisement in RAG"🇪🇸! @webis.de 🗓️Sept 9 (Tue) ⏲️5:20PM (CEST) / 11:20AM (EST) 📍Florentino Sanz Room 🧠https://arxiv.org/abs/2507.00509 Join us for insights on #RAG + advertising!
View on Bluesky Show all post labels

Reposted by Danny To Eun Kim
Fernando Diaz handle.invalid · Dec 31, 2025
as AI increasingly supports shopping and ads, it’s worth remembering that retrieval often shapes who gets exposure in final generated output. in a recent paper, @teknology.bsky.social uses methods from fair ranking to assess and address exposure bias in downstream generation. 841.io/doc/fairrag....

View on Bluesky Show all post labels

Danny To Eun Kim teknology.bsky.social · Sep 9, 2025
Excited to present at #CLEF2025 #Touché Lab (Session 2) shared task "Advertisement in RAG"🇪🇸! @webis.de 🗓️Sept 9 (Tue) ⏲️5:20PM (CEST) / 11:20AM (EST) 📍Florentino Sanz Room 🧠https://arxiv.org/abs/2507.00509 Join us for insights on #RAG + advertising!

View on Bluesky Download image Show all post labels

Reposted by Danny To Eun Kim
Bhaskar Mitra | ভাস্কর মিত্র handle.invalid · Sep 1, 2025
Some exciting news! 🤗 After 3 amazing years at TREC, the Tip-of-the-Tongue (ToT) shared task will be a core task at NTCIR-19 in 2026. The new track will focus on tip-of-the-tongue information needs in English and East Asian languages. More details coming soon. See you all in Tokyo next year!

View on Bluesky Show all post labels

Reposted by Danny To Eun Kim
Bhaskar Mitra | ভাস্কর মিত্র handle.invalid · Aug 19, 2025
Gentle reminder 📢 All run submissions for the Tip-of-the-Tongue (ToT) Track are due next week Wednesday (Aug 27). More info: trec-tot.github.io/guidelines #TREC2025 #TRECToT #TREC2025ToT
TREC 2025 Tip-of-the-Tongue (ToT) Track

Tip of the tongue: The phenomenon of failing to retrieve something from memory, combined with partial recall and the feeling that retrieval is imminent.

trec-tot.github.io
- Bhaskar Mitra | ভাস্কর মিত্র handle.invalid · Aug 4, 2025
  Important announcement: All run submissions for TREC'25 Tip-of-the-Tongue (TREC-ToT) Track are due by **August 27th**. The run submission form is now open. Please submit your runs before the deadline. More information: trec-tot.github.io/guidelines #TREC2025 #TRECToT #TREC2025ToT Spread the word!
  TREC 2025 Tip-of-the-Tongue (ToT) Track
  
  Tip of the tongue: The phenomenon of failing to retrieve something from memory, combined with partial recall and the feeling that retrieval is imminent.
  
  trec-tot.github.io
View on Bluesky Show all post labels

Danny To Eun Kim teknology.bsky.social · Aug 4, 2025
This year's TREC Tip of the Tongue (ToT) track will be amazing! Based on our rigorous experiments on synthetic ToT query generation presented at #SIGIR2025, we extended the track to open domain ToT queries. We provide codes for baseline systems, and submissions are due by August 27th!
- Bhaskar Mitra | ভাস্কর মিত্র handle.invalid · Aug 4, 2025
  Important announcement: All run submissions for TREC'25 Tip-of-the-Tongue (TREC-ToT) Track are due by **August 27th**. The run submission form is now open. Please submit your runs before the deadline. More information: trec-tot.github.io/guidelines #TREC2025 #TRECToT #TREC2025ToT Spread the word!
  TREC 2025 Tip-of-the-Tongue (ToT) Track
  
  Tip of the tongue: The phenomenon of failing to retrieve something from memory, combined with partial recall and the feeling that retrieval is imminent.
  
  trec-tot.github.io
View on Bluesky Show all post labels

Reposted by Danny To Eun Kim
Maik Fröbe handle.invalid · Jul 15, 2025
To Eun Kim just presented the work on "Tip of the Tongue Query Elicitation for Simulated Evaluation" at #SIGIR2025. The approach will be used in the #TREC2025 Tip-of-the-Tongue track, and we had some sweets at the poster :) The paper is available online: dl.acm.org/doi/10.1145/...

View on Bluesky Download image (1)Download image (2)Download image (3)Show all post labels

Reposted by Danny To Eun Kim
Bhaskar Mitra | ভাস্কর মিত্র handle.invalid · Jul 13, 2025
Hello TREC-ToTers! We have released the test queries for the TREC 2025 Tip-of-the-Tongue (TREC-ToT) Track. Please see the guidelines for more information: trec-tot.github.io/guidelines. Run submission deadline will tentatively be in August. #TREC2025 #TRECToT #TREC2025ToT Please spread the word!
TREC 2025 Tip-of-the-Tongue (ToT) Track

Tip of the tongue: The phenomenon of failing to retrieve something from memory, combined with partial recall and the feeling that retrieval is imminent.

trec-tot.github.io
- Bhaskar Mitra | ভাস্কর মিত্র handle.invalid · May 9, 2025
  Hello TREC-ToTers! 👋🏽 Excited to announce the release of TREC 2025 Tip-of-the-Tongue (TREC-ToT) Track guidelines: trec-tot.github.io/guidelines. We will release test queries in July and run submission deadline will be in August. #TREC2025 #TRECToT #TREC2025ToT Please register to participate:
  TREC 2025 Tip-of-the-Tongue (ToT) Track
  
  Tip of the tongue: The phenomenon of failing to retrieve something from memory, combined with partial recall and the feeling that retrieval is imminent.
  
  trec-tot.github.io
View on Bluesky Show all post labels

Danny To Eun Kim teknology.bsky.social · Jul 12, 2025
❓How do LLMs respond to fair ranking in RAG? 🤩 See how fair ranking boosts downstream utility while promoting fairer attribution of cited sources. Catch our oral presentation at #ICTIR2025! #SIGIR2025 @841io.bsky.social
- Danny To Eun Kim teknology.bsky.social · Dec 9, 2024
  Heading to #NeurIPS2024 to present our ‘Fair RAG’ paper at the #AFME2024 workshop! Let's talk about RAG, Information Retrieval, and Fairness. Honored that our paper was selected as one of the Top 5 Spotlight Papers! 🎉 Let’s connect and chat! Paper: arxiv.org/abs/2409.11598
  Towards Fair RAG: On the Impact of Fair Ranking in Retrieval-Augmented Generation
  
  Many language models now enhance their responses with retrieval capabilities, leading to the widespread adoption of retrieval-augmented generation (RAG) systems. However, despite retrieval being a cor...
  
  arxiv.org
View on Bluesky Show all post labels

Reposted by Danny To Eun Kim
Maik Fröbe handle.invalid · Jun 27, 2025
Do not forget to participate in the #TREC2025 Tip-of-the-Tongue (ToT) Track :) The corpus and baselines (with run files) are now available and easily accessible via the ir_datasets API and the HuggingFace Datasets API. More details are available at: trec-tot.github.io/guidelines

View on Bluesky Download image Show all post labels

Reposted by Danny To Eun Kim
Shaily shaily99.bsky.social · Jun 9, 2025
🖋️ Curious how writing differs across (research) cultures? 🚩 Tired of “cultural” evals that don't consult people? We engaged with interdisciplinary researchers to identify & measure ✨cultural norms✨in scientific writing, and show that❗LLMs flatten them❗ 📜 arxiv.org/abs/2506.00784 [1/11]

View on Bluesky Download image Show all post labels

Reposted by Danny To Eun Kim
Bhaskar Mitra | ভাস্কর মিত্র handle.invalid · May 9, 2025
Hello TREC-ToTers! 👋🏽 Excited to announce the release of TREC 2025 Tip-of-the-Tongue (TREC-ToT) Track guidelines: trec-tot.github.io/guidelines. We will release test queries in July and run submission deadline will be in August. #TREC2025 #TRECToT #TREC2025ToT Please register to participate:
TREC 2025 Tip-of-the-Tongue (ToT) Track

Tip of the tongue: The phenomenon of failing to retrieve something from memory, combined with partial recall and the feeling that retrieval is imminent.

trec-tot.github.io

View on Bluesky Show all post labels

Reposted by Danny To Eun Kim
Athiya Deviyani athiya.bsky.social · Apr 29, 2025
Ever trusted a metric that works great on average, only for it to fail in your specific use case? In our #NAACL2025 paper (w/ @841io.bsky.social), we show why global evaluations are not enough and why context matters more than you think. 📄 aclanthology.org/2025.finding... #NLP #Evaluation (🧵1/9)

View on Bluesky Download image Show all post labels

Reposted by Danny To Eun Kim
Fernando Diaz handle.invalid · Apr 28, 2025
If you're interested in OpenAI including shopping results, you might also be interested in @teknology.bsky.social's paper relating retrieval diversity/fairness and generation by downstream RAG models. This has implications for individuals selling products online. arxiv.org/abs/2409.11598
Towards Fair RAG: On the Impact of Fair Ranking in Retrieval-Augmented Generation

Modern language models frequently include retrieval components to improve their outputs, giving rise to a growing number of retrieval-augmented generation (RAG) systems. Yet, most existing work in RAG...

arxiv.org

View on Bluesky Show all post labels

Reposted by Danny To Eun Kim
Fernando Diaz handle.invalid · Apr 7, 2025
Replying to Fernando Diaz
If you're working on a recall-oriented task or with ranking systems evaluated across varied users, content, or intents, check it out. 5/5 dl.acm.org/doi/10.1145/...

View on Bluesky Show all post labels

Reposted by Danny To Eun Kim
Fernando Diaz handle.invalid · Apr 7, 2025
📢 New Paper: "Recall, Robustness, and Lexicographic Evaluation" (ACM TORS) F Diaz, M Ekstrand (@md.ekstrandom.net), B Mitra (@bmitra.bsky.social) For IR, NLP, and ML researchers working on ranking systems evaluated for recall and robustness. 🧵 1/5 dl.acm.org/doi/10.1145/...

View on Bluesky Download image Show all post labels

Danny To Eun Kim teknology.bsky.social · Mar 5, 2025
🚨New Breakthrough in Tip-of-the-Tongue (TOT) Retrieval Research! We address data limitations and offer a fresh evaluation method for these complex queries. Curious how TREC TOT track test queries are created? Check out this thread 🧵 and our paper 📄: arxiv.org/abs/2502.17776
Tip of the Tongue Query Elicitation for Simulated Evaluation

Tip-of-the-tongue (TOT) search occurs when a user struggles to recall a specific identifier, such as a document title. While common, existing search systems often fail to effectively support TOT scena...

arxiv.org

View on Bluesky Show all post labels
Danny To Eun Kim teknology.bsky.social · Mar 5, 2025
👅Tip-of-the-Tongue (TOT) search is a complex form of known-item search, shaped by the expression of partial recall, personal context, and uncertain memories. However, TOT research has long been hindered by the scarcity of high-quality TOT queries.

View on Bluesky Show all post labels
Danny To Eun Kim teknology.bsky.social · Mar 5, 2025
🤔Why the Problem? TOT query data collection relies heavily on community question answering websites (e.g., Reddit). This causes data availability issues and domain bias (most TOT queries end up being about movies or books).

View on Bluesky Show all post labels
View full thread
Danny To Eun Kim teknology.bsky.social · Mar 7, 2025
Here's an overview of TREC 2024 TOT track runs with the test queries: trec.nist.gov/pubs/trec33/...
https://trec.nist.gov/pubs/trec33/papers/Overview_tot.pdf

trec.nist.gov

View on Bluesky Show all post labels

Reposted by Danny To Eun Kim
Akhila Yerukola akhilayerukola.bsky.social · Feb 26, 2025
Did you know? Gestures used to express universal concepts—like wishing for luck—vary DRAMATICALLY across cultures? 🤞means luck in US but deeply offensive in Vietnam 🚨 📣 We introduce MC-SIGNS, a test bed to evaluate how LLMs/VLMs/T2I handle such nonverbal behavior! 📜: arxiv.org/abs/2502.17710

View on Bluesky Download image Show all post labels

Danny To Eun Kim teknology.bsky.social · Dec 9, 2024
Heading to #NeurIPS2024 to present our ‘Fair RAG’ paper at the #AFME2024 workshop! Let's talk about RAG, Information Retrieval, and Fairness. Honored that our paper was selected as one of the Top 5 Spotlight Papers! 🎉 Let’s connect and chat! Paper: arxiv.org/abs/2409.11598
Towards Fair RAG: On the Impact of Fair Ranking in Retrieval-Augmented Generation

Many language models now enhance their responses with retrieval capabilities, leading to the widespread adoption of retrieval-augmented generation (RAG) systems. However, despite retrieval being a cor...

arxiv.org

View on Bluesky Show all post labels

Danny To Eun Kim teknology.bsky.social · Dec 9, 2024
Those who are attending #SIGIRAP2024, come by and learn how retrieval can enhance ML models!
- Andrew Drozdov mrdrozdov.com · Dec 9, 2024
  Today we'll be presenting the Tutorial on Retrieval-Enhanced Machine Learning (REML). Come by to learn about the emerging design patterns in this space and see how to use retrieval beyond RAG. In collaboration w/ the amazing @841io.bsky.social @teknology.bsky.social Alireza Salemi and Hamed Zamani.
View on Bluesky Show all post labels
Danny To Eun Kim teknology.bsky.social · Dec 9, 2024
Link to the website: retrieval-enhanced-ml.github.io/sigir-ap2024...
SIGIR-AP 2024 Tutorial: Retrieval-Enhanced Machine Learning: Synthesis and Opportunities

SIGIR-AP 2024 Tutorial: Retrieval-Enhanced Machine Learning: Synthesis and Opportunities

retrieval-enhanced-ml.github.io

View on Bluesky Show all post labels

Reposted by Danny To Eun Kim
Andrew Drozdov mrdrozdov.com · Dec 9, 2024
Slides are up! I presented on "Presentation & Consumption in the context of REML" The full deck is here. There's a lot of gems if you're interested in this space! retrieval-enhanced-ml.github.io/sigir-ap2024...
- Andrew Drozdov mrdrozdov.com · Dec 9, 2024
  Today we'll be presenting the Tutorial on Retrieval-Enhanced Machine Learning (REML). Come by to learn about the emerging design patterns in this space and see how to use retrieval beyond RAG. In collaboration w/ the amazing @841io.bsky.social @teknology.bsky.social Alireza Salemi and Hamed Zamani.
View on Bluesky Download image Show all post labels

Reposted by Danny To Eun Kim
Orion Weller orionweller.bsky.social · Nov 23, 2024
Creating a 🦋 starter pack for people working in IR/RAG: go.bsky.app/88ULgwY I can’t seem to find everyone though, help definitely appreciated to fill this out (DM or comment)!
at://did:plc:vm5ic6kt3kljuttejxqf3dz3/app.bsky.graph.starterpack/3lbnevuobkq2z

View on Bluesky Show all post labels

Reposted by Danny To Eun Kim
Andrew Drozdov mrdrozdov.com · Nov 20, 2024
Mat is not on 🦋—posting on his behalf! It's time to revisit common assumptions in IR! Embeddings have improved drastically, but mainstream IR evals have stagnated since MSMARCO + BEIR. We ask: on private or tricky IR tasks, are rerankers better? Surely, reranking many docs is best?

View on Bluesky Download image Show all post labels

Reposted by Danny To Eun Kim
Martin Potthast martin-potthast.com · Nov 14, 2024
Time for a starter pack on information retrieval: go.bsky.app/MXPJoTn
at://did:plc:fr4mrqeybprbevl5eenagk5f/app.bsky.graph.starterpack/3lawqgkwp2z25

View on Bluesky Show all post labels

Reposted by Danny To Eun Kim
michael ginn handle.invalid · Nov 13, 2024
Replying to Maria Antoniak
Hey all! I started a second starter pack with people who didn't make the first one, please let me know if you'd like to be added: go.bsky.app/JgneRQk
at://did:plc:tj7jc54fic4zlahv4qfmg7mq/app.bsky.graph.starterpack/3las2wmbj2e2s

View on Bluesky Show all post labels

Reposted by Danny To Eun Kim
Sireesh Gururaja siree.sh · Nov 12, 2024
I'm keeping track of people at the CMU Language Technologies Institute here: go.bsky.app/NhTwCVb. Follow along!
at://did:plc:rfkbaph36it2i66g6a7uzcht/app.bsky.graph.starterpack/3laepe7jj7x2j

View on Bluesky Show all post labels

An unhandled error has occurred. Reload 🗙