See full post

Martin Potthast

martin-potthast.com

Followers · Following

Professor at the University of Kassel, https://hessian.AI, and https://ScaDS.AI. Member of @webis.de Research in information retrieval #IR, natural language processing #NLP, and artificial intelligence.

Joined July 2023

Posts Replies Media Original posts Likes Lists

Martin Potthast martin-potthast.com · Jan 2
No cheating, repost the most recent picture of your pet(s)
- Yuval Pinter uvp.bsky.social · Jan 2
  i've had better
View on Bluesky Download image (1)Download image (2)Show all post labels

Reposted by Martin Potthast
Leonie Weissweiler weissweiler.bsky.social · Dec 11, 2025
🧑‍🔬I’m recruiting PhD students in Natural Language Processing @unileipzig.bsky.social Computer Science, together with @scadsai.bsky.social! Topics include, but aren’t limited to: 🔎Linguistic Interpretability 🌍Multilingual Evaluation 📖Computational Typology Please share! #NLProc #NLP

View on Bluesky Download image Show all post labels

Reposted by Martin Potthast
Webis Group webis.de · Oct 27, 2025
We just released "German Commons", the largest openly-licensed German text dataset for LLM training: 154B tokens with clear usage rights for research and commercial use. huggingface.co/datasets/coral-nlp/german-commons
coral-nlp/german-commons · Datasets at Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

View on Bluesky Show all post labels

Reposted by Martin Potthast
Johanne Trippas jtrippas.bsky.social · Sep 2, 2025
🌟Really excited to share the fourth Strategic Workshop on Information Retrieval (SWIRL) report published in SIGIR Forum! Paper 👉🏻 www.johannetrippas.com/papers/tripp... More info 👉🏻 sites.google.com/view/swirl20... #SWIRL2025 #SIGIR2026 #IR #GenAI #Research #CHIIR2026

View on Bluesky Download image (1)Download image (2)Show all post labels

Reposted by Martin Potthast
Webis Group webis.de · Jul 18, 2025
Thrilled to announce that Matti Wiegmann has successfully defended his PhD! 🎉🧑‍🎓 Huge congratulations on this incredible achievement! #PhDDefense #AcademicMilestone

View on Bluesky Download image Show all post labels

Reposted by Martin Potthast
Webis Group webis.de · Jul 18, 2025
Replying to Webis Group
Honored to win the ICTIR Best Paper Honorable Mention Award for "Axioms for Retrieval-Augmented Generation"! Our new axioms are integrated with ir_axioms: github.com/webis-de/ir_... Nice to see axiomatic IR gaining momentum.

View on Bluesky Download image Show all post labels

Reposted by Martin Potthast
Webis Group webis.de · Jul 18, 2025
We presented two papers at ICTIR 2025 today: - Axioms for Retrieval-Augmented Generation webis.de/publications... - Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins webis.de/publications...

View on Bluesky Download image Show all post labels

Reposted by Martin Potthast
Ferdinand Schlatt fschlatt.bsky.social · Jul 16, 2025
Want to know how to make bi-encoders more than 3x faster with a new backbone encoder model? Check out our talk on the Token-Independent Text Encoder (TITE) #SIGIR2025 in the efficiency track. It pools vectors within the model to improve efficiency dl.acm.org/doi/10.1145/...

View on Bluesky Download image Show all post labels

Reposted by Martin Potthast
Maik Fröbe maik-froebe.bsky.social · Jul 16, 2025
Now @fschlatt.bsky.social presents "TITE: Token-Independent Text Encoder for Information Retrieval" at #SIGIR2025 Paper: webis.de/publications...

View on Bluesky Download image (1)Download image (2)Show all post labels

Reposted by Martin Potthast
Maik Fröbe maik-froebe.bsky.social · Jul 18, 2025
Here are some impressions from our ReNeuIR workshop on "Reaching Efficiency in Neural IR" that we had yesterday at #SIGIR2025.

View on Bluesky Download image (1)Download image (2)Download image (3)Download image (4)Show all post labels

Reposted by Martin Potthast
Webis Group webis.de · Jul 16, 2025
Happy to share that our paper "The Viability of Crowdsourcing for RAG Evaluation" received the Best Paper Honourable Mention at #SIGIR2025! Very grateful to the community for recognizing our work on improving RAG evaluation. 📄 webis.de/publications...

View on Bluesky Download image Show all post labels

Reposted by Martin Potthast
Maik Fröbe maik-froebe.bsky.social · Jul 15, 2025
Lukas Gienapp presents "The Viability of Crowdsourcing for RAG Evaluation" at #SIGIR2025 The paper is available at: webis.de/publications...

View on Bluesky Download image (1)Download image (2)Show all post labels

Reposted by Martin Potthast
Ferdinand Schlatt fschlatt.bsky.social · Jul 14, 2025
@mrparryparry.bsky.social presenting our work on reproducing TREC DL 2019 judgements and the implications for evaluating modern ranking models on modern collections. Paper: arxiv.org/abs/2502.20937
Variations in Relevance Judgments and the Shelf Life of Test Collections

The fundamental property of Cranfield-style evaluations, that system rankings are stable even when assessors disagree on individual relevance decisions, was validated on traditional test collections. ...

arxiv.org

View on Bluesky Show all post labels

Reposted by Martin Potthast
Ferdinand Schlatt fschlatt.bsky.social · Jul 13, 2025
Thank you Carlos for the shout-out of Lightning IR in the LSR tutorial at #SIGIR2025 If you want to fine your own LSR models, check out our framework at github.com/webis-de/lig...

View on Bluesky Download image Show all post labels

Reposted by Martin Potthast
ScaDS.AI Dresden/Leipzig scadsai.bsky.social · Jul 10, 2025
From July 13-17, 2025, @scadsai.bsky.social will join the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval in Padua, Italy. Our researchers have made the following contributions. Learn more about #SIGIR2025: 👉 sigir2025.dei.unipd.it

View on Bluesky Download image Show all post labels

Reposted by Martin Potthast
Maik Fröbe maik-froebe.bsky.social · Jun 27, 2025
Do not forget to participate in the #TREC2025 Tip-of-the-Tongue (ToT) Track :) The corpus and baselines (with run files) are now available and easily accessible via the ir_datasets API and the HuggingFace Datasets API. More details are available at: trec-tot.github.io/guidelines

View on Bluesky Download image Show all post labels

Reposted by Martin Potthast
Webis Group webis.de · Jun 22, 2025
Our paper on self-distillation for training bi-encoders got accepted at #ICTIR2025! By exploiting pretrained encoder capabilities, our approach eliminates expensive teacher models and batch sampling while maintaining the same effectiveness.

View on Bluesky Download image Show all post labels

Reposted by Martin Potthast
Lighthouse Reports lighthousereports.com · Jun 11, 2025
Most reporting on AI examines worst-case systems deployed under the guise of efficiency. But what would a good faith effort at Ethical AI look like? For two years, we’ve been looking over the shoulder of a city trying to do things differently.

View on Bluesky Show all post labels

Reposted by Martin Potthast
Jonathan Aldrich jonathanaldrich.bsky.social · May 19, 2025
All @acm.org publications will be 100% Open Access as of January 2026. When we announced this at POPL and CHI this year, conference participants spontaneously erupted in applause. The CS community is excited about ACM's move to OA!

View on Bluesky Show all post labels

Reposted by Martin Potthast
Maik Fröbe maik-froebe.bsky.social · May 21, 2025
The deadline for submissions to the ReNeuIR workshop at #SIGIR2025 is extended to June 10 😸 Details: reneuir.org #ReNeuIr2025 #SIGIR25
ReNeuIR’25

Workshop on Reaching Efficiency in Neural Information Retrieval

reneuir.org

View on Bluesky Show all post labels

Reposted by Martin Potthast
Webis Group webis.de · Mar 5, 2025
PAN 2025 Call for Participation: Shared Tasks on Authorship Analysis, Computational Ethics, and Originality We'd like to invite you to participate in the following shared tasks at PAN 2025 held in conjunction with the CLEF conference in Madrid, Spain. Find out more at pan.webis.de/clef25/pan25...
https://pan.webis.de/clef25/pan25-web

pan.webis.de

View on Bluesky Show all post labels

Reposted by Martin Potthast
Sebastian Heineking sheineking.bsky.social · Apr 30, 2025
Replying to Mark Riedl
We share your concern that LLMs could be prompted to generate responses that are biased in favor of certain products. That is why we are currently organizing a shared task on detecting advertisements in the responses of RAG-based search engines: bsky.app/profile/webi...
- Webis Group webis.de · Apr 30, 2025
  Can LLM-generated ads be blocked? With OpenAI adding shopping options to ChatGPT, this question gains further importance. If you are interested in contributing to the research on LLM-based advertising, please check out our shared task: touche.webis.de/clef25/touch... More details below.
View on Bluesky Show all post labels

Martin Potthast martin-potthast.com · Apr 30, 2025
The fourth edition of ReNeuIR @ #SIGIR2025 is back!! Check reneuir.org to see what we have in mind this year! Paper submission deadline: May 20, 2025.

View on Bluesky Download image Show all post labels

Reposted by Martin Potthast
Webis Group webis.de · Apr 30, 2025
Can LLM-generated ads be blocked? With OpenAI adding shopping options to ChatGPT, this question gains further importance. If you are interested in contributing to the research on LLM-based advertising, please check out our shared task: touche.webis.de/clef25/touch... More details below.

View on Bluesky Download image Show all post labels

Reposted by Martin Potthast
Simon Willison simonwillison.net · Apr 26, 2025
New AI ethics scandal brewing... turns out a team at University of Zurich had dozens of undisclosed AI bot accounts debating with people on /r/ChangeMyView from November 2024 to March 2025 simonwillison.net/2025/Apr/26/...
META: Unauthorized Experiment on CMV Involving AI-generated Comments

[r/changemyview](https://www.reddit.com/r/changemyview/) is a popular (top 1%) well moderated subreddit with an extremely well developed [set of rules](https://www.reddit.com/r/changemyview/wiki/rules...

simonwillison.net

View on Bluesky Show all post labels

Reposted by Martin Potthast
Internet Archive archive.org · Apr 17, 2025
📢 The Internet Archive needs your help. At a time when information is being rewritten or erased online, a $700 million lawsuit from major record labels threatens to destroy the Wayback Machine. Tell the labels to drop the 78s lawsuit. 👉 Sign our open letter: www.change.org/p/defend-the... 🧵⬇️

View on Bluesky Download image Show all post labels

Reposted by Martin Potthast
Maik Fröbe maik-froebe.bsky.social · Apr 10, 2025
The Workshop on Open Web Search at #ECIR2025 just starts with a keynote by @claclarke.bsky.social on Annotative Indexing. #WOWS25 #WOWS2025 #ECIR25

View on Bluesky Download image (1)Download image (2)Download image (3)Download image (4)Show all post labels

Reposted by Martin Potthast
Maik Fröbe maik-froebe.bsky.social · Apr 10, 2025
The Workshop on Open Web Search just finished #WOWS2025 #ECIR2025. It was a very cool experience with many interesting talks. Lets hope we can do it again next year at #ECIR2026 in Delft :)

View on Bluesky Download image (1)Download image (2)Download image (3)Download image (4)Show all post labels

Reposted by Martin Potthast
Ferdinand Schlatt fschlatt.bsky.social · Apr 9, 2025
Honored to receive the best short paper award and best paper honourable mention award at #ECIR2025. Thank you to all co-authors @maik-froebe.bsky.social, @hscells.bsky.social, Shengyao Zhuang, @bevankoopman.bsky.social, Guido Zuccon, Benno Stein, @martin-potthast.com, @matthias-hagen.bsky.social 🥳

View on Bluesky Download image Show all post labels

Reposted by Martin Potthast
Webis Group webis.de · Apr 7, 2025
📢 Our paper "The Viability of Crowdsourcing for RAG Evaluation" has been accepted to #SIGIR2025 ! We compared how good humans and LLMs are at writing and judging RAG responses, assembling 1800+ responses across 3 styles, and 47K+ pairwise judgments in 7 quality dimensions. 🧵➡️

View on Bluesky Download image Show all post labels

Reposted by Martin Potthast
Carl T. Bergstrom carlbergstrom.com · Mar 19, 2025
1. For the past thirty years I've had the best job in the world.  I've had the opportunity to follow my curiosity; explore the workings of nature and society; mentor students and junior colleagues in the same process; and teach generations of students about it all.

View on Bluesky Show all post labels

Reposted by Martin Potthast
Webis Group webis.de · Mar 5, 2025
Replying to Webis Group
Important Dates ---------------------- now Training Data Released May 23, 2025 Software submission May 30, 2025 Participant paper submission June 27, 2025 Peer review notification July 07, 2025 Camera-ready participant papers submission Sep 09-12, 2025 Conference

View on Bluesky Show all post labels

Reposted by Martin Potthast
Webis Group webis.de · Mar 5, 2025
Replying to Webis Group
4. Generative Plagiarism Detection. Given a pair of documents, your task is to identify all contiguous maximal-length passages of reused text between them. pan.webis.de/clef25/pan25...
PAN at CLEF 2025 - Generated Plagiarism Detection

PAN at CLEF 2025 - Generated Plagiarism Detection

pan.webis.de

View on Bluesky Show all post labels

Reposted by Martin Potthast
Webis Group webis.de · Mar 5, 2025
Replying to Webis Group
3. Multi-Author Writing Style Analysis. Given a document, determine at which positions the author changes. pan.webis.de/clef25/pan25...
PAN at CLEF 2025 - Multi-Author Writing Style Analysis

PAN at CLEF 2025 - Multi-Author Writing Style Analysis

pan.webis.de

View on Bluesky Show all post labels

Reposted by Martin Potthast
Webis Group webis.de · Mar 5, 2025
Replying to Webis Group
2. Multilingual Text Detoxification. Given a toxic piece of text, re-write it in a non-toxic way while saving the main content as much as possible. pan.webis.de/clef25/pan25...
PAN at CLEF 2025 - Multilingual Text Detoxification

PAN at CLEF 2025 - Multilingual Text Detoxification

pan.webis.de

View on Bluesky Show all post labels

Reposted by Martin Potthast
Webis Group webis.de · Mar 5, 2025
Replying to Webis Group
1. Voight-Kampff Generative AI Detection. Subtask 1: Given a (potentially obfuscated) text, decide whether it was written by a human or an AI. Subtask 2: Given a document collaboratively authored by human and AI, classify the extent to which the model assisted. pan.webis.de/clef25/pan25...
PAN at CLEF 2025 - Voight-Kampff Generative AI Detection

PAN at CLEF 2025 - Generative AI Detection

pan.webis.de

View on Bluesky Show all post labels

Reposted by Martin Potthast
arjen arjen@idf.social · Dec 19, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Martin Potthast
Webis Group webis.de · Feb 17, 2025
Interested in joining our research group or do you know someone who might be interested? We have a new vacancy: Research position at the Webis group on Watermarking for Large Language Models. More information: webis.de/for-students...

View on Bluesky Show all post labels

Reposted by Martin Potthast
Brian Bilston handle.invalid · Jan 24, 2025
Nearly there now - just a few hundred more days to go.

View on Bluesky Download image Show all post labels

Reposted by Martin Potthast
Webis Group webis.de · Jan 8, 2025
2nd International Workshop on Open Web Search: CfP We invite you to the #ECIR2025 Workshop on Open Web Search #wows2025. Please consider to submit to the scientific track or the WOWS-Eval shared task to enrich the Open Web Index with relevance judgments. Details: opensearchfoundation.org/wows2025
1st International Workshop on Open Web Search #wows2024 - 28 March 2024

Discuss ideas and approaches to open up the web search ecosystem!

opensearchfoundation.org

View on Bluesky Show all post labels

Reposted by Martin Potthast
danah boyd zephoria.bsky.social · Jan 4, 2025
My advisor warned me that academics trend towards bitterness. He encouraged me to intentionally resist this, remember where I came from, and never forget the privilege of getting to spend a life working with knowledge and ideas. He too said that bitterness and resentment is easy.
- Julia Angwin juliaangwin.com · Jan 4, 2025
  [Not loaded yet]
View on Bluesky Show all post labels

Martin Potthast martin-potthast.com · Dec 25, 2024
Analyzing game boards ain't ChatGPT's thing, yet. (German conversation, alt texts in English) The game is "Mensch ärgere dich nicht", a dice game in which the aim is to move your own four pieces around the board without being thrown back to square 1 by the pieces of the other players ... 1/4

View on Bluesky Download image (1)Download image (2)Show all post labels
Martin Potthast martin-potthast.com · Dec 25, 2024
... who move to your own position(s). Despite being almost entirely a game of luck, it's surprisingly funny, known to be just as upsetting on occasion, and reasonably well balanced until the end. Here, yellow is in the lead. but GPT recognizes it only partially. 2/4

View on Bluesky Download image (1)Download image (2)Show all post labels
Martin Potthast martin-potthast.com · Dec 25, 2024
Yellow won. GPT does not see it. Potentially, the perspective plays a role. 3/4

View on Bluesky Download image (1)Download image (2)Show all post labels
Martin Potthast martin-potthast.com · Dec 25, 2024
Apparently, it can be made to second-guess itself quite easily. 4/4

View on Bluesky Download image (1)Download image (2)Download image (3)Download image (4)Show all post labels

Reposted by Martin Potthast
djoerd djoerd@idf.social · Dec 19, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Martin Potthast
Marius Sältzer handle.invalid · Dec 6, 2024
Before you all delete your accounts on X, you should consider deleting content but "donating" them to science. Many institutions, such as @gesis-dataservices.bsky.social might use them to scrape more effectively than via burner accounts.

View on Bluesky Show all post labels

Reposted by Martin Potthast
Christopher Schröder cschroeder.bsky.social · Nov 24, 2024
🐣 New release: small-text v2.0.0.dev1 With Small Language Models on the rise, the new version of small-text has been long overdue! Despite the generative AI hype, many real-world tasks still rely on supervised learning—which is reliant on labeled data. #activelearning #nlproc #nlp #llms

View on Bluesky Download image Show all post labels

Reposted by Martin Potthast
Maik Fröbe maik-froebe.bsky.social · Nov 18, 2024
The #TREC2024 conference just started. Turns out that BM25 is turning 30 🥳 #TREC #TREC24

View on Bluesky Download image Show all post labels

Martin Potthast martin-potthast.com · Nov 14, 2024
Time for a starter pack on information retrieval: go.bsky.app/MXPJoTn
at://did:plc:fr4mrqeybprbevl5eenagk5f/app.bsky.graph.starterpack/3lawqgkwp2z25

View on Bluesky Show all post labels

Reposted by Martin Potthast
ACL handle.invalid · Nov 14, 2024
Hello, Computational linguistics/NLP world in Bluesky! We're creating the same accounts on other social media platforms in Bluesky! #NLProc

View on Bluesky Show all post labels

Reposted by Martin Potthast
Webis Group webis.de · Nov 13, 2024
Today we will present our poster on Query Variation Robustness of Transformer Models at #EMNLP2024. You can find us at the Information Retrieval and Text Mining 3 poster session at #EMNLP2024.

View on Bluesky Show all post labels

Reposted by Martin Potthast
botornot botornot.bsky.social · Nov 6, 2024
@bsky.app is there a way to follow all the people someone is following with a click, or make a starter pack from them? Would be a very fast way to create big networks when onboarding.

View on Bluesky Show all post labels

Martin Potthast martin-potthast.com · Oct 21, 2024
Tomorrow's introductory lecture on IR will be fun: We'll discuss examples of situations where retrieval systems succeed and fail. Here's a nice little example of news retrieval and how RAG systems fail at it. More research is needed if they are going to be used for any type of question.

View on Bluesky Download image (1)Download image (2)Show all post labels

Reposted by Martin Potthast
Bhaskar Mitra | ভাস্কর মিত্র bmitra.bsky.social · Apr 9, 2024
🔵News: ReNeuIR Workshop is back at #SIGIR2024! » Call for papers: reneuir.org/cfp.html » Shared task on efficient neural IR: reneuir.org/shared_task.... Come participate/present/network with a growing IR research sub-community excited about efficient neural retrieval.
Shared Task

Workshop on Reaching Efficiency in Neural Information Retrieval

reneuir.org

View on Bluesky Show all post labels

Martin Potthast martin-potthast.com · Mar 6, 2024
How will conversational search AI pay for itself? It may be native ads or product placement in generated answers. At #CHIIR2024 next week, we'll present a user study showing that many people don't recognize ads inserted by LLMs in generated search results: webis.de/publications... #mlsky

View on Bluesky Download image (1)Download image (2)Show all post labels

Reposted by Martin Potthast
Arno Simons arnosimons.bsky.social · Mar 1, 2024
How does Wikipedia decide whether a scientist should be mentioned in an article that is not about them? 🤷 We call this the problem of "micro-notability", and we've studied how Wikipedia editors deal with it in two articles on CRISPR/Cas9: journals.sagepub.com/doi/10.1177/...
Sage Journals: Your gateway to world-class research journals

Subscription and open access journals from Sage, the world's leading independent academic publisher.

journals.sagepub.com

View on Bluesky Show all post labels

An unhandled error has occurred. Reload 🗙