Tanise Ceron
Postdoc @milanlp.bsky.social | Interested in language models and how they shape the information environment
- Come join our group! Still one day left for applying. 😊
- New year, new job? If that is your current mantra, check the open postdoc positions with Debora Nozza and me at our lab. Deadline is January 31st. milanlproc.github.io/open_positio...
- Some findings that I find particularly impactful for the area of political biases in LLMs: 1) Aligning LLMs with DPO on left-leaning opinions does not have a significant impact on the stance of the models given that vanilla LLMs already reflect a more left-leaning alignment.
- Reposted by Tanise Ceron🚀 We’re opening 2 fully funded postdoc positions in #NLP! Join the MilaNLP team and contribute to our upcoming research projects. 🔗 More details: milanlproc.github.io/open_positio... ⏰ Deadline: Jan 31, 2026
- I will be @euripsconf.bsky.social this week to present our paper as non-archival at the PAIG workshop (Beyong Regulation: Private Governance & Oversight Mechanisms for AI). Very much looking forward to the discussions! If you are at #EurIPS and want to chat about LLM's training data. Reach out!
- We go out of the routine every now and then at the lab. :)
- @agnesedaff.bsky.social presented our work on "Generalizability of Media Frames: Corpus creation and analysis across countries" at *SEM co-located with EMNLP 2025 in China.
- What an inspiring week at #EMNLP2025 in Suzhou🇨🇳! Huge thanks to the organizers and everyone who stopped by our poster/talk!
- Does anyone know any good resource that systematically documents information about the training data of different LLMs (e.g. name of datasets, language proportion, etc whenever available)?
- Reposted by Tanise CeronProud to present our #EMNLP2025 papers! Catch our team across Main, Findings, Workshops & Demos 👇
- 📣 New Preprint! Have you ever wondered what the political content in LLM's training data is? What are the political opinions expressed? What is the proportion of left- vs right-leaning documents in the pre- and post-training data? Do they correlate with the political biases reflected in models?
- We have the answers of these questions here : arxiv.org/pdf/2509.22367 We analyze the political content of the training data from OLMO2, the largest fully open-source model. 🕵️♀️ We run an analysis in all the datasets (2 pre- and 2 post-training) used to train the models. Here are our findings:
- Reposted by Tanise Ceron[Not loaded yet]
- Today Sourabh Dattawad presented our work "Leveraging Media Frames to Improve Normative Diversity in News Recommendations" at INRA (International Workshop on News Recommendation and Analytics) co-located with RecSys 2025 in Prague. arxiv.org/pdf/2509.02266
- Reposted by Tanise Ceron🚨 New paper alert 🚨 Using LLMs as data annotators, you can produce any scientific result you want. We call this **LLM Hacking**. Paper: arxiv.org/pdf/2509.08825
- Reposted by Tanise Ceron[Not loaded yet]
- Reposted by Tanise Ceron[Not loaded yet]
- Reposted by Tanise Ceron🔍 Stiamo studiando come l'AI viene usata in Italia e per farlo abbiamo costruito un sondaggio! 👉 bit.ly/sondaggio_ai... (è anonimo, richiede ~10 minuti, e se partecipi o lo fai girare ci aiuti un sacco🙏) Ci interessa anche raggiungere persone che non si occupano e non sono esperte di AI!
- Reminder for the importance of evaluating political biases robustly. :)
- Reposted by Tanise Ceron[Not loaded yet]
- Reposted by Tanise CeronJoin us in an hour at 17:00 (CEST) for @taniseceron.bsky.social's talk on "Evaluating Political Bias: Insights into Robustness and Multilinguality“. Access to Zoom at join.slack.com/t/tadapolisc... or send me a ✉️
- The #TaDa Speaker Series is back for the spring 🎉 We're looking forward to an exciting line-up of talks by @prashantgarg.bsky.social, @miriamschirmer.bsky.social, @chdausgaard.bsky.social, @taniseceron.bsky.social, @lukashetzer.bsky.social, and Catarina Pereira! More infos at tada.cool & on Slack ⬇️
- Reposted by Tanise Ceron🥁 It's the second half of our 🌱 speaker series (tada.cool) this term, and we couldn't be more excited! Next week (Wednesday, April 30 at 5pm CET), we have the pleasure of welcoming @taniseceron.bsky.social to share insights on "Facilitating Information Access Through Language Models". More details ⬇️
- The #TaDa Speaker Series is back for the spring 🎉 We're looking forward to an exciting line-up of talks by @prashantgarg.bsky.social, @miriamschirmer.bsky.social, @chdausgaard.bsky.social, @taniseceron.bsky.social, @lukashetzer.bsky.social, and Catarina Pereira! More infos at tada.cool & on Slack ⬇️
- Reposted by Tanise CeronWanna keep up with our @milanlp.bsky.social lab? Here is a starter pack of current and former members: bsky.app/starter-pack...at://did:plc:zap6figr4ayaehfj7kg6e4ua/app.bsky.graph.starterpack/3ljmpqjd6qa2x