MT Group at FBK
- Last week at the @fbk-mt.bsky.social seminars, we hosted Elizabeth Salesky from Google DeepMind, presenting her work on "Translation and Language Modeling with Pixels" #NLProc #tokenization #MT
- Our pick of the week by @linaconti.bsky.social: "Voice gender diversity: expression, perception and acoustics" by @victor-rosi.bsky.social and Carolyn McGettigan, Royal Society Open Science (2025)
- Our pick of the week by @apierg.bsky.social: "Glitter: A Multi-Sentence, Multi-Reference Benchmark for Gender-Fair German Machine Translation" by Praval, @jhackenbuchner.bsky.social, @gattanasio.cc, and @a-lauscher.bsky.social (EMNLP 2025 Findings)
- Impressive work by the Glitter team: a new human-made benchmark for German gender-inclusive MT with long passages and multiple inclusive approaches + experiments showing that MT systems and LLMs still fall short in generating inclusive outputs. aclanthology.org/2025.finding... ✨
- Our pick of the week by @uzumakidhairya.bsky.social: "How Does Quantization Affect Multilingual LLMs?" by Kelly Marchisio, Saurabh Dash, @hongyuchen.bsky.social, Dennis Aumiller, Ahmet Üstün, @sarahooker.bsky.social, @sebruder.bsky.social (EMNLP 2024 Findings) aclanthology.org/2024.finding...
- It was great having Hosein Mohebbi (@hmohebbi.bsky.social) speak about interpretability for speech Transformers at our #MTSeminars! Thanks for the insights 🎤 #NLP #XAI
- 🚀 JOB ALERT 3: The FBK's MT Unit is hiring! Join us as a Researcher in Responsible & Trustworthy NLP and advance ethical, fair, and transparent language technologies. If you care about building safe and accountable AI systems, you can apply here: 👉 jobs.fbk.eu/Annunci/Offe...
- Happy to welcome our new PhD student Dhairya Suman, who will be researching model efficiency in machine translation! #nlproc @uzumakidhairya.bsky.social
- 🚀 JOB ALERT2: The FBK's MT Unit is hiring! Join us as a Junior Research Engineer in Multimodal LLMs and help build next-gen AI across text, speech, and video. If you’re passionate about deep learning and multimodal NLP, we want to hear from you! Apply at jobs.fbk.eu/Annunci/Jobs...
- 🚀 JOB ALERT: The FBK's MT Unit is hiring! Join us as a Researcher in Multimodal LLMs and work at the forefront of AI, combining text, speech, and video. If you’re passionate about deep learning and next-gen models, we want to hear from you! Apply 👉 jobs.fbk.eu/Annunci/Jobs...
- Reposted by MT Group at FBK[Not loaded yet]
- Heading home tired but very happy after a fantastic #EMNLP2025 (and some well-deserved vacation 😎). We were super busy presenting 5 papers! It was fantastic catching up with colleagues, exchanging ideas, and seeing all the amazing work in the #NLProc community! (1/5)
- Our pick of the week by @zhihangxie.bsky.social: "#Speech Discrete Tokens or Continuous Features? A Comparative Analysis for Spoken Language Understanding in #SpeechLLMs" by Dingdong Wang, Junan Li, Mingyu Cui, et al. (#EMNLP2025) aclanthology.org/2025.emnlp-m... #SLU #SpeechTech
- 🚀 Exciting news from the @fbk-mt.bsky.social group! @bsavoldi.bsky.social , @linaconti.bsky.social, @matteo-negri.bsky.social & @luisabentivogli.bsky.social are attending #EMNLP2025 in Suzhou 🇨🇳! Come to our sessions & let's connect: 🔗 mt.fbk.eu/fbk-mt-at-em... We’re also hiring postdocs!⚡
- 🎉🎓Congratulations to our PhD student @dennisfucci.bsky.social on a very successful thesis defense! 👏 Many thanks to the evaluation committee members @deboranozza.bsky.social, Mirco Ravanelli, and Leonardo Badino for their insightful feedback and appreciation of his work! #nlproc
- Reposted by MT Group at FBK[Not loaded yet]
- Our #PickOfTheWeek by @beomseok-lee.bsky.social: "Can Speech LLMs Think while Listening?" by Yi-Jen Shih, @rdesh26.bsky.social, Chunyang Wu, Wei Zhou, SK Bong, Yashesh Gaur, Jay Mahadeokar, Ozlem Kalinli, Mike Seltzer (2025). #Speech #SpeechLLM #LLM #SpeechTech #AI
- Our next presentation is by @sarapapi.bsky.social: "How real is your real-time simultaneous speech-to-text translation system?" Look for the answer in her TACL paper: direct.mit.edu/tacl/article... #lt2025fbk
- Our Marco Gaido presenting FAMA, the first family of large-scale open-science speech foundation models for English and Italian. Joint work with the @speechtekfbk.bsky.social group. Data, code, models publicly available, check all info in the paper: clic2025.unica.it/wp-content/u... #lt2025fbk
- @bsavoldi.bsky.social presenting our new multilingual benchmark for evaluating LLMs on gender-neutral translation. Catch our paper at #EMNLP2025 ℹ️ arxiv.org/pdf/2501.09409 #lt2025fbk
- Now it's the turn of our @dennisfucci.bsky.social presenting the #ACL2025NLP paper on explaining gender bias in speech translation 📖 aclanthology.org/2025.acl-sho... #lt2025fbk
- The Language Technology at FBK workshop has just started with a truly insightful talk by @deboranozza.bsky.social: "A Roadmap for the Everyday Use of LLMs: Emerging Risks and Research Directions" #LT2025FBK
- Our pick of the week by @linaconti.bsky.social: "Lost in Transcription, Found in Distribution Shift: Demystifying Hallucination in Speech Foundation Models" by Hanin Atwany, @abdulwaheed.bsky.social, Rita Singh, Monojit Choudhury, and Bhiksha Raj (ACL Findings 2025) aclanthology.org/2025.finding...
- 🚀 Join us for the LT@FBK day 2025! Discover cutting-edge research and highlights in speech and language technologies from Fondazione Bruno Kessler (FBK) 📅 October 28, 2025 📍FBK, Trento ℹ️ lt-highlights.fbk.eu
- Our pick of the week by @bsavoldi.bsky.social: "Acoustic-based Gender Differentiation in Speech-aware Language Models" by Junhyuk Choi, Jihwan Seol, Nayeon Kim, Chanhee Cho, EunBin Cho, Bugeun Kim. arxiv.org/abs/2509.21125 #Gender #SpeechLLM #Speech
- Reposted by MT Group at FBK[Not loaded yet]
- Marco Gaido introducing SimulStream, an #OpenSource Tool for Simultaneous #Speech #Translation 🗣️🖥️📝 at the DI Center Demo Day at FBK! The tool is going to be released soon. Stay tuned! 👀
- Marco Gaido and Roldano Cattoni presenting our SimulStream Demo at the DI Center Demo Day at FBK! The open-source tool, which is going to be released soon, natively supports any speech-to-text #HuggingFace models! 🤖 #SpeechTech #Translation
- Reposted by MT Group at FBK[Not loaded yet]
- Our very own @sarapapi.bsky.social presenting FAMA at #clicit2025: 📗Paper: clic2025.unica.it/wp-content/u... 🔗 Models: hf.co/collections/... 📊 Data: hf.co/datasets/FBK... 💻 Code: github.com/hlt-mt/FBK-f... Joint work with @speechtekfbk.bsky.social
- Reposted by MT Group at FBK[Not loaded yet]
- Reposted by MT Group at FBK[Not loaded yet]
- Our pick of the week by @sarapapi.bsky.social: "Retrieval-Augmented Generation for AI-Generated Content: A Survey" by Penghao Zhao, Hailin Zhang, Qinhan Yu, Zhengren Wang, Yunteng Geng, Fangcheng Fu, Ling Yang, Wentao Zhang, Jie Jiang, Bin Cui. arxiv.org/pdf/2402.19473 #RAG #survey
- Our pick of the week by Marco Gaido: "Context-Driven Dynamic #Pruning for Large #Speech #Foundation Models" by Masao Someki, Shikhar Bharadwaj, Atharva Anand Joshi, Chyi-Jiunn Lin, Jinchuan Tian, Jee-weon Jung, @shinjiw.bsky.social, et al. #INTERSPEECH2025. arxiv.org/abs/2505.18860
- Our pick of the week by @zhihangxie.bsky.social: "SimulMEGA: MoE Routers are Advanced Policy Makers for Simultaneous Speech Translation" by Chenyang Le, Bing Han, Jinshun Li, Songyong Chen, and Yanmin Qian (2025) #Speech #Simultaneous #Translation #MOE #SpeechTech
- Our pick of the week by @beomseok-lee.bsky.social: "Speech Discrete Tokens or Continuous Features? A Comparative Analysis for Spoken Language Understanding in SpeechLLMs" by Dingdong Wang, Junan Li, Mingyu Cui, Dongchao Yang, Xueyuan Chen, and Helen Meng (EMNLP 2025)
- Our pick of the week by @linaconti.bsky.social: "I Have No Mouth, and I Must Rhyme: Uncovering Internal Phonetic Representations in LLaMA 3.2" @jackmerullo.bsky.social, Arjun Khurana, Oliver McLaughlin (ICML 2025 Workshop on Assessing World Models) arxiv.org/abs/2508.02527 #XAI #LLM
- Heading home after an exciting and intense @aclmeeting.bsky.social in Vienna! We had a great time presenting our work and connecting with the community. Thanks to everyone who came by! #acl2025 #nlproc (1/6)
- Before presenting our speech model compression task at IWSLT, our pick of the week by Marco Gaido: WhisperKit arxiv.org/abs/2507.10860 by Atila Orhon, Arda Okan, Berkin Durmus, Zach Nagengast and @eduardo-pacheco.bsky.social (ICML 2025)—an early attempt to bring large-scale models to edge devices
- Our pick of the week by @zhihangxie.bsky.social: "Adversarial Speech-Text Pre-Training for Speech Translation" by Chenxuan Liu, Liping Chen, Weitai Zhang, Xiaoxi Li, Peiwang Tang, Mingjia Yu, Sreyan Ghosh, and Zhongyi Ye (ICASSP 2025)
- Our pick of the week by @zhihangxie.bsky.social 🔎: "PHRASED: Phrase Dictionary Biasing for Speech Translation" by Peidong Wang, Jian Xue, Rui Zhao, Junkun Chen, Aswin Shanmugam Subramanian, Jinyu Li arxiv.org/abs/2506.09175 #speech #AI #ST #NLP
- Reposted by MT Group at FBK[Not loaded yet]
- Reposted by MT Group at FBK[Not loaded yet]
- Our pick of the week by @dennisfucci.bsky.social: "Speech Representation Analysis Based on Inter- and Intra-Model Similarities" by @yelkheir.bsky.social, @ratedali.bsky.social, and Shammur Absar Chowdhury (ICASSP Workshops 2024)
- Findings from ieeexplore.ieee.org/document/106... show that speech SSL models converge on similar embedding spaces, but via different routes. While overall representations align, individual neurons learn distinct localized concepts. Interesting read! @fbk-mt.bsky.social
- Our pick of the week by @beomseok-lee.bsky.social: "ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs" by Pooneh Mousavi, Yingzhi Wang, Mirco Ravanelli, and Cem Subakan (2025) arxiv.org/abs/2505.19937 #SLU #speech #multimodal #LLM
- Our pick of the week by @apierg.bsky.social: "Agree to Disagree? A Meta-Evaluation of LLM Misgendering" by Arjun Subramonian, @dippedrusk.com, Preethi Seshadri, Dietrich Klakow, Kai-Wei Chang, and Yizhou Sun #LLM #NLProc #fairness
- Reposted by MT Group at FBK🔍 Stiamo studiando come l'AI viene usata in Italia e per farlo abbiamo costruito un sondaggio! 👉 bit.ly/sondaggio_ai... (è anonimo, richiede ~10 minuti, e se partecipi o lo fai girare ci aiuti un sacco🙏) Ci interessa anche raggiungere persone che non si occupano e non sono esperte di AI!
- Reposted by MT Group at FBK🚀 New tech report out! Meet FAMA, our open-science speech foundation model family for both ASR and ST in 🇬🇧 English and 🇮🇹 Italian. The models are live and ready to try on @hf.co: 🔗 huggingface.co/collections/... 📄 Preprint: arxiv.org/abs/2505.22759 #ASR #ST #OpenScience #MultilingualAI
- Our pick of the week by @linaconti.bsky.social : "Languages in Multilingual Speech Foundation Models Align Both Phonetically and Semantically", by @soheunshim.bsky.social, Domenico De Cristofaro, Chengzhi Martin Hu, @alessandrovietti.bsky.social, @barbaraplank.bsky.social #AI #XAI #speech #nlproc
- Our pick of the week by @bsavoldi.bsky.social: "Lost in Translation: Artificial Intelligence and the Demand for Foreign Language Skills" by @pmllanos.bsky.social and @carlbfrey.bsky.social (2025) oxfordmartin.ox.ac.uk/publications... #AI #translation #MT
- Reposted by MT Group at FBKSo happy our paper “Different Speech Translation Models Encode and Translate Speaker Gender Differently” was accepted at #ACL2025 (main)! 🎉 The preprint will be out soon! #SpeechTranslation #GenderBias #Interpretability @aclmeeting.bsky.social
- 🎉 Excited to share that our @sarapapi.bsky.social has won the 2024 Best PhD Award from the Information and Engineering Doctoral School for her thesis “Direct Speech Translation in Constrained Contexts: The Simultaneous and Subtitling Scenarios.” #nlproc #speech #speechprocessing #speechtranslation