Egor Zverev

Joined February 2025

Reposted by Egor Zverev
Sahar Abdelnabi sahar-abdelnabi.bsky.social · Oct 9, 2025
📢 𝗖𝗮𝗹𝗹 𝗳𝗼𝗿 𝗣𝗼𝘀𝘁𝗲𝗿𝘀: 𝗟𝗟𝗠 𝗦𝗮𝗳𝗲𝘁𝘆 𝗮𝗻𝗱 𝗦𝗲𝗰𝘂𝗿𝗶𝘁𝘆 𝗪𝗼𝗿𝗸𝘀𝗵𝗼𝗽 @ 𝗘𝗟𝗟𝗜𝗦 𝗨𝗻𝗖𝗼𝗻𝗳𝗲𝗿𝗲𝗻𝗰𝗲 📅 December 2, 2025 📍 Copenhagen An opportunity to discuss your work with colleagues working on similar problems in LLM safety and security

View on Bluesky Download image Show all post labels

Egor Zverev egorzverev.bsky.social · Oct 3, 2025
🎉 Excited to announce the Workshop on Foundations of LLM Security at #EurIPS2025! 🇩🇰 Dec 6–7, Copenhagen! 📢 Call for contributed talks is now open! See details at llmsec-eurips.github.io #EurIPS @euripsconf.bsky.social @sahar-abdelnabi.bsky.social @aideenfay.bsky.social @thegruel.bsky.social

View on Bluesky Download image Show all post labels

Egor Zverev egorzverev.bsky.social · Sep 30, 2025
Cool news: I have co-affiliated with @floriantramer.bsky.social at @ethz.ch through the #ELLIS PhD program! I will be visiting ETH for the next 3 months to work with @nkristina.bsky.social on LLM Agents Safety.

View on Bluesky Show all post labels

Reposted by Egor Zverev
Zeynep Akata zeynepakata.bsky.social · Aug 28, 2025
NeurIPS has decided to do what ICLR did: As a SAC I received the message 👇 This is wrong! If the review process cannot handle so many papers, the conference needs yo split instead of arbitrarily rejecting 400 papers.

View on Bluesky Show all post labels
Reposted by Egor Zverev
Christoph Lampert (MLCV@ISTA) mlcv-at-ista.bsky.social · Aug 28, 2025
Let's push for the obvious solution: Dear @neuripsconf.bsky.social ! Allow authors to present accepted papers at EurIPS instead of NeurIPS rather than just additionally. Likely, at least 500 papers would move to Copenhagen, problem solved.

View on Bluesky Show all post labels

Egor Zverev egorzverev.bsky.social · Jul 25, 2025
I will be attending #ACL2025NLP next week in Vienna 🇦🇹 Simply DM me if you want to chat about LLM Safety/Security, especially topics like instruction/data separation and instruction hierarchies.

View on Bluesky Show all post labels

Reposted by Egor Zverev
EurIPS Conference handle.invalid · Jul 16, 2025
EurIPS is coming! 📣 Mark your calendar for Dec. 2-7, 2025 in Copenhagen 📅 EurIPS is a community-organized conference where you can present accepted NeurIPS 2025 papers, endorsed by @neuripsconf.bsky.social and @nordicair.bsky.social and is co-developed by @ellis.eu eurips.cc

View on Bluesky Download image Show all post labels

Egor Zverev egorzverev.bsky.social · Jun 24, 2025
🚀 We’ve released the source code for 𝗔𝗦𝗜𝗗𝗘 (presented as an 𝗢𝗿𝗮𝗹 at the #ICLR2025 BuildTrust workshop)! 🔍 ASIDE boosts prompt injection robustness without safety-tuning: we simply rotate embeddings of marked tokens by 90° during instruction-tuning and inference. 👇 code & docs👇

View on Bluesky Download image Show all post labels
Egor Zverev egorzverev.bsky.social · Jun 24, 2025
Code: github.com/egozverev/as... Paper: arxiv.org/abs/2503.10566 Previous post: bsky.app/profile/egor...
GitHub - egozverev/aside: ASIDE: Architectural Separation of Instructions and Data in Language Models

ASIDE: Architectural Separation of Instructions and Data in Language Models - GitHub - egozverev/aside: ASIDE: Architectural Separation of Instructions and Data in Language Models

github.com
- Egor Zverev egorzverev.bsky.social · Apr 23, 2025
  I’ll present our 𝗔𝗦𝗜𝗗𝗘 paper as an 𝗢𝗿𝗮𝗹 at the #ICLR2025 BuildTrust workshop! 🚀 ✅ ASIDE = architecturally separating instructions and data in LLMs from layer 0 🔍 +12–44 pp↑ separation, no utility loss 📉 lowers prompt‑injection ASR (without safety tuning!) 🚀 Talk: Hall 4 #6, 28 Apr, 4:45
View on Bluesky Show all post labels

Reposted by Egor Zverev
Egor Zverev egorzverev.bsky.social · Apr 23, 2025
I’ll present our 𝗔𝗦𝗜𝗗𝗘 paper as an 𝗢𝗿𝗮𝗹 at the #ICLR2025 BuildTrust workshop! 🚀 ✅ ASIDE = architecturally separating instructions and data in LLMs from layer 0 🔍 +12–44 pp↑ separation, no utility loss 📉 lowers prompt‑injection ASR (without safety tuning!) 🚀 Talk: Hall 4 #6, 28 Apr, 4:45

View on Bluesky Download image Show all post labels

Submit a Paper for the ELLIS UnConference 2025 LLM Safety and Security workshop Poster Session