Aran Nayebi: Feel free to check out my new LessWrong post for a high-level summary of our two AAAI papers! "From Barriers to Alignment to the First Formal Corrigibility Guarantees" www.lesswrong.com/posts/M5owRc...

See full post

Aran Nayebi anayebi.bsky.social
Feel free to check out my new LessWrong post for a high-level summary of our two AAAI papers! "From Barriers to Alignment to the First Formal Corrigibility Guarantees" www.lesswrong.com/posts/M5owRc...
From Barriers to Alignment to the First Formal Corrigibility Guarantees — LessWrong

This post summarizes two related papers that will appear at AAAI 2026 in January: …

lesswrong.com
- Aran Nayebi anayebi.bsky.social · Nov 21, 2025
  We have 2 papers accepted to #AAAI2026 this year! The first paper 👇 on intrinsic barriers to alignment (establishing no free lunch theorems of encoding "all human values" & the inevitability of reward hacking) will appear as an *oral* presentation at the Special Track on AI Alignment.
Dec 8, 2025 13:09
0 reposts 0 quotes 0 likes

View on Bluesky Show all post labels

An unhandled error has occurred. Reload 🗙