- Feel free to check out my new LessWrong post for a high-level summary of our two AAAI papers! "From Barriers to Alignment to the First Formal Corrigibility Guarantees" www.lesswrong.com/posts/M5owRc...
- We have 2 papers accepted to #AAAI2026 this year! The first paper 👇 on intrinsic barriers to alignment (establishing no free lunch theorems of encoding "all human values" & the inevitability of reward hacking) will appear as an *oral* presentation at the Special Track on AI Alignment.