Search
Feeds
Discover
Login
Debug
Fork on GitHub
See full post
François Fleuret
francois.fleuret.org
This is very great.
Sam Bowman
handle.invalid
ยท
Dec 18, 2024
Replying to
Sam Bowman
Alongside our paper, we also recorded a roundtable video featuring four of the paper’s authors discussing the results and their implications in detail:
Alignment faking in large language models
YouTube video by Anthropic
youtube.com
Dec 19, 2024 08:29
0
reposts
0
quotes
0
likes
Repost
Quote post
View on Bluesky
Copy Bluesky URL
Copy post URL
Translate post
Show all post labels
An unhandled error has occurred.
Reload
๐