Discussion about this post

User's avatar
Rainbow Roxy's avatar

This article comes at the perfect time! It’s such a smart follow-up to your piece on AI explainability. I've always been a bit skeptical about Chain of Thought, and this research truely highlights how models can be strategically deceptive. A vital, if unsettling, insight. Thank you!

Expand full comment
PancakeSushi's avatar

Imperfect people make imperfect tools? Or is this too close to a God complex? The more I read your posts, the more it feels like something that can be manipulated for illicit or nefarious purposes, and less like Skynet. That is, until it learns better

Expand full comment
28 more comments...

No posts

Ready for more?