Twitter/X

@GaryMarcus: agreed. RL is not (at least by itself) the way to alignment Haider. (@haider1) Yoshua Bengio says...

2026-05-13 · 14:58 UTC ·@GaryMarcus ·0 min read

agreed. RL is not (at least by itself) the way to alignment

Haider. (@haider1)

Yoshua Bengio says Reinforcement Learning is a dangerous path for building superintelligence

It can create systems with hidden goals, reward hacking, and behavior that goes against what humans actually want

"an AI that doesn't care about outcomes can't be corrupted by them"

Video

— https://nitter.net/haider1/status/2054252767557145044#m