agreed. RL is not (at least by itself) the way to alignment
Haider. (@haider1)
Yoshua Bengio says Reinforcement Learning is a dangerous path for building superintelligence
It can create systems with hidden goals, reward hacking, and behavior that goes against what humans actually want
"an AI that doesn't care about outcomes can't be corrupted by them"
Video
— https://nitter.net/haider1/status/2054252767557145044#m