December 14, 2023

Weak-to-strong generalization

We present a new research direction for superalignment, together with promising initial results: can we leverage the generalization properties of deep learning to control strong models with weak supervisors?

Share This Post

Leave a Reply

Your email address will not be published. Required fields are marked *


en_USEnglish