Our best intervention was a dataset where the user is in an ethically difficult situation and the assistant gives a high quality, principled response. (opens in new tab)
Our best intervention was a dataset where the user is in an ethically difficult situation and the assistant gives a high quality, principled response. This had the biggest effect despite being quite different from the evaluation set.
Read the original article