-
Mar 24, 2023
Goal-misgeneralization is ELK-hard
can goal-misgeneralization be formulated as an instance of ELK? -
Mar 24, 2023
Hutter-Prize for Prompts
An alternate hutter-prize -
Oct 16, 2021
The AGI needs to be honest
building truthful-ai is hard