Description: This blog is intended to be a place to share ideas and results that are too weird, incomplete, or off-topic to turn into an academic paper, but that I think may be important. Let me know what you think! Contact links to the left.
About Posts 2023-03-09 The hot mess theory of AI misalignment: More intelligent agents behave less coherently 2022-11-06 Too much efficiency makes everything worse: overfitting and the strong version of Goodhart's law subscribe via RSS
This blog is intended to be a place to share ideas and results that are too weird, incomplete, or off-topic to turn into an academic paper, but that I think may be important. Let me know what you think! Contact links to the left.
About Posts 2023-09-10 Brain dump on the diversity of AI risk 2023-03-09 The hot mess theory of AI misalignment: More intelligent agents behave less coherently 2022-11-06 Too much efficiency makes everything worse: overfitting and the strong version of Goodhart's law subscribe via RSS