Blog

Author: Matthew Gray

New paper: “Functional Decision Theory”

MIRI senior researcher Eliezer Yudkowsky and executive director Nate Soares have a new introductory paper out on decision theory: “Functional decision theory: A new theory of instrumental rationality.” Abstract: This paper describes and motivates a new decision theory known as...

New paper: “Incorrigibility in the CIRL Framework”

MIRI assistant research fellow Ryan Carey has a new paper out discussing situations where good performance in Cooperative Inverse Reinforcement Learning (CIRL) tasks fails to imply that software agents will assist or cooperate with programmers. The paper, titled “Incorrigibility in...

Response to Cegłowski on superintelligence

Web developer Maciej Cegłowski recently gave a talk on AI safety (video, text) arguing that we should be skeptical of the standard assumptions that go into working on this problem, and doubly skeptical of the extreme-sounding claims, attitudes, and policies...

Browse

Blog

Author: Matthew Gray

New paper: “Functional Decision Theory”

New paper: “Incorrigibility in the CIRL Framework”

Response to Cegłowski on superintelligence

Categories