Blog

Category: Papers

New report: “Computable probability distributions which converge…”

Back in July 2013, Will Sawin (Princeton) and Abram Demski (USC) wrote a technical report describing a result from that month’s MIRI research workshop. We are finally releasing that report today. It is titled “Computable probability distributions which converge on...

New report: “Toward Idealized Decision Theory”

Today we release a new technical report by Nate Soares and Benja Fallenstein, “Toward idealized decision theory.” If you’d like to discuss the paper, please do so here. Abstract: This paper motivates the study of decision theory as necessary for...

New report: “Tiling agents in causal graphs”

Today we release a new technical report by Nate Soares, “Tiling agents in causal graphs.” The report begins: Fallenstein and Soares [2014] demonstrates that it’s possible for certain types of proof-based agents to “tile” (license the construction of successor agents...

New paper: “Concept learning for safe autonomous AI”

MIRI research associate Kaj Sotala has released a new paper, accepted to the AI & Ethics workshop at AAAI-2015, titled “Concept learning for safe autonomous AI.” The abstract reads: Sophisticated autonomous AI may need to base its behavior on fuzzy...

New report: “UDT with known search order”

Today we release a new technical report from MIRI research associate Tsvi Benson-Tilsen: “UDT with known search order.” Abstract: We consider logical agents in a predictable universe running a variant of updateless decision theory. We give an algorithm to predict...

New paper: “Corrigibility”

Today we release a paper describing a new problem area in Friendly AI research we call corrigibility. The report (PDF) is co-authored by MIRI’s Friendly AI research team (Eliezer Yudkowsky, Benja Fallenstein, Nate Soares) and also Stuart Armstrong from the...