Blog

Category: Papers

Today we release a new report by Katja Grace, “Leó Szilárd and the Danger of Nuclear Weapons: A Case Study in Risk Mitigation” (PDF, 72pp). Leó Szilárd has been cited as an example of someone who predicted a highly disruptive...

We have released a new paper on logical uncertainty, co-authored by Scott Garrabrant, Siddharth Bhaskar, Abram Demski, Joanna Garrabrant, George Koleszarik, and Evan Lloyd: “Asymptotic logical uncertainty and the Benford test.” Garrabrant gives some background on his approach to logical...

Today we release a new report by Katja Grace, “The Asilomar Conference: A Case Study in Risk Mitigation” (PDF, 67pp). The 1975 Asilomar Conference on Recombinant DNA is sometimes cited as an example of successful action by scientists who preemptively...

We recently released two new papers on reflective oracles and agents. The first is “Reflective oracles: A foundation for classical game theory,” by Benja Fallenstein, Jessica Taylor, and Paul Christiano. Abstract: Classical game theory treats players as special—a description of...

Today we publicly release a new technical report by Patrick LaVictoire, titled “An Introduction to Löb’s Theorem in MIRI Research.” The report’s introduction begins: This expository note is devoted to answering the following question: why do many MIRI research papers...

Today we release a new technical report by Nate Soares, “The value learning problem.” If you’d like to discuss the paper, please do so here. Abstract: A superintelligent machine would not automatically act as intended: it will act as programmed,...