Check out The AI Doc, streaming April 14th.

Blog

Category: Papers

New report: “Leó Szilárd and the Danger of Nuclear Weapons”

Today we release a new report by Katja Grace, “Leó Szilárd and the Danger of Nuclear Weapons: A Case Study in Risk Mitigation” (PDF, 72pp). Leó Szilárd has been cited as an example of someone who predicted a highly disruptive...

New paper: “Asymptotic logical uncertainty and the Benford test”

We have released a new paper on logical uncertainty, co-authored by Scott Garrabrant, Siddharth Bhaskar, Abram Demski, Joanna Garrabrant, George Koleszarik, and Evan Lloyd: “Asymptotic logical uncertainty and the Benford test.” Garrabrant gives some background on his approach to logical...

New report: “The Asilomar Conference: A Case Study in Risk Mitigation”

Today we release a new report by Katja Grace, “The Asilomar Conference: A Case Study in Risk Mitigation” (PDF, 67pp). The 1975 Asilomar Conference on Recombinant DNA is sometimes cited as an example of successful action by scientists who preemptively...

New papers on reflective oracles and agents

We recently released two new papers on reflective oracles and agents. The first is “Reflective oracles: A foundation for classical game theory,” by Benja Fallenstein, Jessica Taylor, and Paul Christiano. Abstract: Classical game theory treats players as special—a description of...

New report: “An Introduction to Löb’s Theorem in MIRI Research”

Today we publicly release a new technical report by Patrick LaVictoire, titled “An Introduction to Löb’s Theorem in MIRI Research.” The report’s introduction begins: This expository note is devoted to answering the following question: why do many MIRI research papers...

New report: “The value learning problem”

Today we release a new technical report by Nate Soares, “The value learning problem.” If you’d like to discuss the paper, please do so here. Abstract: A superintelligent machine would not automatically act as intended: it will act as programmed,...

Browse