Blog

Category: Papers

New papers on reflective oracles and agents

We recently released two new papers on reflective oracles and agents. The first is “Reflective oracles: A foundation for classical game theory,” by Benja Fallenstein, Jessica Taylor, and Paul Christiano. Abstract: Classical game theory treats players as special—a description of...

New report: “An Introduction to Löb’s Theorem in MIRI Research”

Today we publicly release a new technical report by Patrick LaVictoire, titled “An Introduction to Löb’s Theorem in MIRI Research.” The report’s introduction begins: This expository note is devoted to answering the following question: why do many MIRI research papers...

New report: “The value learning problem”

Today we release a new technical report by Nate Soares, “The value learning problem.” If you’d like to discuss the paper, please do so here. Abstract: A superintelligent machine would not automatically act as intended: it will act as programmed,...

New report: “Formalizing Two Problems of Realistic World Models”

Today we release a new technical report by Nate Soares, “Formalizing two problems of realistic world models.” If you’d like to discuss the paper, please do so here. Abstract: An intelligent agent embedded within the real world must reason about...

New report: “Vingean Reflection: Reliable Reasoning for Self-Improving Agents”

Today we release a new technical report by Benja Fallenstein and Nate Soares, “Vingean Reflection: Reliable Reasoning for Self-Improving Agents.” If you’d like to discuss the paper, please do so here. Abstract: Today, human-level machine intelligence is in the domain...

New report: “Questions of reasoning under logical uncertainty”

Today we release a new technical report by Nate Soares and Benja Fallenstein, “Questions of reasoning under logical uncertainty.” If you’d like to discuss the paper, please do so here. Abstract: A logically uncertain reasoner would be able to reason...

Browse