We recently released two new papers on reflective oracles and agents. The first is “Reflective oracles: A foundation for classical game theory,” by Benja Fallenstein, Jessica Taylor, and Paul Christiano. Abstract: Classical game theory treats players as special—a description of...
Today we publicly release a new technical report by Patrick LaVictoire, titled “An Introduction to Löb’s Theorem in MIRI Research.” The report’s introduction begins: This expository note is devoted to answering the following question: why do many MIRI research papers...
Today we release a new technical report by Nate Soares, “The value learning problem.” If you’d like to discuss the paper, please do so here. Abstract: A superintelligent machine would not automatically act as intended: it will act as programmed,...
Today we release a new technical report by Nate Soares, “Formalizing two problems of realistic world models.” If you’d like to discuss the paper, please do so here. Abstract: An intelligent agent embedded within the real world must reason about...
Today we release a new technical report by Benja Fallenstein and Nate Soares, “Vingean Reflection: Reliable Reasoning for Self-Improving Agents.” If you’d like to discuss the paper, please do so here. Abstract: Today, human-level machine intelligence is in the domain...
Today we release a new technical report by Nate Soares and Benja Fallenstein, “Questions of reasoning under logical uncertainty.” If you’d like to discuss the paper, please do so here. Abstract: A logically uncertain reasoner would be able to reason...