Machine Intelligence Research Institute

Machine Intelligence Research Institute

Formation	2000 (2000)
Type	Nonprofit research institute
Legal status	501(c)(3) tax exempt charity
Purpose	Research into friendly artificial intelligence
Location	Berkeley, California
Chair of the board	Edwin Evans
Executive director	Nate Soares
Key people	Eliezer Yudkowsky
Revenue	$1.7 million (2013)^[1]
Staff	14^[2]
Website	intelligence.org
Formerly called	Singularity Institute, Singularity Institute for Artificial Intelligence

The Machine Intelligence Research Institute (MIRI), formerly the Singularity Institute for Artificial Intelligence (SIAI), is a non-profit organization founded in 2000 to research safety issues related to the development of Strong AI. Nate Soares is the executive director, having taken over from Luke Muehlhauser in May 2015.^[3]

MIRI's technical agenda states that new formal tools are needed in order to ensure the safe operation of future generations of AI software (friendly artificial intelligence).^[4] The organization hosts regular research workshops to develop mathematical foundations for this project,^[5] and has been cited as one of several academic and nonprofit groups studying long-term AI outcomes.^[6]^[7]^[8]

History

In 2000, AI theorist Eliezer Yudkowsky and Internet entrepreneurs Brian and Sabine Atkins founded the Singularity Institute for Artificial Intelligence to "help humanity prepare for the moment when machine intelligence exceeded human intelligence".^[9]^[10]^[11] In early 2005, SIAI relocated from Atlanta, Georgia to Silicon Valley. From 2006 to 2012, the Institute collaborated with Singularity University to produce the Singularity Summit, a science and technology conference. Speakers included Steven Pinker, Peter Norvig, Stephen Wolfram, John Tooby, James Randi, and Douglas Hofstadter.^[12]^[13]^[14]

In mid-2012, the Institute spun off a new organization called the Center for Applied Rationality, whose focus is on using ideas from cognitive science to improve people's effectiveness in their daily lives.^[15]^[16]^[17] Having previously shortened its name to "Singularity Institute", in January 2013 SIAI changed its name to the "Machine Intelligence Research Institute" in order to avoid confusion with Singularity University. MIRI gave control of the Singularity Summit to Singularity University and shifted its focus toward research in mathematics and theoretical computer science.^[18]

In mid-2014, Nick Bostrom's book Superintelligence: Paths, Dangers, Strategies helped spark public discussion about AI's long-run social impact, receiving endorsements from Bill Gates and Elon Musk.^[19]^[20]^[21]^[22] Stephen Hawking and AI pioneer Stuart Russell co-authored a Huffington Post article citing the work of MIRI and other organizations in the area:

Whereas the short-term impact of AI depends on who controls it, the long-term impact depends on whether it can be controlled at all. [...] Although we are facing potentially the best or worst thing ever to happen to humanity, little serious research is devoted to these issues outside small non-profit institutes such as the Cambridge Center for Existential Risk, the Future of Humanity Institute, the Machine Intelligence Research Institute, and the Future of Life Institute.^[7]

In early 2015, MIRI's research was cited in a research priorities document accompanying an open letter on AI that called for "expanded research aimed at ensuring that increasingly capable AI systems are robust and beneficial".^[23] Musk responded by funding a large AI safety grant program, with grant recipients including Bostrom, Russell, Bart Selman, Francesca Rossi, Thomas Dietterich, Manuela M. Veloso, and researchers at MIRI.^[8]^[24]

Research

Forecasting

In addition to mathematical research, MIRI studies strategic questions related to AI, such as: What can (and can't) we predict about future AI technology? How can we improve our forecasting ability? Which interventions available today appear to be the most beneficial, given what little we do know?^[25]

Beginning in 2014, MIRI has funded forecasting work through the independent AI Impacts project. AI Impacts studies historical instances of discontinuous technological change, and has developed new measures of the relative computational power of humans and computer hardware.^[26]^[27]

MIRI researchers' interest in discontinuous AI progress stems from I. J. Good's argument that sufficiently advanced AI systems will eventually outperform humans in software engineering tasks, leading to a feedback loop of increasingly capable AI systems:^[4]^[28]^[23]^[22]

Let an ultraintelligent machine be defined as a machine that can far surpass all the intellectual activities of any man however clever. Since the design of machines is one of these intellectual activities, an ultraintelligent machine could design even better machines; there would then unquestionably be an 'intelligence explosion,' and the intelligence of man would be left far behind. Thus the first ultraintelligent machine is the last invention that man need ever make, provided that the machine is docile enough to tell us how to keep it under control.^[29]

Writers like Bostrom use the term superintelligence in place of Good's ultraintelligence.^[19] Following Vernor Vinge, Good's idea of intelligence explosion has come to be associated with the idea of a "technological singularity".^[30]^[31]^[32] Bostrom and researchers at MIRI have expressed skepticism about the views of singularity advocates like Ray Kurzweil that superintelligence is "just around the corner". MIRI researchers have advocated early safety work as a precautionary measure, while arguing that past predictions of AI progress have not been reliable.^[33]^[22]^[19]

Eliezer Yudkowsky, MIRI's co-founder and senior researcher, is frequently cited for his writing on the long-term social impact of progress in AI. Russell and Norvig's Artificial Intelligence: A Modern Approach, the standard textbook in the field of AI, summarizes Yudkowsky's thesis:

If ultraintelligent machines are a possibility, we humans would do well to make sure that we design their predecessors in such a way that they design themselves to treat us well. [...] Yudkowsky (2008)^[28] goes into more detail about how to design a Friendly AI. He asserts that friendliness (a desire not to harm humans) should be designed in from the start, but that the designers should recognize both that their own designs may be flawed, and that the robot will learn and evolve over time. Thus the problem is one of mechanism design—to define a mechanism for evolving AI systems under a system of checks and balances, and to give the systems utility functions that will remain friendly in the face of such changes.^[31]

Yudkowsky writes on the importance of friendly artificial intelligence in smarter-than-human systems.^[34] This informal goal is reflected in MIRI's recent publications as the requirement that AI systems be "aligned with human interests".^[4] Following Bostrom and Steve Omohundro, MIRI researchers believe that autonomous generally intelligent AI systems will have default incentives to treat human operators as competitors, obstacles, or threats if they are not specifically designed to promote their operators' goals.^[35]^[36]^[19]^[8]

High reliability and error tolerance in AI

The Future of Life Institute (FLI) research priorities document states:

Mathematical tools such as formal logic, probability, and decision theory have yielded significant insight into the foundations of reasoning and decision-making. However, there are still many open problems in the foundations of reasoning and decision. Solutions to these problems may make the behavior of very capable systems much more reliable and predictable. Example research topics in this area include reasoning and decision under bounded computational resources à la Horvitz and Russell, how to take into account correlations between AI systems’ behaviors and those of their environments or of other agents, how agents that are embedded in their environments should reason, and how to reason about uncertainty over logical consequences of beliefs or other deterministic computations. These topics may benefit from being considered together, since they appear deeply linked.^[23]

The priorities document cites MIRI publications in the relevant areas: formalizing cooperation in the prisoner's dilemma between "superrational" software agents;^[37] defining alternatives to causal decision theory and evidential decision theory in Newcomb's problem;^[38] and developing alternatives to Solomonoff's theory of inductive inference for agents embedded in physical environments^[39] and agents reasoning without logical omniscience.^[40]^[22]

Standard decision procedures are not well-specified enough (e.g., with regard to counterfactuals) to be instantiated as algorithms. MIRI researcher Benja Fallenstein and then-researcher Nate Soares write that causal decision theory is "unstable under reflection" in the sense that a rational agent following causal decision theory "correctly identifies that the agent should modify itself to stop using CDT [causal decision theory] to make decisions". MIRI researchers identify "logical decision theories" as alternatives that perform better in general decision-making tasks.^[38]

MIRI also studies self-monitoring and self-verifying software. The FLI research priorities document notes that "a formal system that is sufficiently powerful cannot use formal methods in the obvious way to gain assurance about the accuracy of functionally similar formal systems, on pain of inconsistency via Gödel's incompleteness theorems".^[23] MIRI's publications on Vingean reflection attempt to model the Gödelian limits on self-referential reasoning and identify practically useful exceptions.^[41]

Soares and Fallenstein classify the above research programs as aimed at high reliability and transparency in agent behavior. They separately recommend research into "error-tolerant" software systems, citing human error and default incentives as sources of serious risk.^[35]^[8] The FLI research priorities document adds:

If an AI system is selecting the actions that best allow it to complete a given task, then avoiding conditions that prevent the system from continuing to pursue the task is a natural subgoal (and conversely, seeking unconstrained situations is sometimes a useful heuristic). This could become problematic, however, if we wish to repurpose the system, to deactivate it, or to significantly alter its decision-making process; such a system would rationally avoid these changes. Systems that do not exhibit these behaviors have been termed corrigible systems, and both theoretical and practical work in this area appears tractable and useful.

MIRI's priorities in these areas are summarized in their 2015 technical agenda.^[4]

Value specification

In defining correct goals for autonomous systems, Soares and Fallenstein write, "the 'intentions' of the operators are a complex, vague, fuzzy, context-dependent notion (Yudkowsky 2011).^[42] Concretely writing out the full intentions of the operators in a machine-readable format is implausible if not impossible, even for simple tasks." Soares and Fallenstein propose that autonomous AI systems instead be designed to inductively learn the values of humans from observational data.^[4]

Soares discusses several technical obstacles to value learning in AI: changes in the agent's beliefs may result in a mismatch between the agent's values and its ontology; agents that are well-behaved in training data may induct incorrect values in new domains; and human operators' moral uncertainty may make it difficult to identify or anticipate incorrect inductions.^[22]^[43] Bostrom's Superintelligence discusses the philosophical problems raised by value learning at greater length.^[19]

References

↑ "IRS Form 990" (PDF). Machine Intelligence Research Institute. 2013. Retrieved 12 October 2015.
↑ "Team". Machine Intelligence Research Institute. 2016. Retrieved 4 October 2016.
↑ Muehlhauser, Luke (2015). "A fond farewell and a new Executive Director". MIRI Blog. Retrieved 12 October 2015.
1 2 3 4 5 Soares, Nate; Fallenstein, Benja (2015). "Aligning Superintelligence with Human Interests: A Technical Research Agenda" (PDF). In Miller, James; Yampolskiy, Roman; Armstrong, Stuart; et al. The Technological Singularity: Managing the Journey. Springer.
↑ "Research Workshops". Machine Intelligence Research Institute. 2013. Retrieved 11 October 2015.
↑ GiveWell (2015). Potential risks from advanced artificial intelligence (Report). Retrieved 11 October 2015.
1 2 Hawking, Stephen; Tegmark, Max; Russell, Stuart; Wilczek, Frank (2014). "Transcending Complacency on Superintelligent Machines". The Huffington Post. Retrieved 11 October 2015.
1 2 3 4 Basulto, Dominic (2015). "The very best ideas for preventing artificial intelligence from wrecking the planet". The Washington Post. Retrieved 11 October 2015.
↑ Ackerman, Elise (2008). "Annual A.I. conference to be held this Saturday in San Jose". San Jose Mercury News. Retrieved 11 October 2015.
↑ "Singularity Institute Strategic Plan" (PDF). Machine Intelligence Research Institute. 2011. Retrieved 12 October 2015.
↑ "Scientists Fear Day Computers Become Smarter Than Humans". Fox News Channel. Associated Press. 2007. Retrieved 12 October 2015.
↑ Abate, Tom (2006). "Smarter than thou?". San Francisco Chronicle. Retrieved 12 October 2015.
↑ Abate, Tom (2007). "Public meeting will re-examine future of artificial intelligence". San Francisco Chronicle. Retrieved 12 October 2015.
↑ "Singularity Summit: An Annual Conference on Science, Technology, and the Future". Machine Intelligence Research Institute. 2012. Retrieved 12 October 2015.
↑ Muehlhauser, Luke (2012). "July 2012 Newsletter". MIRI Blog. Retrieved 12 October 2015.
↑ Stiefel, Todd; Metskas, Amanda K. (22 May 2013). "Julia Galef". The Humanist Hour. Episode 083. The Humanist. Retrieved 3 March 2015.
↑ Chen, Angela (2014). "More Rational Resolutions". The Wall Street Journal. Retrieved 5 March 2015.
↑ Muehlhauser, Luke (2013). "We are now the "Machine Intelligence Research Institute" (MIRI)". MIRI Blog. Retrieved 12 October 2015.
1 2 3 4 5 Bostrom, Nick (2014). Superintelligence: Paths, Dangers, Strategies (First edition. ed.). ISBN 0199678111.
↑ Muehlhauser, Luke (2015). "Musk and Gates on superintelligence and fast takeoff". Luke Muehlhauser Blog. Retrieved 12 October 2015.
↑ D'Orazio, Dante (2014). "Elon Musk says artificial intelligence is 'potentially more dangerous than nukes'". The Verge. Retrieved 5 October 2015.
1 2 3 4 5 LaFrance, Adrienne (2015). "Building Robots With Better Morals Than Humans". The Atlantic. Retrieved 12 October 2015.
1 2 3 4 Future of Life Institute (2015). Research priorities for robust and beneficial artificial intelligence (PDF) (Report). Retrieved 4 October 2015.
↑ "2015 Awardees". Future of Life Institute. 2015. Retrieved 5 October 2015.
↑ Bostrom, Nick; Yudkowsky, Eliezer (2014). "The Ethics of Artificial Intelligence" (PDF). In Frankish, Keith; Ramsey, William. The Cambridge Handbook of Artificial Intelligence. New York: Cambridge University Press. ISBN 978-0-521-87142-6.
↑ Hsu, Jeremy (2015). "Making Sure AI's Rapid Rise Is No Surprise". Discover. Retrieved 12 October 2015.
↑ De Looper, Christian (2015). "Research Suggests Human Brain Is 30 Times As Powerful As The Best Supercomputers". Tech Times. Retrieved 12 October 2015.
1 2 Yudkowsky, Eliezer (2008). "Artificial Intelligence as a Positive and Negative Factor in Global Risk" (PDF). In Bostrom, Nick; Ćirković, Milan. Global Catastrophic Risks. Oxford University Press. ISBN 978-0199606504.
↑ Good, Irving (1965). "Speculations Concerning the First Ultraintelligent Machine" (PDF). Advances in Computers. 6. Retrieved 4 October 2015.
↑ Vinge, Vernor (1993). "The Coming Technological Singularity: How to Survive in the Post-Human Era". Whole Earth Review. Retrieved 12 October 2015.
1 2 Russell, Stuart; Norvig, Peter (2009). Artificial Intelligence: A Modern Approach. Prentice Hall. ISBN 978-0-13-604259-4.
↑ Yudkowsky, Eliezer (2007). "Three Major Singularity Schools". MIRI Blog. Retrieved 11 October 2015.
↑ Bensinger, Rob (2015). "Brooks and Searle on AGI volition and timelines". MIRI Blog. Retrieved 12 October 2015.
↑ Tegmark, Max (2014). "Life, Our Universe and Everything". Our Mathematical Universe: My Quest for the Ultimate Nature of Reality (First edition. ed.). ISBN 9780307744258.
1 2 Soares, Nate; Fallenstein, Benja; Yudkowsky, Eliezer; Armstrong, Stuart (2015). "Corrigibility". AAAI Workshops: Workshops at the Twenty-Ninth AAAI Conference on Artificial Intelligence, Austin, TX, January 25–26, 2015. AAAI Publications.
↑ Omohundro, Steve (2008). "The Basic AI Drives" (PDF). Artificial General Intelligence 2008: Proceedings of the First AGI Conference. Amsterdam: IOS.
↑ LaVictoire, Patrick; Fallenstein, Benja; Yudkowsky, Eliezer; Bárász, Mihály; Christiano, Paul; Herreshoff, Marcello (2014). "Program Equilibrium in the Prisoner's Dilemma via Löb's Theorem". Multiagent Interaction without Prior Coordination: Papers from the AAAI-14 Workshop. AAAI Publications.
1 2 Soares, Nate; Fallenstein, Benja (2015). "Toward Idealized Decision Theory". arXiv:1507.01986 [cs.AI].
↑ Soares, Nate (2015). Formalizing Two Problems of Realistic World-Models (PDF) (Technical report). Machine Intelligence Research Institute. 2015-3.
↑ Soares, Nate; Fallenstein, Benja (2015). Questions of Reasoning under Logical Uncertainty (PDF) (Technical report). Machine Intelligence Research Institute. 2015-1.
↑ Fallenstein, Benja; Soares, Nate (2015). Vingean Reflection: Reliable Reasoning for Self-Improving Agents (PDF) (Technical report). Machine Intelligence Research Institute. 2015-2.
↑ Yudkowsky, Eliezer (2011). "Complex Value Systems in Friendly AI" (PDF). Artificial General Intelligence: 4th International Conference, AGI 2011, Mountain View, CA, USA, August 3–6, 2011. Berlin: Springer.
↑ Soares, Nate (2015). The Value Learning Problem (PDF) (Technical report). Machine Intelligence Research Institute. 2015-4.

External links

Official website

LessWrong

People	Julia Galef Robin Hanson Eliezer Yudkowsky

Organizations	Center for Applied Rationality Future of Humanity Institute Machine Intelligence Research Institute MetaMed

Works	Harry Potter and the Methods of Rationality Rationality: From AI to Zombies

Related concepts	Behavioral economics Bias blind spot Bounded rationality Cognitive bias Effective altruism Existential risk from artificial general intelligence Instrumental rationality Psychology of reasoning Scientific skepticism

Existential risk from artificial general intelligence

Concepts	AI box AI takeover Control problem Friendly artificial intelligence Instrumental convergence Intelligence explosion Machine ethics Superintelligence Technological singularity

Organizations	Center for Applied Rationality Centre for the Study of Existential Risk Future of Humanity Institute Future of Life Institute Leverhulme Centre for the Future of Intelligence Machine Intelligence Research Institute OpenAI

People	Nick Bostrom Stephen Hawking Bill Hibbard Bill Joy Elon Musk Steve Omohundro Huw Price Martin Rees Stuart J. Russell Jaan Tallinn Max Tegmark Frank Wilczek Roman Yampolskiy Eliezer Yudkowsky

Other	Open Letter on Artificial Intelligence, Ethics of artificial intelligence, Controversies and dangers of artificial general intelligence, Artificial intelligence as a global catastrophic risk, Superintelligence: Paths, Dangers, Strategies, Our Final Invention

This article is issued from Wikipedia - version of the 10/5/2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.