Death and Suicide in Universal Artificial Intelligence

Reinforcement learning (RL) is a general paradigm for studying intelligent behaviour, with applications ranging from artificial intelligence to psychology and economics. AIXI is a universal solution to the RL problem; it can learn any computable environme

PDF / 248,265 Bytes
10 Pages / 439.37 x 666.142 pts Page_size
122 Downloads / 346 Views

DOWNLOAD

REPORT

Abstract. Reinforcement learning (RL) is a general paradigm for studying intelligent behaviour, with applications ranging from artiﬁcial intelligence to psychology and economics. AIXI is a universal solution to the RL problem; it can learn any computable environment. A technical subtlety of AIXI is that it is deﬁned using a mixture over semimeasures that need not sum to 1, rather than over proper probability measures. In this work we argue that the shortfall of a semimeasure can naturally be interpreted as the agent’s estimate of the probability of its death. We formally deﬁne death for generally intelligent agents like AIXI, and prove a number of related theorems about their behaviour. Notable discoveries include that agent behaviour can change radically under positive linear transformations of the reward signal (from suicidal to dogmatically self-preserving), and that the agent’s posterior belief that it will survive increases over time.

“That Suicide may often be consistent with interest and with our duty to ourselves, no one can question, who allows, that age, sickness, or misfortune may render life a burthen, and make it worse even than annihilation.” — Hume, Of Suicide (1777)

1

Introduction

Reinforcement Learning (RL) has proven to be a fruitful theoretical framework for reasoning about the properties of generally intelligent agents [3]. A good theoretical understanding of these agents is valuable for several reasons. Firstly, it can guide principled attempts to construct such agents [10]. Secondly, once such agents are constructed, it may serve to make their reasoning and behaviour more transparent and intelligible to humans. Thirdly, it may assist in the development of strategies for controlling these agents. The latter challenge has recently received considerable attention in the context of the potential risks posed by these agents to human safety [2]. It has even been argued that control strategies should be devised before generally intelligent agents are ﬁrst built [8]. In this context - where we must reason about the behaviour of agents in the absence of a full speciﬁcation of their implementation - a theoretical understanding of their general properties seems indispensable. c Springer International Publishing Switzerland 2016 B. Steunebrink et al. (Eds.): AGI 2016, LNAI 9782, pp. 23–32, 2016. DOI: 10.1007/978-3-319-41649-6 3

24

J. Martin et al.

The universally intelligent agent AIXI constitutes a formal mathematical theory of artiﬁcial general intelligence [3]. AIXI models its environment using a universal mixture ξ over the class of all lower semi-computable semimeasures, and thus is able to learn any computable environment. Semimeasures are defective probability measures which may sum to less than 1. Originally devised for Solomonoﬀ induction, they are necessary for universal artiﬁcial intelligence because the halting problem prevents the existence of a (lower semi-)computable universal measure for the class of (computable) measures [5]. Recent work has shown that their use in RL has techni

Data Loading...

Death and Suicide in Universal Artificial Intelligence

Recommend Documents

Artificial Intelligence and Nudging

Artificial Intelligence and Automation

Artificial Intelligence in Education

Artificial Intelligence and Robotics

Artificial Intelligence

Artificial intelligence

Artificial Intelligence and Computational Intelligence Internati

Artificial Intelligence and Bioethics

Argumentation in Artificial Intelligence

Healthcare and Artificial Intelligence

Artificial Intelligence and Computational Intelligence Third Interna

Artificial Intelligence and Computational Intelligence 4th Inter