Intelligence as inference or forcing Occam on the world

Sunehag, Peter; Hutter, Marcus

Intelligence as inference or forcing Occam on the world

Date

2014-10

Authors

Sunehag, Peter

Hutter, Marcus

Publisher

Springer Verlag

Abstract

We propose to perform the optimization task of Universal Artificial Intelligence (UAI) through learning a reference machine on which good programs are short. Further, we also acknowledge that the choice of reference machine that the UAI objective is based on is arbitrary and, therefore, we learn a suitable machine for the environment we are in. This is based on viewing Occam’s razor as an imperative instead of as a proposition about the world. Since this principle cannot be true for all reference machines, we need to find a machine that makes the principle true. We both want good policies and the environment to have short implementations on the machine. Such a machine is learnt iteratively through a procedure that generalizes the principle underlying the Expectation-Maximization algorithm.