On the computability of Solomonoff induction and AIXI

dc.contributor.authorLeike, Jan
dc.contributor.authorHutter, Marcus
dc.date.accessioned2021-05-12T01:39:28Z
dc.date.issued2018-03-15
dc.date.updated2020-11-23T11:50:09Z
dc.description.abstractHow could we solve the machine learning and the artificial intelligence problem if we had infinite computation? Solomonoff induction and the reinforcement learning agent AIXI are proposed answers to this question. Both are known to be incomputable. We quantify this using the arithmetical hierarchy, and prove upper and in most cases corresponding lower bounds for incomputability. Moreover, we show that AIXI is not limit computable, thus it cannot be approximated using finite computation. However there are limit computable ε-optimal approximations to AIXI. We also derive computability bounds for knowledge-seeking agents, and give a limit computable weakly asymptotically optimal reinforcement learning agent.en_AU
dc.description.sponsorshipThis work was supported by ARC grant DP150104590.en_AU
dc.format.mimetypeapplication/pdfen_AU
dc.identifier.issn0304-3975en_AU
dc.identifier.urihttp://hdl.handle.net/1885/232674
dc.language.isoen_AUen_AU
dc.provenancehttps://v2.sherpa.ac.uk/id/publication/12520..."Author accepted manuscript can be made open access on institutional repository with CC BY-NC-ND license after 24 month embargo" from SHERPA/RoMEO site (as at 28.5.2021)
dc.publisherElsevieren_AU
dc.relationhttp://purl.org/au-research/grants/arc/DP150104590en_AU
dc.rights© 2017 Crown Copyrighten_AU
dc.rights.licenseCreative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.sourceTheoretical Computer Scienceen_AU
dc.subjectSolomonoff inductionen_AU
dc.subjectAIXIen_AU
dc.subjectGeneral reinforcement learningen_AU
dc.subjectKnowledge-seeking agentsen_AU
dc.subjectComputabilityen_AU
dc.subjectArithmetical hierarchyen_AU
dc.titleOn the computability of Solomonoff induction and AIXIen_AU
dc.typeJournal articleen_AU
dcterms.accessRightsOpen Access
local.bibliographicCitation.lastpage49en_AU
local.bibliographicCitation.startpage28en_AU
local.contributor.affiliationLeike, Jan, College of Engineering and Computer Science, ANUen_AU
local.contributor.affiliationHutter, Marcus, College of Engineering and Computer Science, ANUen_AU
local.contributor.authoruidLeike, Jan, u5485774en_AU
local.contributor.authoruidHutter, Marcus, u4350841en_AU
local.description.notesImported from ARIESen_AU
local.identifier.absfor080199 - Artificial Intelligence and Image Processing not elsewhere classifieden_AU
local.identifier.absfor019999 - Mathematical Sciences not elsewhere classifieden_AU
local.identifier.ariespublicationa383154xPUB8838en_AU
local.identifier.citationvolume716en_AU
local.identifier.doi10.1016/j.tcs.2017.11.020en_AU
local.identifier.scopusID2-s2.0-85036534919
local.publisher.urlhttps://www.sciencedirect.com/en_AU
local.type.statusAccepted Versionen_AU

Downloads

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
aixicplexx.pdf
Size:
394.21 KB
Format:
Adobe Portable Document Format
Description:
Author Accepted Manuscript