Compress and control

Veness, Joel; Bellemare, Marc G.; Hutter, Marcus; Chua, Alvin; Desjardins, Guillaume

Compress and control

dc.contributor.author	Veness, Joel	en
dc.contributor.author	Bellemare, Marc G.	en
dc.contributor.author	Hutter, Marcus	en
dc.contributor.author	Chua, Alvin	en
dc.contributor.author	Desjardins, Guillaume	en
dc.date.accessioned	2025-12-17T13:41:02Z
dc.date.available	2025-12-17T13:41:02Z
dc.date.issued	2015-06-01	en
dc.description.abstract	This paper describes a new information-theoretic policy evaluation technique for reinforcement learning. This technique converts any compression or density model into a corresponding estimate of value. Under appropriate stationarity and ergodicity conditions, we show that the use of a sufficiently powerful model gives rise to a consistent value function estimator. We also study the behavior of this technique when applied to various Atari 2600 video games, where the use of suboptimal modeling techniques is unavoidable. We consider three fundamentally different models, all too limited to perfectly model the dynamics of the system. Remarkably, we find that our technique provides sufficiently accurate value estimates for effective on-policy control. We conclude with a suggestive study highlighting the potential of our technique to scale to large problems.	en
dc.description.status	Peer-reviewed	en
dc.format.extent	8	en
dc.identifier.isbn	9781577357025	en
dc.identifier.scopus	84960105649	en
dc.identifier.uri	https://hdl.handle.net/1885/733795938
dc.language.iso	en	en
dc.publisher	AI Access Foundation	en
dc.relation.ispartof	Proceedings of the 29th AAAI Conference on Artificial Intelligence, AAAI 2015 and the 27th Innovative Applications of Artificial Intelligence Conference, IAAI 2015	en
dc.relation.ispartofseries	29th AAAI Conference on Artificial Intelligence, AAAI 2015 and the 27th Innovative Applications of Artificial Intelligence Conference, IAAI 2015	en
dc.relation.ispartofseries	Proceedings of the National Conference on Artificial Intelligence	en
dc.rights	Publisher Copyright: Copyright © 2015, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.	en
dc.title	Compress and control	en
dc.type	Conference paper	en
dspace.entity.type	Publication	en
local.bibliographicCitation.lastpage	3023	en
local.bibliographicCitation.startpage	3016	en
local.contributor.affiliation	Veness, Joel; Australian National University	en
local.contributor.affiliation	Bellemare, Marc G.; Australian National University	en
local.contributor.affiliation	Hutter, Marcus; School of Computing, ANU College of Systems and Society, The Australian National University	en
local.contributor.affiliation	Chua, Alvin; Australian National University	en
local.contributor.affiliation	Desjardins, Guillaume; Australian National University	en
local.identifier.ariespublication	u4056230xPUB438	en
local.identifier.pure	c7d1e79b-8509-4597-a801-2b189c89e238	en
local.identifier.url	https://www.scopus.com/pages/publications/84960105649	en
local.type.status	Published	en

Collections

ANU Research Publications

Compress and control

Downloads

Collections