Identifying Reusable Early-Life Options

dc.contributor.authorWeber, Alineen
dc.contributor.authorMartin, Charles P.en
dc.contributor.authorTorresen, Jimen
dc.contributor.authorDa Silva, Bruno C.en
dc.date.accessioned2026-01-01T08:42:21Z
dc.date.available2026-01-01T08:42:21Z
dc.date.issued2019-09-30en
dc.description.abstractWe introduce a method for identifying short-duration reusable motor behaviors, which we call early-life options, that allow robots to perform well even in the very early stages of their lives. This is important when agents need to operate in environments where the use of poor-performing policies (such as the random policies with which they are typically initialized) may be catastrophic. Our method augments the original action set of the agent with specially-constructed behaviors that maximize performance over a possibly infinite family of related motor tasks. These are akin to primitive reflexes in infant mammals - agents born with our early-life options, even if acting randomly, are capable of producing rudimentary behaviors comparable to those acquired by agents that actively optimize a policy for hundreds of thousands of steps. We also introduce three metrics for identifying useful early-life options and show that they result in behaviors that maximize both the option's expected return while minimizing the risk that executing the option will result in extremely poor performance. We evaluate our technique on three simulated robots tasked with learning to walk under different battery consumption constraints and show that even random policies over early-life options are already sufficient to allow for the agent to perform similarly to agents trained for hundreds of thousands of steps.en
dc.description.sponsorshipThis work was partially supported by the Research Council of Norway (RCN); by the Norwegian Centre for International Cooperation in Education (SIU), part of the project Collaboration on Intelligent Machines (COINMAC), grant no. 261645; and by FAPERGS, grant no. 17/2551-000.en
dc.description.statusPeer-revieweden
dc.format.extent6en
dc.identifier.isbn978-1-5386-8129-9en
dc.identifier.isbn978-1-5386-8128-2en
dc.identifier.issn2161-9484en
dc.identifier.otherORCID:/0000-0001-5683-7529/work/169192990en
dc.identifier.scopus85073674250en
dc.identifier.urihttps://hdl.handle.net/1885/733799264
dc.language.isoenen
dc.publisherInstitute of Electrical and Electronics Engineers Inc.en
dc.relation.ispartof2019 Joint IEEE 9th International Conference on Development and Learning and Epigenetic Robotics, ICDL-EpiRob 2019en
dc.relation.ispartofseries9th Joint IEEE International Conference on Development and Learning and Epigenetic Robotics, ICDL-EpiRob 2019en
dc.relation.ispartofseriesIEEE International Conference on Development and Learning, ICDLen
dc.rightsPublisher Copyright: © 2019 IEEE.en
dc.subjectDevelopment of skills in biological systems and robotsen
dc.subjectMachine Learning methods for robot developmenten
dc.titleIdentifying Reusable Early-Life Optionsen
dc.typeConference paperen
dspace.entity.typePublicationen
local.bibliographicCitation.lastpage340en
local.bibliographicCitation.startpage335en
local.contributor.affiliationWeber, Aline; Universidade Federal do Rio Grande do Sulen
local.contributor.affiliationMartin, Charles P.; University of Osloen
local.contributor.affiliationTorresen, Jim; University of Osloen
local.contributor.affiliationDa Silva, Bruno C.; Universidade Federal do Rio Grande do Sulen
local.identifier.ariespublicationu4421513xPUB985en
local.identifier.doi10.1109/DEVLRN.2019.8850725en
local.identifier.essn2161-9484en
local.identifier.purea994986d-566a-4cf3-8286-086dbf602661en
local.identifier.urlhttps://www.scopus.com/pages/publications/85073674250en
local.type.statusPublisheden

Downloads