Peripheral-foveal vision for real-time object recognition and tracking in video
| dc.contributor.author | Gould, Stephen | en |
| dc.contributor.author | Arfvidsson, Joakim | en |
| dc.contributor.author | Kaehler, Adrian | en |
| dc.contributor.author | Sapp, Benjamin | en |
| dc.contributor.author | Messner, Marius | en |
| dc.contributor.author | Bradski, Gary | en |
| dc.contributor.author | Baumstarck, Paul | en |
| dc.contributor.author | Sukwon, Chung | en |
| dc.contributor.author | Ng, Andrew Y. | en |
| dc.date.accessioned | 2025-12-17T19:41:11Z | |
| dc.date.available | 2025-12-17T19:41:11Z | |
| dc.date.issued | 2007 | en |
| dc.description.abstract | Human object recognition in a physical 3-d environment is still far superior to that of any robotic vision system. We believe that one reason (out of many) for this - one that has not heretofore been significantly exploited in the artificial vision literature - is that humans use a fovea to fixate on, or near an object, thus obtaining a very high resolution image of the object and rendering it easy to recognize. In this paper, we present a novel method for identifying and tracking objects in multi-resolution digital video of partially cluttered environments. Our method is motivated by biological vision systems and uses a learned "attentive" interest map on a low resolution data stream to direct a high resolution "fovea." Objects that are recognized in the fovea can then be tracked using peripheral vision. Because object recognition is run only on a small foveal image, our system achieves performance in real-time object recognition and tracking that is well beyond simpler systems. | en |
| dc.description.status | Peer-reviewed | en |
| dc.format.extent | 7 | en |
| dc.identifier.issn | 1045-0823 | en |
| dc.identifier.scopus | 84880887685 | en |
| dc.identifier.uri | https://hdl.handle.net/1885/733796280 | |
| dc.language.iso | en | en |
| dc.relation.ispartofseries | 20th International Joint Conference on Artificial Intelligence, IJCAI 2007 | en |
| dc.source | IJCAI International Joint Conference on Artificial Intelligence | en |
| dc.title | Peripheral-foveal vision for real-time object recognition and tracking in video | en |
| dc.type | Conference paper | en |
| dspace.entity.type | Publication | en |
| local.bibliographicCitation.lastpage | 2121 | en |
| local.bibliographicCitation.startpage | 2115 | en |
| local.contributor.affiliation | Gould, Stephen; Stanford University | en |
| local.contributor.affiliation | Arfvidsson, Joakim; Stanford University | en |
| local.contributor.affiliation | Kaehler, Adrian; Stanford University | en |
| local.contributor.affiliation | Sapp, Benjamin; Stanford University | en |
| local.contributor.affiliation | Messner, Marius; Stanford University | en |
| local.contributor.affiliation | Bradski, Gary; Stanford University | en |
| local.contributor.affiliation | Baumstarck, Paul; Stanford University | en |
| local.contributor.affiliation | Sukwon, Chung; Stanford University | en |
| local.contributor.affiliation | Ng, Andrew Y.; Stanford University | en |
| local.identifier.ariespublication | U3594520xPUB435 | en |
| local.identifier.pure | 18619197-a330-4167-8869-93f06abb46ec | en |
| local.identifier.url | https://www.scopus.com/pages/publications/84880887685 | en |
| local.type.status | Published | en |