Portable performance on asymmetric multicore processors

Jibaja, Ivan; Cao, Ting; Blackburn, Stephen; McKinley, Kathryn

Portable performance on asymmetric multicore processors

dc.contributor.author	Jibaja, Ivan
dc.contributor.author	Cao, Ting
dc.contributor.author	Blackburn, Stephen
dc.contributor.author	McKinley, Kathryn
dc.date.accessioned	2017-01-24T03:25:15Z
dc.date.available	2017-01-24T03:25:15Z
dc.date.issued	2016
dc.description.abstract	Static and dynamic power constraints are steering chip manufacturers to build single-ISA Asymmetric Multicore Processors (AMPs) with big and small cores. To deliver on their energy efficiency potential, schedulers must consider core sensitivity, load balance, and the critical path. Applying these criteria effectively is challenging especially for complex and non-scalable multithreaded applications. We demonstrate that runtimes for managed languages, which are now ubiquitous, provide a unique opportunity to abstract over AMP complexity and inform scheduling with rich semantics such as thread priorities, locks, and parallelism— information not directly available to the hardware, OS, or application. We present the WASH AMP scheduler, which (1) automatically identifies and accelerates critical threads in concurrent, but non-scalable applications; (2) respects thread priorities; (3) considers core availability and thread sensitivity; and (4) proportionally schedules threads on big and small cores to optimize performance and energy. We introduce new dynamic analyses that identify critical threads and classify applications as sequential, scalable, or non-scalable. Compared to prior work, WASH improves performance by 20% and energy by 9% or more on frequency-scaled AMP hardware (not simulation). Performance advantages grow to 27% when asymmetry increases. Performance advantages are robust to a complex multithreaded adversary independently scheduled by the OS. WASH effectively identifies and optimizes a wider class of workloads than prior work.	en_AU
dc.description.sponsorship	This research is funded by the China Postdoctoral Science Foundation (No. 2015T80139), the National Natural Science Foundation of China (No. 61432018, 61272136, 61133005, 61221062), the National High Technology Research and Development Program of China (No. 2015AA01A303, 2015AA 011505), and the National Science Foundation of the United States (SHF-0910818).	en_AU
dc.format.mimetype	application/pdf	en_AU
dc.identifier.isbn	9781450337786	en_AU
dc.identifier.uri	http://hdl.handle.net/1885/112023
dc.publisher	Association for Computing Machinery	en_AU
dc.relation.ispartof	CGO '16 Proceedings of the 2016 International Symposium on Code Generation and Optimization, Barcelona, Spain - March 12 - 18, 2016	en_AU
dc.rights	© 2016 ACM	en_AU
dc.title	Portable performance on asymmetric multicore processors	en_AU
dc.type	Conference paper	en_AU
dcterms.accessRights	Open Access	en_AU
local.bibliographicCitation.lastpage	35	en_AU
local.bibliographicCitation.startpage	24	en_AU
local.contributor.affiliation	Cao, T., The Australian National University	en_AU
local.contributor.affiliation	Blackburn, S. M., The Australian National University	en_AU
local.contributor.authoremail	ting.cao@anu.edu.au	en_AU
local.contributor.authoruid	u4639340	en_AU
local.identifier.doi	10.1145/2854038.2854047	en_AU
local.identifier.uidSubmittedBy	u1005913	en_AU
local.type.status	Published Version	en_AU

Downloads

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 884 B
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

ANU Research Publications