Cultural advice

The Australian National University acknowledges, celebrates and pays our respects to the Ngunnawal and Ngambri people of the Canberra region and to all First Nations Australians on whose traditional lands we meet and work, and whose cultures are among the oldest continuing cultures in human history.

Aboriginal and Torres Strait Islander peoples are advised that ANU Library collections may include images, names, voices, and other representations of deceased persons.

Material in the collection may contain terms, language or views that reflect the period in which the item was created and may be considered inappropriate today.

Portable performance on asymmetric multicore processors

dc.contributor.authorJibaja, Ivan
dc.contributor.authorCao, Ting
dc.contributor.authorBlackburn, Stephen
dc.contributor.authorMcKinley, Kathryn
dc.date.accessioned2017-01-24T03:25:15Z
dc.date.available2017-01-24T03:25:15Z
dc.date.issued2016
dc.description.abstractStatic and dynamic power constraints are steering chip manufacturers to build single-ISA Asymmetric Multicore Processors (AMPs) with big and small cores. To deliver on their energy efficiency potential, schedulers must consider core sensitivity, load balance, and the critical path. Applying these criteria effectively is challenging especially for complex and non-scalable multithreaded applications. We demonstrate that runtimes for managed languages, which are now ubiquitous, provide a unique opportunity to abstract over AMP complexity and inform scheduling with rich semantics such as thread priorities, locks, and parallelism— information not directly available to the hardware, OS, or application. We present the WASH AMP scheduler, which (1) automatically identifies and accelerates critical threads in concurrent, but non-scalable applications; (2) respects thread priorities; (3) considers core availability and thread sensitivity; and (4) proportionally schedules threads on big and small cores to optimize performance and energy. We introduce new dynamic analyses that identify critical threads and classify applications as sequential, scalable, or non-scalable. Compared to prior work, WASH improves performance by 20% and energy by 9% or more on frequency-scaled AMP hardware (not simulation). Performance advantages grow to 27% when asymmetry increases. Performance advantages are robust to a complex multithreaded adversary independently scheduled by the OS. WASH effectively identifies and optimizes a wider class of workloads than prior work.en_AU
dc.description.sponsorshipThis research is funded by the China Postdoctoral Science Foundation (No. 2015T80139), the National Natural Science Foundation of China (No. 61432018, 61272136, 61133005, 61221062), the National High Technology Research and Development Program of China (No. 2015AA01A303, 2015AA 011505), and the National Science Foundation of the United States (SHF-0910818).en_AU
dc.format.mimetypeapplication/pdfen_AU
dc.identifier.isbn9781450337786en_AU
dc.identifier.urihttp://hdl.handle.net/1885/112023
dc.publisherAssociation for Computing Machineryen_AU
dc.relation.ispartofCGO '16 Proceedings of the 2016 International Symposium on Code Generation and Optimization, Barcelona, Spain - March 12 - 18, 2016en_AU
dc.rights© 2016 ACMen_AU
dc.titlePortable performance on asymmetric multicore processorsen_AU
dc.typeConference paperen_AU
dcterms.accessRightsOpen Accessen_AU
local.bibliographicCitation.lastpage35en_AU
local.bibliographicCitation.startpage24en_AU
local.contributor.affiliationCao, T., The Australian National Universityen_AU
local.contributor.affiliationBlackburn, S. M., The Australian National Universityen_AU
local.contributor.authoruidu4639340en_AU
local.identifier.doi10.1145/2854038.2854047en_AU
local.type.statusPublished Versionen_AU

Downloads

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
884 B
Format:
Item-specific license agreed upon to submission
Description:
abcd