Cultural advice

The Australian National University acknowledges, celebrates and pays our respects to the Ngunnawal and Ngambri people of the Canberra region and to all First Nations Australians on whose traditional lands we meet and work, and whose cultures are among the oldest continuing cultures in human history.

Aboriginal and Torres Strait Islander peoples are advised that ANU Library collections may include images, names, voices, and other representations of deceased persons.

Material in the collection may contain terms, language or views that reflect the period in which the item was created and may be considered inappropriate today.

Comparative analysis of feature representations and machine learning methods in Android family classification

dc.contributor.authorBai, Yude
dc.contributor.authorXing, Zhenchang
dc.contributor.authorMa, Duoyuan
dc.contributor.authorLi, Xiaohong
dc.contributor.authorFeng, Zhiyong
dc.date.accessioned2024-01-10T00:50:36Z
dc.date.issued2020
dc.date.updated2022-09-25T08:16:39Z
dc.description.abstractIn order to overcome the lasting increase of Android malware, malware family classification, which clusters malware with the same features into one family, has been proposed as an efficient way for malware analysis. Several machine learning based approaches have been proposed for such task of malware family classification. However, due to the adoption of very different features and learning methods in different approaches, it is still an open question to explore: which approach works better for malware family classification? In this paper, we conduct extensive experiments to answer this question. For three widely known Android malware datasets, we design five multi-classification methods for predicting Android malware family. Based on the survey of Android malware analysis literatures and the observation of a large number of Android malware, we construct a set of 250 common features shared by Android malware. And we also collect 16873 documentary features from Android Developer as a comparison. Furthermore, we investigate the effects of transfer learning for adapting the model on three malware datasets on different scales. Our empirical results show that (i) the classification methods perform very closely, with neural network model having marginally better performance (1% to 3% in F1-score), (ii) features contribute most for classification, especially to enhance API features on larger datasets, and (iii) it is model transferable across different malware datasets based on various transfer learning tasks.en_AU
dc.description.sponsorshipThis work has partially been sponsored by the National Natural Science Foundation of China (No.61872262, 61572349).en_AU
dc.format.mimetypeapplication/pdfen_AU
dc.identifier.issn1389-1286en_AU
dc.identifier.urihttp://hdl.handle.net/1885/311314
dc.language.isoen_AUen_AU
dc.publisherElsevieren_AU
dc.rights© 2020 Elsevier B.V.en_AU
dc.sourceComputer Networksen_AU
dc.titleComparative analysis of feature representations and machine learning methods in Android family classificationen_AU
dc.typeJournal articleen_AU
local.bibliographicCitation.lastpage11en_AU
local.bibliographicCitation.startpage1en_AU
local.contributor.affiliationBai, Yude, Tianjin Universityen_AU
local.contributor.affiliationXing, Zhenchang, College of Engineering and Computer Science, ANUen_AU
local.contributor.affiliationMa, Duoyuan, Tianjin Universityen_AU
local.contributor.affiliationLi, Xiaohong, Tianjin Universityen_AU
local.contributor.affiliationFeng, Zhiyong, Tianjin Universityen_AU
local.contributor.authoruidXing, Zhenchang, u1023389en_AU
local.description.embargo2099-12-31
local.description.notesImported from ARIESen_AU
local.identifier.absfor460300 - Computer vision and multimedia computationen_AU
local.identifier.ariespublicationa383154xPUB15108en_AU
local.identifier.citationvolume184en_AU
local.identifier.doi10.1016/j.comnet.2020.107639en_AU
local.identifier.scopusID2-s2.0-85094888514
local.identifier.thomsonIDWOS:000608115400006
local.publisher.urlhttps://www.elsevier.com/en-auen_AU
local.type.statusPublished Versionen_AU

Downloads

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Comparative analysis of feature representations.pdf
Size:
1.47 MB
Format:
Adobe Portable Document Format
Description:
abcd