Dynamic ensemble of classifiers and security relevant methods of android’s API : an empirical study
| Ano de defesa: | 2022 |
|---|---|
| Autor(a) principal: | |
| Orientador(a): | |
| Banca de defesa: | |
| Tipo de documento: | Dissertação |
| Tipo de acesso: | Acesso embargado |
| Idioma: | eng |
| Instituição de defesa: |
Universidade Federal de Pernambuco
|
| Programa de Pós-Graduação: |
Programa de Pos Graduacao em Ciencia da Computacao
|
| Departamento: |
Não Informado pela instituição
|
| País: |
Brasil
|
| Palavras-chave em Português: | |
| Link de acesso: | https://repositorio.ufpe.br/handle/123456789/45398 |
Resumo: | The Android operating system provides functions and methods to handle sensitive data to se- cure users’ data. Sensitive data is every data that can identify the user, such as GPS location, biometric data, and banking data. The Android security literature proposes extracting binary features from a method and classifying the method into one of the Security Relevant Method’s classes, adding information about how the method handles sensitive data. However, there is a gap in the literature where Dynamic Ensemble algorithms are not evaluated. Dynamic En- semble techniques are state of the art on Multiple Classifiers Systems, which do not explicitly address the problem of a dataset of binary features. Thus, this work tackles the gap related to Dynamic Ensemble applied to Security Relevant Methods classification. Our analyzes show that, unlikely initially stated in the literature, SVM is not the best classifier for this problem, being MLP, Random Forest, Gradient Boosted Decision Trees, and META-DES using Random Forest as pool generation gives the best results. We also find that, in general, Dynamic En- semble algorithms have a disadvantage compared to monolithic classifiers. Furthermore, this disadvantage is exacerbated in algorithms that use distance-based classifiers, such as OLP. When using the Triplet Loss embedding algorithm, we observed an increase in performance for kNN and OLP, but not for other Dynamic Ensemble techniques, showing that a set of binary features has a more significant impact on these algorithms. |
| id |
UFPE_79ff559ca35d171f998388c02a646fd0 |
|---|---|
| oai_identifier_str |
oai:repositorio.ufpe.br:123456789/45398 |
| network_acronym_str |
UFPE |
| network_name_str |
Repositório Institucional da UFPE |
| repository_id_str |
|
| spelling |
RODRIGUES, Walber de Macedohttp://lattes.cnpq.br/8700122611473574http://lattes.cnpq.br/8577312109146354CAVALCANTI, George Darmiton da Cunha2022-08-03T15:29:25Z2022-08-03T15:29:25Z2022-02-10RODRIGUES, Walber de Macedo. Dynamic ensemble of classifiers and security relevant methods of android’s API: an empirical study. 2022. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal de Pernambuco, Recife, 2022.https://repositorio.ufpe.br/handle/123456789/45398The Android operating system provides functions and methods to handle sensitive data to se- cure users’ data. Sensitive data is every data that can identify the user, such as GPS location, biometric data, and banking data. The Android security literature proposes extracting binary features from a method and classifying the method into one of the Security Relevant Method’s classes, adding information about how the method handles sensitive data. However, there is a gap in the literature where Dynamic Ensemble algorithms are not evaluated. Dynamic En- semble techniques are state of the art on Multiple Classifiers Systems, which do not explicitly address the problem of a dataset of binary features. Thus, this work tackles the gap related to Dynamic Ensemble applied to Security Relevant Methods classification. Our analyzes show that, unlikely initially stated in the literature, SVM is not the best classifier for this problem, being MLP, Random Forest, Gradient Boosted Decision Trees, and META-DES using Random Forest as pool generation gives the best results. We also find that, in general, Dynamic En- semble algorithms have a disadvantage compared to monolithic classifiers. Furthermore, this disadvantage is exacerbated in algorithms that use distance-based classifiers, such as OLP. When using the Triplet Loss embedding algorithm, we observed an increase in performance for kNN and OLP, but not for other Dynamic Ensemble techniques, showing that a set of binary features has a more significant impact on these algorithms.CNPqO sistema operacional Android disponibiliza funções e métodos de manuseio de dados sensíveis para proteger os dados dos usuários. Dados sensíveis são todo tipo de dados que podem identificar o usuário, como localização de GPS, dados biométricos e informações bancárias. A literatura de segurança Android propõe extrair features binárias de um método classificar-lo em uma das classes de Security Relevant Methods, agregando informação de o método manuseia dados sensíveis. Entretanto, existe uma lacuna na literatura onde não são avaliados algoritmos de Ensemble Dinâmico. Os algoritmos de Ensemble Dinâmico são estado da arte para Sistemas de Múltiplos classificadores, que por sua vez, não atacam objetivamente o tipo específico de features binárias. Assim sendo, este trabalho endereça a lacuna em relação a algoritmos de Ensemble Dinâmicos aplicados ao problema de classificação de Security Relevant Methods. Nossas análises motram que, ao contrário do que é inicialmente posto pela literatura, SVM não é o melhor classificador para esse problema, sendo MLP, Random Forest, Gradient Boosted Decision Trees e META-DES usando Random Forest como geração do pool os melhores resultados. Também constatamos que, em geral, algoritmos de Ensemble Dinâmico possuem uma desvantagem em relação aos classificadores monolíticos. Ademais, essa desvantagem é exarcebada em algoritmos que utilizam classificadores baseados em distância, como o OLP. Quando utlizamos o algoritmo de embedding Triplet Loss, observamos um aumento de performance para o kNN e OLP, mas não de outras técnicas de Ensemble Dinâmico, mostrando que um conjunto de features binárias tem impacto mais significativo sobre esses algoritmos.engUniversidade Federal de PernambucoPrograma de Pos Graduacao em Ciencia da ComputacaoUFPEBrasilhttp://creativecommons.org/licenses/by-nc-nd/3.0/br/info:eu-repo/semantics/embargoedAccessSecurity relevant methodsMétodos de ensembleSistema de múltiplos classificadoresEnsenmble dinâmicoDynamic ensemble of classifiers and security relevant methods of android’s API : an empirical studyinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesismestradoreponame:Repositório Institucional da UFPEinstname:Universidade Federal de Pernambuco (UFPE)instacron:UFPEORIGINALDISSERTAÇÃO Walber de Macedo Rodrigues.pdfDISSERTAÇÃO Walber de Macedo Rodrigues.pdfapplication/pdf1807417https://repositorio.ufpe.br/bitstream/123456789/45398/1/DISSERTA%c3%87%c3%83O%20Walber%20de%20Macedo%20Rodrigues.pdf75f8d3f6ca94c181ca5ea9c41155a7f4MD51CC-LICENSElicense_rdflicense_rdfapplication/rdf+xml; charset=utf-8811https://repositorio.ufpe.br/bitstream/123456789/45398/2/license_rdfe39d27027a6cc9cb039ad269a5db8e34MD52LICENSElicense.txtlicense.txttext/plain; charset=utf-82142https://repositorio.ufpe.br/bitstream/123456789/45398/3/license.txt6928b9260b07fb2755249a5ca9903395MD53TEXTDISSERTAÇÃO Walber de Macedo Rodrigues.pdf.txtDISSERTAÇÃO Walber de Macedo Rodrigues.pdf.txtExtracted texttext/plain168426https://repositorio.ufpe.br/bitstream/123456789/45398/4/DISSERTA%c3%87%c3%83O%20Walber%20de%20Macedo%20Rodrigues.pdf.txtd53244ca1317cc1328f603f8d078674eMD54THUMBNAILDISSERTAÇÃO Walber de Macedo Rodrigues.pdf.jpgDISSERTAÇÃO Walber de Macedo Rodrigues.pdf.jpgGenerated Thumbnailimage/jpeg1153https://repositorio.ufpe.br/bitstream/123456789/45398/5/DISSERTA%c3%87%c3%83O%20Walber%20de%20Macedo%20Rodrigues.pdf.jpge141ed21abb3275f356fa68b79e09be0MD55123456789/453982022-08-04 02:59:51.732oai:repositorio.ufpe.br:123456789/45398VGVybW8gZGUgRGVww7NzaXRvIExlZ2FsIGUgQXV0b3JpemHDp8OjbyBwYXJhIFB1YmxpY2HDp8OjbyBkZSBEb2N1bWVudG9zIG5vIFJlcG9zaXTDs3JpbyBEaWdpdGFsIGRhIFVGUEUKIAoKRGVjbGFybyBlc3RhciBjaWVudGUgZGUgcXVlIGVzdGUgVGVybW8gZGUgRGVww7NzaXRvIExlZ2FsIGUgQXV0b3JpemHDp8OjbyB0ZW0gbyBvYmpldGl2byBkZSBkaXZ1bGdhw6fDo28gZG9zIGRvY3VtZW50b3MgZGVwb3NpdGFkb3Mgbm8gUmVwb3NpdMOzcmlvIERpZ2l0YWwgZGEgVUZQRSBlIGRlY2xhcm8gcXVlOgoKSSAtICBvIGNvbnRlw7pkbyBkaXNwb25pYmlsaXphZG8gw6kgZGUgcmVzcG9uc2FiaWxpZGFkZSBkZSBzdWEgYXV0b3JpYTsKCklJIC0gbyBjb250ZcO6ZG8gw6kgb3JpZ2luYWwsIGUgc2UgbyB0cmFiYWxobyBlL291IHBhbGF2cmFzIGRlIG91dHJhcyBwZXNzb2FzIGZvcmFtIHV0aWxpemFkb3MsIGVzdGFzIGZvcmFtIGRldmlkYW1lbnRlIHJlY29uaGVjaWRhczsKCklJSSAtIHF1YW5kbyB0cmF0YXItc2UgZGUgVHJhYmFsaG8gZGUgQ29uY2x1c8OjbyBkZSBDdXJzbywgRGlzc2VydGHDp8OjbyBvdSBUZXNlOiBvIGFycXVpdm8gZGVwb3NpdGFkbyBjb3JyZXNwb25kZSDDoCB2ZXJzw6NvIGZpbmFsIGRvIHRyYWJhbGhvOwoKSVYgLSBxdWFuZG8gdHJhdGFyLXNlIGRlIFRyYWJhbGhvIGRlIENvbmNsdXPDo28gZGUgQ3Vyc28sIERpc3NlcnRhw6fDo28gb3UgVGVzZTogZXN0b3UgY2llbnRlIGRlIHF1ZSBhIGFsdGVyYcOnw6NvIGRhIG1vZGFsaWRhZGUgZGUgYWNlc3NvIGFvIGRvY3VtZW50byBhcMOzcyBvIGRlcMOzc2l0byBlIGFudGVzIGRlIGZpbmRhciBvIHBlcsOtb2RvIGRlIGVtYmFyZ28sIHF1YW5kbyBmb3IgZXNjb2xoaWRvIGFjZXNzbyByZXN0cml0bywgc2Vyw6EgcGVybWl0aWRhIG1lZGlhbnRlIHNvbGljaXRhw6fDo28gZG8gKGEpIGF1dG9yIChhKSBhbyBTaXN0ZW1hIEludGVncmFkbyBkZSBCaWJsaW90ZWNhcyBkYSBVRlBFIChTSUIvVUZQRSkuCgogClBhcmEgdHJhYmFsaG9zIGVtIEFjZXNzbyBBYmVydG86CgpOYSBxdWFsaWRhZGUgZGUgdGl0dWxhciBkb3MgZGlyZWl0b3MgYXV0b3JhaXMgZGUgYXV0b3IgcXVlIHJlY2FlbSBzb2JyZSBlc3RlIGRvY3VtZW50bywgZnVuZGFtZW50YWRvIG5hIExlaSBkZSBEaXJlaXRvIEF1dG9yYWwgbm8gOS42MTAsIGRlIDE5IGRlIGZldmVyZWlybyBkZSAxOTk4LCBhcnQuIDI5LCBpbmNpc28gSUlJLCBhdXRvcml6byBhIFVuaXZlcnNpZGFkZSBGZWRlcmFsIGRlIFBlcm5hbWJ1Y28gYSBkaXNwb25pYmlsaXphciBncmF0dWl0YW1lbnRlLCBzZW0gcmVzc2FyY2ltZW50byBkb3MgZGlyZWl0b3MgYXV0b3JhaXMsIHBhcmEgZmlucyBkZSBsZWl0dXJhLCBpbXByZXNzw6NvIGUvb3UgZG93bmxvYWQgKGFxdWlzacOnw6NvKSBhdHJhdsOpcyBkbyBzaXRlIGRvIFJlcG9zaXTDs3JpbyBEaWdpdGFsIGRhIFVGUEUgbm8gZW5kZXJlw6dvIGh0dHA6Ly93d3cucmVwb3NpdG9yaW8udWZwZS5iciwgYSBwYXJ0aXIgZGEgZGF0YSBkZSBkZXDDs3NpdG8uCgogClBhcmEgdHJhYmFsaG9zIGVtIEFjZXNzbyBSZXN0cml0bzoKCk5hIHF1YWxpZGFkZSBkZSB0aXR1bGFyIGRvcyBkaXJlaXRvcyBhdXRvcmFpcyBkZSBhdXRvciBxdWUgcmVjYWVtIHNvYnJlIGVzdGUgZG9jdW1lbnRvLCBmdW5kYW1lbnRhZG8gbmEgTGVpIGRlIERpcmVpdG8gQXV0b3JhbCBubyA5LjYxMCBkZSAxOSBkZSBmZXZlcmVpcm8gZGUgMTk5OCwgYXJ0LiAyOSwgaW5jaXNvIElJSSwgYXV0b3Jpem8gYSBVbml2ZXJzaWRhZGUgRmVkZXJhbCBkZSBQZXJuYW1idWNvIGEgZGlzcG9uaWJpbGl6YXIgZ3JhdHVpdGFtZW50ZSwgc2VtIHJlc3NhcmNpbWVudG8gZG9zIGRpcmVpdG9zIGF1dG9yYWlzLCBwYXJhIGZpbnMgZGUgbGVpdHVyYSwgaW1wcmVzc8OjbyBlL291IGRvd25sb2FkIChhcXVpc2nDp8OjbykgYXRyYXbDqXMgZG8gc2l0ZSBkbyBSZXBvc2l0w7NyaW8gRGlnaXRhbCBkYSBVRlBFIG5vIGVuZGVyZcOnbyBodHRwOi8vd3d3LnJlcG9zaXRvcmlvLnVmcGUuYnIsIHF1YW5kbyBmaW5kYXIgbyBwZXLDrW9kbyBkZSBlbWJhcmdvIGNvbmRpemVudGUgYW8gdGlwbyBkZSBkb2N1bWVudG8sIGNvbmZvcm1lIGluZGljYWRvIG5vIGNhbXBvIERhdGEgZGUgRW1iYXJnby4KRepositório InstitucionalPUBhttps://repositorio.ufpe.br/oai/requestattena@ufpe.bropendoar:22212022-08-04T05:59:51Repositório Institucional da UFPE - Universidade Federal de Pernambuco (UFPE)false |
| dc.title.pt_BR.fl_str_mv |
Dynamic ensemble of classifiers and security relevant methods of android’s API : an empirical study |
| title |
Dynamic ensemble of classifiers and security relevant methods of android’s API : an empirical study |
| spellingShingle |
Dynamic ensemble of classifiers and security relevant methods of android’s API : an empirical study RODRIGUES, Walber de Macedo Security relevant methods Métodos de ensemble Sistema de múltiplos classificadores Ensenmble dinâmico |
| title_short |
Dynamic ensemble of classifiers and security relevant methods of android’s API : an empirical study |
| title_full |
Dynamic ensemble of classifiers and security relevant methods of android’s API : an empirical study |
| title_fullStr |
Dynamic ensemble of classifiers and security relevant methods of android’s API : an empirical study |
| title_full_unstemmed |
Dynamic ensemble of classifiers and security relevant methods of android’s API : an empirical study |
| title_sort |
Dynamic ensemble of classifiers and security relevant methods of android’s API : an empirical study |
| author |
RODRIGUES, Walber de Macedo |
| author_facet |
RODRIGUES, Walber de Macedo |
| author_role |
author |
| dc.contributor.authorLattes.pt_BR.fl_str_mv |
http://lattes.cnpq.br/8700122611473574 |
| dc.contributor.advisorLattes.pt_BR.fl_str_mv |
http://lattes.cnpq.br/8577312109146354 |
| dc.contributor.author.fl_str_mv |
RODRIGUES, Walber de Macedo |
| dc.contributor.advisor1.fl_str_mv |
CAVALCANTI, George Darmiton da Cunha |
| contributor_str_mv |
CAVALCANTI, George Darmiton da Cunha |
| dc.subject.por.fl_str_mv |
Security relevant methods Métodos de ensemble Sistema de múltiplos classificadores Ensenmble dinâmico |
| topic |
Security relevant methods Métodos de ensemble Sistema de múltiplos classificadores Ensenmble dinâmico |
| description |
The Android operating system provides functions and methods to handle sensitive data to se- cure users’ data. Sensitive data is every data that can identify the user, such as GPS location, biometric data, and banking data. The Android security literature proposes extracting binary features from a method and classifying the method into one of the Security Relevant Method’s classes, adding information about how the method handles sensitive data. However, there is a gap in the literature where Dynamic Ensemble algorithms are not evaluated. Dynamic En- semble techniques are state of the art on Multiple Classifiers Systems, which do not explicitly address the problem of a dataset of binary features. Thus, this work tackles the gap related to Dynamic Ensemble applied to Security Relevant Methods classification. Our analyzes show that, unlikely initially stated in the literature, SVM is not the best classifier for this problem, being MLP, Random Forest, Gradient Boosted Decision Trees, and META-DES using Random Forest as pool generation gives the best results. We also find that, in general, Dynamic En- semble algorithms have a disadvantage compared to monolithic classifiers. Furthermore, this disadvantage is exacerbated in algorithms that use distance-based classifiers, such as OLP. When using the Triplet Loss embedding algorithm, we observed an increase in performance for kNN and OLP, but not for other Dynamic Ensemble techniques, showing that a set of binary features has a more significant impact on these algorithms. |
| publishDate |
2022 |
| dc.date.accessioned.fl_str_mv |
2022-08-03T15:29:25Z |
| dc.date.available.fl_str_mv |
2022-08-03T15:29:25Z |
| dc.date.issued.fl_str_mv |
2022-02-10 |
| dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
| dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
| format |
masterThesis |
| status_str |
publishedVersion |
| dc.identifier.citation.fl_str_mv |
RODRIGUES, Walber de Macedo. Dynamic ensemble of classifiers and security relevant methods of android’s API: an empirical study. 2022. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal de Pernambuco, Recife, 2022. |
| dc.identifier.uri.fl_str_mv |
https://repositorio.ufpe.br/handle/123456789/45398 |
| identifier_str_mv |
RODRIGUES, Walber de Macedo. Dynamic ensemble of classifiers and security relevant methods of android’s API: an empirical study. 2022. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal de Pernambuco, Recife, 2022. |
| url |
https://repositorio.ufpe.br/handle/123456789/45398 |
| dc.language.iso.fl_str_mv |
eng |
| language |
eng |
| dc.rights.driver.fl_str_mv |
http://creativecommons.org/licenses/by-nc-nd/3.0/br/ info:eu-repo/semantics/embargoedAccess |
| rights_invalid_str_mv |
http://creativecommons.org/licenses/by-nc-nd/3.0/br/ |
| eu_rights_str_mv |
embargoedAccess |
| dc.publisher.none.fl_str_mv |
Universidade Federal de Pernambuco |
| dc.publisher.program.fl_str_mv |
Programa de Pos Graduacao em Ciencia da Computacao |
| dc.publisher.initials.fl_str_mv |
UFPE |
| dc.publisher.country.fl_str_mv |
Brasil |
| publisher.none.fl_str_mv |
Universidade Federal de Pernambuco |
| dc.source.none.fl_str_mv |
reponame:Repositório Institucional da UFPE instname:Universidade Federal de Pernambuco (UFPE) instacron:UFPE |
| instname_str |
Universidade Federal de Pernambuco (UFPE) |
| instacron_str |
UFPE |
| institution |
UFPE |
| reponame_str |
Repositório Institucional da UFPE |
| collection |
Repositório Institucional da UFPE |
| bitstream.url.fl_str_mv |
https://repositorio.ufpe.br/bitstream/123456789/45398/1/DISSERTA%c3%87%c3%83O%20Walber%20de%20Macedo%20Rodrigues.pdf https://repositorio.ufpe.br/bitstream/123456789/45398/2/license_rdf https://repositorio.ufpe.br/bitstream/123456789/45398/3/license.txt https://repositorio.ufpe.br/bitstream/123456789/45398/4/DISSERTA%c3%87%c3%83O%20Walber%20de%20Macedo%20Rodrigues.pdf.txt https://repositorio.ufpe.br/bitstream/123456789/45398/5/DISSERTA%c3%87%c3%83O%20Walber%20de%20Macedo%20Rodrigues.pdf.jpg |
| bitstream.checksum.fl_str_mv |
75f8d3f6ca94c181ca5ea9c41155a7f4 e39d27027a6cc9cb039ad269a5db8e34 6928b9260b07fb2755249a5ca9903395 d53244ca1317cc1328f603f8d078674e e141ed21abb3275f356fa68b79e09be0 |
| bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 MD5 MD5 MD5 |
| repository.name.fl_str_mv |
Repositório Institucional da UFPE - Universidade Federal de Pernambuco (UFPE) |
| repository.mail.fl_str_mv |
attena@ufpe.br |
| _version_ |
1862741597137928192 |