Avaliação da recuperação arquitetural de visões modulares de software a partir de técnicas de agrupamento

Detalhes bibliográficos
Ano de defesa: 2019
Autor(a) principal: Silva, Douglas Eder Uno lattes
Orientador(a): Bittencourt, Roberto Almeida lattes
Banca de defesa: Não Informado pela instituição
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Universidade Estadual de Feira de Santana
Programa de Pós-Graduação: Mestrado em Computação Aplicada
Departamento: DEPARTAMENTO DE TECNOLOGIA
País: Brasil
Palavras-chave em Português:
Palavras-chave em Inglês:
Área do conhecimento CNPq:
Link de acesso: http://tede2.uefs.br:8080/handle/tede/774
Resumo: Architecture module views of software are made up of modules with distinct functional responsibilities but with dependencies between them. Previous work has evaluated architecture recovery techniques of module views in order to better understand their strengths and weaknesses. In this context, different similarity metrics are used to evaluate such techniques, especially those based on clustering algorithms. However, few studies try to evaluate whether such metrics accurately capture the similarities between two clusters. Among the similarity metrics in the literature, we can cite examples from both the field of software engineering and from other fields (e.g., classification). This work evaluates six cluster similarity metrics through intrinsic quality and stability metrics and the use of software architecture models proposed by developers. To do so, we used the dimensions of stability and authoritativeness, in accordance with what has been discussed in the literature. For authoritativeness, the concentration statistics of the MeCl metric were higher, in comparison with the other similarity metrics. However, in the absence of architectural models, the Purity metric shows better results. As architecture models are very relevant to software engineers, we understand that the MeCl metric is the most appropriate. For stability, all metrics have values close to unity, despite the presence of outliers. Here as well, the MeCl metric was considered the best because of its superiority in this item. Being better in both dimensions, especially in authoritativeness, we decided to use the MeCl metric as the basis for comparison of clustering algorithms. We compared, using the MeCl metric, four agglomerative clustering algorithms in the context of four software systems. For both authoritativeness and stability, the SL90 algorithm produced higher values in two of the four systems studied by comparing the data series generated by all algorithms. In this case, the SL90 agglomerative algorithm was the best. In conclusion, we empirically realized that the MeCl metric is the best metric to measure group similarity; regarding the clustering algorithms, no algorithm exceeds the others in all comparisons, although SL90 presented better results in two of the four systems we analyzed
id UEFS_992d3af8899fc3a18e26ba1187a6fd9b
oai_identifier_str oai:tede2.uefs.br:8080:tede/774
network_acronym_str UEFS
network_name_str Biblioteca Digital de Teses e Dissertações da UEFS
repository_id_str
spelling Bittencourt, Roberto Almeida58176098515http://lattes.cnpq.br/614854666614255104156106560http://lattes.cnpq.br/6566668318845305Silva, Douglas Eder Uno2019-05-27T21:56:17Z2019-02-14SILVA, Douglas Eder Uno. Avaliação da recuperação arquitetural de visões modulares de software a partir de técnicas de agrupamento. 2019. 93f. Dissertação (Mestrado em Computação Aplicada) - Universidade Estadual de Feira de Santana, Feira de Santana, 2019.http://tede2.uefs.br:8080/handle/tede/774Architecture module views of software are made up of modules with distinct functional responsibilities but with dependencies between them. Previous work has evaluated architecture recovery techniques of module views in order to better understand their strengths and weaknesses. In this context, different similarity metrics are used to evaluate such techniques, especially those based on clustering algorithms. However, few studies try to evaluate whether such metrics accurately capture the similarities between two clusters. Among the similarity metrics in the literature, we can cite examples from both the field of software engineering and from other fields (e.g., classification). This work evaluates six cluster similarity metrics through intrinsic quality and stability metrics and the use of software architecture models proposed by developers. To do so, we used the dimensions of stability and authoritativeness, in accordance with what has been discussed in the literature. For authoritativeness, the concentration statistics of the MeCl metric were higher, in comparison with the other similarity metrics. However, in the absence of architectural models, the Purity metric shows better results. As architecture models are very relevant to software engineers, we understand that the MeCl metric is the most appropriate. For stability, all metrics have values close to unity, despite the presence of outliers. Here as well, the MeCl metric was considered the best because of its superiority in this item. Being better in both dimensions, especially in authoritativeness, we decided to use the MeCl metric as the basis for comparison of clustering algorithms. We compared, using the MeCl metric, four agglomerative clustering algorithms in the context of four software systems. For both authoritativeness and stability, the SL90 algorithm produced higher values in two of the four systems studied by comparing the data series generated by all algorithms. In this case, the SL90 agglomerative algorithm was the best. In conclusion, we empirically realized that the MeCl metric is the best metric to measure group similarity; regarding the clustering algorithms, no algorithm exceeds the others in all comparisons, although SL90 presented better results in two of the four systems we analyzedVisões arquiteturais modulares de software são formadas por módulos com responsabilidades distintas mas com dependências entre si. Diversos trabalhos avaliam as técnicas de recuperação arquitetural de visões modulares para entender melhor seus pontos fortes e fracos. Neste contexto, diferentes métricas de similaridade são utilizadas para avaliar tais técnicas, especialmente as que usam algoritmos de agrupamento. Contudo, poucos trabalhos avaliam se tais métricas realmente capturam de maneira fidedigna as similaridades entre dois agrupamentos. Dentre as métricas de similaridade existentes na literatura, pode-se citar métricas tanto da área da engenharia de software quanto de outras áreas (e.g., classificação). Este trabalho avalia seis métricas de similaridade de agrupamentos através de medidas intrínsecas de qualidade e estabilidade e da utilização de modelos arquiteturais propostos por desenvolvedores. Para tanto, usamos as dimensões de estabilidade e autoridade, em conformidade com a literatura. Para a autoridade, as estatísticas de concentração da métrica MeCl foram maiores, em comparação com as demais métricas de similaridade. Contudo, na ausência de modelos arquiteturais, a métrica Pureza apresenta melhores resultados. Como os modelos arquiteturais são muito relevantes para os engenheiros de software, entendemos que a métrica MeCl é a mais adequada. Para a estabilidade, todas as métricas apresentam valores próximos da unidade, apesar da presença de \textit{outliers}. Aqui também, a métrica MeCl foi considerada a melhor devido à sua superioridade neste item. Sendo melhor nas duas dimensões, especialmente em autoridade, usamos a métrica MeCl como base para comparação de algoritmos de agrupamento. Comparamos, usando a métrica MeCl, quatro algoritmos de agrupamento aglomerativos no contexto de quatro sistemas de software. Tanto para a autoridade quanto a estabilidade, o algoritmo SL90 gerou valores mais altos em dois dos quatro sistemas estudados ao comparar as séries de dados geradas por todos os algoritmos. Neste caso, o algoritmo aglomerativo SL90 foi o melhor. Em conclusão, percebemos empiricamente que a métrica MeCl é a melhor métrica para medir similaridade de agrupamentos; já em relação aos algoritmos de agrupamento, nenhum algoritmo supera os demais em todas as comparações, apesar de o SL90 ter apresentado melhores resultados em dois dos quatro sistemas analisadosSubmitted by Verena Pereira (verenagoncalves@uefs.br) on 2019-05-27T21:56:17Z No. of bitstreams: 1 Dissertacao_Mestrado_Douglas_Silva_Versao_Final_Completa.pdf: 3141068 bytes, checksum: 78658f8d2933aba5b1fddaa41e057ae0 (MD5)Made available in DSpace on 2019-05-27T21:56:17Z (GMT). No. of bitstreams: 1 Dissertacao_Mestrado_Douglas_Silva_Versao_Final_Completa.pdf: 3141068 bytes, checksum: 78658f8d2933aba5b1fddaa41e057ae0 (MD5) Previous issue date: 2019-02-14application/pdfporUniversidade Estadual de Feira de SantanaMestrado em Computação AplicadaUEFSBrasilDEPARTAMENTO DE TECNOLOGIAevolução de softwarearquitetura de softwarevisão modularrecuperação arquiteturalavaliação experimentalmétricasevolution of softwaresoftware architecturemodular visioncumulative architectureexperimental evaluationmetricsCIENCIA DA COMPUTACAO::TEORIA DA COMPUTACAOAvaliação da recuperação arquitetural de visões modulares de software a partir de técnicas de agrupamentoinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesis3033172823111442046006006004335108523020347051-862078257083325301info:eu-repo/semantics/openAccessreponame:Biblioteca Digital de Teses e Dissertações da UEFSinstname:Universidade Estadual de Feira de Santana (UEFS)instacron:UEFSORIGINALDissertacao_Mestrado_Douglas_Silva_Versao_Final_Completa.pdfDissertacao_Mestrado_Douglas_Silva_Versao_Final_Completa.pdfapplication/pdf3141068http://tede2.uefs.br:8080/bitstream/tede/774/2/Dissertacao_Mestrado_Douglas_Silva_Versao_Final_Completa.pdf78658f8d2933aba5b1fddaa41e057ae0MD52LICENSElicense.txtlicense.txttext/plain; charset=utf-82089http://tede2.uefs.br:8080/bitstream/tede/774/1/license.txt7b5ba3d2445355f386edab96125d42b7MD51tede/7742019-05-27 18:56:17.316oai:tede2.uefs.br:8080:tede/774Tk9UQTogQ09MT1FVRSBBUVVJIEEgU1VBIFBSP1BSSUEgTElDRU4/QQpFc3RhIGxpY2VuP2EgZGUgZXhlbXBsbyA/IGZvcm5lY2lkYSBhcGVuYXMgcGFyYSBmaW5zIGluZm9ybWF0aXZvcy4KCkxJQ0VOP0EgREUgRElTVFJJQlVJPz9PIE4/Ty1FWENMVVNJVkEKCkNvbSBhIGFwcmVzZW50YT8/byBkZXN0YSBsaWNlbj9hLCB2b2M/IChvIGF1dG9yIChlcykgb3UgbyB0aXR1bGFyIGRvcyBkaXJlaXRvcyBkZSBhdXRvcikgY29uY2VkZSA/IFVuaXZlcnNpZGFkZSAKWFhYIChTaWdsYSBkYSBVbml2ZXJzaWRhZGUpIG8gZGlyZWl0byBuP28tZXhjbHVzaXZvIGRlIHJlcHJvZHV6aXIsICB0cmFkdXppciAoY29uZm9ybWUgZGVmaW5pZG8gYWJhaXhvKSwgZS9vdSAKZGlzdHJpYnVpciBhIHN1YSB0ZXNlIG91IGRpc3NlcnRhPz9vIChpbmNsdWluZG8gbyByZXN1bW8pIHBvciB0b2RvIG8gbXVuZG8gbm8gZm9ybWF0byBpbXByZXNzbyBlIGVsZXRyP25pY28gZSAKZW0gcXVhbHF1ZXIgbWVpbywgaW5jbHVpbmRvIG9zIGZvcm1hdG9zID91ZGlvIG91IHY/ZGVvLgoKVm9jPyBjb25jb3JkYSBxdWUgYSBTaWdsYSBkZSBVbml2ZXJzaWRhZGUgcG9kZSwgc2VtIGFsdGVyYXIgbyBjb250ZT9kbywgdHJhbnNwb3IgYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YT8/byAKcGFyYSBxdWFscXVlciBtZWlvIG91IGZvcm1hdG8gcGFyYSBmaW5zIGRlIHByZXNlcnZhPz9vLgoKVm9jPyB0YW1iP20gY29uY29yZGEgcXVlIGEgU2lnbGEgZGUgVW5pdmVyc2lkYWRlIHBvZGUgbWFudGVyIG1haXMgZGUgdW1hIGM/cGlhIGEgc3VhIHRlc2Ugb3UgCmRpc3NlcnRhPz9vIHBhcmEgZmlucyBkZSBzZWd1cmFuP2EsIGJhY2stdXAgZSBwcmVzZXJ2YT8/by4KClZvYz8gZGVjbGFyYSBxdWUgYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YT8/byA/IG9yaWdpbmFsIGUgcXVlIHZvYz8gdGVtIG8gcG9kZXIgZGUgY29uY2VkZXIgb3MgZGlyZWl0b3MgY29udGlkb3MgCm5lc3RhIGxpY2VuP2EuIFZvYz8gdGFtYj9tIGRlY2xhcmEgcXVlIG8gZGVwP3NpdG8gZGEgc3VhIHRlc2Ugb3UgZGlzc2VydGE/P28gbj9vLCBxdWUgc2VqYSBkZSBzZXUgCmNvbmhlY2ltZW50bywgaW5mcmluZ2UgZGlyZWl0b3MgYXV0b3JhaXMgZGUgbmluZ3U/bS4KCkNhc28gYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YT8/byBjb250ZW5oYSBtYXRlcmlhbCBxdWUgdm9jPyBuP28gcG9zc3VpIGEgdGl0dWxhcmlkYWRlIGRvcyBkaXJlaXRvcyBhdXRvcmFpcywgdm9jPyAKZGVjbGFyYSBxdWUgb2J0ZXZlIGEgcGVybWlzcz9vIGlycmVzdHJpdGEgZG8gZGV0ZW50b3IgZG9zIGRpcmVpdG9zIGF1dG9yYWlzIHBhcmEgY29uY2VkZXIgPyBTaWdsYSBkZSBVbml2ZXJzaWRhZGUgCm9zIGRpcmVpdG9zIGFwcmVzZW50YWRvcyBuZXN0YSBsaWNlbj9hLCBlIHF1ZSBlc3NlIG1hdGVyaWFsIGRlIHByb3ByaWVkYWRlIGRlIHRlcmNlaXJvcyBlc3Q/IGNsYXJhbWVudGUgCmlkZW50aWZpY2FkbyBlIHJlY29uaGVjaWRvIG5vIHRleHRvIG91IG5vIGNvbnRlP2RvIGRhIHRlc2Ugb3UgZGlzc2VydGE/P28gb3JhIGRlcG9zaXRhZGEuCgpDQVNPIEEgVEVTRSBPVSBESVNTRVJUQT8/TyBPUkEgREVQT1NJVEFEQSBURU5IQSBTSURPIFJFU1VMVEFETyBERSBVTSBQQVRST0M/TklPIE9VIApBUE9JTyBERSBVTUEgQUc/TkNJQSBERSBGT01FTlRPIE9VIE9VVFJPIE9SR0FOSVNNTyBRVUUgTj9PIFNFSkEgQSBTSUdMQSBERSAKVU5JVkVSU0lEQURFLCBWT0M/IERFQ0xBUkEgUVVFIFJFU1BFSVRPVSBUT0RPUyBFIFFVQUlTUVVFUiBESVJFSVRPUyBERSBSRVZJUz9PIENPTU8gClRBTUI/TSBBUyBERU1BSVMgT0JSSUdBPz9FUyBFWElHSURBUyBQT1IgQ09OVFJBVE8gT1UgQUNPUkRPLgoKQSBTaWdsYSBkZSBVbml2ZXJzaWRhZGUgc2UgY29tcHJvbWV0ZSBhIGlkZW50aWZpY2FyIGNsYXJhbWVudGUgbyBzZXUgbm9tZSAocykgb3UgbyhzKSBub21lKHMpIGRvKHMpIApkZXRlbnRvcihlcykgZG9zIGRpcmVpdG9zIGF1dG9yYWlzIGRhIHRlc2Ugb3UgZGlzc2VydGE/P28sIGUgbj9vIGZhcj8gcXVhbHF1ZXIgYWx0ZXJhPz9vLCBhbD9tIGRhcXVlbGFzIApjb25jZWRpZGFzIHBvciBlc3RhIGxpY2VuP2EuCg==Biblioteca Digital de Teses e Dissertaçõeshttp://tede2.uefs.br:8080/PUBhttp://tede2.uefs.br:8080/oai/requestbcuefs@uefs.br|| bcref@uefs.br||bcuefs@uefs.bropendoar:2019-05-27T21:56:17Biblioteca Digital de Teses e Dissertações da UEFS - Universidade Estadual de Feira de Santana (UEFS)false
dc.title.por.fl_str_mv Avaliação da recuperação arquitetural de visões modulares de software a partir de técnicas de agrupamento
title Avaliação da recuperação arquitetural de visões modulares de software a partir de técnicas de agrupamento
spellingShingle Avaliação da recuperação arquitetural de visões modulares de software a partir de técnicas de agrupamento
Silva, Douglas Eder Uno
evolução de software
arquitetura de software
visão modular
recuperação arquitetural
avaliação experimental
métricas
evolution of software
software architecture
modular vision
cumulative architecture
experimental evaluation
metrics
CIENCIA DA COMPUTACAO::TEORIA DA COMPUTACAO
title_short Avaliação da recuperação arquitetural de visões modulares de software a partir de técnicas de agrupamento
title_full Avaliação da recuperação arquitetural de visões modulares de software a partir de técnicas de agrupamento
title_fullStr Avaliação da recuperação arquitetural de visões modulares de software a partir de técnicas de agrupamento
title_full_unstemmed Avaliação da recuperação arquitetural de visões modulares de software a partir de técnicas de agrupamento
title_sort Avaliação da recuperação arquitetural de visões modulares de software a partir de técnicas de agrupamento
author Silva, Douglas Eder Uno
author_facet Silva, Douglas Eder Uno
author_role author
dc.contributor.advisor1.fl_str_mv Bittencourt, Roberto Almeida
dc.contributor.advisor1ID.fl_str_mv 58176098515
dc.contributor.advisor1Lattes.fl_str_mv http://lattes.cnpq.br/6148546666142551
dc.contributor.authorID.fl_str_mv 04156106560
dc.contributor.authorLattes.fl_str_mv http://lattes.cnpq.br/6566668318845305
dc.contributor.author.fl_str_mv Silva, Douglas Eder Uno
contributor_str_mv Bittencourt, Roberto Almeida
dc.subject.por.fl_str_mv evolução de software
arquitetura de software
visão modular
recuperação arquitetural
avaliação experimental
métricas
topic evolução de software
arquitetura de software
visão modular
recuperação arquitetural
avaliação experimental
métricas
evolution of software
software architecture
modular vision
cumulative architecture
experimental evaluation
metrics
CIENCIA DA COMPUTACAO::TEORIA DA COMPUTACAO
dc.subject.eng.fl_str_mv evolution of software
software architecture
modular vision
cumulative architecture
experimental evaluation
metrics
dc.subject.cnpq.fl_str_mv CIENCIA DA COMPUTACAO::TEORIA DA COMPUTACAO
description Architecture module views of software are made up of modules with distinct functional responsibilities but with dependencies between them. Previous work has evaluated architecture recovery techniques of module views in order to better understand their strengths and weaknesses. In this context, different similarity metrics are used to evaluate such techniques, especially those based on clustering algorithms. However, few studies try to evaluate whether such metrics accurately capture the similarities between two clusters. Among the similarity metrics in the literature, we can cite examples from both the field of software engineering and from other fields (e.g., classification). This work evaluates six cluster similarity metrics through intrinsic quality and stability metrics and the use of software architecture models proposed by developers. To do so, we used the dimensions of stability and authoritativeness, in accordance with what has been discussed in the literature. For authoritativeness, the concentration statistics of the MeCl metric were higher, in comparison with the other similarity metrics. However, in the absence of architectural models, the Purity metric shows better results. As architecture models are very relevant to software engineers, we understand that the MeCl metric is the most appropriate. For stability, all metrics have values close to unity, despite the presence of outliers. Here as well, the MeCl metric was considered the best because of its superiority in this item. Being better in both dimensions, especially in authoritativeness, we decided to use the MeCl metric as the basis for comparison of clustering algorithms. We compared, using the MeCl metric, four agglomerative clustering algorithms in the context of four software systems. For both authoritativeness and stability, the SL90 algorithm produced higher values in two of the four systems studied by comparing the data series generated by all algorithms. In this case, the SL90 agglomerative algorithm was the best. In conclusion, we empirically realized that the MeCl metric is the best metric to measure group similarity; regarding the clustering algorithms, no algorithm exceeds the others in all comparisons, although SL90 presented better results in two of the four systems we analyzed
publishDate 2019
dc.date.accessioned.fl_str_mv 2019-05-27T21:56:17Z
dc.date.issued.fl_str_mv 2019-02-14
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.citation.fl_str_mv SILVA, Douglas Eder Uno. Avaliação da recuperação arquitetural de visões modulares de software a partir de técnicas de agrupamento. 2019. 93f. Dissertação (Mestrado em Computação Aplicada) - Universidade Estadual de Feira de Santana, Feira de Santana, 2019.
dc.identifier.uri.fl_str_mv http://tede2.uefs.br:8080/handle/tede/774
identifier_str_mv SILVA, Douglas Eder Uno. Avaliação da recuperação arquitetural de visões modulares de software a partir de técnicas de agrupamento. 2019. 93f. Dissertação (Mestrado em Computação Aplicada) - Universidade Estadual de Feira de Santana, Feira de Santana, 2019.
url http://tede2.uefs.br:8080/handle/tede/774
dc.language.iso.fl_str_mv por
language por
dc.relation.program.fl_str_mv 303317282311144204
dc.relation.confidence.fl_str_mv 600
600
600
dc.relation.department.fl_str_mv 4335108523020347051
dc.relation.cnpq.fl_str_mv -862078257083325301
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade Estadual de Feira de Santana
dc.publisher.program.fl_str_mv Mestrado em Computação Aplicada
dc.publisher.initials.fl_str_mv UEFS
dc.publisher.country.fl_str_mv Brasil
dc.publisher.department.fl_str_mv DEPARTAMENTO DE TECNOLOGIA
publisher.none.fl_str_mv Universidade Estadual de Feira de Santana
dc.source.none.fl_str_mv reponame:Biblioteca Digital de Teses e Dissertações da UEFS
instname:Universidade Estadual de Feira de Santana (UEFS)
instacron:UEFS
instname_str Universidade Estadual de Feira de Santana (UEFS)
instacron_str UEFS
institution UEFS
reponame_str Biblioteca Digital de Teses e Dissertações da UEFS
collection Biblioteca Digital de Teses e Dissertações da UEFS
bitstream.url.fl_str_mv http://tede2.uefs.br:8080/bitstream/tede/774/2/Dissertacao_Mestrado_Douglas_Silva_Versao_Final_Completa.pdf
http://tede2.uefs.br:8080/bitstream/tede/774/1/license.txt
bitstream.checksum.fl_str_mv 78658f8d2933aba5b1fddaa41e057ae0
7b5ba3d2445355f386edab96125d42b7
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
repository.name.fl_str_mv Biblioteca Digital de Teses e Dissertações da UEFS - Universidade Estadual de Feira de Santana (UEFS)
repository.mail.fl_str_mv bcuefs@uefs.br|| bcref@uefs.br||bcuefs@uefs.br
_version_ 1800214654390108160