An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques
| Ano de defesa: | 2021 |
|---|---|
| Autor(a) principal: | |
| Orientador(a): | |
| Banca de defesa: | |
| Tipo de documento: | Dissertação |
| Tipo de acesso: | Acesso aberto |
| Idioma: | por |
| Instituição de defesa: |
Universidade Federal do Rio Grande do Norte
Brasil UFRN PROGRAMA DE PÓS-GRADUAÇÃO EM SISTEMAS E COMPUTAÇÃO |
| Programa de Pós-Graduação: |
Não Informado pela instituição
|
| Departamento: |
Não Informado pela instituição
|
| País: |
Não Informado pela instituição
|
| Palavras-chave em Português: | |
| Link de acesso: | https://repositorio.ufrn.br/handle/123456789/33299 |
Resumo: | Brazilian Courts have been working in virtualisation of judicial processes since this century’s rise and, since then, a massive volume of data has been produced. Computational techniques have been an intimate ally to face the increasing amount of accumulated and new lawsuits in the system. However, although there is a misunderstanding that automation solutions are always ’intelligent’, which in most cases, it is not valid, there has never been any discussion about the use of intelligent solutions for this end as well as any issues related to automatic predicting and decision making using historical data in context. One of the problems that have already come to light is the bias in judicial data sets worldwide. This work aims to analyse a judicial dataset looking for decision bias and intelligent algorithms suitability. Taking motivation from the social impact of bias in the decision-making process, we have selected gender and social condition of indicted as classes for investigation. We have used a dataset of judicial sentences (built by Além da Pena research group), identified data structure and distribution, created supervised and unsupervised machine learning models applied to the dataset and analysed the occurrence of obvious and non-obvious bias related to judicial decisions. To investigate obvious bias, classification techniques based on k-Nearest Neighbours, Naive Bayes and Decision Trees algorithms, and to non-obvious bias, the unsupervised algorithms like k-Means and Hierarchical Clustering. Our experiments have been conducted to results that do not achieve a conclusive detection of bias but suggest a trend that would confirm its occurrence in the dataset, and therefore, the need for deeper analysis and improvements of techniques. |
| id |
UFRN_879bb67d7fd725055589e07d666a8fc1 |
|---|---|
| oai_identifier_str |
oai:repositorio.ufrn.br:123456789/33299 |
| network_acronym_str |
UFRN |
| network_name_str |
Repositório Institucional da UFRN |
| repository_id_str |
|
| spelling |
An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniquesJudicial dataMachine learningSupervised algorithmsUnsupervised algorithmsData analyticsData miningBrazilian Courts have been working in virtualisation of judicial processes since this century’s rise and, since then, a massive volume of data has been produced. Computational techniques have been an intimate ally to face the increasing amount of accumulated and new lawsuits in the system. However, although there is a misunderstanding that automation solutions are always ’intelligent’, which in most cases, it is not valid, there has never been any discussion about the use of intelligent solutions for this end as well as any issues related to automatic predicting and decision making using historical data in context. One of the problems that have already come to light is the bias in judicial data sets worldwide. This work aims to analyse a judicial dataset looking for decision bias and intelligent algorithms suitability. Taking motivation from the social impact of bias in the decision-making process, we have selected gender and social condition of indicted as classes for investigation. We have used a dataset of judicial sentences (built by Além da Pena research group), identified data structure and distribution, created supervised and unsupervised machine learning models applied to the dataset and analysed the occurrence of obvious and non-obvious bias related to judicial decisions. To investigate obvious bias, classification techniques based on k-Nearest Neighbours, Naive Bayes and Decision Trees algorithms, and to non-obvious bias, the unsupervised algorithms like k-Means and Hierarchical Clustering. Our experiments have been conducted to results that do not achieve a conclusive detection of bias but suggest a trend that would confirm its occurrence in the dataset, and therefore, the need for deeper analysis and improvements of techniques.Universidade Federal do Rio Grande do NorteBrasilUFRNPROGRAMA DE PÓS-GRADUAÇÃO EM SISTEMAS E COMPUTAÇÃOAbreu, Marjory Cristiany da Costahttp://lattes.cnpq.br/9229268386945230http://lattes.cnpq.br/2234040548103596Cavalcante, Everton Ranielly de Sousahttp://lattes.cnpq.br/5065548216266121Oliveira, Laura Emmanuella Alves dos Santos Santana dehttp://lattes.cnpq.br/8996581733787436Souza Neto, Plácido Antônio dehttp://lattes.cnpq.br/3641504724164977Silva, Bruno dos Santos Fernandes da2021-09-08T16:36:35Z2021-09-08T16:36:35Z2021-07-05info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfSILVA, Bruno dos Santos Fernandes da. An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques. 2021. 82f. Dissertação (Mestrado em Sistemas e Computação) - Centro de Ciências Exatas e da Terra, Universidade Federal do Rio Grande do Norte, Natal, 2021.https://repositorio.ufrn.br/handle/123456789/33299info:eu-repo/semantics/openAccessporreponame:Repositório Institucional da UFRNinstname:Universidade Federal do Rio Grande do Norte (UFRN)instacron:UFRN2022-05-02T16:01:57Zoai:repositorio.ufrn.br:123456789/33299Repositório InstitucionalPUBhttp://repositorio.ufrn.br/oai/repositorio@bczm.ufrn.bropendoar:2022-05-02T16:01:57Repositório Institucional da UFRN - Universidade Federal do Rio Grande do Norte (UFRN)false |
| dc.title.none.fl_str_mv |
An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques |
| title |
An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques |
| spellingShingle |
An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques Silva, Bruno dos Santos Fernandes da Judicial data Machine learning Supervised algorithms Unsupervised algorithms Data analytics Data mining |
| title_short |
An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques |
| title_full |
An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques |
| title_fullStr |
An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques |
| title_full_unstemmed |
An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques |
| title_sort |
An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques |
| author |
Silva, Bruno dos Santos Fernandes da |
| author_facet |
Silva, Bruno dos Santos Fernandes da |
| author_role |
author |
| dc.contributor.none.fl_str_mv |
Abreu, Marjory Cristiany da Costa http://lattes.cnpq.br/9229268386945230 http://lattes.cnpq.br/2234040548103596 Cavalcante, Everton Ranielly de Sousa http://lattes.cnpq.br/5065548216266121 Oliveira, Laura Emmanuella Alves dos Santos Santana de http://lattes.cnpq.br/8996581733787436 Souza Neto, Plácido Antônio de http://lattes.cnpq.br/3641504724164977 |
| dc.contributor.author.fl_str_mv |
Silva, Bruno dos Santos Fernandes da |
| dc.subject.por.fl_str_mv |
Judicial data Machine learning Supervised algorithms Unsupervised algorithms Data analytics Data mining |
| topic |
Judicial data Machine learning Supervised algorithms Unsupervised algorithms Data analytics Data mining |
| description |
Brazilian Courts have been working in virtualisation of judicial processes since this century’s rise and, since then, a massive volume of data has been produced. Computational techniques have been an intimate ally to face the increasing amount of accumulated and new lawsuits in the system. However, although there is a misunderstanding that automation solutions are always ’intelligent’, which in most cases, it is not valid, there has never been any discussion about the use of intelligent solutions for this end as well as any issues related to automatic predicting and decision making using historical data in context. One of the problems that have already come to light is the bias in judicial data sets worldwide. This work aims to analyse a judicial dataset looking for decision bias and intelligent algorithms suitability. Taking motivation from the social impact of bias in the decision-making process, we have selected gender and social condition of indicted as classes for investigation. We have used a dataset of judicial sentences (built by Além da Pena research group), identified data structure and distribution, created supervised and unsupervised machine learning models applied to the dataset and analysed the occurrence of obvious and non-obvious bias related to judicial decisions. To investigate obvious bias, classification techniques based on k-Nearest Neighbours, Naive Bayes and Decision Trees algorithms, and to non-obvious bias, the unsupervised algorithms like k-Means and Hierarchical Clustering. Our experiments have been conducted to results that do not achieve a conclusive detection of bias but suggest a trend that would confirm its occurrence in the dataset, and therefore, the need for deeper analysis and improvements of techniques. |
| publishDate |
2021 |
| dc.date.none.fl_str_mv |
2021-09-08T16:36:35Z 2021-09-08T16:36:35Z 2021-07-05 |
| dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
| dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
| format |
masterThesis |
| status_str |
publishedVersion |
| dc.identifier.uri.fl_str_mv |
SILVA, Bruno dos Santos Fernandes da. An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques. 2021. 82f. Dissertação (Mestrado em Sistemas e Computação) - Centro de Ciências Exatas e da Terra, Universidade Federal do Rio Grande do Norte, Natal, 2021. https://repositorio.ufrn.br/handle/123456789/33299 |
| identifier_str_mv |
SILVA, Bruno dos Santos Fernandes da. An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques. 2021. 82f. Dissertação (Mestrado em Sistemas e Computação) - Centro de Ciências Exatas e da Terra, Universidade Federal do Rio Grande do Norte, Natal, 2021. |
| url |
https://repositorio.ufrn.br/handle/123456789/33299 |
| dc.language.iso.fl_str_mv |
por |
| language |
por |
| dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
| eu_rights_str_mv |
openAccess |
| dc.format.none.fl_str_mv |
application/pdf |
| dc.publisher.none.fl_str_mv |
Universidade Federal do Rio Grande do Norte Brasil UFRN PROGRAMA DE PÓS-GRADUAÇÃO EM SISTEMAS E COMPUTAÇÃO |
| publisher.none.fl_str_mv |
Universidade Federal do Rio Grande do Norte Brasil UFRN PROGRAMA DE PÓS-GRADUAÇÃO EM SISTEMAS E COMPUTAÇÃO |
| dc.source.none.fl_str_mv |
reponame:Repositório Institucional da UFRN instname:Universidade Federal do Rio Grande do Norte (UFRN) instacron:UFRN |
| instname_str |
Universidade Federal do Rio Grande do Norte (UFRN) |
| instacron_str |
UFRN |
| institution |
UFRN |
| reponame_str |
Repositório Institucional da UFRN |
| collection |
Repositório Institucional da UFRN |
| repository.name.fl_str_mv |
Repositório Institucional da UFRN - Universidade Federal do Rio Grande do Norte (UFRN) |
| repository.mail.fl_str_mv |
repositorio@bczm.ufrn.br |
| _version_ |
1855758751866290176 |