An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques

Detalhes bibliográficos
Ano de defesa: 2021
Autor(a) principal: Silva, Bruno dos Santos Fernandes da
Orientador(a): Não Informado pela instituição
Banca de defesa: Não Informado pela instituição
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Universidade Federal do Rio Grande do Norte
Brasil
UFRN
PROGRAMA DE PÓS-GRADUAÇÃO EM SISTEMAS E COMPUTAÇÃO
Programa de Pós-Graduação: Não Informado pela instituição
Departamento: Não Informado pela instituição
País: Não Informado pela instituição
Palavras-chave em Português:
Link de acesso: https://repositorio.ufrn.br/handle/123456789/33299
Resumo: Brazilian Courts have been working in virtualisation of judicial processes since this century’s rise and, since then, a massive volume of data has been produced. Computational techniques have been an intimate ally to face the increasing amount of accumulated and new lawsuits in the system. However, although there is a misunderstanding that automation solutions are always ’intelligent’, which in most cases, it is not valid, there has never been any discussion about the use of intelligent solutions for this end as well as any issues related to automatic predicting and decision making using historical data in context. One of the problems that have already come to light is the bias in judicial data sets worldwide. This work aims to analyse a judicial dataset looking for decision bias and intelligent algorithms suitability. Taking motivation from the social impact of bias in the decision-making process, we have selected gender and social condition of indicted as classes for investigation. We have used a dataset of judicial sentences (built by Além da Pena research group), identified data structure and distribution, created supervised and unsupervised machine learning models applied to the dataset and analysed the occurrence of obvious and non-obvious bias related to judicial decisions. To investigate obvious bias, classification techniques based on k-Nearest Neighbours, Naive Bayes and Decision Trees algorithms, and to non-obvious bias, the unsupervised algorithms like k-Means and Hierarchical Clustering. Our experiments have been conducted to results that do not achieve a conclusive detection of bias but suggest a trend that would confirm its occurrence in the dataset, and therefore, the need for deeper analysis and improvements of techniques.
id UFRN_879bb67d7fd725055589e07d666a8fc1
oai_identifier_str oai:repositorio.ufrn.br:123456789/33299
network_acronym_str UFRN
network_name_str Repositório Institucional da UFRN
repository_id_str
spelling An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniquesJudicial dataMachine learningSupervised algorithmsUnsupervised algorithmsData analyticsData miningBrazilian Courts have been working in virtualisation of judicial processes since this century’s rise and, since then, a massive volume of data has been produced. Computational techniques have been an intimate ally to face the increasing amount of accumulated and new lawsuits in the system. However, although there is a misunderstanding that automation solutions are always ’intelligent’, which in most cases, it is not valid, there has never been any discussion about the use of intelligent solutions for this end as well as any issues related to automatic predicting and decision making using historical data in context. One of the problems that have already come to light is the bias in judicial data sets worldwide. This work aims to analyse a judicial dataset looking for decision bias and intelligent algorithms suitability. Taking motivation from the social impact of bias in the decision-making process, we have selected gender and social condition of indicted as classes for investigation. We have used a dataset of judicial sentences (built by Além da Pena research group), identified data structure and distribution, created supervised and unsupervised machine learning models applied to the dataset and analysed the occurrence of obvious and non-obvious bias related to judicial decisions. To investigate obvious bias, classification techniques based on k-Nearest Neighbours, Naive Bayes and Decision Trees algorithms, and to non-obvious bias, the unsupervised algorithms like k-Means and Hierarchical Clustering. Our experiments have been conducted to results that do not achieve a conclusive detection of bias but suggest a trend that would confirm its occurrence in the dataset, and therefore, the need for deeper analysis and improvements of techniques.Universidade Federal do Rio Grande do NorteBrasilUFRNPROGRAMA DE PÓS-GRADUAÇÃO EM SISTEMAS E COMPUTAÇÃOAbreu, Marjory Cristiany da Costahttp://lattes.cnpq.br/9229268386945230http://lattes.cnpq.br/2234040548103596Cavalcante, Everton Ranielly de Sousahttp://lattes.cnpq.br/5065548216266121Oliveira, Laura Emmanuella Alves dos Santos Santana dehttp://lattes.cnpq.br/8996581733787436Souza Neto, Plácido Antônio dehttp://lattes.cnpq.br/3641504724164977Silva, Bruno dos Santos Fernandes da2021-09-08T16:36:35Z2021-09-08T16:36:35Z2021-07-05info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfSILVA, Bruno dos Santos Fernandes da. An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques. 2021. 82f. Dissertação (Mestrado em Sistemas e Computação) - Centro de Ciências Exatas e da Terra, Universidade Federal do Rio Grande do Norte, Natal, 2021.https://repositorio.ufrn.br/handle/123456789/33299info:eu-repo/semantics/openAccessporreponame:Repositório Institucional da UFRNinstname:Universidade Federal do Rio Grande do Norte (UFRN)instacron:UFRN2022-05-02T16:01:57Zoai:repositorio.ufrn.br:123456789/33299Repositório InstitucionalPUBhttp://repositorio.ufrn.br/oai/repositorio@bczm.ufrn.bropendoar:2022-05-02T16:01:57Repositório Institucional da UFRN - Universidade Federal do Rio Grande do Norte (UFRN)false
dc.title.none.fl_str_mv An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques
title An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques
spellingShingle An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques
Silva, Bruno dos Santos Fernandes da
Judicial data
Machine learning
Supervised algorithms
Unsupervised algorithms
Data analytics
Data mining
title_short An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques
title_full An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques
title_fullStr An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques
title_full_unstemmed An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques
title_sort An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques
author Silva, Bruno dos Santos Fernandes da
author_facet Silva, Bruno dos Santos Fernandes da
author_role author
dc.contributor.none.fl_str_mv Abreu, Marjory Cristiany da Costa
http://lattes.cnpq.br/9229268386945230
http://lattes.cnpq.br/2234040548103596
Cavalcante, Everton Ranielly de Sousa
http://lattes.cnpq.br/5065548216266121
Oliveira, Laura Emmanuella Alves dos Santos Santana de
http://lattes.cnpq.br/8996581733787436
Souza Neto, Plácido Antônio de
http://lattes.cnpq.br/3641504724164977
dc.contributor.author.fl_str_mv Silva, Bruno dos Santos Fernandes da
dc.subject.por.fl_str_mv Judicial data
Machine learning
Supervised algorithms
Unsupervised algorithms
Data analytics
Data mining
topic Judicial data
Machine learning
Supervised algorithms
Unsupervised algorithms
Data analytics
Data mining
description Brazilian Courts have been working in virtualisation of judicial processes since this century’s rise and, since then, a massive volume of data has been produced. Computational techniques have been an intimate ally to face the increasing amount of accumulated and new lawsuits in the system. However, although there is a misunderstanding that automation solutions are always ’intelligent’, which in most cases, it is not valid, there has never been any discussion about the use of intelligent solutions for this end as well as any issues related to automatic predicting and decision making using historical data in context. One of the problems that have already come to light is the bias in judicial data sets worldwide. This work aims to analyse a judicial dataset looking for decision bias and intelligent algorithms suitability. Taking motivation from the social impact of bias in the decision-making process, we have selected gender and social condition of indicted as classes for investigation. We have used a dataset of judicial sentences (built by Além da Pena research group), identified data structure and distribution, created supervised and unsupervised machine learning models applied to the dataset and analysed the occurrence of obvious and non-obvious bias related to judicial decisions. To investigate obvious bias, classification techniques based on k-Nearest Neighbours, Naive Bayes and Decision Trees algorithms, and to non-obvious bias, the unsupervised algorithms like k-Means and Hierarchical Clustering. Our experiments have been conducted to results that do not achieve a conclusive detection of bias but suggest a trend that would confirm its occurrence in the dataset, and therefore, the need for deeper analysis and improvements of techniques.
publishDate 2021
dc.date.none.fl_str_mv 2021-09-08T16:36:35Z
2021-09-08T16:36:35Z
2021-07-05
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv SILVA, Bruno dos Santos Fernandes da. An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques. 2021. 82f. Dissertação (Mestrado em Sistemas e Computação) - Centro de Ciências Exatas e da Terra, Universidade Federal do Rio Grande do Norte, Natal, 2021.
https://repositorio.ufrn.br/handle/123456789/33299
identifier_str_mv SILVA, Bruno dos Santos Fernandes da. An investigative analysis of obvious and non-obvious Bias in judicial data using supervised and unsupervised machine learning techniques. 2021. 82f. Dissertação (Mestrado em Sistemas e Computação) - Centro de Ciências Exatas e da Terra, Universidade Federal do Rio Grande do Norte, Natal, 2021.
url https://repositorio.ufrn.br/handle/123456789/33299
dc.language.iso.fl_str_mv por
language por
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade Federal do Rio Grande do Norte
Brasil
UFRN
PROGRAMA DE PÓS-GRADUAÇÃO EM SISTEMAS E COMPUTAÇÃO
publisher.none.fl_str_mv Universidade Federal do Rio Grande do Norte
Brasil
UFRN
PROGRAMA DE PÓS-GRADUAÇÃO EM SISTEMAS E COMPUTAÇÃO
dc.source.none.fl_str_mv reponame:Repositório Institucional da UFRN
instname:Universidade Federal do Rio Grande do Norte (UFRN)
instacron:UFRN
instname_str Universidade Federal do Rio Grande do Norte (UFRN)
instacron_str UFRN
institution UFRN
reponame_str Repositório Institucional da UFRN
collection Repositório Institucional da UFRN
repository.name.fl_str_mv Repositório Institucional da UFRN - Universidade Federal do Rio Grande do Norte (UFRN)
repository.mail.fl_str_mv repositorio@bczm.ufrn.br
_version_ 1855758751866290176