Lingüística de corpus na análise do internetês

Detalhes bibliográficos
Ano de defesa: 2007
Autor(a) principal: Gonzalez, Zeli Miranda Gutierrez lattes
Orientador(a): Sardinha, Antonio Paulo Berber
Banca de defesa: Não Informado pela instituição
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Pontifícia Universidade Católica de São Paulo
Programa de Pós-Graduação: Programa de Estudos Pós-Graduados em Linguística Aplicada e Estudos da Linguagem
Departamento: Lingüística
País: BR
Palavras-chave em Português:
Palavras-chave em Inglês:
Área do conhecimento CNPq:
Link de acesso: https://tede2.pucsp.br/handle/handle/13928
Resumo: The study presented was motivated by the needs of comprehend the changes in the ortography of the Internet language, such as identify those changes frequency. The main aim of this study was to focus on the usage of a Corpus Linguistics approach for identification of frequent words most used in the studies corpus, such as frequences of changes in the ortography and the lexican gramathical standards of the internet language. There is a great range of studies on the internet language; however, very few of them has demontrated empirically how frequent changes are. Therefore, this study has tried to fill this gap by being able to show empirically the changes. The main theoretical underpinning for the research is provided by Corpus Linguistics, assuming the main notions presented by Biber (1998), Berber Sardinha (2004, 2006), Sinclair (1991,1996). For focusing the use frequency of lexican items it was considered, more specificly, the studies of Berber Sardinha (2000a, 2000b, 2004), Halliday (1991, 1992, 1993). Besides the Corpus Linguistics, the project also mentioned in questions such as: linguisctics diversity, genre, registry and internet language ortography along the perspective of Possenti (2006), Mollica (2007), Thurlow and Brown (2007), Crystal (2001), Othero, (2004). The corpus employed in the study was collected of young people s blogs that use internet for comunication. This corpus contains 135.021 tokes and 15.552 types. For the development of this research and of the analysis of the lexican items it was considered all the 500 most used words in the corpus studies. The frequences were used as base for decription of changes happened in the variant linguistics ortography the internet language. Among the most frequent items in the corpus was selected the td item with the sense of all, every, everything ( tudo, todo, toda, todas e todos in portuguese), with the objective of verify the standards lexican-gramathical, contributed for the respective senses. To sum up, this study hopes it has contributed to the study of the internet language, since there are few studies that have demosntrated empirically how these changes occur. This work also presentes the research limitations and its possible applications in the future
id PUC_SP-1_c5928ed363933c3fad1d4b061c8b51fe
oai_identifier_str oai:repositorio.pucsp.br:handle/13928
network_acronym_str PUC_SP-1
network_name_str Repositório Institucional da PUC_SP
repository_id_str
spelling Sardinha, Antonio Paulo Berberhttp://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4564693Y0Gonzalez, Zeli Miranda Gutierrez2016-04-28T18:23:36Z2008-01-112007-11-05Gonzalez, Zeli Miranda Gutierrez. Lingüística de corpus na análise do internetês. 2007. 123 f. Dissertação (Mestrado em Lingüística) - Pontifícia Universidade Católica de São Paulo, São Paulo, 2007.https://tede2.pucsp.br/handle/handle/13928The study presented was motivated by the needs of comprehend the changes in the ortography of the Internet language, such as identify those changes frequency. The main aim of this study was to focus on the usage of a Corpus Linguistics approach for identification of frequent words most used in the studies corpus, such as frequences of changes in the ortography and the lexican gramathical standards of the internet language. There is a great range of studies on the internet language; however, very few of them has demontrated empirically how frequent changes are. Therefore, this study has tried to fill this gap by being able to show empirically the changes. The main theoretical underpinning for the research is provided by Corpus Linguistics, assuming the main notions presented by Biber (1998), Berber Sardinha (2004, 2006), Sinclair (1991,1996). For focusing the use frequency of lexican items it was considered, more specificly, the studies of Berber Sardinha (2000a, 2000b, 2004), Halliday (1991, 1992, 1993). Besides the Corpus Linguistics, the project also mentioned in questions such as: linguisctics diversity, genre, registry and internet language ortography along the perspective of Possenti (2006), Mollica (2007), Thurlow and Brown (2007), Crystal (2001), Othero, (2004). The corpus employed in the study was collected of young people s blogs that use internet for comunication. This corpus contains 135.021 tokes and 15.552 types. For the development of this research and of the analysis of the lexican items it was considered all the 500 most used words in the corpus studies. The frequences were used as base for decription of changes happened in the variant linguistics ortography the internet language. Among the most frequent items in the corpus was selected the td item with the sense of all, every, everything ( tudo, todo, toda, todas e todos in portuguese), with the objective of verify the standards lexican-gramathical, contributed for the respective senses. To sum up, this study hopes it has contributed to the study of the internet language, since there are few studies that have demosntrated empirically how these changes occur. This work also presentes the research limitations and its possible applications in the futureO trabalho que ora se apresenta foi motivado pela necessidade de compreender as modificações na grafia do internetês, bem como identificar a freqüência dessas modificações. Esse trabalho teve como objetivo principal utilizar uma abordagem baseada em Lingüística de Corpus na identificação das palavras mais freqüentes do internetês, das freqüências de modificações na grafia e os padrões léxico gramaticais. Há vários trabalhos que lidam com a questão do internetês; entretanto, nenhum deles demonstrou empiricamente quão freqüente as modificações ocorrem. Sendo assim, esse trabalho buscou preencher essa lacuna, sendo, portanto, capaz de demonstrar empiricamente a extensão dessas modificações. Para tanto, encontrou suporte teórico na Lingüística de Corpus, adotando as principais noções apresentadas por Biber (1998), Berber Sardinha (2004, 2006), Sinclair (1991,1996). Por enfocar as freqüências de uso de itens lexicais consideraram-se, mais especificamente, os trabalhos de Berber Sardinha (2000a, 2000b, 2004), Halliday (1991, 1992, 1993). Além da Lingüística de Corpus, o projeto também tocou em questões como: variedades lingüísticas, gênero, registro e grafia internáutica sob a perspectiva de Possenti (2006), Mollica (2007), Thurlow and Brown (2007), Crystal (2001), Othero (2004). O corpus empregado na pesquisa foi coletado em blogs de jovens que utilizam a internet para comunicação. O corpus contém 135.021palavras e 15.552 formas. Para as análises dos itens lexicais consideraram-se as 500 palavras mais freqüentes do corpus de estudo. As freqüências detectadas serviram como base para a descrição das modificações ocorridas na grafia da variante lingüística o internetês. Entre os itens mais freqüentes do corpus, selecionou-se o item td com sentido de tudo, toda, todo, todos, todas, com a finalidade de verificar se os padrões léxicogramaticais contribuíam para os respectivos sentidos. Por conseguinte, a pesquisa pretende ter contribuído para o estudo do internetês, uma vez que há poucos trabalhos que demonstrem, de maneira empírica, essas modificações. O trabalho ainda apresenta as limitações da pesquisa e aponta sugestões para futuros estudosapplication/pdfhttp://tede2.pucsp.br/tede/retrieve/30762/ZELI%20MIRANDA%20GUTIERREZ%20GONZALEZ.pdf.jpgporPontifícia Universidade Católica de São PauloPrograma de Estudos Pós-Graduados em Linguística Aplicada e Estudos da LinguagemPUC-SPBRLingüísticaGrafia do internetêsLinguistica -- Processamento de dadosInternetLinguagem e a internetInternet languageCNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADALingüística de corpus na análise do internetêsinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da PUC_SPinstname:Pontifícia Universidade Católica de São Paulo (PUC-SP)instacron:PUC_SPTEXTZELI MIRANDA GUTIERREZ GONZALEZ.pdf.txtZELI MIRANDA GUTIERREZ GONZALEZ.pdf.txtExtracted texttext/plain201372https://repositorio.pucsp.br/xmlui/bitstream/handle/13928/3/ZELI%20MIRANDA%20GUTIERREZ%20GONZALEZ.pdf.txt0c8579dd2e831b630c132cb3f1650d33MD53ORIGINALZELI MIRANDA GUTIERREZ GONZALEZ.pdfapplication/pdf1268917https://repositorio.pucsp.br/xmlui/bitstream/handle/13928/1/ZELI%20MIRANDA%20GUTIERREZ%20GONZALEZ.pdf3a704528461b06f74cb2b2e71d8fdcf1MD51THUMBNAILZELI MIRANDA GUTIERREZ GONZALEZ.pdf.jpgZELI MIRANDA GUTIERREZ GONZALEZ.pdf.jpgGenerated Thumbnailimage/jpeg3270https://repositorio.pucsp.br/xmlui/bitstream/handle/13928/2/ZELI%20MIRANDA%20GUTIERREZ%20GONZALEZ.pdf.jpg731c58f7b4666536216d7fb13b84ce3dMD52handle/139282022-04-27 22:58:10.115oai:repositorio.pucsp.br:handle/13928Repositório Institucionalhttps://sapientia.pucsp.br/https://sapientia.pucsp.br/oai/requestbngkatende@pucsp.br||rapassi@pucsp.bropendoar:2022-04-28T01:58:10Repositório Institucional da PUC_SP - Pontifícia Universidade Católica de São Paulo (PUC-SP)false
dc.title.por.fl_str_mv Lingüística de corpus na análise do internetês
title Lingüística de corpus na análise do internetês
spellingShingle Lingüística de corpus na análise do internetês
Gonzalez, Zeli Miranda Gutierrez
Grafia do internetês
Linguistica -- Processamento de dados
Internet
Linguagem e a internet
Internet language
CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA
title_short Lingüística de corpus na análise do internetês
title_full Lingüística de corpus na análise do internetês
title_fullStr Lingüística de corpus na análise do internetês
title_full_unstemmed Lingüística de corpus na análise do internetês
title_sort Lingüística de corpus na análise do internetês
author Gonzalez, Zeli Miranda Gutierrez
author_facet Gonzalez, Zeli Miranda Gutierrez
author_role author
dc.contributor.advisor1.fl_str_mv Sardinha, Antonio Paulo Berber
dc.contributor.authorLattes.fl_str_mv http://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4564693Y0
dc.contributor.author.fl_str_mv Gonzalez, Zeli Miranda Gutierrez
contributor_str_mv Sardinha, Antonio Paulo Berber
dc.subject.por.fl_str_mv Grafia do internetês
Linguistica -- Processamento de dados
Internet
Linguagem e a internet
topic Grafia do internetês
Linguistica -- Processamento de dados
Internet
Linguagem e a internet
Internet language
CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA
dc.subject.eng.fl_str_mv Internet language
dc.subject.cnpq.fl_str_mv CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA
description The study presented was motivated by the needs of comprehend the changes in the ortography of the Internet language, such as identify those changes frequency. The main aim of this study was to focus on the usage of a Corpus Linguistics approach for identification of frequent words most used in the studies corpus, such as frequences of changes in the ortography and the lexican gramathical standards of the internet language. There is a great range of studies on the internet language; however, very few of them has demontrated empirically how frequent changes are. Therefore, this study has tried to fill this gap by being able to show empirically the changes. The main theoretical underpinning for the research is provided by Corpus Linguistics, assuming the main notions presented by Biber (1998), Berber Sardinha (2004, 2006), Sinclair (1991,1996). For focusing the use frequency of lexican items it was considered, more specificly, the studies of Berber Sardinha (2000a, 2000b, 2004), Halliday (1991, 1992, 1993). Besides the Corpus Linguistics, the project also mentioned in questions such as: linguisctics diversity, genre, registry and internet language ortography along the perspective of Possenti (2006), Mollica (2007), Thurlow and Brown (2007), Crystal (2001), Othero, (2004). The corpus employed in the study was collected of young people s blogs that use internet for comunication. This corpus contains 135.021 tokes and 15.552 types. For the development of this research and of the analysis of the lexican items it was considered all the 500 most used words in the corpus studies. The frequences were used as base for decription of changes happened in the variant linguistics ortography the internet language. Among the most frequent items in the corpus was selected the td item with the sense of all, every, everything ( tudo, todo, toda, todas e todos in portuguese), with the objective of verify the standards lexican-gramathical, contributed for the respective senses. To sum up, this study hopes it has contributed to the study of the internet language, since there are few studies that have demosntrated empirically how these changes occur. This work also presentes the research limitations and its possible applications in the future
publishDate 2007
dc.date.issued.fl_str_mv 2007-11-05
dc.date.available.fl_str_mv 2008-01-11
dc.date.accessioned.fl_str_mv 2016-04-28T18:23:36Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.citation.fl_str_mv Gonzalez, Zeli Miranda Gutierrez. Lingüística de corpus na análise do internetês. 2007. 123 f. Dissertação (Mestrado em Lingüística) - Pontifícia Universidade Católica de São Paulo, São Paulo, 2007.
dc.identifier.uri.fl_str_mv https://tede2.pucsp.br/handle/handle/13928
identifier_str_mv Gonzalez, Zeli Miranda Gutierrez. Lingüística de corpus na análise do internetês. 2007. 123 f. Dissertação (Mestrado em Lingüística) - Pontifícia Universidade Católica de São Paulo, São Paulo, 2007.
url https://tede2.pucsp.br/handle/handle/13928
dc.language.iso.fl_str_mv por
language por
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Pontifícia Universidade Católica de São Paulo
dc.publisher.program.fl_str_mv Programa de Estudos Pós-Graduados em Linguística Aplicada e Estudos da Linguagem
dc.publisher.initials.fl_str_mv PUC-SP
dc.publisher.country.fl_str_mv BR
dc.publisher.department.fl_str_mv Lingüística
publisher.none.fl_str_mv Pontifícia Universidade Católica de São Paulo
dc.source.none.fl_str_mv reponame:Repositório Institucional da PUC_SP
instname:Pontifícia Universidade Católica de São Paulo (PUC-SP)
instacron:PUC_SP
instname_str Pontifícia Universidade Católica de São Paulo (PUC-SP)
instacron_str PUC_SP
institution PUC_SP
reponame_str Repositório Institucional da PUC_SP
collection Repositório Institucional da PUC_SP
bitstream.url.fl_str_mv https://repositorio.pucsp.br/xmlui/bitstream/handle/13928/3/ZELI%20MIRANDA%20GUTIERREZ%20GONZALEZ.pdf.txt
https://repositorio.pucsp.br/xmlui/bitstream/handle/13928/1/ZELI%20MIRANDA%20GUTIERREZ%20GONZALEZ.pdf
https://repositorio.pucsp.br/xmlui/bitstream/handle/13928/2/ZELI%20MIRANDA%20GUTIERREZ%20GONZALEZ.pdf.jpg
bitstream.checksum.fl_str_mv 0c8579dd2e831b630c132cb3f1650d33
3a704528461b06f74cb2b2e71d8fdcf1
731c58f7b4666536216d7fb13b84ce3d
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
repository.name.fl_str_mv Repositório Institucional da PUC_SP - Pontifícia Universidade Católica de São Paulo (PUC-SP)
repository.mail.fl_str_mv bngkatende@pucsp.br||rapassi@pucsp.br
_version_ 1840370399130419200