Lingüística de corpus na análise do internetês
| Ano de defesa: | 2007 |
|---|---|
| Autor(a) principal: | |
| Orientador(a): | |
| Banca de defesa: | |
| Tipo de documento: | Dissertação |
| Tipo de acesso: | Acesso aberto |
| Idioma: | por |
| Instituição de defesa: |
Pontifícia Universidade Católica de São Paulo
|
| Programa de Pós-Graduação: |
Programa de Estudos Pós-Graduados em Linguística Aplicada e Estudos da Linguagem
|
| Departamento: |
Lingüística
|
| País: |
BR
|
| Palavras-chave em Português: | |
| Palavras-chave em Inglês: | |
| Área do conhecimento CNPq: | |
| Link de acesso: | https://tede2.pucsp.br/handle/handle/13928 |
Resumo: | The study presented was motivated by the needs of comprehend the changes in the ortography of the Internet language, such as identify those changes frequency. The main aim of this study was to focus on the usage of a Corpus Linguistics approach for identification of frequent words most used in the studies corpus, such as frequences of changes in the ortography and the lexican gramathical standards of the internet language. There is a great range of studies on the internet language; however, very few of them has demontrated empirically how frequent changes are. Therefore, this study has tried to fill this gap by being able to show empirically the changes. The main theoretical underpinning for the research is provided by Corpus Linguistics, assuming the main notions presented by Biber (1998), Berber Sardinha (2004, 2006), Sinclair (1991,1996). For focusing the use frequency of lexican items it was considered, more specificly, the studies of Berber Sardinha (2000a, 2000b, 2004), Halliday (1991, 1992, 1993). Besides the Corpus Linguistics, the project also mentioned in questions such as: linguisctics diversity, genre, registry and internet language ortography along the perspective of Possenti (2006), Mollica (2007), Thurlow and Brown (2007), Crystal (2001), Othero, (2004). The corpus employed in the study was collected of young people s blogs that use internet for comunication. This corpus contains 135.021 tokes and 15.552 types. For the development of this research and of the analysis of the lexican items it was considered all the 500 most used words in the corpus studies. The frequences were used as base for decription of changes happened in the variant linguistics ortography the internet language. Among the most frequent items in the corpus was selected the td item with the sense of all, every, everything ( tudo, todo, toda, todas e todos in portuguese), with the objective of verify the standards lexican-gramathical, contributed for the respective senses. To sum up, this study hopes it has contributed to the study of the internet language, since there are few studies that have demosntrated empirically how these changes occur. This work also presentes the research limitations and its possible applications in the future |
| id |
PUC_SP-1_c5928ed363933c3fad1d4b061c8b51fe |
|---|---|
| oai_identifier_str |
oai:repositorio.pucsp.br:handle/13928 |
| network_acronym_str |
PUC_SP-1 |
| network_name_str |
Repositório Institucional da PUC_SP |
| repository_id_str |
|
| spelling |
Sardinha, Antonio Paulo Berberhttp://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4564693Y0Gonzalez, Zeli Miranda Gutierrez2016-04-28T18:23:36Z2008-01-112007-11-05Gonzalez, Zeli Miranda Gutierrez. Lingüística de corpus na análise do internetês. 2007. 123 f. Dissertação (Mestrado em Lingüística) - Pontifícia Universidade Católica de São Paulo, São Paulo, 2007.https://tede2.pucsp.br/handle/handle/13928The study presented was motivated by the needs of comprehend the changes in the ortography of the Internet language, such as identify those changes frequency. The main aim of this study was to focus on the usage of a Corpus Linguistics approach for identification of frequent words most used in the studies corpus, such as frequences of changes in the ortography and the lexican gramathical standards of the internet language. There is a great range of studies on the internet language; however, very few of them has demontrated empirically how frequent changes are. Therefore, this study has tried to fill this gap by being able to show empirically the changes. The main theoretical underpinning for the research is provided by Corpus Linguistics, assuming the main notions presented by Biber (1998), Berber Sardinha (2004, 2006), Sinclair (1991,1996). For focusing the use frequency of lexican items it was considered, more specificly, the studies of Berber Sardinha (2000a, 2000b, 2004), Halliday (1991, 1992, 1993). Besides the Corpus Linguistics, the project also mentioned in questions such as: linguisctics diversity, genre, registry and internet language ortography along the perspective of Possenti (2006), Mollica (2007), Thurlow and Brown (2007), Crystal (2001), Othero, (2004). The corpus employed in the study was collected of young people s blogs that use internet for comunication. This corpus contains 135.021 tokes and 15.552 types. For the development of this research and of the analysis of the lexican items it was considered all the 500 most used words in the corpus studies. The frequences were used as base for decription of changes happened in the variant linguistics ortography the internet language. Among the most frequent items in the corpus was selected the td item with the sense of all, every, everything ( tudo, todo, toda, todas e todos in portuguese), with the objective of verify the standards lexican-gramathical, contributed for the respective senses. To sum up, this study hopes it has contributed to the study of the internet language, since there are few studies that have demosntrated empirically how these changes occur. This work also presentes the research limitations and its possible applications in the futureO trabalho que ora se apresenta foi motivado pela necessidade de compreender as modificações na grafia do internetês, bem como identificar a freqüência dessas modificações. Esse trabalho teve como objetivo principal utilizar uma abordagem baseada em Lingüística de Corpus na identificação das palavras mais freqüentes do internetês, das freqüências de modificações na grafia e os padrões léxico gramaticais. Há vários trabalhos que lidam com a questão do internetês; entretanto, nenhum deles demonstrou empiricamente quão freqüente as modificações ocorrem. Sendo assim, esse trabalho buscou preencher essa lacuna, sendo, portanto, capaz de demonstrar empiricamente a extensão dessas modificações. Para tanto, encontrou suporte teórico na Lingüística de Corpus, adotando as principais noções apresentadas por Biber (1998), Berber Sardinha (2004, 2006), Sinclair (1991,1996). Por enfocar as freqüências de uso de itens lexicais consideraram-se, mais especificamente, os trabalhos de Berber Sardinha (2000a, 2000b, 2004), Halliday (1991, 1992, 1993). Além da Lingüística de Corpus, o projeto também tocou em questões como: variedades lingüísticas, gênero, registro e grafia internáutica sob a perspectiva de Possenti (2006), Mollica (2007), Thurlow and Brown (2007), Crystal (2001), Othero (2004). O corpus empregado na pesquisa foi coletado em blogs de jovens que utilizam a internet para comunicação. O corpus contém 135.021palavras e 15.552 formas. Para as análises dos itens lexicais consideraram-se as 500 palavras mais freqüentes do corpus de estudo. As freqüências detectadas serviram como base para a descrição das modificações ocorridas na grafia da variante lingüística o internetês. Entre os itens mais freqüentes do corpus, selecionou-se o item td com sentido de tudo, toda, todo, todos, todas, com a finalidade de verificar se os padrões léxicogramaticais contribuíam para os respectivos sentidos. Por conseguinte, a pesquisa pretende ter contribuído para o estudo do internetês, uma vez que há poucos trabalhos que demonstrem, de maneira empírica, essas modificações. O trabalho ainda apresenta as limitações da pesquisa e aponta sugestões para futuros estudosapplication/pdfhttp://tede2.pucsp.br/tede/retrieve/30762/ZELI%20MIRANDA%20GUTIERREZ%20GONZALEZ.pdf.jpgporPontifícia Universidade Católica de São PauloPrograma de Estudos Pós-Graduados em Linguística Aplicada e Estudos da LinguagemPUC-SPBRLingüísticaGrafia do internetêsLinguistica -- Processamento de dadosInternetLinguagem e a internetInternet languageCNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADALingüística de corpus na análise do internetêsinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da PUC_SPinstname:Pontifícia Universidade Católica de São Paulo (PUC-SP)instacron:PUC_SPTEXTZELI MIRANDA GUTIERREZ GONZALEZ.pdf.txtZELI MIRANDA GUTIERREZ GONZALEZ.pdf.txtExtracted texttext/plain201372https://repositorio.pucsp.br/xmlui/bitstream/handle/13928/3/ZELI%20MIRANDA%20GUTIERREZ%20GONZALEZ.pdf.txt0c8579dd2e831b630c132cb3f1650d33MD53ORIGINALZELI MIRANDA GUTIERREZ GONZALEZ.pdfapplication/pdf1268917https://repositorio.pucsp.br/xmlui/bitstream/handle/13928/1/ZELI%20MIRANDA%20GUTIERREZ%20GONZALEZ.pdf3a704528461b06f74cb2b2e71d8fdcf1MD51THUMBNAILZELI MIRANDA GUTIERREZ GONZALEZ.pdf.jpgZELI MIRANDA GUTIERREZ GONZALEZ.pdf.jpgGenerated Thumbnailimage/jpeg3270https://repositorio.pucsp.br/xmlui/bitstream/handle/13928/2/ZELI%20MIRANDA%20GUTIERREZ%20GONZALEZ.pdf.jpg731c58f7b4666536216d7fb13b84ce3dMD52handle/139282022-04-27 22:58:10.115oai:repositorio.pucsp.br:handle/13928Repositório Institucionalhttps://sapientia.pucsp.br/https://sapientia.pucsp.br/oai/requestbngkatende@pucsp.br||rapassi@pucsp.bropendoar:2022-04-28T01:58:10Repositório Institucional da PUC_SP - Pontifícia Universidade Católica de São Paulo (PUC-SP)false |
| dc.title.por.fl_str_mv |
Lingüística de corpus na análise do internetês |
| title |
Lingüística de corpus na análise do internetês |
| spellingShingle |
Lingüística de corpus na análise do internetês Gonzalez, Zeli Miranda Gutierrez Grafia do internetês Linguistica -- Processamento de dados Internet Linguagem e a internet Internet language CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA |
| title_short |
Lingüística de corpus na análise do internetês |
| title_full |
Lingüística de corpus na análise do internetês |
| title_fullStr |
Lingüística de corpus na análise do internetês |
| title_full_unstemmed |
Lingüística de corpus na análise do internetês |
| title_sort |
Lingüística de corpus na análise do internetês |
| author |
Gonzalez, Zeli Miranda Gutierrez |
| author_facet |
Gonzalez, Zeli Miranda Gutierrez |
| author_role |
author |
| dc.contributor.advisor1.fl_str_mv |
Sardinha, Antonio Paulo Berber |
| dc.contributor.authorLattes.fl_str_mv |
http://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4564693Y0 |
| dc.contributor.author.fl_str_mv |
Gonzalez, Zeli Miranda Gutierrez |
| contributor_str_mv |
Sardinha, Antonio Paulo Berber |
| dc.subject.por.fl_str_mv |
Grafia do internetês Linguistica -- Processamento de dados Internet Linguagem e a internet |
| topic |
Grafia do internetês Linguistica -- Processamento de dados Internet Linguagem e a internet Internet language CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA |
| dc.subject.eng.fl_str_mv |
Internet language |
| dc.subject.cnpq.fl_str_mv |
CNPQ::LINGUISTICA, LETRAS E ARTES::LINGUISTICA::LINGUISTICA APLICADA |
| description |
The study presented was motivated by the needs of comprehend the changes in the ortography of the Internet language, such as identify those changes frequency. The main aim of this study was to focus on the usage of a Corpus Linguistics approach for identification of frequent words most used in the studies corpus, such as frequences of changes in the ortography and the lexican gramathical standards of the internet language. There is a great range of studies on the internet language; however, very few of them has demontrated empirically how frequent changes are. Therefore, this study has tried to fill this gap by being able to show empirically the changes. The main theoretical underpinning for the research is provided by Corpus Linguistics, assuming the main notions presented by Biber (1998), Berber Sardinha (2004, 2006), Sinclair (1991,1996). For focusing the use frequency of lexican items it was considered, more specificly, the studies of Berber Sardinha (2000a, 2000b, 2004), Halliday (1991, 1992, 1993). Besides the Corpus Linguistics, the project also mentioned in questions such as: linguisctics diversity, genre, registry and internet language ortography along the perspective of Possenti (2006), Mollica (2007), Thurlow and Brown (2007), Crystal (2001), Othero, (2004). The corpus employed in the study was collected of young people s blogs that use internet for comunication. This corpus contains 135.021 tokes and 15.552 types. For the development of this research and of the analysis of the lexican items it was considered all the 500 most used words in the corpus studies. The frequences were used as base for decription of changes happened in the variant linguistics ortography the internet language. Among the most frequent items in the corpus was selected the td item with the sense of all, every, everything ( tudo, todo, toda, todas e todos in portuguese), with the objective of verify the standards lexican-gramathical, contributed for the respective senses. To sum up, this study hopes it has contributed to the study of the internet language, since there are few studies that have demosntrated empirically how these changes occur. This work also presentes the research limitations and its possible applications in the future |
| publishDate |
2007 |
| dc.date.issued.fl_str_mv |
2007-11-05 |
| dc.date.available.fl_str_mv |
2008-01-11 |
| dc.date.accessioned.fl_str_mv |
2016-04-28T18:23:36Z |
| dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
| dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
| format |
masterThesis |
| status_str |
publishedVersion |
| dc.identifier.citation.fl_str_mv |
Gonzalez, Zeli Miranda Gutierrez. Lingüística de corpus na análise do internetês. 2007. 123 f. Dissertação (Mestrado em Lingüística) - Pontifícia Universidade Católica de São Paulo, São Paulo, 2007. |
| dc.identifier.uri.fl_str_mv |
https://tede2.pucsp.br/handle/handle/13928 |
| identifier_str_mv |
Gonzalez, Zeli Miranda Gutierrez. Lingüística de corpus na análise do internetês. 2007. 123 f. Dissertação (Mestrado em Lingüística) - Pontifícia Universidade Católica de São Paulo, São Paulo, 2007. |
| url |
https://tede2.pucsp.br/handle/handle/13928 |
| dc.language.iso.fl_str_mv |
por |
| language |
por |
| dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
| eu_rights_str_mv |
openAccess |
| dc.format.none.fl_str_mv |
application/pdf |
| dc.publisher.none.fl_str_mv |
Pontifícia Universidade Católica de São Paulo |
| dc.publisher.program.fl_str_mv |
Programa de Estudos Pós-Graduados em Linguística Aplicada e Estudos da Linguagem |
| dc.publisher.initials.fl_str_mv |
PUC-SP |
| dc.publisher.country.fl_str_mv |
BR |
| dc.publisher.department.fl_str_mv |
Lingüística |
| publisher.none.fl_str_mv |
Pontifícia Universidade Católica de São Paulo |
| dc.source.none.fl_str_mv |
reponame:Repositório Institucional da PUC_SP instname:Pontifícia Universidade Católica de São Paulo (PUC-SP) instacron:PUC_SP |
| instname_str |
Pontifícia Universidade Católica de São Paulo (PUC-SP) |
| instacron_str |
PUC_SP |
| institution |
PUC_SP |
| reponame_str |
Repositório Institucional da PUC_SP |
| collection |
Repositório Institucional da PUC_SP |
| bitstream.url.fl_str_mv |
https://repositorio.pucsp.br/xmlui/bitstream/handle/13928/3/ZELI%20MIRANDA%20GUTIERREZ%20GONZALEZ.pdf.txt https://repositorio.pucsp.br/xmlui/bitstream/handle/13928/1/ZELI%20MIRANDA%20GUTIERREZ%20GONZALEZ.pdf https://repositorio.pucsp.br/xmlui/bitstream/handle/13928/2/ZELI%20MIRANDA%20GUTIERREZ%20GONZALEZ.pdf.jpg |
| bitstream.checksum.fl_str_mv |
0c8579dd2e831b630c132cb3f1650d33 3a704528461b06f74cb2b2e71d8fdcf1 731c58f7b4666536216d7fb13b84ce3d |
| bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 MD5 |
| repository.name.fl_str_mv |
Repositório Institucional da PUC_SP - Pontifícia Universidade Católica de São Paulo (PUC-SP) |
| repository.mail.fl_str_mv |
bngkatende@pucsp.br||rapassi@pucsp.br |
| _version_ |
1840370399130419200 |