Lexical bundles across sections of applied linguistics research articles

Detalhes bibliográficos
Ano de defesa: 2018
Autor(a) principal: Luciana Dias de Macedo
Orientador(a): Não Informado pela instituição
Banca de defesa: Não Informado pela instituição
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: eng
Instituição de defesa: Universidade Federal de Minas Gerais
Programa de Pós-Graduação: Não Informado pela instituição
Departamento: Não Informado pela instituição
País: Não Informado pela instituição
Palavras-chave em Português:
Link de acesso: https://hdl.handle.net/1843/LETR-AXAJP5
Resumo: The increasing use of computerheld text corpora for the analysis of lexicogrammatical patterns, once unavailable to observers, has allowed researchers to get a better and more precise understanding of language. Corpus Linguistics has also shown to be capable of revealing valuable and detailed features of expressions that play a crucial role in discoursewith linguistic investigations that comprehend not only single words but also idiomatic or nonidiomatic expressions in a given context. The aim of the current research is to investigate 47 word lexical bundles in sections of Applied Linguistics Research Articles, published in English. Lexical bundles, fundamentally defined by frequency (BIBER et al., 1999; CORTES, 2013), are generated with no predefined linguistic categories. Despite the frequency with which they occur, lexical bundles are not idiomatic in meaning and not perceptually salient (BIBER; BARBIERI, 2007, p. 269). Their use, however, has been thoroughly investigateddue to the role those devices play as building blocks of discourse. In order to offer details of intext linguistic variation, a corpus was compiled from 180 articles and split into four subcorpora: Introduction, Methods, Results, and Discussion (IMRD), with more than 1 million words in total. The lexical bundles were individually classified into structural andfunctional categories, including their subtypes, according to previous and established taxonomies (BIBER et al., 1999, 2004; CORTES, 2013; SIMPSONVLACH; ELLIS, 2010). Two different statistical treatments, confidence intervals and the nullhypothesis significance ztest, were adopted as complementary in order to check whether differences acrosssubcorpora were significant. Results show that sections of Applied Linguistics research articles should be treated as separate texts for they display strong distinctions, and grammatical structures may play singular functional roles. The main distinctions between sections entail frequency of devices and a preference for particular lexicogrammatical elements in relation to the pragmatic nature of the section, e.g. fragments with exclusive roles can be related to passive and nonpassive voice forms containing specific verbs. Additionally, the proportion of some structures and functional categories that emerged in the current research challenge generalizations made in previous studies about academic register. Finally, we argue for more research on subregisters with representative and specialized corpora so as to give more insight into academic writing of research articles sections.
id UFMG_ec1ed273ab4dbc8fb4776d0d62357cc1
oai_identifier_str oai:repositorio.ufmg.br:1843/LETR-AXAJP5
network_acronym_str UFMG
network_name_str Repositório Institucional da UFMG
repository_id_str
spelling 2019-08-14T00:53:58Z2025-09-09T00:52:49Z2019-08-14T00:53:58Z2018-02-23https://hdl.handle.net/1843/LETR-AXAJP5The increasing use of computerheld text corpora for the analysis of lexicogrammatical patterns, once unavailable to observers, has allowed researchers to get a better and more precise understanding of language. Corpus Linguistics has also shown to be capable of revealing valuable and detailed features of expressions that play a crucial role in discoursewith linguistic investigations that comprehend not only single words but also idiomatic or nonidiomatic expressions in a given context. The aim of the current research is to investigate 47 word lexical bundles in sections of Applied Linguistics Research Articles, published in English. Lexical bundles, fundamentally defined by frequency (BIBER et al., 1999; CORTES, 2013), are generated with no predefined linguistic categories. Despite the frequency with which they occur, lexical bundles are not idiomatic in meaning and not perceptually salient (BIBER; BARBIERI, 2007, p. 269). Their use, however, has been thoroughly investigateddue to the role those devices play as building blocks of discourse. In order to offer details of intext linguistic variation, a corpus was compiled from 180 articles and split into four subcorpora: Introduction, Methods, Results, and Discussion (IMRD), with more than 1 million words in total. The lexical bundles were individually classified into structural andfunctional categories, including their subtypes, according to previous and established taxonomies (BIBER et al., 1999, 2004; CORTES, 2013; SIMPSONVLACH; ELLIS, 2010). Two different statistical treatments, confidence intervals and the nullhypothesis significance ztest, were adopted as complementary in order to check whether differences acrosssubcorpora were significant. Results show that sections of Applied Linguistics research articles should be treated as separate texts for they display strong distinctions, and grammatical structures may play singular functional roles. The main distinctions between sections entail frequency of devices and a preference for particular lexicogrammatical elements in relation to the pragmatic nature of the section, e.g. fragments with exclusive roles can be related to passive and nonpassive voice forms containing specific verbs. Additionally, the proportion of some structures and functional categories that emerged in the current research challenge generalizations made in previous studies about academic register. Finally, we argue for more research on subregisters with representative and specialized corpora so as to give more insight into academic writing of research articles sections.Universidade Federal de Minas Geraislinguística de corpuspacotes lexicaisescrita acadêmicaseções de artigos acadêmicosLíngua inglesa Estudo e ensino Falantes estrangeirosLíngua inglesa LexicologiaRedação acadêmicaAquisição da segunda linguagemLinguística de corpusLexical bundles across sections of applied linguistics research articlesinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisLuciana Dias de Macedoinfo:eu-repo/semantics/openAccessengreponame:Repositório Institucional da UFMGinstname:Universidade Federal de Minas Gerais (UFMG)instacron:UFMGDeise Prina DutraPatricia Pereira BertoliLeonardo Pereira NunesO crescente uso de corpora textuais em computadores para a análise de padrões léxicogramaticais, antes indisponível para observadores, tem permitido aos pesquisadores uma compreensão melhor e mais precisa sobre aspectos linguísticos. A Linguística de Corpus temdemonstrado capacidade de revelar características valiosas e detalhadas de expressões que desempenham papel crucial no discurso, por meio de pesquisas linguísticas que abrangemdesde palavras soltas a expressões idiomáticas e nãoidiomáticasem um dado contexto. O objetivo da atual pesquisa é investigar pacotes lexicais com sequências de 47 palavras em seções de artigos acadêmicos pertencentes à Linguística Aplicada, publicados em inglês, e entender seu papel pragmático dentro de cada seção. Pacotes lexicais, fundamentalmente definidos pela frequência (BIBER et al., 1999; CORTES, 2013) são gerados sem quaisquer categorias linguísticas prédefinidas. A despeito de sua frequência, os pacotes lexicais não são considerados idiomáticos em definição ou perceptualmente salientes (BIBER; BARBIERI, 2007, p. 269). Seu uso, no entanto, tem sido amplamente investigado devido ao papel queesses dispositivos desempenham como blocos construtores de discurso. Com o intuito de oferecer mais detalhes da variação linguística interna de artigos, um corpus foi compilado de 180 artigos e dividido em quatro subcorpora: Introdução, Métodos, Resultados e Discussão (IMRD), com mais de 1 milhão de palavras no total. Os pacotes lexicais foram classificados individualmente em categorias estruturais e funcionais, incluindo seus subtipos, de acordo com taxonomias existentes e estabelecidas na literatura (BIBER et al., 1999, 2004; CORTES, 2013; SIMPSONVLACH; ELLIS, 2010). Dois tratamentos estatísticos, intervalos de confiança e zteste de hipótese nula, foram empregados como complementares para checar se as diferenças entre os subcorpora são relevantes. Os resultados mostram que seções de artigos da Linguística Aplicada deveriam ser abordados como textos distintos dadas as grandes diferenças identificadas, assim como estruturas gramaticais podem desempenhar papéis funcionais singulares. As principais distinções entre as seções envolvem a frequência de pacotes lexicais e uma preferência por elementos lexicogramaticais particulares em relação à natureza pragmática da seção, por exemplo, fragmentos com papéis exclusivos podem serrelacionados às formas em voz passiva ou nãopassiva contendo verbos específicos. Além disso, a proporção de algumas estruturas e categorias funcionais que emergiram na atual pesquisa desafiam generalizações feitas em estudos anteriores sobre o registro acadêmico. Finalmente, sugerimos mais pesquisas de subregistroscom corpora representativos e especializados para que haja mais esclarecimentos em relação à escrita acadêmica de seções de artigos científicos.UFMGORIGINALdissertacao_mestrado_luciana_dias_de_macedo_2018__1_.pdfapplication/pdf2208773https://repositorio.ufmg.br//bitstreams/e3904d9e-38c8-4f49-9907-24f0063d11d6/download6d5a2161cf3bd895bc7195ff7894fdefMD51trueAnonymousREADTEXTdissertacao_mestrado_luciana_dias_de_macedo_2018__1_.pdf.txttext/plain197523https://repositorio.ufmg.br//bitstreams/78439e91-a29f-4678-99f0-f17ccbaf7ec2/download4fefb23c9900330e989c2c7486acac00MD52falseAnonymousREAD1843/LETR-AXAJP52025-09-08 21:52:49.066open.accessoai:repositorio.ufmg.br:1843/LETR-AXAJP5https://repositorio.ufmg.br/Repositório InstitucionalPUBhttps://repositorio.ufmg.br/oairepositorio@ufmg.bropendoar:2025-09-09T00:52:49Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)false
dc.title.none.fl_str_mv Lexical bundles across sections of applied linguistics research articles
title Lexical bundles across sections of applied linguistics research articles
spellingShingle Lexical bundles across sections of applied linguistics research articles
Luciana Dias de Macedo
Língua inglesa Estudo e ensino Falantes estrangeiros
Língua inglesa Lexicologia
Redação acadêmica
Aquisição da segunda linguagem
Linguística de corpus
linguística de corpus
pacotes lexicais
escrita acadêmica
seções de artigos acadêmicos
title_short Lexical bundles across sections of applied linguistics research articles
title_full Lexical bundles across sections of applied linguistics research articles
title_fullStr Lexical bundles across sections of applied linguistics research articles
title_full_unstemmed Lexical bundles across sections of applied linguistics research articles
title_sort Lexical bundles across sections of applied linguistics research articles
author Luciana Dias de Macedo
author_facet Luciana Dias de Macedo
author_role author
dc.contributor.author.fl_str_mv Luciana Dias de Macedo
dc.subject.por.fl_str_mv Língua inglesa Estudo e ensino Falantes estrangeiros
Língua inglesa Lexicologia
Redação acadêmica
Aquisição da segunda linguagem
Linguística de corpus
topic Língua inglesa Estudo e ensino Falantes estrangeiros
Língua inglesa Lexicologia
Redação acadêmica
Aquisição da segunda linguagem
Linguística de corpus
linguística de corpus
pacotes lexicais
escrita acadêmica
seções de artigos acadêmicos
dc.subject.other.none.fl_str_mv linguística de corpus
pacotes lexicais
escrita acadêmica
seções de artigos acadêmicos
description The increasing use of computerheld text corpora for the analysis of lexicogrammatical patterns, once unavailable to observers, has allowed researchers to get a better and more precise understanding of language. Corpus Linguistics has also shown to be capable of revealing valuable and detailed features of expressions that play a crucial role in discoursewith linguistic investigations that comprehend not only single words but also idiomatic or nonidiomatic expressions in a given context. The aim of the current research is to investigate 47 word lexical bundles in sections of Applied Linguistics Research Articles, published in English. Lexical bundles, fundamentally defined by frequency (BIBER et al., 1999; CORTES, 2013), are generated with no predefined linguistic categories. Despite the frequency with which they occur, lexical bundles are not idiomatic in meaning and not perceptually salient (BIBER; BARBIERI, 2007, p. 269). Their use, however, has been thoroughly investigateddue to the role those devices play as building blocks of discourse. In order to offer details of intext linguistic variation, a corpus was compiled from 180 articles and split into four subcorpora: Introduction, Methods, Results, and Discussion (IMRD), with more than 1 million words in total. The lexical bundles were individually classified into structural andfunctional categories, including their subtypes, according to previous and established taxonomies (BIBER et al., 1999, 2004; CORTES, 2013; SIMPSONVLACH; ELLIS, 2010). Two different statistical treatments, confidence intervals and the nullhypothesis significance ztest, were adopted as complementary in order to check whether differences acrosssubcorpora were significant. Results show that sections of Applied Linguistics research articles should be treated as separate texts for they display strong distinctions, and grammatical structures may play singular functional roles. The main distinctions between sections entail frequency of devices and a preference for particular lexicogrammatical elements in relation to the pragmatic nature of the section, e.g. fragments with exclusive roles can be related to passive and nonpassive voice forms containing specific verbs. Additionally, the proportion of some structures and functional categories that emerged in the current research challenge generalizations made in previous studies about academic register. Finally, we argue for more research on subregisters with representative and specialized corpora so as to give more insight into academic writing of research articles sections.
publishDate 2018
dc.date.issued.fl_str_mv 2018-02-23
dc.date.accessioned.fl_str_mv 2019-08-14T00:53:58Z
2025-09-09T00:52:49Z
dc.date.available.fl_str_mv 2019-08-14T00:53:58Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://hdl.handle.net/1843/LETR-AXAJP5
url https://hdl.handle.net/1843/LETR-AXAJP5
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.publisher.none.fl_str_mv Universidade Federal de Minas Gerais
publisher.none.fl_str_mv Universidade Federal de Minas Gerais
dc.source.none.fl_str_mv reponame:Repositório Institucional da UFMG
instname:Universidade Federal de Minas Gerais (UFMG)
instacron:UFMG
instname_str Universidade Federal de Minas Gerais (UFMG)
instacron_str UFMG
institution UFMG
reponame_str Repositório Institucional da UFMG
collection Repositório Institucional da UFMG
bitstream.url.fl_str_mv https://repositorio.ufmg.br//bitstreams/e3904d9e-38c8-4f49-9907-24f0063d11d6/download
https://repositorio.ufmg.br//bitstreams/78439e91-a29f-4678-99f0-f17ccbaf7ec2/download
bitstream.checksum.fl_str_mv 6d5a2161cf3bd895bc7195ff7894fdef
4fefb23c9900330e989c2c7486acac00
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
repository.name.fl_str_mv Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)
repository.mail.fl_str_mv repositorio@ufmg.br
_version_ 1862105759150505984