Magni calculi ratiocinatoris: a theoretical universal logical and axiological calculation framework for generative large language models
| Ano de defesa: | 2023 |
|---|---|
| Autor(a) principal: | |
| Orientador(a): | |
| Banca de defesa: | |
| Tipo de documento: | Tese |
| Tipo de acesso: | Acesso aberto |
| Idioma: | por |
| Instituição de defesa: |
Universidade Federal do Espírito Santo
BR Doutorado em Ciência da Computação Centro Tecnológico UFES Programa de Pós-Graduação em Informática |
| Programa de Pós-Graduação: |
Não Informado pela instituição
|
| Departamento: |
Não Informado pela instituição
|
| País: |
Não Informado pela instituição
|
| Palavras-chave em Português: | |
| Link de acesso: | http://repositorio.ufes.br/handle/10/12435 |
Resumo: | This dissertation explores the idea of developing a theoretical universal framework as a calculation of thought, specifically focusing on integrating value alignment in logical reasoning upon generative large language models. The research delves into the historical search for a "calculus ratiocinator," a universal logical calculus, underlying the "Characteristica Universalis," a universal language, and positions large language models as a contemporary manifestation of the latter. The study includes a conceptual review of the foundational models’ learning strategies, including pre-training transformer-based LLMs, transfer learning, and in-context learning methodologies such as zero-shot and few-shot learning, chain-of-thoughts, tree-of-thoughts, self-consistency, and automatic prompt engineer. Following the theoretical framework, the work comprises a thorough literature review, examining the logical reasoning abilities of large language models (LLMs) and evaluating their strengths and weaknesses, scalability, and quality of generated texts. The research then focuses on fine-tuning a pre-trained LLM for value-aligned logical reasoning. It explores methods such as Logical and Axiological Weights (LAW), a rank-based theory for weight adaptation that introduces value-aligned logical reasoning weights in subnetworks of pre-trained models, utilizing the parameter-efficient fine-tuning LoRA/QLoRA. The results from these methods are presented and discussed in detail. Additionally, the work explores prompting methodologies for value-aligned logical reasoning. This includes techniques such as Logic-of-Reasoning (LoR), an in-context learning approach that incorporates the decomposition of input for logical subproofs, tree-based and forest-based sequent calculi, and natural deduction prompts. It also includes aligning the generated text with human value through the Value-Aligned Logic-of-Reasoning (VALoR) prompt. The dissertation concludes by presenting experimental results that validate the proposed methodologies. This research contributes to the field by offering a holistic approach to improve logical reasoning aligned with human values on generative large language models. It provides a foundation for future work in developing more human-aligned AI systems that can reason logically while upholding human values. |
| id |
UFES_6eedcc2d3186b3a1ec6af56ecbfcc5c7 |
|---|---|
| oai_identifier_str |
oai:repositorio.ufes.br:10/12435 |
| network_acronym_str |
UFES |
| network_name_str |
Repositório Institucional da Universidade Federal do Espírito Santo (riUfes) |
| repository_id_str |
|
| spelling |
Magni calculi ratiocinatoris: a theoretical universal logical and axiological calculation framework for generative large language modelsGrandes Modelos de Linguagem GenerativosRaciocínio lógicoAlinhamento axiológicoAjuste-fino de modelos pré-treinadosAprendizagem em contextoCiência da ComputaçãoThis dissertation explores the idea of developing a theoretical universal framework as a calculation of thought, specifically focusing on integrating value alignment in logical reasoning upon generative large language models. The research delves into the historical search for a "calculus ratiocinator," a universal logical calculus, underlying the "Characteristica Universalis," a universal language, and positions large language models as a contemporary manifestation of the latter. The study includes a conceptual review of the foundational models’ learning strategies, including pre-training transformer-based LLMs, transfer learning, and in-context learning methodologies such as zero-shot and few-shot learning, chain-of-thoughts, tree-of-thoughts, self-consistency, and automatic prompt engineer. Following the theoretical framework, the work comprises a thorough literature review, examining the logical reasoning abilities of large language models (LLMs) and evaluating their strengths and weaknesses, scalability, and quality of generated texts. The research then focuses on fine-tuning a pre-trained LLM for value-aligned logical reasoning. It explores methods such as Logical and Axiological Weights (LAW), a rank-based theory for weight adaptation that introduces value-aligned logical reasoning weights in subnetworks of pre-trained models, utilizing the parameter-efficient fine-tuning LoRA/QLoRA. The results from these methods are presented and discussed in detail. Additionally, the work explores prompting methodologies for value-aligned logical reasoning. This includes techniques such as Logic-of-Reasoning (LoR), an in-context learning approach that incorporates the decomposition of input for logical subproofs, tree-based and forest-based sequent calculi, and natural deduction prompts. It also includes aligning the generated text with human value through the Value-Aligned Logic-of-Reasoning (VALoR) prompt. The dissertation concludes by presenting experimental results that validate the proposed methodologies. This research contributes to the field by offering a holistic approach to improve logical reasoning aligned with human values on generative large language models. It provides a foundation for future work in developing more human-aligned AI systems that can reason logically while upholding human values.A presente tese investiga o desenvolvimento de uma estrutura teórica universal para o "cálculo de raciocínio", com foco principal no alinhamento de valores e no raciocínio lógico dos grandes modelos de linguagem generativos (Generative LLMs). A pesquisa inicia com a referência histórica ao "Fundamenta Calculi Ratiocinatoris"de Leibniz, que se trata de um cálculo lógico integrado a uma linguagem universal ("Lingua Characteristica"), e posiciona os modelos de linguagem como uma manifestação contemporânea desse conceito. O estudo inclui uma revisão conceitual das estratégias de aprendizado dos modelos de fundamentais, incluindo LLMs pré-treinados baseados em transformers, metodologias de transferência do aprendizado e aprendizado em contexto, como aprendizado "zero-shot"e "few-shot", cadeia-de-pensamentos, árvore-de-pensamentos, auto-consistência e engenharia de entrada automática. Seguindo o marco teórico, o trabalho elabora uma abrangente revisão da literatura, examinando as habilidades de raciocínio lógico dos grandes modelos de linguagem (LLMs), e avalia seus pontos fortes e fracos, escalabilidade e qualidade dos textos gerados. A pesquisa então se concentra no ajuste fino de LLMs pré-treinados para se obter um raciocínio lógico alinhado a valores. Apresenta métodos como teoria de rankings para adaptação de peso, cálculo lógico alinhado a valores e ajuste fino de sub-redes. Os resultados desses métodos são apresentados e discutidos em detalhes. Além disso, o trabalho explora metodologias de prompt em linguagem natural para obter raciocínio lógico alinhado a valores. Isso inclui técnicas como Sequent Calculi e Dedução Natural, lógica de raciocínio alinhada a valores, decomposição para subprovas lógicas e prompts de cálculos sequenciais baseados em árvores de decisão, juntamente com raciocínio baseado em preferências para obter o alinhamento de valores. A dissertação conclui com a apresentação de resultados experimentais para validar as metodologias propostas. Espera-se que a pesquisa possa contribuir para a área, ao propor uma abordagem holística para melhorar o raciocínio lógico alinhado aos valores humanos em modelos de linguagem generativa. E que a pesquisa possa fornecer uma base para trabalhos futuros no desenvolvimento de sistemas de inteligência artificial mais alinhados aos valores humanos ou ao menos estimular pesquisas nesse sentido.Universidade Federal do Espírito SantoBRDoutorado em Ciência da ComputaçãoCentro TecnológicoUFESPrograma de Pós-Graduação em InformáticaVarejão, Flavio Miguelhttp://lattes.cnpq.br/6501574961643171https://orcid.org/0000000264784743http://lattes.cnpq.br/1600831611942868Rezende, Solange OliveiraBoldt, Francisco de AssisSantos, Thiago Oliveira doshttp://lattes.cnpq.br/5117339495064254Rauber, Thomas Walterhttps://orcid.org/0000000263806584http://lattes.cnpq.br/0462549482032704Brasil Junior, Samuel Meira2024-05-29T20:55:16Z2024-05-29T20:55:16Z2023-12-13info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisTextapplication/pdfhttp://repositorio.ufes.br/handle/10/12435porinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da Universidade Federal do Espírito Santo (riUfes)instname:Universidade Federal do Espírito Santo (UFES)instacron:UFES2024-10-22T08:14:11Zoai:repositorio.ufes.br:10/12435Repositório InstitucionalPUBhttp://repositorio.ufes.br/oai/requestriufes@ufes.bropendoar:21082024-10-22T08:14:11Repositório Institucional da Universidade Federal do Espírito Santo (riUfes) - Universidade Federal do Espírito Santo (UFES)false |
| dc.title.none.fl_str_mv |
Magni calculi ratiocinatoris: a theoretical universal logical and axiological calculation framework for generative large language models |
| title |
Magni calculi ratiocinatoris: a theoretical universal logical and axiological calculation framework for generative large language models |
| spellingShingle |
Magni calculi ratiocinatoris: a theoretical universal logical and axiological calculation framework for generative large language models Brasil Junior, Samuel Meira Grandes Modelos de Linguagem Generativos Raciocínio lógico Alinhamento axiológico Ajuste-fino de modelos pré-treinados Aprendizagem em contexto Ciência da Computação |
| title_short |
Magni calculi ratiocinatoris: a theoretical universal logical and axiological calculation framework for generative large language models |
| title_full |
Magni calculi ratiocinatoris: a theoretical universal logical and axiological calculation framework for generative large language models |
| title_fullStr |
Magni calculi ratiocinatoris: a theoretical universal logical and axiological calculation framework for generative large language models |
| title_full_unstemmed |
Magni calculi ratiocinatoris: a theoretical universal logical and axiological calculation framework for generative large language models |
| title_sort |
Magni calculi ratiocinatoris: a theoretical universal logical and axiological calculation framework for generative large language models |
| author |
Brasil Junior, Samuel Meira |
| author_facet |
Brasil Junior, Samuel Meira |
| author_role |
author |
| dc.contributor.none.fl_str_mv |
Varejão, Flavio Miguel http://lattes.cnpq.br/6501574961643171 https://orcid.org/0000000264784743 http://lattes.cnpq.br/1600831611942868 Rezende, Solange Oliveira Boldt, Francisco de Assis Santos, Thiago Oliveira dos http://lattes.cnpq.br/5117339495064254 Rauber, Thomas Walter https://orcid.org/0000000263806584 http://lattes.cnpq.br/0462549482032704 |
| dc.contributor.author.fl_str_mv |
Brasil Junior, Samuel Meira |
| dc.subject.por.fl_str_mv |
Grandes Modelos de Linguagem Generativos Raciocínio lógico Alinhamento axiológico Ajuste-fino de modelos pré-treinados Aprendizagem em contexto Ciência da Computação |
| topic |
Grandes Modelos de Linguagem Generativos Raciocínio lógico Alinhamento axiológico Ajuste-fino de modelos pré-treinados Aprendizagem em contexto Ciência da Computação |
| description |
This dissertation explores the idea of developing a theoretical universal framework as a calculation of thought, specifically focusing on integrating value alignment in logical reasoning upon generative large language models. The research delves into the historical search for a "calculus ratiocinator," a universal logical calculus, underlying the "Characteristica Universalis," a universal language, and positions large language models as a contemporary manifestation of the latter. The study includes a conceptual review of the foundational models’ learning strategies, including pre-training transformer-based LLMs, transfer learning, and in-context learning methodologies such as zero-shot and few-shot learning, chain-of-thoughts, tree-of-thoughts, self-consistency, and automatic prompt engineer. Following the theoretical framework, the work comprises a thorough literature review, examining the logical reasoning abilities of large language models (LLMs) and evaluating their strengths and weaknesses, scalability, and quality of generated texts. The research then focuses on fine-tuning a pre-trained LLM for value-aligned logical reasoning. It explores methods such as Logical and Axiological Weights (LAW), a rank-based theory for weight adaptation that introduces value-aligned logical reasoning weights in subnetworks of pre-trained models, utilizing the parameter-efficient fine-tuning LoRA/QLoRA. The results from these methods are presented and discussed in detail. Additionally, the work explores prompting methodologies for value-aligned logical reasoning. This includes techniques such as Logic-of-Reasoning (LoR), an in-context learning approach that incorporates the decomposition of input for logical subproofs, tree-based and forest-based sequent calculi, and natural deduction prompts. It also includes aligning the generated text with human value through the Value-Aligned Logic-of-Reasoning (VALoR) prompt. The dissertation concludes by presenting experimental results that validate the proposed methodologies. This research contributes to the field by offering a holistic approach to improve logical reasoning aligned with human values on generative large language models. It provides a foundation for future work in developing more human-aligned AI systems that can reason logically while upholding human values. |
| publishDate |
2023 |
| dc.date.none.fl_str_mv |
2023-12-13 2024-05-29T20:55:16Z 2024-05-29T20:55:16Z |
| dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
| dc.type.driver.fl_str_mv |
info:eu-repo/semantics/doctoralThesis |
| format |
doctoralThesis |
| status_str |
publishedVersion |
| dc.identifier.uri.fl_str_mv |
http://repositorio.ufes.br/handle/10/12435 |
| url |
http://repositorio.ufes.br/handle/10/12435 |
| dc.language.iso.fl_str_mv |
por |
| language |
por |
| dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
| eu_rights_str_mv |
openAccess |
| dc.format.none.fl_str_mv |
Text application/pdf |
| dc.publisher.none.fl_str_mv |
Universidade Federal do Espírito Santo BR Doutorado em Ciência da Computação Centro Tecnológico UFES Programa de Pós-Graduação em Informática |
| publisher.none.fl_str_mv |
Universidade Federal do Espírito Santo BR Doutorado em Ciência da Computação Centro Tecnológico UFES Programa de Pós-Graduação em Informática |
| dc.source.none.fl_str_mv |
reponame:Repositório Institucional da Universidade Federal do Espírito Santo (riUfes) instname:Universidade Federal do Espírito Santo (UFES) instacron:UFES |
| instname_str |
Universidade Federal do Espírito Santo (UFES) |
| instacron_str |
UFES |
| institution |
UFES |
| reponame_str |
Repositório Institucional da Universidade Federal do Espírito Santo (riUfes) |
| collection |
Repositório Institucional da Universidade Federal do Espírito Santo (riUfes) |
| repository.name.fl_str_mv |
Repositório Institucional da Universidade Federal do Espírito Santo (riUfes) - Universidade Federal do Espírito Santo (UFES) |
| repository.mail.fl_str_mv |
riufes@ufes.br |
| _version_ |
1834479091002114048 |