A collaborative approach to fault tolerance in cyber-physical systems

Detalhes bibliográficos
Ano de defesa: 2025
Autor(a) principal: Piardi, Luis Fernando
Orientador(a): Não Informado pela instituição
Banca de defesa: Não Informado pela instituição
Tipo de documento: Tese
Tipo de acesso: Acesso aberto
Idioma: eng
Instituição de defesa: Universidade Tecnológica Federal do Paraná
Curitiba
Brasil
Programa de Pós-Graduação em Engenharia Elétrica e Informática Industrial
UTFPR
Programa de Pós-Graduação: Não Informado pela instituição
Departamento: Não Informado pela instituição
País: Não Informado pela instituição
Palavras-chave em Português:
Link de acesso: http://repositorio.utfpr.edu.br/jspui/handle/1/37787
Resumo: The reliability and robustness of cyber-physical systems are critical aspects of the current industrial landscape. The high level of autonomous and distributed components associated with many devices makes cyber-physical systems prone to faults. With this in mind and contextualized with the current era of Industry 4.0, fault tolerance is essential for maintaining the robustness and resilience of systems facing unforeseen or undesirable disturbances. Despite their importance and benefits, traditional approaches for fault tolerance, such as local or monolithic centralized, do not correspond with the accelerated technological evolution pace over the past two decades. Driven by the advent of digital technologies such as the Internet of Things, cloud and edge computing, and artificial intelligence associated with enhanced computational processing and communication capabilities, the traditional fault tolerance approaches are out of sync with contemporary and future systems. Consequently, these approaches are limited in achieving the maximum benefits enabled by integrating these technologies, such as robustness related to fault tolerance accuracy and performance improvements, maintaining the flexibility and scalability fundamental for cyberphysical systems operations. Accordingly, this thesis proposes a collaborative fault tolerance approach for cyber-physical systems called Collaborative Fault-Star (CF*), a unified term that refers to the collaboration on the detection, diagnosis, and recovery stages of fault tolerance. The proposed approach takes advantage of cyber-physical components’ inherent data analysis and communication capabilities and is based on multi-agent system principles, where components are self-fault tolerant and adopt collaborative and distributed intelligence behavior when necessary to transcend their individual fault tolerance capabilities. This thesis introduces the backbone and infrastructure of CF*, highlighting the components, agent behavior, functionalities, and interaction protocols to explore the benefits of communication and collaboration between agents to enhance fault tolerance. The CF* is a generic approach, and can be instantiated for diverse cyber-physical system domains as well as for different applications. Experiments were conducted on a developed laboratory-scale warehouse, focusing on the fault detection and diagnosis stage for temperature and humidity sensors in the warehouse racks. The experimental results confirmed the accuracy and performance improvements under CF* compared with the local approach and competitiveness compared to a centralized approach.
id UTFPR-12_9ae83dceb03d4437a698a4156c3ebcaa
oai_identifier_str oai:repositorio.utfpr.edu.br:1/37787
network_acronym_str UTFPR-12
network_name_str Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT))
repository_id_str
spelling A collaborative approach to fault tolerance in cyber-physical systemsUma abordagem colaborativa para tolerância a falhas em sistemas ciber-físicosObjetos cooperativos (sistemas de computador)Tolerância a falha (Computadores)Sistemas multiagentesIndústria 4.0Confiabilidade (Engenharia)Cooperating objects (Computer systems)Fault-tolerant computingMultiagent systemsIndustry 4.0Reliability (Engineering)CNPQ::ENGENHARIAS::ENGENHARIA ELETRICAEngenharia ElétricaThe reliability and robustness of cyber-physical systems are critical aspects of the current industrial landscape. The high level of autonomous and distributed components associated with many devices makes cyber-physical systems prone to faults. With this in mind and contextualized with the current era of Industry 4.0, fault tolerance is essential for maintaining the robustness and resilience of systems facing unforeseen or undesirable disturbances. Despite their importance and benefits, traditional approaches for fault tolerance, such as local or monolithic centralized, do not correspond with the accelerated technological evolution pace over the past two decades. Driven by the advent of digital technologies such as the Internet of Things, cloud and edge computing, and artificial intelligence associated with enhanced computational processing and communication capabilities, the traditional fault tolerance approaches are out of sync with contemporary and future systems. Consequently, these approaches are limited in achieving the maximum benefits enabled by integrating these technologies, such as robustness related to fault tolerance accuracy and performance improvements, maintaining the flexibility and scalability fundamental for cyberphysical systems operations. Accordingly, this thesis proposes a collaborative fault tolerance approach for cyber-physical systems called Collaborative Fault-Star (CF*), a unified term that refers to the collaboration on the detection, diagnosis, and recovery stages of fault tolerance. The proposed approach takes advantage of cyber-physical components’ inherent data analysis and communication capabilities and is based on multi-agent system principles, where components are self-fault tolerant and adopt collaborative and distributed intelligence behavior when necessary to transcend their individual fault tolerance capabilities. This thesis introduces the backbone and infrastructure of CF*, highlighting the components, agent behavior, functionalities, and interaction protocols to explore the benefits of communication and collaboration between agents to enhance fault tolerance. The CF* is a generic approach, and can be instantiated for diverse cyber-physical system domains as well as for different applications. Experiments were conducted on a developed laboratory-scale warehouse, focusing on the fault detection and diagnosis stage for temperature and humidity sensors in the warehouse racks. The experimental results confirmed the accuracy and performance improvements under CF* compared with the local approach and competitiveness compared to a centralized approach.Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)Universidade Tecnológica Federal do Paraná (UTFPR)A confiabilidade e a robustez dos sistemas ciber-físicos são aspectos críticos no cenário industrial atual. O elevado nível de autonomia e distribuição dos componentes, associado à ampla variedade de dispositivos, torna esses sistemas suscetíveis a falhas. Nesse contexto, e alinhado à era atual da Indústria 4.0, a tolerância a falhas torna-se essencial para manter a robustez e a resiliência dos sistemas diante de perturbações imprevistas ou indesejáveis. Apesar de sua importância e dos benefícios oferecidos, as abordagens tradicionais de tolerância a falhas, como as estratégias locais ou centralizadas monolíticas, não acompanham o ritmo acelerado da evolução tecnológica das últimas duas décadas. Impulsionadas pelo advento de tecnologias digitais, como a Internet das Coisas, computação em nuvem e edge, e inteligência artificial, associadas a capacidades elevadas de processamento computacional e comunicação, as abordagens tradicionais de tolerância a falhas mostram-se defasadas em relação aos sistemas contemporâneos e futuros. Como consequência, tais abordagens apresentam limitações para alcançar os benefícios máximos proporcionados pela integração dessas tecnologias, como maior robustez, precisão na tolerância a falhas e melhorias de desempenho, mantendo a flexibilidade e escalabilidade fundamentais para a operação dos sistemas ciber-físicos. Diante disso, esta tese propõe uma abordagem colaborativa de tolerância a falhas em sistemas ciber-físicos, denominada Collaborative Fault-Star (CF*), um termo unificado que engloba a colaboração nos estágios de detecção, diagnóstico e recuperação de falhas. A abordagem proposta explora as capacidades inerentes de análise de dados e comunicação dos componentes ciber-físicos, fundamentando-se nos princípios de sistemas multiagente, nos quais os componentes são autônomos para tolerar falhas e adotam comportamento colaborativo e inteligência distribuída quando necessário, a fim de superar as limitações de suas capacidades individuais. Esta tese apresenta a infraestrutura e a espinha dorsal do CF*, destacando os componentes, comportamentos dos agentes, funcionalidades e protocolos de interação, de modo a explorar os benefícios da comunicação e colaboração entre agentes para aprimorar a tolerância a falhas. O CF* constitui uma abordagem genérica, passível de instanciação para diferentes domínios e aplicações de sistemas ciber-físicos. Experimentos foram conduzidos num armazém em escala laboratorial desenvolvido, com foco nos estágios de detecção e diagnóstico de falhas em sensores de temperatura e umidade instalados nas estantes do armazém. Os resultados experimentais confirmaram melhorias em termos de precisão e desempenho da abordagem CF* em comparação com a abordagem local e competitividade em relação à abordagem centralizada.Universidade Tecnológica Federal do ParanáCuritibaBrasilPrograma de Pós-Graduação em Engenharia Elétrica e Informática IndustrialUTFPROliveira, Andre Schneider dehttps://orcid.org/0000-0002-8295-366Xhttps://lattes.cnpq.br/4006878042502781Costa, Pedro Luis Cerqueira Gomes daLeitão, Paulo Jorge Pintohttps://orcid.org/0000-0002-0435-8419https://orcid.org/0000-0002-2151-7944http://lattes.cnpq.br/6851970061162215Oliveira, Andre Schneider dehttps://orcid.org/0000-0002-8295-366Xhttps://lattes.cnpq.br/4006878042502781Moreira, Antônio Paulo Gomes Mendeshttps://orcid.org/0000-0001-8573-3147http://lattes.cnpq.br/9452560221551641Teixeira, Marco Antonio Simõeshttps://orcid.org/0000-0002-0372-312Xhttps://lattes.cnpq.br/3482274885890440Wehrmeister, Marco Auréliohttps://orcid.org/0000-0002-1415-5527https://lattes.cnpq.br/5548205054206839Rabelo, Ricardo Joséhttps://orcid.org/0000-0002-5488-3692http://lattes.cnpq.br/9784200268507565Piardi, Luis Fernando2025-08-07T19:41:20Z2025-08-07T19:41:20Z2025-06-26info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisapplication/pdfPIARDI, Luis Fernando. A collaborative approach to fault tolerance in cyber-physical systems. 2025. Tese (Doutorado em Engenharia Elétrica e Informática Industrial) - Universidade Tecnológica Federal do Paraná, Curitiba, 2025.http://repositorio.utfpr.edu.br/jspui/handle/1/37787enghttp://creativecommons.org/licenses/by/4.0/info:eu-repo/semantics/openAccessreponame:Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT))instname:Universidade Tecnológica Federal do Paraná (UTFPR)instacron:UTFPR2025-08-08T06:13:04Zoai:repositorio.utfpr.edu.br:1/37787Repositório InstitucionalPUBhttp://repositorio.utfpr.edu.br:8080/oai/requestriut@utfpr.edu.br || sibi@utfpr.edu.bropendoar:2025-08-08T06:13:04Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT)) - Universidade Tecnológica Federal do Paraná (UTFPR)false
dc.title.none.fl_str_mv A collaborative approach to fault tolerance in cyber-physical systems
Uma abordagem colaborativa para tolerância a falhas em sistemas ciber-físicos
title A collaborative approach to fault tolerance in cyber-physical systems
spellingShingle A collaborative approach to fault tolerance in cyber-physical systems
Piardi, Luis Fernando
Objetos cooperativos (sistemas de computador)
Tolerância a falha (Computadores)
Sistemas multiagentes
Indústria 4.0
Confiabilidade (Engenharia)
Cooperating objects (Computer systems)
Fault-tolerant computing
Multiagent systems
Industry 4.0
Reliability (Engineering)
CNPQ::ENGENHARIAS::ENGENHARIA ELETRICA
Engenharia Elétrica
title_short A collaborative approach to fault tolerance in cyber-physical systems
title_full A collaborative approach to fault tolerance in cyber-physical systems
title_fullStr A collaborative approach to fault tolerance in cyber-physical systems
title_full_unstemmed A collaborative approach to fault tolerance in cyber-physical systems
title_sort A collaborative approach to fault tolerance in cyber-physical systems
author Piardi, Luis Fernando
author_facet Piardi, Luis Fernando
author_role author
dc.contributor.none.fl_str_mv Oliveira, Andre Schneider de
https://orcid.org/0000-0002-8295-366X
https://lattes.cnpq.br/4006878042502781
Costa, Pedro Luis Cerqueira Gomes da
Leitão, Paulo Jorge Pinto
https://orcid.org/0000-0002-0435-8419
https://orcid.org/0000-0002-2151-7944
http://lattes.cnpq.br/6851970061162215
Oliveira, Andre Schneider de
https://orcid.org/0000-0002-8295-366X
https://lattes.cnpq.br/4006878042502781
Moreira, Antônio Paulo Gomes Mendes
https://orcid.org/0000-0001-8573-3147
http://lattes.cnpq.br/9452560221551641
Teixeira, Marco Antonio Simões
https://orcid.org/0000-0002-0372-312X
https://lattes.cnpq.br/3482274885890440
Wehrmeister, Marco Aurélio
https://orcid.org/0000-0002-1415-5527
https://lattes.cnpq.br/5548205054206839
Rabelo, Ricardo José
https://orcid.org/0000-0002-5488-3692
http://lattes.cnpq.br/9784200268507565
dc.contributor.author.fl_str_mv Piardi, Luis Fernando
dc.subject.por.fl_str_mv Objetos cooperativos (sistemas de computador)
Tolerância a falha (Computadores)
Sistemas multiagentes
Indústria 4.0
Confiabilidade (Engenharia)
Cooperating objects (Computer systems)
Fault-tolerant computing
Multiagent systems
Industry 4.0
Reliability (Engineering)
CNPQ::ENGENHARIAS::ENGENHARIA ELETRICA
Engenharia Elétrica
topic Objetos cooperativos (sistemas de computador)
Tolerância a falha (Computadores)
Sistemas multiagentes
Indústria 4.0
Confiabilidade (Engenharia)
Cooperating objects (Computer systems)
Fault-tolerant computing
Multiagent systems
Industry 4.0
Reliability (Engineering)
CNPQ::ENGENHARIAS::ENGENHARIA ELETRICA
Engenharia Elétrica
description The reliability and robustness of cyber-physical systems are critical aspects of the current industrial landscape. The high level of autonomous and distributed components associated with many devices makes cyber-physical systems prone to faults. With this in mind and contextualized with the current era of Industry 4.0, fault tolerance is essential for maintaining the robustness and resilience of systems facing unforeseen or undesirable disturbances. Despite their importance and benefits, traditional approaches for fault tolerance, such as local or monolithic centralized, do not correspond with the accelerated technological evolution pace over the past two decades. Driven by the advent of digital technologies such as the Internet of Things, cloud and edge computing, and artificial intelligence associated with enhanced computational processing and communication capabilities, the traditional fault tolerance approaches are out of sync with contemporary and future systems. Consequently, these approaches are limited in achieving the maximum benefits enabled by integrating these technologies, such as robustness related to fault tolerance accuracy and performance improvements, maintaining the flexibility and scalability fundamental for cyberphysical systems operations. Accordingly, this thesis proposes a collaborative fault tolerance approach for cyber-physical systems called Collaborative Fault-Star (CF*), a unified term that refers to the collaboration on the detection, diagnosis, and recovery stages of fault tolerance. The proposed approach takes advantage of cyber-physical components’ inherent data analysis and communication capabilities and is based on multi-agent system principles, where components are self-fault tolerant and adopt collaborative and distributed intelligence behavior when necessary to transcend their individual fault tolerance capabilities. This thesis introduces the backbone and infrastructure of CF*, highlighting the components, agent behavior, functionalities, and interaction protocols to explore the benefits of communication and collaboration between agents to enhance fault tolerance. The CF* is a generic approach, and can be instantiated for diverse cyber-physical system domains as well as for different applications. Experiments were conducted on a developed laboratory-scale warehouse, focusing on the fault detection and diagnosis stage for temperature and humidity sensors in the warehouse racks. The experimental results confirmed the accuracy and performance improvements under CF* compared with the local approach and competitiveness compared to a centralized approach.
publishDate 2025
dc.date.none.fl_str_mv 2025-08-07T19:41:20Z
2025-08-07T19:41:20Z
2025-06-26
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/doctoralThesis
format doctoralThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv PIARDI, Luis Fernando. A collaborative approach to fault tolerance in cyber-physical systems. 2025. Tese (Doutorado em Engenharia Elétrica e Informática Industrial) - Universidade Tecnológica Federal do Paraná, Curitiba, 2025.
http://repositorio.utfpr.edu.br/jspui/handle/1/37787
identifier_str_mv PIARDI, Luis Fernando. A collaborative approach to fault tolerance in cyber-physical systems. 2025. Tese (Doutorado em Engenharia Elétrica e Informática Industrial) - Universidade Tecnológica Federal do Paraná, Curitiba, 2025.
url http://repositorio.utfpr.edu.br/jspui/handle/1/37787
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv http://creativecommons.org/licenses/by/4.0/
info:eu-repo/semantics/openAccess
rights_invalid_str_mv http://creativecommons.org/licenses/by/4.0/
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade Tecnológica Federal do Paraná
Curitiba
Brasil
Programa de Pós-Graduação em Engenharia Elétrica e Informática Industrial
UTFPR
publisher.none.fl_str_mv Universidade Tecnológica Federal do Paraná
Curitiba
Brasil
Programa de Pós-Graduação em Engenharia Elétrica e Informática Industrial
UTFPR
dc.source.none.fl_str_mv reponame:Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT))
instname:Universidade Tecnológica Federal do Paraná (UTFPR)
instacron:UTFPR
instname_str Universidade Tecnológica Federal do Paraná (UTFPR)
instacron_str UTFPR
institution UTFPR
reponame_str Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT))
collection Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT))
repository.name.fl_str_mv Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT)) - Universidade Tecnológica Federal do Paraná (UTFPR)
repository.mail.fl_str_mv riut@utfpr.edu.br || sibi@utfpr.edu.br
_version_ 1850498314516037632