A collaborative approach to fault tolerance in cyber-physical systems
| Ano de defesa: | 2025 |
|---|---|
| Autor(a) principal: | |
| Orientador(a): | |
| Banca de defesa: | |
| Tipo de documento: | Tese |
| Tipo de acesso: | Acesso aberto |
| Idioma: | eng |
| Instituição de defesa: |
Universidade Tecnológica Federal do Paraná
Curitiba Brasil Programa de Pós-Graduação em Engenharia Elétrica e Informática Industrial UTFPR |
| Programa de Pós-Graduação: |
Não Informado pela instituição
|
| Departamento: |
Não Informado pela instituição
|
| País: |
Não Informado pela instituição
|
| Palavras-chave em Português: | |
| Link de acesso: | http://repositorio.utfpr.edu.br/jspui/handle/1/37787 |
Resumo: | The reliability and robustness of cyber-physical systems are critical aspects of the current industrial landscape. The high level of autonomous and distributed components associated with many devices makes cyber-physical systems prone to faults. With this in mind and contextualized with the current era of Industry 4.0, fault tolerance is essential for maintaining the robustness and resilience of systems facing unforeseen or undesirable disturbances. Despite their importance and benefits, traditional approaches for fault tolerance, such as local or monolithic centralized, do not correspond with the accelerated technological evolution pace over the past two decades. Driven by the advent of digital technologies such as the Internet of Things, cloud and edge computing, and artificial intelligence associated with enhanced computational processing and communication capabilities, the traditional fault tolerance approaches are out of sync with contemporary and future systems. Consequently, these approaches are limited in achieving the maximum benefits enabled by integrating these technologies, such as robustness related to fault tolerance accuracy and performance improvements, maintaining the flexibility and scalability fundamental for cyberphysical systems operations. Accordingly, this thesis proposes a collaborative fault tolerance approach for cyber-physical systems called Collaborative Fault-Star (CF*), a unified term that refers to the collaboration on the detection, diagnosis, and recovery stages of fault tolerance. The proposed approach takes advantage of cyber-physical components’ inherent data analysis and communication capabilities and is based on multi-agent system principles, where components are self-fault tolerant and adopt collaborative and distributed intelligence behavior when necessary to transcend their individual fault tolerance capabilities. This thesis introduces the backbone and infrastructure of CF*, highlighting the components, agent behavior, functionalities, and interaction protocols to explore the benefits of communication and collaboration between agents to enhance fault tolerance. The CF* is a generic approach, and can be instantiated for diverse cyber-physical system domains as well as for different applications. Experiments were conducted on a developed laboratory-scale warehouse, focusing on the fault detection and diagnosis stage for temperature and humidity sensors in the warehouse racks. The experimental results confirmed the accuracy and performance improvements under CF* compared with the local approach and competitiveness compared to a centralized approach. |
| id |
UTFPR-12_9ae83dceb03d4437a698a4156c3ebcaa |
|---|---|
| oai_identifier_str |
oai:repositorio.utfpr.edu.br:1/37787 |
| network_acronym_str |
UTFPR-12 |
| network_name_str |
Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT)) |
| repository_id_str |
|
| spelling |
A collaborative approach to fault tolerance in cyber-physical systemsUma abordagem colaborativa para tolerância a falhas em sistemas ciber-físicosObjetos cooperativos (sistemas de computador)Tolerância a falha (Computadores)Sistemas multiagentesIndústria 4.0Confiabilidade (Engenharia)Cooperating objects (Computer systems)Fault-tolerant computingMultiagent systemsIndustry 4.0Reliability (Engineering)CNPQ::ENGENHARIAS::ENGENHARIA ELETRICAEngenharia ElétricaThe reliability and robustness of cyber-physical systems are critical aspects of the current industrial landscape. The high level of autonomous and distributed components associated with many devices makes cyber-physical systems prone to faults. With this in mind and contextualized with the current era of Industry 4.0, fault tolerance is essential for maintaining the robustness and resilience of systems facing unforeseen or undesirable disturbances. Despite their importance and benefits, traditional approaches for fault tolerance, such as local or monolithic centralized, do not correspond with the accelerated technological evolution pace over the past two decades. Driven by the advent of digital technologies such as the Internet of Things, cloud and edge computing, and artificial intelligence associated with enhanced computational processing and communication capabilities, the traditional fault tolerance approaches are out of sync with contemporary and future systems. Consequently, these approaches are limited in achieving the maximum benefits enabled by integrating these technologies, such as robustness related to fault tolerance accuracy and performance improvements, maintaining the flexibility and scalability fundamental for cyberphysical systems operations. Accordingly, this thesis proposes a collaborative fault tolerance approach for cyber-physical systems called Collaborative Fault-Star (CF*), a unified term that refers to the collaboration on the detection, diagnosis, and recovery stages of fault tolerance. The proposed approach takes advantage of cyber-physical components’ inherent data analysis and communication capabilities and is based on multi-agent system principles, where components are self-fault tolerant and adopt collaborative and distributed intelligence behavior when necessary to transcend their individual fault tolerance capabilities. This thesis introduces the backbone and infrastructure of CF*, highlighting the components, agent behavior, functionalities, and interaction protocols to explore the benefits of communication and collaboration between agents to enhance fault tolerance. The CF* is a generic approach, and can be instantiated for diverse cyber-physical system domains as well as for different applications. Experiments were conducted on a developed laboratory-scale warehouse, focusing on the fault detection and diagnosis stage for temperature and humidity sensors in the warehouse racks. The experimental results confirmed the accuracy and performance improvements under CF* compared with the local approach and competitiveness compared to a centralized approach.Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)Universidade Tecnológica Federal do Paraná (UTFPR)A confiabilidade e a robustez dos sistemas ciber-físicos são aspectos críticos no cenário industrial atual. O elevado nível de autonomia e distribuição dos componentes, associado à ampla variedade de dispositivos, torna esses sistemas suscetíveis a falhas. Nesse contexto, e alinhado à era atual da Indústria 4.0, a tolerância a falhas torna-se essencial para manter a robustez e a resiliência dos sistemas diante de perturbações imprevistas ou indesejáveis. Apesar de sua importância e dos benefícios oferecidos, as abordagens tradicionais de tolerância a falhas, como as estratégias locais ou centralizadas monolíticas, não acompanham o ritmo acelerado da evolução tecnológica das últimas duas décadas. Impulsionadas pelo advento de tecnologias digitais, como a Internet das Coisas, computação em nuvem e edge, e inteligência artificial, associadas a capacidades elevadas de processamento computacional e comunicação, as abordagens tradicionais de tolerância a falhas mostram-se defasadas em relação aos sistemas contemporâneos e futuros. Como consequência, tais abordagens apresentam limitações para alcançar os benefícios máximos proporcionados pela integração dessas tecnologias, como maior robustez, precisão na tolerância a falhas e melhorias de desempenho, mantendo a flexibilidade e escalabilidade fundamentais para a operação dos sistemas ciber-físicos. Diante disso, esta tese propõe uma abordagem colaborativa de tolerância a falhas em sistemas ciber-físicos, denominada Collaborative Fault-Star (CF*), um termo unificado que engloba a colaboração nos estágios de detecção, diagnóstico e recuperação de falhas. A abordagem proposta explora as capacidades inerentes de análise de dados e comunicação dos componentes ciber-físicos, fundamentando-se nos princípios de sistemas multiagente, nos quais os componentes são autônomos para tolerar falhas e adotam comportamento colaborativo e inteligência distribuída quando necessário, a fim de superar as limitações de suas capacidades individuais. Esta tese apresenta a infraestrutura e a espinha dorsal do CF*, destacando os componentes, comportamentos dos agentes, funcionalidades e protocolos de interação, de modo a explorar os benefícios da comunicação e colaboração entre agentes para aprimorar a tolerância a falhas. O CF* constitui uma abordagem genérica, passível de instanciação para diferentes domínios e aplicações de sistemas ciber-físicos. Experimentos foram conduzidos num armazém em escala laboratorial desenvolvido, com foco nos estágios de detecção e diagnóstico de falhas em sensores de temperatura e umidade instalados nas estantes do armazém. Os resultados experimentais confirmaram melhorias em termos de precisão e desempenho da abordagem CF* em comparação com a abordagem local e competitividade em relação à abordagem centralizada.Universidade Tecnológica Federal do ParanáCuritibaBrasilPrograma de Pós-Graduação em Engenharia Elétrica e Informática IndustrialUTFPROliveira, Andre Schneider dehttps://orcid.org/0000-0002-8295-366Xhttps://lattes.cnpq.br/4006878042502781Costa, Pedro Luis Cerqueira Gomes daLeitão, Paulo Jorge Pintohttps://orcid.org/0000-0002-0435-8419https://orcid.org/0000-0002-2151-7944http://lattes.cnpq.br/6851970061162215Oliveira, Andre Schneider dehttps://orcid.org/0000-0002-8295-366Xhttps://lattes.cnpq.br/4006878042502781Moreira, Antônio Paulo Gomes Mendeshttps://orcid.org/0000-0001-8573-3147http://lattes.cnpq.br/9452560221551641Teixeira, Marco Antonio Simõeshttps://orcid.org/0000-0002-0372-312Xhttps://lattes.cnpq.br/3482274885890440Wehrmeister, Marco Auréliohttps://orcid.org/0000-0002-1415-5527https://lattes.cnpq.br/5548205054206839Rabelo, Ricardo Joséhttps://orcid.org/0000-0002-5488-3692http://lattes.cnpq.br/9784200268507565Piardi, Luis Fernando2025-08-07T19:41:20Z2025-08-07T19:41:20Z2025-06-26info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisapplication/pdfPIARDI, Luis Fernando. A collaborative approach to fault tolerance in cyber-physical systems. 2025. Tese (Doutorado em Engenharia Elétrica e Informática Industrial) - Universidade Tecnológica Federal do Paraná, Curitiba, 2025.http://repositorio.utfpr.edu.br/jspui/handle/1/37787enghttp://creativecommons.org/licenses/by/4.0/info:eu-repo/semantics/openAccessreponame:Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT))instname:Universidade Tecnológica Federal do Paraná (UTFPR)instacron:UTFPR2025-08-08T06:13:04Zoai:repositorio.utfpr.edu.br:1/37787Repositório InstitucionalPUBhttp://repositorio.utfpr.edu.br:8080/oai/requestriut@utfpr.edu.br || sibi@utfpr.edu.bropendoar:2025-08-08T06:13:04Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT)) - Universidade Tecnológica Federal do Paraná (UTFPR)false |
| dc.title.none.fl_str_mv |
A collaborative approach to fault tolerance in cyber-physical systems Uma abordagem colaborativa para tolerância a falhas em sistemas ciber-físicos |
| title |
A collaborative approach to fault tolerance in cyber-physical systems |
| spellingShingle |
A collaborative approach to fault tolerance in cyber-physical systems Piardi, Luis Fernando Objetos cooperativos (sistemas de computador) Tolerância a falha (Computadores) Sistemas multiagentes Indústria 4.0 Confiabilidade (Engenharia) Cooperating objects (Computer systems) Fault-tolerant computing Multiagent systems Industry 4.0 Reliability (Engineering) CNPQ::ENGENHARIAS::ENGENHARIA ELETRICA Engenharia Elétrica |
| title_short |
A collaborative approach to fault tolerance in cyber-physical systems |
| title_full |
A collaborative approach to fault tolerance in cyber-physical systems |
| title_fullStr |
A collaborative approach to fault tolerance in cyber-physical systems |
| title_full_unstemmed |
A collaborative approach to fault tolerance in cyber-physical systems |
| title_sort |
A collaborative approach to fault tolerance in cyber-physical systems |
| author |
Piardi, Luis Fernando |
| author_facet |
Piardi, Luis Fernando |
| author_role |
author |
| dc.contributor.none.fl_str_mv |
Oliveira, Andre Schneider de https://orcid.org/0000-0002-8295-366X https://lattes.cnpq.br/4006878042502781 Costa, Pedro Luis Cerqueira Gomes da Leitão, Paulo Jorge Pinto https://orcid.org/0000-0002-0435-8419 https://orcid.org/0000-0002-2151-7944 http://lattes.cnpq.br/6851970061162215 Oliveira, Andre Schneider de https://orcid.org/0000-0002-8295-366X https://lattes.cnpq.br/4006878042502781 Moreira, Antônio Paulo Gomes Mendes https://orcid.org/0000-0001-8573-3147 http://lattes.cnpq.br/9452560221551641 Teixeira, Marco Antonio Simões https://orcid.org/0000-0002-0372-312X https://lattes.cnpq.br/3482274885890440 Wehrmeister, Marco Aurélio https://orcid.org/0000-0002-1415-5527 https://lattes.cnpq.br/5548205054206839 Rabelo, Ricardo José https://orcid.org/0000-0002-5488-3692 http://lattes.cnpq.br/9784200268507565 |
| dc.contributor.author.fl_str_mv |
Piardi, Luis Fernando |
| dc.subject.por.fl_str_mv |
Objetos cooperativos (sistemas de computador) Tolerância a falha (Computadores) Sistemas multiagentes Indústria 4.0 Confiabilidade (Engenharia) Cooperating objects (Computer systems) Fault-tolerant computing Multiagent systems Industry 4.0 Reliability (Engineering) CNPQ::ENGENHARIAS::ENGENHARIA ELETRICA Engenharia Elétrica |
| topic |
Objetos cooperativos (sistemas de computador) Tolerância a falha (Computadores) Sistemas multiagentes Indústria 4.0 Confiabilidade (Engenharia) Cooperating objects (Computer systems) Fault-tolerant computing Multiagent systems Industry 4.0 Reliability (Engineering) CNPQ::ENGENHARIAS::ENGENHARIA ELETRICA Engenharia Elétrica |
| description |
The reliability and robustness of cyber-physical systems are critical aspects of the current industrial landscape. The high level of autonomous and distributed components associated with many devices makes cyber-physical systems prone to faults. With this in mind and contextualized with the current era of Industry 4.0, fault tolerance is essential for maintaining the robustness and resilience of systems facing unforeseen or undesirable disturbances. Despite their importance and benefits, traditional approaches for fault tolerance, such as local or monolithic centralized, do not correspond with the accelerated technological evolution pace over the past two decades. Driven by the advent of digital technologies such as the Internet of Things, cloud and edge computing, and artificial intelligence associated with enhanced computational processing and communication capabilities, the traditional fault tolerance approaches are out of sync with contemporary and future systems. Consequently, these approaches are limited in achieving the maximum benefits enabled by integrating these technologies, such as robustness related to fault tolerance accuracy and performance improvements, maintaining the flexibility and scalability fundamental for cyberphysical systems operations. Accordingly, this thesis proposes a collaborative fault tolerance approach for cyber-physical systems called Collaborative Fault-Star (CF*), a unified term that refers to the collaboration on the detection, diagnosis, and recovery stages of fault tolerance. The proposed approach takes advantage of cyber-physical components’ inherent data analysis and communication capabilities and is based on multi-agent system principles, where components are self-fault tolerant and adopt collaborative and distributed intelligence behavior when necessary to transcend their individual fault tolerance capabilities. This thesis introduces the backbone and infrastructure of CF*, highlighting the components, agent behavior, functionalities, and interaction protocols to explore the benefits of communication and collaboration between agents to enhance fault tolerance. The CF* is a generic approach, and can be instantiated for diverse cyber-physical system domains as well as for different applications. Experiments were conducted on a developed laboratory-scale warehouse, focusing on the fault detection and diagnosis stage for temperature and humidity sensors in the warehouse racks. The experimental results confirmed the accuracy and performance improvements under CF* compared with the local approach and competitiveness compared to a centralized approach. |
| publishDate |
2025 |
| dc.date.none.fl_str_mv |
2025-08-07T19:41:20Z 2025-08-07T19:41:20Z 2025-06-26 |
| dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
| dc.type.driver.fl_str_mv |
info:eu-repo/semantics/doctoralThesis |
| format |
doctoralThesis |
| status_str |
publishedVersion |
| dc.identifier.uri.fl_str_mv |
PIARDI, Luis Fernando. A collaborative approach to fault tolerance in cyber-physical systems. 2025. Tese (Doutorado em Engenharia Elétrica e Informática Industrial) - Universidade Tecnológica Federal do Paraná, Curitiba, 2025. http://repositorio.utfpr.edu.br/jspui/handle/1/37787 |
| identifier_str_mv |
PIARDI, Luis Fernando. A collaborative approach to fault tolerance in cyber-physical systems. 2025. Tese (Doutorado em Engenharia Elétrica e Informática Industrial) - Universidade Tecnológica Federal do Paraná, Curitiba, 2025. |
| url |
http://repositorio.utfpr.edu.br/jspui/handle/1/37787 |
| dc.language.iso.fl_str_mv |
eng |
| language |
eng |
| dc.rights.driver.fl_str_mv |
http://creativecommons.org/licenses/by/4.0/ info:eu-repo/semantics/openAccess |
| rights_invalid_str_mv |
http://creativecommons.org/licenses/by/4.0/ |
| eu_rights_str_mv |
openAccess |
| dc.format.none.fl_str_mv |
application/pdf |
| dc.publisher.none.fl_str_mv |
Universidade Tecnológica Federal do Paraná Curitiba Brasil Programa de Pós-Graduação em Engenharia Elétrica e Informática Industrial UTFPR |
| publisher.none.fl_str_mv |
Universidade Tecnológica Federal do Paraná Curitiba Brasil Programa de Pós-Graduação em Engenharia Elétrica e Informática Industrial UTFPR |
| dc.source.none.fl_str_mv |
reponame:Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT)) instname:Universidade Tecnológica Federal do Paraná (UTFPR) instacron:UTFPR |
| instname_str |
Universidade Tecnológica Federal do Paraná (UTFPR) |
| instacron_str |
UTFPR |
| institution |
UTFPR |
| reponame_str |
Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT)) |
| collection |
Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT)) |
| repository.name.fl_str_mv |
Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT)) - Universidade Tecnológica Federal do Paraná (UTFPR) |
| repository.mail.fl_str_mv |
riut@utfpr.edu.br || sibi@utfpr.edu.br |
| _version_ |
1850498314516037632 |