Combinando imagem e som para detecção de transições em vídeos digitais

Fonseca, Marcilene Scantamburlo

Combinando imagem e som para detecção de transições em vídeos digitais

Detalhes bibliográficos
Ano de defesa:	2008
Autor(a) principal:	Fonseca, Marcilene Scantamburlo
Orientador(a):	Não Informado pela instituição
Banca de defesa:	Não Informado pela instituição
Tipo de documento:	Dissertação
Tipo de acesso:	Acesso aberto
Idioma:	por
Instituição de defesa:	Programa de Pós-Graduação em Computação Computação
Programa de Pós-Graduação:	Não Informado pela instituição
Departamento:	Não Informado pela instituição
País:	Não Informado pela instituição
Palavras-chave em Português:	Ciência da computação Computação visual Sistemas nervoso autonômos Vídeos Processamento de imagem Análise de imagens Recuperação de imagens Segmentação de vídeo Teoria Bayesiana Recuperação de áudio Estatística CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::TEORIA DA COMPUTACAO::COMPUTABILIDADE E MODELOS DE COMPUTACAO
Link de acesso:	https://app.uff.br/riuff/handle/1/17887
Resumo:	The large and growing volume of digital applications, such as video indexing, retrieving, analysis, compression, storing and data transmission, have being accomplished based on scene transition detection, sometimes called video segmentation. Since it is a basic step and it is far from a complete solutions, it is also a open area, needing much more for investigations. Although scene classification by human concerns several attributes, the use of color, shape and texture are the more frequently used on computational models. On the other hand, researches on digital sound processing and voice signal have gotten much attention, mainly in multimedia systems. It is well known, however, that each attribution on processing of scene analysis can relies on several features at the same time, as visual as sonorous. This dissertation proposes has as main goal the analysis of the results in automatic detection of scene transition using traditional visual features combined with sounding features to the scene. This detection regards the color aspects (based on color histogram), shape aspects (based on edge map histogram) and sound aspects, which is accomplished with 7 sound characteristics, namely: short time average energy, zero-crossing rate, energy band ratio, delta spectral magnitude, root mean square of square sum of signals, high sounds and low value characteristics ratios. A Bayesian network model is used on this analysis and the results shows that the inclusion of sound in the model combining color and shape can improve the detection performance.

Metadados do item

id	UFF-2_8f946f2d002f15df16905a9da0fa64d9
oai_identifier_str	oai:app.uff.br:1/17887
network_acronym_str	UFF-2
network_name_str	Repositório Institucional da Universidade Federal Fluminense (RIUFF)
repository_id_str
spelling	Combinando imagem e som para detecção de transições em vídeos digitaisCiência da computaçãoComputação visualSistemas nervoso autonômosVídeosProcessamento de imagemAnálise de imagensRecuperação de imagensSegmentação de vídeoTeoria BayesianaRecuperação de áudioEstatísticaCNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::TEORIA DA COMPUTACAO::COMPUTABILIDADE E MODELOS DE COMPUTACAOThe large and growing volume of digital applications, such as video indexing, retrieving, analysis, compression, storing and data transmission, have being accomplished based on scene transition detection, sometimes called video segmentation. Since it is a basic step and it is far from a complete solutions, it is also a open area, needing much more for investigations. Although scene classification by human concerns several attributes, the use of color, shape and texture are the more frequently used on computational models. On the other hand, researches on digital sound processing and voice signal have gotten much attention, mainly in multimedia systems. It is well known, however, that each attribution on processing of scene analysis can relies on several features at the same time, as visual as sonorous. This dissertation proposes has as main goal the analysis of the results in automatic detection of scene transition using traditional visual features combined with sounding features to the scene. This detection regards the color aspects (based on color histogram), shape aspects (based on edge map histogram) and sound aspects, which is accomplished with 7 sound characteristics, namely: short time average energy, zero-crossing rate, energy band ratio, delta spectral magnitude, root mean square of square sum of signals, high sounds and low value characteristics ratios. A Bayesian network model is used on this analysis and the results shows that the inclusion of sound in the model combining color and shape can improve the detection performance.Um grande número de aplicações de vídeo digitais, como indexação, recuperação, análise, compressão, armazenamento e transmissão de dados, é realizado, primeiramente, baseado em detecção de transições de cena, às vezes, chamada de segmentação de vídeo. Por ser uma etapa base e estar longe de resolver todos os seus desafios, ainda é uma área em aberto, necessitando de muita investigação. Embora a classificação de cenas pelo ser humano envolva diversos atributos, a utilização de cor, forma e textura ainda são os mais utilizados por modelos computacionais conhecidos. Por outro lado, pesquisas com processamento de sinais sonoros como voz têm ganhado bastante atenção, principalmente em sistemas multimídia. O processo de análise de cenas reais pode envolver uma ou mais características relacionadas a cada atributo, tanto visuais quanto sonoros. O objetivo principal dessa dissertação é a análise dos resultados obtidos da detecção de transição de cenas, quando usadas características visuais combinadas com características de sinais sonoros associados às cenas. Essa detecção levou em consideração os aspectos de cor (baseado no cálculo do histograma de cores); e de forma (baseado no cálculo e histograma do mapa de bordas da imagem). Por sua vez, a detecção com base no conteúdo do som foi baseada na extração de 7 características, a saber: energia média de curta duração, taxa de cruzamento de zero, razão da energia de banda, magnitude do espectro delta, raiz quadrada média do somatório das intensidades dos sinais ao quadrado, razão dos valores de características altas e razão dos valores de características baixas. Um modelo bayesiano foi utilizado para essa análise e os resultados mostraram que a inclusão do som à combinação de cor e forma pode melhorar o desempenho da detecção de transições.Programa de Pós-Graduação em ComputaçãoComputaçãoConci, AuraCPF:01090879922http://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4787952Y2Rodrigues, Paulo Sergio SilvaCPF:40095333222http://lattes.cnpq.br/0302011461580302Garcia, Ana Cristina BicharraCPF:31237899422http://lattes.cnpq.br/4879977915136752Oliveira, Jauvane Cavalcante deCPF:40133455622http://lattes.cnpq.br/4054756781423727Bokehi, José RaphaelCPF:40230090022http://lattes.cnpq.br/7945046379650456Fonseca, Marcilene Scantamburlo2021-03-10T20:43:01Z2008-06-092021-03-10T20:43:01Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://app.uff.br/riuff/handle/1/17887porCC-BY-SAinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da Universidade Federal Fluminense (RIUFF)instname:Universidade Federal Fluminense (UFF)instacron:UFF2021-03-10T20:43:01Zoai:app.uff.br:1/17887Repositório InstitucionalPUBhttps://app.uff.br/oai/requestriuff@id.uff.bropendoar:21202021-03-10T20:43:01Repositório Institucional da Universidade Federal Fluminense (RIUFF) - Universidade Federal Fluminense (UFF)false
dc.title.none.fl_str_mv	Combinando imagem e som para detecção de transições em vídeos digitais
title	Combinando imagem e som para detecção de transições em vídeos digitais
spellingShingle	Combinando imagem e som para detecção de transições em vídeos digitais Fonseca, Marcilene Scantamburlo Ciência da computação Computação visual Sistemas nervoso autonômos Vídeos Processamento de imagem Análise de imagens Recuperação de imagens Segmentação de vídeo Teoria Bayesiana Recuperação de áudio Estatística CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::TEORIA DA COMPUTACAO::COMPUTABILIDADE E MODELOS DE COMPUTACAO
title_short	Combinando imagem e som para detecção de transições em vídeos digitais
title_full	Combinando imagem e som para detecção de transições em vídeos digitais
title_fullStr	Combinando imagem e som para detecção de transições em vídeos digitais
title_full_unstemmed	Combinando imagem e som para detecção de transições em vídeos digitais
title_sort	Combinando imagem e som para detecção de transições em vídeos digitais
author	Fonseca, Marcilene Scantamburlo
author_facet	Fonseca, Marcilene Scantamburlo
author_role	author
dc.contributor.none.fl_str_mv	Conci, Aura CPF:01090879922 http://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4787952Y2 Rodrigues, Paulo Sergio Silva CPF:40095333222 http://lattes.cnpq.br/0302011461580302 Garcia, Ana Cristina Bicharra CPF:31237899422 http://lattes.cnpq.br/4879977915136752 Oliveira, Jauvane Cavalcante de CPF:40133455622 http://lattes.cnpq.br/4054756781423727 Bokehi, José Raphael CPF:40230090022 http://lattes.cnpq.br/7945046379650456
dc.contributor.author.fl_str_mv	Fonseca, Marcilene Scantamburlo
dc.subject.por.fl_str_mv	Ciência da computação Computação visual Sistemas nervoso autonômos Vídeos Processamento de imagem Análise de imagens Recuperação de imagens Segmentação de vídeo Teoria Bayesiana Recuperação de áudio Estatística CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::TEORIA DA COMPUTACAO::COMPUTABILIDADE E MODELOS DE COMPUTACAO
topic	Ciência da computação Computação visual Sistemas nervoso autonômos Vídeos Processamento de imagem Análise de imagens Recuperação de imagens Segmentação de vídeo Teoria Bayesiana Recuperação de áudio Estatística CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::TEORIA DA COMPUTACAO::COMPUTABILIDADE E MODELOS DE COMPUTACAO
description	The large and growing volume of digital applications, such as video indexing, retrieving, analysis, compression, storing and data transmission, have being accomplished based on scene transition detection, sometimes called video segmentation. Since it is a basic step and it is far from a complete solutions, it is also a open area, needing much more for investigations. Although scene classification by human concerns several attributes, the use of color, shape and texture are the more frequently used on computational models. On the other hand, researches on digital sound processing and voice signal have gotten much attention, mainly in multimedia systems. It is well known, however, that each attribution on processing of scene analysis can relies on several features at the same time, as visual as sonorous. This dissertation proposes has as main goal the analysis of the results in automatic detection of scene transition using traditional visual features combined with sounding features to the scene. This detection regards the color aspects (based on color histogram), shape aspects (based on edge map histogram) and sound aspects, which is accomplished with 7 sound characteristics, namely: short time average energy, zero-crossing rate, energy band ratio, delta spectral magnitude, root mean square of square sum of signals, high sounds and low value characteristics ratios. A Bayesian network model is used on this analysis and the results shows that the inclusion of sound in the model combining color and shape can improve the detection performance.
publishDate	2008
dc.date.none.fl_str_mv	2008-06-09 2021-03-10T20:43:01Z 2021-03-10T20:43:01Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	https://app.uff.br/riuff/handle/1/17887
url	https://app.uff.br/riuff/handle/1/17887
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	CC-BY-SA info:eu-repo/semantics/openAccess
rights_invalid_str_mv	CC-BY-SA
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Programa de Pós-Graduação em Computação Computação
publisher.none.fl_str_mv	Programa de Pós-Graduação em Computação Computação
dc.source.none.fl_str_mv	reponame:Repositório Institucional da Universidade Federal Fluminense (RIUFF) instname:Universidade Federal Fluminense (UFF) instacron:UFF
instname_str	Universidade Federal Fluminense (UFF)
instacron_str	UFF
institution	UFF
reponame_str	Repositório Institucional da Universidade Federal Fluminense (RIUFF)
collection	Repositório Institucional da Universidade Federal Fluminense (RIUFF)
repository.name.fl_str_mv	Repositório Institucional da Universidade Federal Fluminense (RIUFF) - Universidade Federal Fluminense (UFF)
repository.mail.fl_str_mv	riuff@id.uff.br
_version_	1802135797395095552

Combinando imagem e som para detecção de transições em vídeos digitais

Registros relacionados