RFlow: uma arquitetura para execu??o e coleta de proveni?ncia de workflows estat?sticos

Detalhes bibliográficos
Ano de defesa: 2015
Autor(a) principal: Nascimento, Jos? Ant?nio Pires do lattes
Orientador(a): Cruz, S?rgio Manuel Serra da lattes
Banca de defesa: Cruz, S?rgio Manuel Serra da, Chaer, Guilherme Montandon, Costa, Raimundo Jos? Mac?rio
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Universidade Federal Rural do Rio de Janeiro
Programa de Pós-Graduação: Programa de P?s-Gradua??o em Modelagem Matem?tica e Computacional
Departamento: Instituto de Ci?ncias Exatas
País: Brasil
Palavras-chave em Português:
Palavras-chave em Inglês:
Área do conhecimento CNPq:
Link de acesso: https://tede.ufrrj.br/jspui/handle/jspui/4520
Resumo: Agricultural data related to the reduction of costs of production and improvements in product quality, prediction and control of pests and epidemics and high precision agriculture are produced on a large scale, and in a heterogeneous manner. The data are captured via sensors, UAVs, the web, satellites, mobile devices, among others. This increasing volume of scientific data and the need to manage and share them between geographically dispersed teams has created a demand for new techniques and computational tools. This study presents the RFlow architecture, a set of integrated tools that manages, shares and collects provenance and reproduces scientific experiments based on R scripts and helps to validate the statistical results. The SisGExp application, one of the architectural components, allows not only access to the data and the processes that are transformed in real time but also collects and records prospective and retrospective provenance descriptors concerning the experiment. In addition, the alignment of the research data to statistical results expands experimental reproducibility, providing greater reliability of scientific results
id UFRRJ-1_f08a5279ec9c9413fd37e09ce5f5c338
oai_identifier_str oai:localhost:jspui/4520
network_acronym_str UFRRJ-1
network_name_str Biblioteca Digital de Teses e Dissertações da UFRRJ
repository_id_str
spelling Cruz, S?rgio Manuel Serra da848.488.637-91http://lattes.cnpq.br/7618571401128973Ceddia, Marcos Bacis141.571.218-21http://lattes.cnpq.br/2115137917689655Cruz, S?rgio Manuel Serra daChaer, Guilherme MontandonCosta, Raimundo Jos? Mac?rio690.987.867-15http://lattes.cnpq.br/8626837615588061Nascimento, Jos? Ant?nio Pires do2021-04-08T12:25:08Z2015-10-07NASCIMENTO, Jos? Ant?nio Pires do. RFlow: uma arquitetura para execu??o e coleta de proveni?ncia de workflows estat?sticos. 2015. 83 f. Disserta??o (Mestrado em Modelagem Matem?tica e Computacional) - Instituto de Ci?ncias Exatas, Universidade Federal Rural do Rio de Janeiro, Serop?dica - RJ, 2015.https://tede.ufrrj.br/jspui/handle/jspui/4520Agricultural data related to the reduction of costs of production and improvements in product quality, prediction and control of pests and epidemics and high precision agriculture are produced on a large scale, and in a heterogeneous manner. The data are captured via sensors, UAVs, the web, satellites, mobile devices, among others. This increasing volume of scientific data and the need to manage and share them between geographically dispersed teams has created a demand for new techniques and computational tools. This study presents the RFlow architecture, a set of integrated tools that manages, shares and collects provenance and reproduces scientific experiments based on R scripts and helps to validate the statistical results. The SisGExp application, one of the architectural components, allows not only access to the data and the processes that are transformed in real time but also collects and records prospective and retrospective provenance descriptors concerning the experiment. In addition, the alignment of the research data to statistical results expands experimental reproducibility, providing greater reliability of scientific resultsOs dados agropecu?rios relacionados ? redu??o de custos de produ??o e aumento da qualidade de produtos, previs?o e controle de pragas e epidemias e agricultura de alta precis?o s?o produzidos em grande escala e de maneira heterog?nea e distribu?da atrav?s de sensores, VANTs, web, sat?lites, dispositivos m?veis, planilhas, entre outros. Este crescente aumento no volume de dados cient?ficos e a necessidade de gerenci?-los e compartilh?-los entre equipes geograficamente dispersas t?m demandado novas t?cnicas e ferramentas computacionais. Este trabalho apresenta a arquitetura RFlow, um conjunto de ferramentas integradas, com o intuito de gerenciar, compartilhar e reproduzir os experimentos cient?ficos baseados em scripts R legados e, tamb?m, auxiliar a validar os resultados estat?sticos junto ? comunidade cient?fica. O aplicativo SisGExp, um dos componentes da arquitetura, permite n?o s? o acesso aos dados e os processos que os transformaram via online, bem como a coleta e registro dos descritores de proveni?ncia sobre os experimentos. Al?m disso, vincula os dados de pesquisa aos resultados estat?sticos, o que amplia a reprodutibilidade do experimento, oferecendo maior confiabilidade aos resultados cient?ficosSubmitted by Celso Magalhaes (celsomagalhaes@ufrrj.br) on 2021-04-08T12:25:08Z No. of bitstreams: 1 2015 - Jos? Ant?nio Pires do Nascimento.pdf: 3468908 bytes, checksum: 11b5c882cabbf9492fb67a3f3a211117 (MD5)Made available in DSpace on 2021-04-08T12:25:08Z (GMT). No. of bitstreams: 1 2015 - Jos? Ant?nio Pires do Nascimento.pdf: 3468908 bytes, checksum: 11b5c882cabbf9492fb67a3f3a211117 (MD5) Previous issue date: 2015-10-07EMBRAPA - Empresa Brasileira de Pesquisa Agropecu?riaapplication/pdfhttps://tede.ufrrj.br/retrieve/64580/2015%20-%20Jos%c3%a9%20Ant%c3%b4nio%20Pires%20do%20Nascimento.pdf.jpgporUniversidade Federal Rural do Rio de JaneiroPrograma de P?s-Gradua??o em Modelagem Matem?tica e ComputacionalUFRRJBrasilInstituto de Ci?ncias ExatasAALST, W. V. D.; HOFSTEDE, A.; KIEPUSZEWSKI, B.; BARROS, A. "Workflow patterns", Distributed and Parallel Databases, v. 14, n. 1, p. 5-51. 2003. ABOUELHODA, M.; ISSA, S. A.; GHANEM, M. Tavaxy: Integrating Taverna and Galaxy workflows with cloud computing support. BMC Bioinformatics, 13, 77. 2012. ALTINTAS, I. et al. ?Provenance Collection Support in the Kepler Scientific Workflow System?, IPAW2006, 118-132, 2006. ALTINTAS, I.; BERKLEY, C.; JAEGER, E.; JONES, M.; LUDASCHER, B.; MOCK, S. "Kepler: an extensible system for design and execution of scientific workflows". Scientific and Statistical Database Management, p. 423-424, Greece.2004. ANDERSON, C. The end of theory: the data deluge makes the scientific method obsolete. Wired Magazine, 23 jun. 2008. Dispon?vel em: <http://www.wired.com/science/discoveries/ magazine/16-07/pb_theory>. Acesso em 25 mar. 2013. APACHE.Commons.2014.[S.l.]. Dispon?vel em < https://commons.apache.org/proper/commons-io/>. Acessado em: 04 de Nov. de 2014. ATKINSON, M.; BRITTON, D.; COVENEY, P.; DE ROURE, D.; GARNETT, N.; GEDDES, N., GURNEY, R.; HAINES, K.; HUGHES, L.; INGRAM, D.; JEFFREYS, P.; LYON, L.; OSBORNE, I.; PERROTT, R.; PROCTER, R.; RUSBRIDGE, C.; TREFETHEN, A. A; WATSON, P. Century-of-Information Research (CIR): a strategy for research and innovation in the Century of Information. Prometheus, v. 27, n. 1, p. 27-45, 2009. BANZATTO, D.A.; KRONKA, S. N. EXPERIMENTA??O AGR?COLA. Jaboticabal. SP. FUNEP. 1992. BARROS, A. J. P.; LEHFELD, N. A. S. Projeto de Pesquisa: Propostas Metodol?gicas. 8 a. ed. Petr?polis. Vozes, 95 p. 1999. BAUMER, B.; CETINKAYA-RUNDEL, M.; BRAY, A.; LOI, L.; HORTON, N. J. R. markdown: Integrating a reproducible analysis tool into introductory statistics. ArXiv eprints, February 2014. BIOINFORMATICS, manuals.[S.l.].2015, Dispon?vel em <http://manuals.bioinformatics.ucr.edu/home/ht-seq>. Acessado em: 13 Mar. 2015. BUNEMAN, P.; KHANNA, S. E.; CHIEW, W. Why and Where: a Characterization of Data Provenance. ICDT?01: 8th International Conference on Database Theory, LNCS, v.1973, p.316?330, 2001. 60 CALLAHAN, S. P.; FREIRE, J.; SANTOS, E.; SCHEIDEGGER, C. E.; SILVA, C. T.; VO, H. T. "VisTrails: visualization meets data management". SIGMOD, p. 745-747, Chicago, Illinois, USA.2006. CASADEVALL, A.; FANG, F. C. Infect Immun. Reproducible Science. doi: 10.1128/IAI.00908-10 PMCID: PMC2981311.2010 CESARIO, E.; LACKOVIC, M.; TALIA, D.; TRUNFIO, P. ?Service-oriented data analysis in distributed computing systems,? in High Performance Computing: From Grids and Clouds to Exascale, Eds., pp. 225?245, IOS Press, Lansdale, Pa, USA. 2011. CHAMBERS, J. R. Software Data Analysis Programming with R Software. Springer. 1st edition, 2008. COHEN, S.; BOULAKIA, S. E.; DAVIDSON, S. Towards a Model of Provenance and User Views in Scientific Workflows, Data Integration in the Life Sciences, LNCS 4075, Springer, p.264?279, 2006. CRAWLEY, M. J. Statistical Computing to Data Analysis using S-plus. Wiley. 1st edition, 2002. CRUZ, S. M. S.; CAMPOS, M. L M.; MATTOSO, M. L. Q. ?Towards a Taxonomy of Provenance in Scientific Workflow Management Systems?. Services.pp. 259 ? 266. 2009. CRUZ, S. M. S. ?Uma Estrat?gia De Apoio ? Ger?ncia De Dados De Proveni?ncia Em Experimentos Cient?ficos?. Tese de Doutorado, COPPE/UFRJ. 2011. DEELMAN, E.; GANNON, D.; SHIELDS, M.; TAYLOR, I. Workflows and e-Science: An overview of workflow system features and capabilities. Future Generation Computer Systems. v. 25, n. 5, p. 528-540. 2009. DEELMAN, E.; SINGH, G.; SU, M. H. et al., ?Pegasus: a framework for mapping complex scientific workflows onto distributed systems? Scientific Programming, vol. 13, no. 3, pp. 219?237, 2005. ELLKVIST, T.; KOOP, D.; ANDERSON, E. W.; FREIRE, J.; SILVA, C. "Using Provenance to Support Real-Time Collaborative Design of Workflows", Provenance and Annotation of Data and Processes: 2nd International Provenance and Annotation Workshop, Salt Lake City, UT, USA, L/CS, Springer-Verlag, p. 266-279. 2008. EMBRAPA. (2015).Empresa Brasileira de Pesquisa Agropecu?ria. Dispon?vel em: <https://www.embrapa.br/quem-somos>. Acesso em: 7 Mar. 2015. EMBRAPA INFORM?TICA AGROPECU?RIA.(2015a).Embrapa Inform?tica Agropecu?ria. Dispon?vel em: <https://www.embrapa.br/informatica-agropecuaria/missaovisao- valores>. Acesso em: 7 Mar. 2015. EMBRAPA INFORM?TICA AGROPECU?RIA. (2015b).Embrapa Inform?tica 61 Agropecu?ria. Dispon?vel em: <https://www.embrapa.br/informaticaagropecuaria/ infraestrutura/laboratorio-multiusuario-de-bioinformatica>. Acesso em: 7 Mar. 2015. FREIRE, J.; KOOP, D.; SANTOS, E.; SILVA, C. T. "Provenance for Computational Tasks: A Survey", Computing in Science and Engineering, v.10, n. 3, p. 11-21. 2008. FREIRE, J.; BONNET, P.; SHASHA, D. Computational reproducibility: state-of-the-art, challenges, and database research opportunities "http://dl.acm.org/img/shopping-art16.gif" ;New York University, Poly, Brooklyn, NY, USA; IT University of Copenhagen, Copenhagen, Denmark. 2012. GEKKOQUANT. Gekkoquant. Dispon?vel em: <http://gekkoquant.com/2012/05/26/neuralnetworks- with-r-simple-example/,Neural Networks with R ? A Simple Example>. Acesso em: 17 Jun. 2015. GIARDINE, B.; RIEMER, C.; HARDISON, R. C.; BURHANS, R.; ELNITSKI, L.; SHAH, P.; ZHANG, Y.; BLANKENBERG, D.; ALBERT, I.; TAYLOR, J.; MILLER, W.; KENT, W. J.; NEKRUTENKO, A. "Galaxy: a platform for interactive large-scale genome analysis." Genome Research. 2005. GOBLE, C. A.; BHAGAT, J.; ALEKSEJEVS, S.; CRUICKSHANK, D.; MICHAELIDES, D.; NEWMAN, D.; BORKUM, M.; BECHHOFER, S.; ROOS, M. myExperiment: a repository and social network for the sharing of bioinformatics workflows, NucleicAcids Research, v. 38, n. Web Server Issue. p. 677-682. 2010. GOMES, F. P. Curso de estat?stica experimental. 13. ed., Piracicaba: Nobel, 1996. GONZ?LEZ-BELTR?N, A.; LI, P.; ZHAO, J.; AVILA-GARCIA, M. S.; ROOS, M.; THOMPSON, M. et al. From Peer-Reviewed to Peer-Reproduced in Scholarly Publishing: The Complementary Roles of Data Models and Workflows in Bioinformatics. PLoS ONE 10(7): e0127612. doi:10.1371/journal.pone. 2015. GRAY, J. Jim Gray on escience: a transformed scientific method. In: HEY, T.; TANSLEY, S.; TOLLE, K. (Ed.). The fourth paradigm: data-intensive scientific discovery. Washington: Microsoft Research, 2009. GUERRA, M. J.; DONAIRE, D. Estat?stica intuitiva. 5 ed. S?o Paulo: LTC, 1991. HEY, T.; TANSLEY, S.; TOLLE, K., The Fourth Paradigm: Data-Intensive Scientific Discovery. 1. ed. Redmond, Microsoft Research, 2009. HIGGINS, D. Using R in Kepler, Berkeley University, <ptolemy.eecs.berkeley.edu/conferences/05/presentations/higginsRSystem.pdf>, 2007. HINKELMANN, K.; KEMPTHORNE, O. Design and analysis of experiments. New York: J. Wiley,. 631 p. 1994. 62 HOFFMANN, R; VIEIRA, S. Estat?stica experimental. S?o Paulo: Atlas, 1989. HULL, D.; WOLSTENCROFT, K.; STEVENS, R.; GOBLE, C.; POCOCK, M. R.; LI, P.; KEPLER, 2013. Dispon?vel em: <https://code.keplerproject. org/code/kepler/trunk/modules/provenance/docs/provenance.pdf>. Acesso em: 23 Fev. 2013 KIRCHKAMP, O. ?Workflow of statistical data analysis?. Dispon?vel em: <http://www.kirchkamp.de/oekonometrie/pdf/wf-screen2.pdf>. Acesso em: 05 Out. 2014 KUMAR, A.; WAINER, J. ?Meta-workflows as a control and coordination mechanism for exception handling in workflow systems?. Decision Support Systems. v. 40 pp. 89-105.2005. LAKATOS, E. M.; MARCONI, M. A. Metodologia Cientifica. 2a . ed. S?o Paulo: Editora Atlas. 242 p. 1991. LERNER, B.; BOOSE, E. RDataTracker: Collecting Provenance in an Interactive Scripting Environment. In 6th USENIX Workshop on the Theory and Practice of Provenance (TaPP 2014), Cologne, USENIX Association. 2014. LI, Q.; BROWN, J. B.; HUANG, H.; BICKEL, P. J. MEASURING REPRODUCIBILITY OF HIGH-THROUGHPUT EXPERIMENTS, The Annals of Applied Statistics, Vol. 5, No. 3, 1752?1779. 2011. LITTAUER, R.; RAM, K.; LUD?SCHER, B.; MICHENER, W.; KOSKELA, R.Trends in Use of Scientific Workflows: Insights from a Public Repository and Recommendations for Best Practice.Int J Digit Curation.7(2):92-100. 2012. LOANNIDIS, J. P. A. PLoS Med.2005:e124. Why most published research findings are false.Epub.2005. LUD?SCHER, B. et al. "Scientific workflow management and the Kepler system: Research Articles". Concurrency and Computation: Practice & Experience, v. 18, n. 10, p. 1039- 1065, 2006. MAIR, P.; DE LEEUW, J. ?A general framework for multivariate analysis with optimal scaling: The R package aspect?. Journal of Statistical Software, 32(9), pp. 1-12, 2010. MARCONI, M.; LAKATOS, E. M. Fundamentos de metodologia cient?fica. 7.ed. S?o Paulo: Atlas, 2010. MARINHO, A.; MURTA, L.; WERNER, C.; et al.., "Integrating Provenance Data from Distributed Workflow Systems with ProvManager". In: Provenance and Annotation of Data and Processes, v. 6378, Lecture Notes in Computer Science. Springer, pp. 286-288, 2010. MATES, P.; SANTOS, E.; FREIRE, J.; SILVA, C. T. CrowdLabs: Social Analysis and Visualization for the Sciences. In: 23rd Scientific and Statistical Database Management Conference23rd Scientific and Statistical Database ManagementConference, Portland, 63 Oregon, USA, 2011. MATTOSO, M.; CRUZ, S. M. S. Ger?ncia de workflows cient?ficos: oportunidades de pesquisa em bancos de dados. In: Proceedings of the 23rd Brazilian symposium on Databases, pp. 313-314, Campinas, Sao Paulo, Out. 2008 MATTOSO, M.; WERNER, C.; TRAVASSOS, G. H.; et al. Gerenciando Experimentos Cient?ficos em Larga Escala. In: Anais do XIII Congresso da Sociedade Brasileira de Computa??o, pp. 121-135, Bel?m, Jul. 2008. MATTOSO, M.; et al. "Desafios no apoio ? composi??o de experimentos cient?ficos em larga escala". In: Semin?rio Integrado de Software e Hardware (XXXVI SEMISH), pp. 307- 321, 2009. MCPHILLIPS, T. M.; SONG, T.; KOLISNIK, T.; AULENBACH, S.; et al. Yesworkflow: A user-oriented, language-independent tool for recovering workflow information from scripts. CoRR, abs/1502.02403, 2015. MOREAU, L.; FREIRE, J.; MYERS, J.; FUTRELLE, J.; PAULSON, P. The Open Provenance Model, Technical report, Electronics and Computer Science, University of Southampton. 2007. MOREAU, L.; MISSIER, P.; BELHAJJAME, K.; CRESSWELL, S.; GOLDEN, R.; GROTH, P.; MILES, S.; SAHOO, S. (2011). The PROV Data Model and Abstract Syntax Notation. Dispon?vel em: http://www.w3.org/TR/prov-dm/. Acesso em: 17 Mar. 2014. MYGRID.2008. Dispon?vel em: <http://www.mygrid.org.uk/>. Acesso em: 01 jul. 2015. MURTA, L.; BRAGANHOLO, V.; CHIRIGATI, F.; KOOP, D.; FREIRE, J. noWorkflow: Capturing and Analyzing Provenance of Scripts. 5th International Provenance and Annotation Workshop, IPAW. LNCS. Vol. 8628, p 71-83. 2014. NASCIMENTO, J. A. P.; CRUZ, S. M. S. RFlow: Uma Abordagem de Reutiliza??o de Workflows Estat?sticos Legados. In: Macei? - Alagoas. XXXIII Congresso da Sociedade Brasileira de Computa??o, VII e-Science workshop, 2013. NASCIMENTO, J. A. P.; CRUZ, S. M. S. RFlow: uma arquitetura para proveni?ncia de workflows estat?sticos. In: Curitiba - Paran?. X Congresso Brasileiro de Agroinform?tica, SBIAGRO.2015. NAGAVARAM, A.; AGRAWAL, G.; FREITAS, M.; MEHTA, G.; MAYANI, R.; DEELMAN, E.?A cloud-based dynamic workflow for mass spectrometry data analysis,? in Proceedings of the 7th IEEE International Conference on e-Science (e-Science '11), December 2011. NOBELPRIZE.2013. Dispon?vel em <http://www.nobelprize.org/nobel_prizes/chemistry/laureates/2013/popularchemistryprize2013. pdf>. Acesso em: 21 Jun. 2014. 64 NOGUEIRA, M. C. S. Estat?stica experimental aplicada ? experimenta??o agr?cola. Piracicaba: USP-ESALQ, 250 p. 1997. OINN, T.; LI, P.; KELL, D. B.; GOBLE, C.; GODERIS, A.; GREENWOOD, M.; HULL, D.; STEVENS, R.;TURI, D.; ZHAO, J. Taverna/myGrid: Aligning a Workflow System with the Life Sciences Community, Workflows for e-Science, Springer, p. 300-319, 2007. OINN, T. "Taverna: a tool for building and running workflows of services", Nucleic Acids Research, v. 34, n. 2, p. 729-732. 2006. PENG, R. D. Reproductible Research in Computer Science, Science, Vol. 334 no. 6060 p. 1226-1227, 2011. POPPER, K. R.The logic of scientific discovery. Hutchinson, London, United Kingdom. 1959. POSTGRESQL, (2009), PostgreSQL, Dispon?vel em < http://www.postgresql.org>.Acessado em: 03 Jan. 2014. PRIMEFACES, (2009), Dispon?vel em <http://primefaces.org/downloads>. Acessado em: 25 Out. 2014. QIN, Z.; XING, J.; ZHENG, X. Software architecture. Springer. 1st edition.2008. RANABAHU, A.; ANDERSON, P.; SHETH, A. P. ?The Cloud Agnostic e-Science Analysis Platform?. IEEE Internet Computing v. 15.pp. 85-89. 2011. R DEVELOPMENT CORE TEAM. The R project for statistical computing. Vienna, 2012. Dispon?vel em: < http://www.R-project.org>. Acesso em: 17 Mar. 2013. RUNNALLS, A. ?CXXR: an extensible R interpreter In: Wiley Interdisciplinary Reviews: Computational Statistics. DOI: 10.1002/wics.1251, 2013. RUSSELL, N.; HOFSTEDE, A.; AALST, W. V. D; MULYAR, N. "Workflow control-flow patterns: A revised view", BPM Center Report BPM-06-22, BPMcenter.org, p. 06?22. 2006. SILLES, C. A.; RUNNALLS, A. ?Provenance-Awareness in R?. LNCS, vol. 6378, p. 64-72, 2010. SILVA, C. E. P. Captura de Dados de Proveni?ncia de Workflows Cient?ficos em Nuvens Computacionais / Carlos Eduardo Paulino Silva. ? Rio de Janeiro: UFRJ/COPPE, 2011. SILVA, F. C. D; ADACHI, D. T.; NARCISO, M. G; J?NIOR, V. B. Banco de Dados de Experimentos Agr?colas: An?lise e Projeto. Campinas: Embrapa Inform?tica Agropecu?ria, (Embrapa Inform?tica Agropecu?ria. Comunicado T?cnico, 6). 2001. TALIA, D.; TRUNFIO, P.; VERTA, O. ?Weka4WS: a WSRF-enabled Weka toolkit for 65 distributed data mining on Grids,? in Proceedings of the 9th European Conference on Principles and Practice of Knowledge Discovery in Databases, pp. 309?320, Porto, Portugal, 2005. TALIA, D. ?Workflow Systems for Science: Concepts and Tools?, ISRN Software Engineering, vol. 2013, Article ID 404525, 15 pages, doi:10.1155/2013/404525. 2013. TAYLOR, I.; SHIELDS, M.; WANG, I.; RANA, O. ?Triana, applications within Grid computing and peer to peer environments?, Journal of Grid Computing, vol. 1, pp. 199? 217, 2004. TAYLOR, I.; DEELMAN, E.; GANNON, D.; et al. Workflows for e-Science: Scientific Workflows for Grids. 1 ed. London, Springer-Verlag, 2007. TRAVASSOS, G. H.; BARROS, M. O. "Contributions of in virtuo and in silico experiments for the future of empirical studies in software engineering". In: Proceedings of the WSESE03, pp. 189-200, Roma, Ago. 2003. TUOT, C. J.; SINTEK, M.; DENGEL, A. R. IVIP ? A Scientific Workflow System to Support Experts in Spatial Planning of Crop Production. Scientific and Statistical Database Management. LNCS, vol. 5069, p 586-591. 2008. UNICAMP. Campinas, SP, 2015, Disponibilizado em <http://www.unicamp.br/iq/cces/public/index.php>. Acessado em: 15 Jan. 2015 VAZ, G. J. e-Science na Embrapa / Jos? Glauber Vaz. - Campinas: Embrapa Inform?tica Agropecu?ria, 2011. VISTRAILS.VisTrails Documentation., 2013. Dispon?vel em: <http://www.vistrails.org/usersguide/v2.0/html/VisTrails.pdf>. Acesso em: 16 set. 2014 V?CKLER, J. S.; JUVE, G.; DEELMAN, E.; RYNGE, M.; BERRIMAN, B. ?Experiences using cloud computing for a scientific workflow application,? in Proceedings of the 2nd International Workshop on Scientific Cloud Computing (ScienceCloud '11), pp. 15? 24,.View at Publisher?View at Google Scholar?View at Scopus. June 2011. WASHINGTON. 2015.University of Washington Escience Institute. Washington, 2015. Disponibilizado em <http://escience.washington.edu/>. Acessado em: 15Mar. 2015. W3C. PROV-DM: The PROV Data Model. 2012. Disponvel em: <www.w3.org/TR/provdm/ >.Acessado em: 13 Maio de 2014. WILSON J. E. B., An Introduction to Scientific Research. 2. ed. Dover Publications, 1991. ZHAO, J.; GOBLE, C.; STEVENS, R.; BECHHOFER, S. "Semantically linking and browsing provenance logs for e-science", Semantics of a Networked World, v. 3226, p. 158?176. 2004. 66 ZHAO, Z.; PASCHKE, A. A. Survey on Semantic Scientific Workflow Semantic Web Journal, IOS press 1-5. 2012Workflow Cient?ficoProveni?nciaSistema RAgropecu?riaScientific workflowProvenanceR SystemAgricultureMatem?ticaRFlow: uma arquitetura para execu??o e coleta de proveni?ncia de workflows estat?sticosRFlow: uma arquitetura para execu??o e coleta de proveni?ncia de workflows estat?sticosinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisinfo:eu-repo/semantics/openAccessreponame:Biblioteca Digital de Teses e Dissertações da UFRRJinstname:Universidade Federal Rural do Rio de Janeiro (UFRRJ)instacron:UFRRJTHUMBNAIL2015 - Jos? Ant?nio Pires do Nascimento.pdf.jpg2015 - Jos? Ant?nio Pires do Nascimento.pdf.jpgimage/jpeg3342http://localhost:8080/tede/bitstream/jspui/4520/4/2015+-+Jos%C3%A9+Ant%C3%B4nio+Pires+do+Nascimento.pdf.jpgd5fc55dfdf92472960e43cbbdd351650MD54TEXT2015 - Jos? Ant?nio Pires do Nascimento.pdf.txt2015 - Jos? Ant?nio Pires do Nascimento.pdf.txttext/plain182720http://localhost:8080/tede/bitstream/jspui/4520/3/2015+-+Jos%C3%A9+Ant%C3%B4nio+Pires+do+Nascimento.pdf.txte97441fe064cc37d9a8c63dbb2db1ddfMD53ORIGINAL2015 - Jos? Ant?nio Pires do Nascimento.pdf2015 - Jos? Ant?nio Pires do Nascimento.pdfapplication/pdf3468908http://localhost:8080/tede/bitstream/jspui/4520/2/2015+-+Jos%C3%A9+Ant%C3%B4nio+Pires+do+Nascimento.pdf11b5c882cabbf9492fb67a3f3a211117MD52LICENSElicense.txtlicense.txttext/plain; charset=utf-82089http://localhost:8080/tede/bitstream/jspui/4520/1/license.txt7b5ba3d2445355f386edab96125d42b7MD51jspui/45202023-01-17 17:22:04.239oai:localhost:jspui/4520Tk9UQTogQ09MT1FVRSBBUVVJIEEgU1VBIFBSP1BSSUEgTElDRU4/QQpFc3RhIGxpY2VuP2EgZGUgZXhlbXBsbyA/IGZvcm5lY2lkYSBhcGVuYXMgcGFyYSBmaW5zIGluZm9ybWF0aXZvcy4KCkxJQ0VOP0EgREUgRElTVFJJQlVJPz9PIE4/Ty1FWENMVVNJVkEKCkNvbSBhIGFwcmVzZW50YT8/byBkZXN0YSBsaWNlbj9hLCB2b2M/IChvIGF1dG9yIChlcykgb3UgbyB0aXR1bGFyIGRvcyBkaXJlaXRvcyBkZSBhdXRvcikgY29uY2VkZSA/IFVuaXZlcnNpZGFkZSAKWFhYIChTaWdsYSBkYSBVbml2ZXJzaWRhZGUpIG8gZGlyZWl0byBuP28tZXhjbHVzaXZvIGRlIHJlcHJvZHV6aXIsICB0cmFkdXppciAoY29uZm9ybWUgZGVmaW5pZG8gYWJhaXhvKSwgZS9vdSAKZGlzdHJpYnVpciBhIHN1YSB0ZXNlIG91IGRpc3NlcnRhPz9vIChpbmNsdWluZG8gbyByZXN1bW8pIHBvciB0b2RvIG8gbXVuZG8gbm8gZm9ybWF0byBpbXByZXNzbyBlIGVsZXRyP25pY28gZSAKZW0gcXVhbHF1ZXIgbWVpbywgaW5jbHVpbmRvIG9zIGZvcm1hdG9zID91ZGlvIG91IHY/ZGVvLgoKVm9jPyBjb25jb3JkYSBxdWUgYSBTaWdsYSBkZSBVbml2ZXJzaWRhZGUgcG9kZSwgc2VtIGFsdGVyYXIgbyBjb250ZT9kbywgdHJhbnNwb3IgYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YT8/byAKcGFyYSBxdWFscXVlciBtZWlvIG91IGZvcm1hdG8gcGFyYSBmaW5zIGRlIHByZXNlcnZhPz9vLgoKVm9jPyB0YW1iP20gY29uY29yZGEgcXVlIGEgU2lnbGEgZGUgVW5pdmVyc2lkYWRlIHBvZGUgbWFudGVyIG1haXMgZGUgdW1hIGM/cGlhIGEgc3VhIHRlc2Ugb3UgCmRpc3NlcnRhPz9vIHBhcmEgZmlucyBkZSBzZWd1cmFuP2EsIGJhY2stdXAgZSBwcmVzZXJ2YT8/by4KClZvYz8gZGVjbGFyYSBxdWUgYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YT8/byA/IG9yaWdpbmFsIGUgcXVlIHZvYz8gdGVtIG8gcG9kZXIgZGUgY29uY2VkZXIgb3MgZGlyZWl0b3MgY29udGlkb3MgCm5lc3RhIGxpY2VuP2EuIFZvYz8gdGFtYj9tIGRlY2xhcmEgcXVlIG8gZGVwP3NpdG8gZGEgc3VhIHRlc2Ugb3UgZGlzc2VydGE/P28gbj9vLCBxdWUgc2VqYSBkZSBzZXUgCmNvbmhlY2ltZW50bywgaW5mcmluZ2UgZGlyZWl0b3MgYXV0b3JhaXMgZGUgbmluZ3U/bS4KCkNhc28gYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YT8/byBjb250ZW5oYSBtYXRlcmlhbCBxdWUgdm9jPyBuP28gcG9zc3VpIGEgdGl0dWxhcmlkYWRlIGRvcyBkaXJlaXRvcyBhdXRvcmFpcywgdm9jPyAKZGVjbGFyYSBxdWUgb2J0ZXZlIGEgcGVybWlzcz9vIGlycmVzdHJpdGEgZG8gZGV0ZW50b3IgZG9zIGRpcmVpdG9zIGF1dG9yYWlzIHBhcmEgY29uY2VkZXIgPyBTaWdsYSBkZSBVbml2ZXJzaWRhZGUgCm9zIGRpcmVpdG9zIGFwcmVzZW50YWRvcyBuZXN0YSBsaWNlbj9hLCBlIHF1ZSBlc3NlIG1hdGVyaWFsIGRlIHByb3ByaWVkYWRlIGRlIHRlcmNlaXJvcyBlc3Q/IGNsYXJhbWVudGUgCmlkZW50aWZpY2FkbyBlIHJlY29uaGVjaWRvIG5vIHRleHRvIG91IG5vIGNvbnRlP2RvIGRhIHRlc2Ugb3UgZGlzc2VydGE/P28gb3JhIGRlcG9zaXRhZGEuCgpDQVNPIEEgVEVTRSBPVSBESVNTRVJUQT8/TyBPUkEgREVQT1NJVEFEQSBURU5IQSBTSURPIFJFU1VMVEFETyBERSBVTSBQQVRST0M/TklPIE9VIApBUE9JTyBERSBVTUEgQUc/TkNJQSBERSBGT01FTlRPIE9VIE9VVFJPIE9SR0FOSVNNTyBRVUUgTj9PIFNFSkEgQSBTSUdMQSBERSAKVU5JVkVSU0lEQURFLCBWT0M/IERFQ0xBUkEgUVVFIFJFU1BFSVRPVSBUT0RPUyBFIFFVQUlTUVVFUiBESVJFSVRPUyBERSBSRVZJUz9PIENPTU8gClRBTUI/TSBBUyBERU1BSVMgT0JSSUdBPz9FUyBFWElHSURBUyBQT1IgQ09OVFJBVE8gT1UgQUNPUkRPLgoKQSBTaWdsYSBkZSBVbml2ZXJzaWRhZGUgc2UgY29tcHJvbWV0ZSBhIGlkZW50aWZpY2FyIGNsYXJhbWVudGUgbyBzZXUgbm9tZSAocykgb3UgbyhzKSBub21lKHMpIGRvKHMpIApkZXRlbnRvcihlcykgZG9zIGRpcmVpdG9zIGF1dG9yYWlzIGRhIHRlc2Ugb3UgZGlzc2VydGE/P28sIGUgbj9vIGZhcj8gcXVhbHF1ZXIgYWx0ZXJhPz9vLCBhbD9tIGRhcXVlbGFzIApjb25jZWRpZGFzIHBvciBlc3RhIGxpY2VuP2EuCg==Biblioteca Digital de Teses e Dissertaçõeshttps://tede.ufrrj.br/PUBhttps://tede.ufrrj.br/oai/requestbibliot@ufrrj.br||bibliot@ufrrj.bropendoar:2023-01-17T19:22:04Biblioteca Digital de Teses e Dissertações da UFRRJ - Universidade Federal Rural do Rio de Janeiro (UFRRJ)false
dc.title.por.fl_str_mv RFlow: uma arquitetura para execu??o e coleta de proveni?ncia de workflows estat?sticos
dc.title.alternative.eng.fl_str_mv RFlow: uma arquitetura para execu??o e coleta de proveni?ncia de workflows estat?sticos
title RFlow: uma arquitetura para execu??o e coleta de proveni?ncia de workflows estat?sticos
spellingShingle RFlow: uma arquitetura para execu??o e coleta de proveni?ncia de workflows estat?sticos
Nascimento, Jos? Ant?nio Pires do
Workflow Cient?fico
Proveni?ncia
Sistema R
Agropecu?ria
Scientific workflow
Provenance
R System
Agriculture
Matem?tica
title_short RFlow: uma arquitetura para execu??o e coleta de proveni?ncia de workflows estat?sticos
title_full RFlow: uma arquitetura para execu??o e coleta de proveni?ncia de workflows estat?sticos
title_fullStr RFlow: uma arquitetura para execu??o e coleta de proveni?ncia de workflows estat?sticos
title_full_unstemmed RFlow: uma arquitetura para execu??o e coleta de proveni?ncia de workflows estat?sticos
title_sort RFlow: uma arquitetura para execu??o e coleta de proveni?ncia de workflows estat?sticos
author Nascimento, Jos? Ant?nio Pires do
author_facet Nascimento, Jos? Ant?nio Pires do
author_role author
dc.contributor.advisor1.fl_str_mv Cruz, S?rgio Manuel Serra da
dc.contributor.advisor1ID.fl_str_mv 848.488.637-91
dc.contributor.advisor1Lattes.fl_str_mv http://lattes.cnpq.br/7618571401128973
dc.contributor.advisor-co1.fl_str_mv Ceddia, Marcos Bacis
dc.contributor.advisor-co1ID.fl_str_mv 141.571.218-21
dc.contributor.advisor-co1Lattes.fl_str_mv http://lattes.cnpq.br/2115137917689655
dc.contributor.referee1.fl_str_mv Cruz, S?rgio Manuel Serra da
dc.contributor.referee2.fl_str_mv Chaer, Guilherme Montandon
dc.contributor.referee3.fl_str_mv Costa, Raimundo Jos? Mac?rio
dc.contributor.authorID.fl_str_mv 690.987.867-15
dc.contributor.authorLattes.fl_str_mv http://lattes.cnpq.br/8626837615588061
dc.contributor.author.fl_str_mv Nascimento, Jos? Ant?nio Pires do
contributor_str_mv Cruz, S?rgio Manuel Serra da
Ceddia, Marcos Bacis
Cruz, S?rgio Manuel Serra da
Chaer, Guilherme Montandon
Costa, Raimundo Jos? Mac?rio
dc.subject.por.fl_str_mv Workflow Cient?fico
Proveni?ncia
Sistema R
Agropecu?ria
topic Workflow Cient?fico
Proveni?ncia
Sistema R
Agropecu?ria
Scientific workflow
Provenance
R System
Agriculture
Matem?tica
dc.subject.eng.fl_str_mv Scientific workflow
Provenance
R System
Agriculture
dc.subject.cnpq.fl_str_mv Matem?tica
description Agricultural data related to the reduction of costs of production and improvements in product quality, prediction and control of pests and epidemics and high precision agriculture are produced on a large scale, and in a heterogeneous manner. The data are captured via sensors, UAVs, the web, satellites, mobile devices, among others. This increasing volume of scientific data and the need to manage and share them between geographically dispersed teams has created a demand for new techniques and computational tools. This study presents the RFlow architecture, a set of integrated tools that manages, shares and collects provenance and reproduces scientific experiments based on R scripts and helps to validate the statistical results. The SisGExp application, one of the architectural components, allows not only access to the data and the processes that are transformed in real time but also collects and records prospective and retrospective provenance descriptors concerning the experiment. In addition, the alignment of the research data to statistical results expands experimental reproducibility, providing greater reliability of scientific results
publishDate 2015
dc.date.issued.fl_str_mv 2015-10-07
dc.date.accessioned.fl_str_mv 2021-04-08T12:25:08Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.citation.fl_str_mv NASCIMENTO, Jos? Ant?nio Pires do. RFlow: uma arquitetura para execu??o e coleta de proveni?ncia de workflows estat?sticos. 2015. 83 f. Disserta??o (Mestrado em Modelagem Matem?tica e Computacional) - Instituto de Ci?ncias Exatas, Universidade Federal Rural do Rio de Janeiro, Serop?dica - RJ, 2015.
dc.identifier.uri.fl_str_mv https://tede.ufrrj.br/jspui/handle/jspui/4520
identifier_str_mv NASCIMENTO, Jos? Ant?nio Pires do. RFlow: uma arquitetura para execu??o e coleta de proveni?ncia de workflows estat?sticos. 2015. 83 f. Disserta??o (Mestrado em Modelagem Matem?tica e Computacional) - Instituto de Ci?ncias Exatas, Universidade Federal Rural do Rio de Janeiro, Serop?dica - RJ, 2015.
url https://tede.ufrrj.br/jspui/handle/jspui/4520
dc.language.iso.fl_str_mv por
language por
dc.relation.references.por.fl_str_mv AALST, W. V. D.; HOFSTEDE, A.; KIEPUSZEWSKI, B.; BARROS, A. "Workflow patterns", Distributed and Parallel Databases, v. 14, n. 1, p. 5-51. 2003. ABOUELHODA, M.; ISSA, S. A.; GHANEM, M. Tavaxy: Integrating Taverna and Galaxy workflows with cloud computing support. BMC Bioinformatics, 13, 77. 2012. ALTINTAS, I. et al. ?Provenance Collection Support in the Kepler Scientific Workflow System?, IPAW2006, 118-132, 2006. ALTINTAS, I.; BERKLEY, C.; JAEGER, E.; JONES, M.; LUDASCHER, B.; MOCK, S. "Kepler: an extensible system for design and execution of scientific workflows". Scientific and Statistical Database Management, p. 423-424, Greece.2004. ANDERSON, C. The end of theory: the data deluge makes the scientific method obsolete. Wired Magazine, 23 jun. 2008. Dispon?vel em: <http://www.wired.com/science/discoveries/ magazine/16-07/pb_theory>. Acesso em 25 mar. 2013. APACHE.Commons.2014.[S.l.]. Dispon?vel em < https://commons.apache.org/proper/commons-io/>. Acessado em: 04 de Nov. de 2014. ATKINSON, M.; BRITTON, D.; COVENEY, P.; DE ROURE, D.; GARNETT, N.; GEDDES, N., GURNEY, R.; HAINES, K.; HUGHES, L.; INGRAM, D.; JEFFREYS, P.; LYON, L.; OSBORNE, I.; PERROTT, R.; PROCTER, R.; RUSBRIDGE, C.; TREFETHEN, A. A; WATSON, P. Century-of-Information Research (CIR): a strategy for research and innovation in the Century of Information. Prometheus, v. 27, n. 1, p. 27-45, 2009. BANZATTO, D.A.; KRONKA, S. N. EXPERIMENTA??O AGR?COLA. Jaboticabal. SP. FUNEP. 1992. BARROS, A. J. P.; LEHFELD, N. A. S. Projeto de Pesquisa: Propostas Metodol?gicas. 8 a. ed. Petr?polis. Vozes, 95 p. 1999. BAUMER, B.; CETINKAYA-RUNDEL, M.; BRAY, A.; LOI, L.; HORTON, N. J. R. markdown: Integrating a reproducible analysis tool into introductory statistics. ArXiv eprints, February 2014. BIOINFORMATICS, manuals.[S.l.].2015, Dispon?vel em <http://manuals.bioinformatics.ucr.edu/home/ht-seq>. Acessado em: 13 Mar. 2015. BUNEMAN, P.; KHANNA, S. E.; CHIEW, W. Why and Where: a Characterization of Data Provenance. ICDT?01: 8th International Conference on Database Theory, LNCS, v.1973, p.316?330, 2001. 60 CALLAHAN, S. P.; FREIRE, J.; SANTOS, E.; SCHEIDEGGER, C. E.; SILVA, C. T.; VO, H. T. "VisTrails: visualization meets data management". SIGMOD, p. 745-747, Chicago, Illinois, USA.2006. CASADEVALL, A.; FANG, F. C. Infect Immun. Reproducible Science. doi: 10.1128/IAI.00908-10 PMCID: PMC2981311.2010 CESARIO, E.; LACKOVIC, M.; TALIA, D.; TRUNFIO, P. ?Service-oriented data analysis in distributed computing systems,? in High Performance Computing: From Grids and Clouds to Exascale, Eds., pp. 225?245, IOS Press, Lansdale, Pa, USA. 2011. CHAMBERS, J. R. Software Data Analysis Programming with R Software. Springer. 1st edition, 2008. COHEN, S.; BOULAKIA, S. E.; DAVIDSON, S. Towards a Model of Provenance and User Views in Scientific Workflows, Data Integration in the Life Sciences, LNCS 4075, Springer, p.264?279, 2006. CRAWLEY, M. J. Statistical Computing to Data Analysis using S-plus. Wiley. 1st edition, 2002. CRUZ, S. M. S.; CAMPOS, M. L M.; MATTOSO, M. L. Q. ?Towards a Taxonomy of Provenance in Scientific Workflow Management Systems?. Services.pp. 259 ? 266. 2009. CRUZ, S. M. S. ?Uma Estrat?gia De Apoio ? Ger?ncia De Dados De Proveni?ncia Em Experimentos Cient?ficos?. Tese de Doutorado, COPPE/UFRJ. 2011. DEELMAN, E.; GANNON, D.; SHIELDS, M.; TAYLOR, I. Workflows and e-Science: An overview of workflow system features and capabilities. Future Generation Computer Systems. v. 25, n. 5, p. 528-540. 2009. DEELMAN, E.; SINGH, G.; SU, M. H. et al., ?Pegasus: a framework for mapping complex scientific workflows onto distributed systems? Scientific Programming, vol. 13, no. 3, pp. 219?237, 2005. ELLKVIST, T.; KOOP, D.; ANDERSON, E. W.; FREIRE, J.; SILVA, C. "Using Provenance to Support Real-Time Collaborative Design of Workflows", Provenance and Annotation of Data and Processes: 2nd International Provenance and Annotation Workshop, Salt Lake City, UT, USA, L/CS, Springer-Verlag, p. 266-279. 2008. EMBRAPA. (2015).Empresa Brasileira de Pesquisa Agropecu?ria. Dispon?vel em: <https://www.embrapa.br/quem-somos>. Acesso em: 7 Mar. 2015. EMBRAPA INFORM?TICA AGROPECU?RIA.(2015a).Embrapa Inform?tica Agropecu?ria. Dispon?vel em: <https://www.embrapa.br/informatica-agropecuaria/missaovisao- valores>. Acesso em: 7 Mar. 2015. EMBRAPA INFORM?TICA AGROPECU?RIA. (2015b).Embrapa Inform?tica 61 Agropecu?ria. Dispon?vel em: <https://www.embrapa.br/informaticaagropecuaria/ infraestrutura/laboratorio-multiusuario-de-bioinformatica>. Acesso em: 7 Mar. 2015. FREIRE, J.; KOOP, D.; SANTOS, E.; SILVA, C. T. "Provenance for Computational Tasks: A Survey", Computing in Science and Engineering, v.10, n. 3, p. 11-21. 2008. FREIRE, J.; BONNET, P.; SHASHA, D. Computational reproducibility: state-of-the-art, challenges, and database research opportunities "http://dl.acm.org/img/shopping-art16.gif" ;New York University, Poly, Brooklyn, NY, USA; IT University of Copenhagen, Copenhagen, Denmark. 2012. GEKKOQUANT. Gekkoquant. Dispon?vel em: <http://gekkoquant.com/2012/05/26/neuralnetworks- with-r-simple-example/,Neural Networks with R ? A Simple Example>. Acesso em: 17 Jun. 2015. GIARDINE, B.; RIEMER, C.; HARDISON, R. C.; BURHANS, R.; ELNITSKI, L.; SHAH, P.; ZHANG, Y.; BLANKENBERG, D.; ALBERT, I.; TAYLOR, J.; MILLER, W.; KENT, W. J.; NEKRUTENKO, A. "Galaxy: a platform for interactive large-scale genome analysis." Genome Research. 2005. GOBLE, C. A.; BHAGAT, J.; ALEKSEJEVS, S.; CRUICKSHANK, D.; MICHAELIDES, D.; NEWMAN, D.; BORKUM, M.; BECHHOFER, S.; ROOS, M. myExperiment: a repository and social network for the sharing of bioinformatics workflows, NucleicAcids Research, v. 38, n. Web Server Issue. p. 677-682. 2010. GOMES, F. P. Curso de estat?stica experimental. 13. ed., Piracicaba: Nobel, 1996. GONZ?LEZ-BELTR?N, A.; LI, P.; ZHAO, J.; AVILA-GARCIA, M. S.; ROOS, M.; THOMPSON, M. et al. From Peer-Reviewed to Peer-Reproduced in Scholarly Publishing: The Complementary Roles of Data Models and Workflows in Bioinformatics. PLoS ONE 10(7): e0127612. doi:10.1371/journal.pone. 2015. GRAY, J. Jim Gray on escience: a transformed scientific method. In: HEY, T.; TANSLEY, S.; TOLLE, K. (Ed.). The fourth paradigm: data-intensive scientific discovery. Washington: Microsoft Research, 2009. GUERRA, M. J.; DONAIRE, D. Estat?stica intuitiva. 5 ed. S?o Paulo: LTC, 1991. HEY, T.; TANSLEY, S.; TOLLE, K., The Fourth Paradigm: Data-Intensive Scientific Discovery. 1. ed. Redmond, Microsoft Research, 2009. HIGGINS, D. Using R in Kepler, Berkeley University, <ptolemy.eecs.berkeley.edu/conferences/05/presentations/higginsRSystem.pdf>, 2007. HINKELMANN, K.; KEMPTHORNE, O. Design and analysis of experiments. New York: J. Wiley,. 631 p. 1994. 62 HOFFMANN, R; VIEIRA, S. Estat?stica experimental. S?o Paulo: Atlas, 1989. HULL, D.; WOLSTENCROFT, K.; STEVENS, R.; GOBLE, C.; POCOCK, M. R.; LI, P.; KEPLER, 2013. Dispon?vel em: <https://code.keplerproject. org/code/kepler/trunk/modules/provenance/docs/provenance.pdf>. Acesso em: 23 Fev. 2013 KIRCHKAMP, O. ?Workflow of statistical data analysis?. Dispon?vel em: <http://www.kirchkamp.de/oekonometrie/pdf/wf-screen2.pdf>. Acesso em: 05 Out. 2014 KUMAR, A.; WAINER, J. ?Meta-workflows as a control and coordination mechanism for exception handling in workflow systems?. Decision Support Systems. v. 40 pp. 89-105.2005. LAKATOS, E. M.; MARCONI, M. A. Metodologia Cientifica. 2a . ed. S?o Paulo: Editora Atlas. 242 p. 1991. LERNER, B.; BOOSE, E. RDataTracker: Collecting Provenance in an Interactive Scripting Environment. In 6th USENIX Workshop on the Theory and Practice of Provenance (TaPP 2014), Cologne, USENIX Association. 2014. LI, Q.; BROWN, J. B.; HUANG, H.; BICKEL, P. J. MEASURING REPRODUCIBILITY OF HIGH-THROUGHPUT EXPERIMENTS, The Annals of Applied Statistics, Vol. 5, No. 3, 1752?1779. 2011. LITTAUER, R.; RAM, K.; LUD?SCHER, B.; MICHENER, W.; KOSKELA, R.Trends in Use of Scientific Workflows: Insights from a Public Repository and Recommendations for Best Practice.Int J Digit Curation.7(2):92-100. 2012. LOANNIDIS, J. P. A. PLoS Med.2005:e124. Why most published research findings are false.Epub.2005. LUD?SCHER, B. et al. "Scientific workflow management and the Kepler system: Research Articles". Concurrency and Computation: Practice & Experience, v. 18, n. 10, p. 1039- 1065, 2006. MAIR, P.; DE LEEUW, J. ?A general framework for multivariate analysis with optimal scaling: The R package aspect?. Journal of Statistical Software, 32(9), pp. 1-12, 2010. MARCONI, M.; LAKATOS, E. M. Fundamentos de metodologia cient?fica. 7.ed. S?o Paulo: Atlas, 2010. MARINHO, A.; MURTA, L.; WERNER, C.; et al.., "Integrating Provenance Data from Distributed Workflow Systems with ProvManager". In: Provenance and Annotation of Data and Processes, v. 6378, Lecture Notes in Computer Science. Springer, pp. 286-288, 2010. MATES, P.; SANTOS, E.; FREIRE, J.; SILVA, C. T. CrowdLabs: Social Analysis and Visualization for the Sciences. In: 23rd Scientific and Statistical Database Management Conference23rd Scientific and Statistical Database ManagementConference, Portland, 63 Oregon, USA, 2011. MATTOSO, M.; CRUZ, S. M. S. Ger?ncia de workflows cient?ficos: oportunidades de pesquisa em bancos de dados. In: Proceedings of the 23rd Brazilian symposium on Databases, pp. 313-314, Campinas, Sao Paulo, Out. 2008 MATTOSO, M.; WERNER, C.; TRAVASSOS, G. H.; et al. Gerenciando Experimentos Cient?ficos em Larga Escala. In: Anais do XIII Congresso da Sociedade Brasileira de Computa??o, pp. 121-135, Bel?m, Jul. 2008. MATTOSO, M.; et al. "Desafios no apoio ? composi??o de experimentos cient?ficos em larga escala". In: Semin?rio Integrado de Software e Hardware (XXXVI SEMISH), pp. 307- 321, 2009. MCPHILLIPS, T. M.; SONG, T.; KOLISNIK, T.; AULENBACH, S.; et al. Yesworkflow: A user-oriented, language-independent tool for recovering workflow information from scripts. CoRR, abs/1502.02403, 2015. MOREAU, L.; FREIRE, J.; MYERS, J.; FUTRELLE, J.; PAULSON, P. The Open Provenance Model, Technical report, Electronics and Computer Science, University of Southampton. 2007. MOREAU, L.; MISSIER, P.; BELHAJJAME, K.; CRESSWELL, S.; GOLDEN, R.; GROTH, P.; MILES, S.; SAHOO, S. (2011). The PROV Data Model and Abstract Syntax Notation. Dispon?vel em: http://www.w3.org/TR/prov-dm/. Acesso em: 17 Mar. 2014. MYGRID.2008. Dispon?vel em: <http://www.mygrid.org.uk/>. Acesso em: 01 jul. 2015. MURTA, L.; BRAGANHOLO, V.; CHIRIGATI, F.; KOOP, D.; FREIRE, J. noWorkflow: Capturing and Analyzing Provenance of Scripts. 5th International Provenance and Annotation Workshop, IPAW. LNCS. Vol. 8628, p 71-83. 2014. NASCIMENTO, J. A. P.; CRUZ, S. M. S. RFlow: Uma Abordagem de Reutiliza??o de Workflows Estat?sticos Legados. In: Macei? - Alagoas. XXXIII Congresso da Sociedade Brasileira de Computa??o, VII e-Science workshop, 2013. NASCIMENTO, J. A. P.; CRUZ, S. M. S. RFlow: uma arquitetura para proveni?ncia de workflows estat?sticos. In: Curitiba - Paran?. X Congresso Brasileiro de Agroinform?tica, SBIAGRO.2015. NAGAVARAM, A.; AGRAWAL, G.; FREITAS, M.; MEHTA, G.; MAYANI, R.; DEELMAN, E.?A cloud-based dynamic workflow for mass spectrometry data analysis,? in Proceedings of the 7th IEEE International Conference on e-Science (e-Science '11), December 2011. NOBELPRIZE.2013. Dispon?vel em <http://www.nobelprize.org/nobel_prizes/chemistry/laureates/2013/popularchemistryprize2013. pdf>. Acesso em: 21 Jun. 2014. 64 NOGUEIRA, M. C. S. Estat?stica experimental aplicada ? experimenta??o agr?cola. Piracicaba: USP-ESALQ, 250 p. 1997. OINN, T.; LI, P.; KELL, D. B.; GOBLE, C.; GODERIS, A.; GREENWOOD, M.; HULL, D.; STEVENS, R.;TURI, D.; ZHAO, J. Taverna/myGrid: Aligning a Workflow System with the Life Sciences Community, Workflows for e-Science, Springer, p. 300-319, 2007. OINN, T. "Taverna: a tool for building and running workflows of services", Nucleic Acids Research, v. 34, n. 2, p. 729-732. 2006. PENG, R. D. Reproductible Research in Computer Science, Science, Vol. 334 no. 6060 p. 1226-1227, 2011. POPPER, K. R.The logic of scientific discovery. Hutchinson, London, United Kingdom. 1959. POSTGRESQL, (2009), PostgreSQL, Dispon?vel em < http://www.postgresql.org>.Acessado em: 03 Jan. 2014. PRIMEFACES, (2009), Dispon?vel em <http://primefaces.org/downloads>. Acessado em: 25 Out. 2014. QIN, Z.; XING, J.; ZHENG, X. Software architecture. Springer. 1st edition.2008. RANABAHU, A.; ANDERSON, P.; SHETH, A. P. ?The Cloud Agnostic e-Science Analysis Platform?. IEEE Internet Computing v. 15.pp. 85-89. 2011. R DEVELOPMENT CORE TEAM. The R project for statistical computing. Vienna, 2012. Dispon?vel em: < http://www.R-project.org>. Acesso em: 17 Mar. 2013. RUNNALLS, A. ?CXXR: an extensible R interpreter In: Wiley Interdisciplinary Reviews: Computational Statistics. DOI: 10.1002/wics.1251, 2013. RUSSELL, N.; HOFSTEDE, A.; AALST, W. V. D; MULYAR, N. "Workflow control-flow patterns: A revised view", BPM Center Report BPM-06-22, BPMcenter.org, p. 06?22. 2006. SILLES, C. A.; RUNNALLS, A. ?Provenance-Awareness in R?. LNCS, vol. 6378, p. 64-72, 2010. SILVA, C. E. P. Captura de Dados de Proveni?ncia de Workflows Cient?ficos em Nuvens Computacionais / Carlos Eduardo Paulino Silva. ? Rio de Janeiro: UFRJ/COPPE, 2011. SILVA, F. C. D; ADACHI, D. T.; NARCISO, M. G; J?NIOR, V. B. Banco de Dados de Experimentos Agr?colas: An?lise e Projeto. Campinas: Embrapa Inform?tica Agropecu?ria, (Embrapa Inform?tica Agropecu?ria. Comunicado T?cnico, 6). 2001. TALIA, D.; TRUNFIO, P.; VERTA, O. ?Weka4WS: a WSRF-enabled Weka toolkit for 65 distributed data mining on Grids,? in Proceedings of the 9th European Conference on Principles and Practice of Knowledge Discovery in Databases, pp. 309?320, Porto, Portugal, 2005. TALIA, D. ?Workflow Systems for Science: Concepts and Tools?, ISRN Software Engineering, vol. 2013, Article ID 404525, 15 pages, doi:10.1155/2013/404525. 2013. TAYLOR, I.; SHIELDS, M.; WANG, I.; RANA, O. ?Triana, applications within Grid computing and peer to peer environments?, Journal of Grid Computing, vol. 1, pp. 199? 217, 2004. TAYLOR, I.; DEELMAN, E.; GANNON, D.; et al. Workflows for e-Science: Scientific Workflows for Grids. 1 ed. London, Springer-Verlag, 2007. TRAVASSOS, G. H.; BARROS, M. O. "Contributions of in virtuo and in silico experiments for the future of empirical studies in software engineering". In: Proceedings of the WSESE03, pp. 189-200, Roma, Ago. 2003. TUOT, C. J.; SINTEK, M.; DENGEL, A. R. IVIP ? A Scientific Workflow System to Support Experts in Spatial Planning of Crop Production. Scientific and Statistical Database Management. LNCS, vol. 5069, p 586-591. 2008. UNICAMP. Campinas, SP, 2015, Disponibilizado em <http://www.unicamp.br/iq/cces/public/index.php>. Acessado em: 15 Jan. 2015 VAZ, G. J. e-Science na Embrapa / Jos? Glauber Vaz. - Campinas: Embrapa Inform?tica Agropecu?ria, 2011. VISTRAILS.VisTrails Documentation., 2013. Dispon?vel em: <http://www.vistrails.org/usersguide/v2.0/html/VisTrails.pdf>. Acesso em: 16 set. 2014 V?CKLER, J. S.; JUVE, G.; DEELMAN, E.; RYNGE, M.; BERRIMAN, B. ?Experiences using cloud computing for a scientific workflow application,? in Proceedings of the 2nd International Workshop on Scientific Cloud Computing (ScienceCloud '11), pp. 15? 24,.View at Publisher?View at Google Scholar?View at Scopus. June 2011. WASHINGTON. 2015.University of Washington Escience Institute. Washington, 2015. Disponibilizado em <http://escience.washington.edu/>. Acessado em: 15Mar. 2015. W3C. PROV-DM: The PROV Data Model. 2012. Disponvel em: <www.w3.org/TR/provdm/ >.Acessado em: 13 Maio de 2014. WILSON J. E. B., An Introduction to Scientific Research. 2. ed. Dover Publications, 1991. ZHAO, J.; GOBLE, C.; STEVENS, R.; BECHHOFER, S. "Semantically linking and browsing provenance logs for e-science", Semantics of a Networked World, v. 3226, p. 158?176. 2004. 66 ZHAO, Z.; PASCHKE, A. A. Survey on Semantic Scientific Workflow Semantic Web Journal, IOS press 1-5. 2012
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade Federal Rural do Rio de Janeiro
dc.publisher.program.fl_str_mv Programa de P?s-Gradua??o em Modelagem Matem?tica e Computacional
dc.publisher.initials.fl_str_mv UFRRJ
dc.publisher.country.fl_str_mv Brasil
dc.publisher.department.fl_str_mv Instituto de Ci?ncias Exatas
publisher.none.fl_str_mv Universidade Federal Rural do Rio de Janeiro
dc.source.none.fl_str_mv reponame:Biblioteca Digital de Teses e Dissertações da UFRRJ
instname:Universidade Federal Rural do Rio de Janeiro (UFRRJ)
instacron:UFRRJ
instname_str Universidade Federal Rural do Rio de Janeiro (UFRRJ)
instacron_str UFRRJ
institution UFRRJ
reponame_str Biblioteca Digital de Teses e Dissertações da UFRRJ
collection Biblioteca Digital de Teses e Dissertações da UFRRJ
bitstream.url.fl_str_mv http://localhost:8080/tede/bitstream/jspui/4520/4/2015+-+Jos%C3%A9+Ant%C3%B4nio+Pires+do+Nascimento.pdf.jpg
http://localhost:8080/tede/bitstream/jspui/4520/3/2015+-+Jos%C3%A9+Ant%C3%B4nio+Pires+do+Nascimento.pdf.txt
http://localhost:8080/tede/bitstream/jspui/4520/2/2015+-+Jos%C3%A9+Ant%C3%B4nio+Pires+do+Nascimento.pdf
http://localhost:8080/tede/bitstream/jspui/4520/1/license.txt
bitstream.checksum.fl_str_mv d5fc55dfdf92472960e43cbbdd351650
e97441fe064cc37d9a8c63dbb2db1ddf
11b5c882cabbf9492fb67a3f3a211117
7b5ba3d2445355f386edab96125d42b7
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
MD5
repository.name.fl_str_mv Biblioteca Digital de Teses e Dissertações da UFRRJ - Universidade Federal Rural do Rio de Janeiro (UFRRJ)
repository.mail.fl_str_mv bibliot@ufrrj.br||bibliot@ufrrj.br
_version_ 1797220335708274688