Desarrollo de una herramienta para adquisición de datos en proyectos de datamining para un tren de laminación en caliente

  1. Joaquín M. Villanueva Balsera
  2. José Valeriano Álvarez Cabal
  3. Luis A. Rodríguez Loredo
  4. Antonio Bello García
Book:
X Congreso Internacional de Ingeniería de Proyectos: Valencia, 13-15 Septiembre 2006. Actas

Publisher: edUPV, Editorial Universitat Politècnica de València ; Universitat Politècnica de València

ISBN: 84-9705-987-5

Year of publication: 2006

Pages: 1473-1481

Congress: CIDIP. Congreso Internacional de Ingeniería de Proyectos (10. 2006. Valencia)

Type: Conference paper

Abstract

This paper solves the problem of the acquisition and data storage necessary for the development of Data mining projects. This project is developed following a methodology of data mining projects that is CRISP-DM. In this methodology a project follows some phases such as problem analysis, data analysis, data preprocess, modeling, evaluation, model development. Once analyzed the problem the objective is to identify the deviation in the width of the coil. The methodology proposes the phase of data analysis; this phase consists of activities or tasks that define the purpose of this article. This paper describes the tasks from data acquisition such as analyzing the data sources and definition of data base model. In addition, there is a description of the types of information that are used and the conversions for scale or compression. Another phases will be the exploration and the monitoring of the quality of the data; for these activities it will be necessary the development of tools that allow access to data stored to the members of the team of model development.. Both the tools and the design of the database have to allow a balance between the facility of storage and access. It is necessary to give emphasis to the importance of using methodologies for both data mining projects and software projects that will allow fixing milestones, generalize the development and facilitate the deployment minimizing the failures.