KDD, SEMMA and CRISP-DM: a parallel overview

In the last years there has been a huge growth and consolidation of the Data Mining field. Some efforts are being done that seek the establishment of standards in the area. Included on these efforts there can be enumerated SEMMA and CRISP-DM. Both grow as industrial standards and define a set of seq...

ver descrição completa

Detalhes bibliográficos
Autor principal: Azevedo, Ana (author)
Outros Autores: Santos, Manuel Filipe (author)
Formato: article
Idioma:eng
Publicado em: 2012
Assuntos:
Texto completo:http://hdl.handle.net/10400.22/136
País:Portugal
Oai:oai:recipp.ipp.pt:10400.22/136
Descrição
Resumo:In the last years there has been a huge growth and consolidation of the Data Mining field. Some efforts are being done that seek the establishment of standards in the area. Included on these efforts there can be enumerated SEMMA and CRISP-DM. Both grow as industrial standards and define a set of sequential steps that pretends to guide the implementation of data mining applications. The question of the existence of substantial differences between them and the traditional KDD process arose. In this paper, is pretended to establish a parallel between these and the KDD process as well as an understanding of the similarities between them.