Mutual information and sensitivity analysis for feature selection in customer targeting: a comparative study

Feature selection is a highly relevant task in any data-driven knowledge discovery project. The present research focuses on analysing the advantages and disadvantages of using mutual information (MI) and data-based sensitivity analysis (DSA) for feature selection in classification problems, by apply...

ver descrição completa

Detalhes bibliográficos
Autor principal:	Barraza, N. (author)
Outros Autores:	Moro, S. (author), Ferreyra, M. (author), de la Peña, A. (author)
Formato:	article
Idioma:	eng
Publicado em:	2018
Assuntos:	Customer targeting Direct marketing Feature selection Modelling Mutual information Sensitivity analysis
Texto completo:	http://hdl.handle.net/10071/16227
País:	Portugal
Oai:	oai:repositorio.iscte-iul.pt:10071/16227

Descrição
Resumo:	Feature selection is a highly relevant task in any data-driven knowledge discovery project. The present research focuses on analysing the advantages and disadvantages of using mutual information (MI) and data-based sensitivity analysis (DSA) for feature selection in classification problems, by applying both to a bank telemarketing case. A logistic regression model is built on the tuned set of features identified by each of the two techniques as the most influencing set of features on the success of a telemarketing contact, in a total of 13 features for MI and 9 for DSA. The latter performs better for lower values of false positives while the former is slightly better for a higher false-positive ratio. Thus, MI becomes a better choice if the intention is reducing slightly the cost of contacts without risking losing a high number of successes. However, DSA achieved good prediction results with less features.

Mutual information and sensitivity analysis for feature selection in customer targeting: a comparative study

Registos relacionados

Precisa de ajuda?