Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/38196

TítuloUsing data mining for prediction of hospital length of stay: an application of the CRISP-DM Methodology
Autor(es)Caetano, Nuno
Cortez, Paulo
Laureano, Raul
Palavras-chaveMedical data mining
Hospitalization process
Length of stay
CRISP-DM
Regression
Random forest
DataSet-2015
EditoraSpringer
RevistaLecture Notes in Business Information Processing
CitaçãoCaetano, N., Cortez, P., & Laureano, R. S. (2015). Using Data Mining for Prediction of Hospital Length of Stay: An Application of the CRISP-DM Methodology. In J. Cordeiro, S. Hammoudi, L. Maciaszek, O. Camp & J. Filipe (Eds.), Enterprise Information Systems (Vol. 227, pp. 149-166): Springer International Publishing.
Resumo(s)Hospitals are nowadays collecting vast amounts of data related with patient records. All this data hold valuable knowledge that can be used to improve hospital decision making. Data mining techniques aim precisely at the extraction of useful knowledge from raw data. This work describes an implementation of a medical data mining project approach based on the CRISP-DM methodology. Recent real-world data, from 2000 to 2013, were collected from a Portuguese hospital and related with inpatient hospitalization. The goal was to predict generic hospital Length Of Stay based on indicators that are commonly available at the hospitalization process (e.g., gender, age, episode type, medical specialty). At the data preparation stage, the data were cleaned and variables were selected and transformed, leading to 14 inputs. Next, at the modeling stage, a regression approach was adopted, where six learning methods were compared: Average Prediction, Multiple Regression, Decision Tree, Artificial Neural Network ensemble, Support Vector Machine and Random Forest. The best learning model was obtained by the Random Forest method, which presents a high quality coefficient of determination value (0.81). This model was then opened by using a sensitivity analysis procedure that revealed three influential input attributes: the hospital episode type, the physical service where the patient is hospitalized and the associated medical specialty. Such extracted knowledge confirmed that the obtained predictive model is credible and with potential value for supporting decisions of hospital managers.
TipoArtigo em ata de conferência
URIhttps://hdl.handle.net/1822/38196
ISBN978-3-319-22347-6
978-3-319-22348-3
DOI10.1007/978-3-319-22348-3_9
ISSN1865-1348
Versão da editoraThe original publication is available at : http://link.springer.com/chapter/10.1007%2F978-3-319-22348-3_9#
Arbitragem científicayes
AcessoAcesso aberto
Aparece nas coleções:CAlg - Livros e capítulos de livros/Books and book chapters

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
ext151.pdf370 kBAdobe PDFVer/Abrir

Partilhe no FacebookPartilhe no TwitterPartilhe no DeliciousPartilhe no LinkedInPartilhe no DiggAdicionar ao Google BookmarksPartilhe no MySpacePartilhe no Orkut
Exporte no formato BibTex mendeley Exporte no formato Endnote Adicione ao seu ORCID