Neonatal mortality prediction with routinely collected data: a machine learning approach
dc.contributor.author | ANDRE FILIPE DE MORAES BATISTA | |
dc.contributor.author | Diniz, Carmen S. G. | |
dc.contributor.author | Bonilha, Eliana A. | |
dc.contributor.author | Kawachi, Ichiro | |
dc.contributor.author | Chiavegatto Filho, Alexandre D. P. | |
dc.creator | Diniz, Carmen S. G. | |
dc.creator | Bonilha, Eliana A. | |
dc.creator | Kawachi, Ichiro | |
dc.creator | Chiavegatto Filho, Alexandre D. P. | |
dc.date.accessioned | 2024-11-22T16:37:09Z | |
dc.date.available | 2024-11-22T16:37:09Z | |
dc.date.issued | 2021 | |
dc.description.abstract | Background: Recent decreases in neonatal mortality have been slower than expected for most countries. This study aims to predict the risk of neonatal mortality using only data routinely available from birth records in the largest city of the Americas. Methods: A probabilistic linkage of every birth record occurring in the municipality of São Paulo, Brazil, between 2012 e 2017 was performed with the death records from 2012 to 2018 (1,202,843 births and 447,687 deaths), and a total of 7282 neonatal deaths were identified (a neonatal mortality rate of 6.46 per 1000 live births). Births from 2012 and 2016 (N = 941,308; or 83.44% of the total) were used to train five different machine learning algorithms, while births occurring in 2017 (N = 186,854; or 16.56% of the total) were used to test their predictive performance on new unseen data. Results: The best performance was obtained by the extreme gradient boosting trees (XGBoost) algorithm, with a very high AUC of 0.97 and F1-score of 0.55. The 5% births with the highest predicted risk of neonatal death included more than 90% of the actual neonatal deaths. On the other hand, there were no deaths among the 5% births with the lowest predicted risk. There were no significant differences in predictive performance for vulnerable subgroups. The use of a smaller number of variables (WHO’s five minimum perinatal indicators) decreased overall performance but the results still remained high (AUC of 0.91). With the addition of only three more variables, we achieved the same predictive performance (AUC of 0.97) as using all the 23 variables originally available from the Brazilian birth records. Conclusion: Machine learning algorithms were able to identify with very high predictive performance the neonatal mortality risk of newborns using only routinely collected data. | en |
dc.format | Digital | |
dc.format.extent | 6 p. | |
dc.identifier.doi | 10.1186/s12887-021-02788-9 | |
dc.identifier.uri | https://repositorio.insper.edu.br/handle/11224/7236 | |
dc.language.iso | Inglês | |
dc.relation.ispartof | BMC Pediatrics | |
dc.subject | Machine learning | en |
dc.subject | Artificial intelligence | en |
dc.subject | Prediction | en |
dc.subject | Neonatal mortality | en |
dc.subject | Birth records | en |
dc.subject | Brazil | en |
dc.title | Neonatal mortality prediction with routinely collected data: a machine learning approach | |
dc.type | journal article | |
dspace.entity.type | Publication | |
local.identifier.sourceUri | https://bmcpediatr.biomedcentral.com/articles/10.1186/s12887-021-02788-9#citeas | |
local.publisher.country | Não Informado | |
local.subject.cnpq | CIENCIAS DA SAUDE::MEDICINA | |
local.subject.cnpq | CIENCIAS DA SAUDE::SAUDE COLETIVA | |
local.subject.cnpq | CIENCIAS EXATAS E DA TERRA::PROBABILIDADE E ESTATISTICA | |
local.subject.cnpq | CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO | |
local.subject.cnpq | ENGENHARIAS::ENGENHARIA BIOMEDICA | |
local.type | Artigo Científico | |
publicationvolume.volumeNumber | 21 | |
relation.isAuthorOfPublication | b10d272e-98b2-4953-8e51-37aea3fde20c | |
relation.isAuthorOfPublication.latestForDiscovery | b10d272e-98b2-4953-8e51-37aea3fde20c |
Arquivos
Pacote Original
1 - 2 de 2
Carregando...
- Nome:
- Primeira_Pagina_Artigo_2021_Neonatal_mortality_prediction_with_routinely_collected_data_a_machine_learning_approach_TC.pdf
- Tamanho:
- 148.8 KB
- Formato:
- Adobe Portable Document Format
N/D
- Nome:
- ACESSO_RESTRITO_Artigo_2021_Neonatal_mortality_prediction_with_routinely_collected_data_a_machine_learning_approach_TC.pdf
- Tamanho:
- 919.93 KB
- Formato:
- Adobe Portable Document Format
Licença do Pacote
1 - 1 de 1
N/D
- Nome:
- license.txt
- Tamanho:
- 236 B
- Formato:
- Item-specific license agreed upon to submission
- Descrição: