Coleção de Artigos Acadêmicos

URI permanente para esta coleçãohttps://repositorio.insper.edu.br/handle/11224/3227

Navegar

Resultados da Pesquisa

Agora exibindo 1 - 10 de 13

Neonatal mortality prediction with routinely collected data: a machine learning approach
(2021) ANDRE FILIPE DE MORAES BATISTA; Diniz, Carmen S. G.; Bonilha, Eliana A.; Kawachi, Ichiro; Chiavegatto Filho, Alexandre D. P.
Background: Recent decreases in neonatal mortality have been slower than expected for most countries. This study aims to predict the risk of neonatal mortality using only data routinely available from birth records in the largest city of the Americas. Methods: A probabilistic linkage of every birth record occurring in the municipality of São Paulo, Brazil, between 2012 e 2017 was performed with the death records from 2012 to 2018 (1,202,843 births and 447,687 deaths), and a total of 7282 neonatal deaths were identified (a neonatal mortality rate of 6.46 per 1000 live births). Births from 2012 and 2016 (N = 941,308; or 83.44% of the total) were used to train five different machine learning algorithms, while births occurring in 2017 (N = 186,854; or 16.56% of the total) were used to test their predictive performance on new unseen data. Results: The best performance was obtained by the extreme gradient boosting trees (XGBoost) algorithm, with a very high AUC of 0.97 and F1-score of 0.55. The 5% births with the highest predicted risk of neonatal death included more than 90% of the actual neonatal deaths. On the other hand, there were no deaths among the 5% births with the lowest predicted risk. There were no significant differences in predictive performance for vulnerable subgroups. The use of a smaller number of variables (WHO’s five minimum perinatal indicators) decreased overall performance but the results still remained high (AUC of 0.91). With the addition of only three more variables, we achieved the same predictive performance (AUC of 0.97) as using all the 23 variables originally available from the Brazilian birth records. Conclusion: Machine learning algorithms were able to identify with very high predictive performance the neonatal mortality risk of newborns using only routinely collected data.
Cause-specific mortality prediction in older residents of São Paulo, Brazil: a machine learning approach
(2021) Nascimento, Carla Ferreira do; Hellen Geremias dos Santos; ANDRE FILIPE DE MORAES BATISTA; Lay, Alejandra Andrea Roman; Duarte, Yeda Aparecida Oliveira
Background: Populational ageing has been increasing in a remarkable rate in developing countries. In this scenario, preventive strategies could help to decrease the burden of higher demands for healthcare services. Machine learning algorithms have been increasingly applied for identifying priority candidates for preventive actions, presenting a better predictive performance than traditional parsimonious models. Methods: Data were collected from the Health, Well Being and Aging (SABE) Study, a representative sample of older residents of São Paulo, Brazil. Machine learning algorithms were applied to predict death by diseases of respiratory system (DRS), diseases of circulatory system (DCS), neoplasms and other specific causes within 5 years, using socioeconomic, demographic and health features. The algorithms were trained in a random sample of 70% of subjects, and then tested in the other 30% unseen data. Results: The outcome with highest predictive performance was death by DRS (AUC−ROC = 0.89), followed by the other specific causes (AUC−ROC = 0.87), DCS (AUC−ROC = 0.67) and neoplasms (AUC−ROC = 0.52). Among only the 25% of individuals with the highest predicted risk of mortality from DRS were included 100% of the actual cases. The machine learning algorithms with the highest predictive performance were light gradient boosted machine and extreme gradient boosting. Conclusion: The algorithms had a high predictive performance for DRS, but lower for DCS and neoplasms. Mortality prediction with machine learning can improve clinical decisions especially regarding targeted preventive measures for older individuals.
Data Leakage in Health Outcomes Prediction With Machine Learning. Comment on “Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning"
(2021) Chiavegatto Filho, Alexandre; ANDRE FILIPE DE MORAES BATISTA; Santos, Hellen Geremias dos
A multipurpose machine learning approach to predict COVID-19 negative prognosis in São Paulo, Brazil
(2021) Fernandes, Fernando Timoteo; Oliveira, Tiago Almeida de; Teixeira, Cristiane Esteves; ANDRE FILIPE DE MORAES BATISTA; Costa, Gabriel Dalla; Chiavegatto Filho, Alexandre Dias Porto
The new coronavirus disease (COVID-19) is a challenge for clinical decision-making and the effective allocation of healthcare resources. An accurate prognostic assessment is necessary to improve survival of patients, especially in developing countries. This study proposes to predict the risk of developing critical conditions in COVID-19 patients by training multipurpose algorithms. We followed a total of 1040 patients with a positive RT-PCR diagnosis for COVID-19 from a large hospital from São Paulo, Brazil, from March to June 2020, of which 288 (28%) presented a severe prognosis, i.e. Intensive Care Unit (ICU) admission, use of mechanical ventilation or death. We used routinely-collected laboratory, clinical and demographic data to train five machine learning algorithms (artificial neural networks, extra trees, random forests, catboost, and extreme gradient boosting). We used a random sample of 70% of patients to train the algorithms and 30% were left for performance assessment, simulating new unseen data. In order to assess if the algorithms could capture general severe prognostic patterns, each model was trained by combining two out of three outcomes to predict the other. All algorithms presented very high predictive performance (average AUROC of 0.92, sensitivity of 0.92, and specificity of 0.82). The three most important variables for the multipurpose algorithms were ratio of lymphocyte per C-reactive protein, C-reactive protein and Braden Scale. The results highlight the possibility that machine learning algorithms are able to predict unspecific negative COVID-19 outcomes from routinely-collected data.
A Software to Compare Clusters between Groups and Its Application to the Study of Autism Spectrum Disorder
(2017) MACIEL CALEBE VIDAL; Sato, João R.; Balardin, Joana B.; Takahashi, Daniel Y.; Fujita, André
Understanding how brain activities cluster can help in the diagnosis of neuropsychological disorders. Thus, it is important to be able to identify alterations in the clustering structure of functional brain networks. Here, we provide an R implementation of Analysis of Cluster Variability (ANOCVA), which statistically tests (1) whether a set of brain regions of interest (ROI) are equally clustered between two or more populations and (2) whether the contribution of each ROI to the differences in clustering is significant. To illustrate the usefulness of our method and software, we apply the R package in a large functional magnetic resonance imaging (fMRI) dataset composed of 896 individuals (529 controls and 285 diagnosed with ASD—autism spectrum disorder) collected by the ABIDE (The Autism Brain Imaging Data Exchange) Consortium. Our analysis show that the clustering structure of controls and ASD subjects are different (p < 0.001) and that specific brain regions distributed in the frontotemporal, sensorimotor, visual, cerebellar, and brainstem systems significantly contributed (p < 0.05) to this differential clustering. These findings suggest an atypical organization of domain-specific functionbrain modules in ASD.
Identification of alterations associated with age in the clustering structure of functional brain networks
(2018) Guzman, Grover E. C.; Sato, Joao R.; MACIEL CALEBE VIDAL; Fujita, Andre
Initial studies using resting-state functional magnetic resonance imaging on the trajectories of the brain network from childhood to adulthood found evidence of functional integration and segregation over time. The comprehension of how healthy individuals’ functional integration and segregation occur is crucial to enhance our understanding of possible deviations that may lead to brain disorders. Recent approaches have focused on the framework wherein the functional brain network is organized into spatially distributed modules that have been associated with specific cognitive functions. Here, we tested the hypothesis that the clustering structure of brain networks evolves during development. To address this hypothesis, we defined a measure of how well a brain region is clustered (network fitness index), and developed a method to evaluate its association with age. Then, we applied this method to a functional magnetic resonance imaging data set composed of 397 males under 31 years of age collected as part of the Autism Brain Imaging Data Exchange Consortium. As results, we identified two brain regions for which the clustering change over time, namely, the left middle temporal gyrus and the left putamen. Since the network fitness index is associated with both integration and segregation, our finding suggests that the identified brain region plays a role in the development of brain systems.
Estimation of the tissue composition of the tumour mass in neuroblastoma using segmented CT images
(2004) FABIO JOSE AYRES; M. K. Zuffo,; Rangayyan, R. M.; Boag, G. S.; O. Filho, V.; Valente , M.
Neuroblastoma is the most common extra-cranial, solid, malignant tumour in children. Advances in radiology have made possible the detection and staging of the disease. Nevertheless, there is no method available at present that can go beyond detection and qualitative analysis, towards quantitative assessment of the tissues composition of the primary tumour mass in neuroblastoma. Such quantitative analysis could provide important information and serve as a decision-support tool to the radiologist and the oncologist, result in better treatment and follow-up and even lead to the avoidance of delayed surgery. The problem investigated was the improvement of the analysis of the primary tumour mass, in patients with neuroblastoma, using X-ray computed tomography (CT) images. A methodology was proposed for the estimation of the tissue content of the mass: it comprised a Gaussian mixture model for estimation, from segmented CT images, of the tissue composition of the primary tumour. To demonstrate the potential of the method, the results are presented of its application to ten CT examinations of four patients. The method provides quantitative information, and it was observed that the tumour in one of the patients reduced from 523 cm3 to 81 cm3 in volume, with an increase in calcification from about 20% to about 88% of the tumour volume, in response to chemotherapy over a period of five months. Results indicate that the proposed technique may be of considerable value in assessing the response to therapy of patients with neuroblastoma.
Gabor filters and phase portraits for the detection of architectural distortion in mammograms
(2006) Rangayyan, Rangaraj M.; FABIO JOSE AYRES
Segmentation of the tumor in neuroblastoma is complicated by the fact that the mass is almost Always heterogeneous in nature; furthermore, viable Architectural distortion is a subtle abnormality in mammograms, and a source of overlooking errors by radiologists. Computer-aided diagnosis (CAD) techniques can improve the performance of radiologists in detecting masses and calcifications; however, most CAD systems have not been designed to detect architectural distortion. We present a new method to detect and localise architectural distortion by analysing the oriented texture in mammograms. A bank of Gabor filters is used to obtain the orientation field of the given mammogram. The curvilinear structures (CLS) of interest (spicules and fibrous tissue) are separated from confounding structures (pectoral muscle edge, parenchymal tissue edges, breast boundary, and noise). The selected core CLS pixels and the orientation field are filtered and downsampled, to reduce noise and also to reduce the computational effort required by the subsequent methods. The downsampled orientation field is analysed to produce three phase portrait maps: node, saddle, and spiral. The node map is further analysed in order to detect the sites of architectural distortion. The method was tested with 19 mammograms containing architectural distortion. In a preliminary experiment, a sensitivity of 84% was obtained at 7.8 false positives per image.
Three-Dimensional Segmentation of the Tumor in Computed Tomographic Images of Neuroblastoma
(2007) Deglint, Hanford J.; Rangayyan, Rangaraj M.; FABIO JOSE AYRES; Boag, Graham S.; Zuffo, Marcelo K.
Segmentation of the tumor in neuroblastoma is complicated by the fact that the mass is almost Always heterogeneous in nature; furthermore, viable tumor, necrosis, and normal tissue are often intermixed. Tumor definition and diagnosis require the analysis of the spatial distribution and Hounsfield unit (HU) values of voxels in computed tomography (CT) images, coupled with a knowledge of normal anatomy. Segmentation and analysis of the tissue composition of the tumor can assist in quantitative assessment of the response to therapy and in the planning of delayed surgery for resection of the tumor. We propose methods to achieve 3-dimensional segmentation of the neuroblastic tumor. In our scheme, some of the normal structures expected in abdominal CT images are delineated and removed from further consideration; the remaining parts of the image volume are then examined for the tumor mass. Mathematical morphology, fuzzy connectivity, and other image processing tools are deployed for this purpose. Expert knowledge provided by a radiologist in the form of the expected structures and their shapes, HU values, and radiological characteristics are incorporated into the segmentation algorithm. In this preliminary study, the methods were tested with 10 CT exams of four cases from the Alberta Children’s Hospital. False-negative error rates of less than 12% were obtained in eight of the 10 exams; however, seven of the exams had false-positive error rates of more than 20% with respect to manual segmentation of the tumor by a radiologist.
A review of computer-aided diagnosis of breast cancer: Toward the detection of subtle signs
(2007) Rangayyan, Rangaraj M.; FABIO JOSE AYRES; Desautels, J.E. Leo
Mammography is the best available tool for screening for the early detection of breast cancer. Mammographic screening has been shown to be effective in reducing breast cancer mortality rates: screening programs have reduced mortality rates by 30–70%. Mammograms are difficult to interpret, especially in the screening context. The sensitivity of screening mammography is affected by image quality and the radiologist's level of expertise. Computer-aided diagnosis (CAD) technology can improve the performance of radiologists, by increasing sensitivity to rates comparable to those obtained by double reading, in a cost-effective manner. Current research is directed toward the development of digital imaging and image analysis systems that can detect mammographic features, classify them, and provide visual prompts to the radiologist. Radiologists would like the ability to change the contrast of a mammogram, either manually or with pre-selected settings. Computer techniques for detecting, classifying, and annotating diagnostic features on the images would be desirable. This paper presents an overview of digital image processing and pattern analysis techniques to address several areas in CAD of breast cancer, including: contrast enhancement, detection and analysis of calcifications, detection and analysis of masses and tumors, analysis of bilateral asymmetry, and detection of architectural distortion. Although a few commercial CAD systems have been released, the detection of subtle signs of breast cancer such as global bilateral asymmetry and focal architectural distortion remains a difficult problem. We present some of our recent works on the development of image processing and pattern analysis techniques for these applications.

Coleção de Artigos Acadêmicos

Navegar

Filtros

Configurações

Ordenar por

Resultados por página

Resultados da Pesquisa