Improving suicide risk prediction via targeted data fusion: proof of concept using medical claims data.

TitleImproving suicide risk prediction via targeted data fusion: proof of concept using medical claims data.
Publication TypeJournal Article
Year of Publication2022
AuthorsXu W, Su C, Li Y, Rogers S, Wang F, Chen K, Aseltine R
JournalJ Am Med Inform Assoc
Volume29
Issue3
Pagination500-511
Date Published2022 Jan 29
ISSN1527-974X
KeywordsChild, Delivery of Health Care, Humans, Predictive Value of Tests, Risk Factors, Suicidal Ideation, Suicide
Abstract

OBJECTIVE: Reducing suicidal behavior among patients in the healthcare system requires accurate and explainable predictive models of suicide risk across diverse healthcare settings.

MATERIALS AND METHODS: We proposed a general targeted fusion learning framework that can be used to build a tailored risk prediction model for any specific healthcare setting, drawing on information fusion from a separate more comprehensive dataset with indirect sample linkage through patient similarities. As a proof of concept, we predicted suicide-related hospitalizations for pediatric patients in a limited statewide Hospital Inpatient Discharge Dataset (HIDD) fused with a more comprehensive medical All-Payer Claims Database (APCD) from Connecticut.

RESULTS: We built a suicide risk prediction model for the source data (APCD) and calculated patient risk scores. Patient similarity scores between patients in the source and target (HIDD) datasets using their demographic characteristics and diagnosis codes were assessed. A fused risk score was generated for each patient in the target dataset using our proposed targeted fusion framework. With this model, the averaged sensitivities at 90% and 95% specificity improved by 67% and 171%, and the positive predictive values for the combined fusion model improved 64% and 135% compared to the conventional model.

DISCUSSION AND CONCLUSIONS: We proposed a general targeted fusion learning framework that can be used to build a tailored predictive model for any specific healthcare setting. Results from this study suggest we can improve the performance of predictive models in specific target settings without complete integration of the raw records from external data sources.

DOI10.1093/jamia/ocab209
Alternate JournalJ Am Med Inform Assoc
PubMed ID34850890
PubMed Central IDPMC8800522
Grant ListR01 MH112148 / MH / NIMH NIH HHS / United States
R01 MH124740 / MH / NIMH NIH HHS / United States
R01MH124740 / NH / NIH HHS / United States
Division: 
Institute of Artificial Intelligence for Digital Health
Category: 
Faculty Publication