Predicting population health focused outcomes using machine learning

Arnold, David

doi:doi:10.7282/t3-9ck3-9485

RUcore: Rutgers University Community Repository

Search
- All
- Text
- Images
- Audio
- Video
Advanced Search | Help

Search all content in all RUcore collections.
Services
Collections

Help Contact Us My Account

Home

Resource

Predicting population health focused outcomes using machine learning

PDF

PDF format is widely accepted and good for printing.

Plug-in required

PDF-1(2.28 MB)

Citation & Export

View Usage Statistics

Staff View

Citation & Export
Hide

Simple citation

Arnold, David. Predicting population health focused outcomes using machine learning. Retrieved from https://doi.org/doi:10.7282/t3-9ck3-9485

Export

Click here for information about Citation Management Tools at Rutgers.

Statistics
Hide

Description

TitlePredicting population health focused outcomes using machine learning

NameArnold, David (author); Srinivasan, Shankar (chair); Coffman, Fredrick (internal member); Gohel, Suril (internal member); Rutgers University; School of Health Professions

Date Created2019

Other Date2019-05 (degree)

SubjectPopulation health, Biomedical Informatics, Machine learning, Health services administration

Extent1 online resource (xvi, 104 pages) : illustrations

DescriptionCare management activities seek to reduce healthcare cost and improve patient outcomes. Identifying patients who may receive substantial benefit from care management services can be especially challenging when managing large populations across disparate systems. This research tests a novel method for identifying patients for care management using over 30 disparate healthcare data sources and machine learning. Random Forest models were used to predict four binary outcomes; high cost, hospital admission, hospital readmission, and multiple emergency department visits. The models leveraged population health enterprise data warehouse cross-ontology mappings for the following data types; conditions, procedures, medications, results, demographics, and claims-based cost and utilization. Each of the data types were tested independently then combined incrementally. The highest performing models for each outcome of interest resulted with the following ROC AUC; High Cost (0.81), Admission (0.80), Re-admission (0.86), and Multi-ED (0.74). The research shows disparate data sources and machine learning can be used to predict population health focused outcomes. The framework used in this research has the potential to expand and scale to include any number of additional data types and outcomes.

NotePh.D.

NoteIncludes bibliographical references

Genretheses, ETD doctoral

Persistent URLhttps://doi.org/doi:10.7282/t3-9ck3-9485

LanguageEnglish

CollectionSchool of Health Professions ETD Collection

Organization NameRutgers, The State University of New Jersey

RightsThe author owns the copyright to this work.

Version 8.5.5

Citation & ExportHide

Simple citation

Export

StatisticsHide

Description

Citation & Export
Hide

Statistics
Hide