RUcore: Rutgers University Community Repository
Search
All
Text
Images
Audio
Video
Advanced Search
|
Help
Search all content in all RUcore collections.
Services
My Account
Deposit Your Work (SOAR Deposit)
Submit Your Rutgers Thesis or Dissertation (RUetd)
Collections
SOAR (Scholarly Open Access at Rutgers)
RUetd (Rutgers Electronic Theses and Dissertations)
Search Portals to Specific Collections
Digital Exhibitions
New Jersey Digital Highway
New Jersey Environmental Digital Library
Browse Scholarly Materials by Publication Year
Cite
Export
Share
Print
Email
Help
Contact Us
My Account
Home
Resource
Selected item
Development and evaluation of a machine learning algorithm to map medical conditions and procedures from real-world data
PDF
PDF format is widely accepted and good for printing.
Plug-in required
PDF-1
(2.05 MB)
EXCEL
Negative Cont...
(1.00 MB)
Citation & Export
View Usage Statistics
Staff View
Citation & Export
Hide
Simple citation
Makadia, Rupa.
Development and evaluation of a machine learning algorithm to map medical conditions and procedures from real-world data.
Retrieved from
https://doi.org/doi:10.7282/t3-ezab-zd23
Export
Click here for information about Citation Management Tools at Rutgers.
EndNote Desktop Client
EndNoteWeb
RefWorks
RIS File Download
Statistics
Hide
Description
Title
Development and evaluation of a machine learning algorithm to map medical conditions and procedures from real-world data
Name
Makadia, Rupa (author)
;
Srinivasan, Shankar (chair)
;
Rutgers University
;
School of Health Professions
Date Created
2019
Other Date
2019-05 (degree)
Subject
Machine learning
,
Biomedical Informatics
,
Knowledge management
Extent
1 online resource (187 pages) : illustrations
Description
Background: Ontologies characterize complex and detailed information and are extensively used in healthcare research. Medical information (textbooks, expert opinions, clinical evidence) has information on conditions and its corresponding procedures (treatments), but this information is not captured or structured in any ontology. The objective of the research is to create a condition-procedure ontology from real world data to be utilized in observational research or electronic health record (EHR) system.
Methods: Predictive models are developed to learn from five datasets (administrative claims, hospital charge data) to generate two algorithms (diagnostic and therapeutic) to predict condition-procedure relationships in the SNOMED-CT vocabulary. A reference set with 100 positive pairs per algorithm, and 32,132 negative pairs were developed. Predictive models were constructed by designing 51 possible covariates that describe condition-procedure pairs from Optum© De-Identified Clinformatics® Data Mart Database – Socio-Economic Status (Optum) dataset and determining which covariates discriminated between the positive and negative controls, as measured by Area Under Receiver Operator Characteristic Curve (AUC). External validation of the final algorithms was performed on 4 other databases. The final algorithms were applied across the universe of condition and procedure pairs in all five databases to construct the full condition-procedure ontology, and the ontology was evaluated for validity and coverage of condition and procedure concepts from the set of identified condition-procedure pairs. An additional analysis was trained to classify diagnostic vs. therapeutic intervention based on the overlap of pairs within the two algorithms.
Results: Algorithms include the following covariates: condition-procedure occurring together, relative risk, support and sensitivity. Both algorithms had AUCs greater than .90, and external validation also showed similar results. In Optum, 98% of conditions and 63% of procedure codes had at least one relationship identified in the ontology. The intervention type analysis resulted in an AUC of 0.79.
Conclusions: Real-world data can be utilized to construct a medical ontology of condition-procedure relationships with strong performance and good coverage. These results can be utilized to fuel research efforts in healthcare such as cohort generation and computer provider order entry systems by understanding conditions and procedures and their application to diagnose or treat a patient.
Note
Ph.D.
Note
Includes bibliographical references
Genre
theses, ETD doctoral
Persistent URL
https://doi.org/doi:10.7282/t3-ezab-zd23
Language
English
Collection
School of Health Professions ETD Collection
Organization Name
Rutgers, The State University of New Jersey
Rights
The author owns the copyright to this work.
Statistical Profile
Version 8.5.1
Site Search
Privacy Policy
libraries.rutgers.edu
Copyright ©2023
Rutgers, The State University of New Jersey
(Further Copyright Information)
Version 8.5.1
Rutgers University Libraries - Copyright ©2023