Draft

156  Hypercapnia Computable Phenotype Design

156.1 Summary

  • Hypercapnic RF Computable Phenotype Study Design
  • Overall Structure:
    1. TriNetX analysis
  • Principal Component Analysis
  • Vs LASSO or Elastic Net Regression?
  • Do we still want to concern ourselves with cohort enrollment primarily?
    1. UU Data pull: Next steps
  • Rule vs regression or more advanced models.
  • Enriched sample
  • Issues with generating the enriched sample
  • U of U data request

156.2 Slide outline

156.2.1 Slide 1

  • Hypercapnic RF Computable Phenotype Study Design
  • June 27 2022 ### Slide 2
  • Overall Structure:
  • TriNetX: data analysis to assess feasible, data trends, and get preliminary information for power analysis and design (current)
  • Will also run a latent component (or LASSO? Tensor factorialization? Generalized low rankl modeling) analysis to identify predictors of a gold standard definition
  • UU data pull:
  • Enriched cohort of patients (ie. more likely to have hypercapnic respiratory failure, ideally close to balanced)
  • Reference standard determination of 1. Definitive 2. Probable 3. Not Hypercap RF based on information available in the chart. 2 reviewers
  • Systematic Review of inclusion criteria used in studies or predictors of hypercapnic RF -> evaluation of each of these as “features” in the phenotype
  • Evaluate performance of each, and combinations of those criteria
  • Re-extrapolation to TriNetX ### Slide 3
    1. TriNetX analysis
  • The work we’ve been doing:
  • Suggests a problem is present and provides information to get people interested in doing this better (accomplished)
  • Get preliminary data about the structure of data
  • Overlap of methods estimate
  • R^2 of features might be useful for a power calculation (see in a few slides)
  • Last aim: Generate a classifier
  • Reference standard? 1st ABG w/n 24h admission with PaCO2 over 45mmHg (prior to iatrogenic causes, such as sedation/IMV) or diagnostic code of Hypercapnic respiratory failure.
  • Method of optimization? (next slide) ### Slide 4
  • Principal Component Analysis
  • Dimension reduction: reduce the number of variables as much as possible, while containing most of the information in the original dataset
  • Principal components: combinations that explain the maximal variance.
  • Question: how to handle missing data and informed presence bias?
  • Do we need any pre-determination of what values to include? Ie. derivatives such as prior hospitalization or location of care. Or medications in particular. Will this be “sparse” data?
  • Is our current data-set optimal? Regression only valid in the range of covariates represented. Dimensions excluded by the selection introduced by our data-set
  • Output: feature vector that recasts the data into the principal components ### Slide 5
  • Vs LASSO or Elastic Net Regression?
  • Both are targeted toward minimizing multi-collinearity
  • Lasso/Ridge/Elastic net preserve interpretability. Lasso does dimension reduction while the others don’t exactly (coeff near 0)
  • PCA can then be used as the basis of regression.
  • LASSO seems like it would be computationally complex without some dimension reduction first?
  • Bootstrap (internal) validation? ### Slide 6
  • Do we still want to concern ourselves with cohort enrollment primarily?
  • Vs. predicting whether hypercapnia would be present, if checked?
  • This would be implicitly predicting: a.) that a blood gas was checked and b.) that it showed hypercapnia
  • We may not have robust enough data here to do this with the goal of individual patient care influence (e.g. MEWS) – though this could possibly be done with a dataset like MIMIC ### Slide 7
    1. UU Data pull: Next steps
  • For EMR data request:
  • —ram will look into methods we might have to expedite
  • EMR requests based on diagnoses and give us all the data -
  • —test negative controls. Can use
  • Diagnostic codes, lab values, etc.
  • 157 of patients included - no limit. But, start from 2015

  • Do amendment of the IRB with the data //Ensure IRB has been approved under the condition that we are going to look at MRNs.
    • Will submit an amendment, and then submit the request to the EDW ### Slide 8
  • Rule vs regression or more advanced models.
  • Major Criteria (any of these)
  • First ABG PaCO2 > 45 mmHg
  • Diagnostic code for Hypercapnic RF
  • Etc.
  • Minor Criteria (in combination)
  • First VBG with PaCO2 over 50
  • Initiation of NIV
  • Diagnostic code for respiratory failure of any type
  • Predisposing Demographic
  • Age over 65
  • BMI over 35
  • HCO3 27+
  • Exclusions (none of these must be present)
  • Criteria w/n 3h of anesthesia
  • Receiving IMV prior to criteria ### Slide 9
  • Enriched sample
  • Goal: generate a cohort a large portion (the majority) of the patients who wouldn’t conceivably have hypercapnic respiratory failure
  • Would it be possible (or advisable) to over-select non-white participants to more closely approximate non-UT population? Smokers in proportion to US population?
  • E.g.
  • Diagnostic code for respiratory failure
  • Any blood gas obtained during admission
  • Procedure code for starting non-invasive or invasive mechanical ventilation or CPAP
  • BMI over 30 & HCO3 over 25? ### Slide 10
  • Issues with generating the enriched sample
  • Are we going to introduce selection/collider biases?
  • What is an optimal balance of +Hypercap RF to -Hypercap RF?
  • Is the literature on power calculations around predictive modeling the right literature? E.g. https://www.bmj.com/content/368/bmj.m441
  • Computable phenotype simple prediction model (with binary criteria)?
  • Stata, R “pmsamplesize”; https://riskcalc.org/pmsamplesize/
  • In logistic regression: ### Slide 11
  • Issues with generating the enriched sample
  • Can use methodology similar to https://doi.org/10.1371/journal.pone.0235574
  • People that aren’t in the “enrichment cohort” are assumed to be true negatives (ie. nothing going for them having it) – this minimizes chart review work
  • Over-select on certain variables to make more representative of national population? ### Slide 12
  • U of U data request
  • Dates of interest? If started in 2015– 15,000 population
  • 158 of charts? Est 300 UU patients/month in Venn diagram on TriNetX (extrapolation because no blood gas information)

  • – 150-300 charts depending on method of sample calculation
  • Timeline for expediting the data request?
  • Internal Medicine?
  • ASPIRE grant moneys ### Slide 13
  • Study design:
  • Random sample of charts: in each category
  • Review by 2 raters (inter-rater agreement)
  • Reference “Silver Standard” (raters don’t have access to all info needed for reference std)
  • Classification:
  • Definite non-iatrogenic hypercapnic respiratory failure
  • Threshold: you would put this diagnosis in the EMR, we didn’t cause it
  • Probable iatrogenic hypercapnic respiratory failure
  • Probable non-iatrogenic hypercapnic respiratory failure
  • Uncertain hypercapnic respiratory failure
  • Hypercapnic respiratory failure excluded ### Slide 14
  • Issues with Provider Review “reference std”
  • “Conceptual definitions and operational definitions for study design elements”
  • Exactly what we’re trying to identify: hypercapnic respiratory failure (ie. a blood gas would have shown a PaCO2 over 45 mmHg, had it been checked)
  • that occurred at the time of presentation (ie. prior to iatrogenic causes).
  • No criterion that the hypercapnia directly caused the admission (causality is not knowable given current research)
  • time-line? Criteria for when data-elements must be present wrt diagnosis? E.g.iif BMP is available at admission, but ABG only obtained later.
  • Need a specification of the amount of certainty: e.g. “Definite” I would feel comfortable putting this in the chart as a diagnostic label; probable “I would not label the diagnosis, but it would be more likely than not to be present.”
  • Is there a way to get around the incorporation bias that will result from us using the same criteria as the model to make the “reference standard” determination? (hold out ABG information?) ### Slide 15
  • Computable Phenotype “feature” selection
  • Considerations from: https://www.atsjournals.org/doi/10.1513/AnnalsATS.202002-141ED#i8 (ATS guidance)
  • Data source: retrospective, secondary use
  • Participant inclusion: enriched data-set
  • Outcome: outcome definitions reviewer criteria
  • Predictors: which possible selectors have been included? Some brief systematic review described to identify ways of inclusion into papers on hypercapnic respiratory failure, or prediction of hypercapnia
  • —- Ideally, the pre-selection of potential predictors should focus on those variables that, prior to data collection, are known to be related to the prognostic outcome, based on a combination of clinical expertise and evidence from literature, for instance through prognostic factor studies”
  • Model specification/structure: exclusively linear criteria to facilitate interoperability and understandability ### Slide 16
  • Computable Phenotype examples
  • Individually test the performance of:
  • ABG PaCO2 over 45; Diagnostic Code; other strategies used in prior studies
  • Major Criteria (any of these)
  • First ABG PaCO2 > 45 mmHg
  • Diagnostic code for Hypercapnic RF
  • Etc.
  • Minor Criteria
  • First VBG with PaCO2 over 50
  • Initiation of NIV
  • Diagnostic code for respiratory failure of any type
  • Predisposing Demographic
  • Age over 65
  • BMI over 35
  • HCO3 27+
  • Exclusions (none of these must be present)
  • Criteria w/n 3h of anesthesia
  • Receiving IMV prior to criteria ### Slide 17
  • Features: experimental/continuous?
  • Age, BMI? (analogous to age-adjusted Ddimer)
  • Medications? (all data elements need to be reliably obtainable to be useful)
  • E.g. how in COVID (https://github.com/National-COVID-Cohort-Collaborative/PhenotypeDataAcquisition/wiki/Latest-Phenotype)
  • Perhaps in combination with a “Weak positive” criteria:
  • Note: they also have a “NOT” phenotype – would this be useful? ### Slide 18
  • Pooled-analysis of 5 studies (n335 w/ OHS, n1037 eucapnic obese)
  • Serum HCO3 > 27 mEq/L: +LR 3.74; -LR 0.18 ### Slide 19
  • Utility considerations:
  • Presentation of performance of each component individually (with some representation of who is included and excluded) and a model combining the elements is probably more likely to be useful than a normative “final output” – the user is people who design studies.. Anything that makes the output more “usable” will improve the project. ### Slide 20
  • Study design:
  • Calculate Se and Sp for Definite and Probable non-iatrogenic hypercapnic respiratory failure with various definitions
  • In proposing multi-criteria
  • Change in AUROC/Precision-recall curve?
  • Net Reclassification Index?
  • Shapley values? ### Slide 21
  • Validation/Generalizability?
  • Bootstrapped internal validation worthwhile to assess performance? (ie with a prediction model)
  • Generalizability: Regenerate venn-diagram on TriNetX data (nationally representative data)
  • Implication: if patterns of overlap similar, accuracy may be too
  • Also lets us explore the implications some.
  • in particular, trying to guage it’s performance in more varied populations.
  • “Bring your own model”? What sort of outputs could we get from this? ### Slide 22
  • “In acclimatized people, the decrease in [HCO3-] is 1.5 mEq/L per each 1000 m”
  • Change in variance?
  • Local data?
  • Erik Swenson?
  • Information on the elevation of habitation of the world population? ### Slide 23
  • Anticipated Critiques
  • Is “Hypercapnic Respiratory Failure” too heterogenous to study?
  • Studying ’syndromes’ haven’t been particularly fruitful
  • Proposed paradigm -> moving toward precision in defining the heterogeneity: will it be based on physiologic, epidemiologic, microbiologic criteria?
  • https://www.nature.com/articles/s41591-022-01843-x#Fig1
  • — want to have enough information that we can re-draw the lines around disease definitions if they change.
  • Re: COVID models: ““Most of these prognostic COVID-19 models, usually with a mortality or clinical deterioration outcome, were judged to be at high risk of bias, for which the main reasons were the use of inappropriate data sources, inadequate low sample sizes, inappropriate statistical model evaluations and overall poor reporting.”
  • Idiosyncracies in local coding practices?
  • Intended use: enroll in studies
  • Not: Best-practice alert to consider hypercapnia in an individual patient
  • For cohort studies:
  • For case-control studies: cases need a specific definition to avoid diluting the power ### Slide 24
  • Electronics 2019, 8, 1235; doi:10.3390/electronics8111235 ### Slide 25
  • Types of “validation”

158.1 Learning objectives

  • Hypercapnic RF Computable Phenotype Study Design
  • Overall Structure:
    1. TriNetX analysis
  • Principal Component Analysis
  • Vs LASSO or Elastic Net Regression?

158.2 Bottom line / summary

  • Hypercapnic RF Computable Phenotype Study Design
  • Overall Structure:
    1. TriNetX analysis
  • Principal Component Analysis
  • Vs LASSO or Elastic Net Regression?

158.3 Approach

  1. TODO: Outline the initial assessment or decision point.
  2. TODO: Outline the next diagnostic or management step.
  3. TODO: Outline follow-up or escalation criteria.

158.4 Red flags / when to escalate

  • TODO: List red flags that require urgent escalation.

158.5 Common pitfalls

  • TODO: Capture common errors or missed steps.

158.6 References

TODO: Add landmark references or guideline citations.

158.7 Slides and assets

158.8 Source materials