ResearchHub | Open Science Community

Predicting Obesity in Young Adulthood from Childhood and Parental Obesity

Robert Whitaker et al.Sep 25, 1997

Childhood obesity increases the risk of obesity in adulthood, but how parental obesity affects the chances of a child's becoming an obese adult is unknown. We investigated the risk of obesity in young adulthood associated with both obesity in childhood and obesity in one or both parents.

Physiology

Internal Medicine

0

Paper

Save

Time‐Dependent ROC Curves for Censored Survival Data and a Diagnostic Marker

Patrick Heagerty et al.Jun 1, 2000

ROC curves are a popular method for displaying sensitivity and specificity of a continuous diagnostic marker, X, for a binary disease variable, D. However, many disease outcomes are time dependent, D(t), and ROC curves that vary as a function of time may be more appropriate. A common example of a time-dependent variable is vital status, where D(t) = 1 if a patient has died prior to time t and zero otherwise. We propose summarizing the discrimination potential of a marker X, measured at baseline (t = 0), by calculating ROC curves for cumulative disease or death incidence by time t, which we denote as ROC(t). A typical complexity with survival data is that observations may be censored. Two ROC curve estimators are proposed that can accommodate censored data. A simple estimator is based on using the Kaplan-Meier estimator for each possible subset X > c. However, this estimator does not guarantee the necessary condition that sensitivity and specificity are monotone in X. An alternative estimator that does guarantee monotonicity is based on a nearest neighbor estimator for the bivariate distribution function of (X, T), where T represents survival time (Akritas, M. J., 1994, Annals of Statistics 22, 1299-1327). We present an example where ROC(t) is used to compare a standard and a modified flow cytometry measurement for predicting survival after detection of breast cancer and an example where the ROC(t) curve displays the impact of modifying eligibility criteria for sample size and power in HIV prevention trials.

Statistics And Probability

Mathematics

0

Paper

Statistics And Probability

2,484

0

Save

0

The Statistical Evaluation of Medical Tests for Classification and Prediction

Margaret PepeMay 1, 2005

1. Introduction 2. Measures of Accuracy for Binary Tests 3. Comparing Binary Tests and Regression Analysis 4. The Receiver Operating Characteristic Curve 5. Estimating the ROC Curve 6. Covariate Effects on Continuous and Ordinal Tests 7. Incomplete Data and Imperfect Reference Tests 8. Study Design and Hypothesis Testing 9. More Topics and Conclusions References/Bibliography Index

Artificial Intelligence

Statistics And Probability

0

Paper

Artificial Intelligence

1,928

0

Save

0

A cautionary note on inference for marginal regression models with longitudinal data and general correlated response data

Margaret Pepe et al.Jan 1, 1994

Inference for cross-sectional models using longitudinal data, can be accomplished with generalized estimating equations (Zeger and Liang, 1992). We show that either a diagonal working covariance matrix should be used or a key assumption should be verified. The assumption is non-trivial when covariates vary over time. The validity of this assumption is explored for some broad classes of correlation structures. Similar considerations are shown to be relevant for the more general problem of correlated response data and marginal regression analysis with individual level covariates.

Genetics

Artificial Intelligence

0

Paper

Save

Pathologists’ diagnosis of invasive melanoma and melanocytic proliferations: observer accuracy and reproducibility study

Joann Elmore et al.Jun 28, 2017

Objective To quantify the accuracy and reproducibility of pathologists' diagnoses of melanocytic skin lesions.Design Observer accuracy and reproducibility study.Setting 10 US states.Participants Skin biopsy cases (n=240), grouped into sets of 36 or 48. Pathologists from 10 US states were randomized to independently interpret the same set on two occasions (phases 1 and 2), at least eight months apart.Main outcome measures Pathologists' interpretations were condensed into five classes: I (eg, nevus or mild atypia); II (eg, moderate atypia); III (eg, severe atypia or melanoma in situ); IV (eg, pathologic stage T1a (pT1a) early invasive melanoma); and V (eg, ≥pT1b invasive melanoma). Reproducibility was assessed by intraobserver and interobserver concordance rates, and accuracy by concordance with three reference diagnoses.Results In phase 1, 187 pathologists completed 8976 independent case interpretations resulting in an average of 10 (SD 4) different diagnostic terms applied to each case. Among pathologists interpreting the same cases in both phases, when pathologists diagnosed a case as class I or class V during phase 1, they gave the same diagnosis in phase 2 for the majority of cases (class I 76.7%; class V 82.6%). However, the intraobserver reproducibility was lower for cases interpreted as class II (35.2%), class III (59.5%), and class IV (63.2%). Average interobserver concordance rates were lower, but with similar trends. Accuracy using a consensus diagnosis of experienced pathologists as reference varied by class: I, 92% (95% confidence interval 90% to 94%); II, 25% (22% to 28%); III, 40% (37% to 44%); IV, 43% (39% to 46%); and V, 72% (69% to 75%). It is estimated that at a population level, 82.8% (81.0% to 84.5%) of melanocytic skin biopsy diagnoses would have their diagnosis verified if reviewed by a consensus reference panel of experienced pathologists, with 8.0% (6.2% to 9.9%) of cases overinterpreted by the initial pathologist and 9.2% (8.8% to 9.6%) underinterpreted.Conclusion Diagnoses spanning moderately dysplastic nevi to early stage invasive melanoma were neither reproducible nor accurate in this large study of pathologists in the USA. Efforts to improve clinical practice should include using a standardized classification system, acknowledging uncertainty in pathology reports, and developing tools such as molecular markers to support pathologists' visual assessments.

Artificial Intelligence

Biophysics

0

Paper

Artificial Intelligence

360

0

Save

0

Estimation and Comparison of Receiver Operating Characteristic Curves

Margaret Pepe et al.Mar 1, 2009

The receiver operating characteristic (ROC) curve displays the capacity of a marker or diagnostic test to discriminate between two groups of subjects, cases versus controls. We present a comprehensive suite of Stata commands for performing ROC analysis. Nonparametric, semiparametric, and parametric estimators are calculated. Comparisons between curves are based on the area or partial area under the ROC curve. Alternatively, pointwise comparisons between ROC curves or inverse ROC curves can be made. We describe options to adjust these analyses for covariates and to perform ROC regression in a companion article. We use a unified framework by representing the ROC curve as the distribution of the marker in cases where we have standardized it to the control reference distribution.

Physiology

Statistics, Probability And Uncertainty

0

Paper

Save

Testing for improvement in prediction model performance

Margaret Pepe et al.Jan 7, 2013

Authors have proposed new methodology in recent years for evaluating the improvement in prediction performance gained by adding a new predictor, Y , to a risk model containing a set of baseline predictors, X , for a binary outcome D . We prove theoretically that null hypotheses concerning no improvement in performance are equivalent to the simple null hypothesis that Y is not a risk factor when controlling for X , H 0 : P ( D = 1 | X , Y ) = P ( D = 1 | X ). Therefore, testing for improvement in prediction performance is redundant if Y has already been shown to be a risk factor. We also investigate properties of tests through simulation studies, focusing on the change in the area under the ROC curve (AUC). An unexpected finding is that standard testing procedures that do not adjust for variability in estimated regression coefficients are extremely conservative. This may explain why the AUC is widely considered insensitive to improvements in prediction performance and suggests that the problem of insensitivity has to do with use of invalid procedures for inference rather than with the measure itself. To avoid redundant testing and use of potentially problematic methods for inference, we recommend that hypothesis testing for no improvement be limited to evaluation of Y as a risk factor, for which methods are well developed and widely available. Analyses of measures of prediction performance should focus on estimation rather than on testing for no improvement in performance. Copyright © 2013 John Wiley & Sons, Ltd.

Artificial Intelligence

Statistics, Probability And Uncertainty

0

Paper

Artificial Intelligence

214

0

Save