Paper
Document
Submit new version
Download
Flag content
0

Data-Driven Cutoff Selection for the Patient Health Questionnaire-9 Depression Screening Tool

Authors
Brooke Levis,Parash Bhandari
Dipika Neupane,Suiqiong Fan,Ying Sun,Chen He,Yin Wu,Ankur Krishnan,Zelalem Negeri,Mahrukh Imran,Danielle Rice,Kira Riehm,Marleine Azar,Alexander Levis,Jill Boruff,Pim Cuijpers,Simon Gilbody,John Ioannidis,Lorie Kloda,Scott Patten,Roy Ziegelstein,Daphna Harel,Yemisi Takwoingi,Sarah Markham,Sultan Alamri,Dagmar Amtmann,Bruce Arroll,Liat Ayalon,Hamid Baradaran,Anna Beraldi,Charles Bernstein,Arvin Bhana,Charles Bombardier,Ryna Buji,Peter Butterworth,Gregory Carter,Marcos Chagas,Juliana Chan,Lai Chan,Dixon Chibanda,Kerrie Clover,Aaron Conway,Yeates Conwell,Federico Daray,Janneke Ginkel,Jesse Fann,Felix Fischer,Sally Field,Jane Fisher,Daniel Fung,Bizu Gelaye,Leila Gholizadeh,Felicity Goodyear‐Smith,Eric Green,Catherine Greeno,Brian Hall,Liisa Hantsoo,Martin Härter,Leanne Hides,Stevan Hobfoll,Simone Honikman,Thomas Hyphantis,Masatoshi Inagaki,Maria Iglesias‐González,Hong Jeon,Nathalie Jetté,Mohammad Khamseh,Kim Kiely,Brandon Kohrt,Yunxin Kwan,Ma. Lara,Holly Levin‐Aspenson,Shen‐Ing Liu,Manote Lotrakul,Sônia Loureiro,Bernd Löwe,Nagendra Luitel,Crick Lund,Ruth Marrie,Laura Marsh,Brian Marx,Anthony McGuire,Sherina Sidik,Tiago Munhoz,Kumiko Muramatsu,Juliet Nakku,Laura Navarrete,Flávia Osório,B.W. Pence,Philippe Persoons,Inge Petersen,Angelo Picardi,Stephanie Pugh,Terence Quinn,Elmārs Rancāns,Sujit Rathod,Katrin Reuter,Alasdair Rooney,Iná Santos,Miranda Schram,Juwita Shaaban,Eileen Shinn,Abbey Sidebottom,Adam Simning,Lena Spangenberg,Lesley Stafford,Sharon Sung,Keiko Suzuki,Pei Tan,Martin Taylor‐Rowan,Thach Tran,Alyna Turner,Christina Feltz‐Cornelis,Thandi Heyningen,Paul Vöhringer,Lynne Wagner,JianLi Wang,David Watson,Jennifer White,Mary Whooley,Kirsty Winkley,Karen Wynter,Mitsuhiko Yamada,Qing Zeng,Yuying Zhang,Brett Thombs,Andrea Benedetti
+125 authors
,Masumi Inagaki
Published
Nov 22, 2024
Show more
Save
TipTip
Document
Submit new version
Download
Flag content
0
TipTip
Save
Document
Submit new version
Download
Flag content

Abstract

Importance Test accuracy studies often use small datasets to simultaneously select an optimal cutoff score that maximizes test accuracy and generate accuracy estimates. Objective To evaluate the degree to which using data-driven methods to simultaneously select an optimal Patient Health Questionnaire-9 (PHQ-9) cutoff score and estimate accuracy yields (1) optimal cutoff scores that differ from the population-level optimal cutoff score and (2) biased accuracy estimates. Design, Setting, and Participants This study used cross-sectional data from an existing individual participant data meta-analysis (IPDMA) database on PHQ-9 screening accuracy to represent a hypothetical population. Studies in the IPDMA database compared participant PHQ-9 scores with a major depression classification. From the IPDMA population, 1000 studies of 100, 200, 500, and 1000 participants each were resampled. Main Outcomes and Measures For the full IPDMA population and each simulated study, an optimal cutoff score was selected by maximizing the Youden index. Accuracy estimates for optimal cutoff scores in simulated studies were compared with accuracy in the full population. Results The IPDMA database included 100 primary studies with 44 503 participants (4541 [10%] cases of major depression). The population-level optimal cutoff score was 8 or higher. Optimal cutoff scores in simulated studies ranged from 2 or higher to 21 or higher in samples of 100 participants and 5 or higher to 11 or higher in samples of 1000 participants. The percentage of simulated studies that identified the true optimal cutoff score of 8 or higher was 17% for samples of 100 participants and 33% for samples of 1000 participants. Compared with estimates for a cutoff score of 8 or higher in the population, sensitivity was overestimated by 6.4 (95% CI, 5.7-7.1) percentage points in samples of 100 participants, 4.9 (95% CI, 4.3-5.5) percentage points in samples of 200 participants, 2.2 (95% CI, 1.8-2.6) percentage points in samples of 500 participants, and 1.8 (95% CI, 1.5-2.1) percentage points in samples of 1000 participants. Specificity was within 1 percentage point across sample sizes. Conclusions and Relevance This study of cross-sectional data found that optimal cutoff scores and accuracy estimates differed substantially from population values when data-driven methods were used to simultaneously identify an optimal cutoff score and estimate accuracy. Users of diagnostic accuracy evidence should evaluate studies of accuracy with caution and ensure that cutoff score recommendations are based on adequately powered research or well-conducted meta-analyses.

Paper PDF

Empty State
This PDF hasn't been uploaded yet.
Do not upload any copyrighted content to the site, only open-access content.
or