Rheumatoid arthritis (RA) is the most common systemic autoimmune disease, affecting ∼1% of the adult population worldwide, with an estimated heritability of 60%. To identify genes involved in RA susceptibility, we investigated the association between putative functional single-nucleotide polymorphisms (SNPs) and RA among white individuals by use of a case-control study design; a second sample was tested for replication. Here we report the association of RA susceptibility with the minor allele of a missense SNP in PTPN22 (discovery-study allelic P=6.6×10−4; replication-study allelic P=5.6×10−8), which encodes a hematopoietic-specific protein tyrosine phosphatase also known as “Lyp.” We show that the risk allele, which is present in ∼17% of white individuals from the general population and in ∼28% of white individuals with RA, disrupts the P1 proline-rich motif that is important for interaction with Csk, potentially altering these proteins' normal function as negative regulators of T-cell activation. The minor allele of this SNP recently was implicated in type 1 diabetes, suggesting that the variant phosphatase may increase overall reactivity of the immune system and may heighten an individual carrier’s risk for autoimmune disease. Rheumatoid arthritis (RA) is the most common systemic autoimmune disease, affecting ∼1% of the adult population worldwide, with an estimated heritability of 60%. To identify genes involved in RA susceptibility, we investigated the association between putative functional single-nucleotide polymorphisms (SNPs) and RA among white individuals by use of a case-control study design; a second sample was tested for replication. Here we report the association of RA susceptibility with the minor allele of a missense SNP in PTPN22 (discovery-study allelic P=6.6×10−4; replication-study allelic P=5.6×10−8), which encodes a hematopoietic-specific protein tyrosine phosphatase also known as “Lyp.” We show that the risk allele, which is present in ∼17% of white individuals from the general population and in ∼28% of white individuals with RA, disrupts the P1 proline-rich motif that is important for interaction with Csk, potentially altering these proteins' normal function as negative regulators of T-cell activation. The minor allele of this SNP recently was implicated in type 1 diabetes, suggesting that the variant phosphatase may increase overall reactivity of the immune system and may heighten an individual carrier’s risk for autoimmune disease. Rheumatoid arthritis (RA [MIM 180300]) is characterized by immune cell–mediated destruction of the joint architecture and is two to three times more common in women than in men (Firestein Firestein, 2003Firestein GS Evolving concepts of rheumatoid arthritis.Nature. 2003; 423: 356-361Crossref PubMed Scopus (2573) Google Scholar). A strong genetic component is indicated (Seldin et al. Seldin et al., 1999Seldin MF Amos CI Ward R Gregersen PK The genetics revolution and the assault on rheumatoid arthritis.Arthritis Rheum. 1999; 42: 1071-1079Crossref PubMed Scopus (240) Google Scholar; MacGregor et al. MacGregor et al., 2000MacGregor AJ Snieder H Rigby AS Koskenvuo M Kaprio J Aho K Silman AJ Characterizing the quantitative genetic contribution to rheumatoid arthritis using data from twins.Arthritis Rheum. 2000; 43: 30-37Crossref PubMed Scopus (807) Google Scholar), and genome scans have identified multiple regions linked to disease (Cornelis et al. Cornelis et al., 1998Cornelis F Faure S Martinez M Prud’homme JF Fritz P Dib C Alves H et al.New susceptibility locus for rheumatoid arthritis suggested by a genome-wide linkage study.Proc Natl Acad Sci USA. 1998; 95: 10746-10750Crossref PubMed Scopus (436) Google Scholar; Shiozawa et al. Shiozawa et al., 1998Shiozawa S Hayashi S Tsukamoto Y Goko H Kawasaki H Wada T Shimizu K Yasuda N Kamatani N Takasugi K Tanaka Y Shiozawa K Imura S Identification of the gene loci that predispose to rheumatoid arthritis.Int Immunol. 1998; 10: 1891-1895Crossref PubMed Scopus (181) Google Scholar; Jawaheer et al. Jawaheer et al., 2001Jawaheer D Seldin MF Amos CI Chen WV Shigeta R Monteiro J Kern M Criswell LA Albani S Nelson JL Clegg DO Pope R Schroeder Jr, HW Bridges Jr, SL Pisetsky DS Ward R Kastner DL Wilder RL Pincus T Callahan LF Flemming D Wener MH Gregersen PK A genomewide screen in multiplex rheumatoid arthritis families suggests genetic overlap with other autoimmune diseases.Am J Hum Genet. 2001; 68: 927-936Abstract Full Text Full Text PDF PubMed Scopus (308) Google Scholar, Jawaheer et al., 2003Jawaheer D Seldin MF Amos CI Chen WV Shigeta R Etzel C Damle A et al.Screening the genome for rheumatoid arthritis susceptibility genes: a replication study and combined analysis of 512 multicase families.Arthritis Rheum. 2003; 48: 906-916Crossref PubMed Scopus (190) Google Scholar; MacKay et al. MacKay et al., 2002MacKay K Eyre S Myerscough A Milicic A Barton A Laval S Barrett J Lee D White S John S Brown MA Bell J Silman A Ollier W Wordsworth P Worthington J Whole-genome linkage analysis of rheumatoid arthritis susceptibility loci in 252 affected sibling pairs in the United Kingdom.Arthritis Rheum. 2002; 46: 632-639Crossref PubMed Scopus (179) Google Scholar; Fisher et al. Fisher et al., 2003Fisher SA Lanchbury JS Lewis CM Meta-analysis of four rheumatoid arthritis genome-wide linkage studies: confirmation of a susceptibility locus on chromosome 16.Arthritis Rheum. 2003; 48: 1200-1206Crossref PubMed Scopus (60) Google Scholar). Although interesting associations have been reported (Suzuki et al. Suzuki et al., 2003Suzuki A Yamada R Chang X Tokuhiro S Sawada T Suzuki M Nagasaki M Nakayama-Hamada M Kawaida R Ono M Ohtsuki M Furukawa H Yoshino S Yukioka M Tohma S Matsubara T Wakitani S Teshima R Nishioka Y Sekine A Iida A Takahashi A Tsunoda T Nakamura Y Yamamoto K Functional haplotypes of PADI4 encoding citrullinating enzyme peptidylarginine deaminase 4, are associated with rheumatoid arthritis.Nat Genet. 2003; 34: 395-402Crossref PubMed Scopus (864) Google Scholar; Tokuhiro et al. Tokuhiro et al., 2003Tokuhiro S Yamada R Chang X Suzuki A Kochi Y Sawada T Suzuki M Nagasaki M Ohtsuki M Ono M Furukawa H Nagashima M Yoshino S Mabuchi A Sekine A Saito S Takahashi A Tsunoda T Nakamura Y Yamamoto K An intronic SNP in a RUNX1 binding site of SLC22A4 encoding an organic cation transporter is associated with rheumatoid arthritis.Nat Genet. 2003; 35: 341-348Crossref PubMed Scopus (484) Google Scholar), only alleles at the HLA-DRB1 locus have consistently demonstrated both linkage and association (Seldin et al 1999). To identify genes involved in genetic predisposition to RA, we performed a case-control association study (called “discovery study”) with assays for 87 putative functional SNPs (Botstein and Risch Botstein and Risch, 2003Botstein D Risch N Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease.Nat Genet. 2003; : 228-237Crossref PubMed Scopus (1159) Google Scholar) in RA candidate genes and/or linkage regions. The discovery study, consisting of 475 individuals with RA and 475 individually matched controls, was obtained by Genomics Collaborative, Inc. (GCI). Case samples were collected from throughout the United States, and they met the 1987 American College of Rheumatology (ACR) diagnostic criteria for RA. All case samples were from white individuals with an age at onset of RA of between 18 and 68 years and a positive rheumatoid factor of ⩾20 IU. Individuals with psoriasis, systemic lupus erythematosus (SLE), ankylosing spondylitis, or Reiter syndrome were excluded. Control samples were taken from a pool of healthy white individuals from the United States with no medical history of RA or of any of the autoimmune disorders listed above. A single control was matched to each case on the basis of sex, age (±5 years), and ethnicity (grandparental country/region of origin). All protocols and recruitment sites have been approved by national and/or local institutional review boards, and all subjects were enrolled with informed written consent. We found association with the minor allele (T) of a missense SNP (R620W [rs2476601, 1858C→T]) in the protein tyrosine phosphatase non-receptor type 22 gene (PTPN22) (allele frequency 13.8% in cases, 8.8% in controls) (P=6.6×10−4, allelic odds ratio [OR] 1.65, 95% CI 1.23–2.20). Replication in a second study (called the “replication study”) confirmed association. Cases in the replication study were obtained by the North American Rheumatoid Arthritis Consortium (NARAC) and consisted of members of white multiplex families. For this study, DNA was available for 840 individuals with RA from 463 families. These families were recruited from throughout the United States through the 12 participating recruitment centers of NARAC (NARAC Web site). Informed written consent was obtained from every subject, including all participating family members, and the local institutional review board’s approval was secured at each recruitment site. The enrollment criteria for family participation are described in detail elsewhere (Jawaheer et al. Jawaheer et al., 2001Jawaheer D Seldin MF Amos CI Chen WV Shigeta R Monteiro J Kern M Criswell LA Albani S Nelson JL Clegg DO Pope R Schroeder Jr, HW Bridges Jr, SL Pisetsky DS Ward R Kastner DL Wilder RL Pincus T Callahan LF Flemming D Wener MH Gregersen PK A genomewide screen in multiplex rheumatoid arthritis families suggests genetic overlap with other autoimmune diseases.Am J Hum Genet. 2001; 68: 927-936Abstract Full Text Full Text PDF PubMed Scopus (308) Google Scholar). In brief, at least two siblings must satisfy the 1987 ACR criteria for RA, at least one sibling must have documented erosions on hand radiographs, and at least one sibling must have disease onset between the ages of 18 and 60 years. The presence of psoriasis, inflammatory bowel disease, or SLE was an exclusionary criterion for the sib pair. Controls were selected from 20,000 individuals who are part of the New York Cancer Project (NYCP), a population-based prospective study of the genetic and environmental factors that cause disease (New York Cancer Project Web site). Two control individuals were matched to a single randomly chosen affected sib on the basis of sex, age (birth decade), and ethnicity (grandparental country/region of origin). These 463 independent cases (referred to as “single sibs”) and their 926 matched controls were used for all analyses reported in this study, except where noted. The association for single sibs was as follows: allele frequency 15.8% in cases and 8.7% in controls (P=5.6×10−8, allelic OR 1.97, 95% CI 1.55–2.50). An increase of the risk allele frequency was apparent when all affected siblings (n=840) were analyzed (allele frequency 16.8%). Genotypic analyses produced similar results, showing increased frequencies of both TT and TC genotypes in the cases, compared with the controls (table 1).Table 1Frequency of PTPN22 Genotypes in Individuals with RA and Matched Controls in Two Independent Sample SetsFrequency of (No. of Individuals with)PTPN22Genotype inDiscovery StudyReplication StudyGenotypeControls (n=475)Cases (n=475)Controls (n=926)Single SibsaIndependent cases comprising one randomly selected sib per NARAC family. (n=463)All SibsbAll affected individuals in NARAC. (n=840)TT.006 (3).013 (6).010 (9).021 (10).025 (21)TC.164 (78).251 (119).154 (143).273 (126).287 (241)CC.830 (394).737 (350).836 (774).706 (327).688 (578)Note.—Genotypes were generated by use of kinetic, allele-specific PCR (Germer et al. Germer et al., 2000Germer S Holland MJ Higuchi R High-throughput SNP allele-frequency determination in pooled DNA samples by kinetic PCR.Genome Res. 2000; 10: 258-266Crossref PubMed Scopus (361) Google Scholar): 0.3 ng of DNA in a 15−μl reaction containing the allele-specific primers 5′-CCTCCACTTCCTGTAT/C and a common primer, 5′-CCCATCCCACACTTTATTTTATAC. Genotyping accuracy was 100%, as determined by internal comparisons by use of a different assay (Iannone et al. Iannone et al., 2000Iannone MA Taylor JD Chen J Li MS Rivers P Slentz-Kesler KA Weiner MP Multiplexed single nucleotide polymorphism genotyping by oligonucleotide ligation and flow cytometry.Cytometry. 2000; 39: 131-140Crossref PubMed Scopus (187) Google Scholar) for the same marker on ∼6% of the samples. All case and control populations were in Hardy-Weinberg equilibrium.a Independent cases comprising one randomly selected sib per NARAC family.b All affected individuals in NARAC. Open table in a new tab Note.— Genotypes were generated by use of kinetic, allele-specific PCR (Germer et al. Germer et al., 2000Germer S Holland MJ Higuchi R High-throughput SNP allele-frequency determination in pooled DNA samples by kinetic PCR.Genome Res. 2000; 10: 258-266Crossref PubMed Scopus (361) Google Scholar): 0.3 ng of DNA in a 15−μl reaction containing the allele-specific primers 5′-CCTCCACTTCCTGTAT/C and a common primer, 5′-CCCATCCCACACTTTATTTTATAC. Genotyping accuracy was 100%, as determined by internal comparisons by use of a different assay (Iannone et al. Iannone et al., 2000Iannone MA Taylor JD Chen J Li MS Rivers P Slentz-Kesler KA Weiner MP Multiplexed single nucleotide polymorphism genotyping by oligonucleotide ligation and flow cytometry.Cytometry. 2000; 39: 131-140Crossref PubMed Scopus (187) Google Scholar) for the same marker on ∼6% of the samples. All case and control populations were in Hardy-Weinberg equilibrium. We used contingency tables and conditional-logistic regression (CLR) (Breslow and Day Breslow and Day, 1980Breslow NE Day NE Statistical methods in cancer research, volume I: the analysis of case-control studies.IARC Sci Publ. 1980; 32: 5-338PubMed Google Scholar) to assess the association of the PTPN22 R620W genotype with RA (table 2). In the discovery study, both the TC (ORCLR 1.69, P=.0012) and TT (ORCLR 2.26, P value not significant) genotypes conferred an increased risk for disease compared with the CC genotype. Further adjustment for the HLA-DRB1 genotype, the strongest known genetic risk factor (Seldin et al. Seldin et al., 1999Seldin MF Amos CI Ward R Gregersen PK The genetics revolution and the assault on rheumatoid arthritis.Arthritis Rheum. 1999; 42: 1071-1079Crossref PubMed Scopus (240) Google Scholar), had little impact on risk estimates. Similar results were observed in the replication study. The susceptible TT and TC genotypes were strongly associated with rheumatoid factor–positive (RF+) disease, even after adjustment for the HLA genotype (P=.0197 in discovery study, P=.0005 in replication study) (table 3); however, there was no evidence for association with rheumatoid factor–negative (RF−) disease. This interesting observation has been replicated in an additional cohort of patients with recent-onset RA (A. T. Lee and P. K. Gregersen, unpublished data). There was no consistent sex difference in association of the risk allele with RA (data not shown).Table 2Association of PTPN22 with RAResults forCrude AnalysisaValues were calculated by use of standard contingency tables (or two-tailed Fisher’s Exact Test).CLRbCLR was performed as described elsewhere (Breslow and Day 1980) to account for individual matching of controls to cases.CLR, Adjusted for HLAcTo assess whether the observed associations were independent of the HLA-DRB1 genotype, we further adjusted for this known risk factor. HLA-DRB1 genotyping was performed as described elsewhere (Jawaheer et al. 2002), and, for this analysis, samples were binned according to their HLA-DRB1 genotype: high risk, two shared epitopes (HR-2SE); high risk, one shared epitope (HR-1SE; 4K,X; or 4R,X); low risk, one shared epitope (LR-1SE; 1R,X; or 10,X); and low risk, zero shared epitopes (LR-0SE; X,X) (Fries et al. 2002).Data Set and GenotypeNo. of CasesNo. of ControlsORdORs were calculated relative to the major allele homozygote (CC).95% CIPORdORs were calculated relative to the major allele homozygote (CC).95% CIPORdORs were calculated relative to the major allele homozygote (CC).95% CIPDiscovery: TT632.25.56–9.07.32eP value calculated by use of Fisher’s Exact Test.2.26.56–9.14.252.39.35–16.43.38 TC119781.721.25–2.36.0011.691.23–2.32.0011.531.05–2.23.03 TT+TCfBecause of the infrequency of the TT genotype, we combined it with the TC genotype and repeated the analysis.125811.741.27–2.38.00051.711.25–2.34.00081.551.07–2.25.02 CC350394………………………Replication: TT1092.631.06–6.53.032.551.03–6.31.043.261.03–10.28.04 TC1261432.091.59–2.74<.00012.081.58–2.74<.00011.71.19–2.42.004 TT+TCfBecause of the infrequency of the TT genotype, we combined it with the TC genotype and repeated the analysis.1361522.121.62–2.76<.00012.111.62–2.76<.00011.781.26–2.51.001 CC327774………………………a Values were calculated by use of standard contingency tables (or two-tailed Fisher’s Exact Test).b CLR was performed as described elsewhere (Breslow and Day Breslow and Day, 1980Breslow NE Day NE Statistical methods in cancer research, volume I: the analysis of case-control studies.IARC Sci Publ. 1980; 32: 5-338PubMed Google Scholar) to account for individual matching of controls to cases.c To assess whether the observed associations were independent of the HLA-DRB1 genotype, we further adjusted for this known risk factor. HLA-DRB1 genotyping was performed as described elsewhere (Jawaheer et al. Jawaheer et al., 2002Jawaheer D Li W Graham RR Chen W Damle A Xiao X Monteiro J Khalili H Lee A Lundsten R Begovich A Bugawan T Erlich H Elder JT Criswell LA Seldin MF Amos CI Behrens TW Gregersen PK Dissecting the genetic complexity of the association between human leukocyte antigens and rheumatoid arthritis.Am J Hum Genet. 2002; 71: 585-594Abstract Full Text Full Text PDF PubMed Scopus (153) Google Scholar), and, for this analysis, samples were binned according to their HLA-DRB1 genotype: high risk, two shared epitopes (HR-2SE); high risk, one shared epitope (HR-1SE; 4K,X; or 4R,X); low risk, one shared epitope (LR-1SE; 1R,X; or 10,X); and low risk, zero shared epitopes (LR-0SE; X,X) (Fries et al. Fries et al., 2002Fries JF Wolfe F Apple R Erlich H Bugawan T Holmes T Bruce B HLA-DRB1 genotype associations in 793 white patients from a rheumatoid arthritis inception cohort: frequency, severity, and treatment bias.Arthritis Rheum. 2002; 46: 2320-2329Crossref PubMed Scopus (72) Google Scholar).d ORs were calculated relative to the major allele homozygote (CC).e P value calculated by use of Fisher’s Exact Test.f Because of the infrequency of the TT genotype, we combined it with the TC genotype and repeated the analysis. Open table in a new tab Table 3Association of PTPN22 with RF+ and RF− DiseaseResults forCrude AnalysisaValues were calculated by use of standard contingency tables.CLRbCLR was performed as described elsewhere (Breslow and Day 1980) to account for individual matching of controls to cases.CLR, Adjusted for HLAcTo assess whether the observed associations were independent of the HLA-DRB1 genotype, we further adjusted for this known risk factor. For this analysis, samples were binned according to their HLA-DRB1 genotype: high risk, two shared epitopes (HR-2SE); high risk, one shared epitope (HR-1SE; 4K,X; or 4R,X); low risk, one shared epitope (LR-1SE; 1R,X; or 10,X); and low risk, zero shared epitopes (LR-0SE; X,X) (Fries et al. 2002).Data Set and GenotypeRF StatusNo. of CasesNo. of ControlsORdORs were calculated relative to the major allele homozygote (CC).95% CIPORdORs were calculated relative to the major allele homozygote (CC).95% CIPORdORs were calculated relative to the major allele homozygote (CC).95% CIPDiscovery:eAll patients in the discovery cohort are RF+. TT+TCfBecause of the infrequency of the TT genotype, we combined it with the TC genotype for these analyses.RF+125811.741.27–2.38.00051.711.25–2.34.00081.551.07–2.25.02 CCRF+350394………………………Replication: TT+TCfBecause of the infrequency of the TT genotype, we combined it with the TC genotype for these analyses.RF+1191222.381.78–3.18<.00012.361.76–3.16<.000121.35–2.97.0005 CCRF+266648……………………… TT+TCfBecause of the infrequency of the TT genotype, we combined it with the TC genotype for these analyses.RF−17301.17.60–2.29.641.17.60–2.31.641.09.51–2.36.82 CCRF−61126………………………a Values were calculated by use of standard contingency tables.b CLR was performed as described elsewhere (Breslow and Day Breslow and Day, 1980Breslow NE Day NE Statistical methods in cancer research, volume I: the analysis of case-control studies.IARC Sci Publ. 1980; 32: 5-338PubMed Google Scholar) to account for individual matching of controls to cases.c To assess whether the observed associations were independent of the HLA-DRB1 genotype, we further adjusted for this known risk factor. For this analysis, samples were binned according to their HLA-DRB1 genotype: high risk, two shared epitopes (HR-2SE); high risk, one shared epitope (HR-1SE; 4K,X; or 4R,X); low risk, one shared epitope (LR-1SE; 1R,X; or 10,X); and low risk, zero shared epitopes (LR-0SE; X,X) (Fries et al. Fries et al., 2002Fries JF Wolfe F Apple R Erlich H Bugawan T Holmes T Bruce B HLA-DRB1 genotype associations in 793 white patients from a rheumatoid arthritis inception cohort: frequency, severity, and treatment bias.Arthritis Rheum. 2002; 46: 2320-2329Crossref PubMed Scopus (72) Google Scholar).d ORs were calculated relative to the major allele homozygote (CC).e All patients in the discovery cohort are RF+.f Because of the infrequency of the TT genotype, we combined it with the TC genotype for these analyses. Open table in a new tab To evaluate putative modes of inheritance, we combined the two data sets and used the likelihood-ratio test (Breslow and Day Breslow and Day, 1980Breslow NE Day NE Statistical methods in cancer research, volume I: the analysis of case-control studies.IARC Sci Publ. 1980; 32: 5-338PubMed Google Scholar) on the basis of CLR models. Although we could exclude a recessive mode of inheritance (P<.0001), the data are consistent with either additive or dominant modes of inheritance. We also generated estimates of the population-attributable fraction (0.11 for discovery study [95% CI 0.05–0.17]; 0.16 for replication study [95% CI 0.10–0.21]) (Schlesselman Schlesselman, 1982Schlesselman JJ Case-control studies: design, conduct, analysis. Oxford University Press, New York1982Google Scholar; Yang et al. Yang et al., 2003Yang Q Khoury MJ Friedman JM Flanders WD On the use of population attributable fraction to determine sample size for case-control studies of gene-environment interaction.Epidemiology. 2003; 14: 161-167PubMed Google Scholar). However, these estimates should be interpreted with caution, since our cases were selected to have relatively severe disease and thus are not representative of the full clinical spectrum of RA. PTPN22 is located on chromosome 1p13, ∼9 Mb centromeric to a microsatellite marker, D1S1631, that shows linkage to RA in the NARAC sib pairs (SIBPAL P=.0011) (Jawaheer et al. Jawaheer et al., 2003Jawaheer D Seldin MF Amos CI Chen WV Shigeta R Etzel C Damle A et al.Screening the genome for rheumatoid arthritis susceptibility genes: a replication study and combined analysis of 512 multicase families.Arthritis Rheum. 2003; 48: 906-916Crossref PubMed Scopus (190) Google Scholar) from which our replication study was drawn. Linkage analysis using Allegro (Gudbjartsson et al. Gudbjartsson et al., 2000Gudbjartsson DF Jonasson K Frigge ML Kong A Allegro, a new computer program for multipoint linkage analysis.Nat Genet. 2000; 25: 12-13Crossref PubMed Scopus (650) Google Scholar) yielded an insignificant LOD score of 0.15 for PTPN22 R620W in these sib pairs, indicating that R620W is not solely responsible for the linkage seen at D1S1631. We then conducted stratified analyses based on PTPN22 R620W genotypes of the probands. Among 7, 76, and 193 affected sib pairs for which the proband had the TT, TC, or CC genotype, respectively, the respective mean identity-by-descent–sharing values were 0.598, 0.505, and 0.486, and the respective nonparametric linkage (NPL) scores were 0.94, 0.35, and −0.46 (Allegro) (Gudbjartsson et al. Gudbjartsson et al., 2000Gudbjartsson DF Jonasson K Frigge ML Kong A Allegro, a new computer program for multipoint linkage analysis.Nat Genet. 2000; 25: 12-13Crossref PubMed Scopus (650) Google Scholar). These results suggested that R620W has a weak effect on the 1p13 linkage signal. Li et al. (Li et al., 2004Li C Scott LJ Boehnke M Assessing whether an allele can account in part for a linkage signal: the Genotype-IBD Sharing Test (GIST).Am J Hum Genet. 2004; 74: 418-431Abstract Full Text Full Text PDF PubMed Scopus (49) Google Scholar) have recently shown that there is large variability in NPL scores when families are stratified by the genotype of a single, randomly selected sib, and they developed a program (Genotype-IBD Sharing Test [GIST]) in which families are weighted on the basis of the genotypes of all family members. GIST analysis that was conditional on the T allele indicated significant evidence for linkage (P<.0001). The apparent difference between the significance of the stratified analysis and the GIST results may reflect the increased power of the GIST algorithm, or, alternatively, the rarity of the TT genotype may lead to poor convergence of the GIST test statistic to the asymptotic distribution. Taken together, all these data suggest that, whereas the SNP may account for a small part of the linkage signal seen on 1p, it is clearly not responsible for the entire signal. We also genotyped this SNP in several additional control populations (table 4). The observed frequency of the risk allele in 560 additional white subjects (8.4%) was consistent with results in our two RA control populations. When all three control data sets of whites (n=1,961) were combined, the T allele was present in 16.7% of individuals at an allele frequency of 8.7%. The T allele frequency was lower in 99 Mexican Americans (3.5%) and 409 African Americans (2.4%). This allele was not detected in 100 Han Chinese or 21 Africans. A Fisher's Exact Test showed that these differences in allele frequencies were highly significant (P<1×10−10). It will be important to expand these studies to determine whether presence of the T allele in the African American and Mexican American populations is due to admixture with whites.Table 4Frequency of the PTPN22 Risk Allele in Discovery and Replication Control Samples and Other PopulationsPopulationNo. of IndividualsRisk Allele FrequencyDiscovery controls (white)475.088Replication controls (white)926.087Additional whitesaIncluding samples from 360 whites from throughout the United States (obtained by GCI), 147 North American whites (obtained from the National Institute of General Medical Sciences [NIGMS] Human Variation Panels [HVP]), 10 northern Europeans (NIGMS HVP), 9 Italians (NIGMS HVP), 8 Greeks (NIGMS HVP), 20 independent CEPH individuals from Utah (NIGMS), 1 CEPH individual from France (NIGMS), and 5 Adygei (an indigenous Circassian people; NIGMS). GCI obtained informed written consent from all participants and local institutional review board approval at all recruitment sites.560.084African AmericansbIncluding samples from 369 African Americans from throughout the United States obtained by GCI (see footnote a for comments on consent/IRB approval) and 40 African Americans (NIGMS HVP).409.024AfricanscIncluding samples from 9 sub-Saharan Africans (NIGMS HVP), 7 northern Africans (north of the Sahara) (NIGMS HVP), and 5 Pygmies (NIGMS).210Mexican AmericansdIncluding samples from 99 Mexican Americans from Los Angeles (NIGMS HVP).99.035Han ChineseeIncluding samples from 100 Han Chinese from Los Angeles (NIGMS HVP).1000a Including samples from 360 whites from throughout the United States (obtained by GCI), 147 North American whites (obtained from the National Institute of General Medical Sciences [NIGMS] Human Variation Panels [HVP]), 10 northern Europeans (NIGMS HVP), 9 Italians (NIGMS HVP), 8 Greeks (NIGMS HVP), 20 independent CEPH individuals from Utah (NIGMS), 1 CEPH individual from France (NIGMS), and 5 Adygei (an indigenous Circassian people; NIGMS). GCI obtained informed written consent from all participants and local institutional review board approval at all recruitment sites.b Including samples from 369 African Americans from throughout the United States obtained by GCI (see footnote aIncluding samples from 360 whites from throughout the United States (obtained by GCI), 147 North American whites (obtained from the National Institute of General Medical Sciences [NIGMS] Human Variation Panels [HVP]), 10 northern Europeans (NIGMS HVP), 9 Italians (NIGMS HVP), 8 Greeks (NIGMS HVP), 20 independent CEPH individuals from Utah (NIGMS), 1 CEPH individual from France (NIGMS), and 5 Adygei (an indigenous Circassian people; NIGMS). GCI obtained informed written consent from all participants and local institutional review board approval at all recruitment sites. for comments on consent/IRB approval) and 40 African Americans (NIGMS HVP).c Including samples from 9 sub-Saharan Africans (NIGMS HVP), 7 northern Africans (north of the Sahara) (NIGMS HVP), and 5 Pygmies (NIGMS).d Including samples from 99 Mexican Americans from Los Angeles (NIGMS HVP).e Including samples from 100 Han Chinese from Los Angeles (NIGMS HVP). Open table in a new tab PTPN22 encodes a 110-kD cytoplasmic protein tyrosine phosphatase that consists of an N-terminal phosphatase domain and a long C-terminal region containing several proline-rich motifs (Cohen et al. Cohen et al., 1999Cohen S Dadi H Shaoul E Sharfe N Roifman CM Cloning and characterization of a lymphoid-specific, inducible human pro