Behçet's disease is a chronic systemic inflammatory disease that remains incompletely understood. Herein, we perform the first genome-wide association study in Behçet's disease.
Using DNA pooling technology and the Affymetrix 500K arrays, we identified possible candidate gene associations with Behçet's disease in a cohort of 152 Behçet's disease patients and 172 healthy ethnically matched controls. Genetic loci that were identified in the pooling study were genotyped in patients and controls using TaqMan genotyping technology.
We identified genetic associations between Behçet's disease and single-nucleotide polymorphisms (SNPs) in KIAA1529, CPVL, LOC100129342, UBASH3B, and UBAC2 (odds ratio = 2.04, 2.26, 1.84, 1.71, and 1.61, respectively; P value = 4.2 × 10-5, 1.0 × 10-4, 3.0 × 10-4, 1.5 × 10-3, and 5.8 × 10-3, respectively). Among the associated SNPs, the Behçet's disease-risk allele in rs2061634 leads to substitution of serine to cysteine at amino acid position 995 (S995C) in the KIAA1529 protein.
Using an unbiased whole-genome genetic association approach, we identified novel candidate genetic loci that are associated with increased susceptibility for Behçet's disease. These findings will help to better understand the pathogenesis of Behçet's disease and identify novel targets for therapeutic intervention.
Behçet's disease is a chronic relapsing systemic inflammatory disease characterized by the presence of oro-genital ulcers, cutaneous manifestations, and uveitis. The disease can also lead to vascular complications such as arterial and venous thrombosis, central nervous system vasculitis, arthritis, and gastrointestinal involvement . The etiology of Behçet's disease is not fully understood. Therefore, treatment remains insufficient and relies on non-specific immunosuppressive medications, with significant side effects. Evidence for a genetic contribution to the pathogenesis of the disease comes from strong familial aggregation, the strong predominance in patients with Mediterranean or Asian ancestry, and the confirmed association with HLA-B51 in several ethnic groups [2-4].
It is estimated that the association with HLA-B51 in Behçet's disease accounts for only ~20% of the relative risk in siblings of affected individuals . This suggests that other genetic elements outside the HLA region carry risk for developing Behçet's disease. Indeed, a genetic linkage study in a cohort of Behçet's disease multiplex families identified evidence for linkage (P ≤ 0.05) on 16 chromosomal regions: 1p36, 4p15, 5q12, 5q23, 6p22-24, 6q16, 6q25-26, 7p21, 10q24, 12p12-13, 12q13, 16q12, 16q21-23, 17p13, 20q12-13, and Xq26-28 . Genetic association studies performed via genotyping single-nucleotide polymorphisms (SNPs) in candidate genes have been performed. These revealed genetic associations with several genes, including IRF1 (interferon regulatory factor 1) , TNF (tumor necrosis factor) [7,8], and PTPN22 (protein tyrosine phosphatase, non-receptor type 22) .
Despite efforts to discover the genetic contribution to the disease etiology, with the exception of the strong association in the HLA region, other genetic associations are largely unconfirmed. We hypothesize that novel genetic loci for Behçet's disease will be identified using an unbiased genome-wide association approach. Herein, we used a DNA pooling technology to perform the first genome-wide association study in Behçet's disease. We identify five novel genetic loci that are associated with the susceptibility to develop Behçet's disease in a cohort of Turkish Behçet's disease patients and controls.
Materials and methods
Patients and controls
We studied a cohort of Behçet's disease patients and controls recruited at Marmara University Medical School, Istanbul, Turkey. Our cohort consisted of 152 Behçet's disease patients and 172 ethnically matched normal healthy controls. All patients fulfilled the 1990 International Study Group classification criteria for Behçet's disease . DNA was extracted from peripheral blood mononuclear cells using standard techniques. All protocols were approved by the institutional review boards or the research ethics committees at Marmara University, the University of Oklahoma Health Sciences Center, and the Oklahoma Medical Research Foundation. All patients provided written informed consent.
DNA pooling studies
For the initial discovery phase of novel genetic associations, we used a DNA pooling genotyping methodology. DNA pools were generated from cases (five pools) and controls (five pools). DNA samples were first run on a 0.8% agarose gel to examine the quality of DNA (high molecular weight) and absence of RNA contamination. The DNA quality and quantity were then determined using a NanoDrop® spectrophotometer (NanoDrop Technologies, Inc., now part of Thermo Fisher Scientific Inc., Waltham, MA, USA). Three hundred twenty-two out of 324 DNA samples had a 260/280 ratio between 1.65 and 2.0 and a 260/230 ratio between 1.0 and 2.2 and were used for the pooling experiment. Samples were then serially diluted to a concentration of 17 to 23 ng/μL. Following the dilution, each DNA sample was read twice on the NanoDrop and an average was used to determine the concentration. The number of samples in each pool dictated the amount each sample contributes to the pool. The final volume of each pool was 46.5 μL. This represents a pool for each of the restriction enzymes used (15.5 μL each) and a test pool (15.5 μL). The final amount of total genomic DNA in each pool was 250 ng. After the pools were generated, the exact protocol developed by Affymetrix (Santa Clara, CA, USA) for a single DNA sample was followed. This included digestion with the appropriate restriction enzyme, ligation of linkers, amplification of targeted DNA, purification of amplified DNA, labeling, hybridization, and scanning. The Affymetrix GeneChip® Human Mapping 500K Array was used. Scanning and analysis were performed using GTYPE GeneChip Genotyping Analysis Software version 188.8.131.52 and GCOS GeneChip Operating Software version 1.4.0.036. The GTYPE GeneChip Genotyping Analysis software generates the cel file that contains the signal intensity information in which allele frequencies were determined using software developed in-house by personnel at JKA Genomics (Oklahoma City, OK, USA). GeneChip Genotyping 4.0 software output was used for relative quality control assessment of detection rates (>98.5%) and allele distributions (<10% difference between pools).
Candidate genes discovered in our pooling experiment were subsequently genotyped in patients with Behçet's disease and controls using a case-control genetic association approach and the TaqMan SNP Genotyping Assays (Applied Biosystems, Foster City, CA, USA). Allele frequencies and odds ratios (ORs) were determined. The Pearson chi-square values and P values were calculated using Haploview 4.1 . Hardy-Weinberg equilibrium (HWE) P value measures the difference between the observed genotype frequency and the expected genotype frequency based on the observed allele frequency and was determined to examine the randomness of mating and lack of relatedness in our study population. Permutation P values for the allele frequency difference were calculated to correct for multiple testing using Haploview 4.1 .
We studied a cohort of 152 Behçet's disease patients and 172 ethnically matched controls. Our patient groups included 81 (53.3%) men and 71 (46.7%) women. The control group included 79 (45.9%) men and 93 (54.1%) women. The average age of patients at the time of enrollment was 36.4 ± 11.7 years (mean ± standard deviation, SD), and the average age for control participants was 34.2 ± 11.4 years (mean ± SD). There was no significant difference in age between patients and controls (P > 0.05). The clinical characteristics of our Behçet's patients included in this study are summarized in Table 1.
Table 1. Clinical characteristics of the Behçet's disease cohort studied
The genome-wide scanning included pooled DNA from 152 Behçet's disease patients and 170 controls (2 DNA samples from controls did not pass DNA quality measures and were excluded from the pooling experiment). Genes with at least one SNP with an association P value of less than 1 × 10-5 or at least 2 SNPs with an association P value of less than 1 × 10-4 were identified as possible candidate genes for Behçet's disease and were selected for individual case-control genotyping.
We have identified 10 possible candidate genes based on the unbiased genome-wide pooling association study. Individual case-control genotyping using TaqMan SNP Genotyping Assays confirmed the genetic association between SNPs in five genes that we identify as novel genetic susceptibility loci for Behçet's disease. All SNPs had an HWE P value of greater than 0.05 (Table 2). Genotyping success rate was at least 96% in the SNPs genotyped, with the exception of the SNP rs4949332, which had a genotyping success rate of 90%.
Table 2. Genetic association between candidate single-nucleotide polymorphisms identified using DNA pooling in a cohort of Behçet's disease patients and controls
Among the polymorphisms studied, the SNP rs2061634 located within the gene KIAA1529 on chromosome 9q22 showed the strongest association with Behçet's disease in our cohort. Consequently, we propose the name 'Behçet's disease-associated gene 1' (BDAG1) as a synonym for KIAA1529. The frequency of rs2061634 risk allele (G) was 42.7% in Behçet's disease cases compared with 26.7% in controls (OR = 2.04, P = 4.2 × 10-5) (Table 2). The disease-risk allele in rs2061634 leads to a substitution of serine to cysteine at amino acid position 995 (S995C) in the KIAA1529 protein.
The SNP rs317711 located within the gene CPVL (carboxypeptidase, vitellogenic-like) on chromosome 7p15 had a risk allele frequency (C) of 25.5% in cases compared with 13.2% in controls (OR = 2.26, P = 1.0 × 10-4). Allele G in rs11206377 located within the gene LOC100129342 on chromosome 1p34 had a frequency of 66.1% in cases and 51.4% in controls (OR = 1.84, P = 3.0 × 10-4). The SNPs rs9513584 and rs4936742 located within the genes UBAC2 (ubiquitin-associated domain [UBA] containing 2) and UBASH3B (ubiquitin associated and SH3 domain containing, B) were also associated with Behçet's disease with P values of 5.8 × 10-3 and 1.5 × 10-3, respectively (Table 2). However, the association between rs9513584 within UBAC2 and Behçet's disease is not significant after correction for multiple testing.
We next determined the genotype frequency differences between patients with Behçet's disease compared with controls. The presence of the homozygous risk genotype was significantly more frequent in patients than controls in all five of the SNPs that are associated with Behçet's disease in our study (Table 3).
Table 3. Genotype frequencies of the Behçet's disease-associated single-nucleotide polymorphisms identified in this study
A subset analysis was performed to examine the difference in allele frequencies in clinical subsets of Behçet's disease compared with normal controls (Table 4). Interpreting these data is limited by the small number of patients studied. However, the data suggest that the risk allele in rs4936742 is more common in patients with eye involvement and with vascular involvement, suggesting that the disease-associated polymorphism in UBASH3B might predispose to eye and vascular complications in patients with Behçet's disease. Interestingly, the risk allele in rs2061634 (KIAA1529) is more common in patients without eye or vascular involvement (Table 4).
Table 4. Subset analysis showing genetic association between clinical subsets of patients with Behçet's disease compared with normal healthy controls
Behçet's disease is a chronic relapsing inflammatory disease associated with debilitating clinical consequences. The disease is more frequent in regions along the 'Silk Road'; however, cases have been reported in all ethnic groups. The pathogenesis of Behçet's disease remains incompletely understood. We have performed a genome-wide genetic association study using a cohort of Turkish Behçet's disease patients and controls. We identified novel genetic susceptibility loci for Behçet's disease in KIAA1529, CPVL, LOC100129342, and UBASH3B. A probable association with UBAC2 was also identified.
KIAA1529 and LOC100129342 are located on chromosome 9q22 and 1p34, respectively, and have no known function to date. The association between polymorphisms within KIAA1529 and LOC100129342 with Behçet's disease suggests a potential role for these unknown genes in regulating the immune response. LOC100129342 is located on chromosome 1p34 close to 1p36, a susceptibility region previously identified in families with Behçet's disease by genetic linkage studies . It is possible that LOC100129342 explains or contributes to the genetic linkage effect in that region.
The genes UBASH3B and UBAC2 both contain a UBA, suggesting that both gene products are involved in the ubiquitination pathway. Interestingly, a polymorphism in a third ubiquitin-related gene, SUMO4 (small ubiquitin-like modifier 4), was associated with the risk to develop Behçet's disease in Chinese patients . Ubiquitination reactions are involved in diverse biological functions, including regulating receptor tyrosine kinase signaling by modulating endocytosis and degradation of activated receptors . UBASH3B has been shown to negatively regulate T-cell receptor signaling . In mice deficient in both UBASH3B and UBASH3A (both are related proteins with 75% homology), T cells are hyper-responsive to T-cell receptor stimulation and demonstrate a marked increase in cytokine production. Furthermore, double-knockout mice had increased incidence and severity of experimental autoimmune encephalomyelitis (EAE) compared with wild-type mice . These data suggest that an abnormal ubiquitination pathway might be involved in the pathogenesis of Behçet's disease.
CPVL gene encodes for a carboxypeptidase that belongs to a large group of proteases that cleave a single amino acid from the carboxy terminus of proteins or peptides. CPVL is expressed on human macrophages but not in other types of leukocytes . The exact nature and extent of function of this carboxypeptidase are yet to be determined. A polymorphism in this gene would potentially affect the function of any of the peptides cleaved by this enzyme and therefore could interfere with normal macrophage function or alter innate immune responses.
We performed a genome-wide association study and identified novel candidate genetic loci that confer increased susceptibility for Behçet's disease. Further work to replicate these findings in independent cohorts, to identify the disease-causing polymorphisms in these loci, and to determine the functional consequences of these polymorphisms will help us to better understand the pathogenesis of Behçet's disease and identify novel therapeutic targets against this disease.
CI: confidence interval; CPVL: carboxypeptidase, vitellogenic-like; HWE: Hardy-Weinberg equilibrium; OR: odds ratio; SD: standard deviation; SNP: single-nucleotide polymorphism; UBA: ubiquitin-associated domain; UBAC2: ubiquitin-associated domain containing 2; UBASH3A: ubiquitin associated and SH3 domain containing, A; UBASH3B: ubiquitin associated and SH3 domain containing, B.
The authors declare that they have no competing interests.
YF performed the acquisition of data. RW performed the acquisition and analysis of data. BLC performed the analysis of pooling data. HD and GS-D provided DNA samples, made a substantial contribution to the acquisition of data, and critically revised the manuscript. AHS designed the experiments, performed data analysis and interpretation, and wrote the manuscript. All authors read and approved the final manuscript.
This work was made possible by funding from National Institutes of Health (NIH) grant number P20-RR015577 from the National Center for Research Resources, NIH grant number R03AI076729 from the National Institute of Allergy and Infectious Diseases, funding from the University of Oklahoma College of Medicine (to AHS), and support from the Oklahoma City VA and the Oklahoma Medical Research Foundation.
Mizuki N, Ota M, Katsuyama Y, Yabuki K, Ando H, Shiina T, Palimeris GD, Kaklamani E, Ito D, Ohno S, Inoko H: Sequencing-based typing of HLA-B*51 alleles and the significant association of HLA-B*5101 and -B*5108 with Behcet's disease in Greek patients.
Mizuki N, Ota M, Katsuyama Y, Yabuki K, Ando H, Shiina T, Nomura E, Onari K, Ohno S, Inoko H: HLA-B*51 allele analysis by the PCR-SBT method and a strong association of HLA-B*5101 with Japanese patients with Behcet's disease.
Lancet 1990, 335:1078-1080. PubMed Abstract