2,023 research outputs found
The promoter polymorphism -232C/G of the PCK1 gene is associated with type 2 diabetes in a UK-resident South Asian population
Background: The PCK1 gene, encoding cytosolic phosphoenolpyruvate carboxykinase (PEPCK-C), has previously been implicated as a candidate gene for type 2 diabetes (T2D) susceptibility. Rodent models demonstrate that over-expression of Pck1 can result in T2D development and a single nucleotide polymorphism (SNP) in the promoter region of human PCK1 (-232C/G) has exhibited significant association with the disease in several cohorts. Within the UK-resident South Asian population, T2D is 4 to 6 times more common than in indigenous white Caucasians. Despite this, few studies have reported on the genetic susceptibility to T2D in this ethnic group and none of these has investigated the possible effect of PCK1 variants. We therefore aimed to investigate the association between common variants of the PCK1 gene and T2D in a UK-resident South Asian population of Punjabi ancestry, originating predominantly from the Mirpur area of Azad Kashmir, Pakistan. \ud
\ud
Methods: We used TaqMan assays to genotype five tagSNPs covering the PCK1 gene, including the -232C/G variant, in 903 subjects with T2D and 471 normoglycaemic controls. \ud
\ud
Results: Of the variants studied, only the minor allele (G) of the -232C/G SNP demonstrated a significant association with T2D, displaying an OR of 1.21 (95% CI: 1.03 - 1.42, p = 0.019). \ud
\ud
Conclusion: This study is the first to investigate the association between variants of the PCK1 gene and T2D in South Asians. Our results suggest that the -232C/G promoter polymorphism confers susceptibility to T2D in this ethnic group. \ud
\ud
Trial registration: UKADS Trial Registration: ISRCTN38297969
Common variants of the TCF7L2 gene are associated with increased risk of type 2 diabetes mellitus in a UK-resident South Asian population
Background
Recent studies have implicated variants of the transcription factor 7-like 2 (TCF7L2) gene in genetic susceptibility to type 2 diabetes mellitus in several different populations. The aim of this study was to determine whether variants of this gene are also risk factors for type 2 diabetes development in a UK-resident South Asian cohort of Punjabi ancestry.
Methods
We genotyped four single nucleotide polymorphisms (SNPs) of TCF7L2 (rs7901695, rs7903146, rs11196205 and rs12255372) in 831 subjects with diabetes and 437 control subjects.
Results
The minor allele of each variant was significantly associated with type 2 diabetes; the greatest risk of developing the disease was conferred by rs7903146, with an allelic odds ratio (OR) of 1.31 (95% CI: 1.11 – 1.56, p = 1.96 × 10-3). For each variant, disease risk associated with homozygosity for the minor allele was greater than that for heterozygotes, with the exception of rs12255372. To determine the effect on the observed associations of including young control subjects in our data set, we reanalysed the data using subsets of the control group defined by different minimum age thresholds. Increasing the minimum age of our control subjects resulted in a corresponding increase in OR for all variants of the gene (p ≤ 1.04 × 10-7).
Conclusion
Our results support recent findings that TCF7L2 is an important genetic risk factor for the development of type 2 diabetes in multiple ethnic groups
Latent class analysis variable selection
We propose a method for selecting variables in latent class analysis, which is the most common model-based clustering method for discrete data. The method assesses a variable's usefulness for clustering by comparing two models, given the clustering variables already selected. In one model the variable contributes information about cluster allocation beyond that contained in the already selected variables, and in the other model it does not. A headlong search algorithm is used to explore the model space and select clustering variables. In simulated datasets we found that the method selected the correct clustering variables, and also led to improvements in classification performance and in accuracy of the choice of the number of classes. In two real datasets, our method discovered the same group structure with fewer variables. In a dataset from the International HapMap Project consisting of 639 single nucleotide polymorphisms (SNPs) from 210 members of different groups, our method discovered the same group structure with a much smaller number of SNP
GRIMP: A web- and grid-based tool for high-speed analysis of large-scale genome-wide association using imputed data.
The current fast growth of genome-wide association
studies (GWAS) combined with now common computationally
expensive imputation requires the online access of large user groups to high-performance computing resources capable of analyzing rapidly and efficiently millions of genetic markers for ten thousands of individuals. Here, we present a web-based interface—called GRIMP—to run publicly available genetic software for extremely large GWAS on scalable super-computing grid infrastructures. This is of major importance for the enlargement of GWAS with the availability
of whole-genome sequence data from the 1000 Genomes Project
and for future whole-population efforts
Different genes interact with particulate matter and tobacco smoke exposure in affecting lung function decline in the general population
BACKGROUND: Oxidative stress related genes modify the effects of ambient air pollution or tobacco smoking on lung function decline. The impact of interactions might be substantial, but previous studies mostly focused on main effects of single genes. OBJECTIVES: We studied the interaction of both exposures with a broad set of oxidative-stress related candidate genes and pathways on lung function decline and contrasted interactions between exposures. METHODS: For 12679 single nucleotide polymorphisms (SNPs), change in forced expiratory volume in one second (FEV(1)), FEV(1) over forced vital capacity (FEV(1)/FVC), and mean forced expiratory flow between 25 and 75% of the FVC (FEF(25-75)) was regressed on interval exposure to particulate matter >10 microm in diameter (PM10) or packyears smoked (a), additive SNP effects (b), and interaction terms between (a) and (b) in 669 adults with GWAS data. Interaction p-values for 152 genes and 14 pathways were calculated by the adaptive rank truncation product (ARTP) method, and compared between exposures. Interaction effect sizes were contrasted for the strongest SNPs of nominally significant genes (p(interaction)>0.05). Replication was attempted for SNPs with MAF<10% in 3320 SAPALDIA participants without GWAS. RESULTS: On the SNP-level, rs2035268 in gene SNCA accelerated FEV(1)/FVC decline by 3.8% (p(interaction) = 2.5x10(-6)), and rs12190800 in PARK2 attenuated FEV1 decline by 95.1 ml p(interaction) = 9.7x10(-8)) over 11 years, while interacting with PM10. Genes and pathways nominally interacting with PM10 and packyears exposure differed substantially. Gene CRISP2 presented a significant interaction with PM10 (p(interaction) = 3.0x10(-4)) on FEV(1)/FVC decline. Pathway interactions were weak. Replications for the strongest SNPs in PARK2 and CRISP2 were not successful. CONCLUSIONS: Consistent with a stratified response to increasing oxidative stress, different genes and pathways potentially mediate PM10 and tobac smoke effects on lung function decline. Ignoring environmental exposures would miss these patterns, but achieving sufficient sample size and comparability across study samples is challengin
Quantifying single nucleotide variant detection sensitivity in exome sequencing
BACKGROUND: The targeted capture and sequencing of genomic regions has rapidly demonstrated its utility in genetic studies. Inherent in this technology is considerable heterogeneity of target coverage and this is expected to systematically impact our sensitivity to detect genuine polymorphisms. To fully interpret the polymorphisms identified in a genetic study it is often essential to both detect polymorphisms and to understand where and with what probability real polymorphisms may have been missed. RESULTS: Using down-sampling of 30 deeply sequenced exomes and a set of gold-standard single nucleotide variant (SNV) genotype calls for each sample, we developed an empirical model relating the read depth at a polymorphic site to the probability of calling the correct genotype at that site. We find that measured sensitivity in SNV detection is substantially worse than that predicted from the naive expectation of sampling from a binomial. This calibrated model allows us to produce single nucleotide resolution SNV sensitivity estimates which can be merged to give summary sensitivity measures for any arbitrary partition of the target sequences (nucleotide, exon, gene, pathway, exome). These metrics are directly comparable between platforms and can be combined between samples to give “power estimates” for an entire study. We estimate a local read depth of 13X is required to detect the alleles and genotype of a heterozygous SNV 95% of the time, but only 3X for a homozygous SNV. At a mean on-target read depth of 20X, commonly used for rare disease exome sequencing studies, we predict 5–15% of heterozygous and 1–4% of homozygous SNVs in the targeted regions will be missed. CONCLUSIONS: Non-reference alleles in the heterozygote state have a high chance of being missed when commonly applied read coverage thresholds are used despite the widely held assumption that there is good polymorphism detection at these coverage levels. Such alleles are likely to be of functional importance in population based studies of rare diseases, somatic mutations in cancer and explaining the “missing heritability” of quantitative traits
Genome-wide associations of gene expression variation in humans
The exploration of quantitative variation in human populations has become one of the major priorities for medical genetics. The successful identification of variants that contribute to complex traits is highly dependent on reliable assays and genetic maps. We have performed a genome-wide quantitative trait analysis of 630 genes in 60 unrelated Utah residents with ancestry from Northern and Western Europe using the publicly available phase I data of the International HapMap project. The genes are located in regions of the human genome with elevated functional annotation and disease interest including the ENCODE regions spanning 1% of the genome, Chromosome 21 and Chromosome 20q12-13.2. We apply three different methods of multiple test correction, including Bonferroni, false discovery rate, and permutations. For the 374 expressed genes, we find many regions with statistically significant association of single nucleotide polymorphisms (SNPs) with expression variation in lymphoblastoid cell lines after correcting for multiple tests. Based on our analyses, the signal proximal (cis-) to the genes of interest is more abundant and more stable than distal and trans across statistical methodologies. Our results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I) HapMap has sufficient density to enable linkage disequilibrium mapping in humans. Such studies will significantly enhance our ability to annotate the non-coding part of the genome and interpret functional variation. In addition, we demonstrate that the HapMap cell lines themselves may serve as a useful resource for quantitative measurements at the cellular level
Interleukin-6 gene (IL-6): a possible role in brain morphology in the healthy adult brain
Background: Cytokines such as interleukin 6 (IL-6) have been implicated in dual functions in neuropsychiatric disorders. Little is known about the genetic predisposition to neurodegenerative and neuroproliferative properties of cytokine genes. In this study the potential dual role of several IL-6 polymorphisms in brain morphology is investigated. Methodology: In a large sample of healthy individuals (N = 303), associations between genetic variants of IL-6 (rs1800795; rs1800796, rs2069833, rs2069840) and brain volume (gray matter volume) were analyzed using voxel-based morphometry (VBM). Selection of single nucleotide polymorphisms (SNPs) followed a tagging SNP approach (e.g., Stampa algorigthm), yielding a capture 97.08% of the variation in the IL-6 gene using four tagging SNPs. Principal findings/results: In a whole-brain analysis, the polymorphism rs1800795 (−174 C/G) showed a strong main effect of genotype (43 CC vs. 150 CG vs. 100 GG; x = 24, y = −10, z = −15; F(2,286) = 8.54, puncorrected = 0.0002; pAlphaSim-corrected = 0.002; cluster size k = 577) within the right hippocampus head. Homozygous carriers of the G-allele had significantly larger hippocampus gray matter volumes compared to heterozygous subjects. None of the other investigated SNPs showed a significant association with grey matter volume in whole-brain analyses. Conclusions/significance: These findings suggest a possible neuroprotective role of the G-allele of the SNP rs1800795 on hippocampal volumes. Studies on the role of this SNP in psychiatric populations and especially in those with an affected hippocampus (e.g., by maltreatment, stress) are warranted.Bernhard T Baune, Carsten Konrad, Dominik Grotegerd, Thomas Suslow, Eva Birosova, Patricia Ohrmann, Jochen Bauer, Volker Arolt, Walter Heindel, Katharina Domschke, Sonja Schöning, Astrid V Rauch, Christina Uhlmann, Harald Kugel and Udo Dannlowsk
Next generation analytic tools for large scale genetic epidemiology studies of complex diseases
Over the past several years, genome‐wide association studies (GWAS) have succeeded in identifying hundreds of genetic markers associated with common diseases. However, most of these markers confer relatively small increments of risk and explain only a small proportion of familial clustering. To identify obstacles to future progress in genetic epidemiology research and provide recommendations to NIH for overcoming these barriers, the National Cancer Institute sponsored a workshop entitled “Next Generation Analytic Tools for Large‐Scale Genetic Epidemiology Studies of Complex Diseases” on September 15–16, 2010. The goal of the workshop was to facilitate discussions on (1) statistical strategies and methods to efficiently identify genetic and environmental factors contributing to the risk of complex disease; and (2) how to develop, apply, and evaluate these strategies for the design, analysis, and interpretation of large‐scale complex disease association studies in order to guide NIH in setting the future agenda in this area of research. The workshop was organized as a series of short presentations covering scientific (gene‐gene and gene‐environment interaction, complex phenotypes, and rare variants and next generation sequencing) and methodological (simulation modeling and computational resources and data management) topic areas. Specific needs to advance the field were identified during each session and are summarized. Genet. Epidemiol . 36 : 22–35, 2012. © 2011 Wiley Periodicals, Inc.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/93578/1/gepi20652.pd
Data analysis issues for allele-specific expression using Illumina's GoldenGate assay.
BACKGROUND: High-throughput measurement of allele-specific expression (ASE) is a relatively new and exciting application area for array-based technologies. In this paper, we explore several data sets which make use of Illumina's GoldenGate BeadArray technology to measure ASE. This platform exploits coding SNPs to obtain relative expression measurements for alleles at approximately 1500 positions in the genome. RESULTS: We analyze data from a mixture experiment where genomic DNA samples from pairs of individuals of known genotypes are pooled to create allelic imbalances at varying levels for the majority of SNPs on the array. We observe that GoldenGate has less sensitivity at detecting subtle allelic imbalances (around 1.3 fold) compared to extreme imbalances, and note the benefit of applying local background correction to the data. Analysis of data from a dye-swap control experiment allowed us to quantify dye-bias, which can be reduced considerably by careful normalization. The need to filter the data before carrying out further downstream analysis to remove non-responding probes, which show either weak, or non-specific signal for each allele, was also demonstrated. Throughout this paper, we find that a linear model analysis of the data from each SNP is a flexible modelling strategy that allows for testing of allelic imbalances in each sample when replicate hybridizations are available. CONCLUSIONS: Our analysis shows that local background correction carried out by Illumina's software, together with quantile normalization of the red and green channels within each array, provides optimal performance in terms of false positive rates. In addition, we strongly encourage intensity-based filtering to remove SNPs which only measure non-specific signal. We anticipate that a similar analysis strategy will prove useful when quantifying ASE on Illumina's higher density Infinium BeadChips.RIGHTS : This article is licensed under the BioMed Central licence at http://www.biomedcentral.com/about/license which is similar to the 'Creative Commons Attribution Licence'. In brief you may : copy, distribute, and display the work; make derivative works; or make commercial use of the work - under the following conditions: the original author must be given credit; for any reuse or distribution, it must be made clear to others what the license terms of this work are
- …
