294 research outputs found
Rare germline variants in DNA repair genes and the angiogenesis pathway predispose prostate cancer patients to develop metastatic disease
Background
Prostate cancer (PrCa) demonstrates a heterogeneous clinical presentation ranging from largely indolent to lethal. We sought to identify a signature of rare inherited variants that distinguishes between these two extreme phenotypes.
Methods
We sequenced germline whole exomes from 139 aggressive (metastatic, age of diagnosis < 60) and 141 non-aggressive (low clinical grade, age of diagnosis ≥60) PrCa cases. We conducted rare variant association analyses at gene and gene set levels using SKAT and Bayesian risk index techniques. GO term enrichment analysis was performed for genes with the highest differential burden of rare disruptive variants.
Results
Protein truncating variants (PTVs) in specific DNA repair genes were significantly overrepresented among patients with the aggressive phenotype, with BRCA2, ATM and NBN the most frequently mutated genes. Differential burden of rare variants was identified between metastatic and non-aggressive cases for several genes implicated in angiogenesis, conferring both deleterious and protective effects.
Conclusions
Inherited PTVs in several DNA repair genes distinguish aggressive from non-aggressive PrCa cases. Furthermore, inherited variants in genes with roles in angiogenesis may be potential predictors for risk of metastases. If validated in a larger dataset, these findings have potential for future clinical application
Development of SNP markers present in expressed genes of the plant-pathogen interaction: Theobroma cacao - Moniliophtora perniciosa
We report the detection, validation and analysis of SNPs in the plant-pathogen interaction between cacao and Moniliophthora perniciosa ESTs using resequencing. This analysis in 73 EST sequences allowed the identification of 185 SNPs, 57% of them corresponding to transversion, 29% to transition and 14% to indels. The ESTs containing SNPs were classified into 14 main functional categories. After validation, 91 SNPs were confirmed, categorized and the parameters of nucleotide diversity and haplotype were calculated. Haplotype-based gene diversity and polymorphic information content (PIC) ranged from 0.559 to 0.56 and 0.115 to 0.12; respectively. Also, it was the advantage when considering haplotypes structure for each locus in place of single SNPs. Most of the gene fragments had a major haplotype combined to a series of low frequency haplotypes. Thus, the re-sequencing approach proved to be a valuable resource to identify useful SNPs for wide genetic applications. Furthermore, the cacao genome sequence availability allow a positional selection of DNA fragments to be re-sequenced enhancing the usefulness of the discovered SNPs. These results indicate the potential use of SNPs markers to identify allelic status of cacao resistance genes through marker-assisted selection to support the development of promising genotypes with high resistance to witch's broom disease. (Résumé d'auteur
Increasing the use of second-line therapy is a cost-effective approach to prevent the spread of drug-resistant HIV: a mathematical modelling study
METHODS: We develop a deterministic mathematical model representing Kampala, Uganda, to predict the prevalence of TDR over a 10-year period. We then compare the impact on TDR and cost-effectiveness of: (1) introduction of pre-therapy genotyping; (2) doubling use of second-line treatment to 80% (50-90%) of patients with confirmed virological failure on first-line ART; and (3) increasing viral load monitoring from yearly to twice yearly. An intervention can be considered cost-effective if it costs less than three times the gross domestic product per capita per quality adjusted life year (QALY) gained, or less than 1612 to 450-dominated) per QALY gained.CONCLUSIONS: While earlier treatment initiation will result in a predicted increase in the proportion of patients infected with drug-resistant HIV, the absolute numbers of patients infected with drug-resistant HIV is predicted to decrease. Increasing use of second-line treatment to all patients with confirmed failure on first-line therapy is a cost-effective approach to reduce TDR. Improving access to second-line ART is therefore a major priority.INTRODUCTION: Earlier antiretroviral therapy (ART) initiation reduces HIV-1 incidence. This benefit may be offset by increased transmitted drug resistance (TDR), which could limit future HIV treatment options. We analyze the epidemiological impact and cost-effectiveness of strategies to reduce TDR
Genome-wide methylation analysis identifies genes silenced in non-seminoma cell lines
Silencing of genes by DNA methylation is a common phenomenon in many types of cancer. However, the genome wide effect of DNA methylation on gene expression has been analysed in relatively few cancers. Germ cell tumours (GCTs) are a complex group of malignancies. They are unique in developing from a pluripotent progenitor cell. Previous analyses have suggested that non-seminomas exhibit much higher levels of DNA methylation than seminomas. The genomic targets that are methylated, the extent to which this results in gene silencing and the identity of the silenced genes most likely to play a role in the tumours’ biology have not yet been established. In this study, genome-wide methylation and expression analysis of GCT cell lines was combined with gene expression data from primary tumours to address this question. Genome methylation was analysed using the Illumina infinium HumanMethylome450 bead chip system and gene expression was analysed using Affymetrix GeneChip Human Genome U133 Plus 2.0 arrays. Regulation by methylation was confirmed by demethylation using 5-aza-2-deoxycytidine and reverse transcription–quantitative PCR. Large differences in the level of methylation of the CpG islands of individual genes between tumour cell lines correlated well with differential gene expression. Treatment of non-seminoma cells with 5-aza-2-deoxycytidine verified that methylation of all genes tested played a role in their silencing in yolk sac tumour cells and many of these genes were also differentially expressed in primary tumours. Genes silenced by methylation in the various GCT cell lines were identified. Several pluripotency-associated genes were identified as a major functional group of silenced genes
Rare disruptive mutations in ciliary function genes contribute to testicular cancer susceptibility
Testicular germ cell tumour (TGCT) is the most common cancer in young men. Here we sought to identify risk factors for TGCT by performing whole-exome sequencing on 328 TGCT cases from 153 families, 634 sporadic TGCT cases and 1,644 controls. We search for genes that are recurrently affected by rare variants (minor allele frequency <0.01) with potentially damaging effects and evidence of segregation in families. A total of 8.7% of TGCT families carry rare disruptive mutations in the cilia-microtubule genes (CMG) as compared with 0.5% of controls (P=2.1 × 10¯⁸). The most significantly mutated CMG is DNAAF1 with biallelic inactivation and loss of DNAAF1 expression shown in tumours from carriers. DNAAF1 mutation as a cause of TGCT is supported by a dnaaf1hu²⁵⁵h(+/−) zebrafish model, which has a 94% risk of TGCT. Our data implicate cilia-microtubule inactivation as a cause of TGCT and provide evidence for CMGs as cancer susceptibility genes
Meta-analysis of five genome-wide association studies identifies multiple new loci associated with testicular germ cell tumor
The international Testicular Cancer Consortium (TECAC) combined five published genome-wide association studies of testicular germ cell tumor (TGCT; 3,558 cases and 13,970 controls) to identify new susceptibility loci. We conducted a fixed-effects meta-analysis, including, to our knowledge, the first analysis of the X chromosome. Eight new loci mapping to 2q14.2, 3q26.2, 4q35.2, 7q36.3, 10q26.13, 15q21.3, 15q22.31, and Xq28 achieved genome-wide significance (P < 5 × 10−8). Most loci harbor biologically plausible candidate genes. We refined previously reported associations at 9p24.3 and 19p12 by identifying one and three additional independent SNPs, respectively. In aggregate, the 39 independent markers identified to date explain 37% of father-to-son familial risk, 8% of which can be attributed to the 12 new signals reported here. Our findings substantially increase the number of known TGCT susceptibility alleles, move the field closer to a comprehensive understanding of the underlying genetic architecture of TGCT, and provide further clues to the etiology of TGCT
ICR142 Benchmarker: evaluating, optimising and benchmarking variant calling performance using the ICR142 NGS validation series.
Evaluating, optimising and benchmarking of next generation sequencing (NGS) variant calling performance are essential requirements for clinical, commercial and academic NGS pipelines. Such assessments should be performed in a consistent, transparent and reproducible fashion, using independently, orthogonally generated data. Here we present ICR142 Benchmarker, a tool to generate outputs for assessing germline base substitution and indel calling performance using the ICR142 NGS validation series, a dataset of Illumina platform-based exome sequence data from 142 samples together with Sanger sequence data at 704 sites. ICR142 Benchmarker provides summary and detailed information on the sensitivity, specificity and false detection rates of variant callers. ICR142 Benchmarker also automatically generates a single page report highlighting key performance metrics and how performance compares to widely-used open-source tools. We used ICR142 Benchmarker with VCF files outputted by GATK, OpEx and DeepVariant to create a benchmark for variant calling performance. This evaluation revealed pipeline-specific differences and shared challenges in variant calling, for example in detecting indels in short repeating sequence motifs. We next used ICR142 Benchmarker to perform regression testing with DeepVariant versions 0.5.2 and 0.6.1. This showed that v0.6.1 improves variant calling performance, but there was evidence of minor changes in indel calling behaviour that may benefit from attention. The data also allowed us to evaluate filters to optimise DeepVariant calling, and we recommend using 30 as the QUAL threshold for base substitution calls when using DeepVariant v0.6.1. Finally, we used ICR142 Benchmarker with VCF files from two commercial variant calling providers to facilitate optimisation of their in-house pipelines and to provide transparent benchmarking of their performance. ICR142 Benchmarker consistently and transparently analyses variant calling performance based on the ICR142 NGS validation series, using the standard VCF input and outputting informative metrics to enable user understanding of pipeline performance. ICR142 Benchmarker is freely available at https://github.com/RahmanTeamDevelopment/ICR142_Benchmarker/releases.This article is freely available online from the publisher's site via Open Access
OpEx - a validated, automated pipeline optimised for clinical exome sequence analysis.
We present an easy-to-use, open-source Optimised Exome analysis tool, OpEx (http://icr.ac.uk/opex) that accurately detects small-scale variation, including indels, to clinical standards. We evaluated OpEx performance with an experimentally validated dataset (the ICR142 NGS validation series), a large 1000 exome dataset (the ICR1000 UK exome series), and a clinical proband-parent trio dataset. The performance of OpEx for high-quality base substitutions and short indels in both small and large datasets is excellent, with overall sensitivity of 95%, specificity of 97% and low false detection rate (FDR) of 3%. Depending on the individual performance requirements the OpEx output allows one to optimise the inevitable trade-offs between sensitivity and specificity. For example, in the clinical setting one could permit a higher FDR and lower specificity to maximise sensitivity. In contexts where experimental validation is not possible, minimising the FDR and improving specificity may be a preferable trade-off for slightly lower sensitivity. OpEx is simple to install and use; the whole pipeline is run from a single command. OpEx is therefore well suited to the increasing research and clinical laboratories undertaking exome sequencing, particularly those without in-house dedicated bioinformatics expertise
The Quality Sequencing Minimum (QSM): providing comprehensive, consistent, transparent next generation sequencing data quality assurance.
Next generation sequencing (NGS) is routinely used in clinical genetic testing. Quality management of NGS testing is essential to ensure performance is consistently and rigorously evaluated. Three primary metrics are used in NGS quality evaluation: depth of coverage, base quality and mapping quality. To provide consistency and transparency in the utilisation of these metrics we present the Quality Sequencing Minimum (QSM). The QSM defines the minimum quality requirement a laboratory has selected for depth of coverage (C), base quality (B) and mapping quality (M) and can be applied per base, exon, gene or other genomic region, as appropriate. The QSM format is CX_BY(P Y)_MZ(P Z). X is the parameter threshold for C, Y the parameter threshold for B, P Y the percentage of reads that must reach Y, Z the parameter threshold for M, P Z the percentage of reads that must reach Z. The data underlying the QSM is in the BAM file, so a QSM can be easily and automatically calculated in any NGS pipeline. We used the QSM to optimise cancer predisposition gene testing using the TruSight Cancer Panel (TSCP). We set the QSM as C50_B10(85)_M20(95). Test regions falling below the QSM were automatically flagged for review, with 100/1471 test regions QSM-flagged in multiple individuals. Supplementing these regions with 132 additional probes improved performance in 85/100. We also used the QSM to optimise testing of genes with pseudogenes such as PTEN and PMS2. In TSCP data from 960 individuals the median number of regions that passed QSM per sample was 1429 (97%). Importantly, the QSM can be used at an individual report level to provide succinct, comprehensive quality assurance information about individual test performance. We believe many laboratories would find the QSM useful. Furthermore, widespread adoption of the QSM would facilitate consistent, transparent reporting of genetic test performance by different laboratories
- …
