25 research outputs found
Establishing the precise evolutionary history of a gene improves prediction of disease-causing missense mutations
PURPOSE: Predicting the phenotypic effects of mutations has become an important application in clinical genetic diagnostics. Computational tools evaluate the behavior of the variant over evolutionary time and assume that variations seen during the course of evolution are probably benign in humans. However, current tools do not take into account orthologous/paralogous relationships. Paralogs have dramatically different roles in Mendelian diseases. For example, whereas inactivating mutations in the NPC1 gene cause the neurodegenerative disorder Niemann-Pick C, inactivating mutations in its paralog NPC1L1 are not disease-causing and, moreover, are implicated in protection from coronary heart disease. METHODS: We identified major events in NPC1 evolution and revealed and compared orthologs and paralogs of the human NPC1 gene through phylogenetic and protein sequence analyses. We predicted whether an amino acid substitution affects protein function by reducing the organism’s fitness. RESULTS: Removing the paralogs and distant homologs improved the overall performance of categorizing disease-causing and benign amino acid substitutions. CONCLUSION: The results show that a thorough evolutionary analysis followed by identification of orthologs improves the accuracy in predicting disease-causing missense mutations. We anticipate that this approach will be used as a reference in the interpretation of variants in other genetic diseases as well. Genet Med 18 10, 1029–1036
Assessing predictions on fitness effects of missense variants in HMBS in CAGI6
This paper presents an evaluation of predictions submitted for the "HMBS" challenge, a component of the sixth round of the Critical Assessment of Genome Interpretation held in 2021. The challenge required participants to predict the effects of missense variants of the human HMBS gene on yeast growth. The HMBS enzyme, critical for the biosynthesis of heme in eukaryotic cells, is highly conserved among eukaryotes. Despite the application of a variety of algorithms and methods, the performance of predictors was relatively similar, with Kendall's tau correlation coefficients between predictions and experimental scores around 0.3 for a majority of submissions. Notably, the median correlation (>= 0.34) observed among these predictors, especially the top predictions from different groups, was greater than the correlation observed between their predictions and the actual experimental results. Most predictors were moderately successful in distinguishing between deleterious and benign variants, as evidenced by an area under the receiver operating characteristic (ROC) curve (AUC) of approximately 0.7 respectively. Compared with the recent two rounds of CAGI competitions, we noticed more predictors outperformed the baseline predictor, which is solely based on the amino acid frequencies. Nevertheless, the overall accuracy of predictions is still far short of positive control, which is derived from experimental scores, indicating the necessity for considerable improvements in the field. The most inaccurately predicted variants in this round were associated with the insertion loop, which is absent in many orthologs, suggesting the predictors still heavily rely on the information from multiple sequence alignment
SURF1 related Leigh syndrome: Clinical and molecular findings of 16 patients from Turkey
Introduction: Pathogenic variants in SURF1, a nuclear-encoded gene encoding a mitochondrial chaperone involved in COX assembly, are one of the most common causes of Leigh syndrome (LS). Material-methods: Sixteen patients diagnosed to have SURF1-related LS between 2012 and 2020 were included in the study. Their clinical, biochemical and molecular findings were recorded. 10/16 patients were diagnosed using whole-exome sequencing (WES), 4/16 by Sanger sequencing of SURF1, 1/16 via targeted exome sequencing and 1/16 patient with whole-genome sequencing (WGS). The pathogenicity of SURF1 variants was evaluated by phylogenetic studies and modelling on the 3D structure of the SURF1 protein. Results: We identified 16 patients from 14 unrelated families who were either homozygous or compound heterozygous for SURF1 pathogenic variants. Nine different SURF1 variants were detected The c.769G > A was the most common variant with an allelic frequency of 42.8% (12/28), c.870dupT [(p.Lys291*); (8/28 28.5%)], c.169delG [(p.Glu57Lysfs*15), (2/24; 7.1%)], c.532 T > A [(p.Tyr178Asn); (2/28, 7.1%)], c.653_654delCT [(p.Pro218Argfs*29); (4/28, 14.2%)] c.595_597delGGA [(p.Gly199del); (1/28, 3.5%)], c.751 + 1G > A (2/28, 4.1%), c.356C > T [(p.Pro119Leu); (2/28, 3.5%)] were the other detected variants. Two pathogenic variants, C.595_597delGGA and c.356C > T, were detected for the first time. The c.769 G > A variant detected in 6 patients from 5 families was evaluated in terms of phenotype-genotype correlation. There was no definite genotype – phenotype correlation. Conclusions: To date, more than 120 patients of LS with SURF1 pathogenic variants have been reported. We shared the clinical, molecular data and natural course of 16 new SURF1 defect patients from our country. This study is the first comprehensive research from Turkey that provides information about disease-causing variants in the SURF1 gene. The identification of common variants and phenotype of the SURF1 gene is important for understanding SURF1 related LS. Synopsis: SURF1 gene defects are one of the most important causes of LS; patients have a homogeneous clinical and biochemical phenotype. © 2020 The AuthorsG0800674 Biotechnology and Biological Sciences Research Council, BBSRC: L016354 European Molecular Biology Organization, EMBO EMBO Türkiye Bilimsel ve Teknolojik Araştirma Kurumu, TÜBITAK Bilim Akademisi Newcastle upon Tyne Hospitals NHS Foundation Trust NIHR Bristol Biomedical Research Centre Lily Mae Foundation Medical Research Council, MRCRM and RWT are supported by the Wellcome Centre for Mitochondrial Research ( 203105/Z/16/Z ), the Medical Research Council (MRC) International Centre for Genomic Medicine in Neuromuscular Disease, the Mitochondrial Disease Patient Cohort (UK) ( G0800674 ), Newcastle University Centre for Ageing and Vitality (supported by the Biotechnology and Biological Sciences Research Council and Medical Research Council L016354 ), UK NIHR Biomedical Research Centre for Ageing and Age-related disease award to the Newcastle upon Tyne Hospitals NHS Foundation Trust , the Lily Foundation and the UK NHS Specialist Commissioners which funds the “Rare Mitochondrial Disorders of Adults and Children” Diagnostic Service in Newcastle upon Tyne ( http://www.newcastle-mitochondria.com . OA is supported by the European Molecular Biology Organization (EMBO) Installation grant funded by the Scientific and Technological Research Council of Turkey (TUBITAK); Science Academy, Turkey; TUBITAK International Fellowship for Outstanding Researchers Programme
CDvist: a webserver for identification and visualization of conserved domains in protein sequences
Genome-wide mapping of nucleotide excision repair with XR-seq
Nucleotide excision repair is a versatile mechanism to repair a variety of bulky DNA adducts. We developed excision repair sequencing (XR-seq) to study nucleotide excision repair of DNA adducts in humans, mice, Arabidopsis thaliana, yeast and Escherichia coli. In this protocol, the excised oligomers, generated in the nucleotide excision repair reaction, are isolated by cell lysis and fractionation, followed by immunoprecipitation with damage-or repair factor-specific antibodies from the non-chromatin fraction. The single-stranded excised oligomers are ligated to adapters and reimmunoprecipitated with damage-specific antibodies. The DNA damage in the excised oligomers is then reversed by enzymatic or chemical reactions before being converted into a sequencing library by PCR amplification. Alternatively, the excised oligomers containing DNA damage, especially those containing irreversible DNA damage such as benzo[a] pyrene-induced DNA adducts, can be converted to a double-stranded DNA (dsDNA) form by using appropriate translesion DNA synthesis (TLS) polymerases and then can be amplified by PCR. The current genome-wide approaches for studying repair measure the loss of damage signal with time, which limits their resolution. By contrast, an advantage of XR-seq is that the repair signal is directly detected above a background of zero. An XR-seq library using the protocol described here can be obtained in 7-9 d
