513 research outputs found

    Induction, expression and characterisation of laccase genes from the marine-derived fungal strains Nigrospora sp. CBMAI 1328 and Arthopyrenia sp. CBMAI 1330

    Get PDF
    The capability of the fungi Nigrospora sp. CBMAI 1328 and Arthopyrenia sp. CBMAI 1330 isolated from marine sponge to synthesise laccases (Lcc) in the presence of the inducer copper (110 M) was assessed. In a liquid culture medium supplemented with 5 M of copper sulphate after 5 days of incubation, Nigrospora sp. presented the highest Lcc activity (25.2 UL1). The effect of copper on Lcc gene expression was evaluated by reverse transcriptase polymerase chain reaction. Nigrospora sp. showed the highest gene expression of Lcc under the same conditions of Lcc synthesis. The highest Lcc expression by the Arthopyrenia sp. was detected at 96 h of incubation in absence of copper. Molecular approaches allowed the detection of Lcc isozymes and suggest the presence of at least two undescribed putative genes. Additionally, Lcc sequences from the both fungal strains clustered with other Lcc sequences from other fungi that inhabit marine environments.M. Passarini was supported by Ph.D. grant from FAPESP (2008/06720-7), Sao Paulo, Brazil. The authors thank FAPESP for financial support (BIOTA-FAPESP grant 2010/50190-2 and FAPESP grant 2013/19486-0) and Roberto G.S. Berlinck and CEBIMAR for the support related to samples collecting. L.D. Sette thanks CNPq for Productivity Fellowships 304103/2013-6

    Computational prediction and interpretation of druggable proteins using a stacked ensemble-learning framework

    Get PDF
    Discovery of potential drugs requires rapid and precise identification of drug targets. Although traditional experimental methodologies can accurately identify drug targets, they are time-consuming and inappropriate for high-throughput screening. Computational approaches based on machine learning (ML) algorithms can expedite the prediction of druggable proteins; however, the performance of the existing computational methods remains unsatisfactory. This study proposes a computational tool, SPIDER, to enhance the accurate prediction of druggable proteins. SPIDER employs various feature descriptors pertaining to several aspects, including physicochemical properties, compositional information, and composition-transition-distribution information, coupled with well-known ML algorithms to facilitate the construction of the final meta-predictor. The experimental results showed that SPIDER enabled more precise and robust prediction of druggable proteins than the baseline models and current existing methods in terms of the independent test dataset. An online web server was established and made freely available online

    Comparative analysis of machine learning-based approaches for identifying therapeutic peptides targeting SARS-CoV-2

    Get PDF
    Coronavirus disease 2019 (COVID-19) has impacted public health as well as societal and economic well-being. In the last two decades, various prediction algorithms and tools have been developed for predicting antiviral peptides (AVPs). The current COVID-19 pandemic has underscored the need to develop more efficient and accurate machine learning (ML)-based prediction algorithms for the rapid identification of therapeutic peptides against severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2). Several peptide-based ML approaches, including anti-coronavirus peptides (ACVPs), IL-6 inducing epitopes and other epitopes targeting SARS-CoV-2, have been implemented in COVID-19 therapeutics. Owing to the growing interest in the COVID-19 field, it is crucial to systematically compare the existing ML algorithms based on their performances. Accordingly, we comprehensively evaluated the state-of-the-art IL-6 and AVP predictors against coronaviruses in terms of core algorithms, feature encoding schemes, performance evaluation metrics and software usability. A comprehensive performance assessment was then conducted to evaluate the robustness and scalability of the existing predictors using well-constructed independent validation datasets. Additionally, we discussed the advantages and disadvantages of the existing methods, providing useful insights into the development of novel computational tools for characterizing and identifying epitopes or ACVPs. The insights gained from this review are anticipated to provide critical guidance to the scientific community in the rapid design and development of accurate and efficient next-generation in silico tools against SARS-CoV-2

    Extremely-randomized-tree-based Prediction of N(6)-Methyladenosine Sites in Saccharomyces cerevisiae

    Get PDF
    INTRODUCTION: N(6)-methyladenosine (m6A) is one of the most common post-transcriptional modifications in RNA, which has been related to several biological processes. The accurate prediction of m6A sites from RNA sequences is one of the challenging tasks in computational biology. Several computational methods utilizing machine-learning algorithms have been proposed that accelerate in silico screening of m6A sites, thereby drastically reducing the experimental time and labor costs involved. METHODOLOGY: In this study, we proposed a novel computational predictor termed ERT-m6Apred, for the accurate prediction of m6A sites. To identify the feature encodings with more discriminative capability, we applied a two-step feature selection technique on seven different feature encodings and identified the corresponding optimal feature set. RESULTS: Subsequently, performance comparison of the corresponding optimal feature set-based extremely randomized tree model revealed that Pseudo k-tuple composition encoding, which includes 14 physicochemical properties significantly outperformed other encodings. Moreover, ERT-m6Apred achieved an accuracy of 78.84% during cross-validation analysis, which is comparatively better than recently reported predictors. CONCLUSION: In summary, ERT-m6Apred predicts Saccharomyces cerevisiae m6A sites with higher accuracy, thus facilitating biological hypothesis generation and experimental validations

    Multisensory body representation in autoimmune diseases

    Get PDF
    Body representation has been linked to the processing and integration of multisensory signals. An outstanding example of the pivotal role played by multisensory mechanisms in body representation is the Rubber Hand Illusion (RHI). In this paradigm, multisensory stimulation induces a sense of ownership over a fake limb. Previous work has shown high interindividual differences in the susceptibility to the RHI. The origin of this variability remains largely unknown. Given the tight and bidirectional communication between the brain and the immune system, we predicted that the origin of this variability could be traced, in part, to the immune system's functioning, which is altered by several clinical conditions, including Coeliac Disease (CD). Consistent with this prediction, we found that the Rubber Hand Illusion is stronger in CD patients as compared to healthy controls. We propose a biochemical mechanism accounting for the dependency of multisensory body representation upon the Immune system. Our finding has direct implications for a range of neurological, psychiatric and immunological conditions where alterations of multisensory integration, body representation and dysfunction of the immune system co-exist

    STALLION: A stacking-based ensemble learning framework for prokaryotic lysine acetylation site prediction

    Get PDF
    Protein post-translational modification (PTM) is an important regulatory mechanism that plays a key role in both normal and disease states. Acetylation on lysine residues is one of the most potent PTMs owing to its critical role in cellular metabolism and regulatory processes. Identifying protein lysine acetylation (Kace) sites is a challenging task in bioinformatics. To date, several machine learning-based methods for the in silico identification of Kace sites have been developed. Of those, a few are prokaryotic species-specific. Despite their attractive advantages and performances, these methods have certain limitations. Therefore, this study proposes a novel predictor STALLION (STacking-based Predictor for ProkAryotic Lysine AcetyLatION), containing six prokaryotic species-specific models to identify Kace sites accurately. To extract crucial patterns around Kace sites, we employed 11 different encodings representing three different characteristics. Subsequently, a systematic and rigorous feature selection approach was employed to identify the optimal feature set independently for five tree-based ensemble algorithms and built their respective baseline model for each species. Finally, the predicted values from baseline models were utilized and trained with an appropriate classifier using the stacking strategy to develop STALLION. Comparative benchmarking experiments showed that STALLION significantly outperformed existing predictor on independent tests. To expedite direct accessibility to the STALLION models, a user-friendly online predictor was implemented, which is available at: http://thegleelab.org/STALLION

    SCMTHP: A New Approach for Identifying and Characterizing of Tumor-Homing Peptides Using Estimated Propensity Scores of Amino Acids

    Get PDF
    Tumor-homing peptides (THPs) are small peptides that can recognize and bind cancer cells specifically. To gain a better understanding of THPs’ functional mechanisms, the accurate identification and characterization of THPs is required. Although some computational methods for in silico THP identification have been proposed, a major drawback is their lack of model interpretability. In this study, we propose a new, simple and easily interpretable computational approach (called SCMTHP) for identifying and analyzing tumor-homing activities of peptides via the use of a scoring card method (SCM). To improve the predictability and interpretability of our predictor, we generated propensity scores of 20 amino acids as THPs. Finally, informative physicochemical properties were used for providing insights on characteristics giving rise to the bioactivity of THPs via the use of SCMTHP-derived propensity scores. Benchmarking experiments from independent test indicated that SCMTHP could achieve comparable performance to state-of-the-art method with accuracies of 0.827 and 0.798, respectively, when evaluated on two benchmark datasets consisting of Main and Small datasets. Furthermore, SCMTHP was found to outperform several well-known machine learning-based classifiers (e.g., decision tree, k-nearest neighbor, multi-layer perceptron, naive Bayes and partial least squares regression) as indicated by both 10-fold cross-validation and independent tests. Finally, the SCMTHP web server was established and made freely available online. SCMTHP is expected to be a useful tool for rapid and accurate identification of THPs and for providing better understanding on THP biophysical and biochemical properties

    A conserved loop-wedge motif moderates reaction site search and recognition by FEN1

    Get PDF
    DNA replication and repair frequently involve intermediate two-way junction structures with overhangs, or flaps, that must be promptly removed; a task performed by the essential enzyme flap endonuclease 1 (FEN1). We demonstrate a functional relationship between two intrinsically disordered regions of the FEN1 protein, which recognise opposing sides of the junction and order in response to the requisite substrate. Our results inform a model in which short-range translocation of FEN1 on DNA facilitates search for the annealed 3′‑terminus of a primer strand, which is recognised by breaking the terminal base pair to generate a substrate with a single nucleotide 3′‑flap. This recognition event allosterically signals hydrolytic removal of the 5′-flap through reaction in the opposing junction duplex, by controlling access of the scissile phosphate diester to the active site. The recognition process relies on a highly-conserved ‘wedge’ residue located on a mobile loop that orders to bind the newly-unpaired base. The unanticipated ‘loop–wedge’ mechanism exerts control over substrate selection, rate of reaction and reaction site precision, and shares features with other enzymes that recognise irregular DNA structures. These new findings reveal how FEN1 precisely couples 3′-flap verification to function

    SDM6A: A Web-Based Integrative Machine-Learning Framework for Predicting 6mA Sites in the Rice Genome

    Get PDF
    DNA N(6)-adenine methylation (6mA) is an epigenetic modification in prokaryotes and eukaryotes. Identifying 6mA sites in rice genome is important in rice epigenetics and breeding, but non-random distribution and biological functions of these sites remain unclear. Several machine-learning tools can identify 6mA sites but show limited prediction accuracy, which limits their usability in epigenetic research. Here, we developed a novel computational predictor, called the Sequence-based DNA N(6)-methyladenine predictor (SDM6A), which is a two-layer ensemble approach for identifying 6mA sites in the rice genome. Unlike existing methods, which are based on single models with basic features, SDM6A explores various features, and five encoding methods were identified as appropriate for this problem. Subsequently, an optimal feature set was identified from encodings, and corresponding models were developed individually using support vector machine and extremely randomized tree. First, all five single models were integrated via ensemble approach to define the class for each classifier. Second, two classifiers were integrated to generate a final prediction. SDM6A achieved robust performance on cross-validation and independent evaluation, with average accuracy and Matthews correlation coefficient (MCC) of 88.2% and 0.764, respectively. Corresponding metrics were 4.7%-11.0% and 2.3%-5.5% higher than those of existing methods, respectively. A user-friendly, publicly accessible web server (http://thegleelab.org/SDM6A) was implemented to predict novel putative 6mA sites in rice genome

    Computational prediction of species-specific yeast DNA replication origin via iterative feature representation

    Get PDF
    Deoxyribonucleic acid replication is one of the most crucial tasks taking place in the cell, and it has to be precisely regulated. This process is initiated in the replication origins (ORIs), and thus it is essential to identify such sites for a deeper understanding of the cellular processes and functions related to the regulation of gene expression. Considering the important tasks performed by ORIs, several experimental and computational approaches have been developed in the prediction of such sites. However, existing computational predictors for ORIs have certain curbs, such as building only single-feature encoding models, limited systematic feature engineering efforts and failure to validate model robustness. Hence, we developed a novel species-specific yeast predictor called yORIpred that accurately identify ORIs in the yeast genomes. To develop yORIpred, we first constructed optimal 40 baseline models by exploring eight different sequence-based encodings and five different machine learning classifiers. Subsequently, the predicted probability of 40 models was considered as the novel feature vector and carried out iterative feature learning approach independently using five different classifiers. Our systematic analysis revealed that the feature representation learned by the support vector machine algorithm (yORIpred) could well discriminate the distribution characteristics between ORIs and non-ORIs when compared with the other four algorithms. Comprehensive benchmarking experiments showed that yORIpred achieved superior and stable performance when compared with the existing predictors on the same training datasets. Furthermore, independent evaluation showcased the best and accurate performance of yORIpred thus underscoring the significance of iterative feature representation. To facilitate the users in obtaining their desired results without undergoing any mathematical, statistical or computational hassles, we developed a web server for the yORIpred predictor, which is available at: http://thegleelab.org/yORIpred
    corecore