5,573 research outputs found

    Cross-Domain Labeled LDA for Cross-Domain Text Classification

    Full text link
    Cross-domain text classification aims at building a classifier for a target domain which leverages data from both source and target domain. One promising idea is to minimize the feature distribution differences of the two domains. Most existing studies explicitly minimize such differences by an exact alignment mechanism (aligning features by one-to-one feature alignment, projection matrix etc.). Such exact alignment, however, will restrict models' learning ability and will further impair models' performance on classification tasks when the semantic distributions of different domains are very different. To address this problem, we propose a novel group alignment which aligns the semantics at group level. In addition, to help the model learn better semantic groups and semantics within these groups, we also propose a partial supervision for model's learning in source domain. To this end, we embed the group alignment and a partial supervision into a cross-domain topic model, and propose a Cross-Domain Labeled LDA (CDL-LDA). On the standard 20Newsgroup and Reuters dataset, extensive quantitative (classification, perplexity etc.) and qualitative (topic detection) experiments are conducted to show the effectiveness of the proposed group alignment and partial supervision.Comment: ICDM 201

    Tuning the Conductance of Monatomic Carbon Chain

    Full text link
    Ab initio calculations show that the conductance of short monatomic carbon chain can be dramatically modified by adhering a single H, N, or O atom to the chain. For example, the conductance of the pristine chain gets about two orders of magnitude smaller if an H atom is adhered to the chain. By a statistical model, the structure of the carbon chain with the single atom adhered is found to be quite stable at room temperature, indicating that the method can be used to tune the conductance of monatomic carbon chain.Comment: 11pages, 6figure

    L-Theanine Content and Related Gene Expression: Novel Insights into Theanine Biosynthesis and Hydrolysis among Different Tea Plant (Camellia sinensis L.) Tissues and Cultivars

    Get PDF
    L-Theanine content has tissues and cultivars specificity in tea plant (Camellia sinensis L.), the correlations of theanine metabolic related genes expression profiles with theanine contents were explored in this study. L-theanine contents in the bud and 1st leaf, 2nd leaf, 3rd leaf, old leaf, stem, and lateral root were determined by HPLC from three C. sinensis cultivars, namely ‘Huangjinya’, ‘Anjibaicha’, and ‘Yingshuang’, respectively. The theanine contents in leaves and root of ‘Huangjinya’ were the highest, followed by ‘Anjibaicha’, and ‘Yingshuang’. The theanine contents in the leaves reduced as the leaf mature gradually, and in stem were the least. Seventeen genes encoding enzymes involved in theanine metabolism were identified from GenBank and our tea transcriptome database, including CsTS1, CsTS2, CsGS1, CsGS2, CsGOGAT-Fe, CsGOGAT-NAD(P)H, CsGDH1, CsGDH2, CsALT, CsSAMDC, CsADC, CsCuAO, CsPAO, CsNiR, CsNR, CsGGT1, and CsGGT3. The transcript profiles of those seventeen genes in the different tissues of three tea plant cultivars were analyzed comparatively. Among the different cultivars, the transcript levels of most selected genes in ‘Huangjinya’ were significantly higher than that in the ‘Anjibaicha’ and ‘Yingshuang’. Among the different tissues, the transcript levels of CsTS2, CsGS1, and CsGDH2 almost showed positive correlation with the theanine contents, while the other genes showed negative correlation with the theanine contents in most cases. The theanine contents showed correlations with related genes expression levels among cultivars and tissues of tea plant, and were determined by the integrated effect of the metabolic related genes
    corecore