377,336 research outputs found

    A multiple-instance scoring method to predict tissue-specific cis-regulatory motifs and regions

    Get PDF
    Transcription is the central process of gene regulation. In higher eukaryotes, the transcription of a gene is usually regulated by multiple cis-regulatory regions (CRRs). In different tissues, different transcription factors bind to their cis-regulatory motifs in these CRRs to drive tissue-specific expression patterns of their target genes. By combining the genome-wide gene expression data with the genomic sequence data, we proposed multiple-instance scoring (MIS) method to predict the tissue-specific motifs and the corresponding CRRs. The method is mainly based on the assumption that only a subset of CRRs of the expressed gene should function in the studied tissue. By testing on the simulated datasets and the fly muscle dataset, MIS can identify true motifs when noise is high and shows higher specificity for predicting the tissue-specific functions of CRRs

    Zero-error communication over adder MAC

    Full text link
    Adder MAC is a simple noiseless multiple-access channel (MAC), where if users send messages X1,,Xh{0,1}nX_1,\ldots,X_h\in \{0,1\}^n, then the receiver receives Y=X1++XhY = X_1+\cdots+X_h with addition over Z\mathbb{Z}. Communication over the noiseless adder MAC has been studied for more than fifty years. There are two models of particular interest: uniquely decodable code tuples, and BhB_h-codes. In spite of the similarities between these two models, lower bounds and upper bounds of the optimal sum rate of uniquely decodable code tuple asymptotically match as number of users goes to infinity, while there is a gap of factor two between lower bounds and upper bounds of the optimal rate of BhB_h-codes. The best currently known BhB_h-codes for h3h\ge 3 are constructed using random coding. In this work, we study variants of the random coding method and related problems, in hope of achieving BhB_h-codes with better rate. Our contribution include the following. (1) We prove that changing the underlying distribution used in random coding cannot improve the rate. (2) We determine the rate of a list-decoding version of BhB_h-codes achieved by the random coding method. (3) We study several related problems about R\'{e}nyi entropy.Comment: An updated version of author's master thesi

    Spanning rigid subgraph packing and sparse subgraph covering

    Full text link
    Rigidity, arising in discrete geometry, is the property of a structure that does not flex. Laman provides a combinatorial characterization of rigid graphs in the Euclidean plane, and thus rigid graphs in the Euclidean plane have applications in graph theory. We discover a sufficient partition condition of packing spanning rigid subgraphs and spanning trees. As a corollary, we show that a simple graph GG contains a packing of kk spanning rigid subgraphs and ll spanning trees if GG is (4k+2l)(4k+2l)-edge-connected, and GZG-Z is essentially (6k+2l2kZ)(6k+2l - 2k|Z|)-edge-connected for every ZV(G)Z\subset V(G). Thus every (4k+2l)(4k+2l)-connected and essentially (6k+2l)(6k+2l)-connected graph GG contains a packing of kk spanning rigid subgraphs and ll spanning trees. Utilizing this, we show that every 66-connected and essentially 88-connected graph GG contains a spanning tree TT such that GE(T)G-E(T) is 22-connected. These improve some previous results. Sparse subgraph covering problems are also studied.Comment: 12 page

    Computations of Mather Minimal Log Discrepancies

    Full text link
    We compute the Mather minimal log discrepancy via jet schemes and arc spaces for toric varieties and very general hypersurfaces
    corecore