2,556 research outputs found

    Probabilistic Clustering of Time-Evolving Distance Data

    Full text link
    We present a novel probabilistic clustering model for objects that are represented via pairwise distances and observed at different time points. The proposed method utilizes the information given by adjacent time points to find the underlying cluster structure and obtain a smooth cluster evolution. This approach allows the number of objects and clusters to differ at every time point, and no identification on the identities of the objects is needed. Further, the model does not require the number of clusters being specified in advance -- they are instead determined automatically using a Dirichlet process prior. We validate our model on synthetic data showing that the proposed method is more accurate than state-of-the-art clustering methods. Finally, we use our dynamic clustering model to analyze and illustrate the evolution of brain cancer patients over time

    Pathological Features of Breast Cancer seen in Northwestern Tanzania: A Nine Years Retrospective Study.

    Get PDF
    Breast cancer is more common in Western Countries compared to African populations. However in African population, it appears that the disease tends to be more aggressive and occurring at a relatively young age at the time of presentation. The aim of this study was to describe the trend of Breast Cancer in Northwestern Tanzania. This was a retrospective study which involved all cases of breast cancer diagnosed histologically at Bugando Medical Center from 2002 to 2010. Histological results and slides were retrieved from the records in the Pathology department, clinical information and demographic data for patients were retrieved from surgical wards and department of medical records. Histology slides were re-evaluated for the histological type, grade (By modified Bloom-Richardson score), and presence of necrosis and skin involvement. Data was entered and analyzed by SPSS computer software version 15. There were 328 patients histologically confirmed to have breast cancer, the mean age at diagnosis was 48.7 years (+/- 13.1). About half of the patients (52.4%) were below 46 years of age, and this group of patients had significantly higher tendency for lymph node metastasis (p = 0.012). The tumor size ranged from 1 cm to 18 cm in diameter with average (mean) of 5.5 cm (+/- 2.5), and median size of 6 cm. Size of the tumor (above 6 cm in diameter) and presence of necrosis within the tumor was significantly associated with high rate of lymph node metastasis (p = 0.000). Of all patients, 64% were at clinical stage III (specifically IIIB) and 70.4% had lymph node metastasis at the time of diagnosis. Only 4.3% of the patients were in clinical stage I at the time of diagnosis. Majority of the patients had invasive ductal carcinoma (91.5%) followed by mucinous carcinoma (5.2%), Invasive lobular carcinoma (3%) and in situ ductal carcinoma (0.3%). In all patients, 185 (56.4%) had tumor with histological grade 3. Breast cancer in this region show a trend towards relative young age at diagnosis with advanced stage at diagnosis and high rate of lymph node metastasis. Poor Referral system, lack of screening programs and natural aggressive biological behavior of tumor may contribute to advanced disease at the time of diagnosis

    The glyoxal budget and its contribution to organic aerosol for Los Angeles, California, during CalNex 2010

    Get PDF
    Recent laboratory and field studies have indicated that glyoxal is a potentially large contributor to secondary organic aerosol mass. We present in situ glyoxal measurements acquired with a recently developed, high sensitivity spectroscopic instrument during the CalNex 2010 field campaign in Pasadena, California. We use three methods to quantify the production and loss of glyoxal in Los Angeles and its contribution to organic aerosol. First, we calculate the difference between steady state sources and sinks of glyoxal at the Pasadena site, assuming that the remainder is available for aerosol uptake. Second, we use the Master Chemical Mechanism to construct a two-dimensional model for gas-phase glyoxal chemistry in Los Angeles, assuming that the difference between the modeled and measured glyoxal concentration is available for aerosol uptake. Third, we examine the nighttime loss of glyoxal in the absence of its photochemical sources and sinks. Using these methods we constrain the glyoxal loss to aerosol to be 0-5 × 10-5 s-1 during clear days and (1 ± 0.3) × 10-5 s-1 at night. Between 07:00-15:00 local time, the diurnally averaged secondary organic aerosol mass increases from 3.2 μg m-3 to a maximum of 8.8 μg m -3. The constraints on the glyoxal budget from this analysis indicate that it contributes 0-0.2 μg m-3 or 0-4% of the secondary organic aerosol mass. Copyright 2011 by the American Geophysical Union

    Atmospheric emissions from the deepwater Horizon spill constrain air-water partitioning, hydrocarbon fate, and leak rate

    Get PDF
    The fate of deepwater releases of gas and oil mixtures is initially determined by solubility and volatility of individual hydrocarbon species; these attributes determine partitioning between air and water. Quantifying this partitioning is necessary to constrain simulations of gas and oil transport, to predict marine bioavailability of different fractions of the gas-oil mixture, and to develop a comprehensive picture of the fate of leaked hydrocarbons in the marine environment. Analysis of airborne atmospheric data shows massive amounts (∼258,000 kg/day) of hydrocarbons evaporating promptly from the Deepwater Horizon spill; these data collected during two research flights constrain air-water partitioning, thus bioavailability and fate, of the leaked fluid. This analysis quantifies the fraction of surfacing hydrocarbons that dissolves in the water column (∼33% by mass), the fraction that does not dissolve, and the fraction that evaporates promptly after surfacing (∼14% by mass). We do not quantify the leaked fraction lacking a surface expression; therefore, calculation of atmospheric mass fluxes provides a lower limit to the total hydrocarbon leak rate of 32,600 to 47,700 barrels of fluid per day, depending on reservoir fluid composition information. This study demonstrates a new approach for rapid-response airborne assessment of future oil spills. Copyright 2011 by the American Geophysical Union

    Composite structural motifs of binding sites for delineating biological functions of proteins

    Get PDF
    Most biological processes are described as a series of interactions between proteins and other molecules, and interactions are in turn described in terms of atomic structures. To annotate protein functions as sets of interaction states at atomic resolution, and thereby to better understand the relation between protein interactions and biological functions, we conducted exhaustive all-against-all atomic structure comparisons of all known binding sites for ligands including small molecules, proteins and nucleic acids, and identified recurring elementary motifs. By integrating the elementary motifs associated with each subunit, we defined composite motifs which represent context-dependent combinations of elementary motifs. It is demonstrated that function similarity can be better inferred from composite motif similarity compared to the similarity of protein sequences or of individual binding sites. By integrating the composite motifs associated with each protein function, we define meta-composite motifs each of which is regarded as a time-independent diagrammatic representation of a biological process. It is shown that meta-composite motifs provide richer annotations of biological processes than sequence clusters. The present results serve as a basis for bridging atomic structures to higher-order biological phenomena by classification and integration of binding site structures.Comment: 34 pages, 7 figure

    FLORA: a novel method to predict protein function from structure in diverse superfamilies

    Get PDF
    Predicting protein function from structure remains an active area of interest, particularly for the structural genomics initiatives where a substantial number of structures are initially solved with little or no functional characterisation. Although global structure comparison methods can be used to transfer functional annotations, the relationship between fold and function is complex, particularly in functionally diverse superfamilies that have evolved through different secondary structure embellishments to a common structural core. The majority of prediction algorithms employ local templates built on known or predicted functional residues. Here, we present a novel method (FLORA) that automatically generates structural motifs associated with different functional sub-families (FSGs) within functionally diverse domain superfamilies. Templates are created purely on the basis of their specificity for a given FSG, and the method makes no prior prediction of functional sites, nor assumes specific physico-chemical properties of residues. FLORA is able to accurately discriminate between homologous domains with different functions and substantially outperforms (a 2–3 fold increase in coverage at low error rates) popular structure comparison methods and a leading function prediction method. We benchmark FLORA on a large data set of enzyme superfamilies from all three major protein classes (α, β, αβ) and demonstrate the functional relevance of the motifs it identifies. We also provide novel predictions of enzymatic activity for a large number of structures solved by the Protein Structure Initiative. Overall, we show that FLORA is able to effectively detect functionally similar protein domain structures by purely using patterns of structural conservation of all residues

    Perspective from a Younger Generation -- The Astro-Spectroscopy of Gisbert Winnewisser

    Full text link
    Gisbert Winnewisser's astronomical career was practically coextensive with the whole development of molecular radio astronomy. Here I would like to pick out a few of his many contributions, which I, personally, find particularly interesting and put them in the context of newer results.Comment: 14 pages. (Co)authored by members of the MPIfR (Sub)millimeter Astronomy Group. To appear in the Proceedings of the 4th Cologne-Bonn-Zermatt-Symposium "The Dense Interstellar Medium in Galaxies" eds. S. Pfalzner, C. Kramer, C. Straubmeier, & A. Heithausen (Springer: Berlin

    Measurement of the inclusive and dijet cross-sections of b-jets in pp collisions at sqrt(s) = 7 TeV with the ATLAS detector

    Get PDF
    The inclusive and dijet production cross-sections have been measured for jets containing b-hadrons (b-jets) in proton-proton collisions at a centre-of-mass energy of sqrt(s) = 7 TeV, using the ATLAS detector at the LHC. The measurements use data corresponding to an integrated luminosity of 34 pb^-1. The b-jets are identified using either a lifetime-based method, where secondary decay vertices of b-hadrons in jets are reconstructed using information from the tracking detectors, or a muon-based method where the presence of a muon is used to identify semileptonic decays of b-hadrons inside jets. The inclusive b-jet cross-section is measured as a function of transverse momentum in the range 20 < pT < 400 GeV and rapidity in the range |y| < 2.1. The bbbar-dijet cross-section is measured as a function of the dijet invariant mass in the range 110 < m_jj < 760 GeV, the azimuthal angle difference between the two jets and the angular variable chi in two dijet mass regions. The results are compared with next-to-leading-order QCD predictions. Good agreement is observed between the measured cross-sections and the predictions obtained using POWHEG + Pythia. MC@NLO + Herwig shows good agreement with the measured bbbar-dijet cross-section. However, it does not reproduce the measured inclusive cross-section well, particularly for central b-jets with large transverse momenta.Comment: 10 pages plus author list (21 pages total), 8 figures, 1 table, final version published in European Physical Journal

    Search for new phenomena in final states with an energetic jet and large missing transverse momentum in pp collisions at √ s = 8 TeV with the ATLAS detector

    Get PDF
    Results of a search for new phenomena in final states with an energetic jet and large missing transverse momentum are reported. The search uses 20.3 fb−1 of √ s = 8 TeV data collected in 2012 with the ATLAS detector at the LHC. Events are required to have at least one jet with pT > 120 GeV and no leptons. Nine signal regions are considered with increasing missing transverse momentum requirements between Emiss T > 150 GeV and Emiss T > 700 GeV. Good agreement is observed between the number of events in data and Standard Model expectations. The results are translated into exclusion limits on models with either large extra spatial dimensions, pair production of weakly interacting dark matter candidates, or production of very light gravitinos in a gauge-mediated supersymmetric model. In addition, limits on the production of an invisibly decaying Higgs-like boson leading to similar topologies in the final state are presente

    Search for direct pair production of the top squark in all-hadronic final states in proton-proton collisions at s√=8 TeV with the ATLAS detector

    Get PDF
    The results of a search for direct pair production of the scalar partner to the top quark using an integrated luminosity of 20.1fb−1 of proton–proton collision data at √s = 8 TeV recorded with the ATLAS detector at the LHC are reported. The top squark is assumed to decay via t˜→tχ˜01 or t˜→ bχ˜±1 →bW(∗)χ˜01 , where χ˜01 (χ˜±1 ) denotes the lightest neutralino (chargino) in supersymmetric models. The search targets a fully-hadronic final state in events with four or more jets and large missing transverse momentum. No significant excess over the Standard Model background prediction is observed, and exclusion limits are reported in terms of the top squark and neutralino masses and as a function of the branching fraction of t˜ → tχ˜01 . For a branching fraction of 100%, top squark masses in the range 270–645 GeV are excluded for χ˜01 masses below 30 GeV. For a branching fraction of 50% to either t˜ → tχ˜01 or t˜ → bχ˜±1 , and assuming the χ˜±1 mass to be twice the χ˜01 mass, top squark masses in the range 250–550 GeV are excluded for χ˜01 masses below 60 GeV
    corecore