933 research outputs found

    Multiple Instance Learning for Heterogeneous Images: Training a CNN for Histopathology

    Full text link
    Multiple instance (MI) learning with a convolutional neural network enables end-to-end training in the presence of weak image-level labels. We propose a new method for aggregating predictions from smaller regions of the image into an image-level classification by using the quantile function. The quantile function provides a more complete description of the heterogeneity within each image, improving image-level classification. We also adapt image augmentation to the MI framework by randomly selecting cropped regions on which to apply MI aggregation during each epoch of training. This provides a mechanism to study the importance of MI learning. We validate our method on five different classification tasks for breast tumor histology and provide a visualization method for interpreting local image classifications that could lead to future insights into tumor heterogeneity

    Joint and individual analysis of breast cancer histologic images and genomic covariates

    Get PDF
    A key challenge in modern data analysis is understanding connections between complex and differing modalities of data. For example, two of the main approaches to the study of breast cancer are histopathology (analyzing visual characteristics of tumors) and genetics. While histopathology is the gold standard for diagnostics and there have been many recent breakthroughs in genetics, there is little overlap between these two fields. We aim to bridge this gap by developing methods based on Angle-based Joint and Individual Variation Explained (AJIVE) to directly explore similarities and differences between these two modalities. Our approach exploits Convolutional Neural Networks (CNNs) as a powerful, automatic method for image feature extraction to address some of the challenges presented by statistical analysis of histopathology image data. CNNs raise issues of interpretability that we address by developing novel methods to explore visual modes of variation captured by statistical algorithms (e.g. PCA or AJIVE) applied to CNN features. Our results provide many interpretable connections and contrasts between histopathology and genetics

    Weight loss reduces basal-like breast cancer through kinome reprogramming

    Get PDF
    Additional file 1. Tumor burden and growth were not affected by diet. a. Tumor burden was quantified at sacrifice. b. Tumor volume was measured by calipers at detection and sacrifice. (N = 28 10 %; N = 31 60 %; N = 29, 60–10 %)

    Integrated RNA and DNA sequencing improves mutation detection in low purity tumors

    Get PDF
    Identifying somatic mutations is critical for cancer genome characterization and for prioritizing patient treatment. DNA whole exome sequencing (DNA-WES) is currently the most popular technology; however, this yields low sensitivity in low purity tumors. RNA sequencing (RNA-seq) covers the expressed exome with depth proportional to expression. We hypothesized that integrating DNA-WES and RNA-seq would enable superior mutation detection versus DNA-WES alone. We developed a first-of-its-kind method, called UNCeqR, that detects somatic mutations by integrating patient-matched RNA-seq and DNA-WES. In simulation, the integrated DNA and RNA model outperformed the DNA-WES only model. Validation by patient-matched whole genome sequencing demonstrated superior performance of the integrated model over DNA-WES only models, including a published method and published mutation profiles. Genome-wide mutational analysis of breast and lung cancer cohorts (n = 871) revealed remarkable tumor genomics properties. Low purity tumors experienced the largest gains in mutation detection by integrating RNA-seq and DNA-WES. RNA provided greater mutation signal than DNA in expressed mutations. Compared to earlier studies on this cohort, UNCeqR increased mutation rates of driver and therapeutically targeted genes (e.g. PIK3CA, ERBB2 and FGFR2). In summary, integrating RNA-seq with DNA-WES increases mutation detection performance, especially for low purity tumors

    Intratumoral heterogeneity as a source of discordance in breast cancer biomarker classification

    Get PDF
    Abstract Background Spatial heterogeneity in biomarker expression may impact breast cancer classification. The aims of this study were to estimate the frequency of spatial heterogeneity in biomarker expression within tumors, to identify technical and biological factors contributing to spatial heterogeneity, and to examine the impact of discordant biomarker status within tumors on clinical record agreement. Methods Tissue microarrays (TMAs) were constructed using two to four cores (1.0 mm) for each of 1085 invasive breast cancers from the Carolina Breast Cancer Study, which is part of the AMBER Consortium. Immunohistochemical staining for estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (HER2) was quantified using automated digital imaging analysis. The biomarker status for each core and for each case was assigned using clinical thresholds. Cases with core-to-core biomarker discordance were manually reviewed to distinguish intratumoral biomarker heterogeneity from misclassification of biomarker status by the automated algorithm. The impact of core-to-core biomarker discordance on case-level agreement between TMAs and the clinical record was evaluated. Results On the basis of automated analysis, discordant biomarker status between TMA cores occurred in 9 %, 16 %, and 18 % of cases for ER, PR, and HER2, respectively. Misclassification of benign epithelium and/or ductal carcinoma in situ as invasive carcinoma by the automated algorithm was implicated in discordance among cores. However, manual review of discordant cases confirmed spatial heterogeneity as a source of discordant biomarker status between cores in 2 %, 7 %, and 8 % of cases for ER, PR, and HER2, respectively. Overall, agreement between TMA and clinical record was high for ER (94 %), PR (89 %), and HER2 (88 %), but it was reduced in cases with core-to-core discordance (agreement 70 % for ER, 61 % for PR, and 57 % for HER2). Conclusions Intratumoral biomarker heterogeneity may impact breast cancer classification accuracy, with implications for clinical management. Both manually confirmed biomarker heterogeneity and misclassification of biomarker status by automated image analysis contribute to discordant biomarker status between TMA cores. Given that manually confirmed heterogeneity is uncommon (<10 % of cases), large studies are needed to study the impact of heterogeneous biomarker expression on breast cancer classification and outcomes

    Gene Expression Analysis of In Vitro Cocultures to Study Interactions between Breast Epithelium and Stroma

    Get PDF
    The interactions between breast epithelium and stroma are fundamental to normal tissue homeostasis and for tumor initiation and progression. Gene expression studies of in vitro coculture models demonstrate that in vitro models have relevance for tumor progression in vivo. For example, stromal gene expression has been shown to vary in association with tumor subtype in vivo, and analogous in vitro cocultures recapitulate subtype-specific biological interactions. Cocultures can be used to study cancer cell interactions with specific stromal components (e.g., immune cells, fibroblasts, endothelium) and different representative cell lines (e.g., cancer-associated versus normal-associated fibroblasts versus established, immortalized fibroblasts) can help elucidate the role of stromal variation in tumor phenotypes. Gene expression data can also be combined with cell-based assays to identify cellular phenotypes associated with gene expression changes. Coculture systems are manipulable systems that can yield important insights about cell-cell interactions and the cellular phenotypes that occur as tumor and stroma co-evolve

    Assessing Etiologic Heterogeneity for Multinomial Outcome with Two-Phase Outcome-Dependent Sampling Design

    Get PDF
    Etiologic heterogeneity occurs when distinct sets of events or exposures give rise to different subtypes of disease. Inference about subtype-specific exposure effects from two-phase outcome-dependent sampling data requires adjustment for both confounding and the sampling design. Common approaches to inference for these effects do not necessarily appropriately adjust for these sources of bias, or allow for formal comparisons of effects across different subtypes. Herein, using inverse probability weighting (IPW) to fit a multinomial model is shown to yield valid inference with this sampling design for subtype-specific exposure effects and contrasts thereof. The IPW approach is compared to common regression-based methods for assessing exposure effect heterogeneity using simulations. The methods are applied to estimate subtype-specific effects of various exposures on breast cancer risk in the Carolina Breast Cancer Study

    Alcohol intake and invasive breast cancer risk by molecular subtype and race in the Carolina Breast Cancer Study

    Get PDF
    Alcohol is an established breast cancer risk factor, but there is little evidence on whether the association differs between African Americans and whites

    Digital histologic analysis reveals morphometric patterns of age-related involution in breast epithelium and stroma

    Get PDF
    Complete age-related regression of mammary epithelium, often termed post-menopausal involution, is associated with decreased breast cancer risk. However, most studies have qualitatively assessed involution. We quantitatively analyzed epithelium, stroma, and adipose tissue from histologically normal breast tissue of 454 patients in the Normal Breast Study (NBS). High-resolution digital images of normal breast Hematoxylin & Eosin stained slides were partitioned into epithelium, adipose tissue, and non-fatty stroma. Percentage area and nuclei per unit area (nuclear density) were calculated for each component. Quantitative data were evaluated in association with age using linear regression and cubic spline models Stromal area decreased (p=0.0002) and adipose tissue area increased (p<0.0001), with an approximate 0.7% change in area for each component, until age 55 when these area measures reached a steady state. While epithelial area did not show linear changes with age, epithelial nuclear density decreased linearly beginning in the third decade of life. No significant age-related trends were observed for stromal or adipose nuclear density. Digital image analysis offers a high-throughput method for quantitatively measuring tissue morphometry and for objectively assessing age-related changes in adipose tissue, stroma, and epithelium. Epithelial nuclear density is a quantitative measure of age-related breast involution that begins to decline in the early premenopausal period

    Parity-related molecular signatures and breast cancer subtypes by estrogen receptor status

    Get PDF
    INTRODUCTION: Relationships of parity with breast cancer risk are complex. Parity is associated with decreased risk of postmenopausal hormone receptor–positive breast tumors, but may increase risk for basal-like breast cancers and early-onset tumors. Characterizing parity-related gene expression patterns in normal breast and breast tumor tissues may improve understanding of the biological mechanisms underlying this complex pattern of risk. METHODS: We developed a parity signature by analyzing microRNA microarray data from 130 reduction mammoplasty (RM) patients (54 nulliparous and 76 parous). This parity signature, together with published parity signatures, was evaluated in gene expression data from 150 paired tumors and adjacent benign breast tissues from the Polish Breast Cancer Study, both overall and by tumor estrogen receptor (ER) status. RESULTS: We identified 251 genes significantly upregulated by parity status in RM patients (parous versus nulliparous; false discovery rate = 0.008), including genes in immune, inflammation and wound response pathways. This parity signature was significantly enriched in normal and tumor tissues of parous breast cancer patients, specifically in ER-positive tumors. CONCLUSIONS: Our data corroborate epidemiologic data, suggesting that the etiology and pathogenesis of breast cancers vary by ER status, which may have implications for developing prevention strategies for these tumors
    corecore