Search CORE

447 research outputs found

SMART: Unique splitting-while-merging framework for gene clustering

Author: A Thalamuthu
AD Lanterman
AE Teschendorff
AK Jain
Asoke K. Nandi
B Abu-Jamous
B Fritzke
B Fritzke
CR Lin
CS Wallace
D Dembele
D Jiang
David J. Roberts
G Celeux
H Akaike
J Qin
J Rissanen
KY Yeung
L Hubert
L Mavridis
L Zhao
MAT Figueiredo
P Tamayo
PT Spellman
R Xu
R Xu
RJ Cho
Rui Fa
S Bandyopadhyay
S Monti
S Wu
Sergio Gómez
T Kohonen
T Pramila
TR Golub
WM Rand
YJ Zhang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 08/04/2014
Field of study

Copyright @ 2014 Fa et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.Successful clustering algorithms are highly dependent on parameter settings. The clustering performance degrades significantly unless parameters are properly set, and yet, it is difficult to set these parameters a priori. To address this issue, in this paper, we propose a unique splitting-while-merging clustering framework, named “splitting merging awareness tactics” (SMART), which does not require any a priori knowledge of either the number of clusters or even the possible range of this number. Unlike existing self-splitting algorithms, which over-cluster the dataset to a large number of clusters and then merge some similar clusters, our framework has the ability to split and merge clusters automatically during the process and produces the the most reliable clustering results, by intrinsically integrating many clustering techniques and tasks. The SMART framework is implemented with two distinct clustering paradigms in two algorithms: competitive learning and finite mixture model. Nevertheless, within the proposed SMART framework, many other algorithms can be derived for different clustering paradigms. The minimum message length algorithm is integrated into the framework as the clustering selection criterion. The usefulness of the SMART framework and its algorithms is tested in demonstration datasets and simulated gene expression datasets. Moreover, two real microarray gene expression datasets are studied using this approach. Based on the performance of many metrics, all numerical results show that SMART is superior to compared existing self-splitting algorithms and traditional algorithms. Three main properties of the proposed SMART framework are summarized as: (1) needing no parameters dependent on the respective dataset or a priori knowledge about the datasets, (2) extendible to many different applications, (3) offering superior performance compared with counterpart algorithms.National Institute for Health Researc

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Brunel University Research Archive

Statement of accounting principles

Author: Hatfield Henry Rand
Moore Wm.
Sanders Thomas H.
Publication venue: eGrove
Publication date: 01/01/1938
Field of study

American Institute of Accountants

eGrove (Univ. of Mississippi)

Recommended from our members

Genome-wide association study of primary open-angle glaucoma in continental and admixed African populations.

Primary open angle glaucoma (POAG) is a complex disease with a major genetic contribution. Its prevalence varies greatly among ethnic groups, and is up to five times more frequent in black African populations compared to Europeans. So far, worldwide efforts to elucidate the genetic complexity of POAG in African populations has been limited. We conducted a genome-wide association study in 1113 POAG cases and 1826 controls from Tanzanian, South African and African American study samples. Apart from confirming evidence of association at TXNRD2 (rs16984299; OR[T] 1.20; P = 0.003), we found that a genetic risk score combining the effects of the 15 previously reported POAG loci was significantly associated with POAG in our samples (OR 1.56; 95% CI 1.26-1.93; P = 4.79 × 10-5). By genome-wide association testing we identified a novel candidate locus, rs141186647, harboring EXOC4 (OR[A] 0.48; P = 3.75 × 10-8), a gene transcribing a component of the exocyst complex involved in vesicle transport. The low frequency and high degree of genetic heterogeneity at this region hampered validation of this finding in predominantly West-African replication sets. Our results suggest that established genetic risk factors play a role in African POAG, however, they do not explain the higher disease load. The high heterogeneity within Africans remains a challenge to identify the genetic commonalities for POAG in this ethnicity, and demands studies of extremely large size

eScholarship - University of California

Measuring gene similarity by means of the classification distance

Author: A Ben-Dor
A Statnikov
A Thalamuthu
Alessandro Fiori
BS Everitt
CC Chang
D Huang
D Jiang
D Jiang
Elena Baralis
FR Hampel
G Petrovics
Giulia Bruno
H Liu
J Gu
JJ Chen
JL Gregg
L Davies
L Fu
L Kaufman
L Wang
M Bouguessa
M Daszykowski
M Royuela
O Gevaert
P Rosini
P Yang
PR Bushel
RC Thompson
S Datta
S Mukkamala
SB Aicha
T Bo
T Chu
TF Cox
TR Golub
U Alon
WM Rand
X He
Y Torosyan
YH Yang
Publication venue: Springer London
Publication date: 01/01/2011
Field of study

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino

Sharp bounds and normalization of Wiener-type indices

Author: A Balaban
A Delprato
A Dobrynin
A Fronczak
AD Barbour
AL Barabasi
AL Barabási
BH Junker
D Plavšić
Dechao Tian
DJ Watts
F Brückler
Fabio Rapallo
G Csardi
G Ren
H Hosoya
H Wiener
H Wiener
I Gutman
I Gutman
I Gutman
JH Ward Jr
Kwok Pui Choi
L Hu
L Mueller
L Soltés
M Dehmer
M Dehmer
M Dehmer
M Randić
M Vidal
ME Newman
N Pržulj
NS Schmuck
O Resendis-Antonio
O Resendis-Antonio
P Erdős
P Erdős
S Wagnera
T Milenković
WM Rand
Y Zhang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 08/11/2013
Field of study

10.1371/journal.pone.0078448PLoS ONE811-POLN

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

ScholarBank@NUS

The Francis Crick Institute

Roto-Translation Covariant Convolutional Networks for Medical Image Analysis

Author: DC Cireşan
EJ Bekkers
I Arganda-Carreras
J Staal
M Veta
MW Lafarge
R Duits
WM Rand
Publication venue
Publication date: 01/01/2018
Field of study

We propose a framework for rotation and translation covariant deep learning using

SE(2)

group convolutions. The group product of the special Euclidean motion group

SE(2)

describes how a concatenation of two roto-translations results in a net roto-translation. We encode this geometric structure into convolutional neural networks (CNNs) via

SE(2)

group convolutional layers, which fit into the standard 2D CNN framework, and which allow to generically deal with rotated input samples without the need for data augmentation. We introduce three layers: a lifting layer which lifts a 2D (vector valued) image to an

SE(2)

-image, i.e., 3D (vector valued) data whose domain is

SE(2)

; a group convolution layer from and to an

SE(2)

-image; and a projection layer from an

SE(2)

-image to a 2D image. The lifting and group convolution layers are

SE(2)

covariant (the output roto-translates with the input). The final projection layer, a maximum intensity projection over rotations, makes the full CNN rotation invariant. We show with three different problems in histopathology, retinal imaging, and electron microscopy that with the proposed group CNNs, state-of-the-art performance can be achieved, without the need for data augmentation by rotation and with increased performance compared to standard CNNs that do rely on augmentation.Comment: 8 pages, 2 figures, 1 table, accepted at MICCAI 201

arXiv.org e-Print Archive

Crossref

Pure OAI Repository

An adaptive version of k-medoids to deal with the uncertainty in clustering heterogeneous data using an intermediary fusion approach

Author: A Oliva
A Strehl
Aalaa Mojahed
B Khaleghi
Beatriz de la Iglesia
BV Dasarathy
D Hall
DJ Berndt
E Acar
G Salton
GRG Lanckriet
GRG Lanckriet
H-S Park
L Kaufman
L Kaufman
LR Dice
M Žitnik
MA Abidi
MH Vliet van
N-EE Faouzi
OA Akeem
P Pavlidis
RA Baeza-Yates
S Jaccard
TN Manjunath
TY Chan
WM Rand
Y Shi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

This paper introduces Hk-medoids, a modified version of the standard k-medoids algorithm. The modification extends the algorithm for the problem of clustering complex heterogeneous objects that are described by a diversity of data types, e.g. text, images, structured data and time series. We first proposed an intermediary fusion approach to calculate fused similarities between objects, SMF, taking into account the similarities between the component elements of the objects using appropriate similarity measures. The fused approach entails uncertainty for incomplete objects or for objects which have diverging distances according to the different component. Our implementation of Hk-medoids proposed here works with the fused distances and deals with the uncertainty in the fusion process. We experimentally evaluate the potential of our proposed algorithm using five datasets with different combinations of data types that define the objects. Our results show the feasibility of the our algorithm, and also they show a performance enhancement when comparing to the application of the original SMF approach in combination with a standard k-medoids that does not take uncertainty into account. In addition, from a theoretical point of view, our proposed algorithm has lower computation complexity than the popular PAM implementation

Crossref

University of East Anglia digital repository

Biclustering models for two-mode ordinal data

Author: A Agresti
A Agresti
A McQuarrie
AP Dempster
BE Skolnick
Bergljot Gjelsvik
C Biernacki
C Biernacki
CM Hurvich
D Owens
Daniel Fernández
Eleni Matechou
G Govaert
G Govaert
G Schwarz
H Bozdogan
H Bozdogan
I Liu
Ivy Liu
J Cooper
J Molitor
J Wyse
JA Anderson
JA Hartigan
JD Banfield
JRS Fonseca
K Hawton
K Hawton
K Wiech
LM Furlanetto
M Lanfranchi
M Scharoun-Lee
M Tefera
Miguel Farias
N Eluru
P McCullagh
PJ Green
R Pechey
R Rocci
S Pledger
SM Desantis
SM Desantis
WM Rand
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

The work in this paper introduces finite mixture models that can be used to simul- taneously cluster the rows and columns of two-mode ordinal categorical response data, such as those resulting from Likert scale responses. We use the popular proportional odds parameterisation and propose models which provide insights into major patterns in the data. Model-fitting is performed using the EM algorithm and a fuzzy allocation of rows and columns to corresponding clusters is obtained. The clustering ability of the models is evaluated in a simulation study and demonstrated using two real data sets

Crossref

UPCommons. Portal del coneixement obert de la UPC

Springer - Publisher Connector

PubMed Central

Kent Academic Repository

Coventry University Pure Portal

UPCommons (Universitat Politècnica de Catalunya)

Whole-genome sequencing identifies genetic alterations in pediatric low-grade gliomas

Author: A Gajjar
A Korshunov
A Lin
A McPherson
A Peraud
A von Deimling
AJ Sievert
AK Gnekow
B Tang
C Bettegowda
CG Mullighan
CG Mullighan
D Dias-Santagata
D Singh
D Sturm
DA Persons
DN Louis
DN Louis
DT Jones
DT Jones
DT Jones
DW Ellison
DW Parsons
E Bouffet
F Li
G Schindler
G Wu
GT Armstrong
H Cin
H Li
H Ohgaki
H Ohgaki
H Ohgaki
H Yan
I Qaddoumi
IF Pollack
IF Pollack
J Jonkers
J Schwartzentruber
J Wang
J Zhang
J Zhang
J Zhang
JE DeClue
JH Wisoff
JR Downing
KT Flaherty
M Katoh
M Ren
M Wang
MJ Ciesielski
MJ Riemenschneider
ML Bajenaru
MN Edmonson
N Turner
N Turner
ND Dees
PG Fisher
PJ Stephens
PK Duffner
R Endersby
R Listernick
R Listernick
RG Tatevossian
RG Tatevossian
RS Arora
S Pfister
S Yip
T Forshew
T Stokland
TE Merchant
V Rand
W Müller
WM Lin
Y Okamoto
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

The most common pediatric brain tumors are low-grade gliomas (LGGs). We used whole-genome sequencing to identify multiple new genetic alterations involving BRAF, RAF1, FGFR1, MYB, MYBL1 and genes with histone-related functions, including H3F3A and ATRX, in 39 LGGs and low-grade glioneuronal tumors (LGGNTs). Only a single non-silent somatic alteration was detected in 24 of 39 (62%) tumors. Intragenic duplications of the portion of FGFR1 encoding the tyrosine kinase domain (TKD) and rearrangements of MYB were recurrent and mutually exclusive in 53% of grade II diffuse LGGs. Transplantation of Trp53-null neonatal astrocytes expressing FGFR1 with the duplication involving the TKD into the brains of nude mice generated high-grade astrocytomas with short latency and 100% penetrance. FGFR1 with the duplication induced FGFR1 autophosphorylation and upregulation of the MAPK/ERK and PI3K pathways, which could be blocked by specific inhibitors. Focusing on the therapeutically challenging diffuse LGGs, our study of 151 tumors has discovered genetic alterations and potential therapeutic targets across the entire range of pediatric LGGs and LGGNTs.Jinghui Zhang, Gang Wu, Claudia P Miller, Ruth G Tatevossian, James D Dalton, Bo Tang, Wilda Orisme, Chandanamali Punchihewa, Matthew Parker, Ibrahim Qaddoumi, Fredrick A Boop, Charles Lu, Cyriac Kandoth, Li Ding, Ryan Lee, Robert Huether, Xiang Chen, Erin Hedlund, Panduka Nagahawatte, Michael Rusch, Kristy Boggs, Jinjun Cheng, Jared Becksfort, Jing Ma, Guangchun Song, Yongjin Li, Lei Wei, Jianmin Wang, Sheila Shurtleff, John Easton, David Zhao, Robert S Fulton, Lucinda L Fulton, David J Dooling, Bhavin Vadodaria, Heather L Mulder, Chunlao Tang, Kerri Ochoa, Charles G Mullighan, Amar Gajjar, Richard Kriwacki, Denise Sheer, Richard J Gilbertson, Elaine R Mardis, Richard K Wilson, James R Downing, Suzanne J Baker and David W Elliso

Crossref

Adelaide Research & Scholarship

Queen Mary Research Online

Exploring the longitudinal dynamics of herd BVD antibody test results using model-based clustering

Author: A Komarek
A Reverter
A Reverter
C Genolini
C Heffernan
DC Koestler
F Brülisauer
GJ Gunn
JA Hartigan
JL Andrews
L Hubert
LG Fernandes
MC De Souto
N. Coffey
PD McNicholas
PD McNicholas
R. W. Humphry
S Vilcek
W Charoenlarp
WM Rand
X Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 06/08/2019
Field of study

Determining the Bovine Viral Diarrhoea (BVD) infection status of cattle herds is a challenge for control and eradication schemes. Given the changing dynamics of BVD virus (BVDV) antibody responses in cattle, classifying herds based on longitudinal changes in the results of BVDV antibody tests could offer a novel, complementary approach to categorising herds that is less likely than the present system to result in a herd’s status changing from year to year, as it is more likely to capture the true exposure dynamics of the farms. This paper describes the dynamics of BVDV antibody test values (measured as percentage positivity (PP)) obtained from 15,500 bovines between 2007 and 2010 from thirty nine cattle herds located in Scotland and Northern England. It explores approaches of classifying herds based on trend, magnitude and shape of their antibody PP trajectories and investigates the epidemiological similarities between farms within the same cluster. Gaussian mixture models were used for the magnitude and shape clustering. Epidemiologically meaningful clusters were obtained. Farm cluster membership depends on clustering approach used. Moderate concordance was found between the shape and magnitude clusters. These methods hold potential for application to enhance control efforts for BVD and other infectious livestock diseases

Crossref

Edinburgh Research Explorer

SRUC - Scotland's Rural College