Search CORE

17 research outputs found

Fast Gaussian Pairwise Constrained Spectral Clustering

Author: D.J. Klein
J. Shi
L. Hubert
M. Belkin
M. Saerens
S. Guattery
T. Bie De
U. Luxburg
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

International audienceWe consider the problem of spectral clustering with partial supervision in the form of must-link and cannot-link constraints. Such pairwise constraints are common in problems like coreference resolution in natural language processing. The approach developed in this paper is to learn a new representation space for the data together with a dis-tance in this new space. The representation space is obtained through a constraint-driven linear transformation of a spectral embedding of the data. Constraints are expressed with a Gaussian function that locally reweights the similarities in the projected space. A global, non-convex optimization objective is then derived and the model is learned via gradi-ent descent techniques. Our algorithm is evaluated on standard datasets and compared with state of the art algorithms, like [14,18,31]. Results on these datasets, as well on the CoNLL-2012 coreference resolution shared task dataset, show that our algorithm significantly outperforms related approaches and is also much more scalable

HAL - Lille 3

Crossref

INRIA a CCSD electronic archive server

HAL: Hyper Article en Ligne

Ensemble approach for generalized network dismantling

Author: A Braunstein
A Pothen
Aydın Buluç
BH Good
CM Schneider
D Marx
F Morone
G Dong
H-J Zhou
I Dinur
L Lü
L Tian
L Zdeborová
Lazaros K. Gallos
M Fiedler
M Matsumoto
N Antulov-Fantulin
R Albert
R Bar-Yehuda
R Cohen
R Lipton
R Pastor-Satorras
R Pastor-Satorras
S Guattery
S Janson
S Mugisha
S Wandelt
S Wandelt
S-M Qin
T Leighton
TN Bui
U Feige
W Ben-Ameur
X-L Ren
X-L Ren
Y Chen
Publication venue
Publication date: 19/09/2019
Field of study

Finding a set of nodes in a network, whose removal fragments the network below some target size at minimal cost is called network dismantling problem and it belongs to the NP-hard computational class. In this paper, we explore the (generalized) network dismantling problem by exploring the spectral approximation with the variant of the power-iteration method. In particular, we explore the network dismantling solution landscape by creating the ensemble of possible solutions from different initial conditions and a different number of iterations of the spectral approximation.Comment: 11 Pages, 4 Figures, 4 Table

arXiv.org e-Print Archive

Crossref

Proteinortho: Detection of (Co-)orthologs in large-scale analysis

Author: A Alexeyenko
A Force
A Nakabachi
A Schneider
AE Hirsh
AJ Enright
C Lanczos
D Cornaz
DM Kristensen
E Pruesse
EV Koonin
IK Jordan
J Hopcroft
JP McCutcheon
L Li
Lydia Steiner
M Fiedler
M Fiedler
M Remm
M Sikdar
Manja Marz
Marcus Lechner
MC Rivera
P Bork
Peter F Stadler
RL Tatusov
S Guattery
SM van Dongen
Sonja J Prohaska
Sven Findeiß
TJ Hubbard
WM Fitch
Z Fu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Orthology analysis is an important part of data analysis in many areas of bioinformatics such as comparative genomics and molecular phylogenetics. The ever-increasing flood of sequence data, and hence the rapidly increasing number of genomes that can be compared simultaneously, calls for efficient software tools as brute-force approaches with quadratic memory requirements become infeasible in practise. The rapid pace at which new data become available, furthermore, makes it desirable to compute genome-wide orthology relations for a given dataset rather than relying on relations listed in databases. Results The program <monospace>Proteinortho</monospace> described here is a stand-alone tool that is geared towards large datasets and makes use of distributed computing techniques when run on multi-core hardware. It implements an extended version of the reciprocal best alignment heuristic. We apply <monospace>Proteinortho</monospace> to compute orthologous proteins in the complete set of all 717 eubacterial genomes available at NCBI at the beginning of 2009. We identified thirty proteins present in 99% of all bacterial proteomes. Conclusions <monospace>Proteinortho</monospace> significantly reduces the required amount of memory for orthology analysis compared to existing tools, allowing such computations to be performed on off-the-shelf hardware.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

Fraunhofer-Publica

PubMed Central

Permanent Hosting, Archiving and Indexing of Digital Resources and Assets

The Path Resistance Method for Bounding the Smallest Nontrivial Eigenvalue of a Laplacian

Author: G. L. MILLER
S. GUATTERY
T. LEIGHTON
Publication venue: Cambridge University Press (CUP)
Publication date: 01/09/1999
Field of study

We introduce the path resistance method for lower bounds on the smallest nontrivial eigenvalue of the Laplacian matrix of a graph. The method is based on viewing the graph in terms of electrical circuits: it uses clique embeddings to produce lower bounds on λ2 and star embeddings to produce lower bounds on the smallest Rayleigh quotient when there is a zero Dirichlet boundary condition. The method assigns priorities to the paths in the embedding; we show that, for an unweighted tree T, using uniform priorities for a clique embedding produces a lower bound on λ2 that is off by at most an O(log diameter(T)) factor. We show that the best bounds this method can produce for clique embeddings are the same as for a related method that uses clique embeddings and edge lengths to produce bounds.</jats:p

Crossref

The Path Resistance Method for Bounding the Smallest Nontrivial Eigenvalue of a Laplacian

Author: G. L. Miller
S. Guattery
T. Leighton
Publication venue
Publication date
Field of study

this paper we consider methods based on graph embeddings for estimating the smallest nontrivial eigenvalue of the Laplacian matrix representation of a graph. The Laplacian is one of many ways to view a graph as a matrix; it is de ned as follows: Let G = (V; E) be an undirected graph with vertices v 1 ; : : : ; vn . Then the Laplacian of G is an n n matrix L such that l ij = 8 degree(v i ) if i = j 1 if (i; j) 2 E 0 otherwise A version of this paper originally appeared in the Proceedings of the Eighth Annual ACM/SIAM Symposium on Discrete Algorithm

CiteSeerX

The Path Resistance Method for Bounding the Smallest Nontrivial Eigenvalue of a Laplacian

Author: G. L. MILLER
S. GUATTERY
T. LEIGHTON
Publication venue: 'Cambridge University Press (CUP)'
Publication date
Field of study

Crossref

Finding Cliques in Directed Weighted Graphs Using Complex Hermitian Adjacency Matrices

Author: A. Geyer-Schulz
G. Karypis
H.D. Simon
M. Fiedler
R. Kannan
S. Guattery
S.E. Karisch
T. Choe
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Crossref

PageRank and random walks on graphs

Author: C. St. J. A. Nash-Williams
F. Chung
G. Kirchhoff
G. Kirchhoff
H. Haveliwala
L. Lovász
S. Brin
S. Guattery
Publication venue
Publication date: 01/01/2010
Field of study

Dedicated to Lovász on the ocassion of his sixtieth birthday. Abstract. We examine the relationship between PageRank and several invariants occurring in the study of random walks and electrical networks. We consider a generalized version of hitting time and effective resistance with an additional parameter which controls the ‘speed ’ of diffusion. We will establish their connection with PageRank. Through these connections, a combinatorial interpretation of PageRank is given in terms of rooted spanning forests by using a generalized version of the matrix-tree theorem. Using PageRank, we will illustrate that the generalized hitting time leads to finding sparse cuts and efficient approximation algorithms for PageRank can be used for approximating hitting time and effective resistance.

CiteSeerX

Crossref

Empirical Evaluation of Graph Partitioning Using Spectral Embeddings and Flow

Author: A. Goldberg
B. Cherkassky
C. Walshaw
F. Chung
G. Karypis
N. Alon
S. Arora
S. Guattery
T. Leighton
Publication venue
Publication date: 01/01/2009
Field of study

Abstract. We present initial results from the first empirical evaluation of a graph partitioning algorithm inspired by the Arora-Rao-Vazirani algorithm of [5], which combines spectral and flow methods in a novel way. We have studied the parameter space of this new algorithm, e.g., examining the extent to which different parameter settings interpolate between a more spectral and a more flow-based approach, and we have compared results of this algorithm to results from previously known and optimized algorithms such as Metis.

CiteSeerX

Crossref

On the maximal error of spectral approximation of graph bisection

Author: Fiedler M
Fiedler M
Fiedler M
Guattery S
John C. Urschel
Ludmil T. Zikatanov
Mohar B
Spielman DA
Wei Y-C
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref