Search CORE

49 research outputs found

Convolutional Networks for Fast, Energy-Efficient Neuromorphic Computing

Author: Amir Arnon
Andreopoulos Alexander
Appuswamy Rathinakumar
Arthur John V.
Barch Davis R.
Berg David J.
Cassidy Andrew S.
Datta Pallab
di Nolfo Carmelo
Esser Steven K.
Flickner Myron D.
McKinstry Jeffrey L.
Melano Timothy
Merolla Paul A.
Modha Dharmendra S.
Taba Brian
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 24/05/2016
Field of study

Deep networks are now able to achieve human-level performance on a broad spectrum of recognition tasks. Independently, neuromorphic computing has now demonstrated unprecedented energy-efficiency through a new chip architecture based on spiking neurons, low precision synapses, and a scalable communication network. Here, we demonstrate that neuromorphic computing, despite its novel architectural primitives, can implement deep convolution networks that i) approach state-of-the-art classification accuracy across 8 standard datasets, encompassing vision and speech, ii) perform inference while preserving the hardware's underlying energy-efficiency and high throughput, running on the aforementioned datasets at between 1200 and 2600 frames per second and using between 25 and 275 mW (effectively > 6000 frames / sec / W) and iii) can be specified and trained using backpropagation with the same ease-of-use as contemporary deep learning. For the first time, the algorithmic power of deep learning can be merged with the efficiency of neuromorphic processors, bringing the promise of embedded, intelligent, brain-inspired computing one step closer.Comment: 7 pages, 6 figure

arXiv.org e-Print Archive

Crossref

PubMed Central

On relay placemement for deterministic line network

Author: Appuswamy R.
Atsan Emre
Fragouli Christina
Franceschetti Massimo
Publication venue
Publication date: 26/01/2012
Field of study

We consider a unicast communication problem where, a source transmits information to a destination through a wireless network with the help of k relays positioned on a line. We adopt the linear deterministic model to capture the wireless signal interactions and study the optimal placement of the relays so that the capacity from the source to the destination in the deterministic network is maximized. Analytical results are provided for a number of special cases, and the insights gained are used to provide a heuristic framework for designing large relay networks.

Infoscience - École polytechnique fédérale de Lausanne

Analyzing the impact of system architecture on the scalability of OLTP engines for high-contention workloads

Author: Adya A.
Appuswamy R.
Bernstein P. A.
Cowling J.
Harizopoulos S.
Lozi J.-P.
Mu S.
Narula N.
Publication venue: 'VLDB Endowment'
Publication date
Field of study

Crossref

Vispark: GPU-accelerated distributed visual computing using spark

Author: Abbasi A.
Appuswamy R.
Armbrust M.
Buck J.
Buck J. B.
Elteir M.
Geng Y.
Grossman M.
Stuart J. A.
Sumin Hong
Won-Ki Jeong
Woohyuk Choi
Xin R. S.
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 01/10/2016
Field of study

With the growing need of big-data processing in diverse application domains, MapReduce (e.g., Hadoop) has become one of the standard computing paradigms for large-scale computing on a cluster system. Despite its popularity, the current MapReduce framework suffers from inflexibility and inefficiency inherent to its programming model and system architecture. In order to address these problems, we propose Vispark, a novel extension of Spark for GPU-accelerated MapReduce processing on array-based scientific computing and image processing tasks. Vispark provides an easy-to-use, Python-like high-level language syntax and a novel data abstraction for MapReduce programming on a GPU cluster system. Vispark introduces a programming abstraction for accessing neighbor data in the mapper function, which greatly simplifies many image processing tasks using MapReduce by reducing memory footprints and bypassing the reduce stage. Vispark provides socket-based halo communication that synchronizes between data partitions transparently from the users, which is necessary for many scientific computing problems in distributed systems. Vispark also provides domain-specific functions and language supports specifically designed for high-performance computing and image processing applications. We demonstrate the performance of our prototype system on several visual computing tasks, such as image processing, volume rendering, K-means clustering, and heat transfer simulation.clos

Crossref

ScholarWorks@UNIST

Investigating Guided Extensive Reading And Vocabulary Knowledge Performance Among Remedial Esl Learners In A Public University In Malaysia

Author: A. Amir
A. Andreopoulos
A. Cassidy
D. Barch
D.S. Modha
E. McQuinn
F. Zee
J. Arthur
J.A. Kusnitz
M. Flickner
N. Basilico
P. Datta
P. Merolla
R. Alvarez-Icaza
R. Appuswamy
R. Singh
S. Carpin
S. Chandra
S.K. Esser
T. Zimmerman
T.K. Nayak
T.M. Wong
W.P. Risk
Publication venue
Publication date: 04/08/2013
Field of study

Penyelidikan menyokong pembacaan ekstensif, yang tertumpu pada pembelajaran kebetulan (incidental learning), sebagai wadah utama bagi perkembangan pengetahuan kosa kata bahasa kedua/asing. Research supports extensive reading, which draws on incidental learning, as a primary tool for second/foreign language vocabulary knowledge development

CiteSeerX

Crossref

AIR Universita degli studi di Milano

Repository@USM

Joint timing synchronization and channel estimation based on ZCZ sequence set in SC-MIMO-FDE system

Author: AJ Paulraj
BM Popovic
BM Popovic
BM Popovic
CC Tseng
CG Han
CL Wang
D Falconer
E Garcia
E Garcia
H Torii
Haiming Wang
HG Hu
HM Wang
HM Wang
HM Wang
J Zhu
Luxi Yang
MA Funes
MN Hadad
PZ Fan
PZ Fan
PZ Fan
PZ Fan
Qinzhen Xu
R Appuswamy
S Beyme
SA Yang
Shiwen He
SQ Wang
SZ Budisin
XH Tang
XQ Gao
YC Liu
Yu Sun
Yu Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Building a file-based Storage Stack: Modularity and Flexibility in Loris

Author: Appuswamy R.
Publication venue
Publication date: 01/01/2014
Field of study

VU Research Portal

Sequence Alignment Through the Looking Glass

Author: Appuswamy R
Chaturvedi N
Fellay J
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 01/01/2018
Field of study

AbstractRapid advances in sequencing technologies are producing genomic data on an unprecedented scale. The first, and often one of the most time consuming, step of genomic data analysis is sequence alignment, where sequenced reads must be aligned to a reference genome. Several years of research on alignment algorithms has led to the development of several state-of-the-art sequence aligners that can map tens of thousands of reads per second.In this work, we answer the question “How do sequence aligners utilize modern processors?” We examine four state-of-the-art aligners running on an Intel processor and identify that all aligners leave the processor substantially underutilized. We perform an in-depth microarchitectural analysis to explore the interaction between aligner software and processor hardware. We identify bottlenecks that lead to processor underutilization and discuss the implications of our analysis on next-generation sequence aligner design.</jats:p

Crossref

UNIL IRIS | Institutional Research Information System

Cache, cache everywhere, flushing all hits down the sink: on exclusivity in multilevel, hybrid caches

Author: Appuswamy R.
Tanenbaum A.S.
van Moolenbroek D.C.
Publication venue
Publication date: 01/01/2013
Field of study

Supporting and Exploiting Heterogeneity in the Storage Stack

Author: Appuswamy R.
Tanenbaum A.S.
van Moolenbroek D.C.
Publication venue: Advanced School for Computing and Imaging (ASCI)
Publication date: 01/01/2011
Field of study

VU Research Portal