Search CORE

2,631 research outputs found

Guaranteed Rank Minimization via Singular Value Projection

Author: Dhillon Inderjit S.
Jain Prateek
Meka Raghu
Publication venue
Publication date: 01/01/2009
Field of study

Minimizing the rank of a matrix subject to affine constraints is a fundamental problem with many important applications in machine learning and statistics. In this paper we propose a simple and fast algorithm SVP (Singular Value Projection) for rank minimization with affine constraints (ARMP) and show that SVP recovers the minimum rank solution for affine constraints that satisfy the "restricted isometry property" and show robustness of our method to noise. Our results improve upon a recent breakthrough by Recht, Fazel and Parillo (RFP07) and Lee and Bresler (LB09) in three significant ways: 1) our method (SVP) is significantly simpler to analyze and easier to implement, 2) we give recovery guarantees under strictly weaker isometry assumptions 3) we give geometric convergence guarantees for SVP even in presense of noise and, as demonstrated empirically, SVP is significantly faster on real-world and synthetic problems. In addition, we address the practically important problem of low-rank matrix completion (MCP), which can be seen as a special case of ARMP. We empirically demonstrate that our algorithm recovers low-rank incoherent matrices from an almost optimal number of uniformly sampled entries. We make partial progress towards proving exact recovery and provide some intuition for the strong performance of SVP applied to matrix completion by showing a more restricted isometry property. Our algorithm outperforms existing methods, such as those of \cite{RFP07,CR08,CT09,CCS08,KOM09,LB09}, for ARMP and the matrix-completion problem by an order of magnitude and is also significantly more robust to noise.Comment: An earlier version of this paper was submitted to NIPS-2009 on June 5, 200

arXiv.org e-Print Archive

CiteSeerX

A Divide-and-Conquer Solver for Kernel Support Vector Machines

Author: Dhillon Inderjit S.
Hsieh Cho-Jui
Si Si
Publication venue
Publication date: 04/11/2013
Field of study

The kernel support vector machine (SVM) is one of the most widely used classification methods; however, the amount of computation required becomes the bottleneck when facing millions of samples. In this paper, we propose and analyze a novel divide-and-conquer solver for kernel SVMs (DC-SVM). In the division step, we partition the kernel SVM problem into smaller subproblems by clustering the data, so that each subproblem can be solved independently and efficiently. We show theoretically that the support vectors identified by the subproblem solution are likely to be support vectors of the entire kernel SVM problem, provided that the problem is partitioned appropriately by kernel clustering. In the conquer step, the local solutions from the subproblems are used to initialize a global coordinate descent solver, which converges quickly as suggested by our analysis. By extending this idea, we develop a multilevel Divide-and-Conquer SVM algorithm with adaptive clustering and early prediction strategy, which outperforms state-of-the-art methods in terms of training speed, testing accuracy, and memory usage. As an example, on the covtype dataset with half-a-million samples, DC-SVM is 7 times faster than LIBSVM in obtaining the exact SVM solution (to within

10^{-6}

relative error) which achieves 96.15% prediction accuracy. Moreover, with our proposed early prediction strategy, DC-SVM achieves about 96% accuracy in only 12 minutes, which is more than 100 times faster than LIBSVM

arXiv.org e-Print Archive

CiteSeerX

IT and the NHS: Investigating different perspectives of IT using soft systems methodology

Author: Gillies Alan Cameron
Patel Inderjit
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 03/12/2009
Field of study

The UK NHS National Programme for IT has been criticized for a lack of clinical engagement. This paper uses a soft systems methodology (SSM) analysis of a case study from the use of electronic systems within a National Health Service (NHS) Mental Health Trust in the United Kingdom (UK) to explore the legal and ethical implications of the failure to develop clinical systems which are fit for purpose. Soft systems methodology (SSM) was used as a theoretical model both to derive deeper insights into the survey data and suggest how communication between those producing information and those using it, could be improved. Multiple methods were employed which included a postal survey and participant interviews to triangulate the data The use of SSM reinforced the concept that the national IT programme is based on a 'hard' systems view and does not take local factors (which are related to 'soft systems' thinking) into account. The study found administrative staff to be a crucial link between clinicians and information departments and highlighted the need for a joint-up information strategy and integrated systems. The article concludes with a discussion of the legal and ethical implications of the findings and the lessons for the broader UK national programme. It argues that the failure to deliver systems that are fit for purpose is not value neutral but an ethical issue

CLoK

Crossref

Online Embedding Compression for Text Classification using Low Rank Matrix Factorization

Author: Acharya Anish
Dhillon Inderjit
Goel Rahul
Metallinou Angeliki
Publication venue
Publication date: 01/11/2018
Field of study

Deep learning models have become state of the art for natural language processing (NLP) tasks, however deploying these models in production system poses significant memory constraints. Existing compression methods are either lossy or introduce significant latency. We propose a compression method that leverages low rank matrix factorization during training,to compress the word embedding layer which represents the size bottleneck for most NLP models. Our models are trained, compressed and then further re-trained on the downstream task to recover accuracy while maintaining the reduced size. Empirically, we show that the proposed method can achieve 90% compression with minimal impact in accuracy for sentence classification tasks, and outperforms alternative methods like fixed-point quantization or offline word embedding compression. We also analyze the inference time and storage space for our method through FLOP calculations, showing that we can compress DNN models by a configurable ratio and regain accuracy loss without introducing additional latency compared to fixed point quantization. Finally, we introduce a novel learning rate schedule, the Cyclically Annealed Learning Rate (CALR), which we empirically demonstrate to outperform other popular adaptive learning rate algorithms on a sentence classification benchmark.Comment: Accepted in Thirty-Third AAAI Conference on Artificial Intelligence (AAAI 2019

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Can Communist economies transform incrementally? China's experience

Author: Gelb Alan
Jefferson Gary
Singh Inderjit
Publication venue
Publication date
Field of study

The authors try to answer important questions. How important is the phasing of political and economic liberalization and the active (versus passive) role of the state in reform? What lessons can be learned about comprehensive top-down reform as opposed to experimental bottom-up reforms; fast versus slow liberalization and opening up of the economy; the need to establish full private property rights at the beginning of reform; and reform's implications for welfare and distribution? Can China's excellent performance be linked toparticular reform measure, or does it reflect distinctive initial conditions or social or demographic factors? Is China's performance sustainable without more comprehensive transformation, or does it reflect transient gains that are substantially exhausted? Among the lessons China offers are the following. Partial reform can succeed in raising productivity in agriculture and industry; industrial productivity has grown very rapidly in the nonstate sector but also in state enterprises. A"big bang"is not economically necessary unless justified by the need to address macroeconomic imbalances. There may be virtue in a decentralized"bottom-up"approach to reform. Rapid privatization is not necessary for successful reform, but it is important to diversify ownership and encourage the entry of new firms. Small scale privatization and the liberalization of distribution and service sectors are likely to have the fastest payoff in the reform of property rights. China's rapid growth momentum and macroeconomic stability cannot be sustained without further reforms, including the reform of banking, taxation, and property rights.Environmental Economics&Policies,Economic Theory&Research,Banks&Banking Reform,Health Monitoring&Evaluation,Municipal Financial Management

Research Papers in Economics