Search CORE

1,473 research outputs found

Fooling intersections of low-weight halfspaces

Author: Servedio Rocco A.
Tan Li-Yang
Publication venue
Publication date: 16/04/2017
Field of study

A weight-

t

halfspace is a Boolean function

f(x)=

sign

(w_1 x_1 + \cdots + w_n x_n - \theta)

where each

w_i

is an integer in

\{-t,\dots,t\}.

We give an explicit pseudorandom generator that

\delta

-fools any intersection of

k

weight-

t

halfspaces with seed length poly

(\log n, \log k,t,1/\delta)

. In particular, our result gives an explicit PRG that fools any intersection of any quasipoly

(n)

number of halfspaces of any poly

\log(n)

weight to any

1/

poly

\log(n)

accuracy using seed length poly

\log(n).

Prior to this work no explicit PRG with non-trivial seed length was known even for fooling intersections of

n

weight-1 halfspaces to constant accuracy. The analysis of our PRG fuses techniques from two different lines of work on unconditional pseudorandomness for different kinds of Boolean functions. We extend the approach of Harsha, Klivans and Meka \cite{HKM12} for fooling intersections of regular halfspaces, and combine this approach with results of Bazzi \cite{Bazzi:07} and Razborov \cite{Razborov:09} on bounded independence fooling CNF formulas. Our analysis introduces new coupling-based ingredients into the standard Lindeberg method for establishing quantitative central limit theorems and associated pseudorandomness results.Comment: 27 page

arXiv.org e-Print Archive

Crossref

Testing probability distributions using conditional samples

Author: Canonne Clement
Ron Dana
Servedio Rocco A.
Publication venue
Publication date: 01/01/2015
Field of study

We study a new framework for property testing of probability distributions, by considering distribution testing algorithms that have access to a conditional sampling oracle.* This is an oracle that takes as input a subset

S \subseteq [N]

of the domain

[N]

of the unknown probability distribution

D

and returns a draw from the conditional probability distribution

D

restricted to

S

. This new model allows considerable flexibility in the design of distribution testing algorithms; in particular, testing algorithms in this model can be adaptive. We study a wide range of natural distribution testing problems in this new framework and some of its variants, giving both upper and lower bounds on query complexity. These problems include testing whether

D

is the uniform distribution

\mathcal{U}

; testing whether

D = D^\ast

for an explicitly provided

D^\ast

; testing whether two unknown distributions

D_1

and

D_2

are equivalent; and estimating the variation distance between

D

and the uniform distribution. At a high level our main finding is that the new "conditional sampling" framework we consider is a powerful one: while all the problems mentioned above have

\Omega(\sqrt{N})

sample complexity in the standard model (and in some cases the complexity must be almost linear in

N

), we give

\mathrm{poly}(\log N, 1/\varepsilon)

-query algorithms (and in some cases

\mathrm{poly}(1/\varepsilon)

-query algorithms independent of

N

) for all these problems in our conditional sampling setting. *Independently from our work, Chakraborty et al. also considered this framework. We discuss their work in Subsection [1.4].Comment: Significant changes on Section 9 (detailing and expanding the proof of Theorem 16). Several clarifications and typos fixed in various place

arXiv.org e-Print Archive

Crossref

Efficient deterministic approximate counting for low-degree polynomial threshold functions

Author: De Anindya
Servedio Rocco
Publication venue
Publication date: 27/11/2013
Field of study

We give a deterministic algorithm for approximately counting satisfying assignments of a degree-

d

polynomial threshold function (PTF). Given a degree-

d

input polynomial

p(x_1,\dots,x_n)

over

R^n

and a parameter

\epsilon> 0

, our algorithm approximates

\Pr_{x \sim \{-1,1\}^n}[p(x) \geq 0]

to within an additive

\pm \epsilon

in time

O_{d,\epsilon}(1)\cdot \mathop{poly}(n^d)

. (Any sort of efficient multiplicative approximation is impossible even for randomized algorithms assuming

NP\not=RP

.) Note that the running time of our algorithm (as a function of

n^d

, the number of coefficients of a degree-

d

PTF) is a \emph{fixed} polynomial. The fastest previous algorithm for this problem (due to Kane), based on constructions of unconditional pseudorandom generators for degree-

d

PTFs, runs in time

n^{O_{d,c}(1) \cdot \epsilon^{-c}}

for all

c > 0

. The key novel contributions of this work are: A new multivariate central limit theorem, proved using tools from Malliavin calculus and Stein's Method. This new CLT shows that any collection of Gaussian polynomials with small eigenvalues must have a joint distribution which is very close to a multidimensional Gaussian distribution. A new decomposition of low-degree multilinear polynomials over Gaussian inputs. Roughly speaking we show that (up to some small error) any such polynomial can be decomposed into a bounded number of multilinear polynomials all of which have extremely small eigenvalues. We use these new ingredients to give a deterministic algorithm for a Gaussian-space version of the approximate counting problem, and then employ standard techniques for working with low-degree PTFs (invariance principles and regularity lemmas) to reduce the original approximate counting problem over the Boolean hypercube to the Gaussian version

arXiv.org e-Print Archive

Crossref

An average-case depth hierarchy theorem for Boolean circuits

Author: Rossman Benjamin
Servedio Rocco A.
Tan Li-Yang
Publication venue
Publication date: 13/04/2015
Field of study

We prove an average-case depth hierarchy theorem for Boolean circuits over the standard basis of

\mathsf{AND}

\mathsf{OR}

, and

\mathsf{NOT}

gates. Our hierarchy theorem says that for every

d \geq 2

, there is an explicit

n

-variable Boolean function

f

, computed by a linear-size depth-

d

formula, which is such that any depth-

(d-1)

circuit that agrees with

f

(1/2 + o_n(1))

fraction of all inputs must have size

\exp({n^{\Omega(1/d)}}).

This answers an open question posed by H{\aa}stad in his Ph.D. thesis. Our average-case depth hierarchy theorem implies that the polynomial hierarchy is infinite relative to a random oracle with probability 1, confirming a conjecture of H{\aa}stad, Cai, and Babai. We also use our result to show that there is no "approximate converse" to the results of Linial, Mansour, Nisan and Boppana on the total influence of small-depth circuits, thus answering a question posed by O'Donnell, Kalai, and Hatami. A key ingredient in our proof is a notion of \emph{random projections} which generalize random restrictions

arXiv.org e-Print Archive

Crossref

Efficiency versus Convergence of Boolean Kernels for On-Line Learning Algorithms

Author: Khardon R.
Roth D.
Servedio R. A.
Publication venue: 'AI Access Foundation'
Publication date: 09/09/2011
Field of study

The paper studies machine learning problems where each example is described using a set of Boolean features and where hypotheses are represented by linear threshold elements. One method of increasing the expressiveness of learned hypotheses in this context is to expand the feature set to include conjunctions of basic features. This can be done explicitly or where possible by using a kernel function. Focusing on the well known Perceptron and Winnow algorithms, the paper demonstrates a tradeoff between the computational efficiency with which the algorithm can be run over the expanded feature space and the generalization ability of the corresponding learning algorithm. We first describe several kernel functions which capture either limited forms of conjunctions or all conjunctions. We show that these kernels can be used to efficiently run the Perceptron algorithm over a feature space of exponentially many conjunctions; however we also show that using such kernels, the Perceptron algorithm can provably make an exponential number of mistakes even when learning simple functions. We then consider the question of whether kernel functions can analogously be used to run the multiplicative-update Winnow algorithm over an expanded feature space of exponentially many conjunctions. Known upper bounds imply that the Winnow algorithm can learn Disjunctive Normal Form (DNF) formulae with a polynomial mistake bound in this setting. However, we prove that it is computationally hard to simulate Winnows behavior for learning DNF over such a feature set. This implies that the kernel functions which correspond to running Winnow for this problem are not efficiently computable, and that there is no general construction that can run Winnow with kernels

arXiv.org e-Print Archive

Crossref