12,417 research outputs found
Defining Least Community as a Homogeneous Group in Complex Networks
This paper introduces a new concept of least community that is as homogeneous
as a random graph, and develops a new community detection algorithm from the
perspective of homogeneity or heterogeneity. Based on this concept, we adopt
head/tail breaks - a newly developed classification scheme for data with a
heavy-tailed distribution - and rely on edge betweenness given its heavy-tailed
distribution to iteratively partition a network into many heterogeneous and
homogeneous communities. Surprisingly, the derived communities for any
self-organized and/or self-evolved large networks demonstrate very striking
power laws, implying that there are far more small communities than large ones.
This notion of far more small things than large ones constitutes a new
fundamental way of thinking for community detection. Keywords: head/tail
breaks, ht-index, scaling, k-means, natural breaks, and classificationComment: 9 pages, 3 figures, 3 tables; Physica A, 2015, xx(x), xx-x
A Smooth Curve as a Fractal Under the Third Definition
It is commonly believed in the literature that smooth curves, such as
circles, are not fractal, and only non-smooth curves, such as coastlines, are
fractal. However, this paper demonstrates that a smooth curve can be fractal,
under the new, relaxed, third definition of fractal - a set or pattern is
fractal if the scaling of far more small things than large ones recurs at least
twice. The scaling can be rephrased as a hierarchy, consisting of numerous
smallest, a very few largest, and some in between the smallest and the largest.
The logarithmic spiral, as a smooth curve, is apparently fractal because it
bears the self-similar property, or the scaling of far more small squares than
large ones recurs multiple times, or the scaling of far more small bends than
large ones recurs multiple times. A half-circle or half-ellipse and the UK
coastline (before or after smooth processing) are fractal, if the scaling of
far more small bends than large ones recurs at least twice.
Keywords: Third definition of fractal, head/tail breaks, bends, ht-index,
scaling hierarchyComment: 8 pages, 5 figures, and 1 tabl
How Complex Is a Fractal? Head/tail Breaks and Fractional Hierarchy
A fractal bears a complex structure that is reflected in a scaling hierarchy,
indicating that there are far more small things than large ones. This scaling
hierarchy can be effectively derived using head/tail breaks - a clustering and
visualization tool for data with a heavy-tailed distribution - and quantified
by an ht-index, indicating the number of clusters or hierarchical levels, a
head/tail breaks-induced integer. However, this integral ht-index has been
found to be less precise for many fractals at their different phrases of
development. This paper refines the ht-index as a fraction to measure the
scaling hierarchy of a fractal more precisely within a coherent whole, and
further assigns a fractional ht-index - the fht-index - to an individual data
value of a data series that represents the fractal. We developed two case
studies to demonstrate the advantages of the fht-index, in comparison with the
ht-index. We found that the fractional ht-index or fractional hierarchy in
general can help characterize a fractal set or pattern in a much more precise
manner. The index may help create intermediate map scales between two
consecutive map scales.
Keywords: Ht-index, fractal, scaling, complexity, fht-indexComment: 7 pages, 2 figure
A sieve M-theorem for bundled parameters in semiparametric models, with application to the efficient estimation in a linear model for censored data
In many semiparametric models that are parameterized by two types of
parameters---a Euclidean parameter of interest and an infinite-dimensional
nuisance parameter---the two parameters are bundled together, that is, the
nuisance parameter is an unknown function that contains the parameter of
interest as part of its argument. For example, in a linear regression model for
censored survival data, the unspecified error distribution function involves
the regression coefficients. Motivated by developing an efficient estimating
method for the regression parameters, we propose a general sieve M-theorem for
bundled parameters and apply the theorem to deriving the asymptotic theory for
the sieve maximum likelihood estimation in the linear regression model for
censored survival data. The numerical implementation of the proposed estimating
method can be achieved through the conventional gradient-based search
algorithms such as the Newton--Raphson algorithm. We show that the proposed
estimator is consistent and asymptotically normal and achieves the
semiparametric efficiency bound. Simulation studies demonstrate that the
proposed method performs well in practical settings and yields more efficient
estimates than existing estimating equation based methods. Illustration with a
real data example is also provided.Comment: Published in at http://dx.doi.org/10.1214/11-AOS934 the Annals of
Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical
Statistics (http://www.imstat.org
Multivariate Bernoulli distribution
In this paper, we consider the multivariate Bernoulli distribution as a model
to estimate the structure of graphs with binary nodes. This distribution is
discussed in the framework of the exponential family, and its statistical
properties regarding independence of the nodes are demonstrated. Importantly
the model can estimate not only the main effects and pairwise interactions
among the nodes but also is capable of modeling higher order interactions,
allowing for the existence of complex clique effects. We compare the
multivariate Bernoulli model with existing graphical inference models - the
Ising model and the multivariate Gaussian model, where only the pairwise
interactions are considered. On the other hand, the multivariate Bernoulli
distribution has an interesting property in that independence and
uncorrelatedness of the component random variables are equivalent. Both the
marginal and conditional distributions of a subset of variables in the
multivariate Bernoulli distribution still follow the multivariate Bernoulli
distribution. Furthermore, the multivariate Bernoulli logistic model is
developed under generalized linear model theory by utilizing the canonical link
function in order to include covariate information on the nodes, edges and
cliques. We also consider variable selection techniques such as LASSO in the
logistic model to impose sparsity structure on the graph. Finally, we discuss
extending the smoothing spline ANOVA approach to the multivariate Bernoulli
logistic model to enable estimation of non-linear effects of the predictor
variables.Comment: Published in at http://dx.doi.org/10.3150/12-BEJSP10 the Bernoulli
(http://isi.cbs.nl/bernoulli/) by the International Statistical
Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm
Naturalness and a light
Models with a light, additional gauge boson are attractive extensions of the
standard model. Often these models are only considered as effective low energy
theory without any assumption about an UV completion. This leaves not only the
hierarchy problem of the SM unsolved, but introduces a copy of it because of
the new fundamental scalars responsible for breaking the new gauge group. A
possible solution is to embed these models into a supersymmetric framework.
However, this gives rise to an additional source of fine-tuning compared to the
MSSM and poses the question how natural such a setup is. One might expect that
the additional fine-tuning is huge, namely, . In
this paper we point out that this is not necessarily the case. We show that it
is possible to find a focus point behaviour also in the new sector in
co-existence to the MSSM focus point. We call this 'Double Focus Point
Supersymmetry'. Moreover, we stress the need for a proper inclusion of
radiative corrections in the fine-tuning calculation: a tree-level estimate
would lead to predictions for the tuning which can be wrong by many orders of
magnitude. As showcase, we use the extended MSSM and discuss
possible consequence of the observed anomaly. However, similar
features are expected for other models with an extended gauge group which
involve potentially large Yukawa-like interactions of the new scalars.Comment: 11 pages, 4 figures, two column format, reference update
Towards the Natural Gauge Mediation
The sweet spot supersymmetry (SUSY) solves the mu problem in the Minimal
Supersymmetric Standard Model (MSSM) with gauge mediated SUSY breaking (GMSB)
via the generalized Giudice-Masiero (GM) mechanism where only the mu-term and
soft Higgs masses are generated at the unification scale of the Grand Unified
Theory (GUT) due to the approximate PQ symmetry. Because all the other SUSY
breaking soft terms are generated via the GMSB below the GUT scale, there
exists SUSY electroweak (EW) fine-tuning problem to explain the 125 GeV Higgs
boson mass due to small trilinear soft term. Thus, to explain the Higgs boson
mass, we propose the GMSB with both the generalized GM mechanism and
Higgs-messenger interactions. The renormalization group equations are runnings
from the GUT scale down to EW scale. So the EW symmetry breaking can be
realized easier. We can keep the gauge coupling unification and solution to the
flavor problem in the GMSB, as well as solve the \mu/B_{\mu}-problem. Moreover,
there are only five free parameters in our model. So we can determine the
characteristic low energy spectra and explore its distinct phenomenology. The
low-scale fine-tuning measure can be as low as 20 with the light stop mass
below 1 TeV and gluino mass below 2 TeV. The gravitino dark matter can come
from a thermal production with the correct relic density and be consistent with
the thermal leptogenesis. Because gluino and stop can be relatively light in
our model, how to search for such GMSB at the upcoming run II of the LHC
experiment could be very interesting.Comment: 22 pages, 7 figures, Late
- …
