618 research outputs found
Finding Global Optimum for Truth Discovery: Entropy Based Geometric Variance
Truth Discovery is an important problem arising in data analytics related fields such as data mining, database, and big data. It concerns about finding the most trustworthy information from a dataset acquired from a number of unreliable sources. Due to its importance, the problem has been extensively studied in recent years and a number techniques have already been proposed. However, all of them are of heuristic nature and do not have any quality guarantee. In this paper, we formulate the problem as a high dimensional geometric optimization problem, called Entropy based Geometric Variance. Relying on a number of novel geometric techniques (such as Log-Partition and Modified Simplex Lemma), we further discover new insights to this problem. We show, for the first time, that the truth discovery problem can be solved with guaranteed quality of solution. Particularly, we show that it is possible to achieve a (1+eps)-approximation within nearly linear time under some reasonable assumptions. We expect that our algorithm will be useful for other data related applications
Distributed and Robust Support Vector Machine
In this paper, we consider the distributed version of Support Vector Machine (SVM) under the coordinator model, where all input data (i.e., points in R^d space) of SVM are arbitrarily distributed among k nodes in some network with a coordinator which can communicate with all nodes. We investigate two variants of this problem, with and without outliers. For distributed SVM without outliers, we prove a lower bound on the communication complexity and give a distributed (1-epsilon)-approximation algorithm to reach this lower bound, where epsilon is a user specified small constant. For distributed SVM with outliers, we present a (1-epsilon)-approximation algorithm to explicitly remove the influence of outliers. Our algorithm is based on a deterministic distributed top t selection algorithm with communication complexity of O(k log (t)) in the coordinator model. Experimental results on benchmark datasets confirm the theoretical guarantees of our algorithms
Postnatal dysregulation of Notch signal disrupts dendrite development of adult-born neurons in the hippocampus and contributes to memory impairment
Deficits in the Notch pathway are involved in a number of neurologic diseases associated with mental retardation or/and dementia. The mechanisms by which Notch dysregulation are associated with mental retardation and dementia are poorly understood. We found that Notch1 is highly expressed in the adult-born immature neurons in the hippocampus of mice. Retrovirus mediated knockout of notch1 in single adult-born immature neurons decreases mTOR signaling and compromises their dendrite morphogenesis. In contrast, overexpression of Notch1 intracellular domain (NICD), to constitutively activate Notch signaling in single adult-born immature neurons, promotes mTOR signaling and increases their dendrite arborization. Using a unique genetic approach to conditionally and selectively knockout notch 1 in the postnatally born immature neurons in the hippocampus decreases mTOR signaling, compromises their dendrite morphogenesis, and impairs spatial learning and memory. Conditional overexpression of NICD in the postnatally born immature neurons in the hippocampus increases mTOR signaling and promotes dendrite arborization. These data indicate that Notch signaling plays a critical role in dendrite development of immature neurons in the postnatal brain, and dysregulation of Notch signaling in the postnatally born neurons disrupts their development and thus contributes to the cognitive deficits associated with neurological diseases
Understanding Forgetting in Continual Learning with Linear Regression
Continual learning, focused on sequentially learning multiple tasks, has
gained significant attention recently. Despite the tremendous progress made in
the past, the theoretical understanding, especially factors contributing to
catastrophic forgetting, remains relatively unexplored. In this paper, we
provide a general theoretical analysis of forgetting in the linear regression
model via Stochastic Gradient Descent (SGD) applicable to both
underparameterized and overparameterized regimes. Our theoretical framework
reveals some interesting insights into the intricate relationship between task
sequence and algorithmic parameters, an aspect not fully captured in previous
studies due to their restrictive assumptions. Specifically, we demonstrate
that, given a sufficiently large data size, the arrangement of tasks in a
sequence, where tasks with larger eigenvalues in their population data
covariance matrices are trained later, tends to result in increased forgetting.
Additionally, our findings highlight that an appropriate choice of step size
will help mitigate forgetting in both underparameterized and overparameterized
settings. To validate our theoretical analysis, we conducted simulation
experiments on both linear regression models and Deep Neural Networks (DNNs).
Results from these simulations substantiate our theoretical findings.Comment: To be published in The 41st International Conference on Machine
Learnin
Hollow core fiber based interferometer for high temperature (1000 °C) measurement
A simple, cost effective high temperature sensor (up to 1000 °C) based on a hollow core fiber (HCF) structure is reported. It is configured by fusion splicing a short section of HCF with a length of few millimeters between two standard single mode fibers (SMF-28). Due to multiple beam interference introduced by the cladding of the HCF, periodic transmission dips with high spectral extinction ratio and high quality (Q) factor are excited. However, theoretical analysis shows that minor variations of the HCF cladding diameter may result in a significant decrease in the Q factor. Experimental results demonstrate that the position of periodic transmission dips are independent of the HCF length, but spectral Q factors and transmission power varies with different HCF lengths. A maximum Q factor of 3.3×104 has been demonstrated with large free spectral range of 23 nm and extinction ratio of 26 dB. Furthermore, the structure is proved to be an excellent high temperature sensor with advantages of high sensitivity (up to 33.4 pm/°C), wide working temperature range (from room temperature to 1000°C), high resolution, good stability, repeatability, relatively low strain sensitivity (0.46 pm/με), low cost and a simple and flexible fabrication process that offers a great potential for practical applications. A thorough theoretic analysis of the HCF based fiber structure has been proposed. The experimental results are demonstrated to be well matched with our simulation results
- …
