382 research outputs found
Spectral dimensionality reduction for HMMs
Hidden Markov Models (HMMs) can be accurately approximated using
co-occurrence frequencies of pairs and triples of observations by using a fast
spectral method in contrast to the usual slow methods like EM or Gibbs
sampling. We provide a new spectral method which significantly reduces the
number of model parameters that need to be estimated, and generates a sample
complexity that does not depend on the size of the observation vocabulary. We
present an elementary proof giving bounds on the relative accuracy of
probability estimates from our model. (Correlaries show our bounds can be
weakened to provide either L1 bounds or KL bounds which provide easier direct
comparisons to previous work.) Our theorem uses conditions that are checkable
from the data, instead of putting conditions on the unobservable Markov
transition matrix
Analisis Timbulan dan Karakteristik Sampah Mudah Terbakar di TPA Banjarsari Kabupaten Bojonegoro sebagai Bahan Baku Refuse Derived Fuel
The amount of waste entering the Banjarsari Bojonegoro landfill reaches 65 tons per day. Bojonegoro people have a waste generation coefficient of 0.4 kg/person/day. When multiplied by the entire population of Bojonegoro Regency, which is 1.3 million, the waste becomes 520 tons per day. This study aims to utilize the waste pile at TPA Banjarsari as RDF fuel. According to the research findings, the total waste generation taken into TPA Banjarsari was 53,420 kg/day, with plastics (22.09%), fabrics (4.16%), wood (5.07%), paper (5.15%), rubber/leather (1.15%), organics (56.21%), glass (0.74%), metal (1.18%), and others (4.24%). Combustible waste generation at TPA Banjarsari was 20,299.6 kg/day, or 37.6%. Based on an analysis of the waste characteristics, the types of wood that met the RDF standards of the Ministry of Industry in 2017 had a moisture content value of 18.06%, an ash content of 1.29%, a volatile matter content of 66.99%, and a calorific value of 3,788.51%. The waste reduction in scenario 1 (scavengers) yielded 400.07 kg/day, or 0.74%, and residue of 53,019.96 kg/day, or 99.26%. Meanwhile, the waste reduction in scenario 2 (RDF) gained 4,300.31 kg/day, or 8.05%, and residue of 49,199.69 kg/day, or 91.94%. Last, the optimal waste reduction obtained 4,813.19 kg/day, or 9.02%, and residue of 48,606.81 kg/day, or 90.98%.
When black box algorithms are (not) appropriate: a principled prediction-problem ontology
In the 1980s a new, extraordinarily productive way of reasoning about
algorithms emerged. Though this type of reasoning has come to dominate areas of
data science, it has been under-discussed and its impact under-appreciated. For
example, it is the primary way we reason about "black box" algorithms. In this
paper we analyze its current use (i.e., as "the common task framework") and its
limitations; we find a large class of prediction-problems are inappropriate for
this type of reasoning. Further, we find the common task framework does not
provide a foundation for the deployment of an algorithm in a real world
situation. Building off of its core features, we identify a class of problems
where this new form of reasoning can be used in deployment. We purposefully
develop a novel framework so both technical and non-technical people can
discuss and identify key features of their prediction problem and whether or
not it is suitable for this new kind of reasoning
Bridging the Usability Gap: Theoretical and Methodological Advances for Spectral Learning of Hidden Markov Models
The Baum-Welch (B-W) algorithm is the most widely accepted method for
inferring hidden Markov models (HMM). However, it is prone to getting stuck in
local optima, and can be too slow for many real-time applications. Spectral
learning of HMMs (SHMMs), based on the method of moments (MOM) has been
proposed in the literature to overcome these obstacles. Despite its promises,
asymptotic theory for SHMM has been elusive, and the long-run performance of
SHMM can degrade due to unchecked propogation of error. In this paper, we (1)
provide an asymptotic distribution for the approximate error of the likelihood
estimated by SHMM, and (2) propose a novel algorithm called projected SHMM
(PSHMM) that mitigates the problem of error propogation, and (3) develop online
learning variantions of both SHMM and PSHMM that accommodate potential
nonstationarity. We compare the performance of SHMM with PSHMM and estimation
through the B-W algorithm on both simulated data and data from real world
applications, and find that PSHMM not only retains the computational advantages
of SHMM, but also provides more robust estimation and forecasting
Nonlinear Permuted Granger Causality
Granger causal inference is a contentious but widespread method used in
fields ranging from economics to neuroscience. The original definition
addresses the notion of causality in time series by establishing functional
dependence conditional on a specified model. Adaptation of Granger causality to
nonlinear data remains challenging, and many methods apply in-sample tests that
do not incorporate out-of-sample predictability, leading to concerns of model
overfitting. To allow for out-of-sample comparison, a measure of functional
connectivity is explicitly defined using permutations of the covariate set.
Artificial neural networks serve as featurizers of the data to approximate any
arbitrary, nonlinear relationship, and consistent estimation of the variance
for each permutation is shown under certain conditions on the featurization
process and the model residuals. Performance of the permutation method is
compared to penalized variable selection, naive replacement, and omission
techniques via simulation, and it is applied to neuronal responses of acoustic
stimuli in the auditory cortex of anesthetized rats. Targeted use of the
Granger causal framework, when prior knowledge of the causal mechanisms in a
dataset are limited, can help to reveal potential predictive relationships
between sets of variables that warrant further study
Evaluasi Unit Koagulasi, Flokulasi, Sedimentasi, dan Filtrasi pada Instalasi Pengolahan Air (IPA) Semanggi Perumda Air Minum Toya Wening Kota Surakarta
Perumda Air Minum Toya Wening, yang merupakan Badan Usaha Milik Daerah (BUMD) di Kota Surakarta, bertanggung jawab sebagai penyedia air bersih untuk memenuhi kebutuhan masyarakat akan air bersih. Kualitas air minum yang dihasilkan harus memenuhi persyaratan parameter fisik, kimia, dan mikrobiologis sesuai dengan Peraturan Menteri Kesehatan Republik Indonesia Nomor 2 Tahun 2023 tentang Peraturan Pelaksanaan Peraturan Pemerintah Nomor 66 Tahun 2014 Tentang Kesehatan Lingkungan. Untuk meningkatkan efektivitas pengolahan air dan kualitas air yang dihasilkan oleh IPA Semanggi, perubahan pada unit IPA perlu dilakukan melalui evaluasi kinerja unit bangunan IPA. Proses evaluasi ini melibatkan observasi lapangan dan wawancara terkait Eksisting Instalasi Pengolahan Air (IPA), karakteristik unit-unit IPA, debit pengolahan, kualitas air, serta referensi literatur. Hasil penelitian menunjukkan bahwa Nilai waktu detensi pada unit koagulasi sebesar 461,53 dtk, flokulasi seesar 8,47 menit dan sedimentasi sebesar 2,1 detik dan tidak memenuhi kriteria desain. Pada unit flokulasi, gradient kecepatan bernilai 0,004 detik-1 sehingga belum memenuhi kriteria desain. Pada unit sedimentsi, nilai surface loading rate sebesar 1,19 m3/m2.jam dan belum memenuhi kriteria desain. Pada unit filtrasi, nilai kecepatan penyaringan sebesar 1,43 m/jam sehingga tidak sesuai dengan kriteria desai
Switching to smokeless tobacco as a smoking cessation method: evidence from the 2000 National Health Interview Survey
<p>Abstract</p> <p>Background</p> <p>Although smokeless tobacco (ST) use has played a major role in the low smoking prevalence among Swedish men, there is little information at the population level about ST as a smoking cessation aid in the U.S.</p> <p>Methods</p> <p>We used the 2000 National Health Interview Survey to derive population estimates for the number of smokers who had tried twelve methods in their most recent quit attempt, and for the numbers and proportions who were former or current smokers at the time of the survey.</p> <p>Results</p> <p>An estimated 359,000 men switched to smokeless tobacco in their most recent quit attempt. This method had the highest proportion of successes among those attempting it (73%), representing 261,000 successful quitters (switchers). In comparison, the nicotine patch was used by an estimated 2.9 million men in their most recent quit attempt, and almost one million (35%) were former smokers at the time of the survey. Of the 964,000 men using nicotine gum, about 323,000 (34%) became former smokers. Of the 98,000 men who used the nicotine inhaler, 27,000 quit successfully (28%). None of the estimated 14,000 men who tried the nicotine nasal spray became former smokers.</p> <p>Forty-two percent of switchers also reported quitting smoking all at once, which was higher than among former smokers who used medications (8–19%). Although 40% of switchers quit smoking less than 5 years before the survey, 21% quit over 20 years earlier. Forty-six percent of switchers were current ST users at the time of the survey.</p> <p>Conclusion</p> <p>Switching to ST compares very favorably with pharmaceutical nicotine as a quit-smoking aid among American men, despite the fact that few smokers know that the switch provides almost all of the health benefits of complete tobacco abstinence. The results of this study show that tobacco harm reduction is a viable cessation option for American smokers.</p
Tobacco harm reduction: an alternative cessation strategy for inveterate smokers
According to the Centers for Disease Control and Prevention, about 45 million Americans continue to smoke, even after one of the most intense public health campaigns in history, now over 40 years old. Each year some 438,000 smokers die from smoking-related diseases, including lung and other cancers, cardiovascular disorders and pulmonary diseases. Many smokers are unable – or at least unwilling – to achieve cessation through complete nicotine and tobacco abstinence; they continue smoking despite the very real and obvious adverse health consequences. Conventional smoking cessation policies and programs generally present smokers with two unpleasant alternatives: quit, or die. A third approach to smoking cessation, tobacco harm reduction, involves the use of alternative sources of nicotine, including modern smokeless tobacco products. A substantial body of research, much of it produced over the past decade, establishes the scientific and medical foundation for tobacco harm reduction using smokeless tobacco products. This report provides a description of traditional and modern smokeless tobacco products, and of the prevalence of their use in the United States and Sweden. It reviews the epidemiologic evidence for low health risks associated with smokeless use, both in absolute terms and in comparison to the much higher risks of smoking. The report also describes evidence that smokeless tobacco has served as an effective substitute for cigarettes among Swedish men, who consequently have among the lowest smoking-related mortality rates in the developed world. The report documents the fact that extensive misinformation about ST products is widely available from ostensibly reputable sources, including governmental health agencies and major health organizations. The American Council on Science and Health believes that strong support of tobacco harm reduction is fully consistent with its mission to promote sound science in regulation and in public policy, and to assist consumers in distinguishing real health threats from spurious health claims. As this report documents, there is a strong scientific and medical foundation for tobacco harm reduction, and it shows great potential as a public health strategy to help millions of smokers
Change Point Detection with Conceptors
Offline change point detection retrospectively locates change points in a
time series. Many nonparametric methods that target i.i.d. mean and variance
changes fail in the presence of nonlinear temporal dependence, and model based
methods require a known, rigid structure. For the at most one change point
problem, we propose use of a conceptor matrix to learn the characteristic
dynamics of a baseline training window with arbitrary dependence structure. The
associated echo state network acts as a featurizer of the data, and change
points are identified from the nature of the interactions between the features
and their relationship to the baseline state. This model agnostic method can
suggest potential locations of interest that warrant further study. We prove
that, under mild assumptions, the method provides a consistent estimate of the
true change point, and quantile estimates are produced via a moving block
bootstrap of the original data. The method is evaluated with clustering metrics
and Type 1 error control on simulated data, and applied to publicly available
neural data from rats experiencing bouts of non-REM sleep prior to exploration
of a radial maze. With sufficient spacing, the framework provides a simple
extension to the sparse, multiple change point problem
- …
