382 research outputs found

    Spectral dimensionality reduction for HMMs

    Get PDF
    Hidden Markov Models (HMMs) can be accurately approximated using co-occurrence frequencies of pairs and triples of observations by using a fast spectral method in contrast to the usual slow methods like EM or Gibbs sampling. We provide a new spectral method which significantly reduces the number of model parameters that need to be estimated, and generates a sample complexity that does not depend on the size of the observation vocabulary. We present an elementary proof giving bounds on the relative accuracy of probability estimates from our model. (Correlaries show our bounds can be weakened to provide either L1 bounds or KL bounds which provide easier direct comparisons to previous work.) Our theorem uses conditions that are checkable from the data, instead of putting conditions on the unobservable Markov transition matrix

    Analisis Timbulan dan Karakteristik Sampah Mudah Terbakar di TPA Banjarsari Kabupaten Bojonegoro sebagai Bahan Baku Refuse Derived Fuel

    Get PDF
    The amount of waste entering the Banjarsari Bojonegoro landfill reaches 65 tons per day. Bojonegoro people have a waste generation coefficient of 0.4 kg/person/day. When multiplied by the entire population of Bojonegoro Regency, which is 1.3 million, the waste becomes 520 tons per day. This study aims to utilize the waste pile at TPA Banjarsari as RDF fuel. According to the research findings, the total waste generation taken into TPA Banjarsari was 53,420 kg/day, with plastics (22.09%), fabrics (4.16%), wood (5.07%), paper (5.15%), rubber/leather (1.15%), organics (56.21%), glass (0.74%), metal (1.18%), and others (4.24%). Combustible waste generation at TPA Banjarsari was 20,299.6 kg/day, or 37.6%. Based on an analysis of the waste characteristics, the types of wood that met the RDF standards of the Ministry of Industry in 2017 had a moisture content value of 18.06%, an ash content of 1.29%, a volatile matter content of 66.99%, and a calorific value of 3,788.51%. The waste reduction in scenario 1 (scavengers) yielded 400.07 kg/day, or 0.74%, and residue of 53,019.96 kg/day, or 99.26%. Meanwhile, the waste reduction in scenario 2 (RDF) gained 4,300.31 kg/day, or 8.05%, and residue of 49,199.69 kg/day, or 91.94%. Last, the optimal waste reduction obtained 4,813.19 kg/day, or 9.02%, and residue of 48,606.81 kg/day, or 90.98%.

    When black box algorithms are (not) appropriate: a principled prediction-problem ontology

    Full text link
    In the 1980s a new, extraordinarily productive way of reasoning about algorithms emerged. Though this type of reasoning has come to dominate areas of data science, it has been under-discussed and its impact under-appreciated. For example, it is the primary way we reason about "black box" algorithms. In this paper we analyze its current use (i.e., as "the common task framework") and its limitations; we find a large class of prediction-problems are inappropriate for this type of reasoning. Further, we find the common task framework does not provide a foundation for the deployment of an algorithm in a real world situation. Building off of its core features, we identify a class of problems where this new form of reasoning can be used in deployment. We purposefully develop a novel framework so both technical and non-technical people can discuss and identify key features of their prediction problem and whether or not it is suitable for this new kind of reasoning

    Bridging the Usability Gap: Theoretical and Methodological Advances for Spectral Learning of Hidden Markov Models

    Full text link
    The Baum-Welch (B-W) algorithm is the most widely accepted method for inferring hidden Markov models (HMM). However, it is prone to getting stuck in local optima, and can be too slow for many real-time applications. Spectral learning of HMMs (SHMMs), based on the method of moments (MOM) has been proposed in the literature to overcome these obstacles. Despite its promises, asymptotic theory for SHMM has been elusive, and the long-run performance of SHMM can degrade due to unchecked propogation of error. In this paper, we (1) provide an asymptotic distribution for the approximate error of the likelihood estimated by SHMM, and (2) propose a novel algorithm called projected SHMM (PSHMM) that mitigates the problem of error propogation, and (3) develop online learning variantions of both SHMM and PSHMM that accommodate potential nonstationarity. We compare the performance of SHMM with PSHMM and estimation through the B-W algorithm on both simulated data and data from real world applications, and find that PSHMM not only retains the computational advantages of SHMM, but also provides more robust estimation and forecasting

    Nonlinear Permuted Granger Causality

    Full text link
    Granger causal inference is a contentious but widespread method used in fields ranging from economics to neuroscience. The original definition addresses the notion of causality in time series by establishing functional dependence conditional on a specified model. Adaptation of Granger causality to nonlinear data remains challenging, and many methods apply in-sample tests that do not incorporate out-of-sample predictability, leading to concerns of model overfitting. To allow for out-of-sample comparison, a measure of functional connectivity is explicitly defined using permutations of the covariate set. Artificial neural networks serve as featurizers of the data to approximate any arbitrary, nonlinear relationship, and consistent estimation of the variance for each permutation is shown under certain conditions on the featurization process and the model residuals. Performance of the permutation method is compared to penalized variable selection, naive replacement, and omission techniques via simulation, and it is applied to neuronal responses of acoustic stimuli in the auditory cortex of anesthetized rats. Targeted use of the Granger causal framework, when prior knowledge of the causal mechanisms in a dataset are limited, can help to reveal potential predictive relationships between sets of variables that warrant further study

    Evaluasi Unit Koagulasi, Flokulasi, Sedimentasi, dan Filtrasi pada Instalasi Pengolahan Air (IPA) Semanggi Perumda Air Minum Toya Wening Kota Surakarta

    Get PDF
    Perumda Air Minum Toya Wening, yang merupakan Badan Usaha Milik Daerah (BUMD) di Kota Surakarta, bertanggung jawab sebagai penyedia air bersih untuk memenuhi kebutuhan masyarakat akan air bersih. Kualitas air minum yang dihasilkan harus memenuhi persyaratan parameter fisik, kimia, dan mikrobiologis sesuai dengan Peraturan Menteri Kesehatan Republik Indonesia Nomor 2 Tahun 2023 tentang Peraturan Pelaksanaan Peraturan Pemerintah Nomor 66 Tahun 2014 Tentang Kesehatan Lingkungan. Untuk meningkatkan efektivitas pengolahan air dan kualitas air yang dihasilkan oleh IPA Semanggi, perubahan pada unit IPA perlu dilakukan melalui evaluasi kinerja unit bangunan IPA. Proses evaluasi ini melibatkan observasi lapangan dan wawancara terkait Eksisting Instalasi Pengolahan Air (IPA), karakteristik unit-unit IPA, debit pengolahan, kualitas air, serta referensi literatur. Hasil penelitian menunjukkan bahwa Nilai waktu detensi pada unit koagulasi sebesar 461,53 dtk, flokulasi seesar 8,47 menit dan sedimentasi sebesar 2,1 detik dan tidak memenuhi kriteria desain. Pada unit flokulasi, gradient kecepatan bernilai 0,004 detik-1 sehingga belum memenuhi kriteria desain. Pada unit sedimentsi, nilai surface loading rate sebesar 1,19 m3/m2.jam dan belum memenuhi kriteria desain. Pada unit filtrasi, nilai  kecepatan penyaringan sebesar 1,43 m/jam sehingga tidak sesuai dengan kriteria desai

    Switching to smokeless tobacco as a smoking cessation method: evidence from the 2000 National Health Interview Survey

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Although smokeless tobacco (ST) use has played a major role in the low smoking prevalence among Swedish men, there is little information at the population level about ST as a smoking cessation aid in the U.S.</p> <p>Methods</p> <p>We used the 2000 National Health Interview Survey to derive population estimates for the number of smokers who had tried twelve methods in their most recent quit attempt, and for the numbers and proportions who were former or current smokers at the time of the survey.</p> <p>Results</p> <p>An estimated 359,000 men switched to smokeless tobacco in their most recent quit attempt. This method had the highest proportion of successes among those attempting it (73%), representing 261,000 successful quitters (switchers). In comparison, the nicotine patch was used by an estimated 2.9 million men in their most recent quit attempt, and almost one million (35%) were former smokers at the time of the survey. Of the 964,000 men using nicotine gum, about 323,000 (34%) became former smokers. Of the 98,000 men who used the nicotine inhaler, 27,000 quit successfully (28%). None of the estimated 14,000 men who tried the nicotine nasal spray became former smokers.</p> <p>Forty-two percent of switchers also reported quitting smoking all at once, which was higher than among former smokers who used medications (8–19%). Although 40% of switchers quit smoking less than 5 years before the survey, 21% quit over 20 years earlier. Forty-six percent of switchers were current ST users at the time of the survey.</p> <p>Conclusion</p> <p>Switching to ST compares very favorably with pharmaceutical nicotine as a quit-smoking aid among American men, despite the fact that few smokers know that the switch provides almost all of the health benefits of complete tobacco abstinence. The results of this study show that tobacco harm reduction is a viable cessation option for American smokers.</p

    Tobacco harm reduction: an alternative cessation strategy for inveterate smokers

    Get PDF
    According to the Centers for Disease Control and Prevention, about 45 million Americans continue to smoke, even after one of the most intense public health campaigns in history, now over 40 years old. Each year some 438,000 smokers die from smoking-related diseases, including lung and other cancers, cardiovascular disorders and pulmonary diseases. Many smokers are unable – or at least unwilling – to achieve cessation through complete nicotine and tobacco abstinence; they continue smoking despite the very real and obvious adverse health consequences. Conventional smoking cessation policies and programs generally present smokers with two unpleasant alternatives: quit, or die. A third approach to smoking cessation, tobacco harm reduction, involves the use of alternative sources of nicotine, including modern smokeless tobacco products. A substantial body of research, much of it produced over the past decade, establishes the scientific and medical foundation for tobacco harm reduction using smokeless tobacco products. This report provides a description of traditional and modern smokeless tobacco products, and of the prevalence of their use in the United States and Sweden. It reviews the epidemiologic evidence for low health risks associated with smokeless use, both in absolute terms and in comparison to the much higher risks of smoking. The report also describes evidence that smokeless tobacco has served as an effective substitute for cigarettes among Swedish men, who consequently have among the lowest smoking-related mortality rates in the developed world. The report documents the fact that extensive misinformation about ST products is widely available from ostensibly reputable sources, including governmental health agencies and major health organizations. The American Council on Science and Health believes that strong support of tobacco harm reduction is fully consistent with its mission to promote sound science in regulation and in public policy, and to assist consumers in distinguishing real health threats from spurious health claims. As this report documents, there is a strong scientific and medical foundation for tobacco harm reduction, and it shows great potential as a public health strategy to help millions of smokers

    Change Point Detection with Conceptors

    Full text link
    Offline change point detection retrospectively locates change points in a time series. Many nonparametric methods that target i.i.d. mean and variance changes fail in the presence of nonlinear temporal dependence, and model based methods require a known, rigid structure. For the at most one change point problem, we propose use of a conceptor matrix to learn the characteristic dynamics of a baseline training window with arbitrary dependence structure. The associated echo state network acts as a featurizer of the data, and change points are identified from the nature of the interactions between the features and their relationship to the baseline state. This model agnostic method can suggest potential locations of interest that warrant further study. We prove that, under mild assumptions, the method provides a consistent estimate of the true change point, and quantile estimates are produced via a moving block bootstrap of the original data. The method is evaluated with clustering metrics and Type 1 error control on simulated data, and applied to publicly available neural data from rats experiencing bouts of non-REM sleep prior to exploration of a radial maze. With sufficient spacing, the framework provides a simple extension to the sparse, multiple change point problem
    corecore