1,158 research outputs found

    Which Surrogate Works for Empirical Performance Modelling? A Case Study with Differential Evolution

    Full text link
    It is not uncommon that meta-heuristic algorithms contain some intrinsic parameters, the optimal configuration of which is crucial for achieving their peak performance. However, evaluating the effectiveness of a configuration is expensive, as it involves many costly runs of the target algorithm. Perhaps surprisingly, it is possible to build a cheap-to-evaluate surrogate that models the algorithm's empirical performance as a function of its parameters. Such surrogates constitute an important building block for understanding algorithm performance, algorithm portfolio/selection, and the automatic algorithm configuration. In principle, many off-the-shelf machine learning techniques can be used to build surrogates. In this paper, we take the differential evolution (DE) as the baseline algorithm for proof-of-concept study. Regression models are trained to model the DE's empirical performance given a parameter configuration. In particular, we evaluate and compare four popular regression algorithms both in terms of how well they predict the empirical performance with respect to a particular parameter configuration, and also how well they approximate the parameter versus the empirical performance landscapes

    Towards Faster Training of Global Covariance Pooling Networks by Iterative Matrix Square Root Normalization

    Full text link
    Global covariance pooling in convolutional neural networks has achieved impressive improvement over the classical first-order pooling. Recent works have shown matrix square root normalization plays a central role in achieving state-of-the-art performance. However, existing methods depend heavily on eigendecomposition (EIG) or singular value decomposition (SVD), suffering from inefficient training due to limited support of EIG and SVD on GPU. Towards addressing this problem, we propose an iterative matrix square root normalization method for fast end-to-end training of global covariance pooling networks. At the core of our method is a meta-layer designed with loop-embedded directed graph structure. The meta-layer consists of three consecutive nonlinear structured layers, which perform pre-normalization, coupled matrix iteration and post-compensation, respectively. Our method is much faster than EIG or SVD based ones, since it involves only matrix multiplications, suitable for parallel implementation on GPU. Moreover, the proposed network with ResNet architecture can converge in much less epochs, further accelerating network training. On large-scale ImageNet, we achieve competitive performance superior to existing counterparts. By finetuning our models pre-trained on ImageNet, we establish state-of-the-art results on three challenging fine-grained benchmarks. The source code and network models will be available at http://www.peihuali.org/iSQRT-COVComment: Accepted to CVPR 201

    TypeSQL: Knowledge-based Type-Aware Neural Text-to-SQL Generation

    Full text link
    Interacting with relational databases through natural language helps users of any background easily query and analyze a vast amount of data. This requires a system that understands users' questions and converts them to SQL queries automatically. In this paper we present a novel approach, TypeSQL, which views this problem as a slot filling task. Additionally, TypeSQL utilizes type information to better understand rare entities and numbers in natural language questions. We test this idea on the WikiSQL dataset and outperform the prior state-of-the-art by 5.5% in much less time. We also show that accessing the content of databases can significantly improve the performance when users' queries are not well-formed. TypeSQL gets 82.6% accuracy, a 17.5% absolute improvement compared to the previous content-sensitive model.Comment: NAACL 201

    Nonlinear Friction-Induced Vibration of a Slider-Belt System

    Get PDF
    A mass–spring–damper slider excited into vibration in a plane by a moving rigid belt through friction is a major paradigm of friction-induced vibration. This paradigm has two aspects that can be improved: (1) the contact stiffness at the slider–belt interface is often assumed to be linear and (2) this contact is usually assumed to be maintained during vibration (even when the vibration becomes unbounded at certain conditions). In this paper, a cubic contact spring is included; loss of contact (separation) at the slider–belt interface is allowed and importantly reattachment of the slider to the belt after separation is also considered. These two features make a more realistic model of friction-induced vibration and are shown to lead to very rich dynamic behavior even though a simple Coulomb friction law is used. Both complex eigenvalue analyses of the linearized system and transient analysis of the full nonlinear system are conducted. Eigenvalue analysis indicates that the nonlinear system can become unstable at increasing levels of the preload and the nonlinear stiffness, even if the corresponding linear part of the system is stable. However, they at a high enough level become stabilizing factors. Transient analysis shows that separation and reattachment could happen. Vibration can grow with the preload and vertical nonlinear stiffness when separation is considered, while this trend is different when separation is ignored. Finally, it is found that the vibration magnitudes of the model with separation are greater than the corresponding model without considering separation in certain conditions. Thus, ignoring the separation is unsafe.</jats:p

    Dampak Pembangunan Infrastruktur Perdesaan Pada Program PNPM Mandiri Perdesaan Di Kabupaten Toli Toli

    Full text link
    The purpose of this study was to determine the Development Impact of Rural Infrastructure in PNPM RuralProgram in Toli-Toli. Research conducted on the implementation of PNPM Rural Program in Toli-Toli forfiscal year 2007 and 2008.Primary data obtained from interviews with relevant parties and direct observation in the field, then the datais processed with Descriptive Analysis.The results showed the impact of rural infrastructure development in poor communities in Toli Toli, namely:increasing revenue, impoving public education, improving health and improving the public midset. Impact onvillage institutions, namely: the function and role of local government to be effective, institutions ofparticipatory development and improvement of the quality of facilities.and social infrastructure andeconomic base of societ

    Traffic Prediction considering Multiple Levels of Spatial-temporal Information: A Multi-scale Graph Wavelet-based Approach

    Full text link
    Although traffic prediction has been receiving considerable attention with a number of successes in the context of intelligent transportation systems, the prediction of traffic states over a complex transportation network that contains different road types has remained a challenge. This study proposes a multi-scale graph wavelet temporal convolution network (MSGWTCN) to predict the traffic states in complex transportation networks. Specifically, a multi-scale spatial block is designed to simultaneously capture the spatial information at different levels, and the gated temporal convolution network is employed to extract the temporal dependencies of the data. The model jointly learns to mount multiple levels of the spatial interactions by stacking graph wavelets with different scales. Two real-world datasets are used in this study to investigate the model performance, including a highway network in Seattle and a dense road network of Manhattan in New York City. Experiment results show that the proposed model outperforms other baseline models. Furthermore, different scales of graph wavelets are found to be effective in extracting local, intermediate and global information at the same time and thus enable the model to learn a complex transportation network topology with various types of road segments. By carefully customizing the scales of wavelets, the model is able to improve the prediction performance and better adapt to different network configurations
    corecore