Search CORE

138 research outputs found

Towards a new evolutionary subsampling technique for heuristic optimisation of load disaggregators

Author: CD Manning
D Vine
GW Hart
J Derrac
J Hernández-Orallo
L Breiman
L Torgo
M Zeifman
NV Chawla
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

In this paper we present some preliminary work towards the development of a new evolutionary subsampling technique for solving the non-intrusive load monitoring (NILM) problem. The NILM problem concerns using predictive algorithms to analyse whole-house energy usage measurements, so that individual appliance energy usages can be disaggregated. The motivation is to educate home owners about their energy usage. However, by their very nature, the datasets used in this research are massively imbalanced in their target value distributions. Consequently standard machine learning techniques, which often rely on optimising for root mean squared error (RMSE), typically fail. We therefore propose the target-weighted RMSE (TW-RMSE) metric as an alternative fitness function for optimising load disaggregators, and show in a simple initial study in which random search is utilised that TW-RMSE is a metric that can be optimised, and therefore has the potential to be included in a larger evolutionary subsampling-based solution to this problem

Crossref

Research Commons@Waikato

Inducing Polynomial Equations for Regression

Author: B. Falkenhainer
E. Frank
J. Friedman
L. Todorovski
L. Torgo
P. Chaudhuri
P. Langley
S. Džeroski
S. Geman
T. Hastie
Y. Chen
Y. Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2004
Field of study

Crossref

Subgroup Analysis via Recursive Partitioning

Author: A Ciampi
A D R Mcquarrie
A Negassa
Bogong Li
Chih-Ling Tsai
David M. Nickerson
G Schwarz
H Akaike
Hansheng Wang
I Kononenko
J Morgan
J Ye
L Breiman
L Breiman
L Torgo
M Gail
M Leblanc
P Sleight
R Tibshirani
S F Assmann
S W Lagakos
S.-C Chow
X G Su
Xiaogang Su
Publication venue: 'Elsevier BV'
Publication date: 01/01/2009
Field of study

Subgroup analysis is an integral part of comparative analysis where assessing the treatment effect on a response is of central interest. Its goal is to determine the heterogeneity of the treatment effect across subpopulations. In this paper, we adapt the idea of recursive partitioning and introduce an interaction tree (IT) procedure to conduct subgroup analysis. The IT procedure automatically facilitates a number of objectively defined subgroups, in some of which the treatment effect is found prominent while in others the treatment has a negligible or even negative effect. The standard CART (Breiman et al., 1984) methodology is inherited to construct the tree structure. Also, in order to extract factors that contribute to the heterogeneity of the treatment effect, variable importance measure is made available via random forests of the interaction trees. Both simulated experiments and analysis of census wage data are presented for illustration.http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000270824200001&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=8e1609b174ce4e31116a60747a720701Automation & Control SystemsComputer Science, Artificial IntelligenceSCI(E)EI38ARTICLE141-1581

Crossref

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)

Error reduction through learning multiple descriptions

Author: A. Danyluk
D. Howell
E.B. Kong
G. Towell
H. Drucker
I. Kononenko
K. Ali
K. Ali
K. Ali
K. Ali
K. Spackman
Kamal M. Ali
L. Torgo
L.K. Hansen
M Kovacic
M. Gams
M. Pazzani
M. Pazzani
M.H. Groot De
Michael J. Pazzani
N. Lavrac
P. Smyth
P. Smyth
R. Duda
R. Holte
R. Quinlan
R. Quinlan
R. Quinlan
R. Schapire
S. Dzeroski
S. Kwok
S. Muggleton
S. Muggleton
W.G. Baxt
W.H. Kruskal
Y. Freund
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Resampling Approaches to Improve News Importance Prediction

Author: G. Szabo
I. Feinerer
L. Torgo
L. Torgo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

Regression using classification algorithms

Author: J GAMA
L TORGO
Publication venue: 'Elsevier BV'
Publication date: 01/01/1997
Field of study

Crossref

Detecting Errors in Foreign Trade Transactions: Dealing with Insufficient Data

Author: E.M. Knorr
F. Murtagh
J.S. Milton
L. Torgo
L. Torgo
M.M. Breunig
V. Hodge
W.D. Fisher
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Crossref

Shapley-Value Data Valuation for Semi-supervised Learning

Author: Courtnage Christie
Smirnov Evgueni
Soares C.
Torgo L.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

Semi-supervised learning aims at training accurate prediction models on labeled and unlabeled data. Its realization strongly depends on selecting pseudo-labeled data. The standard approach is to select instances based on the pseudo-label confidence values that they receive from the prediction models. In this paper we argue that this is an indirect approach w.r.t. the main goal of semi-supervised learning. Instead, we propose a direct approach that selects the pseudo-labeled instances based on their individual contributions for the performance of the prediction models. The individual instance contributions are computed as Shapley values w.r.t. characteristic functions related to the model performance. Experiments show that our approach outperforms the standard one when used in semi-supervised wrappers