Search CORE

22 research outputs found

Improving fairness in machine learning systems: What do industry practitioners need?

Author: ACM.
Agarwal Alekh
Attenberg Josh
Barocas Solon
Binns Reuben
Bolukbasi Tolga
Bosch Nigel
Buolamwini Joy
Chouldechova Alexandra
DSSG.
Green Ben
Kamar Ece
Kamar Ece
Kilbertus Niki
Kleinberg Jon
Kusner Matt J
Lakkaraju Himabindu
Liu Anqi
Liu Hugo
Liu Lydia T
Lyu Lingyu
Maclellan Christopher J
Nushi Besmira
Raghavan Manish
Sculley D.
Springer Aaron
Vaughan Jennifer Wortman
Yang Qian
Zhao Zian
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 07/01/2019
Field of study

The potential for machine learning (ML) systems to amplify social inequities and unfairness is receiving increasing popular and academic attention. A surge of recent work has focused on the development of algorithmic tools to assess and mitigate such unfairness. If these tools are to have a positive impact on industry practice, however, it is crucial that their design be informed by an understanding of real-world needs. Through 35 semi-structured interviews and an anonymous survey of 267 ML practitioners, we conduct the first systematic investigation of commercial product teams' challenges and needs for support in developing fairer ML systems. We identify areas of alignment and disconnect between the challenges faced by industry practitioners and solutions proposed in the fair ML research literature. Based on these findings, we highlight directions for future ML and HCI research that will better address industry practitioners' needs.Comment: To appear in the 2019 ACM CHI Conference on Human Factors in Computing Systems (CHI 2019

arXiv.org e-Print Archive

Crossref

Interactive Visual Labelling versus Active Learning: An Experimental Comparison

Author: A Culotta
A Inselberg
B Höferlin
B Settles
B Settles
CM Bishop
D Ceneda
D Kottke
F Heimerl
I Jolliffe
J Attenberg
J Bernard
J Bernard
J Bernard
JB Kruskal
L Shao
L van der Maaten
M Chegini
M Chegini
M Chegini
M Chegini
M Hall
T Scheffer
T Schreck
TK Ho
Y LeCun
Y Wu
Publication venue: 'Zhejiang University Press'
Publication date: 01/01/2020
Field of study

Methods from supervised machine learning allow the classification of new data automatically and are tremendously helpful for data analysis. The quality of supervised maching learning depends not only on the type of algorithm used, but also on the quality of the labelled dataset used to train the classifier. Labelling instances in a training dataset is often done manually relying on selections and annotations by expert analysts, and is often a tedious and time-consuming process. Active learning algorithms can automatically determine a subset of data instances for which labels would provide useful input to the learning process. Interactive visual labelling techniques are a promising alternative, providing effective visual overviews from which an analyst can simultaneously explore data records and select items to a label. By putting the analyst in the loop, higher accuracy can be achieved in the resulting classifier. While initial results of interactive visual labelling techniques are promising in the sense that user labelling can improve supervised learning, many aspects of these techniques are still largely unexplored. This paper presents a study conducted using the mVis tool to compare three interactive visualisations, similarity map, scatterplot matrix (SPLOM), and parallel coordinates, with each other and with active learning for the purpose of labelling a multivariate dataset. The results show that all three interactive visual labelling techniques surpass active learning algorithms in terms of classifier accuracy, and that users subjectively prefer the similarity map over SPLOM and parallel coordinates for labelling. Users also employ different labelling strategies depending on the visualisation used

Crossref

TUGraz OPEN Library

MPG.PuRe

Machine learning for targeted display advertising: transfer learning in action

Author: B. Dalessandro
B. Zadrozny
C. Perlich
C. Perlich
D. Agarwal
D. Jensen
F. Provost
F. Provost
F. Provost
G. Weiss
H. Zou
J. Attenberg
J. Attenberg
K. Weinberger
L. Bottou
L. Breiman
O. Stitelman
O. Stitelman
P. Ipeirotis
S. Pan
S. Pandey
S. Rosset
T. Evgeniou
T. Fawcett
T. Heskes
T. Raeder
T. Raeder
Y. Chen
Y. Liu
Y. Xue
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Inactive learning?

Author: Attenberg J.
Catlett D.
Donmez P.
Foster Provost
He J.
Josh Attenberg
Lomasky R.
Mccallum A. K.
Roy N.
Zhu J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Über die Zusammensetzung der Niederschläge bei der Fällung von Vanadyl-hexacyanoferrat(II) in Anwesenheit von Alkalimetallen

Author: D. V. Reddy
J. Krtil
J. Krtil
M. A. Attenberg
M. A. Gluškova
M. Vyrouboff
V. D. Ponomariew
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A unified framework for document clustering with dual supervision

Author: Attenberg J.
Bar-Hillel A.
Basu S.
Evangelos E. Milios
Hu Y.
James Blustein
Wagstaff K.
Yeming Hu
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

An Effective Method for Identifying Unknown Unknowns with Noisy Oracle

Author: D Silver
DD Lewis
G Hinton
H Shimodaira
J Attenberg
J Han
O Vinyals
RS Sutton
S Sani
SJ Pan
V Chandola
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Flock

Author: Attenberg J.
Dai P.
Gomes R. G.
Kamar E.
Law E.
Le J.
Lowe D. G.
Ott M.
Settles B.
Wais P.
Whitehill J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

From Topic Models to Semi-Supervised Learning: Biasing Mixed-membership Models to Exploit Topic-Indicative Features in Entity Clustering

Author: A. Carlson
B. Settles
B.B. Dalvi
D. Andrzejewski
D. Blei
D. Ramage
E.A. Erosheva
G.S. Mann
J. Attenberg
K. Ganchev
K. Nigam
M. Paca
M. Steyvers
M.A. Hearst
P.P. Talukdar
R.C. Wang
Publication venue
Publication date: 01/01/2013
Field of study

Abstract. We present methods to introduce different forms of supervision into mixed-membership latent variable models. Firstly, we introduce a technique to bias the models to exploit topic-indicative features, i.e. features which are apriori known to be good indicators of the latent topics that generated them. Next, we present methods to modify the Gibbs sampler used for approximate inference in such models to permit injection of stronger forms of supervision in the form of labels for features and documents, along with a description of the corresponding change in the underlying generative process. This ability allows us to span the range from unsupervised topic models to semi-supervised learning in the same mixed membership model. Experimental results from an entity-clustering task demonstrate that the biasing technique and the introduction of feature and document labels provide a significant increase in clustering performance over baseline mixed-membership methods.

CiteSeerX

Crossref

Counting with the crowd

Author: Ahmad S.
André P.
Attenberg J.
Bernstein M. S.
Bernstein M. S.
Dawid A.
Douceur J. R.
Efron B.
Franklin M.
Gormley M. R.
Guo S.
Ipeirotis P. G.
Karger D. R.
Marcus A.
Marcus A.
Parameswaran A.
Surowiecki J.
Wolfe J. M.
Publication venue: 'VLDB Endowment'
Publication date
Field of study

Crossref