Search CORE

27 research outputs found

Evaluating and supervising vision models with multi-level similarity judgments

Author: Born F.
Greff K.
Lampinen A.
Mozer M.
Muttenthaler L.
Müller K.
Unterthiner T.
Publication venue
Publication date: 06/08/2024
Field of study

Getting Aligned on Representational Alignment

Author: Achterberg J.
Bobu A.
Chen S.
Collins K.
Geirhos R.
Grant E.
Greff K.
Griffiths T.
Groen I.
Hebart M.
Hermann K.
Jacoby N.
Kim B.
Konkle T.
Kornblith S.
Lampinen A.
Love B.
Marjieh R.
Muttenthaler L.
Müller K.
O'Connell T.
Oktar K.
Peng A.
Rane S.
Sucholutsky I.
Tenenbaum J.
Toneva M.
Unterthiner T.
Weller A.
Zhang Q.
Publication venue
Publication date: 01/01/2023
Field of study

Biological and artificial information processing systems form representationsthat they can use to categorize, reason, plan, navigate, and make decisions.How can we measure the extent to which the representations formed by thesediverse systems agree? Do similarities in representations then translate intosimilar behavior? How can a system's representations be modified to bettermatch those of another system? These questions pertaining to the study ofrepresentational alignment are at the heart of some of the most active researchareas in cognitive science, neuroscience, and machine learning. For example,cognitive scientists measure the representational alignment of multipleindividuals to identify shared cognitive priors, neuroscientists align fMRIresponses from multiple individuals into a shared representational space forgroup-level analyses, and ML researchers distill knowledge from teacher modelsinto student models by increasing their alignment. Unfortunately, there islimited knowledge transfer between research communities interested inrepresentational alignment, so progress in one field often ends up beingrediscovered independently in another. Thus, greater cross-field communicationwould be advantageous. To improve communication between these fields, wepropose a unifying framework that can serve as a common language betweenresearchers studying representational alignment. We survey the literature fromall three fields and demonstrate how prior work fits into this framework.Finally, we lay out open problems in representational alignment where progresscan benefit all three of these fields. We hope that our work can catalyzecross-disciplinary collaboration and accelerate progress for all communitiesstudying and developing information processing systems. We note that this is aworking paper and encourage readers to reach out with their suggestions forfuture revisions.<br

MPG.PuRe

Effect of missing data on multitask prediction methods

Author: A Anighoro
A Mayr
A Tropsha
Antonio de la Vega de León
AP Bento
B Chen
B Ramsundar
Beining Chen
D Fourches
D Rogers
D Weininger
G Harper
J Ma
J Simm
JG Moffat
KY Helal
L Breiman
M Glick
MR Berthold
S Kim
S Knapp
SL Kinnings
SM Wilhelm
T Unterthiner
TWH Backman
Valerie J. Gillet
Y LeCun
Y Wang
Y Xu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/05/2018
Field of study

There has been a growing interest in multitask prediction in chemoinformatics, helped by the increasing use of deep neural networks in this field. This technique is applied to multitarget data sets, where compounds have been tested against different targets, with the aim of developing models to predict a profile of biological activities for a given compound. However, multitarget data sets tend to be sparse; i.e., not all compound-target combinations have experimental values. There has been little research on the effect of missing data on the performance of multitask methods. We have used two complete data sets to simulate sparseness by removing data from the training set. Different models to remove the data were compared. These sparse sets were used to train two different multitask methods, deep neural networks and Macau, which is a Bayesian probabilistic matrix factorization technique. Results from both methods were remarkably similar and showed that the performance decrease because of missing data is at first small before accelerating after large amounts of data are removed. This work provides a first approximation to assess how much data is required to produce good performance in multitask prediction exercises

Crossref

Directory of Open Access Journals

White Rose Research Online

A model-based information sharing protocol for profile Hidden Markov Models used for HIV-1 recombination detection

Author: A Krogh
A Viterbi
AB Abecasis
AG Murzin
AK Schultz
AK Schultz
Anne-Kathrin Schultz
B Korber
BH Hahn
Christophe Chesneau
CS Hahn
D Paraskevis
DL Robertson
DL Robertson
DP Brown
EJ Feil
EM Goss
F Gao
FE McCutchan
Florin Serea
I Bulla
I Bulla
Ingo Bulla
J Fan
J Maydt
JC Plantier
JM Smith
K Sjölander
KE Ashelford
M Bergmann
M Hoelscher
M Orlich
M Seifert
M Worobey
M Zhang
M Zhang
MM Lai
MO Salminen
P Hraber
P Hugenholtz
P Lemey
R Durbin
R Shaikh
S Bolling
S Eddy
S Laht
S Young
SL Kosakovsky Pond
T de Oliveira
T Leitner
T Unterthiner
Tanya Mark
TC Bruen
TC Jarvis
WM Bolstad
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Low Data Drug Discovery with One-Shot Learning

Author: Duvenaud D. K.
Unterthiner T.
Publication venue: 'American Chemical Society (ACS)'
Publication date
Field of study

Crossref

The current limits in virtual screening and property prediction

Author: Imbernón B
Michael C Hutter
Trott O
Unterthiner T
Weisel M
Publication venue: 'Future Science Ltd'
Publication date
Field of study

Crossref

Object-Centric Learning with Slot Attention

Author: Dosovitskiy A.
Heigold G.
Kipf T.
Locatello F.
Mahendran A.
Unterthiner T.
Uszkoreit J.
Weissenborn D.
Publication venue
Publication date: 01/07/2021
Field of study

MPG.PuRe

Comparative study of multitask toxicity modeling on a broad chemical space.

Author: Dahl G. E.
Katzung B. G.
Martel B.
Srivastava N.
Todeschini R.
Unterthiner T.
van der Maaten L.
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/01/2019
Field of study

Acute toxicity is one of the most challenging properties to predict purely with computational methods due to its direct relationship to biological interactions. Moreover, toxicity can be represented by different end points: it can be measured for different species using different types of administration, etc., and it is questionable if the knowledge transfer between end points is possible. We performed a comparative study of prediction multitask toxicity for a broad chemical space using different descriptors and modeling algorithms and applied multitask learning for a large toxicity data set extracted from the Registry of Toxic Effects of Chemical Substances (RTECS). We demonstrated that multitask modeling provides significant improvement over single-output models and other machine learning methods. Our research reveals that multitask learning can be very useful to improve the quality of acute toxicity modeling and raises a discussion about the usage of multitask approaches for regulation purposes

Crossref

PuSH

The Francis Crick Institute

Aligning machine and human visual representations across abstraction levels

Author: Born F.
Greff K.
Kornblith S.
Lampinen A.
Mozer M.
Muttenthaler L.
Müller K.
Spitzer B.
Unterthiner T.
Publication venue
Publication date: 10/09/2024
Field of study

MPG.PuRe

Emulating Docking Results Using a Deep Neural Network: A New Perspective for Virtual Screening

Author: Agnieszka Pocha
Andrzej J. Bojarski
Jacek Tabor
LeCun Y.
Maciej Szymczak
Sabina Podlewska
Stanisław Jastrzębski
Stefan Mordalski
Unterthiner T.
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/01/2020
Field of study

Crossref

Copenhagen University Research Information System

Jagiellonian Univeristy Repository