Conférence

Notice

Lieu de réalisation

Centre Inria de l'Université de Rennes

Langue :

Anglais

Crédits

Anne-Laure Boulesteix (Intervention)

Crédit image : Centre Inria de l'Université de Rennes

Détenteur des droits

Centre Inria de l'Université de Rennes

Conditions d'utilisation

CC BY-NC-SA (Attribution - Pas d’Utilisation Commerciale - Partage dans les Mêmes Conditions)

DOI : 10.60527/5hkk-8e64

Citer cette ressource :

Anne-Laure Boulesteix. Inria. (2024, 19 juin). Keynote: Replicable empirical machine learning research. [Vidéo]. Canal-U. https://doi.org/10.60527/5hkk-8e64. (Consultée le 30 octobre 2025)

Keynote: Replicable empirical machine learning research

Réalisation : 19 juin 2024 - Mise en ligne : 19 juin 2024

document 1 document 2 document 3
niveau 1 niveau 2 niveau 3

Descriptif

In the absence of mathematical theory addressing complex real-life settings beyond simplifying assumptions, the behavior and performance of machine learning methods often has to be addressed by applying them to real or simulation data and observing what happens. In this sense, methodological machine learning research can be viewed as an empirical science. Are the results published in this field reliable? When authors claim that their (new) method performs better than existing ones, should readers trust them? Is an independent study likely to obtain similar results? The answer to all these questions is probably „not always“. The so-called replication crisis in science has drawn increasing attention across empirical research fields such as medicine or psychological science. What about good practice issues in methodological empirical research – that considers methods as research objects? When developing and evaluating new machine learning methods, do we adhere to good practice principles typically promoted in other fields? I argue that the machine learning community should make substantial efforts to address what may be called the replication crisis in methodological research, in particular by trying to avoid bias in comparison studies based on simulated or real data. I discuss topics such as publication bias, cherry-picking/over-optimism, experimental design and the necessity of neutral comparison studies, and review recent positive developments towards more reliable empirical evidence. Benchmark studies comparing statistical learning methods with a focus on high-dimensional biological data will be used as examples.

Intervention / Responsable scientifique

Boulesteix

Anne-Laure

PU, HDR - University of Munich Marchioninistr - Munich (ALLEMAGNE)

Thème

Discipline :

Informatique

Reproductibilité (sciences)

Sur le même thème

Sur le même thème

Cours/Séminaire

01:54:39

Favoris
Computational reproductibility: an overview illustrated with examples from the medical imaging rese…

Pop

Sorina

Computational reproductibility: an overview illustrated with examples from the medical imaging research community
Imagerie médicale -- Qualité de l'image
Analyse des données -- Logiciels
Traitement d'images
Science ouverte
Reproductibilité computationnelle
17.10.2024
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Conférence

02:58:19

Favoris
Tutorial Track 1: Reproducible distributed environments with NixOS Compose

Presented by Quentin Guilloteau, Postdoctoral Fellow, Fernando Ayats Llamas, Research Engineer and Olivier Richard, Assistant Professor.
Reproductibilité (sciences)
20.06.2024
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Conférence

03:04:37

Favoris
Tutorial Track 1: Reproducibility of Scientific Results using E4S Containers

Presented by SHENDE, Sameer, Research Profesor.
Reproductibilité (sciences)
20.06.2024
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Conférence

02:51:35

Favoris
Tutorial Track2: Fostering Reproducibility By Integrating Large Language Model and Scholarly Knowl…

Presented by Hassan Hussein, PhD Student, Vindoh Ilangovan, Researcher and Kaouter Kebaili, PhD Student.
Reproductibilité (sciences)
20.06.2024
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Conférence

01:23:24

Favoris
Tutorial Track 3: Managing HPC Software Complexity with Spack

Presented by Massimiliano Culpo, Researcher.
Reproductibilité (sciences)
20.06.2024
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Conférence

01:47:06

Favoris
Tutorial Track 4: Practical strategies for teaching reproducibility

Presented by Fraida Fund, Research Assistant Professor, Sarah Cohen-Boulakia, Professor and Bogdan Alexandru Stoica, PhD Student.
Reproductibilité (sciences)
20.06.2024
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Conférence

00:46:34

Favoris
Session 1: Provenance and Reproducibility

Chair: Victoria Stodden
Reproductibilité (sciences)
19.06.2024
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Conférence

00:20:34

Favoris
Session 2: The Human Side of Reproducibility

Chair: Camille Maumet
Reproductibilité (sciences)
19.06.2024
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Conférence

01:10:28

Favoris
Session 3: Computational Experiment Preservation

Chair: Arnaud Legrand
Reproductibilité (sciences)
19.06.2024
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Conférence

00:11:16

Favoris
Session 4: Poster Lightning Talks

Talk 1 [00:00] NPF: orchestrate and reproduce network experiments. By Tom Barbette. Presented by Tom Barbette, Assistant Professor. Talk 2 [03:10] : From reproducible to reusable bioinformatics
Reproductibilité (sciences)
19.06.2024
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Conférence

00:53:34

Favoris
Keynote: Reproducibility and replicability of computer simulations

Hinsen

Konrad

Since the early days of the reproducibility crisis, much progress has been made in understanding and improving computational reproducibility and replicability (R and R)...
Reproductibilité (sciences)
18.06.2024
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3
Conférence

01:11:33

Favoris
Session 5: Reproducibility Enhancing Frameworks

Chair: Sameer Shende
Reproductibilité (sciences)
18.06.2024
document 1 document 2 document 3
niveau 1 niveau 2 niveau 3