PhD Position F/M Modeling and data curation in robot policy learning

Télécharger l'offre au format PDF

Le descriptif de l’offre ci-dessous est en Anglais

Type de contrat : CDD

Niveau de diplôme exigé : Bac + 5 ou équivalent

Fonction : Doctorant

Niveau d'expérience souhaité : Jeune diplômé

Contexte et atouts du poste

The Phd will be done at Inria in the Willow research team.

Mission confiée

This PhD addressses the challenge of data in robotic policy learning. The focus is on two key research directions: 1. understanding the structure and utility of existing robotic datasets, and 2. scaling robot learning via human demonstration videos.

The first line of research involves a comprehensive analysis of robotics datasets to identify which samples contribute the most significantly to policy performance and which are redundant. This involves techniques such as influence functions and diversity metrics to support data curation strategies that enhance learning efficiency. Beyond per-sample importance, this work will investigate which types of data features (semantic labels, 2D vs. 3D structure, or specific modalities like touch, language, or speech) most effectively support generalizable policy learning. In turn, this analysis aims to guide the construction of more comprehensive datasets for robot learning, spanning different embodiments and skill domains.

The second line of work aims to unlock large-scale learning from human task demonstrations. Rather than relying on manual data collection or simulation, this research will develop methods to extract structured, task-relevant information (end-effector state changes, contact dynamics, or long-term intent) directly from real-world videos of humans performing tasks. Key challenges such as embodiment mismatch, contact ambiguity, and occlusions will be addressed through morphology-aware embeddings, and policy distillation strategies that adapt human-derived policies to robotic embodiments via reinforcement learning, or embodiment-agnostic models. The research will further investigate how to leverage human error examples to teach robots what not to do, and how to extract rich physical attributes from video using motion cues and learned dynamics priors.

Together, these efforts seek to lay the groundwork for data-efficient, scalable learning pipelines, paving the way for generalist robot policies. This research aims to integrate deep insights from datasets with supervisory signals from human behavior for generalist robotic policy learning, bridging the gap between data availability and robotic capability.

Principales activités

Main activities:

Analyse and implement related work.
Design novel innovative solutions.
Write progress reports and papers.
Present work at conferences.

Compétences

Technical skills and level required : programming skills are required.

Languages : English and possibly French.

Relational skills : Good communication skills.

Avantages

Subsidized meals
Partial reimbursement of public transport costs
Leave: 7 weeks of annual leave + 10 extra days off due to RTT (statutory reduction in working hours) + possibility of exceptional leave (sick children, moving home, etc.)
Possibility of teleworking and flexible organization of working hours
Professional equipment available (videoconferencing, loan of computer equipment, etc.)
Social, cultural and sports events and activities
Access to vocational training
Social security coverage

Postuler à cette offre

Informations générales

Thème/Domaine : Robotique et environnements intelligents
Statistiques (Big data) (BAP E)
Ville : Paris
Centre Inria : Centre Inria de Paris
Date de prise de fonction souhaitée : 2025-10-01
Durée de contrat : 3 ans
Date limite pour postuler : 2025-07-24

Attention: Les candidatures doivent être déposées en ligne sur le site Inria. Le traitement des candidatures adressées par d'autres canaux n'est pas garanti.

Consignes pour postuler

Sécurité défense :
Ce poste est susceptible d’être affecté dans une zone à régime restrictif (ZRR), telle que définie dans le décret n°2011-1425 relatif à la protection du potentiel scientifique et technique de la nation (PPST). L’autorisation d’accès à une zone est délivrée par le chef d’établissement, après avis ministériel favorable, tel que défini dans l’arrêté du 03 juillet 2012, relatif à la PPST. Un avis ministériel défavorable pour un poste affecté dans une ZRR aurait pour conséquence l’annulation du recrutement.

Politique de recrutement :
Dans le cadre de sa politique diversité, tous les postes Inria sont accessibles aux personnes en situation de handicap.

Contacts

Équipe Inria : WILLOW
Directeur de thèse :
Schmid Cordelia / cordelia.schmid@inria.fr

L'essentiel pour réussir

Essential qualities in order to fulfil this assignment are feeling at ease in an environment of scientific dynamics and wanting to learn and listen. Prior experience in research is a plus.

A propos d'Inria

Inria est l’institut national de recherche dédié aux sciences et technologies du numérique. Il emploie 2600 personnes. Ses 215 équipes-projets agiles, en général communes avec des partenaires académiques, impliquent plus de 3900 scientifiques pour relever les défis du numérique, souvent à l’interface d’autres disciplines. L’institut fait appel à de nombreux talents dans plus d’une quarantaine de métiers différents. 900 personnels d’appui à la recherche et à l’innovation contribuent à faire émerger et grandir des projets scientifiques ou entrepreneuriaux qui impactent le monde. Inria travaille avec de nombreuses entreprises et a accompagné la création de plus de 200 start-up. L'institut s'eﬀorce ainsi de répondre aux enjeux de la transformation numérique de la science, de la société et de l'économie.