Post-Doctoral Research Visit F/M Postdoctoral position Reinforcement Learning for Collaborative Annotation
Type de contrat : CDD
Niveau de diplôme exigé : Thèse ou équivalent
Fonction : Post-Doctorant
A propos du centre ou de la direction fonctionnelle
Contexte et atouts du poste
This postdoctoral position is part of the national PEPR (Programme et Equipement Prioritaire de Recherche) PlantAgroEco project, coordinated by Alexis Joly. The PEPR involves several teams from various institutes (Inria ZENITH, CIRAD AMAP, CIRAD PHIM, CIRAD PBVMT, INRAE ePhytia, INRAE IGEPP, INRAE LISAH, IRD EGCE, IRD IEES, Univ. Paris Saclay, TelaBotanica). This is a postdoctoral position in Machine Learning, more specifically in Reinforcement Learning. We are seeking a highly motivated and skilled postdoctoral fellow to join the project, dedicated to advancing the field of Machine Learning, with a specific focus on Reinforcement Learning. The position is initially funded for 18-month, but it can be easily extended.
The starting date is flexible, and ideally would start on Feb. 1st, 2024, but it can be earlier or later. The candidate will be based at Inria Lille - Nord Europe under the expert guidance of Odalric-Ambrym Maillard.
About Us: The PEPR PlantAgroEco project brings together multidisciplinary teams from esteemed institutes, including Inria ZENITH, CIRAD AMAP, CIRAD PHIM, and more. Our mission is to address intriguing theoretical challenges in the application of agroecological practices in agriculture through cutting-edge Machine Learning techniques.
Collaborative Environment: You will collaborate closely with a team of dedicated Engineers responsible for the actual implementations. Hence, your primary focus will be on the creation of sound algorithms and methods, ensuring their theoretical integrity and applicability to real-world scenarios.
Odalric-Ambrym Maillard is a researcher at Inria. He has worked for over a decade on advancing the theoretical foundations of reinforcement learning,using a combination of tools from statistics, optimization and control, in order to build more efficient algorithms able to better estimate uncertainty, exploit structures, or adapt to some non-stationary context.
He was the PI of the ANR-JCJC project BADASS (BAnDits Against non-Stationarity and Structure) until Oct. 2021. He is also leading the Inria Action Exploratoire SR4SG (Sequential Recommendation for Sustainable Gardening) and the Inria-Japan associate team RELIANT (Reliable multi-armed bandits),
and is involved in a series of other projects, from more applied to more theoretical ones all related to the grand-challenge of reinforcement learning that is to make it applicable in real-life situations.
See \texttt{http://odalricambrymmaillard.neowordpress.fr} for further details.
Scool (Sequential COntinual and Online Learning) is an Inria team-project. It was created on November 1st, 2020 as the follow-up of the team SequeL. In a nutshell, the research topic of Scool is the study of the sequential decision making problem under uncertainty. Most of our activities are related to either bandit problems, or reinforcement learning problems. Through collaborations, we are working on their application in various fields, mainly: health, agriculture and ecology, sustainable development. See our \href{https://team.inria.fr/scool/projects/}{Projects page} for more information.
Mission confiée
Your Mission: As a key member of our team, you will embark on an enriching journey to tackle complex theoretical challenges, applying your expertise to a real open-science application. This role offers a unique opportunity for a young researcher to make valuable and visible contributions in an ambitious project.
The project is organized around three high-level tasks and research questions:
-
User Annotation-Expertise Profiling: Your expertise will be instrumental in estimating and tracking user annotation profiles, adapting contextual bandit strategies to provide tailored support, and leveraging change-point detection techniques. These innovations will have wide-ranging applications beyond the scope of PlantNet, contributing to top-tier conferences and journals related to recommender systems.
-
Rapid Annotation Assistance: You will devise efficient techniques for rapid annotation, customizing approaches based on users' estimated expertise. This task involves pioneering sample-efficient hypothesis testing and personalizing assistance for optimal outcomes. Your work will provide generic-purpose approaches applicable to diverse domains.
-
Complementary Expert Query Strategies: You will pioneer adaptive query strategies for a diverse pool of experts, ensuring reliable collective labeling and adaptive stopping mechanisms. This research will not only benefit PlantNet but also have implications for other applications.
These tasks can be explored in various ways and lead to other challenges but should be considered the backbone of the project. The research, though focused on the PlantNet example, should be considered from a broader perspective, and be beneficial to recommender systems at large.
Principales activités
Making reinforcement learning techniques applicable to real-life applications (such as the recommendation of agroecological practices in agriculture) requires overcoming several scientific bottlenecks. Within the scope of the PEPR PlantAgroEco project, this 18m postdoc will focus on providing novel reinforcement learning strategies in order to improve the collaborative annotation process of the \href{https://plantnet.org}{PlantNet} data acquisition platform, both from a theoretical and applied perspective. This project makes appear appealing challenges around contextual multi-armed bandits relevant to collaborative decision making and recommendation at large, with a unique opportunity to interact with a real data platform used by millions. Solving the different challenges in a sound and effective way requires special attention from both mathematical and computational standpoints.
Compétences
- PhD in machine learning or statistics, with a focus on multi-armed bandits or recommender systems.
- Proficiency in English.
- Strong coding abilities, coupled with analytical and statistical expertise.
- Proven background in areas such as probability, Markov chains, and concentration of measure.
- Adeptness with contextual bandits, active sampling, and recommender systems.
- Ability to work collaboratively within a dynamic scientific environment.
Avantages
- Subsidized meals
- Partial reimbursement of public transport costs
- Leave: 7 weeks of annual leave + 10 extra days off due to RTT (statutory reduction in working hours) + possibility of exceptional leave (sick children, moving home, etc.)
- Possibility of teleworking and flexible organization of working hours
- Professional equipment available (videoconferencing, loan of computer equipment, etc.)
- Social, cultural and sports events and activities
- Access to vocational training
- Social security coverage
Rémunération
Informations générales
- Thème/Domaine :
Optimisation, apprentissage et méthodes statistiques
Systèmes d'information (BAP E) - Ville : Villeneuve d'Ascq
- Centre Inria : Centre Inria de l'Université de Lille
- Date de prise de fonction souhaitée : 2024-02-01
- Durée de contrat : 1 an, 6 mois
- Date limite pour postuler : 2024-05-31
Attention: Les candidatures doivent être déposées en ligne sur le site Inria. Le traitement des candidatures adressées par d'autres canaux n'est pas garanti.
Consignes pour postuler
Resume + cover letter
Sécurité défense :
Ce poste est susceptible d’être affecté dans une zone à régime restrictif (ZRR), telle que définie dans le décret n°2011-1425 relatif à la protection du potentiel scientifique et technique de la nation (PPST). L’autorisation d’accès à une zone est délivrée par le chef d’établissement, après avis ministériel favorable, tel que défini dans l’arrêté du 03 juillet 2012, relatif à la PPST. Un avis ministériel défavorable pour un poste affecté dans une ZRR aurait pour conséquence l’annulation du recrutement.
Politique de recrutement :
Dans le cadre de sa politique diversité, tous les postes Inria sont accessibles aux personnes en situation de handicap.
Contacts
- Équipe Inria : SCOOL
-
Recruteur :
Maillard Odalric-ambrym / Odalric.Maillard@inria.fr
L'essentiel pour réussir
This position provides a unique opportunity for a young researcher to contribute meaningfully to both theory and practical application. Your work will lead to impactful publications and modules for the PlantNet platform, while also influencing the broader landscape of agroecological recommendations.
If you are a curious, proactive, and open-minded researcher with a passion for learning, we invite you to join us in this exciting endeavor.
Note: A certain level of autonomy is encouraged in performing your tasks, allowing for creative exploration and innovation.
We look forward to welcoming a dedicated researcher who is ready to make significant contributions in this dynamic and forward-thinking environment. Apply now and be a part of our journey towards advancing agroecological recommendations through cutting-edge Machine Learning techniques.
A propos d'Inria
Inria est l’institut national de recherche dédié aux sciences et technologies du numérique. Il emploie 2600 personnes. Ses 215 équipes-projets agiles, en général communes avec des partenaires académiques, impliquent plus de 3900 scientifiques pour relever les défis du numérique, souvent à l’interface d’autres disciplines. L’institut fait appel à de nombreux talents dans plus d’une quarantaine de métiers différents. 900 personnels d’appui à la recherche et à l’innovation contribuent à faire émerger et grandir des projets scientifiques ou entrepreneuriaux qui impactent le monde. Inria travaille avec de nombreuses entreprises et a accompagné la création de plus de 200 start-up. L'institut s'efforce ainsi de répondre aux enjeux de la transformation numérique de la science, de la société et de l'économie.