PhD Position F/M [Campagne doctorants] on distributed automated machine learning with application on IoT data
Type de contrat : CDD
Niveau de diplôme exigé : Bac + 5 ou équivalent
Fonction : Doctorant
Contexte et atouts du poste
The MIMOVE team at Inria Paris undertakes research enabling next-generation mobile distributed systems, from their conception and design to their runtime support, focusing on middleware and data. MIMOVE has longstanding expertise in mobile and service-oriented computing, semantic technologies, interoperability, system emergence and evolution, and edge/fog computing. MIMOVE works on these topics through many national and international collaborations with academia and industry, including large-scale software development of real-world systems. MIMOVE’s research results impact various application domains; MIMOVE focuses in particular on the application areas of IoT and smart cities.
The PhD student will be an employee of Inria and will be supervised by Nikolaos Georgantas (nikolaos.georgantas@inria.fr) and Maroua Bahri (maroua.bahri@inria.fr).
Mission confiée
Automated Machine Learning (AutoML) is an approach that optimizes the machine learning process, making it more accessible and efficient for users with varying levels of expertise [1]. AutoML leverages algorithms and computational capabilities to automate key aspects of the machine learning pipeline, such as feature engineering, model selection, and hyperparameter tuning. This enables individuals with limited machine learning expertise to build and deploy impactful models. However, autoML is still in its infancy stage and struggles to keep pace with the growing volumes of heterogeneous IoT data in continuous contexts. Moreover, existing automated solutions are primarily designed for batch setting and are centralized, making them unsuitable for handling continuous IoT data streams at scale.
The objective of this PhD thesis is to investigate the autoML problems and enhance its adaptability and efficiency in distributed environments, particularly with IoT data. This will involve researching and integrating algorithms into the autoML pipeline, with a focus on optimizing hyperparameters for real-time IoT data [2][3] in distributed settings. Additionally, two distributed aspects will be examined to improve resource utilization; (i) distributing autoML processing tasks to manage the computational complexity of the autoML challenges on IoT data streams [4], and (ii) processing heterogeneous distributed IoT data in edge computing environments with nodes of varying capabilities, where data are processed near their sources to reduce communication, networks delay, network bandwidth, and even enforce data privacy [5].
References:
[1] Hutter, F, Kotthoff, L, & Vanschoren, J. Automated machine learning: methods, systems, challenges. Springer Nature, 2019.
[2] Kulbach, C, Montiel, J, Bahri, M, Heyden, M, & Bifet, A. "Evolution-Based Online Automated Machine Learning." Pacific-Asia Conference on Knowledge Discovery and Data Mining. Cham: Springer International Publishing, 2022.
[3] Carnein, M, Trautmann, H, Bifet, A, & Pfahringer, B. "confstream: Automated algorithm selection and configuration of stream clustering algorithms." Learning and Intelligent Optimization: 14th International Conference, LION, 2020.
[4] A. Abd Elrahman, M. El Helw, R. Elshawi and S. Sakr, "D-SmartML: A Distributed Automated Machine Learning Framework," IEEE 40th International Conference on Distributed Computing Systems (ICDCS), 2020.
[5] Preuveneers D. “AutoFL: Towards AutoML in a Federated Learning Context”. Applied Sciences. 2023.
Principales activités
The PhD student will conduct original research on the topic described above. The expected activities include, but are not limited to:
- Bibliographical study on autoML, distributed computing, edge analytics
- Formulation of autoML for data streams in a distributed context
- Development of distributed autoML systems
- Assessment of the novel proposed approach(es)
- Scientific publications and presentation of results at conferences
Compétences
- Sound knowledge of machine learning/distributed systems/optimization concepts
- Software development skills: Python and Java
- Relational skills: team worker (verbal communication, active listening, motivation and commitment)
- Good level of spoken and written English
Avantages
- Subsidized meals
- Partial reimbursement of public transport costs
- Leave: 7 weeks of annual leave + 10 extra days off due to RTT (statutory reduction in working hours) + possibility of exceptional leave (sick children, moving home, etc.)
- Possibility of teleworking (after 6 months of employment) and flexible organization of working hours
- Professional equipment available (videoconferencing, loan of computer equipment, etc.)
- Social, cultural and sports events and activities
- Access to vocational training
- Social security coverage
Rémunération
Monthly gross salary : 2100 € during the first and second years. 2190 € the last year.
Informations générales
- Thème/Domaine :
Systèmes distribués et intergiciels
Système & réseaux (BAP E) - Ville : Paris
- Centre Inria : Centre Inria de Paris
- Date de prise de fonction souhaitée : 2024-05-06
- Durée de contrat : 13 jours
- Date limite pour postuler : 2024-05-19
Attention: Les candidatures doivent être déposées en ligne sur le site Inria. Le traitement des candidatures adressées par d'autres canaux n'est pas garanti.
Consignes pour postuler
The documents required for the candidate's file (to put in the same pdf file):
- Motivational letter highlighting the alignment of the candidate's education with the proposed subject
- CV
- Master's transcripts
- Letters of recommendation
Sécurité défense :
Ce poste est susceptible d’être affecté dans une zone à régime restrictif (ZRR), telle que définie dans le décret n°2011-1425 relatif à la protection du potentiel scientifique et technique de la nation (PPST). L’autorisation d’accès à une zone est délivrée par le chef d’établissement, après avis ministériel favorable, tel que défini dans l’arrêté du 03 juillet 2012, relatif à la PPST. Un avis ministériel défavorable pour un poste affecté dans une ZRR aurait pour conséquence l’annulation du recrutement.
Politique de recrutement :
Dans le cadre de sa politique diversité, tous les postes Inria sont accessibles aux personnes en situation de handicap.
Contacts
- Équipe Inria : MIMOVE
-
Directeur de thèse :
Bahri Maroua / maroua.bahri@inria.fr
A propos d'Inria
Inria est l’institut national de recherche dédié aux sciences et technologies du numérique. Il emploie 2600 personnes. Ses 215 équipes-projets agiles, en général communes avec des partenaires académiques, impliquent plus de 3900 scientifiques pour relever les défis du numérique, souvent à l’interface d’autres disciplines. L’institut fait appel à de nombreux talents dans plus d’une quarantaine de métiers différents. 900 personnels d’appui à la recherche et à l’innovation contribuent à faire émerger et grandir des projets scientifiques ou entrepreneuriaux qui impactent le monde. Inria travaille avec de nombreuses entreprises et a accompagné la création de plus de 200 start-up. L'institut s'efforce ainsi de répondre aux enjeux de la transformation numérique de la science, de la société et de l'économie.