Post-Doctoral Research Visit F/M Designing a foundation model for unified video representation learning for multiple camera conditions such as CCTV, dash cam, UAV cam, movies camera, etc...
Contract type : Fixed-term contract
Level of qualifications required : Graduate degree or equivalent
Fonction : Post-Doctoral Research Visit
About the research centre or Inria department
The Inria center at Université Côte d'Azur includes 42 research teams and 9 support services. The center’s staff (about 500 people) is made up of scientists of different nationalities, engineers, technicians and administrative staff. The teams are mainly located on the university campuses of Sophia Antipolis and Nice as well as Montpellier, in close collaboration with research and higher education laboratories and establishments (Université Côte d'Azur, CNRS, INRAE, INSERM ...), but also with the regional economic players.
With a presence in the fields of computational neuroscience and biology, data science and modeling, software engineering and certification, as well as collaborative robotics, the Inria Centre at Université Côte d'Azur is a major player in terms of scientific excellence through its results and collaborations at both European and international levels.
Context
Inria, the French National Institute for Computer Science and Applied Mathematics, promotes “scientific excellence for technology transfer and society”. Graduates from the world’s top universities, Inria's 2,700 employees rise to the challenges of digital sciences. With its open, agile model, Inria can explore original approaches with its partners in industry and academia and provide an efficient response to the multidisciplinary and application challenges of digital transformation. Inria is the source of many innovations that add value and create jobs.
Team
The STARS research team combines advanced theory with cutting-edge practice focusing on cognitive vision systems.
Team web site : https://team.inria.fr/stars/
Scientific context
STARS group works on automatic video monitoring and human behavior understanding for health applications. The Deep Learning platform developed in STARS, detects mobile objects, tracks their trajectory, and recognizes related behaviors predefined by experts. This platform contains several techniques for detecting people and for recognizing human postures/gestures using conventional cameras. However, there are scientific challenges in people tracking when dealing with real-world scenes: cluttered scenes, handling wrong and incomplete person segmentation, handling static and dynamic occlusions, low contrasted objects, moving contextual objects (e.g. chairs), similar appearance of clothes among different people ...
This Project aims to detect critical situations in the daily life of elderly people living at home alone. We believe that a system that can detect potentially dangerous situations will give peace of mind to frail older people as well as to their caregivers. This will require not only recognition of ADLs but also an evaluation of the way and timing in which they are being carried out.
The users we are targeting should be in relatively good health condition. We do not want to address serious dementia problems but intend to give support to people who suffer from age-related forgetfulness (in many cases the start of an Alzheimer's condition). The users should live in their own house by themselves and generally be able to carry out their day-to-day activities. The system we want to develop is intended to help them and their relatives feel more comfortable because they know potentially dangerous situations will be detected and reported to caregivers if necessary.
Assignment
Video data is generated by a wide range of camera types, each with unique characteristics and applications. From static surveillance cameras monitoring urban spaces to UAV (Unmanned Aerial Vehicle) cameras capturing aerial perspectives and cinematic cameras producing dynamic scenes, the variety of video data sources presents a unique challenge for video analysis models. Traditional approaches, often specialised for a single camera type, struggle to generalise across different sources. This limitation reduces the efficiency and adaptability of video analytics systems, particularly in real-world applications where footage from multiple camera types needs to be processed cohesively. This proposal seeks to develop a Unified Video Representation Learning (UVRL) framework capable of generating robust and transferable video representations for multiple camera types, including static, moving, UAV, and cinematic cameras. By designing a flexible model architecture that adapts to the distinct characteristics of each camera type, this project aims to create a versatile framework capable of delivering high-quality, context-aware representations across diverse video sources.
Main activities
The Inria STARS team is seeking an engineer with a strong background in computer vision, deep learning, and machine learning.
The candidate is expected to conduct research related to the development of computer vision algorithms for video understanding.
Main activities:
- Analyze the requirements of doctors and patients/end-users and Study the limitations of existing solutions.
- Propose a new algorithm for detecting the behaviors of patients/end-users
- Evaluate and optimize the proposed algorithm on the targeted video datasets
- Oral presentation and Write reports
- Submit a scientific paper to a conference
Skills
Candidates must hold a Master's degree or equivalent in Computer Science or a closely related discipline by the start date.
The candidate must be grounded in computer vision basics and have solid mathematical and programming skills.
With theoretical knowledge in Computer Vision, OpenCV, Mathematics, Deep Learning (PyTorch, TensorFlow), and technical background in C++ and Python programming, and Linux.
The candidate must be committed to scientific research and substantial publications.
In order to protect its scientific and technological assets, Inria is a restricted-access establishment. Consequently, it follows special regulations for welcoming any person who wishes to work with the institute. The final acceptance of each candidate thus depends on applying this security and defense procedure.
Benefits package
- Subsidized meals
- Partial reimbursement of public transport costs
- Leave: 7 weeks of annual leave + 10 extra days off due to RTT (statutory reduction in working hours) + possibility of exceptional leave (sick children, moving home, etc.)
- Possibility of teleworking and flexible organization of working hours
- Professional equipment available (videoconferencing, loan of computer equipment, etc.)
- Social, cultural and sports events and activities
- Access to vocational training
- Contribution to mutual insurance (subject to conditions)
Remuneration
Gross Salary : 2788 € per month
General Information
- Theme/Domain : Vision, perception and multimedia interpretation
- Town/city : Sophia Antipolis
- Inria Center : Centre Inria d'Université Côte d'Azur
- Starting date : 2025-02-01
- Duration of contract : 12 months
- Deadline to apply : 2025-01-31
Warning : you must enter your e-mail address in order to save your application to Inria. Applications must be submitted online on the Inria website. Processing of applications sent from other channels is not guaranteed.
Instruction to apply
Applications must be submitted online on the Inria website. Collecting applications by other channels is not guaranteed.
Defence Security :
This position is likely to be situated in a restricted area (ZRR), as defined in Decree No. 2011-1425 relating to the protection of national scientific and technical potential (PPST).Authorisation to enter an area is granted by the director of the unit, following a favourable Ministerial decision, as defined in the decree of 3 July 2012 relating to the PPST. An unfavourable Ministerial decision in respect of a position situated in a ZRR would result in the cancellation of the appointment.
Recruitment Policy :
As part of its diversity policy, all Inria positions are accessible to people with disabilities.
Contacts
- Inria Team : STARS
-
Recruiter :
Brémond François / Francois.Bremond@inria.fr
The keys to success
- Essential qualities in order to fulfil this assignment are feeling at ease in an environment of scientific dynamics and wanting to learn and listen.
- Passionate about innovation, willing to go for a PhD thesis in the field of Computer Vision and Machine Learning.
Languages: English
- Relational skills: team work
- Other valued appreciated: leadership
About Inria
Inria is the French national research institute dedicated to digital science and technology. It employs 2,600 people. Its 200 agile project teams, generally run jointly with academic partners, include more than 3,500 scientists and engineers working to meet the challenges of digital technology, often at the interface with other disciplines. The Institute also employs numerous talents in over forty different professions. 900 research support staff contribute to the preparation and development of scientific and entrepreneurial projects that have a worldwide impact.