Post-Doctoral Research Visit F/M Topological Methods for Spatial Transcriptomics Data

Le descriptif de l’offre ci-dessous est en Anglais

Type de contrat : CDD

Niveau de diplôme exigé : Thèse ou équivalent

Fonction : Post-Doctorant

A propos du centre ou de la direction fonctionnelle

The Inria center at Université Côte d'Azur includes 42 research teams and 9 support services. The center’s staff (about 500 people) is made up of scientists of different nationalities, engineers, technicians and administrative staff. The teams are mainly located on the university campuses of Sophia Antipolis and Nice as well as Montpellier, in close collaboration with research and higher education laboratories and establishments (Université Côte d'Azur, CNRS, INRAE, INSERM ...), but also with the regional economic players.

With a presence in the fields of computational neuroscience and biology, data science and modeling, software engineering and certification, as well as collaborative robotics, the Inria Centre at Université Côte d'Azur  is a major player in terms of scientific excellence through its results and collaborations at both European and international levels.

Contexte et atouts du poste

With the launch of the AI4scMed project (multiscale AI for single-cell based precision medicine) of the PEPR Santé Numérique, we seek to hire an excellent candidate for a post-doctoral research position of 2 years specialized in data science and single-cell data. The successful candidate will be primarily based at Centre Inria d’Université Côte d’Azur in Biot, France, with collaborations at ENS Lyon, EC Nantes and INSERM Marseille.

Mission confiée

The goal of this postdoctoral research position is to develop the theoretical and practical applications of Topological Data Analysis (TDA) to spatial transcriptomics data, with a particular focus on multi-parameter persistence methods. TDA is a field of data science that gained a lot of attention during the last few years due to its ability to generate new descriptors and features for data sets (such as persistence diagrams and Mapper complexes), which encode topological information—for instance, the number and sizes of topological structures in the data, including, e.g., connected components, loops and cavities—that is usually complementary to traditional features, and can thus greatly improve performances of standard methods in Machine Learning. Moreover, as TDA is firmly rooted in algebraic topology, its descriptors enjoy many properties, such as being invariant to any continuous deformations, as well as being robust to potentially non-linear noise, as proved by the celebrated stability theorem. These properties have made TDA a useful tool in the analysis of spatial transcriptomics data, which jointly measures the RNA expression of all genes and the spatial coordinates,
of single cells in a piece of tissue. As the geometric arrangements of single cells is known to correlate to more complex biological phenomenon, such as the immune response in the presence of tumors, descriptors have recently as powerful summaries, that can be seen as generalizations of other state-of-the-art descriptors based on spatial statistics, such as Ripley’s K-function. However, an inherent limitation of current approaches based on topology is their inability to study, capture and encode the patterns formed by the joint interactions of spatial coordinates and several marker genes: as of today, all topological approaches can only characterize
the topological arrangements of single cells for a given single gene or cell type. This is problematic as restricting to single genes or cell types is highly restrictive in terms of information content, and can lead to unreliable results, as it is prone to errors as long as the cell type inference method (with, e.g., transfer learning) is not perfect.

Principales activités

Hence, the successful candidate will work on multi-parameter TDA, a powerful field that emerged recently in the TDA community, for designing new analysis tools for spatial transcriptomics data. This includes the statistical analysis and design of confidence models for
multi-parameter Mapper complexes, which are visualization tools based on the topology of data. This will allow for efficient and robust biological inference based on the topological structures exhibited in Mapper complexes that are associated to interesting subpopulations of cells, following the preliminary work. Another line of work will be based on the use of multi-parameter persistence descriptors, which are useful descriptors of the multiparameter topological patterns that are present in (batches of) cells, and which take the form of images encoding the presence and sizes of these patterns. Again, the design of generative and predictive models fueled by these descriptors is expected for biological inference. Then, the candidate will apply its methods and models on real spatial transcriptomics datasets obtained within the PEPR consortium.

The application will focus on the study of long-range spatial interactions of receptor-ligand pairs in the tissue. The research will be jointly supervised by Bertrand Michel (LMJL, EC Nantes), Franck Picard (LBMC, ENS-Lyon), Paul Villoutreix (Marseille Medical Genetics, INSERM, Marseille), and Mathieu Carrière (DataShape, Centre Inria d’Université Côte d’Azur)

Compétences

Any candidate should have a PhD degree in applied mathematics and/or computer science, excellent programming skills with an extensive knowledge of the principal data science libraries for (deep) machine learning and optimization (such as Scikit-Learn,
TensorFlow, PyTorch) and a lot of research experience on topics including topological data analysis and statistical machine learning. Some experience in single cell biology and spatial transcriptomics is a plus. Teamworking, communication and collaboration skills are also essential.

Avantages

  • Subsidized meals
  • Partial reimbursement of public transport costs
  • Leave: 7 weeks of annual leave + 10 extra days off due to RTT (statutory reduction in working hours) + possibility of exceptional leave (sick children, moving home, etc.)
  • Possibility of teleworking (after 6 months of employment) and flexible organization of working hours
  • Professional equipment available (videoconferencing, loan of computer equipment, etc.)
  • Social, cultural and sports events and activities
  • Access to vocational training
  • Contribution to mutual insurance (subject to conditions)

Rémunération

Gross Salary : 2788 € per month.