Recherchez une offre d'emploi
Post-Doctoral Research Visit F - M a Multi-Modal Language Model For Earth Observation H/F - 34
Description du poste
-
INRIA
-
Montpellier - 34
-
CDD
-
Publié le 4 Septembre 2025
A propos d'Inria
Inria est l'institut national de recherche dédié aux sciences et technologies du numérique. Il emploie 2600 personnes. Ses 215 équipes-projets agiles, en général communes avec des partenaires académiques, impliquent plus de 3900 scientifiques pour relever les défis du numérique, souvent à l'interface d'autres disciplines. L'institut fait appel à de nombreux talents dans plus d'une quarantaine de métiers différents. 900 personnels d'appui à la recherche et à l'innovation contribuent à faire émerger et grandir des projets scientifiques ou entrepreneuriaux qui impactent le monde. Inria travaille avec de nombreuses entreprises et a accompagné la création de plus de 200 start-up. L'institut s'eorce ainsi de répondre aux enjeux de la transformation numérique de la science, de la société et de l'économie.Post-Doctoral Research Visit F/M A multi-modal language model for Earth observation
Le descriptif de l'offre ci-dessous est en Anglais
Type de contrat : CDD
Niveau de diplôme exigé : Thèse ou équivalent
Fonction : Post-Doctorant
A propos du centre ou de la direction fonctionnelle
The Inria centre at Université Côte d'Azur includes 42 research teams and 9 support services. The centre's staff (about 500 people) is made up of scientists of dierent nationalities, engineers, technicians and administrative staff. The teams are mainly located on the university campuses of Sophia Antipolis and Nice as well as Montpellier, in close collaboration with research and higher education laboratories and establishments (Université Côte d'Azur, CNRS, INRAE, INSERM ...), but also with the regiona economic players.
With a presence in the fields of computational neuroscience and biology, data science and modeling, software engineering and certification, as well as collaborative robotics, the Inria Centre at Université Côte d'Azur is a major player in terms of scientific excellence through its results and collaborations at both European and international levels.
Contexte et atouts du poste
This post-doctoral offer is funded by the GEO-ReSeT ANR project, representing a collaboration between Inria (team, Montpellier) and Université de Paris Cité (team, Paris).
Leveraging the large amounts of available geo-spatial data from different sources, the (Generalized Earth Observation with Remote Sensing and Text) project has the objective to learn a rich representation of any geo-spatial location and convey a semantic representation of the information, by improving on existing models and providing a better experience to the end users. By using location on the Earth's surface as the common link between different modalities, a geo-spatial foundation model would be able to incorporate a variety of data sources, including remote sensing imagery, textual descriptions of places, and other generic features.
Such a foundation model has the potential to open a set of all new possibilities in terms of Earth observation applications, by allowing for few or zero-shot solutions to classical problems such as land-cover and land-use mapping, target detection, and visual question answering. It will also be useful for a wide range of applications with a geo-spatial component, including environmental monitoring, urban planning and agriculture.
By leveraging several data modalities, this foundation model could provide a comprehensive and accurate understanding of the Earth's surface, enabling informed decisions and actions. This will be particularly valuable for new potential users in sectors such as journalism, social sciences or environmental monitoring, who may not have the resources or expertise to collect their own training datasets and develop their own methods, thus moving beyond open Earth observation data and democratizing the access to Earth observation information.
Mission confiée
The work to be conducted during the proposed post-doc project will contribute to the ambition of the GEO-ReSeT ANR project by linking textual descriptions of places (e.g., collected from heterogeneous online sources, such as news articles or search engine results), to their approximate geo-location, a task known as geoparsing.
This text-location link will then be used in combination with other geospatial data modalities, with a focus on remote sensing data from sensors such as Sentinel-1 and -2, in order to train multi-modal models that are aware about the way in which people describe locations.
This will be done by first combining information stemming from different databases containing geographic named entities, such as Open Street Map, Wikipedia and gazetteers, such that geographic points or polygons can be linked to each named entity.
In a second step, a Natural Language Processing (NLP) pipeline will be developed to obtain the most likely geographic named entities that are referred to in any piece of text that describes a place.
With respect to existing Named Entity Recognition (NER) methodologies, in order to avoid restricting us to cases where entities' names appear exactly as in the databases or gazetteers, we will leverage pre-trained Large Language Models (LLM) to resolve ambiguities and gather evidence towards the most likely entities that are being described in the text. Such an approach will be trained and validated by using the cases that do match the names in the gazetteer.
We will then move on, in collaboration with the rest of the GEO-ReSeT consortium, to train a multi-modal large language model (MMLLM) that will serve as a foundation model for Earth observation tasks.
This model will finally be evaluated on several agro-environmental tasks.
Principales activités
- Description of the state-of-the-art in unstructured text geoparsing, with a focus on approaches leveraging LLMs.
- Collection of a database of geographic named entities linked to their geographic footprint (e.g. point or polygon). Collection of a database of unstructured online text that is likely to contain a reference to a geographic location.
- Development of an NLP pipeline to link each piece of geographic text to its likely geographic footprint.
- Participate in the design and training of a multi-modal large language model (MMLLM) using remote sensing and geoparsed text.
- Evaluation of the final model on two of the following case studies at a national or continental scale: ecosystem type mapping, crop type mapping or land-use mapping.
Compétences
- Python programming.
- Deep Learning with Python (preferably with Pytorch).
- Experience with NLP.
- Experience with GIS would be a plus.
Avantages
- - Subsidized meals
- Partial reimbursement of public transport costs
- Leave: 7 weeks of annual leave + 10 extra days off due to RTT (statutory reduction in working hours) + possibility of exceptional leave (sick children, moving home, etc.)
- Possibility of teleworking and flexible organization of working hours
- Professional equipment available (videoconferencing, loan of computer equipment, etc.)
- Social, cultural and sports events and activities
- Access to vocational training
- Contribution to mutual insurance (subject to conditions)
Rémunération
Gross Salary: 2788 € per month

Offres similaires
Consultant Commercial en Ressources Humaines H/F
-
Randstad
-
Montpellier - 34
-
CDI
-
10 Septembre 2025
Conseiller Bancaire de Proximité - Montpellier Centre H/F
-
La Poste Groupe
-
Montpellier - 34
-
CDI
-
10 Septembre 2025
Conducteur de Travaux GO H/F
-
Adsearch
-
Montpellier - 34
-
CDI
-
10 Septembre 2025
Recherches similaires
Déposez votre CV
Soyez visible par les entreprises qui recrutent à Montpellier.
Chiffres clés de l'emploi à Montpellier
- Taux de chomage : 14%
- Population : 295542
- Médiane niveau de vie : 18870€/an
- Demandeurs d'emploi : 39020
- Actifs : 134890
- Nombres d'entreprises : 30684
Sources :


Un site du réseaux :