Pause
Read
CEA vacancy search engine

Internalisation of external knowledge by foundation models


Thesis topic details

General information

Organisation

The French Alternative Energies and Atomic Energy Commission (CEA) is a key player in research, development and innovation in four main areas :
• defence and security,
• nuclear energy (fission and fusion),
• technological research for industry,
• fundamental research in the physical sciences and life sciences.

Drawing on its widely acknowledged expertise, and thanks to its 16000 technicians, engineers, researchers and staff, the CEA actively participates in collaborative projects with a large number of academic and industrial partners.

The CEA is established in ten centers spread throughout France
  

Reference

SL-DRT-25-0854  

Direction

DRT

Thesis topic details

Category

Technological challenges

Thesis topics

Internalisation of external knowledge by foundation models

Contract

Thèse

Job description

To perform an unknown task, a subject (human or robot) has to consult external information, which involves a cognitive cost. After several similar experiments, it masters the situation and can act automatically. The 1980s and 1990s saw explorations in AI using conceptual graphs and schemas, but their large-scale implementation was limited by the technology available at the time.

Today's neural models, including transformers and LLM/VLMs, learn universal representations through pre-training on huge amounts of data. They can be used with prompts to provide local context. Fine-tuning allows these models to be specialised for specific tasks.

RAG and GraphRAG methods can be used to exploit external knowledge, but their use for inference is resource-intensive. This thesis proposes a cognitivist approach in which the system undergoes continuous learning. It consults external sources during inference and uses this information to refine itself regularly, as it does during sleep. This method aims to improve performance and reduce resource consumption.

In humans, these processes are linked to the spatial organisation of the brain. The thesis will also study network architectures inspired by this organisation, with dedicated but interconnected “zones”, such as the vision-language and language models.


These concepts can be applied to the Astir and Ridder projects, which aim to exploit foundation models for software engineering in robotics and the development of generative AI methods for the safe control of robots.

University / doctoral school

Mathématiques - Information - Ingénierie des Systèmes (MIIS)
Caen

Thesis topic location

Site

Saclay

Requester

Position start date

01/10/2025

Person to be contacted by the applicant

DE CHALENDAR Gaël Gael.de-Chalendar@cea.fr
CEA
DRT/DIASI//LASTI
CEA LIST
Laboratoire Vision et Ingénierie des Contenus
(Vision and Content Engineering Laboratory)

CEA SACLAY - NANO INNOV
BAT. 861
Point courier 173
91191 GIF SUR YVETTE
01.69.08.01.50

Tutor / Responsible thesis director

HERAULT Romain romain.herault@unicaen.fr
Université de Caen Normandie
Département de Mathématiques/Informatique (Laboratoire GREYC - Équipe IMAGE)
Bureau FA 213, au Bâtiment F de l'ENSICAEN, GREYC (UMR-CNRS 6072),
6, Bd du Maréchal Juin, F-14050 CAEN Cedex, France

+33 (0)2 31 45 27

En savoir plus


https://kalisteo.cea.fr/index.php/semantique-texte-et-image/