General information
Organisation
The French Alternative Energies and Atomic Energy Commission (CEA) is a key player in research, development and innovation in four main areas :
• defence and security,
• nuclear energy (fission and fusion),
• technological research for industry,
• fundamental research in the physical sciences and life sciences.
Drawing on its widely acknowledged expertise, and thanks to its 16000 technicians, engineers, researchers and staff, the CEA actively participates in collaborative projects with a large number of academic and industrial partners.
The CEA is established in ten centers spread throughout France
Reference
SL-DRT-25-0854
Direction
DRT
Thesis topic details
Category
Technological challenges
Thesis topics
Internalisation of external knowledge by foundation models
Contract
Thèse
Job description
To perform an unknown task, a subject (human or robot) has to consult external information, which involves a cognitive cost. After several similar experiments, it masters the situation and can act automatically. The 1980s and 1990s saw explorations in AI using conceptual graphs and schemas, but their large-scale implementation was limited by the technology available at the time.
Today's neural models, including transformers and LLM/VLMs, learn universal representations through pre-training on huge amounts of data. They can be used with prompts to provide local context. Fine-tuning allows these models to be specialised for specific tasks.
RAG and GraphRAG methods can be used to exploit external knowledge, but their use for inference is resource-intensive. This thesis proposes a cognitivist approach in which the system undergoes continuous learning. It consults external sources during inference and uses this information to refine itself regularly, as it does during sleep. This method aims to improve performance and reduce resource consumption.
In humans, these processes are linked to the spatial organisation of the brain. The thesis will also study network architectures inspired by this organisation, with dedicated but interconnected “zones”, such as the vision-language and language models.
These concepts can be applied to the Astir and Ridder projects, which aim to exploit foundation models for software engineering in robotics and the development of generative AI methods for the safe control of robots.
University / doctoral school
Mathématiques - Information - Ingénierie des Systèmes (MIIS)
Caen
Thesis topic location
Site
Saclay
Requester
Position start date
01/10/2025
Person to be contacted by the applicant
DE CHALENDAR Gaël
Gael.de-Chalendar@cea.fr
CEA
DRT/DIASI//LASTI
CEA LIST
Laboratoire Vision et Ingénierie des Contenus
(Vision and Content Engineering Laboratory)
CEA SACLAY - NANO INNOV
BAT. 861
Point courier 173
91191 GIF SUR YVETTE
01.69.08.01.50
Tutor / Responsible thesis director
HERAULT Romain
romain.herault@unicaen.fr
Université de Caen Normandie
Département de Mathématiques/Informatique (Laboratoire GREYC - Équipe IMAGE)
Bureau FA 213, au Bâtiment F de l'ENSICAEN, GREYC (UMR-CNRS 6072),
6, Bd du Maréchal Juin, F-14050 CAEN Cedex, France
+33 (0)2 31 45 27
En savoir plus
https://kalisteo.cea.fr/index.php/semantique-texte-et-image/