Ultra-low Power Evolutionary Reinforcement Learning - Rennes, France - INSTITUT NATIONAL DES SCIENCES APPLIQUEES
il y a 1 jour
Description
Ultra-low Power Evolutionary Reinforcement Learning PhDs:
- Réf
ABG-113376
- Sujet de Thèse 14/04/2023
- Financement public/privé
- INSTITUT NATIONAL DES SCIENCES APPLIQUEES
- Lieu de travail
- Rennes
- Bretagne
- France
- Intitulé du sujet
- Ultralow Power Evolutionary Reinforcement Learning PhDs
- Champs scientifiques
- Energie
- Matériaux
- Mots clés
- Artificial Intelligence, Reinforcement Learning, Ultra Low Power, Tangled Program Graphs, Open-Source
Description du sujet:
This project will recruit 2 PhDs in the domain of frugal Machine Learning (ML). The aim of the PhDs is to propose
full-stack methods and open-source tools to train and infer ultra-lightweight AIs, by extending, implementing, and
optimizing a new ML technique that relies on the light-by-construction and adaptive Tangled Program Graph (TPG)
model. The two PhD students will work in tandem to optimize energy during the whole life cycle, from training to
inference.
Keywords:
Artificial Intelligence, Reinforcement Learning, Ultra Low Power, Tangled Program Graphs ,
Open-Source.
Context
Reinforcement Learning with Tangled Program Graphs
Reinforcement Learning (RL) is a branch of Machine Learning (ML) techniques where an autonomous artificial
intelligence learns how to interact with its environment. As depicted in the figure below, using a trial and error
mechanism, the artificial intelligence learns from its own experience, by interacting with the environment, and by
getting rewards for each attempt. The purpose of the artificial intelligence is to maximize this reward.
Proposed in 2017, Tangled Program Graphs (TPGs) are a new way to power reinforcement learning AI, based on
evolutionary concepts.
The main strength of TPGs, compared to state-of-the-art deep learning-based techniques, is the lightweightness of their model, which confers them a low computational complexity, and very high performance on regular desktop CPU.
Compared to deep-learning at equivalent accuracy, TPGs execute 1000x faster with 100x less memory.GEGELATI
Generic Evolvable Graphs for Efficient Learning of Artificial Tangled Intelligence
GEGELATI [dʒedʒelati] is a fresh open-source reinforcement learning framework for training artificial intelligence based on TPGs.
The purpose of this framework, developed as a C++ shared library, is to make it as easy and as fast as possible and to train an agent on a new learning environment.
The C++ library is developed to be portable, fully documented, and thoroughly unit tested to ensure its maintainability.GEGELATI is developed at the Institut d'Electronique et des Technologies du numéRique (IETR).
Objectives
The main scientific objectives pursued by the PhD students are to:
Integrate energy optimizations at the core of the TPG training process: overall energy optimization will minimize
the energy consumption of the computing system for TPG training and inference, as well as, when relevant, the
energy consumption of physical actuators of the controlled cyber-physical systems.
Extend the TPG learning capabilities:
on top of improving the TPG efficiency on existing environments, model
extensions will unlock new types of learning environments like continuous action spaces or non-reinforcement
learning environments required for extending TPGs to real-world use cases.
Propose highly-efficient implementation techniques: in order to find the most suited hardware plarform for TPGs,
implementation will be pursued and compared on multiple state-of-the-art hardware. Implementation will be
studied for both efficient training, and inference on battery-powered ULP devices and reconfigurable devices for
nanoseconds reaction time on the factory floor.
The foreseen focus of the two PhDs are:
PhD 1:
Energy-aware low-complexity reinforcement learning.
PhD 2:
HW-SW co-optimization for ultra-low power reinforcement learning.
Prise de fonction:
- 02/10/2023
Nature du financement: - Financement public/privé
Précisions sur le financement:
- ANR FOUTICS Research Project
Présentation établissement et labo d'accueil: - INSTITUT NATIONAL DES SCIENCES APPLIQUEES
IETR :
UMR CNRS 6164
Cette Unité Mixte de Recherche regroupe les équipes de recherche de l'INSA Rennes, l'Université de Rennes, Supélec, l'Université de Nantes et le CNRS.
Unité Mixte de Recherche CNRS 6164.
Établissements tutelles du laboratoire :
CentraleSupélec / CNRS / INSA Rennes / Université de Nantes / Université de Rennes
Effectifs du site INSA Rennes de l'
IETR :
100 personnes au sein de 6 équipes de recherche:
ASIC :
Architecture, Systems, Infrastructure and electroniCs
SIGNAL :
SIgnal processinG aNd ALgorithm
e
WAVES :
Electromagnetic WAVES in complex media
SurfWave :
Periodic and quasi-periodic SURFaces fo WAVE control
POLARIS :
Propagation Localisation Radar:
instrumentation & signal
VAADER :
Video Algorithms and Architecture Design for Embe
Plus d'emplois de INSTITUT NATIONAL DES SCIENCES APPLIQUEES
-
Phd Position at Ietr
Rennes, France - il y a 3 semaines