Format des notes
Numérique sur 20
Littérale/grade européen
Pour les étudiants du diplôme M2 DATAAI - Data and Artificial Intelligence
L'UE est acquise si Note finale >= 10- Crédits ECTS acquis : 2 ECTS
Programme détaillé
The course is roughly organised into four approaches to the theme of depth in Deep Reinforcement Learning:,
1\. Depth in value function (DQN and variants, distributional RL, ...),
2\. Depth in policy (PPO, SAC, imitation learning, ...),
3\. Depth in environment model (Monte Carlo Tree Search, model-based reinforcement learning),
4\. Depth in reward model (reward shaping, inverse reinforcement learning, transfer learning ...).Mots clés
apprentissage par renforcement, agents autonomes, prise de décision probabiliste, agents intelligents, prise de décision séquentielle