Differences

This shows you the differences between two versions of the page.

--- m1r2017 [2025/07/02 22:58]
20.171.207.208 old revision restored (2025/06/15 16:59)
+++ m1r2017 [2025/07/03 05:46] (current)
20.171.207.121 old revision restored (2025/07/02 22:59)
@@ Line 9: / Line 9: @@
   * cours David Silver : [[http://www0.cs.ucl.ac.uk/staff/D.Silver/web/Teaching.html]]
   * livre de Sutton mis à jour:  [[https://webdocs.cs.ualberta.ca/~sutton/book/bookdraft2016sep.pdf]]
+  * Multi-Agent RL :
+      * en premier, lire le chapitre 4 de [[https://tel.archives-ouvertes.fr/file/index/docid/362529/filename/these_matignon.pdf]]
+      * puis lire [[http://liris.cnrs.fr/laetitia.matignon/index/matignon2012KER.pdf]]
+  * Travaux de De Hauwere: Learning multi-agent state space representations
+      * [[http://www.aamas-conference.org/Proceedings/aamas2010/pdf/01%20Full%20Papers/15_02_FP_0421.pdf]]
+      * [[https://ai.vub.ac.be/ALA2012/downloads/paper5.pdf]]
 === App Constructiviste ===
@@ Line 20: / Line 29: @@
 ===== Mémentos  =====
-=== App Constructiviste ===
+==== App Constructiviste ====
    * [[compte-rendu-etat-art-these | Etat de l'art (Thèse S. Mazac)]]
-===RL et Inspirations Constructivistes===
+==== RL ====
+   * [[memento-Learning-multi-agent-state-space-representations | Learning multi-agent state space representations]]
+=== Inspirations Constructivistes ===
    * [[memento-Intrinsically-Motivated-RL | Intrinsically Motivated RL [Singh2005]]]
+==== Value function approximation ====
+   * [[memento-Value-function-approximation | Quelques infos]]
+==== Temporal Difference - Growing Neural Gas ====
+   * [[memento-td-gng | TD-GNG]]
+===== Réflexions  =====
+   * [[reflexion-gng-qc | CQ-Learning et TD-GNG]]
+===== Comptes-rendu de réunion  =====
+   * [[ reu02-03-17 |02/03/17]]
+   * 14/03/17

DokuWiki

Site Tools

Differences

Page Tools