Site Tools


Hotfix release available: 2024-02-06b "Kaos". upgrade now! [55.2] (what's this?)
Hotfix release available: 2024-02-06a "Kaos". upgrade now! [55.1] (what's this?)
New release available: 2024-02-06 "Kaos". upgrade now! [55] (what's this?)
Hotfix release available: 2023-04-04b "Jack Jackrum". upgrade now! [54.2] (what's this?)
Hotfix release available: 2023-04-04a "Jack Jackrum". upgrade now! [54.1] (what's this?)
New release available: 2023-04-04 "Jack Jackrum". upgrade now! [54] (what's this?)
Hotfix release available: 2022-07-31b "Igor". upgrade now! [53.1] (what's this?)
Hotfix release available: 2022-07-31a "Igor". upgrade now! [53] (what's this?)
New release available: 2022-07-31 "Igor". upgrade now! [52.2] (what's this?)
New release candidate 2 available: rc2022-06-26 "Igor". upgrade now! [52.1] (what's this?)
New release candidate available: 2022-06-26 "Igor". upgrade now! [52] (what's this?)
Hotfix release available: 2020-07-29a "Hogfather". upgrade now! [51.4] (what's this?)
m1r2017

Stage M1R 2017

Pointeurs

RL

  • cours M1: MDP et planif, RL
  • cours David Silver : http://www0.cs.ucl.ac.uk/staff/D.Silver/web/Teaching.html
  • livre de Sutton mis à jour: https://webdocs.cs.ualberta.ca/~sutton/book/bookdraft2016sep.pdf
  • Multi-Agent RL :
    • en premier, lire le chapitre 4 de https://tel.archives-ouvertes.fr/file/index/docid/362529/filename/these_matignon.pdf
    • puis lire http://liris.cnrs.fr/laetitia.matignon/index/matignon2012KER.pdf
  • Travaux de De Hauwere: Learning multi-agent state space representations
    • http://www.aamas-conference.org/Proceedings/aamas2010/pdf/01%20Full%20Papers/15_02_FP_0421.pdf
    • https://ai.vub.ac.be/ALA2012/downloads/paper5.pdf

App Constructiviste

  • Thèse S. Mazac: https://tel.archives-ouvertes.fr/tel-01310583/file/TH2015MazacSebastien.pdf

RL et Inspirations Constructivistes

  • Intrinsically Motivated RL [Singh2005] https://web.eecs.umich.edu/~baveja/Papers/FinalNIPSIMRL.pdf

Mémentos

App Constructiviste

RL et Inspirations Constructivistes

Value function approximation

Temporal Difference - Growing Neural Gas

Comptes-rendu de réunion

m1r2017.txt · Last modified: 2024/10/26 15:33 by 47.128.122.23