Site Tools


Hotfix release available: 2024-02-06b "Kaos". upgrade now! [55.2] (what's this?)
Hotfix release available: 2024-02-06a "Kaos". upgrade now! [55.1] (what's this?)
New release available: 2024-02-06 "Kaos". upgrade now! [55] (what's this?)
Hotfix release available: 2023-04-04b "Jack Jackrum". upgrade now! [54.2] (what's this?)
Hotfix release available: 2023-04-04a "Jack Jackrum". upgrade now! [54.1] (what's this?)
New release available: 2023-04-04 "Jack Jackrum". upgrade now! [54] (what's this?)
Hotfix release available: 2022-07-31b "Igor". upgrade now! [53.1] (what's this?)
Hotfix release available: 2022-07-31a "Igor". upgrade now! [53] (what's this?)
New release available: 2022-07-31 "Igor". upgrade now! [52.2] (what's this?)
New release candidate 2 available: rc2022-06-26 "Igor". upgrade now! [52.1] (what's this?)
New release candidate available: 2022-06-26 "Igor". upgrade now! [52] (what's this?)
Hotfix release available: 2020-07-29a "Hogfather". upgrade now! [51.4] (what's this?)
m1r2017

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
m1r2017 [2025/02/28 17:02]
47.128.18.34 old revision restored (2024/12/28 18:34)
m1r2017 [2025/04/20 08:41] (current)
18.117.87.50 old revision restored (2025/04/20 02:02)
Line 33: Line 33:
  
 ==== RL ==== ==== RL ====
-=== Multi-agents === +   * [[memento-Learning-multi-agent-state-space-representations | Learning multi-agent state space representations]]
-   * [[memento-Learning-multi-agent-state-space-representations | Learning multi-agent state space representations (CQLearning)]] +
-   * [[memento-Processus-décisionnels-de-Markov-et-systèmes-multiagents | Processus décisionnels de Markov et systèmes multiagents (Thèse L. Matignon)]] +
-   * [[memento-Independent-reinforcement-learners-cooperative-Markov-games:-a-survey-regarding-coordination-problems | Independent reinforcement learners in cooperative Markov games: a survey regarding coordination problems (A terminer)]] +
-   * [[memento-Context-Sensitive-Reward-Shaping-for-Sparse-Inter-action-Multi-Agent-Systems | Context-Sensitive Reward Shaping for Sparse Inter-action Multi-Agent Systems]]+
  
 === Inspirations Constructivistes === === Inspirations Constructivistes ===
 +
    * [[memento-Intrinsically-Motivated-RL | Intrinsically Motivated RL [Singh2005]]]    * [[memento-Intrinsically-Motivated-RL | Intrinsically Motivated RL [Singh2005]]]
  
 ==== Value function approximation ==== ==== Value function approximation ====
 +
    * [[memento-Value-function-approximation | Quelques infos]]    * [[memento-Value-function-approximation | Quelques infos]]
  
 ==== Temporal Difference - Growing Neural Gas ==== ==== Temporal Difference - Growing Neural Gas ====
 +
    * [[memento-td-gng | TD-GNG]]    * [[memento-td-gng | TD-GNG]]
  
m1r2017.1740758579.txt.gz · Last modified: 2025/02/28 17:02 by 47.128.18.34