Site Tools


Hotfix release available: 2024-02-06b "Kaos". upgrade now! [55.2] (what's this?)
Hotfix release available: 2024-02-06a "Kaos". upgrade now! [55.1] (what's this?)
New release available: 2024-02-06 "Kaos". upgrade now! [55] (what's this?)
Hotfix release available: 2023-04-04b "Jack Jackrum". upgrade now! [54.2] (what's this?)
Hotfix release available: 2023-04-04a "Jack Jackrum". upgrade now! [54.1] (what's this?)
New release available: 2023-04-04 "Jack Jackrum". upgrade now! [54] (what's this?)
Hotfix release available: 2022-07-31b "Igor". upgrade now! [53.1] (what's this?)
Hotfix release available: 2022-07-31a "Igor". upgrade now! [53] (what's this?)
New release available: 2022-07-31 "Igor". upgrade now! [52.2] (what's this?)
New release candidate 2 available: rc2022-06-26 "Igor". upgrade now! [52.1] (what's this?)
New release candidate available: 2022-06-26 "Igor". upgrade now! [52] (what's this?)
Hotfix release available: 2020-07-29a "Hogfather". upgrade now! [51.4] (what's this?)
memento-value-function-approximation

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
memento-value-function-approximation [2025/03/03 18:03]
47.128.54.94 old revision restored (2025/02/25 18:24)
memento-value-function-approximation [2025/04/04 20:10] (current)
3.144.99.0 old revision restored (2025/03/11 17:51)
Line 52: Line 52:
    * Optimise le MSE (mean squarred error) entre les cibles du QNetwork et du QLearning    * Optimise le MSE (mean squarred error) entre les cibles du QNetwork et du QLearning
    * Utilise une variante de la descente de gradient stochastique    * Utilise une variante de la descente de gradient stochastique
- +   
  
  
memento-value-function-approximation.1741021388.txt.gz · Last modified: 2025/03/03 18:03 by 47.128.54.94