This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
memento-value-function-approximation [2025/02/13 17:54] 47.128.112.242 old revision restored (2025/01/25 18:04) |
memento-value-function-approximation [2025/04/15 03:59] (current) 20.171.207.142 old revision restored (2025/04/12 10:35) |
||
---|---|---|---|
Line 52: | Line 52: | ||
* Optimise le MSE (mean squarred error) entre les cibles du QNetwork et du QLearning | * Optimise le MSE (mean squarred error) entre les cibles du QNetwork et du QLearning | ||
* Utilise une variante de la descente de gradient stochastique | * Utilise une variante de la descente de gradient stochastique | ||
- | + | | |