Code and Results for Chapter 7:
Introduction:
These are results and code for the
problems and examples found in Chapter 7 of this famous book.
Various Figures and Problems:
- Using TD(n) learning to learn random walks:
- Using TD(lambda) learning to learn random walks:
- Online TD(lambda) with eligability traces:
- Example 7.4 Implementing the Grid World example with eligability traces:
- gw_w_et.m (implements the windy grid world learning example with elagability traces)
- gw_w_et_Script.m (a driver for the above function)
- Accumulating traces v.s. Replacing traces
- Example 7.5 Learning from a one directional Markov chain
John Weatherwax