Code and Results for Chapter 2:
Introduction:
These are results and code for the
problems and examples found in Chapter 2 of this famous book.
Various Figures and Problems:
- sample_discrete.m (Utility function to sample from a discrete distribution)
- n_armed_testbed.m (Matlab code to generate the n-armed bandit testbed)
- n_armed_testbed_softmax.m (Matlab code to generate the softmax n-armed bandit)
- sample output (results obtained when running n_armed_testbed_softmax.m)
- The Binary Bandit:
- exercise_2_5.m (Exercise 2.5)
- Exercise 2.7:
- Optimistic Initial Values:
- Reinforcement Comparison Methods:
- Exercise 2.11:
- Pursuit Methods:
John Weatherwax