**Introduction:**

These are results and code for the
problems and examples found in Chapter 2 of this famous book.

- sample_discrete.m (Utility function to sample from a discrete distribution)

- n_armed_testbed.m (Matlab code to generate the n-armed bandit testbed)

- sample output (results obtained when running n_armed_testbed.m)

- n_armed_testbed_softmax.m (Matlab code to generate the softmax n-armed bandit)

- sample output (results obtained when running n_armed_testbed_softmax.m)

- The Binary Bandit:
- binary_bandit_exps.m (some the binary bandit problems)

- binary_bandit_exps_Script.m (runs the binary bandit problems)

- sample output (results obtained when running the above)

- exercise_2_5.m (Exercise 2.5)

- sample output (results obtained when running the above)

- Exercise 2.7:
- exercise_2_7.m (Exercise 2.7)

- exercise_2_7_Script.m (runs exercise 2.7)

- sample output (results obtained when running the above)

- Optimistic Initial Values:
- opt_initial_values.m (the n-armed bandit testbed with optimistic initial values)

- opt_initial_values_Script.m (runs the n-armed bandit testbed with optimistic initial values)

- sample output (results of the n-armed bandit testbed with optimistic initial values)

- Reinforcement Comparison Methods:
- reinforcement_comparison_methods.m (the n-armed bandit testbed with reinforcement comparison)

- reinforment_comparison_methods_Script.m (runs the n-armed bandit testbed with reinforcement comparison)

- sample output (results obtained when running the above)

- Exercise 2.11:
- exercise_2_11.m (Exercise 2.11)

- exercise_2_11_Script.m (runs exercise 2.11)

- sample output (results obtained when running the above)

- Pursuit Methods:
- persuit_method.m (Persuit methods)

- persuit_method_Script.m (runs the persuit methods)

- sample output (results obtained when running the above)

John Weatherwax