NOTES

Dynamic Programming and Optimal Control - Bertsekas

1. General Issues of Cost Approximation

2. Direct Policy Evaluation - Gradient Methods

3. Projected Equation Methods

4. Aggregation Methods

5. Q-Learning

6. Stochastic Shortest Path

7. Average Cost Problems

8. Simulation Based Solution

9. Approximiation in Policy Space