Passive Reinforcement Learning
CSC 261 - Artificial Intelligence - Weinman
Answer the following questions. Record your answers in your Reading
Journal.
- Select the sentence from today's reading that you find best
distinguishes between direct utility estimation and adaptive
dynamic programming. Briefly (3-5 sentences) explain your selection.
- Explain the meaning and intuition of the TD update Equation (21.3)
as you would to a colleague who is a (non-CS) science major.
- Identify the sentence, section, or concept from the current or
previous reading that remains the most confusing to you.
Briefly (3-5 sentences) explain what you find confusing about it.