Passive Reinforcement Learning
CSC 261 - Artificial Intelligence - Weinman
Answer the following questions. Record your answers in your Reading
Journal.
- Select the sentence from today's reading that you feel highlights
the most important consideration for direct utility
estimation. Briefly (3-5 sentences) explain your selection.
- Select the sentence from today's reading that you feel best
distinguishes between direct utility estimation and adaptive
dynamic programming. Briefly (3-5 sentences) explain your selection.
- How would you explain the TD update Equation (21.3) to a fellow student
who knows the perceptron learning rule, Eq. (18.7)?
- Which piece(s) of the utility-based agent in Figure 2.14 (p.
54) do you believe are missing from a PASSIVE-TD-AGENT?
- Identify the sentence, section, or concept from the current or previous
reading that remains the most confusing to you. Briefly explain what
you find confusing about it.