Passive Reinforcement Learning

CSC 261 - Artificial Intelligence - Weinman



Answer the following questions. Record your answers in your Reading Journal.
  1. Select the sentence from today's reading that you find best distinguishes between direct utility estimation and adaptive dynamic programming. Briefly (3-5 sentences) explain your selection.
  2. Explain the TD update Equation (21.3) as you would to a fellow student who knows the perceptron learning rule, Equation (18.7).
  3. Identify the sentence, section, or concept from the current or previous reading that remains the most confusing to you. Briefly (3-5 sentences) explain what you find confusing about it.