Passive Reinforcement Learning

CSC 261 - Artificial Intelligence - Weinman



Answer the following questions. Record your answers in your Reading Journal.
  1. Select the sentence from today's reading that you find best distinguishes between direct utility estimation and adaptive dynamic programming. Briefly (3-5 sentences) explain your selection.
  2. Explain the meaning and intuition of the TD update Equation (21.3) as you would to a colleague who is a (non-CS) science major.
  3. Identify the sentence, section, or concept from the current or previous reading that remains the most confusing to you. Briefly (3-5 sentences) explain what you find confusing about it.