What is the difference between 'on-policy' and 'off-policy' learning?

Practice Questions

1 question
Q1
What is the difference between 'on-policy' and 'off-policy' learning?
  1. On-policy learns from the current policy, off-policy learns from a different policy
  2. On-policy uses supervised learning, off-policy uses unsupervised learning
  3. On-policy is faster than off-policy
  4. There is no difference

Questions & Step-by-step Solutions

1 item
Q
Q: What is the difference between 'on-policy' and 'off-policy' learning?
Solution: On-policy learning updates the policy based on actions taken by the current policy, while off-policy learning can learn from actions taken by a different policy.
Steps: 4

Related Questions

Soulshift Feedback ×

On a scale of 0–10, how likely are you to recommend The Soulshift Academy?

Not likely Very likely