Invariant_rl
When dealing with policy data from several environments, some predictors may be great in some environments, but not in others. In our new paper Invariant Policy Learning, we show that learning invariant sets of predictors allows us to learn policies that generalize well to new environments. [PDF]
Enjoy Reading This Article?
Here are some more articles you might like to read next: