Learning to Self-Modify Rewards with Bi-Level Gradients

EasyChair Preprint no. 8260, version history

VersionDatePagesVersion notes
1June 12, 202210
2January 21, 202310

Corrected meta data for indexing. Scholar was not reading abstract or text

Keyphrases: meta-learning, Reinforcement Learning, Reward Shaping

BibTeX entry
BibTeX does not have the right entry for preprints. This is a hack for producing the correct reference:
  author = {Aiden Boyd and Shibani and Will Callaghan},
  title = {Learning to Self-Modify Rewards with Bi-Level Gradients},
  howpublished = {EasyChair Preprint no. 8260},

  year = {EasyChair, 2023}}