Optimistic policy iteration and natural actor-critic: A unifying view and a non-optimality result (2013)
  • BOOKTITLE:
  • Advances in Neural Information Processing Systems 26
[BibTex]