|
D. Steckelmacher, D. Roijers, A. Harutyunyan, P. Vrancx, H. Plisnier and A. Nowé, Reinforcement Learning in POMDPs with Memoryless Options and Option-Observation Initiation Sets, in Thirty-Second AAAI Conference on Artificial Intelligence (AAAI 2018), AAAI Press, Feb. 2018, pp. 8.
|
|