ETRO-VUB Department of Electronics and Informatics

About ETRO | News | Events | Vacancies | Contact

ETRO Publications

Full Details


	Conference Publication


	Learning with options that terminate off-policy Host Publication: 32nd AAAI Conference on Artificial Intelligence, AAAI 2018 Authors: A. Harutyunyan, P. Vrancx, P. Luc Bacon, D. Precup and A. Nowé Publisher: AAAI Press Publication Date: Jan. 2018 Number of Pages: 10 Abstract: A temporally abstract action, or an option, is specified by a policy and a termination condition: the policy guides the option behavior, and the termination condition roughly determines its length. Generally, learning with longer options (like learning with multi-step returns) is known to be more efficient. However, if the option set for the task is not ideal, and cannot express the primitive optimal policy well, shorter options offer more flexibility and can yield a better solution. Thus, the termination condition puts learning efficiency at odds with solution quality. We propose to resolve this dilemma by decoupling the behavior and target terminations, just like it is done with policies in off-policy learning. To this end, we give a new algorithm, Q(�), that learns the solution with respect to any termination condition, regardless of how the options actually terminate. We derive Q(�) by casting learning with options into a common framework with well-studied multi-step off-policy learning. We validate our algorithm empirically, and show that it holds up to its motivating claims.

	Other Reference Styles

	Full Details IEEE Style BibTex Style EndNote Style

Search ETRO Publications

Author:
Keyword:
Type:	Journals Conferences Books Reports Laymen Other


	Research - Contact person - IRIS - AVSP - LAMI	Education - Contact person - Thesis proposals - ETRO Courses	Industry - Contact person - Spin-offs - Know How	Publications - Journals - Conferences - Books	About ETRO - Vacancies - News - Events - Press	Contact ETRO Department Tel: +32 2 629 29 30


	©2024 • Vrije Universiteit Brussel • ETRO Dept. • Pleinlaan 2 • 1050 Brussels • Tel: +32 2 629 2930 (secretariat) • Fax: +32 2 629 2883 • Webmaster • Disclaimer