Peter Stone's Selected Publications

Classified by TopicClassified by Publication TypeSorted by DateSorted by First Author Last NameClassified by Funding Source


Value-Function-Based Transfer for Reinforcement Learning Using Structure Mapping

Value-Function-Based Transfer for Reinforcement Learning Using Structure Mapping.
Yaxin Liu and Peter Stone.
In Proceedings of the Twenty-First National Conference on Artificial Intelligence, pp. 415–20, July 2006.
AAAI 2006

Download

[PDF]151.7kB  [postscript]1.6MB  

Abstract

Transfer learning concerns applying knowledge learned in one task (the source) to improve learning another related task (the target). In this paper, we use structure mapping, a psychological and computational theory about analogy making, to find mappings between the source and target tasks and thus construct the transfer functional automatically. Our structure mapping algorithm is a specialized and optimized version of the structure mapping engine and uses heuristic search to find the best maximal mapping. The algorithm takes as input the source and target task specifications represented as qualitative dynamic Bayes networks, which do not need probability information. We apply this method to the Keepaway task from RoboCup simulated soccer and compare the result from automated transfer to that from handcoded transfer.

BibTeX Entry

@InProceedings{AAAI06-yaxin,
	author="Yaxin Liu and Peter Stone",
	title="Value-Function-Based Transfer for Reinforcement Learning Using Structure Mapping",
        booktitle="Proceedings of the Twenty-First National Conference on Artificial Intelligence",
        month="July",year="2006",
	pages="415--20",
	abstract={Transfer learning concerns applying knowledge learned in one task (the source) to improve learning another related task (the target). In this paper, we use structure mapping, a psychological and computational theory about analogy making, to find mappings between the source and target tasks and thus construct the transfer functional automatically. Our structure mapping algorithm is a specialized and optimized version of the structure mapping engine and uses heuristic search to find the best maximal mapping. The algorithm takes as input the source and target task specifications represented as qualitative dynamic Bayes networks, which do not need probability information. We apply this method to the Keepaway task from RoboCup simulated soccer and compare the result from automated transfer to that from handcoded transfer.},
        wwwnote={<a href="http://www.aaai.org/Conferences/AAAI/aaai06.php">AAAI 2006</a>},
}

Generated by bib2html.pl (written by Patrick Riley ) on Fri Jun 26, 2026 17:47:46