Home > Publications . Search All . Browse All . Country . Browse PSC Pubs . PSC Report Series

PSC In The News

RSS Feed icon

Frey's Scenario F simulation mentioned in account of the Democratic Party's tribulations

U-M Poverty Solutions funds nine projects

Dynarski says NY's Excelsior Scholarship Program could crowd out low-income and minority students

More News

Highlights

Workshops on EndNote, NIH reporting, and publication altmetrics, Jan 26 through Feb 7, ISR

2017 PAA Annual Meeting, April 27-29, Chicago

NIH funding opportunity: Etiology of Health Disparities and Health Advantages among Immigrant Populations (R01 and R21), open Jan 2017

Russell Sage 2017 Summer Institute in Computational Social Science, June 18-July 1. Application deadline Feb 17.

More Highlights

Next Brown Bag

Mon, Jan 23, 2017 at noon:
Decline of cash assistance and child well-being, Luke Shaefer

Sampling strategies for batch mode reinforcement learning

Publication Abstract

Fonteneau, Raphael, Susan A. Murphy, L. Wehenkel, and D. Ernst. 2013. "Sampling strategies for batch mode reinforcement learning." Revue d'Intelligence Artificielle, 27(2): 171-194.

We propose two strategies for experiment selection in the context of batch mode reinforcement learning. The first strategy is based on the idea that the most interesting experiments to carry out at some stage are those that are the most liable to falsify the current hypothesis about the optimal control policy. We cast this idea in a context where a policy learning algorithm and a model identification method are given a priori. The second strategy exploits recently published methods for computing bounds on the return of control policies from a set of trajectories in order to sample the state-action space so as to be able to discriminate between optimal and non-optimal policies. Both strategies are experimentally validated, showing promising results. © 2013 Lavoisier.

DOI:10.3166/RIA.27.171-194 (Full Text)

Browse | Search : All Pubs | Next