Home > Publications . Search All . Browse All . Country . Browse PSC Pubs . PSC Report Series

PSC In The News

RSS Feed icon

Buchmueller breaks down partisan views on Obamacare

ISR's Conrad says mobile phone polling faces non-response bias

ISR's Jacob, Dynarski and team evaluate effectiveness of youth policy interventions in new U-M initiative

More News


Gonzalez, Alter, and Dinov win NSF "Big Data Spokes" award for neuroscience network

Post-doc Melanie Wasserman wins dissertation award from Upjohn Institute

ISR kicks off DE&I initiative with lunchtime presentation: Oct 13, noon, 1430 ISR Thompson

U-M ranked #4 in USN&WR's top public universities

More Highlights

Next Brown Bag

Mon, Oct 24 at noon:
Academic innovation & the global public research university, James Hilton

Batch mode reinforcement learning based on the synthesis of artificial trajectories

Publication Abstract

Fonteneau, R., Susan A. Murphy, L. Wehenkel, and D. Ernst. 2013. "Batch mode reinforcement learning based on the synthesis of artificial trajectories." Annals of Operations Research, 208(1): 383-416.

In this paper, we consider the batch mode reinforcement learning setting, where the central problem is to learn from a sample of trajectories a policy that satisfies or optimizes a performance criterion. We focus on the continuous state space case for which usual resolution schemes rely on function approximators either to represent the underlying control problem or to represent its value function. As an alternative to the use of function approximators, we rely on the synthesis of "artificial trajectories" from the given sample of trajectories, and show that this idea opens new avenues for designing and analyzing algorithms for batch mode reinforcement learning.

DOI:10.1007/s10479-012-1248-5 (Full Text)

PMCID: PMC3773886. (Pub Med Central)

Browse | Search : All Pubs | Next