Home > Publications . Search All . Browse All . Country . Browse PSC Pubs . PSC Report Series

PSC In The News

RSS Feed icon

Frey's Scenario F simulation mentioned in account of the Democratic Party's tribulations

U-M Poverty Solutions funds nine projects

Dynarski says NY's Excelsior Scholarship Program could crowd out low-income and minority students

More News

Highlights

Workshops on EndNote, NIH reporting, and publication altmetrics, Jan 26 through Feb 7, ISR

2017 PAA Annual Meeting, April 27-29, Chicago

NIH funding opportunity: Etiology of Health Disparities and Health Advantages among Immigrant Populations (R01 and R21), open Jan 2017

Russell Sage 2017 Summer Institute in Computational Social Science, June 18-July 1. Application deadline Feb 17.

More Highlights

Next Brown Bag

Mon, Jan 23, 2017 at noon:
Decline of cash assistance and child well-being, Luke Shaefer

Susan A. Murphy photo

A generalization error for Q-learning

Publication Abstract

Murphy, Susan A. 2005. "A generalization error for Q-learning." Journal of Machine Learning Research, 6(July): 1073-1097.

Planning problems that involve learning a policy from a single training set of finite horizon trajectories arise in both social science and medical fields. We consider Q-learning with function approximation for this setting and derive an upper bound on the generalization error. This upper bound is in terms of quantities minimized by a Q-learning algorithm, the complexity of the approximation space and an approximation term due to the mismatch between Q-learning and the goal of learning a policy that maximizes the value function.

PMCID: PMC1475741. (Pub Med Central)

Browse | Search : All Pubs | Next