Home > Publications . Search All . Browse All . Country . Browse PSC Pubs . PSC Report Series

PSC In The News

RSS Feed icon

Miech on 'generational forgetting' about drug-use dangers

Impacts of H-1B visas: Lower prices and higher production - or lower wages and higher profits?

MTF data show 10% of 19-20 year-olds report bouts of drinking 10-plus alcoholic beverages

More News

Highlights

Call for papers: Conference on computational social science, April 2017, U-M

Sioban Harlow honored with 2017 Sarah Goddard Power Award for commitment to women's health

Post-doc fellowship in computational social science for summer or fall 2017, U-Penn

ICPSR Summer Program scholarships to support training in statistics, quantitative methods, research design, and data analysis

More Highlights

Next Brown Bag

Mon, Feb 13, 2017, noon:
Daniel Almirall, "Getting SMART about adaptive interventions"

Susan A. Murphy photo

A generalization error for Q-learning

Publication Abstract

Murphy, Susan A. 2005. "A generalization error for Q-learning." Journal of Machine Learning Research, 6(July): 1073-1097.

Planning problems that involve learning a policy from a single training set of finite horizon trajectories arise in both social science and medical fields. We consider Q-learning with function approximation for this setting and derive an upper bound on the generalization error. This upper bound is in terms of quantities minimized by a Q-learning algorithm, the complexity of the approximation space and an approximation term due to the mismatch between Q-learning and the goal of learning a policy that maximizes the value function.

PMCID: PMC1475741. (Pub Med Central)

Browse | Search : All Pubs | Next