Home > Publications . Search All . Browse All . Country . Browse PSC Pubs . PSC Report Series

PSC In The News

RSS Feed icon

Hindustan Times points out high value of H-1B visas for US innovation, welfare, and tech firm profits

Novak, Geronimus, Martinez-Cardoso: Threat of deportation harmful to immigrants' health

Students from two worlds learn from one another in Morenoff's Inside-Out class

More News

Highlights

Heather Ann Thompson wins Pulitzer Prize for book on Attica uprising

Lam explores dimensions of the projected 4 billion increase in world population before 2100

ISR's Nick Prieur wins UMOR award for exceptional contribution to U-M's research mission

How effectively can these nations handle outside investments in health R&D?

More Highlights

Next Brown Bag

Mon, April 10, 2017, noon:
Elizabeth Bruch

Susan A. Murphy photo

A generalization error for Q-learning

Publication Abstract

Murphy, Susan A. 2005. "A generalization error for Q-learning." Journal of Machine Learning Research, 6(July): 1073-1097.

Planning problems that involve learning a policy from a single training set of finite horizon trajectories arise in both social science and medical fields. We consider Q-learning with function approximation for this setting and derive an upper bound on the generalization error. This upper bound is in terms of quantities minimized by a Q-learning algorithm, the complexity of the approximation space and an approximation term due to the mismatch between Q-learning and the goal of learning a policy that maximizes the value function.

PMCID: PMC1475741. (Pub Med Central)

Browse | Search : All Pubs | Next