Home > Publications . Search All . Browse All . Country . Browse PSC Pubs . PSC Report Series

PSC In The News

RSS Feed icon

Miller et al. find benefits of Medicaid for pregnant mothers in 1980s carry over two generations

Starr's findings account for some of the 19% black-white gap in federal sentencing

Frey says suburbs are aging, cities draw millennials

More News

Highlights

Bailey et al. find higher incomes among children whose parents had access to federal family planning programs in the 1960s and 70s

U-M's campus climate survey results discussed in CHE story

U-M honors James Jackson's groundbreaking work on how race impacts the health of black Americans

U-M is the only public and non-coastal university on Forbes' top-10 list for billionaire production

More Highlights

Next Brown Bag

Mon, Jan 22, 2018, noon: Narayan Sastry

Susan A. Murphy photo

A generalization error for Q-learning

Publication Abstract

Murphy, Susan A. 2005. "A generalization error for Q-learning." Journal of Machine Learning Research, 6(July): 1073-1097.

Planning problems that involve learning a policy from a single training set of finite horizon trajectories arise in both social science and medical fields. We consider Q-learning with function approximation for this setting and derive an upper bound on the generalization error. This upper bound is in terms of quantities minimized by a Q-learning algorithm, the complexity of the approximation space and an approximation term due to the mismatch between Q-learning and the goal of learning a policy that maximizes the value function.

PMCID: PMC1475741. (Pub Med Central)

Browse | Search : All Pubs | Next