Department of
Applied Mathematics & Statistics (AMS)
Reading 2017, Day 5: Bayesian Non-Parametric Modelling and Case Studies in Bayesian Data Science
Topics will include the value and importance of Bayesian non-parametric (BNP) methods in data science; Dirichlet-Process (DP) and Polya-Tree (PT) priors for CDFs; the stick-breaking algorithm; DP mixture modeling; and the value of the frequentist bootstrap as an approximate BNP method, and how this opens up BNP for data science with enormous data sets.
The case studies will be drawn from work I did at eBay Research Labs on large-scale A/B testing (randomized controlled trials) and large-scale observational studies.
Through one or more practicals (computer labs), the course will liberally illustrate user-friendly implementations of MCMC sampling via the freeware programs WinBUGS (for Windows platforms), R and rjags (for all platforms) when closed-form solutions are not possible.
The course is intended mainly for people who often use statistics in their research or other work in academia, government or industry; as noted above, a first, second and third course in Bayesian analysis equivalent to the content of day 4 of this five-day course will provide sufficient background for participants.
He is a Fellow of the American Association for the Advancement of Science, the American Statistical Association (ASA), the Institute of Mathematical Statistics, and the Royal Statistical Society; from 2001 to 2003 he served as the President-Elect, President, and Past President of the International Society for Bayesian Analysis (ISBA).
He is the author or co-author of about 145 contributions to the methodological and applied statistical literature, including articles in the Journal of the Royal Statistical Society (Series A, B and C), the Journal of the American Statistical Association, the Annals of Applied Statistics, Bayesian Analysis, Statistical Science, the New England Journal of Medicine, and the Journal of the American Medical Association; his 1995 JRSS-B article on assessment and propagation of model uncertainty has been cited about 1,650 times, and taken together his publications have been cited about 14,500 times.
His research is in the areas of Bayesian inference and prediction, model uncertainty and empirical model-building, hierarchical modeling, Markov Chain Monte Carlo methods, and Bayesian nonparametric methods, with applications mainly in medicine, health policy, education, environmental risk assessment and data science.
His short courses have received Excellence in Continuing Education Awards from the American Statistical Association on two occasions. He has won or been nominated for major teaching awards everywhere he has taught (the University of Chicago; the RAND Graduate School of Public Policy Studies; the University of California, Los Angeles; the University of Bath (UK); and the University of California, Santa Cruz).
He has a particular interest in the exposition of complex statistical methods and ideas in the context of real-world applications.
(last modified: 18 November 2017)