Category Archives: statistics

June 27, 2014 · 8:00 am

MCMC in Python: Another thing that turns out to be hard

Here is an interesting StackOverflow question, about a model for data distributed as the sum of two uniforms with unknown support. I was surprised how hard it was for me.

http://stackoverflow.com/questions/24379868/estimate-the-parameters-of-a-random-variable-which-is-the-sum-of-uniform-random/24397044#24397044

I think the future of probabilistic programming should be to make a model for this easy to code.

Comments Off on MCMC in Python: Another thing that turns out to be hard

Filed under statistics

Tagged as MCMC, pymc

June 26, 2014 · 8:00 am

MCMC in Python: Never… no… always check for convergence

I’ve had no teaching responsibilities over the last quarter, and I must miss it. I’ve found myself responding to PyMC questions on StackOverflow more than ever before. It is an interesting window into what is hard in Bayesian computation. Checking (and achieving) MCMC convergence is one thing that is hard. Here are some questions and answers that include it:

http://stackoverflow.com/questions/24294203/difference-between-bugs-model-and-pymc/24347102#24347102
http://stackoverflow.com/questions/24242660/pymc3-multiple-observed-values/24271760#24271760
http://stackoverflow.com/questions/24402834/fitting-power-law-function-with-pymc/24413323#24413323

Comments Off on MCMC in Python: Never… no… always check for convergence

Filed under statistics

Tagged as MCMC, pymc

May 16, 2014 · 8:00 am

MCMC in Python: Estimating failure rates from observed data

A question and answer on CrossValidated, which make me reflect on the danger of knowing enough statistics to be dangerous.

Comments Off on MCMC in Python: Estimating failure rates from observed data

Filed under statistics

Tagged as Bayesian, pymc, python

March 5, 2014 · 8:00 am

Journal Club: Efﬁcient mapping and geographic disparities in breast cancer mortality at the county-level by race and age in the U.S.

Last week, we read Chien et al, Efﬁcient mapping and geographic disparities in breast cancer mortality at the county-level by race and age in the U.S. I’ve been very interested in these sort of “small-areas” spatial statistical methods recently, so it was good to see what is out there as the state of the art. I think I’ve got something to contribute along these lines some day soon.

Maps in small multiples look just lovely, too:

1 Comment

Filed under statistics

Tagged as journal club

February 17, 2014 · 8:00 am

MCMC in Python: random effects logistic regression in PyMC3

It has been a while since I visited my pymc-examples repository, but I got a request there a few weeks ago about the feasibility of upgrading the Seeds Example of a random effects logistic regression model for PyMC3. It turns out that this was not very time consuming, which must mean I’m starting to understand the changes between PyMC2 and PyMC3.

See them side-by-side here (PyMC2) and here (PyMC3).

Comments Off on MCMC in Python: random effects logistic regression in PyMC3

Filed under statistics

Tagged as pymc

February 8, 2014 · 8:00 am

IHME Seminar: Unifying the Counterfactual and Graphical Approaches to Causality via Single World Intervention Graphs (SWIGs)

Thomas Richardson gave a recent seminar at IHME about how the potential outcomes crowd can make sense of graphical models and vice versa. It also has a CSSS working paper to complement it, a trend in our recent seminars: http://www.csss.washington.edu/Papers/wp128.pdf

This talk had some great graphics, as I would hope for in a talk on graphical modeling:

Comments Off on IHME Seminar: Unifying the Counterfactual and Graphical Approaches to Causality via Single World Intervention Graphs (SWIGs)

Filed under statistics

Tagged as seminar

February 6, 2014 · 8:00 am

IHME Seminar: Bayesian reconstruction: estimating past populations and vital rates by age with uncertainty in a variety of data-quality contexts

A recent IHME seminar by Mark Wheldon described a Bayesian approach to estimating past populations and vital rates by age. I like this stuff. The talk is online, and there is a CSSS working paper on it, too: http://www.csss.washington.edu/Papers/wp117.pdf

Comments Off on IHME Seminar: Bayesian reconstruction: estimating past populations and vital rates by age with uncertainty in a variety of data-quality contexts

Filed under global health, statistics

Tagged as seminar

February 5, 2014 · 8:00 am

Journal Club: Repeat Bone Mineral Density Screening and Prediction of Hip and Major Osteoporotic Fracture

Last week we read Repeat Bone Mineral Density Screening and Prediction of Hip and Major Osteoporotic Fracture by Berry et al. It argues that repeat screening does not improve predictions. But I think the world needs a better way to measure the quality of predictions like these. Area-under-the-curve doesn’t cut it when you are predicting the unpredictable.

Comments Off on Journal Club: Repeat Bone Mineral Density Screening and Prediction of Hip and Major Osteoporotic Fracture

Filed under global health, statistics

Tagged as journal club

January 10, 2014 · 1:20 pm

Journal Club: Meat, fish, and esophageal cancer risk: a systematic review and dose-response meta-analysis

For our final journal club paper of last semester, we read Meat, fish, and esophageal cancer risk: a systematic review and dose-response meta-analysis. I am a vegetarian.

That brings me up to date for fall quarter, and the winter quarter is just finishing its first week. Good!

1 Comment

Filed under statistics

Tagged as journal club

December 31, 2013 · 8:00 am

What qualifies as probabilistic programming?

I just went through the classic paper on WinBUGS, which might or might not be called probabilistic programming. It is listed on the probabilistic programming resource page, and it is certainly interesting. The WinBUGS “hello, world” is a linear regression model:

Comments Off on What qualifies as probabilistic programming?

Filed under software engineering, statistics

Tagged as probabilistic programming

Category Archives: statistics

MCMC in Python: Another thing that turns out to be hard

MCMC in Python: Never… no… always check for convergence

MCMC in Python: Estimating failure rates from observed data

Journal Club: Efﬁcient mapping and geographic disparities in breast cancer mortality at the county-level by race and age in the U.S.

MCMC in Python: random effects logistic regression in PyMC3

IHME Seminar: Unifying the Counterfactual and Graphical Approaches to Causality via Single World Intervention Graphs (SWIGs)

IHME Seminar: Bayesian reconstruction: estimating past populations and vital rates by age with uncertainty in a variety of data-quality contexts

Journal Club: Repeat Bone Mineral Density Screening and Prediction of Hip and Major Osteoporotic Fracture

Journal Club: Meat, fish, and esophageal cancer risk: a systematic review and dose-response meta-analysis

What qualifies as probabilistic programming?

Posts

Theory Blogs

some rights reserved

Pages

Archives

Meta