Cool project/resource: Lab in the Wild

http://www.labinthewild.org/studies/color_age/#null

Also, they think I am young.

Comments Off on Cool project/resource: Lab in the Wild

Filed under Uncategorized

Beyond the Sum-Difference Plot

Bland–Altman plot
From Wikipedia, the free encyclopedia

Bland–Altman plot example
A Bland–Altman plot (Difference plot) in analytical chemistry and biostatistics is a method of data plotting used in analyzing the agreement between two different assays. It is identical to a Tukey mean-difference plot, the name by which it is known in other fields, but was popularised in medical statistics by J. Martin Bland and Douglas G. Altman.[1][2]

1 Comment

Filed under dataviz

Brief survey on sequence classification

hi Abie,

It was great speaking with you. This is the paper I was talking about.

http://dl.acm.org/citation.cfm?id=1882478

Looking forward to know more about each other’s work.

Thanks,

2 Comments

Filed under disease modeling, machine learning

Using the sklearn grid_search tools

Scikit-learn has a really nice grid search module. It will soon be called model_selection, because it has grown beyond simple grid search. But here is the spirit of it:

import sklearn.svm, sklearn.grid_search, sklearn.datasets.samples_generator
parameters = {'kernel':('poly', 'rbf'), 'C':[.01, .1, 1, 10, 100]}
clf = sklearn.grid_search.GridSearchCV(
    sklearn.svm.SVC(probability=True),
    parameters,
    n_jobs=64)
X, y = sklearn.datasets.samples_generator.make_classification(n_samples=200, n_features=5, random_state=12345)
clf.fit(X, y)
clf.best_params_

And say you want to take a careful look at the results? They are all in there, too. http://nbviewer.ipython.org/gist/aflaxman/cb0660e602d361d06599

Comments Off on Using the sklearn grid_search tools

Filed under machine learning, software engineering

To read in JAMA: Trends in Prescription Drug Use Among Adults in the United States From 1999-2012

Trends in Prescription Drug Use Among Adults in the United States From 1999-2012
http://jama.jamanetwork.com/article.aspx?articleID=2467552

Comments Off on To read in JAMA: Trends in Prescription Drug Use Among Adults in the United States From 1999-2012

Filed under global health

Projecting ACHD Cases

My new paper, which I believe is the first published application of DisMod-PDE: Projected growth of the adult congenital heart disease population in the United States to 2050: an integrative systems modeling approach
http://www.pophealthmetrics.com/content/13/1/29

Comments Off on Projecting ACHD Cases

Filed under global health

My top 20 numpy calls

One fun thing about using the IPython Notebook as my lab book for all my research is that I can do “me”-search in my copious spare time, for example to see the top 25 `numpy` calls I’ve used this year:

In [1]:
import glob

In [2]:
lines = ''
for fname in glob.glob('*.py'):
    with file(fname) as f:
        lines += f.read()
        lines += '\n'

In [3]:
import re

# Find top np.* calls

In [9]:
np_calls = re.findall('np\.[\w\.]+', lines)
np_calls[:5]
Out[9]:
['np.linspace',
 'np.random.random',
 'np.random.normal',
 'np.sqrt',
 'np.random.normal']

In [10]:
import pandas as pd

In [12]:
pd.Series(np_calls).value_counts().head(20)
Out[12]:
np.array            219
np.random.normal    170
np.mean             130
np.random.seed      126
np.round            124
np.log              119
np.exp              114
np.linspace          96
np.random.choice     84
np.where             84
np.zeros             78
np.ones              65
np.dot               62
np.empty             62
np.sum               52
np.absolute          49
np.nan               47
np.arange            45
np.inf               38
np.sqrt              37

Number one thing: `np.array`! I wonder why I use that.

Comments Off on My top 20 numpy calls

Filed under software engineering

CCCSMF Accuracy

I have a new verbal autopsy paper out, which goes deep into the weeds of how to measure the accuracy of a method for identifying the underlying cause of death at the population level from survey data. http://www.pophealthmetrics.com/content/13/1/28

A fun online appendix includes probabilistic combinatorial calculations of the sort that I was actually trained in: http://www.pophealthmetrics.com/content/supplementary/s12963-015-0061-1-s2.zip

Comments Off on CCCSMF Accuracy

Filed under global health

DM-MR book photo

Finally got one!

t

Comments Off on DM-MR book photo

Filed under global health

Deep Dream and Smudge

I would like to combine the Google Deep Dream with Smudge. This is a way to remind me to try sometime.

Comments Off on Deep Dream and Smudge

Filed under Uncategorized