http://www.labinthewild.org/studies/color_age/#null
Also, they think I am young.
http://www.labinthewild.org/studies/color_age/#null
Also, they think I am young.
Comments Off on Cool project/resource: Lab in the Wild
Filed under Uncategorized
Bland–Altman plot
From Wikipedia, the free encyclopedia
Bland–Altman plot example
A Bland–Altman plot (Difference plot) in analytical chemistry and biostatistics is a method of data plotting used in analyzing the agreement between two different assays. It is identical to a Tukey mean-difference plot, the name by which it is known in other fields, but was popularised in medical statistics by J. Martin Bland and Douglas G. Altman.[1][2]
hi Abie,
It was great speaking with you. This is the paper I was talking about.
http://dl.acm.org/citation.cfm?id=1882478
Looking forward to know more about each other’s work.
Thanks,
Filed under disease modeling, machine learning
Scikit-learn has a really nice grid search module. It will soon be called model_selection, because it has grown beyond simple grid search. But here is the spirit of it:
import sklearn.svm, sklearn.grid_search, sklearn.datasets.samples_generator
parameters = {'kernel':('poly', 'rbf'), 'C':[.01, .1, 1, 10, 100]}
clf = sklearn.grid_search.GridSearchCV(
sklearn.svm.SVC(probability=True),
parameters,
n_jobs=64)
X, y = sklearn.datasets.samples_generator.make_classification(n_samples=200, n_features=5, random_state=12345)
clf.fit(X, y)
clf.best_params_
And say you want to take a careful look at the results? They are all in there, too. http://nbviewer.ipython.org/gist/aflaxman/cb0660e602d361d06599
Comments Off on Using the sklearn grid_search tools
Filed under machine learning, software engineering
Trends in Prescription Drug Use Among Adults in the United States From 1999-2012
http://jama.jamanetwork.com/article.aspx?articleID=2467552
Comments Off on To read in JAMA: Trends in Prescription Drug Use Among Adults in the United States From 1999-2012
Filed under global health
My new paper, which I believe is the first published application of DisMod-PDE: Projected growth of the adult congenital heart disease population in the United States to 2050: an integrative systems modeling approach
http://www.pophealthmetrics.com/content/13/1/29
Comments Off on Projecting ACHD Cases
Filed under global health
One fun thing about using the IPython Notebook as my lab book for all my research is that I can do “me”-search in my copious spare time, for example to see the top 25 `numpy` calls I’ve used this year:
In [1]:
import glob
In [2]:
lines = ''
for fname in glob.glob('*.py'):
with file(fname) as f:
lines += f.read()
lines += '\n'
In [3]:
import re
# Find top np.* calls
In [9]:
np_calls = re.findall('np\.[\w\.]+', lines)
np_calls[:5]
Out[9]:
['np.linspace',
'np.random.random',
'np.random.normal',
'np.sqrt',
'np.random.normal']
In [10]:
import pandas as pd
In [12]:
pd.Series(np_calls).value_counts().head(20)
Out[12]:
np.array 219
np.random.normal 170
np.mean 130
np.random.seed 126
np.round 124
np.log 119
np.exp 114
np.linspace 96
np.random.choice 84
np.where 84
np.zeros 78
np.ones 65
np.dot 62
np.empty 62
np.sum 52
np.absolute 49
np.nan 47
np.arange 45
np.inf 38
np.sqrt 37
Number one thing: `np.array`! I wonder why I use that.
Comments Off on My top 20 numpy calls
Filed under software engineering
I have a new verbal autopsy paper out, which goes deep into the weeds of how to measure the accuracy of a method for identifying the underlying cause of death at the population level from survey data. http://www.pophealthmetrics.com/content/13/1/28
A fun online appendix includes probabilistic combinatorial calculations of the sort that I was actually trained in: http://www.pophealthmetrics.com/content/supplementary/s12963-015-0061-1-s2.zip
Comments Off on CCCSMF Accuracy
Filed under global health
I would like to combine the Google Deep Dream with Smudge. This is a way to remind me to try sometime.
Comments Off on Deep Dream and Smudge
Filed under Uncategorized