Tag Archives: numpy

October 15, 2015 · 8:00 am

My top 20 numpy calls

One fun thing about using the IPython Notebook as my lab book for all my research is that I can do “me”-search in my copious spare time, for example to see the top 25 `numpy` calls I’ve used this year:

In [1]:
import glob

In [2]:
lines = ''
for fname in glob.glob('*.py'):
    with file(fname) as f:
        lines += f.read()
        lines += '\n'

In [3]:
import re

# Find top np.* calls

In [9]:
np_calls = re.findall('np\.[\w\.]+', lines)
np_calls[:5]
Out[9]:
['np.linspace',
 'np.random.random',
 'np.random.normal',
 'np.sqrt',
 'np.random.normal']

In [10]:
import pandas as pd

In [12]:
pd.Series(np_calls).value_counts().head(20)
Out[12]:
np.array            219
np.random.normal    170
np.mean             130
np.random.seed      126
np.round            124
np.log              119
np.exp              114
np.linspace          96
np.random.choice     84
np.where             84
np.zeros             78
np.ones              65
np.dot               62
np.empty             62
np.sum               52
np.absolute          49
np.nan               47
np.arange            45
np.inf               38
np.sqrt              37

Number one thing: `np.array`! I wonder why I use that.

Comments Off on My top 20 numpy calls

Filed under software engineering

Tagged as numpy

October 8, 2013 · 8:00 am

Statistics in Python: Bootstrap resampling with numpy and, optionally, pandas

I’m almost ready to do all my writing in the IPython notebook. If only there was a drag-and-drop solution to move it into a wordpress blog. The next closest thing: An IPython Notebook on Github’s Gist, linked from here. This one is about bootstrap resampling with numpy and, optionally, pandas.

Comments Off on Statistics in Python: Bootstrap resampling with numpy and, optionally, pandas

Filed under statistics

Tagged as bootstrap, numpy, pandas, python

Posts
aco ai ai4hm algorithms baby animals Bayesian books conference contest costs dataviz data viz disease modeling dismod diversity diversity club free/open source funding gaussian processes gbd global health health inequality health metrics health records idv IDV4GH ihme infoviz ipython iraq journal club machine learning malaria matching algorithms matchings MCMC media microsimulation mortality mpld3 my research Mysteries networks networkx optimization orms pandas privacy probability public health pymc pymc3 python random effects reading list reproducible research reproductive health research jobs seminar sklearn software carpentry spanning trees sparql statistics stats survey talks TCS teaching Theory Blogs travel tutorial va verbal autopsy vital registration
Theory Blogs
some rights reserved

This material is released under the Creative Commons Noncommercial Attribution Share-Alike 3.0 License
Pages
- About
March 2026

M T W T F S S

1

2 3 4 5 6 7 8

9 10 11 12 13 14 15

16 17 18 19 20 21 22

23 24 25 26 27 28 29

30 31

« Feb
Archives
Archives
Meta

Tag Archives: numpy

My top 20 numpy calls

Statistics in Python: Bootstrap resampling with numpy and, optionally, pandas

Posts

Theory Blogs

some rights reserved

Pages

Archives

Meta