Optimising Python

In order to speed up code, there are a number of optimisations and shortcuts you can take. Here we focus on Python specifically.

Optimising Python
- Analysis
- Numpy
- Cython
- Numba
- Data Analysis
- Inbuilts

Analysis

The first step in speeding up code is profiling it. This can be done in a number of ways:

%%timeit is Jupyter notebook magic to time the execution of a cell
- this value is only useful compared to other values, and is inaccurate by itself
- variables created in a %%timeit cell will not be stored
the line_profiler package
must be installed with pip
and loaded with %load_ext line_profiler
and provides a breakdown of how long each line of code took in a Jupyter Notebook
the Cython Jupyter extension can be used to convert individual cells to Cython, so they are compiled before being run
- this is loaded with %load_ext Cython
- and called with %%Cython
- if you add %%Cython --annotate, Jupyter will return a line-by-line analysis of your code highlighting which parts are closest to the Python and thus will take longest
it can also be used directly for .py files with cythonize - see below

Numpy

When working with arrays or lists and doing operations on them, it is recommended to use numpy broadcasting instead of using for loops. This is because the numpy library has been optimised and uses compiled C code to speed up execution.

For example, compare the below code blocks.

%%timeit
x = np.arange(1,5+1)
for i in range(len(x)):
    x[i] = x[i]**2

# > 4.45 µs ± 972 ns per loop (mean ± std. dev. of 7 runs, 100,000 loops each)

x = np.arange(1,5+1)
x = np.pow(x,2)

# > 2.29 µs ± 350 ns per loop (mean ± std. dev. of 7 runs, 100,000 loops each)

As you can see, it takes half the time, which can be very significant.

However, note that numpy.arrays may not always be better than lists. Iteratively appending items is way faster for native python lists than for numpy arrays.

Cython

Cython is an extension of Python which is compiled into optimised C or C++ code, which can provide a significant speedup.

The steps are as follows:

Ensure code is type annotated
Save as a .pyx file
This will be translated into C
And then compiled into a shared library and loaded by Python

The cythonize shell command does steps 2-4 for you. If you’re working in a Jupyter notebook, the aforementioned %%Cython magic works instead.

The type annotations must be done using C types, not Python types!

To type variables, use cdef, for example

cdef double complex value # value is the variable name

To type function parameters and return values, use cpdef:

cpdef int square(double complex x=5):
    return x*x

It’s also possible to use numpy in Cython. You need to import twice:

import numpy as np
cimport numpy as np

Then, use numpy as normal, with a couple of alterations:

cdef np.ndarray[long,ndim=2] my_array for array definitions

Numba

An alternative to all this rewriting is numba, which is a JIT compiler which translates some Python and numpy code into machine code.

To use, import numba. The @numba.njit decorator can be used before functions to immediately speed them up. However, the first call will be slow, as it will have to be compiled.

numba also supports parallelisation. For example, to operate on all elements of a list independently, we need to do two things:

use the decorator @numba.njit(parallel=True) before the function
use numba.prange to define the list(s) to be iterated over

Data Analysis

If .csv files get too large, they can be significantly compressed using a data structure known as a parquet.

Parquets are column-based, rather than row-based, and so are more suited to column operations.

You can achieve a significant speedup just with this change, for example:

import pandas as pd

df = pd.DataFrame(...)

df.to_parquet("data.parquet")

df = pd.from_parquet("data.parquet, engine="pyarrow")

# column operations are now much faster

Note that this requires the pyarrow package to be installed with pip install pyarrow first.

Inbuilts

functools.reduce is a useful function to accumulate some function over a list.

For example:

xvals = [1,2,3,4,5,4,3,2,1]
max_val = 0
for i in xvals:
    if i>max_val:
        max_val = i

is equivalent to

from functools import reduce
reduce(max, xvals)