SciPy Reference Guide

User Manual:

Open the PDF directly: View PDF .
Page Count: 1229 [warning: Documents this large are best viewed by clicking the View PDF Link!]

SciPy Tutorial
Contributing to SciPy
API - importing from Scipy
- Guidelines for importing functions from Scipy
- API definition
Release Notes
Reference
Bibliography
Index

Release 0.13.0

Written by the SciPy community

October 21, 2013

CONTENTS

1 SciPy Tutorial 3

1.1 Introduction ............................................... 3

1.2 Basic functions .............................................. 5

1.3 Special functions (scipy.special)................................. 9

1.4 Integration (scipy.integrate)................................... 11

1.5 Optimization (scipy.optimize)................................... 16

1.6 Interpolation (scipy.interpolate)................................ 29

1.7 Fourier Transforms (scipy.fftpack)................................ 41

1.8 Signal Processing (scipy.signal).................................. 44

1.9 Linear Algebra (scipy.linalg)................................... 52

1.10 Sparse Eigenvalue Problems with ARPACK ............................... 65

1.11 Compressed Sparse Graph Routines scipy.sparse.csgraph .................. 68

1.12 Spatial data structures and algorithms (scipy.spatial)...................... 71

1.13 Statistics (scipy.stats)....................................... 77

1.14 Multidimensional image processing (scipy.ndimage)....................... 96

1.15 File IO (scipy.io)........................................... 117

1.16 Weave (scipy.weave)........................................ 123

2 Contributing to SciPy 159

2.1 Contributing new code .......................................... 159

2.2 Contributing by helping maintain existing code ............................. 160

2.3 Other ways to contribute ......................................... 161

2.4 Recommended development setup .................................... 161

2.5 SciPy structure .............................................. 162

2.6 Useful links, FAQ, checklist ....................................... 162

3 API - importing from Scipy 165

3.1 Guidelines for importing functions from Scipy ............................. 165

3.2 API deﬁnition .............................................. 166

4 Release Notes 169

4.1 SciPy 0.13.0 Release Notes ....................................... 169

4.2 SciPy 0.12.0 Release Notes ....................................... 176

4.3 SciPy 0.11.0 Release Notes ....................................... 181

4.4 SciPy 0.10.0 Release Notes ....................................... 187

4.5 SciPy 0.9.0 Release Notes ........................................ 191

4.6 SciPy 0.8.0 Release Notes ........................................ 195

4.7 SciPy 0.7.2 Release Notes ........................................ 199

4.8 SciPy 0.7.1 Release Notes ........................................ 199

4.9 SciPy 0.7.0 Release Notes ........................................ 201

5 Reference 207

5.1 Clustering package (scipy.cluster)................................ 207

5.2 K-means clustering and vector quantization (scipy.cluster.vq)................. 207

5.3 Hierarchical clustering (scipy.cluster.hierarchy)...................... 211

5.4 Constants (scipy.constants).................................... 226

5.5 Discrete Fourier transforms (scipy.fftpack)............................ 241

5.6 Integration and ODEs (scipy.integrate)............................. 255

5.7 Interpolation (scipy.interpolate)................................ 273

5.8 Input and output (scipy.io)...................................... 320

5.9 Linear algebra (scipy.linalg)................................... 331

5.10 Low-level BLAS functions ........................................ 373

5.11 Finding functions ............................................. 373

5.12 All functions ............................................... 373

5.13 Low-level LAPACK functions ...................................... 398

5.14 Finding functions ............................................. 398

5.15 All functions ............................................... 398

5.16 Interpolative matrix decomposition (scipy.linalg.interpolative)............. 448

5.17 Miscellaneous routines (scipy.misc)................................ 457

5.18 Multi-dimensional image processing (scipy.ndimage)....................... 466

5.19 Orthogonal distance regression (scipy.odr)............................. 520

5.20 Optimization and root ﬁnding (scipy.optimize).......................... 530

5.21 Nonlinear solvers ............................................. 593

5.22 Signal processing (scipy.signal).................................. 595

5.23 Sparse matrices (scipy.sparse)................................... 692

5.24 Sparse linear algebra (scipy.sparse.linalg).......................... 784

5.25 Compressed Sparse Graph Routines (scipy.sparse.csgraph).................. 809

5.26 Spatial algorithms and data structures (scipy.spatial)...................... 819

5.27 Distance computations (scipy.spatial.distance)....................... 853

5.28 Special functions (scipy.special)................................. 867

5.29 Statistical functions (scipy.stats)................................. 897

5.30 Statistical functions for masked arrays (scipy.stats.mstats).................. 1162

5.31 C/C++ integration (scipy.weave).................................. 1188

Bibliography 1193

Index 1205

SciPy Reference Guide, Release 0.13.0

Release 0.13.0

Date October 21, 2013

SciPy (pronounced “Sigh Pie”) is open-source software for mathematics, science, and engineering.

CONTENTS 1

SciPy Reference Guide, Release 0.13.0

2 CONTENTS

CHAPTER

ONE

SCIPY TUTORIAL

1.1 Introduction

Contents

•Introduction

–SciPy Organization

–Finding Documentation

SciPy is a collection of mathematical algorithms and convenience functions built on the Numpy extension of Python. It

adds signiﬁcant power to the interactive Python session by providing the user with high-level commands and classes for

manipulating and visualizing data. With SciPy an interactive Python session becomes a data-processing and system-

prototyping environment rivaling sytems such as MATLAB, IDL, Octave, R-Lab, and SciLab.

The additional beneﬁt of basing SciPy on Python is that this also makes a powerful programming language available

for use in developing sophisticated programs and specialized applications. Scientiﬁc applications using SciPy beneﬁt

from the development of additional modules in numerous niche’s of the software landscape by developers across the

world. Everything from parallel programming to web and data-base subroutines and classes have been made available

to the Python programmer. All of this power is available in addition to the mathematical libraries in SciPy.

This tutorial will acquaint the ﬁrst-time user of SciPy with some of its most important features. It assumes that the

user has already installed the SciPy package. Some general Python facility is also assumed, such as could be acquired

by working through the Python distribution’s Tutorial. For further introductory help the user is directed to the Numpy

documentation.

For brevity and convenience, we will often assume that the main packages (numpy, scipy, and matplotlib) have been

imported as:

>>> import numpy as np

>>> import scipy as sp

>>> import matplotlib as mpl

>>> import matplotlib.pyplot as plt

These are the import conventions that our community has adopted after discussion on public mailing lists. You will

see these conventions used throughout NumPy and SciPy source code and documentation. While we obviously don’t

require you to follow these conventions in your own code, it is highly recommended.

1.1.1 SciPy Organization

SciPy is organized into subpackages covering different scientiﬁc computing domains. These are summarized in the

following table:

SciPy Reference Guide, Release 0.13.0

Subpackage Description

cluster Clustering algorithms

constants Physical and mathematical constants

fftpack Fast Fourier Transform routines

integrate Integration and ordinary differential equation solvers

interpolate Interpolation and smoothing splines

io Input and Output

linalg Linear algebra

ndimage N-dimensional image processing

odr Orthogonal distance regression

optimize Optimization and root-ﬁnding routines

signal Signal processing

sparse Sparse matrices and associated routines

spatial Spatial data structures and algorithms

special Special functions

stats Statistical distributions and functions

weave C/C++ integration

Scipy sub-packages need to be imported separately, for example:

>>> from scipy import linalg, optimize

Because of their ubiquitousness, some of the functions in these subpackages are also made available in the scipy

namespace to ease their use in interactive sessions and programs. In addition, many basic array functions from numpy

are also available at the top-level of the scipy package. Before looking at the sub-packages individually, we will ﬁrst

look at some of these common functions.

1.1.2 Finding Documentation

SciPy and NumPy have documentation versions in both HTML and PDF format available at http://docs.scipy.org/, that

cover nearly all available functionality. However, this documentation is still work-in-progress and some parts may be

incomplete or sparse. As we are a volunteer organization and depend on the community for growth, your participation

- everything from providing feedback to improving the documentation and code - is welcome and actively encouraged.

Python’s documentation strings are used in SciPy for on-line documentation. There are two methods for reading

them and getting help. One is Python’s command help in the pydoc module. Entering this command with no

arguments (i.e. >>> help ) launches an interactive help session that allows searching through the keywords and

modules available to all of Python. Secondly, running the command help(obj) with an object as the argument displays

that object’s calling signature, and documentation string.

The pydoc method of help is sophisticated but uses a pager to display the text. Sometimes this can interfere with

the terminal you are running the interactive session within. A scipy-speciﬁc help system is also available under the

command sp.info. The signature and documentation string for the object passed to the help command are printed

to standard output (or to a writeable object passed as the third argument). The second keyword argument of sp.info

deﬁnes the maximum width of the line for printing. If a module is passed as the argument to help than a list of the

functions and classes deﬁned in that module is printed. For example:

>>> sp.info(optimize.fmin)

fmin(func, x0, args=(), xtol=0.0001, ftol=0.0001, maxiter=None, maxfun=None,

full_output=0, disp=1, retall=0, callback=None)

Minimize a function using the downhill simplex algorithm.

Parameters

----------

func : callable func(x,*args)

4 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

The objective function to be minimized.

x0 : ndarray

Initial guess.

args : tuple

Extra arguments passed to func, i.e. ‘‘f(x,*args)‘‘.

callback : callable

Called after each iteration, as callback(xk), where xk is the

current parameter vector.

Returns

-------

xopt : ndarray

Parameter that minimizes function.

fopt : float

Value of function at minimum: ‘‘fopt = func(xopt)‘‘.

iter : int

Number of iterations performed.

funcalls : int

Number of function calls made.

warnflag : int

1 : Maximum number of function evaluations made.

2 : Maximum number of iterations reached.

allvecs : list

Solution at each iteration.

Other parameters

----------------

xtol : float

Relative error in xopt acceptable for convergence.

ftol : number

Relative error in func(xopt) acceptable for convergence.

maxiter : int

Maximum number of iterations to perform.

maxfun : number

Maximum number of function evaluations to make.

full_output : bool

Set to True if fopt and warnflag outputs are desired.

disp : bool

Set to True to print convergence messages.

retall : bool

Set to True to return list of solutions at each iteration.

Notes

-----

Uses a Nelder-Mead simplex algorithm to find the minimum of function of

one or more variables.

Another useful command is source. When given a function written in Python as an argument, it prints out a listing

of the source code for that function. This can be helpful in learning about an algorithm or understanding exactly what

a function is doing with its arguments. Also don’t forget about the Python command dir which can be used to look

at the namespace of a module or package.

1.2 Basic functions

1.2. Basic functions 5

SciPy Reference Guide, Release 0.13.0

Contents

•Basic functions

–Interaction with Numpy

*Index Tricks

*Shape manipulation

*Polynomials

*Vectorizing functions (vectorize)

*Type handling

*Other useful functions

1.2.1 Interaction with Numpy

Scipy builds on Numpy, and for all basic array handling needs you can use Numpy functions:

>>> import numpy as np

>>> np.some_function()

Rather than giving a detailed description of each of these functions (which is available in the Numpy Reference Guide

or by using the help,info and source commands), this tutorial will discuss some of the more useful commands

which require a little introduction to use to their full potential.

To use functions from some of the Scipy modules, you can do:

>>> from scipy import some_module

>>> some_module.some_function()

The top level of scipy also contains functions from numpy and numpy.lib.scimath. However, it is better to

use them directly from the numpy module instead.

Index Tricks

There are some class instances that make special use of the slicing functionality to provide efﬁcient means for array

construction. This part will discuss the operation of np.mgrid ,np.ogrid ,np.r_ , and np.c_ for quickly

constructing arrays.

For example, rather than writing something like the following

>>> concatenate(([3],[0]*5,arange(-1,1.002,2/9.0)))

with the r_ command one can enter this as

>>> r_[3,[0]*5,-1:1:10j]

which can ease typing and make for more readable code. Notice how objects are concatenated, and the slicing syntax

is (ab)used to construct ranges. The other term that deserves a little explanation is the use of the complex number

10j as the step size in the slicing syntax. This non-standard use allows the number to be interpreted as the number of

points to produce in the range rather than as a step size (note we would have used the long integer notation, 10L, but

this notation may go away in Python as the integers become uniﬁed). This non-standard usage may be unsightly to

some, but it gives the user the ability to quickly construct complicated vectors in a very readable fashion. When the

number of points is speciﬁed in this way, the end- point is inclusive.

The “r” stands for row concatenation because if the objects between commas are 2 dimensional arrays, they are stacked

by rows (and thus must have commensurate columns). There is an equivalent command c_ that stacks 2d arrays by

columns but works identically to r_ for 1d arrays.

6 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

Another very useful class instance which makes use of extended slicing notation is the function mgrid. In the simplest

case, this function can be used to construct 1d ranges as a convenient substitute for arange. It also allows the use of

complex-numbers in the step-size to indicate the number of points to place between the (inclusive) end-points. The real

purpose of this function however is to produce N, N-d arrays which provide coordinate arrays for an N-dimensional

volume. The easiest way to understand this is with an example of its usage:

>>> mgrid[0:5,0:5]

array([[[0, 0, 0, 0, 0],

[1, 1, 1, 1, 1],

[2, 2, 2, 2, 2],

[3, 3, 3, 3, 3],

[4, 4, 4, 4, 4]],

[[0, 1, 2, 3, 4],

[0, 1, 2, 3, 4],

[0, 1, 2, 3, 4]]])

>>> mgrid[0:5:4j,0:5:4j]

array([[[ 0. , 0. , 0. , 0. ],

[ 1.6667, 1.6667, 1.6667, 1.6667],

[ 3.3333, 3.3333, 3.3333, 3.3333],

[ 5. , 5. , 5. , 5. ]],

[[ 0. , 1.6667, 3.3333, 5. ],

[ 0. , 1.6667, 3.3333, 5. ],

[ 0. , 1.6667, 3.3333, 5. ]]])

Having meshed arrays like this is sometimes very useful. However, it is not always needed just to evaluate some N-

dimensional function over a grid due to the array-broadcasting rules of Numpy and SciPy. If this is the only purpose for

generating a meshgrid, you should instead use the function ogrid which generates an “open” grid using newaxis

judiciously to create N, N-d arrays where only one dimension in each array has length greater than 1. This will save

memory and create the same result if the only purpose for the meshgrid is to generate sample points for evaluation of

an N-d function.

Shape manipulation

In this category of functions are routines for squeezing out length- one dimensions from N-dimensional arrays, ensur-

ing that an array is at least 1-, 2-, or 3-dimensional, and stacking (concatenating) arrays by rows, columns, and “pages

“(in the third dimension). Routines for splitting arrays (roughly the opposite of stacking arrays) are also available.

Polynomials

There are two (interchangeable) ways to deal with 1-d polynomials in SciPy. The ﬁrst is to use the poly1d class from

Numpy. This class accepts coefﬁcients or polynomial roots to initialize a polynomial. The polynomial object can then

be manipulated in algebraic expressions, integrated, differentiated, and evaluated. It even prints like a polynomial:

>>> p=poly1d([3,4,5])

>>> print p

3x+4x+5

>>> print p*p

432

9x+24x+46x+40x+25

>>> print p.integ(k=6)

3 2

x+2x+5x+6

1.2. Basic functions 7

SciPy Reference Guide, Release 0.13.0

>>> print p.deriv()

6x+4

>>> p([4,5])

array([ 69, 100])

The other way to handle polynomials is as an array of coefﬁcients with the ﬁrst element of the array giving the

coefﬁcient of the highest power. There are explicit functions to add, subtract, multiply, divide, integrate, differentiate,

and evaluate polynomials represented as sequences of coefﬁcients.

Vectorizing functions (vectorize)

One of the features that NumPy provides is a class vectorize to convert an ordinary Python function which accepts

scalars and returns scalars into a “vectorized-function” with the same broadcasting rules as other Numpy functions

(i.e. the Universal functions, or ufuncs). For example, suppose you have a Python function named addsubtract

deﬁned as:

>>> def addsubtract(a,b):

... if a>b:

... return a-b

... else:

... return a+b

which deﬁnes a function of two scalar variables and returns a scalar result. The class vectorize can be used to “vectorize

“this function so that

>>> vec_addsubtract =vectorize(addsubtract)

returns a function which takes array arguments and returns an array result:

>>> vec_addsubtract([0,3,6,9],[1,3,5,7])

array([1, 6, 1, 2])

This particular function could have been written in vector form without the use of vectorize . But, what if the

function you have written is the result of some optimization or integration routine. Such functions can likely only be

vectorized using vectorize.

Type handling

Note the difference between np.iscomplex/np.isreal and np.iscomplexobj/np.isrealobj. The for-

mer command is array based and returns byte arrays of ones and zeros providing the result of the element-wise test.

The latter command is object based and returns a scalar describing the result of the test on the entire object.

Often it is required to get just the real and/or imaginary part of a complex number. While complex numbers and arrays

have attributes that return those values, if one is not sure whether or not the object will be complex-valued, it is better

to use the functional forms np.real and np.imag . These functions succeed for anything that can be turned into

a Numpy array. Consider also the function np.real_if_close which transforms a complex-valued number with

tiny imaginary part into a real number.

Occasionally the need to check whether or not a number is a scalar (Python (long)int, Python ﬂoat, Python complex,

or rank-0 array) occurs in coding. This functionality is provided in the convenient function np.isscalar which

returns a 1 or a 0.

Finally, ensuring that objects are a certain Numpy type occurs often enough that it has been given a convenient interface

in SciPy through the use of the np.cast dictionary. The dictionary is keyed by the type it is desired to cast to and

the dictionary stores functions to perform the casting. Thus, np.cast[’f’](d) returns an array of np.float32

from d. This function is also useful as an easy way to get a scalar of a certain type:

8 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

>>> np.cast[’f’](np.pi)

array(3.1415927410125732, dtype=float32)

Other useful functions

There are also several other useful functions which should be mentioned. For doing phase processing, the functions

angle, and unwrap are useful. Also, the linspace and logspace functions return equally spaced samples in a

linear or log scale. Finally, it’s useful to be aware of the indexing capabilities of Numpy. Mention should be made of

the function select which extends the functionality of where to include multiple conditions and multiple choices.

The calling convention is select(condlist,choicelist,default=0). select is a vectorized form of

the multiple if-statement. It allows rapid construction of a function which returns an array of results based on a list

of conditions. Each element of the return array is taken from the array in a choicelist corresponding to the ﬁrst

condition in condlist that is true. For example

>>> x=r_[-2:3]

>>> x

array([-2, -1, 0, 1, 2])

>>> np.select([x >3,x>= 0],[0,x+2])

array([0, 0, 2, 3, 4])

Some additional useful functions can also be found in the module scipy.misc. For example the factorial and

comb functions compute n!and n!/k!(n−k)! using either exact integer arithmetic (thanks to Python’s Long integer

object), or by using ﬂoating-point precision and the gamma function. Another function returns a common image used

in image processing: lena.

Finally, two functions are provided that are useful for approximating derivatives of functions using discrete-differences.

The function central_diff_weights returns weighting coefﬁcients for an equally-spaced N-point approxima-

tion to the derivative of order o. These weights must be multiplied by the function corresponding to these points and

the results added to obtain the derivative approximation. This function is intended for use when only samples of the

function are avaiable. When the function is an object that can be handed to a routine and evaluated, the function

derivative can be used to automatically evaluate the object at the correct points to obtain an N-point approxima-

tion to the o-th derivative at a given point.

1.3 Special functions (scipy.special)

The main feature of the scipy.special package is the deﬁnition of numerous special functions of mathematical

physics. Available functions include airy, elliptic, bessel, gamma, beta, hypergeometric, parabolic cylinder, mathieu,

spheroidal wave, struve, and kelvin. There are also some low-level stats functions that are not intended for general

use as an easier interface to these functions is provided by the stats module. Most of these functions can take array

arguments and return array results following the same broadcasting rules as other math functions in Numerical Python.

Many of these functions also accept complex numbers as input. For a complete list of the available functions with a

one-line description type >>> help(special). Each function also has its own documentation accessible using

help. If you don’t see a function you need, consider writing it and contributing it to the library. You can write the

function in either C, Fortran, or Python. Look in the source code of the library for examples of each of these kinds of

functions.

1.3. Special functions (scipy.special) 9

SciPy Reference Guide, Release 0.13.0

1.3.1 Bessel functions of real order(jn,jn_zeros)

Bessel functions are a family of solutions to Bessel’s differential equation with real or complex order alpha:

x2d2y

dx2+xdy

dx + (x2−α2)y= 0

Among other uses, these functions arise in wave propagation problems such as the vibrational modes of a thin drum

head. Here is an example of a circular drum head anchored at the edge:

>>> from scipy import *

>>> from scipy.special import jn, jn_zeros

>>> def drumhead_height(n, k, distance, angle, t):

... nth_zero =jn_zeros(n, k)

... return cos(t)*cos(n*angle)*jn(n, distance*nth_zero)

>>> theta =r_[0:2*pi:50j]

>>> radius =r_[0:1:50j]

>>> x=array([r*cos(theta) for rin radius])

>>> y=array([r*sin(theta) for rin radius])

>>> z=array([drumhead_height(1,1, r, theta, 0.5)for rin radius])

>>> import pylab

>>> from mpl_toolkits.mplot3d import Axes3D

>>> from matplotlib import cm

>>> fig =pylab.figure()

>>> ax =Axes3D(fig)

>>> ax.plot_surface(x, y, z, rstride=1, cstride=1, cmap=cm.jet)

>>> ax.set_xlabel(’X’)

>>> ax.set_ylabel(’Y’)

>>> ax.set_zlabel(’Z’)

>>> pylab.show()

10 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

1.4 Integration (scipy.integrate)

The scipy.integrate sub-package provides several integration techniques including an ordinary differential

equation integrator. An overview of the module is provided by the help command:

>>> help(integrate)

Methods for Integrating Functions given function object.

quad -- General purpose integration.

dblquad -- General purpose double integration.

tplquad -- General purpose triple integration.

fixed_quad -- Integrate func(x) using Gaussian quadrature of order n.

quadrature -- Integrate with given tolerance using Gaussian quadrature.

romberg -- Integrate func using Romberg integration.

Methods for Integrating Functions given fixed samples.

trapz -- Use trapezoidal rule to compute integral from samples.

cumtrapz -- Use trapezoidal rule to cumulatively compute integral.

simps -- Use Simpson’s rule to compute integral from samples.

romb -- Use Romberg Integration to compute integral from

(2**k + 1) evenly-spaced samples.

See the special module’s orthogonal polynomials (special) for Gaussian

quadrature roots and weights for other weighting factors and regions.

Interface to numerical integrators of ODE systems.

odeint -- General integration of ordinary differential equations.

ode -- Integrate ODE using VODE and ZVODE routines.

1.4.1 General integration (quad)

The function quad is provided to integrate a function of one variable between two points. The points can be ±∞ (±

inf) to indicate inﬁnite limits. For example, suppose you wish to integrate a bessel function jv(2.5,x) along the

interval [0,4.5].

I=Z4.5

J2.5(x)dx.

This could be computed using quad:

>>> result =integrate.quad(lambda x: special.jv(2.5,x), 0,4.5)

>>> print result

(1.1178179380783249, 7.8663172481899801e-09)

>>> I=sqrt(2/pi)*(18.0/27*sqrt(2)*cos(4.5)-4.0/27*sqrt(2)*sin(4.5)+

sqrt(2*pi)*special.fresnel(3/sqrt(pi))[0])

>>> print I

1.117817938088701

>>> print abs(result[0]-I)

1.03761443881e-11

The ﬁrst argument to quad is a “callable” Python object (i.e a function, method, or class instance). Notice the use of a

lambda- function in this case as the argument. The next two arguments are the limits of integration. The return value

1.4. Integration (scipy.integrate) 11

SciPy Reference Guide, Release 0.13.0

is a tuple, with the ﬁrst element holding the estimated value of the integral and the second element holding an upper

bound on the error. Notice, that in this case, the true value of this integral is

I=r2

π18

27√2 cos (4.5) −4

27√2 sin (4.5) + √2πSi 3

√π,

where

Si (x) = Zx

sin π

2t2dt.

is the Fresnel sine integral. Note that the numerically-computed integral is within 1.04 ×10−11 of the exact result —

well below the reported error bound.

If the function to integrate takes additional parameters, the can be provided in the args argument. Suppose that the

following integral shall be calculated:

I(a, b) = Z1

ax2+b dx.

This integral can be evaluated by using the following code:

>>> from scipy.integrate import quad

>>> def integrand(x, a, b):

... return a*x+b

>>> a=2

>>> b=1

>>> I=quad(integrand, 0,1, args=(a,b))

>>> I=(2.0,2.220446049250313e-14)

Inﬁnite inputs are also allowed in quad by using ±inf as one of the arguments. For example, suppose that a

numerical value for the exponential integral:

En(x) = Z∞

e−xt

tndt.

is desired (and the fact that this integral can be computed as special.expn(n,x) is forgotten). The functionality

of the function special.expn can be replicated by deﬁning a new function vec_expint based on the routine

quad:

>>> from scipy.integrate import quad

>>> def integrand(t,n,x):

... return exp(-x*t) /t**n

>>> def expint(n,x):

... return quad(integrand, 1, Inf, args=(n, x))[0]

>>> vec_expint =vectorize(expint)

>>> vec_expint(3,arange(1.0,4.0,0.5))

array([ 0.1097, 0.0567, 0.0301, 0.0163, 0.0089, 0.0049])

>>> special.expn(3,arange(1.0,4.0,0.5))

array([ 0.1097, 0.0567, 0.0301, 0.0163, 0.0089, 0.0049])

The function which is integrated can even use the quad argument (though the error bound may underestimate the error

due to possible numerical error in the integrand from the use of quad ). The integral in this case is

In=Z∞

0Z∞

e−xt

tndt dx =1

12 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

>>> result =quad(lambda x: expint(3, x), 0, inf)

>>> print result

(0.33333333324560266, 2.8548934485373678e-09)

>>> I3 =1.0/3.0

>>> print I3

0.333333333333

>>> print I3 -result[0]

8.77306560731e-11

This last example shows that multiple integration can be handled using repeated calls to quad.

1.4.2 General multiple integration (dblquad,tplquad,nquad)

The mechanics for double and triple integration have been wrapped up into the functions dblquad and tplquad.

These functions take the function to integrate and four, or six arguments, respecively. The limits of all inner integrals

need to be deﬁned as functions.

An example of using double integration to compute several values of Inis shown below:

>>> from scipy.integrate import quad, dblquad

>>> def I(n):

... return dblquad(lambda t, x: exp(-x*t)/t**n, 0, Inf, lambda x: 1,lambda x: Inf)

>>> print I(4)

(0.25000000000435768, 1.0518245707751597e-09)

>>> print I(3)

(0.33333333325010883, 2.8604069919261191e-09)

>>> print I(2)

(0.49999999999857514, 1.8855523253868967e-09)

As example for non-constant limits consider the integral

I=Z1/2

y=0 Z1−2y

x=0

xy dx dy =1

96.

This integral can be evaluated using the expression below (Note the use of the non-constant lambda functions for the

upper limit of the inner integral):

>>> from scipy.integrate import dblquad

>>> area =dblquad(lambda x, y: x*y, 0,0.5,lambda x: 0,lambda x: 1-2*x)

>>> area

(0.010416666666666668, 1.1564823173178715e-16)

For n-fold integration, scipy provides the function nquad. The integration bounds are an iterable object: either a

list of constant bounds, or a list of functions for the non-constant integration bounds. The order of integration (and

therefore the bounds) is from the innermost integral to the outermost one.

The integral from above

In=Z∞

0Z∞

e−xt

tndt dx =1

can be calculated as

1.4. Integration (scipy.integrate) 13

SciPy Reference Guide, Release 0.13.0

>>> from scipy import integrate

>>> N=5

>>> def f(t, x):

>>> return np.exp(-x*t) /t**N

>>> integrate.nquad(f, [[1, np.inf],[0, np.inf]])

(0.20000000000002294, 1.2239614263187945e-08)

Note that the order of arguments for fmust match the order of the integration bounds; i.e. the inner integral with

respect to tis on the interval [1,∞]and the outer integral with respect to xis on the interval [0,∞].

Non-constant integration bounds can be treated in a similar manner; the example from above

I=Z1/2

y=0 Z1−2y

x=0

xy dx dy =1

96.

can be evaluated by means of

>>> from scipy import integrate

>>> def f(x,y):

>>> return x*y

>>> def bounds_y():

>>> return [0,0.5]

>>> def bounds_x(y):

>>> return [0,1-2*y]

>>> integrate.nquad(f, [bounds_x, bounds_y])

(0.010416666666666668, 4.101620128472366e-16)

which is the same result as before.

1.4.3 Gaussian quadrature

A few functions are also provided in order to perform simple Gaussian quadrature over a ﬁxed interval. The ﬁrst

is fixed_quad which performs ﬁxed-order Gaussian quadrature. The second function is quadrature which

performs Gaussian quadrature of multiple orders until the difference in the integral estimate is beneath some tolerance

supplied by the user. These functions both use the module special.orthogonal which can calculate the roots

and quadrature weights of a large variety of orthogonal polynomials (the polynomials themselves are available as

special functions returning instances of the polynomial class — e.g. special.legendre).

1.4.4 Romberg Integration

Romberg’s method [WPR] is another method for numerically evaluating an integral. See the help function for

romberg for further details.

1.4.5 Integrating using Samples

If the samples are equally-spaced and the number of samples available is 2k+ 1 for some integer k, then Romberg

romb integration can be used to obtain high-precision estimates of the integral using the available samples. Romberg

integration uses the trapezoid rule at step-sizes related by a power of two and then performs Richardson extrapolation

on these estimates to approximate the integral with a higher-degree of accuracy.

In case of arbitrary spaced samples, the two functions trapz (deﬁned in numpy [NPT]) and simps are available.

They are using Newton-Coates formulas of order 1 and 2 respectively to perform integration. The trapezoidal rule

approximates the function as a straight line between adjacent points, while Simpson’s rule approximates the function

between three adjacent points as a parabola.

14 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

For an odd number of samples that are equally spaced Simpson’s rule is exact if the function is a polynomial of order

3 or less. If the samples are not equally spaced, then the result is exact only if the function is a polynomial of order 2

or less.

>>> from scipy.integrate import simps

>>> import numpy as np

>>> def f(x):

... return x**2

>>> def f2(x):

... return x**3

>>> x=np.array([1,3,4])

>>> y1 =f1(x)

>>> I1 =integrate.simps(y1,x)

>>> print(I1)

21.0

This corresponds exactly to

x2dx = 21,

whereas integrating the second function

>>> y2 =f2(x)

>>> I2 =integrate.simps(y2,x)

>>> print(I2)

61.5

does not correspond to

x3dx = 63.75

because the order of the polynomial in f2 is larger than two.

1.4.6 Ordinary differential equations (odeint)

Integrating a set of ordinary differential equations (ODEs) given initial conditions is another useful example. The

function odeint is available in SciPy for integrating a ﬁrst-order vector differential equation:

dt =f(y, t),

given initial conditions y(0) = y0, where yis a length Nvector and fis a mapping from RNto RN.A higher-order

ordinary differential equation can always be reduced to a differential equation of this type by introducing intermediate

derivatives into the yvector.

For example suppose it is desired to ﬁnd the solution to the following second-order differential equation:

d2w

dz2−zw(z) = 0

with initial conditions w(0) = 1

√32Γ(2

3)and dw

dz z=0 =−1

√3Γ(1

3).It is known that the solution to this differential

equation with these boundary conditions is the Airy function

w=Ai (z),

which gives a means to check the integrator using special.airy.

1.4. Integration (scipy.integrate) 15

SciPy Reference Guide, Release 0.13.0

First, convert this ODE into standard form by setting y=dw

dz , wand t=z. Thus, the differential equation becomes

dt =ty1

y0=0t

1 0  y0

y1=0t

1 0 y.

In other words,

f(y, t) = A(t)y.

As an interesting reminder, if A(t)commutes with Rt

0A(τ)dτ under matrix multiplication, then this linear differen-

tial equation has an exact solution using the matrix exponential:

y(t) = exp Zt

A(τ)dτy(0) ,

However, in this case, A(t)and its integral do not commute.

There are many optional inputs and outputs available when using odeint which can help tune the solver. These ad-

ditional inputs and outputs are not needed much of the time, however, and the three required input arguments and

the output solution sufﬁce. The required inputs are the function deﬁning the derivative, fprime, the initial conditions

vector, y0, and the time points to obtain a solution, t, (with the initial value point as the ﬁrst element of this sequence).

The output to odeint is a matrix where each row contains the solution vector at each requested time point (thus, the

initial conditions are given in the ﬁrst output row).

The following example illustrates the use of odeint including the usage of the Dfun option which allows the user to

specify a gradient (with respect to y) of the function, f(y, t).

>>> from scipy.integrate import odeint

>>> from scipy.special import gamma, airy

>>> y1_0 =1.0/3**(2.0/3.0)/gamma(2.0/3.0)

>>> y0_0 = -1.0/3**(1.0/3.0)/gamma(1.0/3.0)

>>> y0 =[y0_0, y1_0]

>>> def func(y, t):

... return [t*y[1],y[0]]

>>> def gradient(y,t):

... return [[0,t],[1,0]]

>>> x=arange(0,4.0,0.01)

>>> t=x

>>> ychk =airy(x)[0]

>>> y=odeint(func, y0, t)

>>> y2 =odeint(func, y0, t, Dfun=gradient)

>>> print ychk[:36:6]

[ 0.355028 0.339511 0.324068 0.308763 0.293658 0.278806]

>>> print y[:36:6,1]

[ 0.355028 0.339511 0.324067 0.308763 0.293658 0.278806]

>>> print y2[:36:6,1]

[ 0.355028 0.339511 0.324067 0.308763 0.293658 0.278806]

References

1.5 Optimization (scipy.optimize)

The scipy.optimize package provides several commonly used optimization algorithms. A detailed listing is

available: scipy.optimize (can also be found by help(scipy.optimize)).

16 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

The module contains:

1. Unconstrained and constrained minimization of multivariate scalar functions (minimize) using a variety of

algorithms (e.g. BFGS, Nelder-Mead simplex, Newton Conjugate Gradient, COBYLA or SLSQP)

2. Global (brute-force) optimization routines (e.g., anneal,basinhopping)

3. Least-squares minimization (leastsq) and curve ﬁtting (curve_fit) algorithms

4. Scalar univariate functions minimizers (minimize_scalar) and root ﬁnders (newton)

5. Multivariate equation system solvers (root) using a variety of algorithms (e.g. hybrid Powell, Levenberg-

Marquardt or large-scale methods such as Newton-Krylov).

Below, several examples demonstrate their basic usage.

1.5.1 Unconstrained minimization of multivariate scalar functions (minimize)

The minimize function provides a common interface to unconstrained and constrained minimization algorithms for

multivariate scalar functions in scipy.optimize. To demonstrate the minimization function consider the problem

of minimizing the Rosenbrock function of Nvariables:

f(x) =

N−1

i=1

100 xi−x2

i−12+ (1 −xi−1)2.

The minimum value of this function is 0 which is achieved when xi= 1.

Note that the Rosenbrock function and its derivatives are included in scipy.optimize. The implementations

shown in the following sections provide examples of how to deﬁne an objective function as well as its jacobian and

hessian functions.

Nelder-Mead Simplex algorithm (method=’Nelder-Mead’)

In the example below, the minimize routine is used with the Nelder-Mead simplex algorithm (selected through the

method parameter):

>>> import numpy as np

>>> from scipy.optimize import minimize

>>> def rosen(x):

... """The Rosenbrock function"""

... return sum(100.0*(x[1:]-x[:-1]**2.0)**2.0 +(1-x[:-1])**2.0)

>>> x0 =np.array([1.3,0.7,0.8,1.9,1.2])

>>> res =minimize(rosen, x0, method=’nelder-mead’,

... options={’xtol’:1e-8,’disp’:True})

Optimization terminated successfully.

Current function value: 0.000000

Iterations: 339

Function evaluations: 571

>>> print(res.x)

[ 1. 1. 1. 1. 1.]

The simplex algorithm is probably the simplest way to minimize a fairly well-behaved function. It requires only

function evaluations and is a good choice for simple minimization problems. However, because it does not use any

gradient evaluations, it may take longer to ﬁnd the minimum.

1.5. Optimization (scipy.optimize) 17

SciPy Reference Guide, Release 0.13.0

Another optimization algorithm that needs only function calls to ﬁnd the minimum is Powell‘s method available by

setting method=’powell’ in minimize.

Broyden-Fletcher-Goldfarb-Shanno algorithm (method=’BFGS’)

In order to converge more quickly to the solution, this routine uses the gradient of the objective function. If the gradient

is not given by the user, then it is estimated using ﬁrst-differences. The Broyden-Fletcher-Goldfarb-Shanno (BFGS)

method typically requires fewer function calls than the simplex algorithm even when the gradient must be estimated.

To demonstrate this algorithm, the Rosenbrock function is again used. The gradient of the Rosenbrock function is the

vector:

∂f

∂xj

i=1

200 xi−x2

i−1(δi,j −2xi−1δi−1,j )−2 (1 −xi−1)δi−1,j .

= 200 xj−x2

j−1−400xjxj+1 −x2

j−2 (1 −xj).

This expression is valid for the interior derivatives. Special cases are

∂f

∂x0

=−400x0x1−x2

0−2 (1 −x0),

∂f

∂xN−1

= 200 xN−1−x2

N−2.

A Python function which computes this gradient is constructed by the code-segment:

>>> def rosen_der(x):

... xm =x[1:-1]

... xm_m1 =x[:-2]

... xm_p1 =x[2:]

... der =np.zeros_like(x)

... der[1:-1]=200*(xm-xm_m1**2)-400*(xm_p1 -xm**2)*xm -2*(1-xm)

... der[0]= -400*x[0]*(x[1]-x[0]**2)-2*(1-x[0])

... der[-1]=200*(x[-1]-x[-2]**2)

... return der

This gradient information is speciﬁed in the minimize function through the jac parameter as illustrated below.

>>> res =minimize(rosen, x0, method=’BFGS’, jac=rosen_der,

... options={’disp’:True})

Optimization terminated successfully.

Current function value: 0.000000

Iterations: 51

Function evaluations: 63

Gradient evaluations: 63

>>> print(res.x)

[ 1. 1. 1. 1. 1.]

Newton-Conjugate-Gradient algorithm (method=’Newton-CG’)

The method which requires the fewest function calls and is therefore often the fastest method to minimize functions

of many variables uses the Newton-Conjugate Gradient algorithm. This method is a modiﬁed Newton’s method and

uses a conjugate gradient algorithm to (approximately) invert the local Hessian. Newton’s method is based on ﬁtting

the function locally to a quadratic form:

f(x)≈f(x0) + ∇f(x0)·(x−x0) + 1

2(x−x0)TH(x0) (x−x0).

18 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

where H(x0)is a matrix of second-derivatives (the Hessian). If the Hessian is positive deﬁnite then the local minimum

of this function can be found by setting the gradient of the quadratic form to zero, resulting in

xopt =x0−H−1∇f.

The inverse of the Hessian is evaluated using the conjugate-gradient method. An example of employing this method

to minimizing the Rosenbrock function is given below. To take full advantage of the Newton-CG method, a function

which computes the Hessian must be provided. The Hessian matrix itself does not need to be constructed, only a

vector which is the product of the Hessian with an arbitrary vector needs to be available to the minimization routine.

As a result, the user can provide either a function to compute the Hessian matrix, or a function to compute the product

of the Hessian with an arbitrary vector.

Full Hessian example:

The Hessian of the Rosenbrock function is

Hij =∂2f

∂xi∂xj

= 200 (δi,j −2xi−1δi−1,j )−400xi(δi+1,j −2xiδi,j )−400δi,j xi+1 −x2

i+ 2δi,j ,

=202 + 1200x2

i−400xi+1δi,j −400xiδi+1,j −400xi−1δi−1,j ,

if i, j ∈[1, N −2] with i, j ∈[0, N −1] deﬁning the N×Nmatrix. Other non-zero entries of the matrix are

∂2f

∂x2

= 1200x2

0−400x1+ 2,

∂2f

∂x0∂x1

=∂2f

∂x1∂x0

=−400x0,

∂2f

∂xN−1∂xN−2

=∂2f

∂xN−2∂xN−1

=−400xN−2,

∂2f

∂x2

N−1

= 200.

For example, the Hessian when N= 5 is

H=





1200x2

0−400x1+ 2 −400x00 0 0

−400x0202 + 1200x2

1−400x2−400x10 0

0−400x1202 + 1200x2

2−400x3−400x20

0−400x2202 + 1200x2

3−400x4−400x3

0 0 0 −400x3200







The code which computes this Hessian along with the code to minimize the function using Newton-CG method is

shown in the following example:

>>> def rosen_hess(x):

... x=np.asarray(x)

... H=np.diag(-400*x[:-1],1)-np.diag(400*x[:-1],-1)

... diagonal =np.zeros_like(x)

... diagonal[0]=1200*x[0]**2-400*x[1]+2

... diagonal[-1]=200

... diagonal[1:-1]=202 +1200*x[1:-1]**2-400*x[2:]

... H=H+np.diag(diagonal)

... return H

>>> res =minimize(rosen, x0, method=’Newton-CG’,

... jac=rosen_der, hess=rosen_hess,

... options={’avextol’:1e-8,’disp’:True})

Optimization terminated successfully.

Current function value: 0.000000

1.5. Optimization (scipy.optimize) 19

SciPy Reference Guide, Release 0.13.0

Iterations: 19

Function evaluations: 22

Gradient evaluations: 19

Hessian evaluations: 19

>>> print(res.x)

[ 1. 1. 1. 1. 1.]

Hessian product example:

For larger minimization problems, storing the entire Hessian matrix can consume considerable time and memory. The

Newton-CG algorithm only needs the product of the Hessian times an arbitrary vector. As a result, the user can supply

code to compute this product rather than the full Hessian by giving a hess function which take the minimization

vector as the ﬁrst argument and the arbitrary vector as the second argument (along with extra arguments passed to the

function to be minimized). If possible, using Newton-CG with the Hessian product option is probably the fastest way

to minimize the function.

In this case, the product of the Rosenbrock Hessian with an arbitrary vector is not difﬁcult to compute. If pis the

arbitrary vector, then H(x)phas elements:

H(x)p=







1200x2

0−400x1+ 2p0−400x0p1

−400xi−1pi−1+202 + 1200x2

i−400xi+1pi−400xipi+1

−400xN−2pN−2+ 200pN−1







Code which makes use of this Hessian product to minimize the Rosenbrock function using minimize follows:

>>> def rosen_hess_p(x,p):

... x=np.asarray(x)

... Hp =np.zeros_like(x)

... Hp[0]=(1200*x[0]**2-400*x[1]+2)*p[0]-400*x[0]*p[1]

... Hp[1:-1]= -400*x[:-2]*p[:-2]+(202+1200*x[1:-1]**2-400*x[2:])*p[1:-1] \

... -400*x[1:-1]*p[2:]

... Hp[-1]= -400*x[-2]*p[-2]+200*p[-1]

... return Hp

>>> res =minimize(rosen, x0, method=’Newton-CG’,

... jac=rosen_der, hess=rosen_hess_p,

... options={’avextol’:1e-8,’disp’:True})

Optimization terminated successfully.

Current function value: 0.000000

Iterations: 20

Function evaluations: 23

Gradient evaluations: 20

Hessian evaluations: 44

>>> print(res.x)

[ 1. 1. 1. 1. 1.]

1.5.2 Constrained minimization of multivariate scalar functions (minimize)

The minimize function also provides an interface to several constrained minimization algorithm. As an example,

the Sequential Least SQuares Programming optimization algorithm (SLSQP) will be considered here. This algorithm

20 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

allows to deal with constrained minimization problems of the form:

min F(x)

subject to Cj(X) = 0, j = 1, ..., MEQ

Cj(x)≥0, j =MEQ + 1, ..., M

XL ≤x≤XU, I = 1, ..., N.

As an example, let us consider the problem of maximizing the function:

f(x, y)=2xy + 2x−x2−2y2

subject to an equality and an inequality constraints deﬁned as:

x3−y= 0

y−1≥0

The objective function and its derivative are deﬁned as follows.

>>> def func(x, sign=1.0):

... """ Objective function """

... return sign*(2*x[0]*x[1]+2*x[0]-x[0]**2-2*x[1]**2)

>>> def func_deriv(x, sign=1.0):

... """ Derivative of objective function """

... dfdx0 =sign*(-2*x[0]+2*x[1]+2)

... dfdx1 =sign*(2*x[0]-4*x[1])

... return np.array([ dfdx0, dfdx1 ])

Note that since minimize only minimizes functions, the sign parameter is introduced to multiply the objective

function (and its derivative by -1) in order to perform a maximization.

Then constraints are deﬁned as a sequence of dictionaries, with keys type,fun and jac.

>>> cons =({’type’:’eq’,

... ’fun’ :lambda x: np.array([x[0]**3-x[1]]),

... ’jac’ :lambda x: np.array([3.0*(x[0]**2.0), -1.0])},

... {’type’:’ineq’,

... ’fun’ :lambda x: np.array([x[1]-1]),

... ’jac’ :lambda x: np.array([0.0,1.0])})

Now an unconstrained optimization can be performed as:

>>> res =minimize(func, [-1.0,1.0], args=(-1.0,), jac=func_deriv,

... method=’SLSQP’, options={’disp’:True})

Optimization terminated successfully. (Exit mode 0)

Current function value: -2.0

Iterations: 4

Function evaluations: 5

Gradient evaluations: 4

>>> print(res.x)

[ 2. 1.]

and a constrained optimization as:

>>> res =minimize(func, [-1.0,1.0], args=(-1.0,), jac=func_deriv,

... constraints=cons, method=’SLSQP’, options={’disp’:True})

Optimization terminated successfully. (Exit mode 0)

Current function value: -1.00000018311

Iterations: 9

1.5. Optimization (scipy.optimize) 21

SciPy Reference Guide, Release 0.13.0

Function evaluations: 14

Gradient evaluations: 9

>>> print(res.x)

[ 1.00000009 1. ]

1.5.3 Least-square ﬁtting (leastsq)

All of the previously-explained minimization procedures can be used to solve a least-squares problem provided the

appropriate objective function is constructed. For example, suppose it is desired to ﬁt a set of data {xi,yi}to a known

model, y=f(x,p)where pis a vector of parameters for the model that need to be found. A common method for

determining which parameter vector gives the best ﬁt to the data is to minimize the sum of squares of the residuals.

The residual is usually deﬁned for each observed data-point as

ei(p,yi,xi) = kyi−f(xi,p)k.

An objective function to pass to any of the previous minization algorithms to obtain a least-squares ﬁt is.

J(p) =

N−1

i=0

i(p).

The leastsq algorithm performs this squaring and summing of the residuals automatically. It takes as an input

argument the vector function e(p)and returns the value of pwhich minimizes J(p) = eTedirectly. The user is also

encouraged to provide the Jacobian matrix of the function (with derivatives down the columns or across the rows). If

the Jacobian is not provided, it is estimated.

An example should clarify the usage. Suppose it is believed some measured data follow a sinusoidal pattern

yi=Asin (2πkxi+θ)

where the parameters A, k , and θare unknown. The residual vector is

ei=|yi−Asin (2πkxi+θ)|.

By deﬁning a function to compute the residuals and (selecting an appropriate starting position), the least-squares ﬁt

routine can be used to ﬁnd the best-ﬁt parameters ˆ

A, ˆ

k, ˆ

θ. This is shown in the following example:

>>> from numpy import *

>>> x=arange(0,6e-2,6e-2/30)

>>> A,k,theta =10,1.0/3e-2, pi/6

>>> y_true =A*sin(2*pi*k*x+theta)

>>> y_meas =y_true +2*random.randn(len(x))

>>> def residuals(p, y, x):

... A,k,theta =p

... err =y-A*sin(2*pi*k*x+theta)

... return err

>>> def peval(x, p):

... return p[0]*sin(2*pi*p[1]*x+p[2])

>>> p0 =[8,1/2.3e-2, pi/3]

>>> print(array(p0))

[ 8. 43.4783 1.0472]

>>> from scipy.optimize import leastsq

>>> plsq =leastsq(residuals, p0, args=(y_meas, x))

>>> print(plsq[0])

[ 10.9437 33.3605 0.5834]

22 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

>>> print(array([A, k, theta]))

[ 10. 33.3333 0.5236]

>>> import matplotlib.pyplot as plt

>>> plt.plot(x,peval(x,plsq[0]),x,y_meas,’o’,x,y_true)

>>> plt.title(’Least-squares fit to noisy data’)

>>> plt.legend([’Fit’,’Noisy’,’True’])

>>> plt.show()

1.5.4 Univariate function minimizers (minimize_scalar)

Often only the minimum of an univariate function (i.e. a function that takes a scalar as input) is needed. In these

circumstances, other optimization techniques have been developed that can work faster. These are accessible from the

minimize_scalar function which proposes several algorithms.

Unconstrained minimization (method=’brent’)

There are actually two methods that can be used to minimize an univariate function: brent and golden, but

golden is included only for academic purposes and should rarely be used. These can be respectively selected through

the method parameter in minimize_scalar. The brent method uses Brent’s algorithm for locating a minimum.

Optimally a bracket (the bs parameter) should be given which contains the minimum desired. A bracket is a triple

(a, b, c)such that f(a)> f (b)< f (c)and a<b<c. If this is not given, then alternatively two starting points can

be chosen and a bracket will be found from these points using a simple marching algorithm. If these two starting points

are not provided 0and 1will be used (this may not be the right choice for your function and result in an unexpected

minimum being returned).

Here is an example:

>>> from scipy.optimize import minimize_scalar

>>> f=lambda x: (x -2)*(x +1)**2

>>> res =minimize_scalar(f, method=’brent’)

>>> print(res.x)

1.0

1.5. Optimization (scipy.optimize) 23

SciPy Reference Guide, Release 0.13.0

Bounded minimization (method=’bounded’)

Very often, there are constraints that can be placed on the solution space before minimization occurs. The bounded

method in minimize_scalar is an example of a constrained minimization procedure that provides a rudimentary

interval constraint for scalar functions. The interval constraint allows the minimization to occur only between two

ﬁxed endpoints, speciﬁed using the mandatory bs parameter.

For example, to ﬁnd the minimum of J1(x)near x= 5 ,minimize_scalar can be called using the interval [4,7]

as a constraint. The result is xmin = 5.3314 :

>>> from scipy.special import j1

>>> res =minimize_scalar(j1, bs=(4,7), method=’bounded’)

>>> print(res.x)

5.33144184241

1.5.5 Root ﬁnding

Scalar functions

If one has a single-variable equation, there are four different root ﬁnding algorithms that can be tried. Each of these

algorithms requires the endpoints of an interval in which a root is expected (because the function changes signs). In

general brentq is the best choice, but the other methods may be useful in certain circumstances or for academic

purposes.

Fixed-point solving

A problem closely related to ﬁnding the zeros of a function is the problem of ﬁnding a ﬁxed-point of a function. A

ﬁxed point of a function is the point at which evaluation of the function returns the point: g(x) = x. Clearly the ﬁxed

point of gis the root of f(x) = g(x)−x. Equivalently, the root of fis the ﬁxed_point of g(x) = f(x) + x. The

routine fixed_point provides a simple iterative method using Aitkens sequence acceleration to estimate the ﬁxed

point of ggiven a starting point.

Sets of equations

Finding a root of a set of non-linear equations can be achieve using the root function. Several methods are available,

amongst which hybr (the default) and lm which respectively use the hybrid method of Powell and the Levenberg-

Marquardt method from MINPACK.

The following example considers the single-variable transcendental equation

x+ 2 cos (x) = 0,

a root of which can be found as follows:

>>> import numpy as np

>>> from scipy.optimize import root

>>> def func(x):

... return x+2*np.cos(x)

>>> sol =root(func, 0.3)

>>> sol.x

array([-1.02986653])

>>> sol.fun

array([ -6.66133815e-16])

24 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

Consider now a set of non-linear equations

x0cos (x1)=4,

x0x1−x1= 5.

We deﬁne the objective function so that it also returns the Jacobian and indicate this by setting the jac parameter to

True. Also, the Levenberg-Marquardt solver is used here.

>>> def func2(x):

... f=[x[0]*np.cos(x[1]) -4,

... x[1]*x[0]-x[1]-5]

... df =np.array([[np.cos(x[1]), -x[0]*np.sin(x[1])],

... [x[1], x[0]-1]])

... return f, df

>>> sol =root(func2, [1,1], jac=True, method=’lm’)

>>> sol.x

array([ 6.50409711, 0.90841421])

Root ﬁnding for large problems

Methods hybr and lm in root cannot deal with a very large number of variables (N), as they need to calculate and

invert a dense NxNJacobian matrix on every Newton step. This becomes rather inefﬁcient when Ngrows.

Consider for instance the following problem: we need to solve the following integrodifferential equation on the square

[0,1] ×[0,1]:

(∂2

x+∂2

y)P+ 5 Z1

0Z1

cosh(P)dx dy2

= 0

with the boundary condition P(x, 1) = 1 on the upper edge and P= 0 elsewhere on the boundary of the square. This

can be done by approximating the continuous function Pby its values on a grid, Pn,m ≈P(nh, mh), with a small

grid spacing h. The derivatives and integrals can then be approximated; for instance ∂2

xP(x, y)≈(P(x+h, y)−

2P(x, y) + P(x−h, y))/h2. The problem is then equivalent to ﬁnding the root of some function residual(P),

where Pis a vector of length NxNy.

Now, because NxNycan be large, methods hybr or lm in root will take a long time to solve this problem. The

solution can however be found using one of the large-scale solvers, for example krylov,broyden2, or anderson.

These use what is known as the inexact Newton method, which instead of computing the Jacobian matrix exactly, forms

an approximation for it.

The problem we have can now be solved as follows:

import numpy as np

from scipy.optimize import root

from numpy import cosh, zeros_like, mgrid, zeros

# parameters

nx, ny =75,75

hx, hy =1./(nx-1), 1./(ny-1)

P_left, P_right =0,0

P_top, P_bottom =1,0

def residual(P):

d2x =zeros_like(P)

d2y =zeros_like(P)

1.5. Optimization (scipy.optimize) 25

SciPy Reference Guide, Release 0.13.0

d2x[1:-1]=(P[2:] -2*P[1:-1]+P[:-2]) /hx/hx

d2x[0]=(P[1]-2*P[0]+P_left)/hx/hx

d2x[-1]=(P_right -2*P[-1]+P[-2])/hx/hx

d2y[:,1:-1]=(P[:,2:] -2*P[:,1:-1]+P[:,:-2])/hy/hy

d2y[:,0]=(P[:,1]-2*P[:,0]+P_bottom)/hy/hy

d2y[:,-1]=(P_top -2*P[:,-1]+P[:,-2])/hy/hy

return d2x +d2y +5*cosh(P).mean()**2

# solve

guess =zeros((nx, ny), float)

sol =root(residual, guess, method=’krylov’, options={’disp’:True})

#sol = root(residual, guess, method=’broyden2’, options={’disp’: True, ’max_rank’: 50})

#sol = root(residual, guess, method=’anderson’, options={’disp’: True, ’M’: 10})

print(’Residual: %g’%abs(residual(sol.x)).max())

# visualize

import matplotlib.pyplot as plt

x,y=mgrid[0:1:(nx*1j), 0:1:(ny*1j)]

plt.pcolor(x, y, sol.x)

plt.colorbar()

plt.show()

Still too slow? Preconditioning.

When looking for the zero of the functions fi(x)=0,i = 1, 2, ..., N, the krylov solver spends most of its time

inverting the Jacobian matrix,

Jij =∂fi

∂xj

If you have an approximation for the inverse matrix M≈J−1, you can use it for preconditioning the linear inversion

problem. The idea is that instead of solving Js=yone solves MJs=My: since matrix MJ is “closer” to the

identity matrix than Jis, the equation should be easier for the Krylov method to deal with.

26 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

The matrix Mcan be passed to root with method krylov as an op-

tion options[’jac_options’][’inner_M’]. It can be a (sparse) matrix or a

scipy.sparse.linalg.LinearOperator instance.

For the problem in the previous section, we note that the function to solve consists of two parts: the ﬁrst one is

application of the Laplace operator, [∂2

x+∂2

y]P, and the second is the integral. We can actually easily compute the

Jacobian corresponding to the Laplace operator part: we know that in one dimension

∂2

x≈1







−2 1 0 0 ···

1−210···

0 1 −2 1 ···

. . .







=h−2

so that the whole 2-D operator is represented by

J1=∂2

x+∂2

y'h−2

xL⊗I+h−2

yI⊗L

The matrix J2of the Jacobian corresponding to the integral is more difﬁcult to calculate, and since all of it entries

are nonzero, it will be difﬁcult to invert. J1on the other hand is a relatively simple matrix, and can be inverted by

scipy.sparse.linalg.splu (or the inverse can be approximated by scipy.sparse.linalg.spilu).

So we are content to take M≈J−1

1and hope for the best.

In the example below, we use the preconditioner M=J−1

import numpy as np

from scipy.optimize import root

from scipy.sparse import spdiags, kron

from scipy.sparse.linalg import spilu, LinearOperator

from numpy import cosh, zeros_like, mgrid, zeros, eye

# parameters

nx, ny =75,75

hx, hy =1./(nx-1), 1./(ny-1)

P_left, P_right =0,0

P_top, P_bottom =1,0

def get_preconditioner():

"""Compute the preconditioner M"""

diags_x =zeros((3, nx))

diags_x[0,:] =1/hx/hx

diags_x[1,:] = -2/hx/hx

diags_x[2,:] =1/hx/hx

Lx =spdiags(diags_x, [-1,0,1], nx, nx)

diags_y =zeros((3, ny))

diags_y[0,:] =1/hy/hy

diags_y[1,:] = -2/hy/hy

diags_y[2,:] =1/hy/hy

Ly =spdiags(diags_y, [-1,0,1], ny, ny)

J1 =kron(Lx, eye(ny)) +kron(eye(nx), Ly)

# Now we have the matrix ‘J_1‘. We need to find its inverse ‘M‘ --

# however, since an approximate inverse is enough, we can use

# the *incomplete LU*decomposition

J1_ilu =spilu(J1)

1.5. Optimization (scipy.optimize) 27

SciPy Reference Guide, Release 0.13.0

# This returns an object with a method .solve() that evaluates

# the corresponding matrix-vector product. We need to wrap it into

# a LinearOperator before it can be passed to the Krylov methods:

M=LinearOperator(shape=(nx*ny, nx*ny), matvec=J1_ilu.solve)

return M

def solve(preconditioning=True):

"""Compute the solution"""

count =[0]

def residual(P):

count[0]+= 1

d2x =zeros_like(P)

d2y =zeros_like(P)

d2x[1:-1]=(P[2:] -2*P[1:-1]+P[:-2])/hx/hx

d2x[0]=(P[1]-2*P[0]+P_left)/hx/hx

d2x[-1]=(P_right -2*P[-1]+P[-2])/hx/hx

d2y[:,1:-1]=(P[:,2:] -2*P[:,1:-1]+P[:,:-2])/hy/hy

d2y[:,0]=(P[:,1]-2*P[:,0]+P_bottom)/hy/hy

d2y[:,-1]=(P_top -2*P[:,-1]+P[:,-2])/hy/hy

return d2x +d2y +5*cosh(P).mean()**2

# preconditioner

if preconditioning:

M=get_preconditioner()

else:

M=None

# solve

guess =zeros((nx, ny), float)

sol =root(residual, guess, method=’krylov’,

options={’disp’:True,

’jac_options’: {’inner_M’: M}})

print ’Residual’,abs(residual(sol.x)).max()

print ’Evaluations’, count[0]

return sol.x

def main():

sol =solve(preconditioning=True)

# visualize

import matplotlib.pyplot as plt

x,y=mgrid[0:1:(nx*1j), 0:1:(ny*1j)]

plt.clf()

plt.pcolor(x, y, sol)

plt.clim(0,1)

plt.colorbar()

plt.show()

if __name__ == "__main__":

main()

28 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

Resulting run, ﬁrst without preconditioning:

0: |F(x)| = 803.614; step 1; tol 0.000257947

1: |F(x)| = 345.912; step 1; tol 0.166755

2: |F(x)| = 139.159; step 1; tol 0.145657

3: |F(x)| = 27.3682; step 1; tol 0.0348109

4: |F(x)| = 1.03303; step 1; tol 0.00128227

5: |F(x)| = 0.0406634; step 1; tol 0.00139451

6: |F(x)| = 0.00344341; step 1; tol 0.00645373

7: |F(x)| = 0.000153671; step 1; tol 0.00179246

8: |F(x)| = 6.7424e-06; step 1; tol 0.00173256

Residual 3.57078908664e-07

Evaluations 317

and then with preconditioning:

0: |F(x)| = 136.993; step 1; tol 7.49599e-06

1: |F(x)| = 4.80983; step 1; tol 0.00110945

2: |F(x)| = 0.195942; step 1; tol 0.00149362

3: |F(x)| = 0.000563597; step 1; tol 7.44604e-06

4: |F(x)| = 1.00698e-09; step 1; tol 2.87308e-12

Residual 9.29603061195e-11

Evaluations 77

Using a preconditioner reduced the number of evaluations of the residual function by a factor of 4. For problems

where the residual is expensive to compute, good preconditioning can be crucial — it can even decide whether the

problem is solvable in practice or not.

Preconditioning is an art, science, and industry. Here, we were lucky in making a simple choice that worked reasonably

well, but there is a lot more depth to this topic than is shown here.

References

Some further reading and related software:

1.6 Interpolation (scipy.interpolate)

Contents

•Interpolation (scipy.interpolate)

–1-D interpolation (interp1d)

–Multivariate data interpolation (griddata)

–Spline interpolation

*Spline interpolation in 1-d: Procedural (interpolate.splXXX)

*Spline interpolation in 1-d: Object-oriented (UnivariateSpline)

*Two-dimensional spline representation: Procedural (bisplrep)

*Two-dimensional spline representation: Object-oriented (BivariateSpline)

–Using radial basis functions for smoothing/interpolation

*1-d Example

*2-d Example

There are several general interpolation facilities available in SciPy, for data in 1, 2, and higher dimensions:

• A class representing an interpolant (interp1d) in 1-D, offering several interpolation methods.

1.6. Interpolation (scipy.interpolate) 29

SciPy Reference Guide, Release 0.13.0

• Convenience function griddata offering a simple interface to interpolation in N dimensions (N = 1, 2, 3, 4,

...). Object-oriented interface for the underlying routines is also available.

• Functions for 1- and 2-dimensional (smoothed) cubic-spline interpolation, based on the FORTRAN library

FITPACK. There are both procedural and object-oriented interfaces for the FITPACK library.

• Interpolation using Radial Basis Functions.

1.6.1 1-D interpolation (interp1d)

The interp1d class in scipy.interpolate is a convenient method to create a function based on ﬁxed data points which can

be evaluated anywhere within the domain deﬁned by the given data using linear interpolation. An instance of this class

is created by passing the 1-d vectors comprising the data. The instance of this class deﬁnes a __call__ method and

can therefore by treated like a function which interpolates between known data values to obtain unknown values (it

also has a docstring for help). Behavior at the boundary can be speciﬁed at instantiation time. The following example

demonstrates its use, for linear and cubic spline interpolation:

>>> from scipy.interpolate import interp1d

>>> x=np.linspace(0,10,10)

>>> y=np.cos(-x**2/8.0)

>>> f=interp1d(x, y)

>>> f2 =interp1d(x, y, kind=’cubic’)

>>> xnew =np.linspace(0,10,40)

>>> import matplotlib.pyplot as plt

>>> plt.plot(x,y,’o’,xnew,f(xnew),’-’, xnew, f2(xnew),’--’)

>>> plt.legend([’data’,’linear’,’cubic’], loc=’best’)

>>> plt.show()

1.6.2 Multivariate data interpolation (griddata)

Suppose you have multidimensional data, for instance for an underlying function f(x, y) you only know the values at

points (x[i], y[i]) that do not form a regular grid.

Suppose we want to interpolate the 2-D function

30 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

>>> def func(x, y):

>>> return x*(1-x)*np.cos(4*np.pi*x) *np.sin(4*np.pi*y**2)**2

on a grid in [0, 1]x[0, 1]

>>> grid_x, grid_y =np.mgrid[0:1:100j,0:1:200j]

but we only know its values at 1000 data points:

>>> points =np.random.rand(1000,2)

>>> values =func(points[:,0], points[:,1])

This can be done with griddata – below we try out all of the interpolation methods:

>>> from scipy.interpolate import griddata

>>> grid_z0 =griddata(points, values, (grid_x, grid_y), method=’nearest’)

>>> grid_z1 =griddata(points, values, (grid_x, grid_y), method=’linear’)

>>> grid_z2 =griddata(points, values, (grid_x, grid_y), method=’cubic’)

One can see that the exact result is reproduced by all of the methods to some degree, but for this smooth function the

piecewise cubic interpolant gives the best results:

>>> import matplotlib.pyplot as plt

>>> plt.subplot(221)

>>> plt.imshow(func(grid_x, grid_y).T, extent=(0,1,0,1), origin=’lower’)

>>> plt.plot(points[:,0], points[:,1], ’k.’, ms=1)

>>> plt.title(’Original’)

>>> plt.subplot(222)

>>> plt.imshow(grid_z0.T, extent=(0,1,0,1), origin=’lower’)

>>> plt.title(’Nearest’)

>>> plt.subplot(223)

>>> plt.imshow(grid_z1.T, extent=(0,1,0,1), origin=’lower’)

>>> plt.title(’Linear’)

>>> plt.subplot(224)

>>> plt.imshow(grid_z2.T, extent=(0,1,0,1), origin=’lower’)

>>> plt.title(’Cubic’)

>>> plt.gcf().set_size_inches(6,6)

>>> plt.show()

1.6. Interpolation (scipy.interpolate) 31

SciPy Reference Guide, Release 0.13.0

1.6.3 Spline interpolation

Spline interpolation in 1-d: Procedural (interpolate.splXXX)

Spline interpolation requires two essential steps: (1) a spline representation of the curve is computed, and (2) the spline

is evaluated at the desired points. In order to ﬁnd the spline representation, there are two different ways to represent

a curve and obtain (smoothing) spline coefﬁcients: directly and parametrically. The direct method ﬁnds the spline

representation of a curve in a two- dimensional plane using the function splrep. The ﬁrst two arguments are the

only ones required, and these provide the xand ycomponents of the curve. The normal output is a 3-tuple, (t, c, k),

containing the knot-points, t, the coefﬁcients cand the order kof the spline. The default spline order is cubic, but this

can be changed with the input keyword, k.

For curves in N-dimensional space the function splprep allows deﬁning the curve parametrically. For this function

only 1 input argument is required. This input is a list of N-arrays representing the curve in N-dimensional space. The

length of each array is the number of curve points, and each array provides one component of the N-dimensional data

point. The parameter variable is given with the keword argument, u, which defaults to an equally-spaced monotonic

32 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

sequence between 0and 1. The default output consists of two objects: a 3-tuple, (t, c, k), containing the spline

representation and the parameter variable u.

The keyword argument, s, is used to specify the amount of smoothing to perform during the spline ﬁt. The default

value of sis s=m−√2mwhere mis the number of data-points being ﬁt. Therefore, if no smoothing is desired a

value of s= 0 should be passed to the routines.

Once the spline representation of the data has been determined, functions are available for evaluating the spline

(splev) and its derivatives (splev,spalde) at any point and the integral of the spline between any two points

(splint). In addition, for cubic splines ( k= 3 ) with 8 or more knots, the roots of the spline can be estimated (

sproot). These functions are demonstrated in the example that follows.

>>> import numpy as np

>>> import matplotlib.pyplot as plt

>>> from scipy import interpolate

Cubic-spline

>>> x=np.arange(0,2*np.pi+np.pi/4,2*np.pi/8)

>>> y=np.sin(x)

>>> tck =interpolate.splrep(x,y,s=0)

>>> xnew =np.arange(0,2*np.pi,np.pi/50)

>>> ynew =interpolate.splev(xnew,tck,der=0)

>>> plt.figure()

>>> plt.plot(x,y,’x’,xnew,ynew,xnew,np.sin(xnew),x,y,’b’)

>>> plt.legend([’Linear’,’Cubic Spline’,’True’])

>>> plt.axis([-0.05,6.33,-1.05,1.05])

>>> plt.title(’Cubic-spline interpolation’)

>>> plt.show()

Derivative of spline

>>> yder =interpolate.splev(xnew,tck,der=1)

>>> plt.figure()

>>> plt.plot(xnew,yder,xnew,np.cos(xnew),’--’)

>>> plt.legend([’Cubic Spline’,’True’])

>>> plt.axis([-0.05,6.33,-1.05,1.05])

>>> plt.title(’Derivative estimation from spline’)

1.6. Interpolation (scipy.interpolate) 33

SciPy Reference Guide, Release 0.13.0

>>> plt.show()

Integral of spline

>>> def integ(x,tck,constant=-1):

>>> x=np.atleast_1d(x)

>>> out =np.zeros(x.shape, dtype=x.dtype)

>>> for nin xrange(len(out)):

>>> out[n] =interpolate.splint(0,x[n],tck)

>>> out += constant

>>> return out

>>>

>>> yint =integ(xnew,tck)

>>> plt.figure()

>>> plt.plot(xnew,yint,xnew,-np.cos(xnew),’--’)

>>> plt.legend([’Cubic Spline’,’True’])

>>> plt.axis([-0.05,6.33,-1.05,1.05])

>>> plt.title(’Integral estimation from spline’)

>>> plt.show()

34 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

Roots of spline

>>> print(interpolate.sproot(tck))

[ 0. 3.1416]

Parametric spline

>>> t=np.arange(0,1.1,.1)

>>> x=np.sin(2*np.pi*t)

>>> y=np.cos(2*np.pi*t)

>>> tck,u =interpolate.splprep([x,y],s=0)

>>> unew =np.arange(0,1.01,0.01)

>>> out =interpolate.splev(unew,tck)

>>> plt.figure()

>>> plt.plot(x,y,’x’,out[0],out[1],np.sin(2*np.pi*unew),np.cos(2*np.pi*unew),x,y,’b’)

>>> plt.legend([’Linear’,’Cubic Spline’,’True’])

>>> plt.axis([-1.05,1.05,-1.05,1.05])

>>> plt.title(’Spline of parametrically-defined curve’)

>>> plt.show()

1.6. Interpolation (scipy.interpolate) 35

SciPy Reference Guide, Release 0.13.0

Spline interpolation in 1-d: Object-oriented (UnivariateSpline)

The spline-ﬁtting capabilities described above are also available via an objected-oriented interface. The one dimen-

sional splines are objects of the UnivariateSpline class, and are created with the xand ycomponents of the

curve provided as arguments to the constructor. The class deﬁnes __call__, allowing the object to be called with the

x-axis values at which the spline should be evaluated, returning the interpolated y-values. This is shown in the ex-

ample below for the subclass InterpolatedUnivariateSpline. The methods integral,derivatives,

and roots methods are also available on UnivariateSpline objects, allowing deﬁnite integrals, derivatives, and

roots to be computed for the spline.

The UnivariateSpline class can also be used to smooth data by providing a non-zero value of the smoothing parameter

s, with the same meaning as the skeyword of the splrep function described above. This results in a spline that

has fewer knots than the number of data points, and hence is no longer strictly an interpolating spline, but rather a

smoothing spline. If this is not desired, the InterpolatedUnivariateSpline class is available. It is a subclass

of UnivariateSpline that always passes through all points (equivalent to forcing the smoothing parameter to 0).

This class is demonstrated in the example below.

The LSQUnivarateSpline is the other subclass of UnivarateSpline. It allows the user to specify the number and location

of internal knots as explicitly with the parameter t. This allows creation of customized splines with non-linear spacing,

to interpolate in some domains and smooth in others, or change the character of the spline.

>>> import numpy as np

>>> import matplotlib.pyplot as plt

>>> from scipy import interpolate

InterpolatedUnivariateSpline

>>> x=np.arange(0,2*np.pi+np.pi/4,2*np.pi/8)

>>> y=np.sin(x)

>>> s=interpolate.InterpolatedUnivariateSpline(x,y)

>>> xnew =np.arange(0,2*np.pi,np.pi/50)

>>> ynew =s(xnew)

>>> plt.figure()

>>> plt.plot(x,y,’x’,xnew,ynew,xnew,np.sin(xnew),x,y,’b’)

>>> plt.legend([’Linear’,’InterpolatedUnivariateSpline’,’True’])

>>> plt.axis([-0.05,6.33,-1.05,1.05])

36 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

>>> plt.title(’InterpolatedUnivariateSpline’)

>>> plt.show()

LSQUnivarateSpline with non-uniform knots

>>> t=[np.pi/2-.1,np.pi/2+.1,3*np.pi/2-.1,3*np.pi/2+.1]

>>> s=interpolate.LSQUnivariateSpline(x,y,t,k=2)

>>> ynew =s(xnew)

>>> plt.figure()

>>> plt.plot(x,y,’x’,xnew,ynew,xnew,np.sin(xnew),x,y,’b’)

>>> plt.legend([’Linear’,’LSQUnivariateSpline’,’True’])

>>> plt.axis([-0.05,6.33,-1.05,1.05])

>>> plt.title(’Spline with Specified Interior Knots’)

>>> plt.show()

1.6. Interpolation (scipy.interpolate) 37

SciPy Reference Guide, Release 0.13.0

Two-dimensional spline representation: Procedural (bisplrep)

For (smooth) spline-ﬁtting to a two dimensional surface, the function bisplrep is available. This function takes as

required inputs the 1-D arrays x,y, and zwhich represent points on the surface z=f(x, y).The default output is a

list [tx, ty, c, kx, ky]whose entries represent respectively, the components of the knot positions, the coefﬁcients of the

spline, and the order of the spline in each coordinate. It is convenient to hold this list in a single object, tck, so that

it can be passed easily to the function bisplev. The keyword, s, can be used to change the amount of smoothing

performed on the data while determining the appropriate spline. The default value is s=m−√2mwhere mis the

number of data points in the x, y, and zvectors. As a result, if no smoothing is desired, then s= 0 should be passed to

bisplrep .

To evaluate the two-dimensional spline and it’s partial derivatives (up to the order of the spline), the function bisplev

is required. This function takes as the ﬁrst two arguments two 1-D arrays whose cross-product speciﬁes the domain

over which to evaluate the spline. The third argument is the tck list returned from bisplrep. If desired, the fourth

and ﬁfth arguments provide the orders of the partial derivative in the xand ydirection respectively.

It is important to note that two dimensional interpolation should not be used to ﬁnd the spline representation of

images. The algorithm used is not amenable to large numbers of input points. The signal processing toolbox contains

more appropriate algorithms for ﬁnding the spline representation of an image. The two dimensional interpolation

commands are intended for use when interpolating a two dimensional function as shown in the example that follows.

This example uses the mgrid command in SciPy which is useful for deﬁning a “mesh-grid “in many dimensions.

(See also the ogrid command if the full-mesh is not needed). The number of output arguments and the number of

dimensions of each argument is determined by the number of indexing objects passed in mgrid.

>>> import numpy as np

>>> from scipy import interpolate

>>> import matplotlib.pyplot as plt

Deﬁne function over sparse 20x20 grid

>>> x,y =np.mgrid[-1:1:20j,-1:1:20j]

>>> z=(x+y)*np.exp(-6.0*(x*x+y*y))

>>> plt.figure()

>>> plt.pcolor(x,y,z)

>>> plt.colorbar()

>>> plt.title("Sparsely sampled function.")

>>> plt.show()

38 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

Interpolate function over new 70x70 grid

>>> xnew,ynew =np.mgrid[-1:1:70j,-1:1:70j]

>>> tck =interpolate.bisplrep(x,y,z,s=0)

>>> znew =interpolate.bisplev(xnew[:,0],ynew[0,:],tck)

>>> plt.figure()

>>> plt.pcolor(xnew,ynew,znew)

>>> plt.colorbar()

>>> plt.title("Interpolated function.")

>>> plt.show()

Two-dimensional spline representation: Object-oriented (BivariateSpline)

The BivariateSpline class is the 2-dimensional analog of the UnivariateSpline class. It and its subclasses

implement the FITPACK functions described above in an object oriented fashion, allowing objects to be instantiated

that can be called to compute the spline value by passing in the two coordinates as the two arguments.

1.6.4 Using radial basis functions for smoothing/interpolation

Radial basis functions can be used for smoothing/interpolating scattered data in n-dimensions, but should be used with

caution for extrapolation outside of the observed data range.

1-d Example

This example compares the usage of the Rbf and UnivariateSpline classes from the scipy.interpolate module.

>>> import numpy as np

>>> from scipy.interpolate import Rbf, InterpolatedUnivariateSpline

>>> import matplotlib.pyplot as plt

>>> # setup data

>>> x=np.linspace(0,10,9)

>>> y=np.sin(x)

>>> xi =np.linspace(0,10,101)

1.6. Interpolation (scipy.interpolate) 39

SciPy Reference Guide, Release 0.13.0

>>> # use fitpack2 method

>>> ius =InterpolatedUnivariateSpline(x, y)

>>> yi =ius(xi)

>>> plt.subplot(2,1,1)

>>> plt.plot(x, y, ’bo’)

>>> plt.plot(xi, yi, ’g’)

>>> plt.plot(xi, np.sin(xi), ’r’)

>>> plt.title(’Interpolation using univariate spline’)

>>> # use RBF method

>>> rbf =Rbf(x, y)

>>> fi =rbf(xi)

>>> plt.subplot(2,1,2)

>>> plt.plot(x, y, ’bo’)

>>> plt.plot(xi, fi, ’g’)

>>> plt.plot(xi, np.sin(xi), ’r’)

>>> plt.title(’Interpolation using RBF - multiquadrics’)

>>> plt.show()

2-d Example

This example shows how to interpolate scattered 2d data.

>>> import numpy as np

>>> from scipy.interpolate import Rbf

>>> import matplotlib.pyplot as plt

>>> from matplotlib import cm

>>> # 2-d tests - setup scattered data

>>> x=np.random.rand(100)*4.0-2.0

>>> y=np.random.rand(100)*4.0-2.0

>>> z=x*np.exp(-x**2-y**2)

>>> ti =np.linspace(-2.0,2.0,100)

>>> XI, YI =np.meshgrid(ti, ti)

40 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

>>> # use RBF

>>> rbf =Rbf(x, y, z, epsilon=2)

>>> ZI =rbf(XI, YI)

>>> # plot the result

>>> n=plt.normalize(-2.,2.)

>>> plt.subplot(1,1,1)

>>> plt.pcolor(XI, YI, ZI, cmap=cm.jet)

>>> plt.scatter(x, y, 100, z, cmap=cm.jet)

>>> plt.title(’RBF interpolation - multiquadrics’)

>>> plt.xlim(-2,2)

>>> plt.ylim(-2,2)

>>> plt.colorbar()

1.7 Fourier Transforms (scipy.fftpack)

Warning: This is currently a stub page

1.7. Fourier Transforms (scipy.fftpack) 41

SciPy Reference Guide, Release 0.13.0

Contents

•Fourier Transforms (scipy.fftpack)

–Fast Fourier transforms

–One dimensional discrete Fourier transforms

–Two and n dimensional discrete Fourier transforms

–Discrete Cosine Transforms

*type I

*type II

*type III

–Discrete Sine Transforms

*type I

*type II

*type III

*References

–FFT convolution

–Cache Destruction

Fourier analysis is fundamentally a method for expressing a function as a sum of periodic components, and for recover-

ing the signal from those components. When both the function and its Fourier transform are replaced with discretized

counterparts, it is called the discrete Fourier transform (DFT). The DFT has become a mainstay of numerical com-

puting in part because of a very fast algorithm for computing it, called the Fast Fourier Transform (FFT), which was

known to Gauss (1805) and was brought to light in its current form by Cooley and Tukey [CT]. Press et al. [NR]

provide an accessible introduction to Fourier analysis and its applications.

1.7.1 Fast Fourier transforms

1.7.2 One dimensional discrete Fourier transforms

fft, ifft, rfft, irfft

1.7.3 Two and n dimensional discrete Fourier transforms

fft in more than one dimension

1.7.4 Discrete Cosine Transforms

Return the Discrete Cosine Transform [Mak] of arbitrary type sequence x.

For a single dimension array x,dct(x, norm=’ortho’) is equal to MATLAB dct(x).

There are theoretically 8 types of the DCT [WPC], only the ﬁrst 3 types are implemented in scipy. ‘The’ DCT generally

refers to DCT type 2, and ‘the’ Inverse DCT generally refers to DCT type 3.

type I

There are several deﬁnitions of the DCT-I; we use the following (for norm=None):

yk=x0+ (−1)kxN−1+ 2

N−2

n=1

xncos πnk

N−1,0≤k < N.

42 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

Only None is supported as normalization mode for DCT-I. Note also that the DCT-I is only supported for input size >

type II

There are several deﬁnitions of the DCT-II; we use the following (for norm=None):

yk= 2

N−1

n=0

xncos π(2n+ 1)k

2N0≤k < N.

If norm=’ortho’,ykis multiplied by a scaling factor f:

f=(p1/(4N),if k= 0

p1/(2N),otherwise

Which makes the corresponding matrix of coefﬁcients orthonormal (OO’ = Id).

type III

There are several deﬁnitions of the DCT-III, we use the following (for norm=None):

yk=x0+ 2

N−1

n=1

xncos πn(2k+ 1)

2N0≤k < N,

or, for norm=’ortho’:

yk=x0

√N+1

√N

N−1

n=1

xncos πn(2k+ 1)

2N0≤k < N.

The (unnormalized) DCT-III is the inverse of the (unnormalized) DCT-II, up to a factor 2N. The orthonormalized

DCT-III is exactly the inverse of the orthonormalized DCT-II.

1.7.5 Discrete Sine Transforms

Return the Discrete Sine Transform [Mak] of arbitrary type sequence x.

There are theoretically 8 types of the DST for different combinations of even/odd boundary conditions and boundary

off sets [WPS], only the ﬁrst 3 types are implemented in scipy.

type I

There are several deﬁnitions of the DST-I; we use the following for norm=None. DST-I assumes the input is odd

around n=-1 and n=N.

yk= 2

N−1

n=0

xnsin π(n+ 1)(k+ 1)

N+ 1 ,0≤k < N.

Only None is supported as normalization mode for DST-I. Note also that the DCT-I is only supported for input size >

1. The (unnormalized) DCT-I is its own inverse, up to a factor 2(N+1).

1.7. Fourier Transforms (scipy.fftpack) 43

SciPy Reference Guide, Release 0.13.0

type II

There are several deﬁnitions of the DST-II; we use the following (for norm=None). DST-II assumes the input is odd

around n=-1/2 and even around n=N

yk= 2

N−1

n=0

xnsin π(n+ 1/2)(k+ 1)

N,0≤k < N.

type III

There are several deﬁnitions of the DST-III, we use the following (for norm=None). DST-III assumes the input is

odd around n=-1 and even around n=N-1

yk= (−1)kxN−1+ 2

N−2

n=0

xnsin π(n+ 1)(k+ 1/2)

N,0≤k < N.

The (unnormalized) DCT-III is the inverse of the (unnormalized) DCT-II, up to a factor 2N.

References

1.7.6 FFT convolution

scipy.fftpack.convolve performs a convolution of two one-dimensional arrays in frequency domain.

1.7.7 Cache Destruction

To accelerate repeat transforms on arrays of the same shape and dtype, scipy.fftpack keeps a cache of the prime

factorization of length of the array and pre-computed trigonometric functions. These caches can be destroyed by

calling the appropriate function in scipy.fftpack._fftpack. dst(type=1) and idst(type=1) share a cache (*dst1_cache).

As do dst(type=2), dst(type=3), idst(type=3), and idst(type=3) (*dst2_cache).

1.8 Signal Processing (scipy.signal)

The signal processing toolbox currently contains some ﬁltering functions, a limited set of ﬁlter design tools, and a few

B-spline interpolation algorithms for one- and two-dimensional data. While the B-spline algorithms could technically

be placed under the interpolation category, they are included here because they only work with equally-spaced data and

make heavy use of ﬁlter-theory and transfer-function formalism to provide a fast B-spline transform. To understand

this section you will need to understand that a signal in SciPy is an array of real or complex numbers.

1.8.1 B-splines

A B-spline is an approximation of a continuous function over a ﬁnite- domain in terms of B-spline coefﬁcients and knot

points. If the knot- points are equally spaced with spacing ∆x, then the B-spline approximation to a 1-dimensional

function is the ﬁnite-basis expansion.

y(x)≈X

cjβox

∆x−j.

44 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

In two dimensions with knot-spacing ∆xand ∆y, the function representation is

z(x, y)≈X

cjkβox

∆x−jβoy

∆y−k.

In these expressions, βo(·)is the space-limited B-spline basis function of order, o. The requirement of equally-

spaced knot-points and equally-spaced data points, allows the development of fast (inverse-ﬁltering) algorithms for

determining the coefﬁcients, cj, from sample-values, yn. Unlike the general spline interpolation algorithms, these

algorithms can quickly ﬁnd the spline coefﬁcients for large images.

The advantage of representing a set of samples via B-spline basis functions is that continuous-domain operators

(derivatives, re- sampling, integral, etc.) which assume that the data samples are drawn from an underlying con-

tinuous function can be computed with relative ease from the spline coefﬁcients. For example, the second-derivative

of a spline is

y00 (x) = 1

∆x2X

cjβo00 x

∆x−j.

Using the property of B-splines that

d2βo(w)

dw2=βo−2(w+ 1) −2βo−2(w) + βo−2(w−1)

it can be seen that

y00 (x) = 1

∆x2X

cjhβo−2x

∆x−j+ 1−2βo−2x

∆x−j+βo−2x

∆x−j−1i.

If o= 3 , then at the sample points,

∆x2y0(x)|x=n∆x=X

cjδn−j+1 −2cjδn−j+cjδn−j−1,

=cn+1 −2cn+cn−1.

Thus, the second-derivative signal can be easily calculated from the spline ﬁt. if desired, smoothing splines can be

found to make the second-derivative less sensitive to random-errors.

The savvy reader will have already noticed that the data samples are related to the knot coefﬁcients via a convolution

operator, so that simple convolution with the sampled B-spline function recovers the original data from the spline coef-

ﬁcients. The output of convolutions can change depending on how boundaries are handled (this becomes increasingly

more important as the number of dimensions in the data- set increases). The algorithms relating to B-splines in the

signal- processing sub package assume mirror-symmetric boundary conditions. Thus, spline coefﬁcients are computed

based on that assumption, and data-samples can be recovered exactly from the spline coefﬁcients by assuming them

to be mirror-symmetric also.

Currently the package provides functions for determining second- and third-order cubic spline coefﬁcients

from equally spaced samples in one- and two-dimensions (signal.qspline1d,signal.qspline2d,

signal.cspline1d,signal.cspline2d). The package also supplies a function ( signal.bspline ) for

evaluating the bspline basis function, βo(x)for arbitrary order and x. For large o, the B-spline basis function can be

approximated well by a zero-mean Gaussian function with standard-deviation equal to σo= (o+ 1) /12 :

βo(x)≈1

p2πσ2

exp −x2

2σo.

A function to compute this Gaussian for arbitrary xand ois also available ( signal.gauss_spline ). The

following code and Figure uses spline-ﬁltering to compute an edge-image (the second-derivative of a smoothed

spline) of Lena’s face which is an array returned by the command lena. The command signal.sepfir2d

was used to apply a separable two-dimensional FIR ﬁlter with mirror- symmetric boundary conditions to the spline

coefﬁcients. This function is ideally suited for reconstructing samples from spline coefﬁcients and is faster than

signal.convolve2d which convolves arbitrary two-dimensional ﬁlters and allows for choosing mirror-symmetric

boundary conditions.

1.8. Signal Processing (scipy.signal) 45

SciPy Reference Guide, Release 0.13.0

>>> from numpy import *

>>> from scipy import signal, misc

>>> import matplotlib.pyplot as plt

>>> image =misc.lena().astype(float32)

>>> derfilt =array([1.0,-2,1.0],float32)

>>> ck =signal.cspline2d(image,8.0)

>>> deriv =signal.sepfir2d(ck, derfilt, [1]) +\

>>> signal.sepfir2d(ck, [1], derfilt)

Alternatively we could have done:

laplacian =array([[0,1,0],[1,-4,1],[0,1,0]],float32)

deriv2 =signal.convolve2d(ck,laplacian,mode=’same’,boundary=’symm’)

>>> plt.figure()

>>> plt.imshow(image)

>>> plt.gray()

>>> plt.title(’Original image’)

>>> plt.show()

>>> plt.figure()

>>> plt.imshow(deriv)

>>> plt.gray()

>>> plt.title(’Output of spline edge filter’)

>>> plt.show()

46 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

1.8.2 Filtering

Filtering is a generic name for any system that modiﬁes an input signal in some way. In SciPy a signal can be thought

of as a Numpy array. There are different kinds of ﬁlters for different kinds of operations. There are two broad kinds

of ﬁltering operations: linear and non-linear. Linear ﬁlters can always be reduced to multiplication of the ﬂattened

Numpy array by an appropriate matrix resulting in another ﬂattened Numpy array. Of course, this is not usually the

best way to compute the ﬁlter as the matrices and vectors involved may be huge. For example ﬁltering a 512 ×512

image with this method would require multiplication of a 5122×5122matrix with a 5122vector. Just trying to store the

5122×5122matrix using a standard Numpy array would require 68,719,476,736 elements. At 4 bytes per element

this would require 256GB of memory. In most applications most of the elements of this matrix are zero and a different

method for computing the output of the ﬁlter is employed.

Convolution/Correlation

Many linear ﬁlters also have the property of shift-invariance. This means that the ﬁltering operation is the same at

different locations in the signal and it implies that the ﬁltering matrix can be constructed from knowledge of one row

(or column) of the matrix alone. In this case, the matrix multiplication can be accomplished using Fourier transforms.

Let x[n]deﬁne a one-dimensional signal indexed by the integer n. Full convolution of two one-dimensional signals

can be expressed as

y[n] = ∞

k=−∞

x[k]h[n−k].

This equation can only be implemented directly if we limit the sequences to ﬁnite support sequences that can be stored

in a computer, choose n= 0 to be the starting point of both sequences, let K+ 1 be that value for which y[n] = 0

for all n>K+ 1 and M+ 1 be that value for which x[n] = 0 for all n>M+ 1 , then the discrete convolution

expression is

y[n] =

min(n,K)

k=max(n−M,0)

x[k]h[n−k].

1.8. Signal Processing (scipy.signal) 47

SciPy Reference Guide, Release 0.13.0

For convenience assume K≥M. Then, more explicitly the output of this operation is

y[0] = x[0] h[0]

y[1] = x[0] h[1] + x[1] h[0]

y[2] = x[0] h[2] + x[1] h[1] + x[2] h[0]

y[M] = x[0] h[M] + x[1] h[M−1] + ··· +x[M]h[0]

y[M+ 1] = x[1] h[M] + x[2] h[M−1] + ··· +x[M+ 1] h[0]

y[K] = x[K−M]h[M] + ··· +x[K]h[0]

y[K+ 1] = x[K+ 1 −M]h[M] + ··· +x[K]h[1]

y[K+M−1] = x[K−1] h[M] + x[K]h[M−1]

y[K+M] = x[K]h[M].

Thus, the full discrete convolution of two ﬁnite sequences of lengths K+ 1 and M+ 1 respectively results in a ﬁnite

sequence of length K+M+ 1 = (K+ 1) + (M+ 1) −1.

One dimensional convolution is implemented in SciPy with the function signal.convolve . This function takes

as inputs the signals x, h , and an optional ﬂag and returns the signal y. The optional ﬂag allows for speciﬁcation of

which part of the output signal to return. The default value of ‘full’ returns the entire signal. If the ﬂag has a value of

‘same’ then only the middle Kvalues are returned starting at yM−1

2so that the output has the same length as the

largest input. If the ﬂag has a value of ‘valid’ then only the middle K−M+ 1 = (K+ 1) −(M+ 1) + 1 output

values are returned where zdepends on all of the values of the smallest input from h[0] to h[M].In other words only

the values y[M]to y[K]inclusive are returned.

This same function signal.convolve can actually take N-dimensional arrays as inputs and will return the N

-dimensional convolution of the two arrays. The same input ﬂags are available for that case as well.

Correlation is very similar to convolution except for the minus sign becomes a plus sign. Thus

w[n] = ∞

k=−∞

y[k]x[n+k]

is the (cross) correlation of the signals yand x. For ﬁnite-length signals with y[n] = 0 outside of the range [0, K]and

x[n] = 0 outside of the range [0, M ],the summation can simplify to

w[n] =

min(K,M−n)

k=max(0,−n)

y[k]x[n+k].

48 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

Assuming again that K≥Mthis is

w[−K] = y[K]x[0]

w[−K+ 1] = y[K−1] x[0] + y[K]x[1]

w[M−K] = y[K−M]x[0] + y[K−M+ 1] x[1] + ··· +y[K]x[M]

w[M−K+ 1] = y[K−M−1] x[0] + ··· +y[K−1] x[M]

w[−1] = y[1] x[0] + y[2] x[1] + ··· +y[M+ 1] x[M]

w[0] = y[0] x[0] + y[1] x[1] + ··· +y[M]x[M]

w[1] = y[0] x[1] + y[1] x[2] + ··· +y[M−1] x[M]

w[2] = y[0] x[2] + y[1] x[3] + ··· +y[M−2] x[M]

w[M−1] = y[0] x[M−1] + y[1] x[M]

w[M] = y[0] x[M].

The SciPy function signal.correlate implements this operation. Equivalent ﬂags are available for this operation

to return the full K+M+1 length sequence (‘full’) or a sequence with the same size as the largest sequence starting at

w−K+M−1

2(‘same’) or a sequence where the values depend on all the values of the smallest sequence (‘valid’).

This ﬁnal option returns the K−M+ 1 values w[M−K]to w[0] inclusive.

The function signal.correlate can also take arbitrary N-dimensional arrays as input and return the N-

dimensional convolution of the two arrays on output.

When N= 2,signal.correlate and/or signal.convolve can be used to construct arbitrary image ﬁlters

to perform actions such as blurring, enhancing, and edge-detection for an image.

Convolution is mainly used for ﬁltering when one of the signals is much smaller than the other ( KM), otherwise

linear ﬁltering is more easily accomplished in the frequency domain (see Fourier Transforms).

Difference-equation ﬁltering

A general class of linear one-dimensional ﬁlters (that includes convolution ﬁlters) are ﬁlters described by the difference

equation

k=0

aky[n−k] =

k=0

bkx[n−k]

where x[n]is the input sequence and y[n]is the output sequence. If we assume initial rest so that y[n] = 0 for n < 0

, then this kind of ﬁlter can be implemented using convolution. However, the convolution ﬁlter sequence h[n]could

be inﬁnite if ak6= 0 for k≥1.In addition, this general class of linear ﬁlter allows initial conditions to be placed on

y[n]for n < 0resulting in a ﬁlter that cannot be expressed using convolution.

The difference equation ﬁlter can be thought of as ﬁnding y[n]recursively in terms of it’s previous values

a0y[n] = −a1y[n−1] − ··· − aNy[n−N] + ··· +b0x[n] + ··· +bMx[n−M].

Often a0= 1 is chosen for normalization. The implementation in SciPy of this general difference equation ﬁlter is

a little more complicated then would be implied by the previous equation. It is implemented so that only one signal

1.8. Signal Processing (scipy.signal) 49

SciPy Reference Guide, Release 0.13.0

needs to be delayed. The actual implementation equations are (assuming a0= 1 ).

y[n] = b0x[n] + z0[n−1]

z0[n] = b1x[n] + z1[n−1] −a1y[n]

z1[n] = b2x[n] + z2[n−1] −a2y[n]

zK−2[n] = bK−1x[n] + zK−1[n−1] −aK−1y[n]

zK−1[n] = bKx[n]−aKy[n],

where K= max (N, M).Note that bK= 0 if K > M and aK= 0 if K > N. In this way, the output at time n

depends only on the input at time nand the value of z0at the previous time. This can always be calculated as long as

the Kvalues z0[n−1] . . . zK−1[n−1] are computed and stored at each time step.

The difference-equation ﬁlter is called using the command signal.lfilter in SciPy. This command takes as

inputs the vector b, the vector, a, a signal xand returns the vector y(the same length as x) computed using the

equation given above. If xis N-dimensional, then the ﬁlter is computed along the axis provided. If, desired, initial

conditions providing the values of z0[−1] to zK−1[−1] can be provided or else it will be assumed that they are all

zero. If initial conditions are provided, then the ﬁnal conditions on the intermediate variables are also returned. These

could be used, for example, to restart the calculation in the same state.

Sometimes it is more convenient to express the initial conditions in terms of the signals x[n]and y[n].In other words,

perhaps you have the values of x[−M]to x[−1] and the values of y[−N]to y[−1] and would like to determine what

values of zm[−1] should be delivered as initial conditions to the difference-equation ﬁlter. It is not difﬁcult to show

that for 0≤m < K,

zm[n] =

K−m−1

p=0

(bm+p+1x[n−p]−am+p+1y[n−p]) .

Using this formula we can ﬁnd the intial condition vector z0[−1] to zK−1[−1] given initial conditions on y(and x).

The command signal.lfiltic performs this function.

Other ﬁlters

The signal processing package provides many more ﬁlters as well.

Median Filter

A median ﬁlter is commonly applied when noise is markedly non-Gaussian or when it is desired to preserve edges. The

median ﬁlter works by sorting all of the array pixel values in a rectangular region surrounding the point of interest.

The sample median of this list of neighborhood pixel values is used as the value for the output array. The sample

median is the middle array value in a sorted list of neighborhood values. If there are an even number of elements in the

neighborhood, then the average of the middle two values is used as the median. A general purpose median ﬁlter that

works on N-dimensional arrays is signal.medfilt . A specialized version that works only for two-dimensional

arrays is available as signal.medfilt2d .

Order Filter

A median ﬁlter is a speciﬁc example of a more general class of ﬁlters called order ﬁlters. To compute the output

at a particular pixel, all order ﬁlters use the array values in a region surrounding that pixel. These array values are

sorted and then one of them is selected as the output value. For the median ﬁlter, the sample median of the list of

array values is used as the output. A general order ﬁlter allows the user to select which of the sorted values will be

used as the output. So, for example one could choose to pick the maximum in the list or the minimum. The order

ﬁlter takes an additional argument besides the input array and the region mask that speciﬁes which of the elements

in the sorted list of neighbor array values should be used as the output. The command to perform an order ﬁlter is

signal.order_filter .

50 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

Wiener ﬁlter

The Wiener ﬁlter is a simple deblurring ﬁlter for denoising images. This is not the Wiener ﬁlter commonly described

in image reconstruction problems but instead it is a simple, local-mean ﬁlter. Let xbe the input signal, then the output

y=(σ2

σ2

xmx+1−σ2

σ2

xx σ2

x≥σ2,

mxσ2

x< σ2,

where mxis the local estimate of the mean and σ2

xis the local estimate of the variance. The window for these estimates

is an optional input parameter (default is 3×3). The parameter σ2is a threshold noise parameter. If σis not given

then it is estimated as the average of the local variances.

Hilbert ﬁlter

The Hilbert transform constructs the complex-valued analytic signal from a real signal. For example if x= cos ωn

then y=hilbert (x)would return (except near the edges) y= exp (jωn).In the frequency domain, the hilbert

transform performs

Y=X·H

where His 2 for positive frequencies, 0for negative frequencies and 1for zero-frequencies.

1.8.3 Least-Squares Spectral Analysis

Least-squares spectral analysis (LSSA) is a method of estimating a frequency spectrum, based on a least squares ﬁt

of sinusoids to data samples, similar to Fourier analysis. Fourier analysis, the most used spectral method in science,

generally boosts long-periodic noise in long gapped records; LSSA mitigates such problems.

Lomb-Scargle Periodograms (lombscargle)

The Lomb-Scargle method performs spectral analysis on unevenly sampled data and is known to be a powerful way

to ﬁnd, and test the signiﬁcance of, weak periodic signals.

For a time series comprising Ntmeasurements Xj≡X(tj)sampled at times tjwhere (j= 1, . . . , Nt), assumed

to have been scaled and shifted such that its mean is zero and its variance is unity, the normalized Lomb-Scargle

periodogram at frequency fis

Pn(f)1

2









hPNt

jXjcos ω(tj−τ)i2

PNt

jcos2ω(tj−τ)+hPNt

jXjsin ω(tj−τ)i2

PNt

jsin2ω(tj−τ)









Here, ω≡2πf is the angular frequency. The frequency dependent time offset τis given by

tan 2ωτ =PNt

jsin 2ωtj

PNt

jcos 2ωtj

The lombscargle function calculates the periodogram using a slightly modiﬁed algorithm due to Townsend 1which

allows the periodogram to be calculated using only a single pass through the input arrays for each frequency.

The equation is refactored as:

Pn(f) = 1

2(cτXC +sτXS)2

τCC + 2cτsτCS +s2

τSS +(cτXS −sτXC)2

τSS −2cτsτCS +s2

τCC 

1R.H.D. Townsend, “Fast calculation of the Lomb-Scargle periodogram using graphics processing units.”, The Astrophysical Journal Supple-

ment Series, vol 191, pp. 247-253, 2010

1.8. Signal Processing (scipy.signal) 51

SciPy Reference Guide, Release 0.13.0

and

tan 2ωτ =2CS

CC −SS .

Here,

cτ= cos ωτ, sτ= sin ωτ

while the sums are

XC =

Xjcos ωtj

XS =

Xjsin ωtj

CC =

cos2ωtj

SS =

sin2ωtj

CS =

cos ωtjsin ωtj.

This requires Nf(2Nt+ 3) trigonometric function evaluations giving a factor of ∼2speed increase over the straight-

forward implementation.

References

Some further reading and related software:

1.9 Linear Algebra (scipy.linalg)

When SciPy is built using the optimized ATLAS LAPACK and BLAS libraries, it has very fast linear algebra capabil-

ities. If you dig deep enough, all of the raw lapack and blas libraries are available for your use for even more speed.

In this section, some easier-to-use interfaces to these routines are described.

All of these linear algebra routines expect an object that can be converted into a 2-dimensional array. The output of

these routines is also a two-dimensional array.

scipy.linalg contains all the functions in numpy.linalg. plus some other more advanced ones not contained

in numpy.linalg

Another advantage of using scipy.linalg over numpy.linalg is that it is always compiled with

BLAS/LAPACK support, while for numpy this is optional. Therefore, the scipy version might be faster depending

on how numpy was installed.

Therefore, unless you don’t want to add scipy as a dependency to your numpy program, use scipy.linalg

instead of numpy.linalg

1.9.1 numpy.matrix vs 2D numpy.ndarray

The classes that represent matrices, and basic operations such as matrix multiplications and transpose are a part of

numpy. For convenience, we summarize the differences between numpy.matrix and numpy.ndarray here.

52 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

numpy.matrix is matrix class that has a more convenient interface than numpy.ndarray for matrix operations.

This class supports for example MATLAB-like creation syntax via the, has matrix multiplication as default for the *

operator, and contains Iand Tmembers that serve as shortcuts for inverse and transpose:

>>> import numpy as np

>>> A=np.mat(’[1 2;3 4]’)

>>> A

matrix([[1, 2],

[3, 4]])

>>> A.I

matrix([[-2. , 1. ],

[ 1.5, -0.5]])

>>> b=np.mat(’[5 6]’)

>>> b

matrix([[5, 6]])

>>> b.T

matrix([[5],

[6]])

>>> A*b.T

matrix([[17],

[39]])

Despite its convenience, the use of the numpy.matrix class is discouraged, since it adds nothing that cannot be

accomplished with 2D numpy.ndarray objects, and may lead to a confusion of which class is being used. For

example, the above code can be rewritten as:

>>> import numpy as np

>>> from scipy import linalg

>>> A=np.array([[1,2],[3,4]])

>>> A

array([[1, 2],

[3, 4]])

>>> linalg.inv(A)

array([[-2. , 1. ],

[ 1.5, -0.5]])

>>> b=np.array([[5,6]]) #2D array

>>> b

array([[5, 6]])

>>> b.T

array([[5],

[6]])

>>> A*b#not matrix multiplication!

array([[ 5, 12],

[15, 24]])

>>> A.dot(b.T) #matrix multiplication

array([[17],

[39]])

>>> b=np.array([5,6]) #1D array

>>> b

array([5, 6])

>>> b.T#not matrix transpose!

array([5, 6])

>>> A.dot(b) #does not matter for multiplication

array([17, 39])

scipy.linalg operations can be applied equally to numpy.matrix or to 2D numpy.ndarray objects.

1.9. Linear Algebra (scipy.linalg) 53

SciPy Reference Guide, Release 0.13.0

1.9.2 Basic routines

Finding Inverse

The inverse of a matrix Ais the matrix Bsuch that AB =Iwhere Iis the identity matrix consisting of ones down

the main diagonal. Usually Bis denoted B=A−1. In SciPy, the matrix inverse of the Numpy array, A, is obtained

using linalg.inv (A) , or using A.I if Ais a Matrix. For example, let

A=



135

251

238



then

A−1=1

25 

−37 9 22

14 2 −9

4−3 1 

=

−1.48 0.36 0.88

0.56 0.08 −0.36

0.16 −0.12 0.04 

.

The following example demonstrates this computation in SciPy

>>> import numpy as np

>>> from scipy import linalg

>>> A=np.array([[1,2],[3,4]])

array([[1, 2],

[3, 4]])

>>> linalg.inv(A)

array([[-2. , 1. ],

[ 1.5, -0.5]])

>>> A.dot(linalg.inv(A)) #double check

array([[ 1.00000000e+00, 0.00000000e+00],

[ 4.44089210e-16, 1.00000000e+00]])

Solving linear system

Solving linear systems of equations is straightforward using the scipy command linalg.solve. This command

expects an input matrix and a right-hand-side vector. The solution vector is then computed. An option for entering a

symmetrix matrix is offered which can speed up the processing when applicable. As an example, suppose it is desired

to solve the following simultaneous equations:

x+ 3y+ 5z= 10

2x+ 5y+z= 8

2x+ 3y+ 8z= 3

We could ﬁnd the solution vector using a matrix inverse:





z

=



135

251

238



−1



3

=1

25 

−232

129

19 

=

−9.28

5.16

0.76 

.

However, it is better to use the linalg.solve command which can be faster and more numerically stable. In this case it

however gives the same answer as shown in the following example:

>>> import numpy as np

>>> from scipy import linalg

>>> A=np.array([[1,2],[3,4]])

>>> A

array([[1, 2],

54 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

[3, 4]])

>>> b=np.array([[5],[6]])

>>> b

array([[5],

[6]])

>>> linalg.inv(A).dot(b) #slow

array([[-4. ],

[ 4.5]]

>>> A.dot(linalg.inv(A).dot(b))-b#check

array([[ 8.88178420e-16],

[ 2.66453526e-15]])

>>> np.linalg.solve(A,b) #fast

array([[-4. ],

[ 4.5]])

>>> A.dot(np.linalg.solve(A,b))-b#check

array([[ 0.],

[ 0.]])

Finding Determinant

The determinant of a square matrix Ais often denoted |A|and is a quantity often used in linear algebra. Suppose aij

are the elements of the matrix Aand let Mij =|Aij |be the determinant of the matrix left by removing the ith row

and jth column from A. Then for any row i,

|A|=X

(−1)i+jaij Mij .

This is a recursive way to deﬁne the determinant where the base case is deﬁned by accepting that the determinant of a

1×1matrix is the only matrix element. In SciPy the determinant can be calculated with linalg.det . For example,

the determinant of

A=



135

251

238



|A|= 1 

5 1

3 8 −3

2 1

2 8 

+ 5 

2 5

2 3 

= 1 (5 ·8−3·1) −3 (2 ·8−2·1) + 5 (2 ·3−2·5) = −25.

In SciPy this is computed as shown in this example:

>>> import numpy as np

>>> from scipy import linalg

>>> A=np.array([[1,2],[3,4]])

>>> A

array([[1, 2],

[3, 4]])

>>> linalg.det(A)

-2.0

Computing norms

Matrix and vector norms can also be computed with SciPy. A wide range of norm deﬁnitions are available using

different parameters to the order argument of linalg.norm . This function takes a rank-1 (vectors) or a rank-2

1.9. Linear Algebra (scipy.linalg) 55

SciPy Reference Guide, Release 0.13.0

(matrices) array and an optional order argument (default is 2). Based on these inputs a vector or matrix norm of the

requested order is computed.

For vector x, the order parameter can be any real number including inf or -inf. The computed norm is

kxk=









max |xi|ord =inf

min |xi|ord =−inf

Pi|xi|ord1/ord

|ord|<∞.

For matrix Athe only valid values for norm are ±2,±1,±inf, and ‘fro’ (or ‘f’) Thus,

kAk=











maxiPj|aij |ord =inf

miniPj|aij |ord =−inf

maxjPi|aij |ord = 1

minjPi|aij |ord =−1

max σiord = 2

min σiord =−2

ptrace (AHA)ord =’fro’

where σiare the singular values of A.

Examples:

>>> import numpy as np

>>> from scipy import linalg

>>> A=np.array([[1,2],[3,4]])

>>> A

array([[1, 2],

[3, 4]])

>>> linalg.norm(A)

5.4772255750516612

>>> linalg.norm(A,’fro’)# frobenius norm is the default

5.4772255750516612

>>> linalg.norm(A,1)# L1 norm (max column sum)

>>> linalg.norm(A,-1)

>>> linalg.norm(A,inf) # L inf norm (max row sum)

Solving linear least-squares problems and pseudo-inverses

Linear least-squares problems occur in many branches of applied mathematics. In this problem a set of linear scaling

coefﬁcients is sought that allow a model to ﬁt data. In particular it is assumed that data yiis related to data xithrough

a set of coefﬁcients cjand model functions fj(xi)via the model

yi=X

cjfj(xi) + i

where irepresents uncertainty in the data. The strategy of least squares is to pick the coefﬁcients cjto minimize

J(c) = X

i

yi−X

cjfj(xi)

Theoretically, a global minimum will occur when

∂J

∂c∗

=0=X

i

yi−X

cjfj(xi)

(−f∗

n(xi))

56 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

cjX

fj(xi)f∗

n(xi) = X

yif∗

n(xi)

AHAc =AHy

where

{A}ij =fj(xi).

When AHAis invertible, then

c=AHA−1AHy=A†y

where A†is called the pseudo-inverse of A.Notice that using this deﬁnition of Athe model can be written

y=Ac +.

The command linalg.lstsq will solve the linear least squares problem for cgiven Aand y. In addition

linalg.pinv or linalg.pinv2 (uses a different method based on singular value decomposition) will ﬁnd A†

given A.

The following example and ﬁgure demonstrate the use of linalg.lstsq and linalg.pinv for solving a data-

ﬁtting problem. The data shown below were generated using the model:

yi=c1e−xi+c2xi

where xi= 0.1ifor i= 1 . . . 10 ,c1= 5 , and c2= 4.Noise is added to yiand the coefﬁcients c1and c2are estimated

using linear least squares.

>>> from numpy import *

>>> from scipy import linalg

>>> import matplotlib.pyplot as plt

>>> c1,c2=5.0,2.0

>>> i=r_[1:11]

>>> xi =0.1*i

>>> yi =c1*exp(-xi)+c2*xi

>>> zi =yi +0.05*max(yi)*random.randn(len(yi))

>>> A=c_[exp(-xi)[:,newaxis],xi[:,newaxis]]

>>> c,resid,rank,sigma =linalg.lstsq(A,zi)

>>> xi2 =r_[0.1:1.0:100j]

>>> yi2 =c[0]*exp(-xi2) +c[1]*xi2

>>> plt.plot(xi,zi,’x’,xi2,yi2)

>>> plt.axis([0,1.1,3.0,5.5])

>>> plt.xlabel(’$x_i$’)

>>> plt.title(’Data fitting with linalg.lstsq’)

>>> plt.show()

1.9. Linear Algebra (scipy.linalg) 57

SciPy Reference Guide, Release 0.13.0

Generalized inverse

The generalized inverse is calculated using the command linalg.pinv or linalg.pinv2. These two commands

differ in how they compute the generalized inverse. The ﬁrst uses the linalg.lstsq algorithm while the second uses

singular value decomposition. Let Abe an M×Nmatrix, then if M > N the generalized inverse is

A†=AHA−1AH

while if M < N matrix the generalized inverse is

A#=AHAAH−1.

In both cases for M=N, then

A†=A#=A−1

as long as Ais invertible.

1.9.3 Decompositions

In many applications it is useful to decompose a matrix using other representations. There are several decompositions

supported by SciPy.

Eigenvalues and eigenvectors

The eigenvalue-eigenvector problem is one of the most commonly employed linear algebra operations. In one popular

form, the eigenvalue-eigenvector problem is to ﬁnd for some square matrix Ascalars λand corresponding vectors v

such that

Av =λv.

For an N×Nmatrix, there are N(not necessarily distinct) eigenvalues — roots of the (characteristic) polynomial

|A−λI|= 0.

The eigenvectors, v, are also sometimes called right eigenvectors to distinguish them from another set of left eigen-

vectors that satisfy

LA=λvH

58 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

AHvL=λ∗vL.

With it’s default optional arguments, the command linalg.eig returns λand v.However, it can also return vLand

just λby itself ( linalg.eigvals returns just λas well).

In addtion, linalg.eig can also solve the more general eigenvalue problem

Av =λBv

AHvL=λ∗BHvL

for square matrices Aand B.The standard eigenvalue problem is an example of the general eigenvalue problem for

B=I.When a generalized eigenvalue problem can be solved, then it provides a decomposition of Aas

A=BVΛV−1

where Vis the collection of eigenvectors into columns and Λis a diagonal matrix of eigenvalues.

By deﬁnition, eigenvectors are only deﬁned up to a constant scale factor. In SciPy, the scaling factor for the eigenvec-

tors is chosen so that kvk2=Piv2

i= 1.

As an example, consider ﬁnding the eigenvalues and eigenvectors of the matrix

A=



152

241

362

.

The characteristic polynomial is

|A−λI|= (1 −λ) [(4 −λ) (2 −λ)−6] −

5 [2 (2 −λ)−3] + 2 [12 −3 (4 −λ)]

=−λ3+ 7λ2+ 8λ−3.

The roots of this polynomial are the eigenvalues of A:

λ1= 7.9579

λ2=−1.2577

λ3= 0.2997.

The eigenvectors corresponding to each eigenvalue can be found using the original equation. The eigenvectors associ-

ated with these eigenvalues can then be found.

>>> import numpy as np

>>> from scipy import linalg

>>> A=np.array([[1,2],[3,4]])

>>> la,v =linalg.eig(A)

>>> l1,l2 =la

>>> print l1, l2 #eigenvalues

(-0.372281323269+0j) (5.37228132327+0j)

>>> print v[:,0]#first eigenvector

[-0.82456484 0.56576746]

>>> print v[:,1]#second eigenvector

[-0.41597356 -0.90937671]

>>> print np.sum(abs(v**2),axis=0)#eigenvectors are unitary

[ 1. 1. ]

>>> v1 =np.array(v[:,0]).T

>>> print linalg.norm(A.dot(v1)-l1*v1) #check the computation

3.23682852457e-16

1.9. Linear Algebra (scipy.linalg) 59

SciPy Reference Guide, Release 0.13.0

Singular value decomposition

Singular Value Decompostion (SVD) can be thought of as an extension of the eigenvalue problem to matrices that are

not square. Let Abe an M×Nmatrix with Mand Narbitrary. The matrices AHAand AAHare square hermitian

matrices 2of size N×Nand M×Mrespectively. It is known that the eigenvalues of square hermitian matrices are

real and non-negative. In addtion, there are at most min (M, N)identical non-zero eigenvalues of AHAand AAH.

Deﬁne these positive eigenvalues as σ2

i.The square-root of these are called singular values of A.The eigenvectors of

AHAare collected by columns into an N×Nunitary 3matrix Vwhile the eigenvectors of AAHare collected by

columns in the unitary matrix U, the singular values are collected in an M×Nzero matrix Σwith main diagonal

entries set to the singular values. Then

A=UΣVH

is the singular-value decomposition of A.Every matrix has a singular value decomposition. Sometimes, the singular

values are called the spectrum of A.The command linalg.svd will return U,VH, and σias an array of the

singular values. To obtain the matrix Σuse linalg.diagsvd. The following example illustrates the use of

linalg.svd .

>>> import numpy as np

>>> from scipy import linalg

>>> A=np.array([[1,2,3],[4,5,6]])

>>> A

array([[1, 2, 3],

[4, 5, 6]])

>>> M,N =A.shape

>>> U,s,Vh =linalg.svd(A)

>>> Sig =linalg.diagsvd(s,M,N)

>>> U, Vh =U, Vh

>>> U

array([[-0.3863177 , -0.92236578],

[-0.92236578, 0.3863177 ]])

>>> Sig

array([[ 9.508032 , 0. , 0. ],

[ 0. , 0.77286964, 0. ]])

>>> Vh

array([[-0.42866713, -0.56630692, -0.7039467 ],

[ 0.80596391, 0.11238241, -0.58119908],

[ 0.40824829, -0.81649658, 0.40824829]])

>>> U.dot(Sig.dot(Vh)) #check computation

array([[ 1., 2., 3.],

[ 4., 5., 6.]])

LU decomposition

The LU decompostion ﬁnds a representation for the M×Nmatrix Aas

A=PLU

where Pis an M×Mpermutation matrix (a permutation of the rows of the identity matrix), Lis in M×Klower

triangular or trapezoidal matrix ( K= min (M, N)) with unit-diagonal, and Uis an upper triangular or trapezoidal

matrix. The SciPy command for this decomposition is linalg.lu .

Such a decomposition is often useful for solving many simultaneous equations where the left-hand-side does not

change but the right hand side does. For example, suppose we are going to solve

Axi=bi

2A hermitian matrix Dsatisﬁes DH=D.

3A unitary matrix Dsatisﬁes DHD=I=DDHso that D−1=DH

60 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

for many different bi. The LU decomposition allows this to be written as

PLUxi=bi.

Because Lis lower-triangular, the equation can be solved for Uxiand ﬁnally xivery rapidly using forward- and

back-substitution. An initial time spent factoring Aallows for very rapid solution of similar systems of equa-

tions in the future. If the intent for performing LU decomposition is for solving linear systems then the command

linalg.lu_factor should be used followed by repeated applications of the command linalg.lu_solve to

solve the system for each new right-hand-side.

Cholesky decomposition

Cholesky decomposition is a special case of LU decomposition applicable to Hermitian positive deﬁnite matrices.

When A=AHand xHAx ≥0for all x, then decompositions of Acan be found so that

A=UHU

A=LLH

where Lis lower-triangular and Uis upper triangular. Notice that L=UH.The command linagl.cholesky

computes the cholesky factorization. For using cholesky factorization to solve systems of equations there are also

linalg.cho_factor and linalg.cho_solve routines that work similarly to their LU decomposition coun-

terparts.

QR decomposition

The QR decomposition (sometimes called a polar decomposition) works for any M×Narray and ﬁnds an M×M

unitary matrix Qand an M×Nupper-trapezoidal matrix Rsuch that

A=QR.

Notice that if the SVD of Ais known then the QR decomposition can be found

A=UΣVH=QR

implies that Q=Uand R=ΣVH.Note, however, that in SciPy independent algorithms are used to ﬁnd QR and

SVD decompositions. The command for QR decomposition is linalg.qr .

Schur decomposition

For a square N×Nmatrix, A, the Schur decomposition ﬁnds (not-necessarily unique) matrices Tand Zsuch that

A=ZTZH

where Zis a unitary matrix and Tis either upper-triangular or quasi-upper triangular depending on whether or not a

real schur form or complex schur form is requested. For a real schur form both Tand Zare real-valued when Ais

real-valued. When Ais a real-valued matrix the real schur form is only quasi-upper triangular because 2×2blocks

extrude from the main diagonal corresponding to any complex- valued eigenvalues. The command linalg.schur

ﬁnds the Schur decomposition while the command linalg.rsf2csf converts Tand Zfrom a real Schur form to

a complex Schur form. The Schur form is especially useful in calculating functions of matrices.

The following example illustrates the schur decomposition:

1.9. Linear Algebra (scipy.linalg) 61

SciPy Reference Guide, Release 0.13.0

>>> from scipy import linalg

>>> A=mat(’[132;145;236]’)

>>> T,Z =linalg.schur(A)

>>> T1,Z1 =linalg.schur(A,’complex’)

>>> T2,Z2 =linalg.rsf2csf(T,Z)

>>> print T

[[ 9.90012467 1.78947961 -0.65498528]

[ 0. 0.54993766 -1.57754789]

[ 0. 0.51260928 0.54993766]]

>>> print T2

[[ 9.90012467 +0.00000000e+00j -0.32436598 +1.55463542e+00j

-0.88619748 +5.69027615e-01j]

[ 0.00000000 +0.00000000e+00j 0.54993766 +8.99258408e-01j

1.06493862 +1.37016050e-17j]

[ 0.00000000 +0.00000000e+00j 0.00000000 +0.00000000e+00j

0.54993766 -8.99258408e-01j]]

>>> print abs(T1-T2) # different

[[ 1.24357637e-14 2.09205364e+00 6.56028192e-01]

[ 0.00000000e+00 4.00296604e-16 1.83223097e+00]

[ 0.00000000e+00 0.00000000e+00 4.57756680e-16]]

>>> print abs(Z1-Z2) # different

[[ 0.06833781 1.10591375 0.23662249]

[ 0.11857169 0.5585604 0.29617525]

[ 0.12624999 0.75656818 0.22975038]]

>>> T,Z,T1,Z1,T2,Z2 =map(mat,(T,Z,T1,Z1,T2,Z2))

>>> print abs(A-Z*T*Z.H) # same

[[ 1.11022302e-16 4.44089210e-16 4.44089210e-16]

[ 4.44089210e-16 1.33226763e-15 8.88178420e-16]

[ 8.88178420e-16 4.44089210e-16 2.66453526e-15]]

>>> print abs(A-Z1*T1*Z1.H) # same

[[ 1.00043248e-15 2.22301403e-15 5.55749485e-15]

[ 2.88899660e-15 8.44927041e-15 9.77322008e-15]

[ 3.11291538e-15 1.15463228e-14 1.15464861e-14]]

>>> print abs(A-Z2*T2*Z2.H) # same

[[ 3.34058710e-16 8.88611201e-16 4.18773089e-18]

[ 1.48694940e-16 8.95109973e-16 8.92966151e-16]

[ 1.33228956e-15 1.33582317e-15 3.55373104e-15]]

Interpolative Decomposition

scipy.linalg.interpolative contains routines for computing the interpolative decomposition (ID) of a ma-

trix. For a matrix A∈Cm×nof rank k≤min{m, n}this is a factorization

AΠ = AΠ1AΠ2=AΠ1I T ,

where Π = [Π1,Π2]is a permutation matrix with Π1∈ {0,1}n×k, i.e., AΠ2=AΠ1T. This can equivalently be

written as A=BP , where B=AΠ1and P= [I, T ]ΠTare the skeleton and interpolation matrices, respectively.

See Also

scipy.linalg.interpolative — for more information.

62 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

1.9.4 Matrix Functions

Consider the function f(x)with Taylor series expansion

f(x) = ∞

k=0

f(k)(0)

k!xk.

A matrix function can be deﬁned using this Taylor series for the square matrix Aas

f(A) = ∞

k=0

f(k)(0)

k!Ak.

While, this serves as a useful representation of a matrix function, it is rarely the best way to calculate a matrix function.

Exponential and logarithm functions

The matrix exponential is one of the more common matrix functions. It can be deﬁned for square matrices as

eA=∞

k=0

k!Ak.

The command linalg.expm3 uses this Taylor series deﬁnition to compute the matrix exponential. Due to poor

convergence properties it is not often used.

Another method to compute the matrix exponential is to ﬁnd an eigenvalue decomposition of A:

A=VΛV−1

and note that

eA=VeΛV−1

where the matrix exponential of the diagonal matrix Λis just the exponential of its elements. This method is imple-

mented in linalg.expm2 .

The preferred method for implementing the matrix exponential is to use scaling and a Padé approximation for ex.

This algorithm is implemented as linalg.expm .

The inverse of the matrix exponential is the matrix logarithm deﬁned as the inverse of the matrix exponential.

A≡exp (log (A)) .

The matrix logarithm can be obtained with linalg.logm .

Trigonometric functions

The trigonometric functions sin ,cos , and tan are implemented for matrices in linalg.sinm,linalg.cosm,

and linalg.tanm respectively. The matrix sin and cosine can be deﬁned using Euler’s identity as

sin (A) = ejA−e−jA

cos (A) = ejA+e−jA

The tangent is

tan (x) = sin (x)

cos (x)= [cos (x)]−1sin (x)

and so the matrix tangent is deﬁned as

[cos (A)]−1sin (A).

1.9. Linear Algebra (scipy.linalg) 63

SciPy Reference Guide, Release 0.13.0

Hyperbolic trigonometric functions

The hyperbolic trigonemetric functions sinh ,cosh , and tanh can also be deﬁned for matrices using the familiar

deﬁnitions:

sinh (A) = eA−e−A

cosh (A) = eA+e−A

tanh (A) = [cosh (A)]−1sinh (A).

These matrix functions can be found using linalg.sinhm,linalg.coshm , and linalg.tanhm.

Arbitrary function

Finally, any arbitrary function that takes one complex number and returns a complex number can be called as a matrix

function using the command linalg.funm. This command takes the matrix and an arbitrary Python function. It

then implements an algorithm from Golub and Van Loan’s book “Matrix Computations “to compute function applied

to the matrix using a Schur decomposition. Note that the function needs to accept complex numbers as input in order

to work with this algorithm. For example the following code computes the zeroth-order Bessel function applied to a

matrix.

>>> from scipy import special, random, linalg

>>> A=random.rand(3,3)

>>> B=linalg.funm(A,lambda x: special.jv(0,x))

>>> print A

[[ 0.72578091 0.34105276 0.79570345]

[ 0.65767207 0.73855618 0.541453 ]

[ 0.78397086 0.68043507 0.4837898 ]]

>>> print B

[[ 0.72599893 -0.20545711 -0.22721101]

[-0.27426769 0.77255139 -0.23422637]

[-0.27612103 -0.21754832 0.7556849 ]]

>>> print linalg.eigvals(A)

[ 1.91262611+0.j 0.21846476+0.j -0.18296399+0.j]

>>> print special.jv(0, linalg.eigvals(A))

[ 0.27448286+0.j 0.98810383+0.j 0.99164854+0.j]

>>> print linalg.eigvals(B)

[ 0.27448286+0.j 0.98810383+0.j 0.99164854+0.j]

Note how, by virtue of how matrix analytic functions are deﬁned, the Bessel function has acted on the matrix eigen-

values.

1.9.5 Special matrices

SciPy and NumPy provide several functions for creating special matrices that are frequently used in engineering and

science.

64 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

Type Function Description

block diagonal scipy.linalg.block_diag Create a block diagonal matrix from the provided arrays.

circulant scipy.linalg.circulant Construct a circulant matrix.

companion scipy.linalg.companion Create a companion matrix.

Hadamard scipy.linalg.hadamard Construct a Hadamard matrix.

Hankel scipy.linalg.hankel Construct a Hankel matrix.

Hilbert scipy.linalg.hilbert Construct a Hilbert matrix.

Inverse Hilbert scipy.linalg.invhilbert Construct the inverse of a Hilbert matrix.

Leslie scipy.linalg.leslie Create a Leslie matrix.

Pascal scipy.linalg.pascal Create a Pascal matrix.

Toeplitz scipy.linalg.toeplitz Construct a Toeplitz matrix.

Van der Monde numpy.vander Generate a Van der Monde matrix.

For examples of the use of these functions, see their respective docstrings.

1.10 Sparse Eigenvalue Problems with ARPACK

1.10.1 Introduction

ARPACK is a Fortran package which provides routines for quickly ﬁnding a few eigenvalues/eigenvectors of large

sparse matrices. In order to ﬁnd these solutions, it requires only left-multiplication by the matrix in question. This

operation is performed through a reverse-communication interface. The result of this structure is that ARPACK is able

to ﬁnd eigenvalues and eigenvectors of any linear function mapping a vector to a vector.

All of the functionality provided in ARPACK is contained within the two high-level interfaces

scipy.sparse.linalg.eigs and scipy.sparse.linalg.eigsh.eigs provides interfaces to

ﬁnd the eigenvalues/vectors of real or complex nonsymmetric square matrices, while eigsh provides interfaces for

real-symmetric or complex-hermitian matrices.

1.10.2 Basic Functionality

ARPACK can solve either standard eigenvalue problems of the form

Ax=λx

or general eigenvalue problems of the form

Ax=λMx

The power of ARPACK is that it can compute only a speciﬁed subset of eigenvalue/eigenvector pairs. This is accom-

plished through the keyword which. The following values of which are available:

•which = ’LM’ : Eigenvalues with largest magnitude (eigs,eigsh), that is, largest eigenvalues in the

euclidean norm of complex numbers.

•which = ’SM’ : Eigenvalues with smallest magnitude (eigs,eigsh), that is, smallest eigenvalues in the

euclidean norm of complex numbers.

•which = ’LR’ : Eigenvalues with largest real part (eigs)

•which = ’SR’ : Eigenvalues with smallest real part (eigs)

•which = ’LI’ : Eigenvalues with largest imaginary part (eigs)

1.10. Sparse Eigenvalue Problems with ARPACK 65

SciPy Reference Guide, Release 0.13.0

•which = ’SI’ : Eigenvalues with smallest imaginary part (eigs)

•which = ’LA’ : Eigenvalues with largest algebraic value (eigsh), that is, largest eigenvalues inclusive of

any negative sign.

•which = ’SA’ : Eigenvalues with smallest algebraic value (eigsh), that is, smallest eigenvalues inclusive

of any negative sign.

•which = ’BE’ : Eigenvalues from both ends of the spectrum (eigsh)

Note that ARPACK is generally better at ﬁnding extremal eigenvalues: that is, eigenvalues with large magnitudes. In

particular, using which = ’SM’ may lead to slow execution time and/or anomalous results. A better approach is to

use shift-invert mode.

1.10.3 Shift-Invert Mode

Shift invert mode relies on the following observation. For the generalized eigenvalue problem

Ax=λMx

it can be shown that

(A−σM)−1Mx=νx

where

ν=1

λ−σ

1.10.4 Examples

Imagine you’d like to ﬁnd the smallest and largest eigenvalues and the corresponding eigenvectors for a

large matrix. ARPACK can handle many forms of input: dense matrices such as numpy.ndarray in-

stances, sparse matrices such as scipy.sparse.csr_matrix, or a general linear operator derived from

scipy.sparse.linalg.LinearOperator. For this example, for simplicity, we’ll construct a symmetric,

positive-deﬁnite matrix.

>>> import numpy as np

>>> from scipy.linalg import eigh

>>> from scipy.sparse.linalg import eigsh

>>> np.set_printoptions(suppress=True)

>>>

>>> np.random.seed(0)

>>> X=np.random.random((100,100)) -0.5

>>> X=np.dot(X, X.T) #create a symmetric matrix

We now have a symmetric matrix Xwith which to test the routines. First compute a standard eigenvalue decomposition

using eigh:

>>> evals_all, evecs_all =eigh(X)

As the dimension of Xgrows, this routine becomes very slow. Especially if only a few eigenvectors and eigenvalues

are needed, ARPACK can be a better option. First let’s compute the largest eigenvalues (which = ’LM’) of Xand

compare them to the known results:

66 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

>>> evals_large, evecs_large =eigsh(X, 3, which=’LM’)

>>> print evals_all[-3:]

[ 29.1446102 30.05821805 31.19467646]

>>> print evals_large

[ 29.1446102 30.05821805 31.19467646]

>>> print np.dot(evecs_large.T, evecs_all[:,-3:])

[[-1. 0. 0.]

[ 0. 1. 0.]

[-0. 0. -1.]]

The results are as expected. ARPACK recovers the desired eigenvalues, and they match the previously known results.

Furthermore, the eigenvectors are orthogonal, as we’d expect. Now let’s attempt to solve for the eigenvalues with

smallest magnitude:

>>> evals_small, evecs_small =eigsh(X, 3, which=’SM’)

scipy.sparse.linalg.eigen.arpack.arpack.ArpackNoConvergence:

ARPACK error -1: No convergence (1001 iterations, 0/3 eigenvectors converged)

Oops. We see that as mentioned above, ARPACK is not quite as adept at ﬁnding small eigenvalues. There are a few

ways this problem can be addressed. We could increase the tolerance (tol) to lead to faster convergence:

>>> evals_small, evecs_small =eigsh(X, 3, which=’SM’, tol=1E-2)

>>> print evals_all[:3]

[ 0.0003783 0.00122714 0.00715878]

>>> print evals_small

[ 0.00037831 0.00122714 0.00715881]

>>> print np.dot(evecs_small.T, evecs_all[:,:3])

[[ 0.99999999 0.00000024 -0.00000049]

[-0.00000023 0.99999999 0.00000056]

[ 0.00000031 -0.00000037 0.99999852]]

This works, but we lose the precision in the results. Another option is to increase the maximum number of iterations

(maxiter) from 1000 to 5000:

>>> evals_small, evecs_small =eigsh(X, 3, which=’SM’, maxiter=5000)

>>> print evals_all[:3]

[ 0.0003783 0.00122714 0.00715878]

>>> print evals_small

[ 0.0003783 0.00122714 0.00715878]

>>> print np.dot(evecs_small.T, evecs_all[:,:3])

[[ 1. 0. 0.]

[-0. 1. 0.]

[ 0. 0. -1.]]

We get the results we’d hoped for, but the computation time is much longer. Fortunately, ARPACK contains a mode that

allows quick determination of non-external eigenvalues: shift-invert mode. As mentioned above, this mode involves

transforming the eigenvalue problem to an equivalent problem with different eigenvalues. In this case, we hope to ﬁnd

eigenvalues near zero, so we’ll choose sigma = 0. The transformed eigenvalues will then satisfy ν= 1/(σ−λ) =

1/λ, so our small eigenvalues λbecome large eigenvalues ν.

>>> evals_small, evecs_small =eigsh(X, 3, sigma=0, which=’LM’)

>>> print evals_all[:3]

[ 0.0003783 0.00122714 0.00715878]

>>> print evals_small

[ 0.0003783 0.00122714 0.00715878]

>>> print np.dot(evecs_small.T, evecs_all[:,:3])

[[ 1. 0. 0.]

[ 0. -1. -0.]

[-0. -0. 1.]]

1.10. Sparse Eigenvalue Problems with ARPACK 67

SciPy Reference Guide, Release 0.13.0

We get the results we were hoping for, with much less computational time. Note that the transformation from ν→λ

takes place entirely in the background. The user need not worry about the details.

The shift-invert mode provides more than just a fast way to obtain a few small eigenvalues. Say you desire to ﬁnd

internal eigenvalues and eigenvectors, e.g. those nearest to λ= 1. Simply set sigma = 1 and ARPACK takes care

of the rest:

>>> evals_mid, evecs_mid =eigsh(X, 3, sigma=1, which=’LM’)

>>> i_sort =np.argsort(abs(1. /(1-evals_all)))[-3:]

>>> print evals_all[i_sort]

[ 1.16577199 0.85081388 1.06642272]

>>> print evals_mid

[ 0.85081388 1.06642272 1.16577199]

>>> print np.dot(evecs_mid.T, evecs_all[:,i_sort])

[[-0. 1. 0.]

[-0. -0. 1.]

[ 1. 0. 0.]]

The eigenvalues come out in a different order, but they’re all there. Note that the shift-invert mode requires

the internal solution of a matrix inverse. This is taken care of automatically by eigsh and eigs, but the

operation can also be speciﬁed by the user. See the docstring of scipy.sparse.linalg.eigsh and

scipy.sparse.linalg.eigs for details.

1.10.5 References

1.11 Compressed Sparse Graph Routines scipy.sparse.csgraph

1.11.1 Example: Word Ladders

AWord Ladder is a word game invented by Lewis Carroll in which players ﬁnd paths between words by switching

one letter at a time. For example, one can link “ape” and “man” in the following way:

ape →apt →ait →bit →big →bag →mag →man

Note that each step involves changing just one letter of the word. This is just one possible path from “ape” to “man”,

but is it the shortest possible path? If we desire to ﬁnd the shortest word ladder path between two given words, the

sparse graph submodule can help.

First we need a list of valid words. Many operating systems have such a list built-in. For example, on linux, a word

list can often be found at one of the following locations:

/usr/share/dict

/var/lib/dict

Another easy source for words are the scrabble word lists available at various sites around the internet (search with

your favorite search engine). We’ll ﬁrst create this list. The system word lists consist of a ﬁle with one word per line.

The following should be modiﬁed to use the particular word list you have available:

>>> word_list =open(’/usr/share/dict/words’).readlines()

>>> word_list =map(str.strip, word_list)

We want to look at words of length 3, so let’s select just those words of the correct length. We’ll also eliminate words

which start with upper-case (proper nouns) or contain non alpha-numeric characters like apostrophes and hyphens.

Finally, we’ll make sure everything is lower-case for comparison later:

68 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

>>> word_list =[word for word in word_list if len(word) == 3]

>>> word_list =[word for word in word_list if word[0].islower()]

>>> word_list =[word for word in word_list if word.isalpha()]

>>> word_list =map(str.lower, word_list)

>>> len(word_list)

586

Now we have a list of 586 valid three-letter words (the exact number may change depending on the particular list

used). Each of these words will become a node in our graph, and we will create edges connecting the nodes associated

with each pair of words which differs by only one letter.

There are efﬁcient ways to do this, and inefﬁcient ways to do this. To do this as efﬁciently as possible, we’re going to

use some sophisticated numpy array manipulation:

>>> import numpy as np

>>> word_list =np.asarray(word_list)

>>> word_list.dtype

dtype(’|S3’)

>>> word_list.sort() # sort for quick searching later

We have an array where each entry is three bytes. We’d like to ﬁnd all pairs where exactly one byte is different. We’ll

start by converting each word to a three-dimensional vector:

>>> word_bytes =np.ndarray((word_list.size, word_list.itemsize),

... dtype=’int8’,

... buffer=word_list.data)

>>> word_bytes.shape

(586, 3)

Now we’ll use the Hamming distance between each point to determine which pairs of words are connected. The

Hamming distance measures the fraction of entries between two vectors which differ: any two words with a hamming

distance equal to 1/N, where Nis the number of letters, are connected in the word ladder:

>>> from scipy.spatial.distance import pdist, squareform

>>> from scipy.sparse import csr_matrix

>>> hamming_dist =pdist(word_bytes, metric=’hamming’)

>>> graph =csr_matrix(squareform(hamming_dist <1.5 /word_list.itemsize))

When comparing the distances, we don’t use an equality because this can be unstable for ﬂoating point values. The

inequality produces the desired result as long as no two entries of the word list are identical. Now that our graph is set

up, we’ll use a shortest path search to ﬁnd the path between any two words in the graph:

>>> i1 =word_list.searchsorted(’ape’)

>>> i2 =word_list.searchsorted(’man’)

>>> word_list[i1]

’ape’

>>> word_list[i2]

’man’

We need to check that these match, because if the words are not in the list that will not be the case. Now all we need

is to ﬁnd the shortest path between these two indices in the graph. We’ll use dijkstra’s algorithm, because it allows us

to ﬁnd the path for just one node:

>>> from scipy.sparse.csgraph import dijkstra

>>> distances, predecessors =dijkstra(graph, indices=i1,

... return_predecessors=True)

>>> print distances[i2]

5.0

1.11. Compressed Sparse Graph Routines scipy.sparse.csgraph 69

SciPy Reference Guide, Release 0.13.0

So we see that the shortest path between ‘ape’ and ‘man’ contains only ﬁve steps. We can use the predecessors returned

by the algorithm to reconstruct this path:

>>> path =[]

>>> i=i2

>>> while i!= i1:

>>> path.append(word_list[i])

>>> i=predecessors[i]

>>> path.append(word_list[i1])

>>> print path[::-1]

[’ape’, ’apt’, ’opt’, ’oat’, ’mat’, ’man’]

This is three fewer links than our initial example: the path from ape to man is only ﬁve steps.

Using other tools in the module, we can answer other questions. For example, are there three-letter words which are

not linked in a word ladder? This is a question of connected components in the graph:

>>> from scipy.sparse.csgraph import connected_components

>>> N_components, component_list =connected_components(graph)

>>> print N_components

In this particular sample of three-letter words, there are 15 connected components: that is, 15 distinct sets of words with

no paths between the sets. How many words are in each of these sets? We can learn this from the list of components:

>>> [np.sum(component_list == i) for iin range(15)]

[571, 1, 1, 1, 2, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]

There is one large connected set, and 14 smaller ones. Let’s look at the words in the smaller ones:

>>> [list(word_list[np.where(component_list == i)]) for iin range(1,15)]

[[’aha’],

[’chi’],

[’ebb’],

[’ems’, ’emu’],

[’gnu’],

[’ism’],

[’khz’],

[’nth’],

[’ova’],

[’qua’],

[’ugh’],

[’ups’],

[’urn’],

[’use’]]

These are all the three-letter words which do not connect to others via a word ladder.

We might also be curious about which words are maximally separated. Which two words take the most links to

connect? We can determine this by computing the matrix of all shortest paths. Note that by convention, the distance

between two non-connected points is reported to be inﬁnity, so we’ll need to remove these before ﬁnding the maximum:

>>> distances, predecessors =dijkstra(graph, return_predecessors=True)

>>> np.max(distances[~np.isinf(distances)])

13.0

So there is at least one pair of words which takes 13 steps to get from one to the other! Let’s determine which these

are:

>>> i1, i2 =np.where(distances == 13)

>>> zip(word_list[i1], word_list[i2])

70 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

[(’imp’, ’ohm’),

(’imp’, ’ohs’),

(’ohm’, ’imp’),

(’ohm’, ’ump’),

(’ohs’, ’imp’),

(’ohs’, ’ump’),

(’ump’, ’ohm’),

(’ump’, ’ohs’)]

We see that there are two pairs of words which are maximally separated from each other: ‘imp’ and ‘ump’ on one

hand, and ‘ohm’ and ‘ohs’ on the other hand. We can ﬁnd the connecting list in the same way as above:

>>> path =[]

>>> i=i2[0]

>>> while i!= i1[0]:

>>> path.append(word_list[i])

>>> i=predecessors[i1[0], i]

>>> path.append(word_list[i1[0]])

>>> print path[::-1]

[’imp’, ’amp’, ’asp’, ’ask’, ’ark’, ’are’, ’aye’, ’rye’, ’roe’, ’woe’, ’woo’, ’who’, ’oho’, ’ohm’]

This gives us the path we desired to see.

Word ladders are just one potential application of scipy’s fast graph algorithms for sparse matrices. Graph theory

makes appearances in many areas of mathematics, data analysis, and machine learning. The sparse graph tools are

ﬂexible enough to handle many of these situations.

1.12 Spatial data structures and algorithms (scipy.spatial)

scipy.spatial can compute triangulations, Voronoi diagrams, and convex hulls of a set of points, by leveraging

the Qhull library.

Moreover, it contains KDTree implementations for nearest-neighbor point queries, and utilities for distance compu-

tations in various metrics.

1.12.1 Delaunay triangulations

The Delaunay triangulation is a subdivision of a set of points into a non-overlapping set of triangles, such that no point

is inside the circumcircle of any triangle. In practice, such triangulations tend to avoid triangles with small angles.

Delaunay triangulation can be computed using scipy.spatial as follows:

>>> from scipy.spatial import Delaunay

>>> points =np.array([[0,0], [0,1.1], [1,0], [1,1]])

>>> tri =Delaunay(points)

We can visualize it:

>>> import matplotlib.pyplot as plt

>>> plt.triplot(points[:,0], points[:,1], tri.simplices.copy())

>>> plt.plot(points[:,0], points[:,1], ’o’)

And add some further decorations:

>>> for j, p in enumerate(points):

... plt.text(p[0]-0.03, p[1]+0.03, j, ha=’right’)# label the points

1.12. Spatial data structures and algorithms (scipy.spatial) 71

SciPy Reference Guide, Release 0.13.0

>>> for j, s in enumerate(tri.simplices):

... p=points[s].mean(axis=0)

... plt.text(p[0], p[1], ’#%d’%j, ha=’center’)# label triangles

>>> plt.xlim(-0.5,1.5); plt.ylim(-0.5,1.5)

>>> plt.show()

The structure of the triangulation is encoded in the following way: the simplices attribute contains the indices of

the points in the points array that make up the triangle. For instance:

>>> i=1

>>> tri.simplices[i,:]

array([3, 1, 0], dtype=int32)

>>> points[tri.simplices[i,:]]

array([[ 1. , 1. ],

[ 0. , 1.1],

[ 0. , 0. ]])

Moreover, neighboring triangles can also be found out:

>>> tri.neighbors[i]

array([-1, 0, -1], dtype=int32)

What this tells us is that this triangle has triangle #0 as a neighbor, but no other neighbors. Moreover, it tells us that

neighbor 0 is opposite the vertex 1 of the triangle:

>>> points[tri.simplices[i, 1]]

array([ 0. , 1.1])

Indeed, from the ﬁgure we see that this is the case.

Qhull can also perform tesselations to simplices also for higher-dimensional point sets (for instance, subdivision into

tetrahedra in 3-D).

Coplanar points

It is important to note that not all points necessarily appear as vertices of the triangulation, due to numerical precision

issues in forming the triangulation. Consider the above with a duplicated point:

72 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

>>> points =np.array([[0,0], [0,1], [1,0], [1,1], [1,1]])

>>> tri =Delaunay(points)

>>> np.unique(tri.simplices.ravel())

array([0, 1, 2, 3], dtype=int32)

Observe that point #4, which is a duplicate, does not occur as a vertex of the triangulation. That this happened is

recorded:

>>> tri.coplanar

array([[4, 0, 3]], dtype=int32)

This means that point 4 resides near triangle 0 and vertex 3, but is not included in the triangulation.

Note that such degeneracies can occur not only because of duplicated points, but also for more complicated geometrical

reasons, even in point sets that at ﬁrst sight seem well-behaved.

However, Qhull has the “QJ” option, which instructs it to perturb the input data randomly until degeneracies are

resolved:

>>> tri =Delaunay(points, qhull_options="QJ Pp")

>>> points[tri.simplices]

array([[[1, 1],

[1, 0],

[0, 0]],

[[1, 1],

[1, 1],

[1, 0]],

[[0, 1],

[1, 1],

[0, 0]],

[[0, 1],

[1, 1],

[1, 1]]])

Two new triangles appeared. However, we see that they are degenerate and have zero area.

1.12.2 Convex hulls

Convex hull is the smallest convex object containing all points in a given point set.

These can be computed via the Qhull wrappers in scipy.spatial as follows:

>>> from scipy.spatial import ConvexHull

>>> points =np.random.rand(30,2)# 30 random points in 2-D

>>> hull =ConvexHull(points)

The convex hull is represented as a set of N-1 dimensional simplices, which in 2-D means line segments. The storage

scheme is exactly the same as for the simplices in the Delaunay triangulation discussed above.

We can illustrate the above result:

>>> import matplotlib.pyplot as plt

>>> plt.plot(points[:,0], points[:,1], ’o’)

>>> for simplex in hull.simplices:

>>> plt.plot(points[simplex,0], points[simplex,1], ’k-’)

>>> plt.show()

1.12. Spatial data structures and algorithms (scipy.spatial) 73

SciPy Reference Guide, Release 0.13.0

The same can be achieved with scipy.spatial.convex_hull_plot_2d.

1.12.3 Voronoi diagrams

A Voronoi diagram is a subdivision of the space into the nearest neighborhoods of a given set of points.

There are two ways to approach this object using scipy.spatial. First, one can use the KDTree to answer the

question “which of the points is closest to this one”, and deﬁne the regions that way:

>>> from scipy.spatial import KDTree

>>> points =np.array([[0,0], [0,1], [0,2], [1,0], [1,1], [1,2],

... [2,0], [2,1], [2,2]])

>>> tree =KDTree(points)

>>> tree.query([0.1,0.1])

(0.14142135623730953, 0)

So the point (0.1, 0.1) belongs to region 0. In color:

>>> x=np.linspace(-0.5,2.5,31)

>>> y=np.linspace(-0.5,2.5,33)

>>> xx, yy =np.meshgrid(x, y)

>>> xy =np.c_[xx.ravel(), yy.ravel()]

>>> import matplotlib.pyplot as plt

>>> plt.pcolor(x, y, tree.query(xy)[1].reshape(33,31))

>>> plt.plot(points[:,0], points[:,1], ’ko’)

>>> plt.show()

74 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

This does not, however, give the Voronoi diagram as a geometrical object.

The representation in terms of lines and points can be again obtained via the Qhull wrappers in scipy.spatial:

>>> from scipy.spatial import Voronoi

>>> vor =Voronoi(points)

>>> vor.vertices

array([[ 0.5, 0.5],

[ 1.5, 0.5],

[ 0.5, 1.5],

[ 1.5, 1.5]])

The Voronoi vertices denote the set of points forming the polygonal edges of the Voronoi regions. In this case, there

are 9 different regions:

>>> vor.regions

[[-1, 0], [-1, 1], [1, -1, 0], [3, -1, 2], [-1, 3], [-1, 2], [3, 1, 0, 2], [2, -1, 0], [3, -1, 1]]

Negative value -1 again indicates a point at inﬁnity. Indeed, only one of the regions, [3, 1, 0, 2], is bounded.

Note here that due to similar numerical precision issues as in Delaunay triangulation above, there may be fewer

Voronoi regions than input points.

The ridges (lines in 2-D) separating the regions are described as a similar collection of simplices as the convex hull

pieces:

>>> vor.ridge_vertices

[[-1, 0], [-1, 0], [-1, 1], [-1, 1], [0, 1], [-1, 3], [-1, 2], [2, 3], [-1, 3], [-1, 2], [0, 2], [1, 3]]

These numbers indicate indices of the Voronoi vertices making up the line segments. -1 is again a point at inﬁnity —

only four of the 12 lines is a bounded line segment while the others extend to inﬁnity.

The Voronoi ridges are perpendicular to lines drawn between the input points. Which two points each ridge corre-

sponds to is also recorded:

>>> vor.ridge_points

array([[0, 3],

[0, 1],

[6, 3],

[6, 7],

[3, 4],

1.12. Spatial data structures and algorithms (scipy.spatial) 75

SciPy Reference Guide, Release 0.13.0

[5, 8],

[5, 2],

[5, 4],

[8, 7],

[2, 1],

[4, 1],

[4, 7]], dtype=int32)

This information, taken together, is enough to construct the full diagram.

We can plot it as follows. First the points and the Voronoi vertices:

>>> plt.plot(points[:,0], points[:,1], ’o’)

>>> plt.plot(vor.vertices[:,0], vor.vertices[:,1], ’*’)

>>> plt.xlim(-1,3); plt.ylim(-1,3)

Plotting the ﬁnite line segments goes as for the convex hull, but now we have to guard for the inﬁnite edges:

>>> for simplex in vor.ridge_vertices:

>>> simplex =np.asarray(simplex)

>>> if np.all(simplex >= 0):

>>> plt.plot(vor.vertices[simplex,0], vor.vertices[simplex,1], ’k-’)

The ridges extending to inﬁnity require a bit more care:

>>> center =points.mean(axis=0)

>>> for pointidx, simplex in zip(vor.ridge_points, vor.ridge_vertices):

>>> simplex =np.asarray(simplex)

>>> if np.any(simplex <0):

>>> i=simplex[simplex >= 0][0]# finite end Voronoi vertex

>>> t=points[pointidx[1]] -points[pointidx[0]] # tangent

>>> t/= np.linalg.norm(t)

>>> n=np.array([-t[1], t[0]]) # normal

>>> midpoint =points[pointidx].mean(axis=0)

>>> far_point =vor.vertices[i] +np.sign(np.dot(midpoint -center, n)) *n*100

>>> plt.plot([vor.vertices[i,0], far_point[0]],

... [vor.vertices[i,1], far_point[1]], ’k--’)

>>> plt.show()

76 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

This plot can also be created using scipy.spatial.voronoi_plot_2d.

1.13 Statistics (scipy.stats)

1.13.1 Introduction

In this tutorial we discuss many, but certainly not all, features of scipy.stats. The intention here is to provide a

user with a working knowledge of this package. We refer to the reference manual for further details.

Note: This documentation is work in progress.

1.13.2 Random Variables

There are two general distribution classes that have been implemented for encapsulating continuous random variables

and discrete random variables . Over 80 continuous random variables (RVs) and 10 discrete random variables have

been implemented using these classes. Besides this, new routines and distributions can easily added by the end user.

(If you create one, please contribute it).

All of the statistics functions are located in the sub-package scipy.stats and a fairly complete listing of these

functions can be obtained using info(stats). The list of the random variables available can also be obtained from

the docstring for the stats sub-package.

In the discussion below we mostly focus on continuous RVs. Nearly all applies to discrete variables also, but we point

out some differences here: Speciﬁc Points for Discrete Distributions.

Getting Help

First of all, all distributions are accompanied with help functions. To obtain just some basic information we can call

>>> from scipy import stats

>>> from scipy.stats import norm

>>> print norm.__doc__

To ﬁnd the support, i.e., upper and lower bound of the distribution, call:

>>> print ’bounds of distribution lower: %s, upper: %s’%(norm.a,norm.b)

bounds of distribution lower: -inf, upper: inf

We can list all methods and properties of the distribution with dir(norm). As it turns out, some of the methods

are private methods although they are not named as such (their name does not start with a leading underscore), for

example veccdf, are only available for internal calculation.

To obtain the real main methods, we list the methods of the frozen distribution. (We explain the meaning of a frozen

distribution below).

>>> rv =norm()

>>> dir(rv) #reformatted

[’__class__’, ’__delattr__’, ’__dict__’, ’__doc__’, ’__getattribute__’,

’__hash__’, ’__init__’, ’__module__’, ’__new__’, ’__reduce__’, ’__reduce_ex__’,

’__repr__’, ’__setattr__’, ’__str__’, ’__weakref__’, ’args’, ’cdf’, ’dist’,

’entropy’, ’isf’, ’kwds’, ’moment’, ’pdf’, ’pmf’, ’ppf’, ’rvs’, ’sf’, ’stats’]

Finally, we can obtain the list of available distribution through introspection:

1.13. Statistics (scipy.stats) 77

SciPy Reference Guide, Release 0.13.0

>>> import warnings

>>> warnings.simplefilter(’ignore’,DeprecationWarning)

>>> dist_continu =[d for din dir(stats) if

... isinstance(getattr(stats,d), stats.rv_continuous)]

>>> dist_discrete =[d for din dir(stats) if

... isinstance(getattr(stats,d), stats.rv_discrete)]

>>> print ’number of continuous distributions:’,len(dist_continu)

number of continuous distributions: 84

>>> print ’number of discrete distributions: ’,len(dist_discrete)

number of discrete distributions: 12

Common Methods

The main public methods for continuous RVs are:

• rvs: Random Variates

• pdf: Probability Density Function

• cdf: Cumulative Distribution Function

• sf: Survival Function (1-CDF)

• ppf: Percent Point Function (Inverse of CDF)

• isf: Inverse Survival Function (Inverse of SF)

• stats: Return mean, variance, (Fisher’s) skew, or (Fisher’s) kurtosis

• moment: non-central moments of the distribution

Let’s take a normal RV as an example.

>>> norm.cdf(0)

0.5

To compute the cdf at a number of points, we can pass a list or a numpy array.

>>> norm.cdf([-1.,0,1])

array([ 0.15865525, 0.5 , 0.84134475])

>>> import numpy as np

>>> norm.cdf(np.array([-1.,0,1]))

array([ 0.15865525, 0.5 , 0.84134475])

Thus, the basic methods such as pdf,cdf, and so on are vectorized with np.vectorize.

Other generally useful methods are supported too:

>>> norm.mean(), norm.std(), norm.var()

(0.0, 1.0, 1.0)

>>> norm.stats(moments ="mv")

(array(0.0), array(1.0))

To ﬁnd the median of a distribution we can use the percent point function ppf, which is the inverse of the cdf:

>>> norm.ppf(0.5)

0.0

To generate a set of random variates:

>>> norm.rvs(size=5)

array([-0.35687759, 1.34347647, -0.11710531, -1.00725181, -0.51275702])

78 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

Don’t think that norm.rvs(5) generates 5 variates:

>>> norm.rvs(5)

7.131624370075814

This brings us, in fact, to the topic of the next subsection.

Shifting and Scaling

All continuous distributions take loc and scale as keyword parameters to adjust the location and scale of the

distribution, e.g. for the standard normal distribution the location is the mean and the scale is the standard deviation.

>>> norm.stats(loc =3, scale =4, moments ="mv")

(array(3.0), array(16.0))

In general the standardized distribution for a random variable Xis obtained through the transformation (X - loc)

/ scale. The default values are loc = 0 and scale = 1.

Smart use of loc and scale can help modify the standard distributions in many ways. To illustrate the scaling

further, the cdf of an exponentially distributed RV with mean 1/λ is given by

F(x) = 1 −exp(−λx)

By applying the scaling rule above, it can be seen that by taking scale = 1./lambda we get the proper scale.

>>> from scipy.stats import expon

>>> expon.mean(scale =3.)

3.0

The uniform distribution is also interesting:

>>> from scipy.stats import uniform

>>> uniform.cdf([0,1,2,3,4,5], loc =1, scale =4)

array([ 0. , 0. , 0.25, 0.5 , 0.75, 1. ])

Finally, recall from the previous paragraph that we are left with the problem of the meaning of norm.rvs(5). As it

turns out, calling a distribution like this, the ﬁrst argument, i.e., the 5, gets passed to set the loc parameter. Let’s see:

>>> np.mean(norm.rvs(5, size=500))

4.983550784784704

Thus, to explain the output of the example of the last section: norm.rvs(5)‘ generates a normally

distributed random variate with mean ‘‘loc=5.

I prefer to set the loc and scale parameter explicitly, by passing the values as keywords rather than as arguments.

This is less of a hassle as it may seem. We clarify this below when we explain the topic of freezing a RV.

Shape Parameters

While a general continuous random variable can be shifted and scaled with the loc and scale parameters, some

distributions require additional shape parameters. For instance, the gamma distribution, with density

γ(x, n) = λ(λx)n−1

Γ(n)e−λx ,

requires the shape parameter n. Observe that setting λcan be obtained by setting the scale keyword to 1/λ.

Let’s check the number and name of the shape parameters of the gamma distribution. (We know from the above that

this should be 1.)

1.13. Statistics (scipy.stats) 79

SciPy Reference Guide, Release 0.13.0

>>> from scipy.stats import gamma

>>> gamma.numargs

>>> gamma.shapes

’a’

Now we set the value of the shape variable to 1 to obtain the exponential distribution, so that we compare easily

whether we get the results we expect.

>>> gamma(1, scale=2.).stats(moments="mv")

(array(2.0), array(4.0))

Notice that we can also specify shape parameters as keywords:

>>> gamma(a=1, scale=2.).stats(moments="mv")

(array(2.0), array(4.0))

Freezing a Distribution

Passing the loc and scale keywords time and again can become quite bothersome. The concept of freezing a RV is

used to solve such problems.

>>> rv =gamma(1, scale=2.)

By using rv we no longer have to include the scale or the shape parameters anymore. Thus, distributions can be used

in one of two ways, either by passing all distribution parameters to each method call (such as we did earlier) or by

freezing the parameters for the instance of the distribution. Let us check this:

>>> rv.mean(), rv.std()

(2.0, 2.0)

This is indeed what we should get.

Broadcasting

The basic methods pdf and so on satisfy the usual numpy broadcasting rules. For example, we can calculate the

critical values for the upper tail of the t distribution for different probabilites and degrees of freedom.

>>> stats.t.isf([0.1,0.05,0.01], [[10], [11]])

array([[ 1.37218364, 1.81246112, 2.76376946],

[ 1.36343032, 1.79588482, 2.71807918]])

Here, the ﬁrst row are the critical values for 10 degrees of freedom and the second row for 11 degrees of freedom

(d.o.f.). Thus, the broadcasting rules give the same result of calling isf twice:

>>> stats.t.isf([0.1,0.05,0.01], 10)

array([ 1.37218364, 1.81246112, 2.76376946])

>>> stats.t.isf([0.1,0.05,0.01], 11)

array([ 1.36343032, 1.79588482, 2.71807918])

If the array with probabilities, i.e, [0.1, 0.05, 0.01] and the array of degrees of freedom i.e., [10, 11,

12], have the same array shape, then element wise matching is used. As an example, we can obtain the 10% tail for

10 d.o.f., the 5% tail for 11 d.o.f. and the 1% tail for 12 d.o.f. by calling

>>> stats.t.isf([0.1,0.05,0.01], [10,11,12])

array([ 1.37218364, 1.79588482, 2.68099799])

80 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

Speciﬁc Points for Discrete Distributions

Discrete distribution have mostly the same basic methods as the continuous distributions. However pdf is replaced

the probability mass function pmf, no estimation methods, such as ﬁt, are available, and scale is not a valid keyword

parameter. The location parameter, keyword loc can still be used to shift the distribution.

The computation of the cdf requires some extra attention. In the case of continuous distribution the cumulative distri-

bution function is in most standard cases strictly monotonic increasing in the bounds (a,b) and has therefore a unique

inverse. The cdf of a discrete distribution, however, is a step function, hence the inverse cdf, i.e., the percent point

function, requires a different deﬁnition:

ppf(q) = min{x : cdf(x) >= q, x integer}

For further info, see the docs here.

We can look at the hypergeometric distribution as an example

>>> from scipy.stats import hypergeom

>>> [M, n, N] =[20,7,12]

If we use the cdf at some integer points and then evaluate the ppf at those cdf values, we get the initial integers back,

for example

>>> x=np.arange(4)*2

>>> x

array([0, 2, 4, 6])

>>> prb =hypergeom.cdf(x, M, n, N)

>>> prb

array([ 0.0001031991744066, 0.0521155830753351, 0.6083591331269301,

0.9897832817337386])

>>> hypergeom.ppf(prb, M, n, N)

array([ 0., 2., 4., 6.])

If we use values that are not at the kinks of the cdf step function, we get the next higher integer back:

>>> hypergeom.ppf(prb+1e-8, M, n, N)

array([ 1., 3., 5., 7.])

>>> hypergeom.ppf(prb-1e-8, M, n, N)

array([ 0., 2., 4., 6.])

Fitting Distributions

The main additional methods of the not frozen distribution are related to the estimation of distribution parameters:

•ﬁt: maximum likelihood estimation of distribution parameters, including location

and scale

• ﬁt_loc_scale: estimation of location and scale when shape parameters are given

• nnlf: negative log likelihood function

• expect: Calculate the expectation of a function against the pdf or pmf

Performance Issues and Cautionary Remarks

The performance of the individual methods, in terms of speed, varies widely by distribution and method. The results of

a method are obtained in one of two ways: either by explicit calculation, or by a generic algorithm that is independent

of the speciﬁc distribution.

1.13. Statistics (scipy.stats) 81

SciPy Reference Guide, Release 0.13.0

Explicit calculation, on the one hand, requires that the method is directly speciﬁed for the given distribution, either

through analytic formulas or through special functions in scipy.special or numpy.random for rvs. These are

usually relatively fast calculations.

The generic methods, on the other hand, are used if the distribution does not specify any explicit calcula-

tion. To deﬁne a distribution, only one of pdf or cdf is necessary; all other methods can be derived using nu-

meric integration and root ﬁnding. However, these indirect methods can be very slow. As an example, rgh =

stats.gausshyper.rvs(0.5, 2, 2, 2, size=100) creates random variables in a very indirect way and

takes about 19 seconds for 100 random variables on my computer, while one million random variables from the

standard normal or from the t distribution take just above one second.

Remaining Issues

The distributions in scipy.stats have recently been corrected and improved and gained a considerable test suite,

however a few issues remain:

• skew and kurtosis, 3rd and 4th moments and entropy are not thoroughly tested and some coarse testing indicates

that there are still some incorrect results left.

• the distributions have been tested over some range of parameters, however in some corner ranges, a few incorrect

results may remain.

• the maximum likelihood estimation in ﬁt does not work with default starting parameters for all distributions

and the user needs to supply good starting parameters. Also, for some distribution using a maximum likelihood

estimator might inherently not be the best choice.

1.13.3 Building Speciﬁc Distributions

The next examples shows how to build your own distributions. Further examples show the usage of the distributions

and some statistical tests.

Making a Continuous Distribution, i.e., Subclassing rv_continuous

Making continuous distributions is fairly simple.

>>> from scipy import stats

>>> class deterministic_gen(stats.rv_continuous):

... def _cdf(self, x ):

... return np.where(x<0,0.,1.)

... def _stats(self):

... return 0.,0.,0.,0.

>>> deterministic =deterministic_gen(name="deterministic")

>>> deterministic.cdf(np.arange(-3,3,0.5))

array([ 0., 0., 0., 0., 0., 0., 1., 1., 1., 1., 1., 1.])

Interestingly, the pdf is now computed automatically:

>>> deterministic.pdf(np.arange(-3,3,0.5))

array([ 0.00000000e+00, 0.00000000e+00, 0.00000000e+00,

0.00000000e+00, 0.00000000e+00, 0.00000000e+00,

5.83333333e+04, 4.16333634e-12, 4.16333634e-12,

4.16333634e-12, 4.16333634e-12, 4.16333634e-12])

82 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

Be aware of the performance issues mentions in Performance Issues and Cautionary Remarks. The computation of

unspeciﬁed common methods can become very slow, since only general methods are called which, by their very nature,

cannot use any speciﬁc information about the distribution. Thus, as a cautionary example:

>>> from scipy.integrate import quad

>>> quad(deterministic.pdf, -1e-1,1e-1)

(4.163336342344337e-13, 0.0)

But this is not correct: the integral over this pdf should be 1. Let’s make the integration interval smaller:

>>> quad(deterministic.pdf, -1e-3,1e-3)# warning removed

(1.000076872229173, 0.0010625571718182458)

This looks better. However, the problem originated from the fact that the pdf is not speciﬁed in the class deﬁnition of

the deterministic distribution.

Subclassing rv_discrete

In the following we use stats.rv_discrete to generate a discrete distribution that has the probabilities of the

truncated normal for the intervals centered around the integers.

General Info

From the docstring of rv_discrete, i.e.,

>>> from scipy.stats import rv_discrete

>>> help(rv_discrete)

we learn that:

“You can construct an aribtrary discrete rv where P{X=xk} = pk by passing to the rv_discrete initialization

method (through the values= keyword) a tuple of sequences (xk, pk) which describes only those values of X

(xk) that occur with nonzero probability (pk).”

Next to this, there are some further requirements for this approach to work:

• The keyword name is required.

• The support points of the distribution xk have to be integers.

• The number of signiﬁcant digits (decimals) needs to be speciﬁed.

In fact, if the last two requirements are not satisﬁed an exception may be raised or the resulting numbers may be

incorrect.

An Example

Let’s do the work. First

>>> npoints =20 # number of integer support points of the distribution minus 1

>>> npointsh =npoints /2

>>> npointsf =float(npoints)

>>> nbound =4# bounds for the truncated normal

>>> normbound =(1+1/npointsf) *nbound # actual bounds of truncated normal

>>> grid =np.arange(-npointsh, npointsh+2,1)# integer grid

>>> gridlimitsnorm =(grid-0.5)/npointsh *nbound # bin limits for the truncnorm

>>> gridlimits =grid -0.5 # used later in the analysis

>>> grid =grid[:-1]

>>> probs =np.diff(stats.truncnorm.cdf(gridlimitsnorm, -normbound, normbound))

>>> gridint =grid

And ﬁnally we can subclass rv_discrete:

1.13. Statistics (scipy.stats) 83

SciPy Reference Guide, Release 0.13.0

>>> normdiscrete =stats.rv_discrete(values=(gridint,

... np.round(probs, decimals=7)), name=’normdiscrete’)

Now that we have deﬁned the distribution, we have access to all common methods of discrete distributions.

>>> print ’mean = %6.4f, variance = %6.4f, skew = %6.4f, kurtosis = %6.4f’%\

... normdiscrete.stats(moments =’mvsk’)

mean = -0.0000, variance = 6.3302, skew = 0.0000, kurtosis = -0.0076

>>> nd_std =np.sqrt(normdiscrete.stats(moments=’v’))

Testing the Implementation

Let’s generate a random sample and compare observed frequencies with the probabilities.

>>> n_sample =500

>>> np.random.seed(87655678)# fix the seed for replicability

>>> rvs =normdiscrete.rvs(size=n_sample)

>>> rvsnd =rvs

>>> f,l=np.histogram(rvs, bins=gridlimits)

>>> sfreq =np.vstack([gridint, f, probs*n_sample]).T

>>> print sfreq

[[ -1.00000000e+01 0.00000000e+00 2.95019349e-02]

[ -9.00000000e+00 0.00000000e+00 1.32294142e-01]

[ -8.00000000e+00 0.00000000e+00 5.06497902e-01]

[ -7.00000000e+00 2.00000000e+00 1.65568919e+00]

[ -6.00000000e+00 1.00000000e+00 4.62125309e+00]

[ -5.00000000e+00 9.00000000e+00 1.10137298e+01]

[ -4.00000000e+00 2.60000000e+01 2.24137683e+01]

[ -3.00000000e+00 3.70000000e+01 3.89503370e+01]

[ -2.00000000e+00 5.10000000e+01 5.78004747e+01]

[ -1.00000000e+00 7.10000000e+01 7.32455414e+01]

[ 0.00000000e+00 7.40000000e+01 7.92618251e+01]

[ 1.00000000e+00 8.90000000e+01 7.32455414e+01]

[ 2.00000000e+00 5.50000000e+01 5.78004747e+01]

[ 3.00000000e+00 5.00000000e+01 3.89503370e+01]

[ 4.00000000e+00 1.70000000e+01 2.24137683e+01]

[ 5.00000000e+00 1.10000000e+01 1.10137298e+01]

[ 6.00000000e+00 4.00000000e+00 4.62125309e+00]

[ 7.00000000e+00 3.00000000e+00 1.65568919e+00]

[ 8.00000000e+00 0.00000000e+00 5.06497902e-01]

[ 9.00000000e+00 0.00000000e+00 1.32294142e-01]

[ 1.00000000e+01 0.00000000e+00 2.95019349e-02]]

84 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

Next, we can test, whether our sample was generated by our normdiscrete distribution. This also veriﬁes whether the

random numbers are generated correctly.

The chisquare test requires that there are a minimum number of observations in each bin. We combine the tail bins

into larger bins so that they contain enough observations.

>>> f2 =np.hstack([f[:5].sum(), f[5:-5], f[-5:].sum()])

>>> p2 =np.hstack([probs[:5].sum(), probs[5:-5], probs[-5:].sum()])

>>> ch2, pval =stats.chisquare(f2, p2*n_sample)

>>> print ’chisquare for normdiscrete: chi2 = %6.3f pvalue = %6.4f’%(ch2, pval)

chisquare for normdiscrete: chi2 = 12.466 pvalue = 0.4090

The pvalue in this case is high, so we can be quite conﬁdent that our random sample was actually generated by the

distribution.

1.13. Statistics (scipy.stats) 85

SciPy Reference Guide, Release 0.13.0

1.13.4 Analysing One Sample

First, we create some random variables. We set a seed so that in each run we get identical results to look at. As an

example we take a sample from the Student t distribution:

>>> np.random.seed(282629734)

>>> x=stats.t.rvs(10, size=1000)

Here, we set the required shape parameter of the t distribution, which in statistics corresponds to the degrees of

freedom, to 10. Using size=1000 means that our sample consists of 1000 independently drawn (pseudo) random

numbers. Since we did not specify the keyword arguments loc and scale, those are set to their default values zero and

one.

Descriptive Statistics

xis a numpy array, and we have direct access to all array methods, e.g.

>>> print x.max(), x.min() # equivalent to np.max(x), np.min(x)

5.26327732981 -3.78975572422

>>> print x.mean(), x.var() # equivalent to np.mean(x), np.var(x)

0.0140610663985 1.28899386208

How do the some sample properties compare to their theoretical counterparts?

>>> m,v,s,k=stats.t.stats(10, moments=’mvsk’)

>>> n, (smin, smax), sm, sv, ss, sk =stats.describe(x)

>>> print ’distribution:’,

distribution:

>>> sstr =’mean = %6.4f, variance = %6.4f, skew = %6.4f, kurtosis = %6.4f’

>>> print sstr %(m, v, s ,k)

mean = 0.0000, variance = 1.2500, skew = 0.0000, kurtosis = 1.0000

>>> print ’sample: ’,

sample:

>>> print sstr %(sm, sv, ss, sk)

mean = 0.0141, variance = 1.2903, skew = 0.2165, kurtosis = 1.0556

Note: stats.describe uses the unbiased estimator for the variance, while np.var is the biased estimator.

For our sample the sample statistics differ a by a small amount from their theoretical counterparts.

T-test and KS-test

We can use the t-test to test whether the mean of our sample differs in a statistcally signiﬁcant way from the theoretical

expectation.

>>> print ’t-statistic = %6.3f pvalue = %6.4f’%stats.ttest_1samp(x, m)

t-statistic = 0.391 pvalue = 0.6955

The pvalue is 0.7, this means that with an alpha error of, for example, 10%, we cannot reject the hypothesis that the

sample mean is equal to zero, the expectation of the standard t-distribution.

As an exercise, we can calculate our ttest also directly without using the provided function, which should give us the

same answer, and so it does:

>>> tt =(sm-m)/np.sqrt(sv/float(n)) # t-statistic for mean

>>> pval =stats.t.sf(np.abs(tt), n-1)*2# two-sided pvalue = Prob(abs(t)>tt)

86 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

>>> print ’t-statistic = %6.3f pvalue = %6.4f’%(tt, pval)

t-statistic = 0.391 pvalue = 0.6955

The Kolmogorov-Smirnov test can be used to test the hypothesis that the sample comes from the standard t-distribution

>>> print ’KS-statistic D = %6.3f pvalue = %6.4f’%stats.kstest(x, ’t’, (10,))

KS-statistic D = 0.016 pvalue = 0.9606

Again the p-value is high enough that we cannot reject the hypothesis that the random sample really is distributed

according to the t-distribution. In real applications, we don’t know what the underlying distribution is. If we perform

the Kolmogorov-Smirnov test of our sample against the standard normal distribution, then we also cannot reject the

hypothesis that our sample was generated by the normal distribution given that in this example the p-value is almost

40%.

>>> print ’KS-statistic D = %6.3f pvalue = %6.4f’%stats.kstest(x,’norm’)

KS-statistic D = 0.028 pvalue = 0.3949

However, the standard normal distribution has a variance of 1, while our sample has a variance of 1.29. If we stan-

dardize our sample and test it against the normal distribution, then the p-value is again large enough that we cannot

reject the hypothesis that the sample came form the normal distribution.

>>> d, pval =stats.kstest((x-x.mean())/x.std(), ’norm’)

>>> print ’KS-statistic D = %6.3f pvalue = %6.4f’%(d, pval)

KS-statistic D = 0.032 pvalue = 0.2402

Note: The Kolmogorov-Smirnov test assumes that we test against a distribution with given parameters, since in the

last case we estimated mean and variance, this assumption is violated, and the distribution of the test statistic on which

the p-value is based, is not correct.

Tails of the distribution

Finally, we can check the upper tail of the distribution. We can use the percent point function ppf, which is the inverse

of the cdf function, to obtain the critical values, or, more directly, we can use the inverse of the survival function

>>> crit01, crit05, crit10 =stats.t.ppf([1-0.01,1-0.05,1-0.10], 10)

>>> print ’critical values from ppf at 1%%, 5%% and 10%% %8.4f %8.4f %8.4f’%(crit01, crit05, crit10)

critical values from ppf at 1%, 5% and 10% 2.7638 1.8125 1.3722

>>> print ’critical values from isf at 1%%, 5%% and 10%% %8.4f %8.4f %8.4f’%tuple(stats.t.isf([0.01,0.05,0.10],10))

critical values from isf at 1%, 5% and 10% 2.7638 1.8125 1.3722

>>> freq01 =np.sum(x>crit01) /float(n) *100

>>> freq05 =np.sum(x>crit05) /float(n) *100

>>> freq10 =np.sum(x>crit10) /float(n) *100

>>> print ’sample %%-frequency at 1%%, 5%% and 10%% tail %8.4f %8.4f %8.4f’%(freq01, freq05, freq10)

sample %-frequency at 1%, 5% and 10% tail 1.4000 5.8000 10.5000

In all three cases, our sample has more weight in the top tail than the underlying distribution. We can brieﬂy check

a larger sample to see if we get a closer match. In this case the empirical frequency is quite close to the theoretical

probability, but if we repeat this several times the ﬂuctuations are still pretty large.

>>> freq05l =np.sum(stats.t.rvs(10, size=10000)>crit05) /10000.0 *100

>>> print ’larger sample %%-frequency at 5%% tail %8.4f’%freq05l

larger sample %-frequency at 5% tail 4.8000

We can also compare it with the tail of the normal distribution, which has less weight in the tails:

1.13. Statistics (scipy.stats) 87

SciPy Reference Guide, Release 0.13.0

>>> print ’tail prob. of normal at 1%%, 5%% and 10%% %8.4f %8.4f %8.4f’%\

... tuple(stats.norm.sf([crit01, crit05, crit10])*100)

tail prob. of normal at 1%, 5% and 10% 0.2857 3.4957 8.5003

The chisquare test can be used to test, whether for a ﬁnite number of bins, the observed frequencies differ signiﬁcantly

from the probabilites of the hypothesized distribution.

>>> quantiles =[0.0,0.01,0.05,0.1,1-0.10,1-0.05,1-0.01,1.0]

>>> crit =stats.t.ppf(quantiles, 10)

>>> print crit

[ -Inf -2.76376946 -1.81246112 -1.37218364 1.37218364 1.81246112

2.76376946 Inf]

>>> n_sample =x.size

>>> freqcount =np.histogram(x, bins=crit)[0]

>>> tprob =np.diff(quantiles)

>>> nprob =np.diff(stats.norm.cdf(crit))

>>> tch, tpval =stats.chisquare(freqcount, tprob*n_sample)

>>> nch, npval =stats.chisquare(freqcount, nprob*n_sample)

>>> print ’chisquare for t: chi2 = %6.3f pvalue = %6.4f’%(tch, tpval)

chisquare for t: chi2 = 2.300 pvalue = 0.8901

>>> print ’chisquare for normal: chi2 = %6.3f pvalue = %6.4f’%(nch, npval)

chisquare for normal: chi2 = 64.605 pvalue = 0.0000

We see that the standard normal distribution is clearly rejected while the standard t-distribution cannot be rejected.

Since the variance of our sample differs from both standard distribution, we can again redo the test taking the estimate

for scale and location into account.

The ﬁt method of the distributions can be used to estimate the parameters of the distribution, and the test is repeated

using probabilites of the estimated distribution.

>>> tdof, tloc, tscale =stats.t.fit(x)

>>> nloc, nscale =stats.norm.fit(x)

>>> tprob =np.diff(stats.t.cdf(crit, tdof, loc=tloc, scale=tscale))

>>> nprob =np.diff(stats.norm.cdf(crit, loc=nloc, scale=nscale))

>>> tch, tpval =stats.chisquare(freqcount, tprob*n_sample)

>>> nch, npval =stats.chisquare(freqcount, nprob*n_sample)

>>> print ’chisquare for t: chi2 = %6.3f pvalue = %6.4f’%(tch, tpval)

chisquare for t: chi2 = 1.577 pvalue = 0.9542

>>> print ’chisquare for normal: chi2 = %6.3f pvalue = %6.4f’%(nch, npval)

chisquare for normal: chi2 = 11.084 pvalue = 0.0858

Taking account of the estimated parameters, we can still reject the hypothesis that our sample came from a normal

distribution (at the 5% level), but again, with a p-value of 0.95, we cannot reject the t distribution.

Special tests for normal distributions

Since the normal distribution is the most common distribution in statistics, there are several additional functions

available to test whether a sample could have been drawn from a normal distribution

First we can test if skew and kurtosis of our sample differ signiﬁcantly from those of a normal distribution:

>>> print ’normal skewtest teststat = %6.3f pvalue = %6.4f’%stats.skewtest(x)

normal skewtest teststat = 2.785 pvalue = 0.0054

>>> print ’normal kurtosistest teststat = %6.3f pvalue = %6.4f’%stats.kurtosistest(x)

normal kurtosistest teststat = 4.757 pvalue = 0.0000

These two tests are combined in the normality test

88 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

>>> print ’normaltest teststat = %6.3f pvalue = %6.4f’%stats.normaltest(x)

normaltest teststat = 30.379 pvalue = 0.0000

In all three tests the p-values are very low and we can reject the hypothesis that the our sample has skew and kurtosis

of the normal distribution.

Since skew and kurtosis of our sample are based on central moments, we get exactly the same results if we test the

standardized sample:

>>> print ’normaltest teststat = %6.3f pvalue = %6.4f’%\

... stats.normaltest((x-x.mean())/x.std())

normaltest teststat = 30.379 pvalue = 0.0000

Because normality is rejected so strongly, we can check whether the normaltest gives reasonable results for other

cases:

>>> print ’normaltest teststat = %6.3f pvalue = %6.4f’%stats.normaltest(stats.t.rvs(10, size=100))

normaltest teststat = 4.698 pvalue = 0.0955

>>> print ’normaltest teststat = %6.3f pvalue = %6.4f’%stats.normaltest(stats.norm.rvs(size=1000))

normaltest teststat = 0.613 pvalue = 0.7361

When testing for normality of a small sample of t-distributed observations and a large sample of normal distributed

observation, then in neither case can we reject the null hypothesis that the sample comes from a normal distribution.

In the ﬁrst case this is because the test is not powerful enough to distinguish a t and a normally distributed random

variable in a small sample.

1.13.5 Comparing two samples

In the following, we are given two samples, which can come either from the same or from different distribution, and

we want to test whether these samples have the same statistical properties.

Comparing means

Test with sample with identical means:

>>> rvs1 =stats.norm.rvs(loc=5, scale=10, size=500)

>>> rvs2 =stats.norm.rvs(loc=5, scale=10, size=500)

>>> stats.ttest_ind(rvs1, rvs2)

(-0.54890361750888583, 0.5831943748663857)

Test with sample with different means:

>>> rvs3 =stats.norm.rvs(loc=8, scale=10, size=500)

>>> stats.ttest_ind(rvs1, rvs3)

(-4.5334142901750321, 6.507128186505895e-006)

Kolmogorov-Smirnov test for two samples ks_2samp

For the example where both samples are drawn from the same distribution, we cannot reject the null hypothesis since

the pvalue is high

>>> stats.ks_2samp(rvs1, rvs2)

(0.025999999999999995, 0.99541195173064878)

In the second example, with different location, i.e. means, we can reject the null hypothesis since the pvalue is below

1.13. Statistics (scipy.stats) 89

SciPy Reference Guide, Release 0.13.0

>>> stats.ks_2samp(rvs1, rvs3)

(0.11399999999999999, 0.0027132103661283141)

1.13.6 Kernel Density Estimation

A common task in statistics is to estimate the probability density function (PDF) of a random variable from a set

of data samples. This task is called density estimation. The most well-known tool to do this is the histogram. A

histogram is a useful tool for visualization (mainly because everyone understands it), but doesn’t use the available data

very efﬁciently. Kernel density estimation (KDE) is a more efﬁcient tool for the same task. The gaussian_kde

estimator can be used to estimate the PDF of univariate as well as multivariate data. It works best if the data is

unimodal.

Univariate estimation

We start with a minimal amount of data in order to see how gaussian_kde works, and what the different options

for bandwidth selection do. The data sampled from the PDF is show as blue dashes at the bottom of the ﬁgure (this is

called a rug plot):

>>> from scipy import stats

>>> import matplotlib.pyplot as plt

>>> x1 =np.array([-7,-5,1,4,5], dtype=np.float)

>>> kde1 =stats.gaussian_kde(x1)

>>> kde2 =stats.gaussian_kde(x1, bw_method=’silverman’)

>>> fig =plt.figure()

>>> ax =fig.add_subplot(111)

>>> ax.plot(x1, np.zeros(x1.shape), ’b+’, ms=20)# rug plot

>>> x_eval =np.linspace(-10,10, num=200)

>>> ax.plot(x_eval, kde1(x_eval), ’k-’, label="Scott’s Rule")

>>> ax.plot(x_eval, kde1(x_eval), ’r-’, label="Silverman’s Rule")

>>> plt.show()

90 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

We see that there is very little difference between Scott’s Rule and Silverman’s Rule, and that the bandwidth selection

with a limited amount of data is probably a bit too wide. We can deﬁne our own bandwidth function to get a less

smoothed out result.

>>> def my_kde_bandwidth(obj, fac=1./5):

... """We use Scott’s Rule, multiplied by a constant factor."""

... return np.power(obj.n, -1./(obj.d+4)) *fac

>>> fig =plt.figure()

>>> ax =fig.add_subplot(111)

>>> ax.plot(x1, np.zeros(x1.shape), ’b+’, ms=20)# rug plot

>>> kde3 =stats.gaussian_kde(x1, bw_method=my_kde_bandwidth)

>>> ax.plot(x_eval, kde3(x_eval), ’g-’, label="With smaller BW")

>>> plt.show()

We see that if we set bandwidth to be very narrow, the obtained estimate for the probability density function (PDF) is

simply the sum of Gaussians around each data point.

We now take a more realistic example, and look at the difference between the two available bandwidth selection rules.

Those rules are known to work well for (close to) normal distributions, but even for unimodal distributions that are

quite strongly non-normal they work reasonably well. As a non-normal distribution we take a Student’s T distribution

with 5 degrees of freedom.

import numpy as np

import matplotlib.pyplot as plt

from scipy import stats

np.random.seed(12456)

x1 =np.random.normal(size=200)# random data, normal distribution

xs =np.linspace(x1.min()-1, x1.max()+1,200)

kde1 =stats.gaussian_kde(x1)

kde2 =stats.gaussian_kde(x1, bw_method=’silverman’)

fig =plt.figure(figsize=(8,6))

1.13. Statistics (scipy.stats) 91

SciPy Reference Guide, Release 0.13.0

ax1 =fig.add_subplot(211)

ax1.plot(x1, np.zeros(x1.shape), ’b+’, ms=12)# rug plot

ax1.plot(xs, kde1(xs), ’k-’, label="Scott’s Rule")

ax1.plot(xs, kde2(xs), ’b-’, label="Silverman’s Rule")

ax1.plot(xs, stats.norm.pdf(xs), ’r--’, label="True PDF")

ax1.set_xlabel(’x’)

ax1.set_ylabel(’Density’)

ax1.set_title("Normal (top) and Student’s T$_{df=5}$ (bottom) distributions")

ax1.legend(loc=1)

x2 =stats.t.rvs(5, size=200)# random data, T distribution

xs =np.linspace(x2.min() -1, x2.max() +1,200)

kde3 =stats.gaussian_kde(x2)

kde4 =stats.gaussian_kde(x2, bw_method=’silverman’)

ax2 =fig.add_subplot(212)

ax2.plot(x2, np.zeros(x2.shape), ’b+’, ms=12)# rug plot

ax2.plot(xs, kde3(xs), ’k-’, label="Scott’s Rule")

ax2.plot(xs, kde4(xs), ’b-’, label="Silverman’s Rule")

ax2.plot(xs, stats.t.pdf(xs, 5), ’r--’, label="True PDF")

ax2.set_xlabel(’x’)

ax2.set_ylabel(’Density’)

plt.show()

92 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

We now take a look at a bimodal distribution with one wider and one narrower Gaussian feature. We expect that this

will be a more difﬁcult density to approximate, due to the different bandwidths required to accurately resolve each

feature.

>>> from functools import partial

>>> loc1, scale1, size1 =(-2,1,175)

>>> loc2, scale2, size2 =(2,0.2,50)

>>> x2 =np.concatenate([np.random.normal(loc=loc1, scale=scale1, size=size1),

... np.random.normal(loc=loc2, scale=scale2, size=size2)])

>>> x_eval =np.linspace(x2.min() -1, x2.max() +1,500)

>>> kde =stats.gaussian_kde(x2)

>>> kde2 =stats.gaussian_kde(x2, bw_method=’silverman’)

>>> kde3 =stats.gaussian_kde(x2, bw_method=partial(my_kde_bandwidth, fac=0.2))

>>> kde4 =stats.gaussian_kde(x2, bw_method=partial(my_kde_bandwidth, fac=0.5))

>>> pdf =stats.norm.pdf

>>> bimodal_pdf =pdf(x_eval, loc=loc1, scale=scale1) *float(size1) /x2.size +\

... pdf(x_eval, loc=loc2, scale=scale2) *float(size2) /x2.size

>>> fig =plt.figure(figsize=(8,6))

>>> ax =fig.add_subplot(111)

1.13. Statistics (scipy.stats) 93

SciPy Reference Guide, Release 0.13.0

>>> ax.plot(x2, np.zeros(x2.shape), ’b+’, ms=12)

>>> ax.plot(x_eval, kde(x_eval), ’k-’, label="Scott’s Rule")

>>> ax.plot(x_eval, kde2(x_eval), ’b-’, label="Silverman’s Rule")

>>> ax.plot(x_eval, kde3(x_eval), ’g-’, label="Scott *0.2")

>>> ax.plot(x_eval, kde4(x_eval), ’c-’, label="Scott *0.5")

>>> ax.plot(x_eval, bimodal_pdf, ’r--’, label="Actual PDF")

>>> ax.set_xlim([x_eval.min(), x_eval.max()])

>>> ax.legend(loc=2)

>>> ax.set_xlabel(’x’)

>>> ax.set_ylabel(’Density’)

>>> plt.show()

As expected, the KDE is not as close to the true PDF as we would like due to the different characteristic size of the

two features of the bimodal distribution. By halving the default bandwidth (Scott *0.5) we can do somewhat

better, while using a factor 5 smaller bandwidth than the default doesn’t smooth enough. What we really need though

in this case is a non-uniform (adaptive) bandwidth.

Multivariate estimation

With gaussian_kde we can perform multivariate as well as univariate estimation. We demonstrate the bivariate

case. First we generate some random data with a model in which the two variates are correlated.

>>> def measure(n):

... """Measurement model, return two coupled measurements."""

94 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

... m1 =np.random.normal(size=n)

... m2 =np.random.normal(scale=0.5, size=n)

... return m1+m2, m1-m2

>>> m1, m2 =measure(2000)

>>> xmin =m1.min()

>>> xmax =m1.max()

>>> ymin =m2.min()

>>> ymax =m2.max()

Then we apply the KDE to the data:

>>> X,Y=np.mgrid[xmin:xmax:100j, ymin:ymax:100j]

>>> positions =np.vstack([X.ravel(), Y.ravel()])

>>> values =np.vstack([m1, m2])

>>> kernel =stats.gaussian_kde(values)

>>> Z=np.reshape(kernel.evaluate(positions).T, X.shape)

Finally we plot the estimated bivariate distribution as a colormap, and plot the individual data points on top.

>>> fig =plt.figure(figsize=(8,6))

>>> ax =fig.add_subplot(111)

>>> ax.imshow(np.rot90(Z), cmap=plt.cm.gist_earth_r,

... extent=[xmin, xmax, ymin, ymax])

>>> ax.plot(m1, m2, ’k.’, markersize=2)

>>> ax.set_xlim([xmin, xmax])

>>> ax.set_ylim([ymin, ymax])

>>> plt.show()

1.13. Statistics (scipy.stats) 95

SciPy Reference Guide, Release 0.13.0

1.14 Multidimensional image processing (scipy.ndimage)

1.14.1 Introduction

Image processing and analysis are generally seen as operations on two-dimensional arrays of values. There are how-

ever a number of ﬁelds where images of higher dimensionality must be analyzed. Good examples of these are medical

imaging and biological imaging. numpy is suited very well for this type of applications due its inherent multidimen-

sional nature. The scipy.ndimage packages provides a number of general image processing and analysis functions

that are designed to operate with arrays of arbitrary dimensionality. The packages currently includes functions for lin-

ear and non-linear ﬁltering, binary morphology, B-spline interpolation, and object measurements.

1.14.2 Properties shared by all functions

All functions share some common properties. Notably, all functions allow the speciﬁcation of an output array with the

output argument. With this argument you can specify an array that will be changed in-place with the result with the

operation. In this case the result is not returned. Usually, using the output argument is more efﬁcient, since an existing

array is used to store the result.

The type of arrays returned is dependent on the type of operation, but it is in most cases equal to the type of the input.

If, however, the output argument is used, the type of the result is equal to the type of the speciﬁed output argument.

96 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

If no output argument is given, it is still possible to specify what the result of the output should be. This is done by

simply assigning the desired numpy type object to the output argument. For example:

>>> correlate(np.arange(10), [1,2.5])

array([ 0, 2, 6, 9, 13, 16, 20, 23, 27, 30])

>>> correlate(np.arange(10), [1,2.5], output=np.float64)

array([ 0. , 2.5, 6. , 9.5, 13. , 16.5, 20. , 23.5, 27. , 30.5])

1.14.3 Filter functions

The functions described in this section all perform some type of spatial ﬁltering of the the input array: the elements

in the output are some function of the values in the neighborhood of the corresponding input element. We refer to

this neighborhood of elements as the ﬁlter kernel, which is often rectangular in shape but may also have an arbitrary

footprint. Many of the functions described below allow you to deﬁne the footprint of the kernel, by passing a mask

through the footprint parameter. For example a cross shaped kernel can be deﬁned as follows:

>>> footprint =array([[0,1,0],[1,1,1],[0,1,0]])

>>> footprint

array([[0, 1, 0],

[1, 1, 1],

[0, 1, 0]])

Usually the origin of the kernel is at the center calculated by dividing the dimensions of the kernel shape by two.

For instance, the origin of a one-dimensional kernel of length three is at the second element. Take for example the

correlation of a one-dimensional array with a ﬁlter of length 3 consisting of ones:

>>> a=[0,0,0,1,0,0,0]

>>> correlate1d(a, [1,1,1])

array([0, 0, 1, 1, 1, 0, 0])

Sometimes it is convenient to choose a different origin for the kernel. For this reason most functions support the origin

parameter which gives the origin of the ﬁlter relative to its center. For example:

>>> a=[0,0,0,1,0,0,0]

>>> correlate1d(a, [1,1,1], origin = -1)

array([0 1 1 1 0 0 0])

The effect is a shift of the result towards the left. This feature will not be needed very often, but it may be useful

especially for ﬁlters that have an even size. A good example is the calculation of backward and forward differences:

>>> a=[0,0,1,1,1,0,0]

>>> correlate1d(a, [-1,1]) # backward difference

array([ 0 0 1 0 0 -1 0])

>>> correlate1d(a, [-1,1], origin = -1)# forward difference

array([ 0 1 0 0 -1 0 0])

We could also have calculated the forward difference as follows:

>>> correlate1d(a, [0,-1,1])

array([ 0 1 0 0 -1 0 0])

However, using the origin parameter instead of a larger kernel is more efﬁcient. For multidimensional kernels origin

can be a number, in which case the origin is assumed to be equal along all axes, or a sequence giving the origin along

each axis.

Since the output elements are a function of elements in the neighborhood of the input elements, the borders of the

array need to be dealt with appropriately by providing the values outside the borders. This is done by assuming that

the arrays are extended beyond their boundaries according certain boundary conditions. In the functions described

1.14. Multidimensional image processing (scipy.ndimage) 97

SciPy Reference Guide, Release 0.13.0

below, the boundary conditions can be selected using the mode parameter which must be a string with the name of the

boundary condition. Following boundary conditions are currently supported:

“nearest” Use the value at the boundary [1 2 3]->[1 1 2 3 3]

“wrap” Periodically replicate the array [1 2 3]->[3 1 2 3 1]

“reﬂect” Reﬂect the array at the boundary [1 2 3]->[1 1 2 3 3]

“constant” Use a constant value, default is 0.0 [1 2 3]->[0 1 2 3 0]

The “constant” mode is special since it needs an additional parameter to specify the constant value that should be used.

Note: The easiest way to implement such boundary conditions would be to copy the data to a larger array and extend

the data at the borders according to the boundary conditions. For large arrays and large ﬁlter kernels, this would be

very memory consuming, and the functions described below therefore use a different approach that does not require

allocating large temporary buffers.

Correlation and convolution

The correlate1d function calculates a one-dimensional correlation along the given axis. The lines of the ar-

ray along the given axis are correlated with the given weights. The weights parameter must be a one-dimensional

sequences of numbers.

The function correlate implements multidimensional correlation of the input array with a given kernel.

The convolve1d function calculates a one-dimensional convolution along the given axis. The lines of the

array along the given axis are convoluted with the given weights. The weights parameter must be a one-

dimensional sequences of numbers.

Note: A convolution is essentially a correlation after mirroring the kernel. As a result, the origin parameter

behaves differently than in the case of a correlation: the result is shifted in the opposite directions.

The function convolve implements multidimensional convolution of the input array with a given kernel.

Note: A convolution is essentially a correlation after mirroring the kernel. As a result, the origin parameter

behaves differently than in the case of a correlation: the results is shifted in the opposite direction.

Smoothing ﬁlters

The gaussian_filter1d function implements a one-dimensional Gaussian ﬁlter. The standard-deviation

of the Gaussian ﬁlter is passed through the parameter sigma. Setting order = 0 corresponds to convolution with

a Gaussian kernel. An order of 1, 2, or 3 corresponds to convolution with the ﬁrst, second or third derivatives of

a Gaussian. Higher order derivatives are not implemented.

The gaussian_filter function implements a multidimensional Gaussian ﬁlter. The standard-deviations of

the Gaussian ﬁlter along each axis are passed through the parameter sigma as a sequence or numbers. If sigma

is not a sequence but a single number, the standard deviation of the ﬁlter is equal along all directions. The order

of the ﬁlter can be speciﬁed separately for each axis. An order of 0 corresponds to convolution with a Gaussian

kernel. An order of 1, 2, or 3 corresponds to convolution with the ﬁrst, second or third derivatives of a Gaussian.

Higher order derivatives are not implemented. The order parameter must be a number, to specify the same order

for all axes, or a sequence of numbers to specify a different order for each axis.

Note: The multidimensional ﬁlter is implemented as a sequence of one-dimensional Gaussian ﬁlters. The

intermediate arrays are stored in the same data type as the output. Therefore, for output types with a lower

precision, the results may be imprecise because intermediate results may be stored with insufﬁcient precision.

This can be prevented by specifying a more precise output type.

98 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

The uniform_filter1d function calculates a one-dimensional uniform ﬁlter of the given size along the

given axis.

The uniform_filter implements a multidimensional uniform ﬁlter. The sizes of the uniform ﬁlter are given

for each axis as a sequence of integers by the size parameter. If size is not a sequence, but a single number, the

sizes along all axis are assumed to be equal.

Note: The multidimensional ﬁlter is implemented as a sequence of one-dimensional uniform ﬁlters. The

intermediate arrays are stored in the same data type as the output. Therefore, for output types with a lower

precision, the results may be imprecise because intermediate results may be stored with insufﬁcient precision.

This can be prevented by specifying a more precise output type.

Filters based on order statistics

The minimum_filter1d function calculates a one-dimensional minimum ﬁlter of given size along the given

axis.

The maximum_filter1d function calculates a one-dimensional maximum ﬁlter of given size along the given

axis.

The minimum_filter function calculates a multidimensional minimum ﬁlter. Either the sizes of a rectangu-

lar kernel or the footprint of the kernel must be provided. The size parameter, if provided, must be a sequence of

sizes or a single number in which case the size of the ﬁlter is assumed to be equal along each axis. The footprint,

if provided, must be an array that deﬁnes the shape of the kernel by its non-zero elements.

The maximum_filter function calculates a multidimensional maximum ﬁlter. Either the sizes of a rectangu-

lar kernel or the footprint of the kernel must be provided. The size parameter, if provided, must be a sequence of

sizes or a single number in which case the size of the ﬁlter is assumed to be equal along each axis. The footprint,

if provided, must be an array that deﬁnes the shape of the kernel by its non-zero elements.

The rank_filter function calculates a multidimensional rank ﬁlter. The rank may be less then zero, i.e.,

rank = -1 indicates the largest element. Either the sizes of a rectangular kernel or the footprint of the kernel must

be provided. The size parameter, if provided, must be a sequence of sizes or a single number in which case the

size of the ﬁlter is assumed to be equal along each axis. The footprint, if provided, must be an array that deﬁnes

the shape of the kernel by its non-zero elements.

The percentile_filter function calculates a multidimensional percentile ﬁlter. The percentile may be

less then zero, i.e., percentile = -20 equals percentile = 80. Either the sizes of a rectangular kernel or the

footprint of the kernel must be provided. The size parameter, if provided, must be a sequence of sizes or a single

number in which case the size of the ﬁlter is assumed to be equal along each axis. The footprint, if provided,

must be an array that deﬁnes the shape of the kernel by its non-zero elements.

The median_filter function calculates a multidimensional median ﬁlter. Either the sizes of a rectangular

kernel or the footprint of the kernel must be provided. The size parameter, if provided, must be a sequence of

sizes or a single number in which case the size of the ﬁlter is assumed to be equal along each axis. The footprint

if provided, must be an array that deﬁnes the shape of the kernel by its non-zero elements.

Derivatives

Derivative ﬁlters can be constructed in several ways. The function gaussian_filter1d described in Smoothing

ﬁlters can be used to calculate derivatives along a given axis using the order parameter. Other derivative ﬁlters are the

Prewitt and Sobel ﬁlters:

The prewitt function calculates a derivative along the given axis.

The sobel function calculates a derivative along the given axis.

The Laplace ﬁlter is calculated by the sum of the second derivatives along all axes. Thus, different Laplace ﬁlters

can be constructed using different second derivative functions. Therefore we provide a general function that takes a

function argument to calculate the second derivative along a given direction and to construct the Laplace ﬁlter:

1.14. Multidimensional image processing (scipy.ndimage) 99

SciPy Reference Guide, Release 0.13.0

The function generic_laplace calculates a laplace ﬁlter using the function passed through derivative2

to calculate second derivatives. The function derivative2 should have the following signature:

derivative2(input, axis, output, mode, cval, *extra_arguments, **extra_keywords)

It should calculate the second derivative along the dimension axis. If output is not None it should use that for

the output and return None, otherwise it should return the result. mode,cval have the usual meaning.

The extra_arguments and extra_keywords arguments can be used to pass a tuple of extra arguments and a dic-

tionary of named arguments that are passed to derivative2 at each call.

For example:

>>> def d2(input, axis, output, mode, cval):

... return correlate1d(input, [1,-2,1], axis, output, mode, cval, 0)

...

>>> a=zeros((5,5))

>>> a[2,2]=1

>>> generic_laplace(a, d2)

array([[ 0., 0., 0., 0., 0.],

[ 0., 0., 1., 0., 0.],

[ 0., 1., -4., 1., 0.],

[ 0., 0., 1., 0., 0.],

[ 0., 0., 0., 0., 0.]])

To demonstrate the use of the extra_arguments argument we could do:

>>> def d2(input, axis, output, mode, cval, weights):

... return correlate1d(input, weights, axis, output, mode, cval, 0,)

...

>>> a=zeros((5,5))

>>> a[2,2]=1

>>> generic_laplace(a, d2, extra_arguments =([1,-2,1],))

array([[ 0., 0., 0., 0., 0.],

[ 0., 0., 1., 0., 0.],

[ 0., 1., -4., 1., 0.],

[ 0., 0., 1., 0., 0.],

[ 0., 0., 0., 0., 0.]])

or:

>>> generic_laplace(a, d2, extra_keywords ={’weights’: [1,-2,1]})

array([[ 0., 0., 0., 0., 0.],

[ 0., 0., 1., 0., 0.],

[ 0., 1., -4., 1., 0.],

[ 0., 0., 1., 0., 0.],

[ 0., 0., 0., 0., 0.]])

The following two functions are implemented using generic_laplace by providing appropriate functions for the

second derivative function:

The function laplace calculates the Laplace using discrete differentiation for the second derivative (i.e. con-

volution with [1, -2, 1]).

The function gaussian_laplace calculates the Laplace using gaussian_filter to calculate the second

derivatives. The standard-deviations of the Gaussian ﬁlter along each axis are passed through the parameter

sigma as a sequence or numbers. If sigma is not a sequence but a single number, the standard deviation of the

ﬁlter is equal along all directions.

The gradient magnitude is deﬁned as the square root of the sum of the squares of the gradients in all directions. Similar

to the generic Laplace function there is a generic_gradient_magnitude function that calculated the gradient

magnitude of an array:

The function generic_gradient_magnitude calculates a gradient magnitude using the function passed

through derivative to calculate ﬁrst derivatives. The function derivative should have the following

100 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

signature:

derivative(input, axis, output, mode, cval, *extra_arguments, **extra_keywords)

It should calculate the derivative along the dimension axis. If output is not None it should use that for the output

and return None, otherwise it should return the result. mode,cval have the usual meaning.

The extra_arguments and extra_keywords arguments can be used to pass a tuple of extra arguments and a dic-

tionary of named arguments that are passed to derivative at each call.

For example, the sobel function ﬁts the required signature:

>>> a=zeros((5,5))

>>> a[2,2]=1

>>> generic_gradient_magnitude(a, sobel)

array([[ 0. , 0. , 0. , 0. , 0. ],

[ 0. , 1.41421356, 2. , 1.41421356, 0. ],

[ 0. , 2. , 0. , 2. , 0. ],

[ 0. , 1.41421356, 2. , 1.41421356, 0. ],

[ 0. , 0. , 0. , 0. , 0. ]])

See the documentation of generic_laplace for examples of using the extra_arguments and extra_keywords

arguments.

The sobel and prewitt functions ﬁt the required signature and can therefore directly be used with

generic_gradient_magnitude. The following function implements the gradient magnitude using Gaussian

derivatives:

The function gaussian_gradient_magnitude calculates the gradient magnitude using

gaussian_filter to calculate the ﬁrst derivatives. The standard-deviations of the Gaussian ﬁlter

along each axis are passed through the parameter sigma as a sequence or numbers. If sigma is not a sequence

but a single number, the standard deviation of the ﬁlter is equal along all directions.

Generic ﬁlter functions

To implement ﬁlter functions, generic functions can be used that accept a callable object that implements the ﬁltering

operation. The iteration over the input and output arrays is handled by these generic functions, along with such

details as the implementation of the boundary conditions. Only a callable object implementing a callback function

that does the actual ﬁltering work must be provided. The callback function can also be written in C and passed using

aPyCObject (see Extending ndimage in C for more information).

The generic_filter1d function implements a generic one-dimensional ﬁlter function, where the actual

ﬁltering operation must be supplied as a python function (or other callable object). The generic_filter1d

function iterates over the lines of an array and calls function at each line. The arguments that are passed to

function are one-dimensional arrays of the tFloat64 type. The ﬁrst contains the values of the current line.

It is extended at the beginning end the end, according to the ﬁlter_size and origin arguments. The second array

should be modiﬁed in-place to provide the output values of the line. For example consider a correlation along

one dimension:

>>> a=arange(12).reshape(3,4)

>>> correlate1d(a, [1,2,3])

array([[ 3, 8, 14, 17],

[27, 32, 38, 41],

[51, 56, 62, 65]])

The same operation can be implemented using generic_filter1d as follows:

>>> def fnc(iline, oline):

... oline[...]=iline[:-2]+2*iline[1:-1]+3*iline[2:]

...

>>> generic_filter1d(a, fnc, 3)

array([[ 3, 8, 14, 17],

1.14. Multidimensional image processing (scipy.ndimage) 101

SciPy Reference Guide, Release 0.13.0

[27, 32, 38, 41],

[51, 56, 62, 65]])

Here the origin of the kernel was (by default) assumed to be in the middle of the ﬁlter of length 3. Therefore,

each input line was extended by one value at the beginning and at the end, before the function was called.

Optionally extra arguments can be deﬁned and passed to the ﬁlter function. The extra_arguments and ex-

tra_keywords arguments can be used to pass a tuple of extra arguments and/or a dictionary of named arguments

that are passed to derivative at each call. For example, we can pass the parameters of our ﬁlter as an argument:

>>> def fnc(iline, oline, a, b):

... oline[...]=iline[:-2]+a*iline[1:-1]+b*iline[2:]

...

>>> generic_filter1d(a, fnc, 3, extra_arguments =(2,3))

array([[ 3, 8, 14, 17],

[27, 32, 38, 41],

[51, 56, 62, 65]])

or:

>>> generic_filter1d(a, fnc, 3, extra_keywords ={’a’:2,’b’:3})

array([[ 3, 8, 14, 17],

[27, 32, 38, 41],

[51, 56, 62, 65]])

The generic_filter function implements a generic ﬁlter function, where the actual ﬁltering operation must

be supplied as a python function (or other callable object). The generic_filter function iterates over the

array and calls function at each element. The argument of function is a one-dimensional array of the

tFloat64 type, that contains the values around the current element that are within the footprint of the ﬁlter.

The function should return a single value that can be converted to a double precision number. For example

consider a correlation:

>>> a=arange(12).reshape(3,4)

>>> correlate(a, [[1,0], [0,3]])

array([[ 0, 3, 7, 11],

[12, 15, 19, 23],

[28, 31, 35, 39]])

The same operation can be implemented using generic_ﬁlter as follows:

>>> def fnc(buffer):

... return (buffer *array([1,3])).sum()

...

>>> generic_filter(a, fnc, footprint =[[1,0], [0,1]])

array([[ 0 3 7 11],

[12 15 19 23],

[28 31 35 39]])

Here a kernel footprint was speciﬁed that contains only two elements. Therefore the ﬁlter function receives a

buffer of length equal to two, which was multiplied with the proper weights and the result summed.

When calling generic_filter, either the sizes of a rectangular kernel or the footprint of the kernel must be

provided. The size parameter, if provided, must be a sequence of sizes or a single number in which case the size

of the ﬁlter is assumed to be equal along each axis. The footprint, if provided, must be an array that deﬁnes the

shape of the kernel by its non-zero elements.

Optionally extra arguments can be deﬁned and passed to the ﬁlter function. The extra_arguments and ex-

tra_keywords arguments can be used to pass a tuple of extra arguments and/or a dictionary of named arguments

that are passed to derivative at each call. For example, we can pass the parameters of our ﬁlter as an argument:

>>> def fnc(buffer, weights):

... weights =asarray(weights)

... return (buffer *weights).sum()

...

102 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

>>> generic_filter(a, fnc, footprint =[[1,0], [0,1]], extra_arguments =([1,3],))

array([[ 0, 3, 7, 11],

[12, 15, 19, 23],

[28, 31, 35, 39]])

or:

>>> generic_filter(a, fnc, footprint =[[1,0], [0,1]], extra_keywords={’weights’: [1,3]})

array([[ 0, 3, 7, 11],

[12, 15, 19, 23],

[28, 31, 35, 39]])

These functions iterate over the lines or elements starting at the last axis, i.e. the last index changes the fastest. This

order of iteration is guaranteed for the case that it is important to adapt the ﬁlter depending on spatial location. Here

is an example of using a class that implements the ﬁlter and keeps track of the current coordinates while iterating.

It performs the same ﬁlter operation as described above for generic_filter, but additionally prints the current

coordinates:

>>> a=arange(12).reshape(3,4)

>>>

>>> class fnc_class:

... def __init__(self, shape):

... # store the shape:

... self.shape =shape

... # initialize the coordinates:

... self.coordinates =[0]*len(shape)

...

... def filter(self,buffer):

... result =(buffer *array([1,3])).sum()

... print self.coordinates

... # calculate the next coordinates:

... axes =range(len(self.shape))

... axes.reverse()

... for jj in axes:

... if self.coordinates[jj] <self.shape[jj] -1:

... self.coordinates[jj] += 1

... break

... else:

... self.coordinates[jj] =0

... return result

...

>>> fnc =fnc_class(shape =(3,4))

>>> generic_filter(a, fnc.filter, footprint =[[1,0], [0,1]])

[0, 0]

[0, 1]

[0, 2]

[0, 3]

[1, 0]

[1, 1]

[1, 2]

[1, 3]

[2, 0]

[2, 1]

[2, 2]

[2, 3]

array([[ 0, 3, 7, 11],

[12, 15, 19, 23],

[28, 31, 35, 39]])

1.14. Multidimensional image processing (scipy.ndimage) 103

SciPy Reference Guide, Release 0.13.0

For the generic_filter1d function the same approach works, except that this function does not iterate over the

axis that is being ﬁltered. The example for generic_filter1d then becomes this:

>>> a=arange(12).reshape(3,4)

>>>

>>> class fnc1d_class:

... def __init__(self, shape, axis = -1):

... # store the filter axis:

... self.axis =axis

... # store the shape:

... self.shape =shape

... # initialize the coordinates:

... self.coordinates =[0]*len(shape)

...

... def filter(self, iline, oline):

... oline[...]=iline[:-2]+2*iline[1:-1]+3*iline[2:]

... print self.coordinates

... # calculate the next coordinates:

... axes =range(len(self.shape))

... # skip the filter axis:

... del axes[self.axis]

... axes.reverse()

... for jj in axes:

... if self.coordinates[jj] <self.shape[jj] -1:

... self.coordinates[jj] += 1

... break

... else:

... self.coordinates[jj] =0

...

>>> fnc =fnc1d_class(shape =(3,4))

>>> generic_filter1d(a, fnc.filter, 3)

[0, 0]

[1, 0]

[2, 0]

array([[ 3, 8, 14, 17],

[27, 32, 38, 41],

[51, 56, 62, 65]])

Fourier domain ﬁlters

The functions described in this section perform ﬁltering operations in the Fourier domain. Thus, the input array

of such a function should be compatible with an inverse Fourier transform function, such as the functions from the

numpy.fft module. We therefore have to deal with arrays that may be the result of a real or a complex Fourier

transform. In the case of a real Fourier transform only half of the of the symmetric complex transform is stored.

Additionally, it needs to be known what the length of the axis was that was transformed by the real fft. The functions

described here provide a parameter nthat in the case of a real transform must be equal to the length of the real

transform axis before transformation. If this parameter is less than zero, it is assumed that the input array was the

result of a complex Fourier transform. The parameter axis can be used to indicate along which axis the real transform

was executed.

The fourier_shift function multiplies the input array with the multidimensional Fourier transform of a

shift operation for the given shift. The shift parameter is a sequences of shifts for each dimension, or a single

value for all dimensions.

The fourier_gaussian function multiplies the input array with the multidimensional Fourier transform of

a Gaussian ﬁlter with given standard-deviations sigma. The sigma parameter is a sequences of values for each

dimension, or a single value for all dimensions.

104 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

The fourier_uniform function multiplies the input array with the multidimensional Fourier transform of a

uniform ﬁlter with given sizes size. The size parameter is a sequences of values for each dimension, or a single

value for all dimensions.

The fourier_ellipsoid function multiplies the input array with the multidimensional Fourier transform of

a elliptically shaped ﬁlter with given sizes size. The size parameter is a sequences of values for each dimension,

or a single value for all dimensions. This function is only implemented for dimensions 1, 2, and 3.

1.14.4 Interpolation functions

This section describes various interpolation functions that are based on B-spline theory. A good introduction to B-

splines can be found in: M. Unser, “Splines: A Perfect Fit for Signal and Image Processing,” IEEE Signal Processing

Magazine, vol. 16, no. 6, pp. 22-38, November 1999.

Spline pre-ﬁlters

Interpolation using splines of an order larger than 1 requires a pre- ﬁltering step. The interpolation functions described

in section Interpolation functions apply pre-ﬁltering by calling spline_filter, but they can be instructed not to

do this by setting the preﬁlter keyword equal to False. This is useful if more than one interpolation operation is done

on the same array. In this case it is more efﬁcient to do the pre-ﬁltering only once and use a preﬁltered array as the

input of the interpolation functions. The following two functions implement the pre-ﬁltering:

The spline_filter1d function calculates a one-dimensional spline ﬁlter along the given axis. An output

array can optionally be provided. The order of the spline must be larger then 1 and less than 6.

The spline_filter function calculates a multidimensional spline ﬁlter.

Note: The multidimensional ﬁlter is implemented as a sequence of one-dimensional spline ﬁlters. The inter-

mediate arrays are stored in the same data type as the output. Therefore, if an output with a limited precision is

requested, the results may be imprecise because intermediate results may be stored with insufﬁcient precision.

This can be prevented by specifying a output type of high precision.

Interpolation functions

Following functions all employ spline interpolation to effect some type of geometric transformation of the input array.

This requires a mapping of the output coordinates to the input coordinates, and therefore the possibility arises that input

values outside the boundaries are needed. This problem is solved in the same way as described in Filter functions for

the multidimensional ﬁlter functions. Therefore these functions all support a mode parameter that determines how the

boundaries are handled, and a cval parameter that gives a constant value in case that the ‘constant’ mode is used.

The geometric_transform function applies an arbitrary geometric transform to the input. The given map-

ping function is called at each point in the output to ﬁnd the corresponding coordinates in the input. mapping

must be a callable object that accepts a tuple of length equal to the output array rank and returns the correspond-

ing input coordinates as a tuple of length equal to the input array rank. The output shape and output type can

optionally be provided. If not given they are equal to the input shape and type.

For example:

>>> a=arange(12).reshape(4,3).astype(np.float64)

>>> def shift_func(output_coordinates):

... return (output_coordinates[0]-0.5, output_coordinates[1]-0.5)

...

>>> geometric_transform(a, shift_func)

array([[ 0. , 0. , 0. ],

[ 0. , 1.3625, 2.7375],

1.14. Multidimensional image processing (scipy.ndimage) 105

SciPy Reference Guide, Release 0.13.0

[ 0. , 4.8125, 6.1875],

[ 0. , 8.2625, 9.6375]])

Optionally extra arguments can be deﬁned and passed to the ﬁlter function. The extra_arguments and ex-

tra_keywords arguments can be used to pass a tuple of extra arguments and/or a dictionary of named arguments

that are passed to derivative at each call. For example, we can pass the shifts in our example as arguments:

>>> def shift_func(output_coordinates, s0, s1):

... return (output_coordinates[0]-s0, output_coordinates[1]-s1)

...

>>> geometric_transform(a, shift_func, extra_arguments =(0.5,0.5))

array([[ 0. , 0. , 0. ],

[ 0. , 1.3625, 2.7375],

[ 0. , 4.8125, 6.1875],

[ 0. , 8.2625, 9.6375]])

or:

>>> geometric_transform(a, shift_func, extra_keywords ={’s0’:0.5,’s1’:0.5})

array([[ 0. , 0. , 0. ],

[ 0. , 1.3625, 2.7375],

[ 0. , 4.8125, 6.1875],

[ 0. , 8.2625, 9.6375]])

Note: The mapping function can also be written in C and passed using a PyCObject. See Extending ndimage

in C for more information.

The function map_coordinates applies an arbitrary coordinate transformation using the given array of

coordinates. The shape of the output is derived from that of the coordinate array by dropping the ﬁrst axis. The

parameter coordinates is used to ﬁnd for each point in the output the corresponding coordinates in the input.

The values of coordinates along the ﬁrst axis are the coordinates in the input array at which the output value is

found. (See also the numarray coordinates function.) Since the coordinates may be non- integer coordinates,

the value of the input at these coordinates is determined by spline interpolation of the requested order. Here is

an example that interpolates a 2D array at (0.5, 0.5) and (1, 2):

>>> a=arange(12).reshape(4,3).astype(np.float64)

>>> a

array([[ 0., 1., 2.],

[ 3., 4., 5.],

[ 6., 7., 8.],

[ 9., 10., 11.]])

>>> map_coordinates(a, [[0.5,2], [0.5,1]])

array([ 1.3625 7. ])

The affine_transform function applies an afﬁne transformation to the input array. The given transforma-

tion matrix and offset are used to ﬁnd for each point in the output the corresponding coordinates in the input. The

value of the input at the calculated coordinates is determined by spline interpolation of the requested order. The

transformation matrix must be two-dimensional or can also be given as a one-dimensional sequence or array. In

the latter case, it is assumed that the matrix is diagonal. A more efﬁcient interpolation algorithm is then applied

that exploits the separability of the problem. The output shape and output type can optionally be provided. If

not given they are equal to the input shape and type.

The shift function returns a shifted version of the input, using spline interpolation of the requested order.

The zoom function returns a rescaled version of the input, using spline interpolation of the requested order.

The rotate function returns the input array rotated in the plane deﬁned by the two axes given by the parameter

axes, using spline interpolation of the requested order. The angle must be given in degrees. If reshape is true,

then the size of the output array is adapted to contain the rotated input.

106 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

1.14.5 Morphology

Binary morphology

Binary morphology (need something to put here).

The generate_binary_structure functions generates a binary structuring element for use in binary

morphology operations. The rank of the structure must be provided. The size of the structure that is returned is

equal to three in each direction. The value of each element is equal to one if the square of the Euclidean distance

from the element to the center is less or equal to connectivity. For instance, two dimensional 4-connected and

8-connected structures are generated as follows:

>>> generate_binary_structure(2,1)

array([[False, True, False],

[ True, True, True],

[False, True, False]], dtype=bool)

>>> generate_binary_structure(2,2)

array([[ True, True, True],

[ True, True, True],

[ True, True, True]], dtype=bool)

Most binary morphology functions can be expressed in terms of the basic operations erosion and dilation:

The binary_erosion function implements binary erosion of arrays of arbitrary rank with the given struc-

turing element. The origin parameter controls the placement of the structuring element as described in Filter

functions. If no structuring element is provided, an element with connectivity equal to one is generated using

generate_binary_structure. The border_value parameter gives the value of the array outside bound-

aries. The erosion is repeated iterations times. If iterations is less than one, the erosion is repeated until the result

does not change anymore. If a mask array is given, only those elements with a true value at the corresponding

mask element are modiﬁed at each iteration.

The binary_dilation function implements binary dilation of arrays of arbitrary rank with the given struc-

turing element. The origin parameter controls the placement of the structuring element as described in Filter

functions. If no structuring element is provided, an element with connectivity equal to one is generated using

generate_binary_structure. The border_value parameter gives the value of the array outside bound-

aries. The dilation is repeated iterations times. If iterations is less than one, the dilation is repeated until the

result does not change anymore. If a mask array is given, only those elements with a true value at the corre-

sponding mask element are modiﬁed at each iteration.

Here is an example of using binary_dilation to ﬁnd all elements that touch the border, by repeatedly

dilating an empty array from the border using the data array as the mask:

>>> struct =array([[0,1,0], [1,1,1], [0,1,0]])

>>> a=array([[1,0,0,0,0], [1,1,0,1,0], [0,0,1,1,0], [0,0,0,0,0]])

>>> a

array([[1, 0, 0, 0, 0],

[1, 1, 0, 1, 0],

[0, 0, 1, 1, 0],

[0, 0, 0, 0, 0]])

>>> binary_dilation(zeros(a.shape), struct, -1, a, border_value=1)

array([[ True, False, False, False, False],

[ True, True, False, False, False],

[False, False, False, False, False],

[False, False, False, False, False]], dtype=bool)

The binary_erosion and binary_dilation functions both have an iterations parameter which allows the

erosion or dilation to be repeated a number of times. Repeating an erosion or a dilation with a given structure ntimes

is equivalent to an erosion or a dilation with a structure that is n-1 times dilated with itself. A function is provided that

allows the calculation of a structure that is dilated a number of times with itself:

1.14. Multidimensional image processing (scipy.ndimage) 107

SciPy Reference Guide, Release 0.13.0

The iterate_structure function returns a structure by dilation of the input structure iteration - 1 times

with itself. For instance:

>>> struct =generate_binary_structure(2,1)

>>> struct

array([[False, True, False],

[ True, True, True],

[False, True, False]], dtype=bool)

>>> iterate_structure(struct, 2)

array([[False, False, True, False, False],

[False, True, True, True, False],

[ True, True, True, True, True],

[False, True, True, True, False],

[False, False, True, False, False]], dtype=bool)

If the origin of the original structure is equal to 0, then it is also equal to 0 for the iterated structure. If not,

the origin must also be adapted if the equivalent of the iterations erosions or dilations must be achieved with

the iterated structure. The adapted origin is simply obtained by multiplying with the number of iterations. For

convenience the iterate_structure also returns the adapted origin if the origin parameter is not None:

>>> iterate_structure(struct, 2,-1)

(array([[False, False, True, False, False],

[False, True, True, True, False],

[ True, True, True, True, True],

[False, True, True, True, False],

[False, False, True, False, False]], dtype=bool), [-2, -2])

Other morphology operations can be deﬁned in terms of erosion and d dilation. Following functions provide a few of

these operations for convenience:

The binary_opening function implements binary opening of arrays of arbitrary rank with the given struc-

turing element. Binary opening is equivalent to a binary erosion followed by a binary dilation with the same

structuring element. The origin parameter controls the placement of the structuring element as described in Fil-

ter functions. If no structuring element is provided, an element with connectivity equal to one is generated using

generate_binary_structure. The iterations parameter gives the number of erosions that is performed

followed by the same number of dilations.

The binary_closing function implements binary closing of arrays of arbitrary rank with the given struc-

turing element. Binary closing is equivalent to a binary dilation followed by a binary erosion with the same

structuring element. The origin parameter controls the placement of the structuring element as described in Fil-

ter functions. If no structuring element is provided, an element with connectivity equal to one is generated using

generate_binary_structure. The iterations parameter gives the number of dilations that is performed

followed by the same number of erosions.

The binary_fill_holes function is used to close holes in objects in a binary image, where the structure

deﬁnes the connectivity of the holes. The origin parameter controls the placement of the structuring element as

described in Filter functions. If no structuring element is provided, an element with connectivity equal to one is

generated using generate_binary_structure.

The binary_hit_or_miss function implements a binary hit-or-miss transform of arrays of arbitrary rank

with the given structuring elements. The hit-or-miss transform is calculated by erosion of the input with

the ﬁrst structure, erosion of the logical not of the input with the second structure, followed by the logi-

cal and of these two erosions. The origin parameters control the placement of the structuring elements as

described in Filter functions. If origin2 equals None it is set equal to the origin1 parameter. If the ﬁrst

structuring element is not provided, a structuring element with connectivity equal to one is generated using

generate_binary_structure, if structure2 is not provided, it is set equal to the logical not of struc-

ture1.

108 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

Grey-scale morphology

Grey-scale morphology operations are the equivalents of binary morphology operations that operate on arrays with

arbitrary values. Below we describe the grey-scale equivalents of erosion, dilation, opening and closing. These

operations are implemented in a similar fashion as the ﬁlters described in Filter functions, and we refer to this section

for the description of ﬁlter kernels and footprints, and the handling of array borders. The grey-scale morphology

operations optionally take a structure parameter that gives the values of the structuring element. If this parameter

is not given the structuring element is assumed to be ﬂat with a value equal to zero. The shape of the structure

can optionally be deﬁned by the footprint parameter. If this parameter is not given, the structure is assumed to be

rectangular, with sizes equal to the dimensions of the structure array, or by the size parameter if structure is not given.

The size parameter is only used if both structure and footprint are not given, in which case the structuring element

is assumed to be rectangular and ﬂat with the dimensions given by size. The size parameter, if provided, must be a

sequence of sizes or a single number in which case the size of the ﬁlter is assumed to be equal along each axis. The

footprint parameter, if provided, must be an array that deﬁnes the shape of the kernel by its non-zero elements.

Similar to binary erosion and dilation there are operations for grey-scale erosion and dilation:

The grey_erosion function calculates a multidimensional grey- scale erosion.

The grey_dilation function calculates a multidimensional grey- scale dilation.

Grey-scale opening and closing operations can be deﬁned similar to their binary counterparts:

The grey_opening function implements grey-scale opening of arrays of arbitrary rank. Grey-scale opening

is equivalent to a grey-scale erosion followed by a grey-scale dilation.

The grey_closing function implements grey-scale closing of arrays of arbitrary rank. Grey-scale opening

is equivalent to a grey-scale dilation followed by a grey-scale erosion.

The morphological_gradient function implements a grey-scale morphological gradient of arrays of

arbitrary rank. The grey-scale morphological gradient is equal to the difference of a grey-scale dilation and a

grey-scale erosion.

The morphological_laplace function implements a grey-scale morphological laplace of arrays of arbi-

trary rank. The grey-scale morphological laplace is equal to the sum of a grey-scale dilation and a grey-scale

erosion minus twice the input.

The white_tophat function implements a white top-hat ﬁlter of arrays of arbitrary rank. The white top-hat

is equal to the difference of the input and a grey-scale opening.

The black_tophat function implements a black top-hat ﬁlter of arrays of arbitrary rank. The black top-hat

is equal to the difference of the a grey-scale closing and the input.

1.14.6 Distance transforms

Distance transforms are used to calculate the minimum distance from each element of an object to the background.

The following functions implement distance transforms for three different distance metrics: Euclidean, City Block,

and Chessboard distances.

The function distance_transform_cdt uses a chamfer type algorithm to calculate the distance trans-

form of the input, by replacing each object element (deﬁned by values larger than zero) with the shortest dis-

tance to the background (all non-object elements). The structure determines the type of chamfering that is

done. If the structure is equal to ‘cityblock’ a structure is generated using generate_binary_structure

with a squared distance equal to 1. If the structure is equal to ‘chessboard’, a structure is generated using

generate_binary_structure with a squared distance equal to the rank of the array. These choices cor-

respond to the common interpretations of the cityblock and the chessboard distancemetrics in two dimensions.

In addition to the distance transform, the feature transform can be calculated. In this case the index of the closest

background element is returned along the ﬁrst axis of the result. The return_distances, and return_indices ﬂags

can be used to indicate if the distance transform, the feature transform, or both must be returned.

The distances and indices arguments can be used to give optional output arrays that must be of the correct size

and type (both Int32).

1.14. Multidimensional image processing (scipy.ndimage) 109

SciPy Reference Guide, Release 0.13.0

The basics of the algorithm used to implement this function is described in: G. Borgefors, “Distance transfor-

mations in arbitrary dimensions.”, Computer Vision, Graphics, and Image Processing, 27:321-345, 1984.

The function distance_transform_edt calculates the exact euclidean distance transform of the input, by

replacing each object element (deﬁned by values larger than zero) with the shortest euclidean distance to the

background (all non-object elements).

In addition to the distance transform, the feature transform can be calculated. In this case the index of the closest

background element is returned along the ﬁrst axis of the result. The return_distances, and return_indices ﬂags

can be used to indicate if the distance transform, the feature transform, or both must be returned.

Optionally the sampling along each axis can be given by the sampling parameter which should be a sequence of

length equal to the input rank, or a single number in which the sampling is assumed to be equal along all axes.

The distances and indices arguments can be used to give optional output arrays that must be of the correct size

and type (Float64 and Int32).

The algorithm used to implement this function is described in: C. R. Maurer, Jr., R. Qi, and V. Raghavan, “A lin-

ear time algorithm for computing exact euclidean distance transforms of binary images in arbitrary dimensions.

IEEE Trans. PAMI 25, 265-270, 2003.

The function distance_transform_bf uses a brute-force algorithm to calculate the distance transform of

the input, by replacing each object element (deﬁned by values larger than zero) with the shortest distance to the

background (all non-object elements). The metric must be one of “euclidean”, “cityblock”, or “chessboard”.

In addition to the distance transform, the feature transform can be calculated. In this case the index of the closest

background element is returned along the ﬁrst axis of the result. The return_distances, and return_indices ﬂags

can be used to indicate if the distance transform, the feature transform, or both must be returned.

Optionally the sampling along each axis can be given by the sampling parameter which should be a sequence of

length equal to the input rank, or a single number in which the sampling is assumed to be equal along all axes.

This parameter is only used in the case of the euclidean distance transform.

The distances and indices arguments can be used to give optional output arrays that must be of the correct size

and type (Float64 and Int32).

Note: This function uses a slow brute-force algorithm, the function distance_transform_cdt

can be used to more efﬁciently calculate cityblock and chessboard distance transforms. The function

distance_transform_edt can be used to more efﬁciently calculate the exact euclidean distance trans-

form.

1.14.7 Segmentation and labeling

Segmentation is the process of separating objects of interest from the background. The most simple approach is

probably intensity thresholding, which is easily done with numpy functions:

>>> a=array([[1,2,2,1,1,0],

... [0,2,3,1,2,0],

... [1,1,1,3,3,2],

... [1,1,1,1,2,1]])

>>> where(a >1,1,0)

array([[0, 1, 1, 0, 0, 0],

[0, 1, 1, 0, 1, 0],

[0, 0, 0, 1, 1, 1],

[0, 0, 0, 0, 1, 0]])

The result is a binary image, in which the individual objects still need to be identiﬁed and labeled. The function

label generates an array where each object is assigned a unique number:

The label function generates an array where the objects in the input are labeled with an integer index. It returns

a tuple consisting of the array of object labels and the number of objects found, unless the output parameter is

given, in which case only the number of objects is returned. The connectivity of the objects is deﬁned by a

structuring element. For instance, in two dimensions using a four-connected structuring element gives:

110 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

>>> a=array([[0,1,1,0,0,0],[0,1,1,0,1,0],[0,0,0,1,1,1],[0,0,0,0,1,0]])

>>> s=[[0,1,0], [1,1,1], [0,1,0]]

>>> label(a, s)

(array([[0, 1, 1, 0, 0, 0],

[0, 1, 1, 0, 2, 0],

[0, 0, 0, 2, 2, 2],

[0, 0, 0, 0, 2, 0]]), 2)

These two objects are not connected because there is no way in which we can place the structuring element such

that it overlaps with both objects. However, an 8-connected structuring element results in only a single object:

>>> a=array([[0,1,1,0,0,0],[0,1,1,0,1,0],[0,0,0,1,1,1],[0,0,0,0,1,0]])

>>> s=[[1,1,1], [1,1,1], [1,1,1]]

>>> label(a, s)[0]

array([[0, 1, 1, 0, 0, 0],

[0, 1, 1, 0, 1, 0],

[0, 0, 0, 1, 1, 1],

[0, 0, 0, 0, 1, 0]])

If no structuring element is provided, one is generated by calling generate_binary_structure (see

Binary morphology) using a connectivity of one (which in 2D is the 4-connected structure of the ﬁrst example).

The input can be of any type, any value not equal to zero is taken to be part of an object. This is useful if you

need to ‘re-label’ an array of object indices, for instance after removing unwanted objects. Just apply the label

function again to the index array. For instance:

>>> l,n=label([1,0,1,0,1])

>>> l

array([1 0 2 0 3])

>>> l=where(l != 2,l,0)

>>> l

array([1 0 0 0 3])

>>> label(l)[0]

array([1 0 0 0 2])

Note: The structuring element used by label is assumed to be symmetric.

There is a large number of other approaches for segmentation, for instance from an estimation of the borders of

the objects that can be obtained for instance by derivative ﬁlters. One such an approach is watershed segmentation.

The function watershed_ift generates an array where each object is assigned a unique label, from an array that

localizes the object borders, generated for instance by a gradient magnitude ﬁlter. It uses an array containing initial

markers for the objects:

The watershed_ift function applies a watershed from markers algorithm, using an Iterative Forest Trans-

form, as described in: P. Felkel, R. Wegenkittl, and M. Bruckschwaiger, “Implementation and Complexity of the

Watershed-from-Markers Algorithm Computed as a Minimal Cost Forest.”, Eurographics 2001, pp. C:26-35.

The inputs of this function are the array to which the transform is applied, and an array of markers that designate

the objects by a unique label, where any non-zero value is a marker. For instance:

>>> input =array([[0,0,0,0,0,0,0],

... [0,1,1,1,1,1,0],

... [0,1,0,0,0,1,0],

... [0,1,1,1,1,1,0],

... [0,0,0,0,0,0,0]], np.uint8)

>>> markers =array([[1,0,0,0,0,0,0],

... [0,0,0,0,0,0,0],

... [0,0,0,2,0,0,0],

1.14. Multidimensional image processing (scipy.ndimage) 111

SciPy Reference Guide, Release 0.13.0

... [0,0,0,0,0,0,0],

... [0,0,0,0,0,0,0]], np.int8)

>>> watershed_ift(input, markers)

array([[1, 1, 1, 1, 1, 1, 1],

[1, 1, 2, 2, 2, 1, 1],

[1, 2, 2, 2, 2, 2, 1],

[1, 1, 2, 2, 2, 1, 1],

[1, 1, 1, 1, 1, 1, 1]], dtype=int8)

Here two markers were used to designate an object (marker = 2) and the background (marker = 1). The order

in which these are processed is arbitrary: moving the marker for the background to the lower right corner of the

array yields a different result:

>>> markers =array([[0,0,0,0,0,0,0],

... [0,0,0,0,0,0,0],

... [0,0,0,2,0,0,0],

... [0,0,0,0,0,0,0],

... [0,0,0,0,0,0,1]], np.int8)

>>> watershed_ift(input, markers)

array([[1, 1, 1, 1, 1, 1, 1],

[1, 1, 1, 1, 1, 1, 1],

[1, 1, 2, 2, 2, 1, 1],

[1, 1, 1, 1, 1, 1, 1],

[1, 1, 1, 1, 1, 1, 1]], dtype=int8)

The result is that the object (marker = 2) is smaller because the second marker was processed earlier. This

may not be the desired effect if the ﬁrst marker was supposed to designate a background object. Therefore

watershed_ift treats markers with a negative value explicitly as background markers and processes them

after the normal markers. For instance, replacing the ﬁrst marker by a negative marker gives a result similar to

the ﬁrst example:

>>> markers =array([[0,0,0,0,0,0,0],

... [0,0,0,0,0,0,0],

... [0,0,0,2,0,0,0],

... [0,0,0,0,0,0,0],

... [0,0,0,0,0,0,-1]], np.int8)

>>> watershed_ift(input, markers)

array([[-1, -1, -1, -1, -1, -1, -1],

[-1, -1, 2, 2, 2, -1, -1],

[-1, 2, 2, 2, 2, 2, -1],

[-1, -1, 2, 2, 2, -1, -1],

[-1, -1, -1, -1, -1, -1, -1]], dtype=int8)

The connectivity of the objects is deﬁned by a structuring element. If no structuring element is provided, one

is generated by calling generate_binary_structure (see Binary morphology) using a connectivity of

one (which in 2D is a 4-connected structure.) For example, using an 8-connected structure with the last example

yields a different object:

112 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

>>> watershed_ift(input, markers,

... structure =[[1,1,1], [1,1,1], [1,1,1]])

array([[-1, -1, -1, -1, -1, -1, -1],

[-1, 2, 2, 2, 2, 2, -1],

[-1, -1, -1, -1, -1, -1, -1]], dtype=int8)

Note: The implementation of watershed_ift limits the data types of the input to UInt8 and UInt16.

1.14.8 Object measurements

Given an array of labeled objects, the properties of the individual objects can be measured. The find_objects

function can be used to generate a list of slices that for each object, give the smallest sub-array that fully contains the

object:

The find_objects function ﬁnds all objects in a labeled array and returns a list of slices that correspond to

the smallest regions in the array that contains the object. For instance:

>>> a=array([[0,1,1,0,0,0],[0,1,1,0,1,0],[0,0,0,1,1,1],[0,0,0,0,1,0]])

>>> l,n=label(a)

>>> f=find_objects(l)

>>> a[f[0]]

array([[1 1],

[1 1]])

>>> a[f[1]]

array([[0, 1, 0],

[1, 1, 1],

[0, 1, 0]])

find_objects returns slices for all objects, unless the max_label parameter is larger then zero, in which case

only the ﬁrst max_label objects are returned. If an index is missing in the label array, None is return instead of

a slice. For example:

>>> find_objects([1,0,3,4], max_label =3)

[(slice(0, 1, None),), None, (slice(2, 3, None),)]

The list of slices generated by find_objects is useful to ﬁnd the position and dimensions of the objects in the

array, but can also be used to perform measurements on the individual objects. Say we want to ﬁnd the sum of the

intensities of an object in image:

>>> image =arange(4*6).reshape(4,6)

>>> mask =array([[0,1,1,0,0,0],[0,1,1,0,1,0],[0,0,0,1,1,1],[0,0,0,0,1,0]])

>>> labels =label(mask)[0]

>>> slices =find_objects(labels)

Then we can calculate the sum of the elements in the second object:

>>> where(labels[slices[1]] == 2, image[slices[1]], 0).sum()

That is however not particularly efﬁcient, and may also be more complicated for other types of measurements. There-

fore a few measurements functions are deﬁned that accept the array of object labels and the index of the object to be

measured. For instance calculating the sum of the intensities can be done by:

1.14. Multidimensional image processing (scipy.ndimage) 113

SciPy Reference Guide, Release 0.13.0

>>> sum(image, labels, 2)

For large arrays and small objects it is more efﬁcient to call the measurement functions after slicing the array:

>>> sum(image[slices[1]], labels[slices[1]], 2)

Alternatively, we can do the measurements for a number of labels with a single function call, returning a list of results.

For instance, to measure the sum of the values of the background and the second object in our example we give a list

of labels:

>>> sum(image, labels, [0,2])

array([178.0, 80.0])

The measurement functions described below all support the index parameter to indicate which object(s) should be

measured. The default value of index is None. This indicates that all elements where the label is larger than zero

should be treated as a single object and measured. Thus, in this case the labels array is treated as a mask deﬁned by

the elements that are larger than zero. If index is a number or a sequence of numbers it gives the labels of the objects

that are measured. If index is a sequence, a list of the results is returned. Functions that return more than one result,

return their result as a tuple if index is a single number, or as a tuple of lists, if index is a sequence.

The sum function calculates the sum of the elements of the object with label(s) given by index, using the labels

array for the object labels. If index is None, all elements with a non-zero label value are treated as a single

object. If label is None, all elements of input are used in the calculation.

The mean function calculates the mean of the elements of the object with label(s) given by index, using the

labels array for the object labels. If index is None, all elements with a non-zero label value are treated as a

single object. If label is None, all elements of input are used in the calculation.

The variance function calculates the variance of the elements of the object with label(s) given by index, using

the labels array for the object labels. If index is None, all elements with a non-zero label value are treated as a

single object. If label is None, all elements of input are used in the calculation.

The standard_deviation function calculates the standard deviation of the elements of the object with

label(s) given by index, using the labels array for the object labels. If index is None, all elements with a non-

zero label value are treated as a single object. If label is None, all elements of input are used in the calculation.

The minimum function calculates the minimum of the elements of the object with label(s) given by index, using

the labels array for the object labels. If index is None, all elements with a non-zero label value are treated as a

single object. If label is None, all elements of input are used in the calculation.

The maximum function calculates the maximum of the elements of the object with label(s) given by index, using

the labels array for the object labels. If index is None, all elements with a non-zero label value are treated as a

single object. If label is None, all elements of input are used in the calculation.

The minimum_position function calculates the position of the minimum of the elements of the object with

label(s) given by index, using the labels array for the object labels. If index is None, all elements with a non-zero

label value are treated as a single object. If label is None, all elements of input are used in the calculation.

The maximum_position function calculates the position of the maximum of the elements of the object with

label(s) given by index, using the labels array for the object labels. If index is None, all elements with a non-zero

label value are treated as a single object. If label is None, all elements of input are used in the calculation.

The extrema function calculates the minimum, the maximum, and their positions, of the elements of the

object with label(s) given by index, using the labels array for the object labels. If index is None, all elements

with a non-zero label value are treated as a single object. If label is None, all elements of input are used in

the calculation. The result is a tuple giving the minimum, the maximum, the position of the minimum and the

postition of the maximum. The result is the same as a tuple formed by the results of the functions minimum,

maximum,minimum_position, and maximum_position that are described above.

The center_of_mass function calculates the center of mass of the of the object with label(s) given by index,

using the labels array for the object labels. If index is None, all elements with a non-zero label value are treated

as a single object. If label is None, all elements of input are used in the calculation.

114 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

The histogram function calculates a histogram of the of the object with label(s) given by index, using the

labels array for the object labels. If index is None, all elements with a non-zero label value are treated as a

single object. If label is None, all elements of input are used in the calculation. Histograms are deﬁned by their

minimum (min), maximum (max) and the number of bins (bins). They are returned as one-dimensional arrays

of type Int32.

1.14.9 Extending ndimage in C

A few functions in the scipy.ndimage take a call-back argument. This can be a python function, but also a

PyCObject containing a pointer to a C function. To use this feature, you must write your own C extension that

deﬁnes the function, and deﬁne a Python function that returns a PyCObject containing a pointer to this function.

An example of a function that supports this is geometric_transform (see Interpolation functions). You can pass

it a python callable object that deﬁnes a mapping from all output coordinates to corresponding coordinates in the input

array. This mapping function can also be a C function, which generally will be much more efﬁcient, since the overhead

of calling a python function at each element is avoided.

For example to implement a simple shift function we deﬁne the following function:

static int

_shift_function(int *output_coordinates, double*input_coordinates,

int output_rank, int input_rank, void *callback_data)

{

int ii;

/*get the shift from the callback data pointer: */

double shift =*(double*)callback_data;

/*calculate the coordinates: */

for(ii =0;ii<irank; ii++)

icoor[ii] =ocoor[ii] -shift;

/*return OK status: */

return 1;

}

This function is called at every element of the output array, passing the current coordinates in the output_coordinates

array. On return, the input_coordinates array must contain the coordinates at which the input is interpolated. The ranks

of the input and output array are passed through output_rank and input_rank. The value of the shift is passed through

the callback_data argument, which is a pointer to void. The function returns an error status, in this case always 1,

since no error can occur.

A pointer to this function and a pointer to the shift value must be passed to geometric_transform. Both are

passed by a single PyCObject which is created by the following python extension function:

static PyObject *

py_shift_function(PyObject *obj, PyObject *args)

{

double shift =0.0;

if (!PyArg_ParseTuple(args, "d",&shift)) {

PyErr_SetString(PyExc_RuntimeError, "invalid parameters");

return NULL;

}else {

/*assign the shift to a dynamically allocated location: */

double *cdata =(double*)malloc(sizeof(double));

*cdata =shift;

/*wrap function and callback_data in a CObject: */

return PyCObject_FromVoidPtrAndDesc(_shift_function, cdata,

_destructor);

}

1.14. Multidimensional image processing (scipy.ndimage) 115

SciPy Reference Guide, Release 0.13.0

The value of the shift is obtained and then assigned to a dynamically allocated memory location. Both this data pointer

and the function pointer are then wrapped in a PyCObject, which is returned. Additionally, a pointer to a destructor

function is given, that will free the memory we allocated for the shift value when the PyCObject is destroyed. This

destructor is very simple:

static void

_destructor(void*cobject, void *cdata)

{

if (cdata)

free(cdata);

}

To use these functions, an extension module is built:

static PyMethodDef methods[] ={

{"shift_function", (PyCFunction)py_shift_function, METH_VARARGS, ""},

{NULL,NULL,0,NULL}

};

void

initexample(void)

{

Py_InitModule("example", methods);

}

This extension can then be used in Python, for example:

>>> import example

>>> array =arange(12).reshape=(4,3).astype(np.float64)

>>> fnc =example.shift_function(0.5)

>>> geometric_transform(array, fnc)

array([[ 0. 0. 0. ],

[ 0. 1.3625 2.7375],

[ 0. 4.8125 6.1875],

[ 0. 8.2625 9.6375]])

C callback functions for use with ndimage functions must all be written according to this scheme. The next section

lists the ndimage functions that acccept a C callback function and gives the prototype of the callback function.

1.14.10 Functions that support C callback functions

The ndimage functions that support C callback functions are described here. Obviously, the prototype of the func-

tion that is provided to these functions must match exactly that what they expect. Therefore we give here the pro-

totypes of the callback functions. All these callback functions accept a void callback_data pointer that must be

wrapped in a PyCObject using the Python PyCObject_FromVoidPtrAndDesc function, which can also ac-

cept a pointer to a destructor function to free any memory allocated for callback_data. If callback_data is not needed,

PyCObject_FromVoidPtr may be used instead. The callback functions must return an integer error status that is

equal to zero if something went wrong, or 1 otherwise. If an error occurs, you should normally set the python error

status with an informative message before returning, otherwise, a default error message is set by the calling function.

The function generic_filter (see Generic ﬁlter functions) accepts a callback function with the following proto-

type:

The calling function iterates over the elements of the input and output arrays, calling the callback function at

each element. The elements within the footprint of the ﬁlter at the current element are passed through the buffer

parameter, and the number of elements within the footprint through ﬁlter_size. The calculated valued should be

returned in the return_value argument.

116 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

The function generic_filter1d (see Generic ﬁlter functions) accepts a callback function with the following

prototype:

The calling function iterates over the lines of the input and output arrays, calling the callback function at each

line. The current line is extended according to the border conditions set by the calling function, and the result is

copied into the array that is passed through the input_line array. The length of the input line (after extension) is

passed through input_length. The callback function should apply the 1D ﬁlter and store the result in the array

passed through output_line. The length of the output line is passed through output_length.

The function geometric_transform (see Interpolation functions) expects a function with the following proto-

type:

The calling function iterates over the elements of the output array, calling the callback function at each element.

The coordinates of the current output element are passed through output_coordinates. The callback function

must return the coordinates at which the input must be interpolated in input_coordinates. The rank of the input

and output arrays are given by input_rank and output_rank respectively.

1.15 File IO (scipy.io)

See Also

numpy-reference.routines.io (in numpy)

1.15.1 MATLAB ﬁles

loadmat(ﬁle_name[, mdict, appendmat]) Load MATLAB ﬁle

savemat(ﬁle_name, mdict[, appendmat, ...]) Save a dictionary of names and arrays into a MATLAB-style .mat ﬁle.

whosmat(ﬁle_name[, appendmat]) List variables inside a MATLAB ﬁle

The basic functions

We’ll start by importing scipy.io and calling it sio for convenience:

>>> import scipy.io as sio

If you are using IPython, try tab completing on sio. Among the many options, you will ﬁnd:

sio.loadmat

sio.savemat

sio.whosmat

These are the high-level functions you will most likely use when working with MATLAB ﬁles. You’ll also ﬁnd:

sio.matlab

This is the package from which loadmat,savemat and whosmat are imported. Within sio.matlab, you will

ﬁnd the mio module This module contains the machinery that loadmat and savemat use. From time to time you

may ﬁnd yourself re-using this machinery.

How do I start?

You may have a .mat ﬁle that you want to read into Scipy. Or, you want to pass some variables from Scipy / Numpy

into MATLAB.

1.15. File IO (scipy.io) 117

SciPy Reference Guide, Release 0.13.0

To save us using a MATLAB license, let’s start in Octave. Octave has MATLAB-compatible save and load functions.

Start Octave (octave at the command line for me):

octave:1>a=1:12

a =

123456789101112

octave:2>a=reshape(a, [134])

a =

ans(:,:,1) =

123

ans(:,:,2) =

456

ans(:,:,3) =

789

ans(:,:,4) =

10 11 12

octave:3>save -6octave_a.mat a % MATLAB 6 compatible

octave:4>ls octave_a.mat

octave_a.mat

Now, to Python:

>>> mat_contents =sio.loadmat(’octave_a.mat’)

>>> mat_contents

{’a’: array([[[ 1., 4., 7., 10.],

[ 2., 5., 8., 11.],

[ 3., 6., 9., 12.]]]),

’__version__’: ’1.0’,

’__header__’: ’MATLAB 5.0 MAT-file, written by

Octave 3.6.3, 2013-02-17 21:02:11 UTC’,

’__globals__’: []}

>>> oct_a =mat_contents[’a’]

>>> oct_a

array([[[ 1., 4., 7., 10.],

[ 2., 5., 8., 11.],

[ 3., 6., 9., 12.]]])

>>> oct_a.shape

(1, 3, 4)

Now let’s try the other way round:

>>> import numpy as np

>>> vect =np.arange(10)

>>> vect.shape

(10,)

>>> sio.savemat(’np_vector.mat’, {’vect’:vect})

Then back to Octave:

118 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

octave:8>load np_vector.mat

octave:9>vect

vect =

0123456789

octave:10>size(vect)

ans =

1 10

If you want to inspect the contents of a MATLAB ﬁle without reading the data into memory, use the whosmat

command:

>>> sio.whosmat(’octave_a.mat’)

[(’a’, (1, 3, 4), ’double’)]

whosmat returns a list of tuples, one for each array (or other object) in the ﬁle. Each tuple contains the name, shape

and data type of the array.

MATLAB structs

MATLAB structs are a little bit like Python dicts, except the ﬁeld names must be strings. Any MATLAB object can be

a value of a ﬁeld. As for all objects in MATLAB, structs are in fact arrays of structs, where a single struct is an array

of shape (1, 1).

octave:11>my_struct = struct(’field1’,1,’field2’,2)

my_struct =

{

field1 = 1

field2 = 2

}

octave:12>save -6octave_struct.mat my_struct

We can load this in Python:

>>> mat_contents =sio.loadmat(’octave_struct.mat’)

>>> mat_contents

{’my_struct’: array([[([[1.0]], [[2.0]])]],

dtype=[(’field1’, ’O’), (’field2’, ’O’)]), ’__version__’: ’1.0’, ’__header__’: ’MATLAB 5.0 MAT-file, written by Octave 3.6.3, 2013-02-17 21:23:14 UTC’, ’__globals__’: []}

>>> oct_struct =mat_contents[’my_struct’]

>>> oct_struct.shape

(1, 1)

>>> val =oct_struct[0,0]

>>> val

([[1.0]], [[2.0]])

>>> val[’field1’]

array([[ 1.]])

>>> val[’field2’]

array([[ 2.]])

>>> val.dtype

dtype([(’field1’, ’O’), (’field2’, ’O’)])

In versions of Scipy from 0.12.0, MATLAB structs come back as numpy structured arrays, with ﬁelds named for the

struct ﬁelds. You can see the ﬁeld names in the dtype output above. Note also:

1.15. File IO (scipy.io) 119

SciPy Reference Guide, Release 0.13.0

>>> val =oct_struct[0,0]

and:

octave:13>size(my_struct)

ans =

1 1

So, in MATLAB, the struct array must be at least 2D, and we replicate that when we read into Scipy. If you want all

length 1 dimensions squeezed out, try this:

>>> mat_contents =sio.loadmat(’octave_struct.mat’, squeeze_me=True)

>>> oct_struct =mat_contents[’my_struct’]

>>> oct_struct.shape

()

Sometimes, it’s more convenient to load the MATLAB structs as python objects rather than numpy structured ar-

rays - it can make the access syntax in python a bit more similar to that in MATLAB. In order to do this, use the

struct_as_record=False parameter setting to loadmat.

>>> mat_contents =sio.loadmat(’octave_struct.mat’, struct_as_record=False)

>>> oct_struct =mat_contents[’my_struct’]

>>> oct_struct[0,0].field1

array([[ 1.]])

struct_as_record=False works nicely with squeeze_me:

>>> mat_contents =sio.loadmat(’octave_struct.mat’, struct_as_record=False, squeeze_me=True)

>>> oct_struct =mat_contents[’my_struct’]

>>> oct_struct.shape # but no - it’s a scalar

Traceback (most recent call last):

File "<stdin>", line 1, in <module>

AttributeError: ’mat_struct’ object has no attribute ’shape’

>>> type(oct_struct)

>>> oct_struct.field1

1.0

Saving struct arrays can be done in various ways. One simple method is to use dicts:

>>> a_dict ={’field1’:0.5,’field2’:’a string’}

>>> sio.savemat(’saved_struct.mat’, {’a_dict’: a_dict})

loaded as:

octave:21>load saved_struct

octave:22>a_dict

a_dict =

scalar structure containing the fields:

field2 = a string

field1 = 0.50000

You can also save structs back again to MATLAB (or Octave in our case) like this:

>>> dt =[(’f1’,’f8’), (’f2’,’S10’)]

>>> arr =np.zeros((2,), dtype=dt)

>>> arr

120 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

array([(0.0, ’’), (0.0, ’’)],

dtype=[(’f1’, ’<f8’), (’f2’, ’S10’)])

>>> arr[0][’f1’]=0.5

>>> arr[0][’f2’]=’python’

>>> arr[1][’f1’]=99

>>> arr[1][’f2’]=’not perl’

>>> sio.savemat(’np_struct_arr.mat’, {’arr’: arr})

MATLAB cell arrays

Cell arrays in MATLAB are rather like python lists, in the sense that the elements in the arrays can contain any type

of MATLAB object. In fact they are most similar to numpy object arrays, and that is how we load them into numpy.

octave:14>my_cells = {1, [2,3]}

my_cells =

{

[1,1] = 1

[1,2] =

2 3

}

octave:15>save -6octave_cells.mat my_cells

Back to Python:

>>> mat_contents =sio.loadmat(’octave_cells.mat’)

>>> oct_cells =mat_contents[’my_cells’]

>>> print(oct_cells.dtype)

object

>>> val =oct_cells[0,0]

>>> val

array([[ 1.]])

>>> print(val.dtype)

float64

Saving to a MATLAB cell array just involves making a numpy object array:

>>> obj_arr =np.zeros((2,), dtype=np.object)

>>> obj_arr[0]=1

>>> obj_arr[1]=’a string’

>>> obj_arr

array([1, ’a string’], dtype=object)

>>> sio.savemat(’np_cells.mat’, {’obj_arr’:obj_arr})

octave:16>load np_cells.mat

octave:17>obj_arr

obj_arr =

{

[1,1]=1

[2,1] = a string

}

1.15.2 IDL ﬁles

1.15. File IO (scipy.io) 121

SciPy Reference Guide, Release 0.13.0

readsav(ﬁle_name[, idict, python_dict, ...]) Read an IDL .sav ﬁle

1.15.3 Matrix Market ﬁles

mminfo(source) Queries the contents of the Matrix Market ﬁle ‘ﬁlename’ to

mmread(source) Reads the contents of a Matrix Market ﬁle ‘ﬁlename’ into a matrix.

mmwrite(target, a[, comment, ﬁeld, precision]) Writes the sparse or dense matrix A to a Matrix Market formatted ﬁle.

1.15.4 Wav sound ﬁles (scipy.io.wavfile)

read(ﬁlename[, mmap]) Return the sample rate (in samples/sec) and data from a WAV ﬁle

write(ﬁlename, rate, data) Write a numpy array as a WAV ﬁle

1.15.5 Arff ﬁles (scipy.io.arff)

Module to read ARFF ﬁles, which are the standard data format for WEKA.

ARFF is a text ﬁle format which support numerical, string and data values. The format can also represent missing data

and sparse data.

See the WEKA website for more details about arff format and available datasets.

Examples

>>> from scipy.io import arff

>>> from cStringIO import StringIO

>>> content ="""

... @relation foo

... @attribute width numeric

... @attribute height numeric

... @attribute color {red,green,blue,yellow,black}

... @data

... 5.0,3.25,blue

... 4.5,3.75,green

... 3.0,4.00,red

... """

>>> f=StringIO(content)

>>> data, meta =arff.loadarff(f)

>>> data

array([(5.0, 3.25, ’blue’), (4.5, 3.75, ’green’), (3.0, 4.0, ’red’)],

dtype=[(’width’, ’<f8’), (’height’, ’<f8’), (’color’, ’|S6’)])

>>> meta

Dataset: foo

width’s type is numeric

height’s type is numeric

color’s type is nominal, range is (’red’, ’green’, ’blue’, ’yellow’, ’black’)

loadarff(f) Read an arff ﬁle.

122 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

1.15.6 Netcdf (scipy.io.netcdf)

netcdf_file(ﬁlename[, mode, mmap, version]) A ﬁle object for NetCDF data.

Allows reading of NetCDF ﬁles (version of pupynere package)

1.16 Weave (scipy.weave)

1.16.1 Outline

1.16. Weave (scipy.weave) 123

SciPy Reference Guide, Release 0.13.0

Contents

•Weave (scipy.weave)

–Outline

–Introduction

–Requirements

–Installation

–Testing

*Testing Notes:

–Benchmarks

–Inline

*More with printf

*More examples

·Binary search

·Dictionary Sort

·NumPy – cast/copy/transpose

·wxPython

*Keyword Option

*Inline Arguments

*Distutils keywords

·Keyword Option Examples

·Returning Values

·The issue with locals()

·A quick look at the code

*Technical Details

*Passing Variables in/out of the C/C++ code

*Type Conversions

·NumPy Argument Conversion

·String, List, Tuple, and Dictionary Conversion

·File Conversion

·Callable, Instance, and Module Conversion

·Customizing Conversions

*The Catalog

·Function Storage

·Catalog search paths and the PYTHONCOMPILED variable

–Blitz

*Requirements

*Limitations

*NumPy efﬁciency issues: What compilation buys you

*The Tools

·Parser

·Blitz and NumPy

*Type deﬁnitions and coersion

*Cataloging Compiled Functions

*Checking Array Sizes

*Creating the Extension Module

–Extension Modules

*A Simple Example

*Fibonacci Example

–Customizing Type Conversions – Type Factories

–Things I wish weave did

124 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

1.16.2 Introduction

The scipy.weave (below just weave) package provides tools for including C/C++ code within in Python code.

This offers both another level of optimization to those who need it, and an easy way to modify and extend any supported

extension libraries such as wxPython and hopefully VTK soon. Inlining C/C++ code within Python generally results

in speed ups of 1.5x to 30x speed-up over algorithms written in pure Python (However, it is also possible to slow things

down...). Generally algorithms that require a large number of calls to the Python API don’t beneﬁt as much from the

conversion to C/C++ as algorithms that have inner loops completely convertable to C.

There are three basic ways to use weave. The weave.inline() function executes C code directly within Python,

and weave.blitz() translates Python NumPy expressions to C++ for fast execution. blitz() was the original

reason weave was built. For those interested in building extension libraries, the ext_tools module provides classes

for building extension modules within Python.

Most of weave’s functionality should work on Windows and Unix, although some of its functionality requires gcc

or a similarly modern C++ compiler that handles templates well. Up to now, most testing has been done on Windows

2000 with Microsoft’s C++ compiler (MSVC) and with gcc (mingw32 2.95.2 and 2.95.3-6). All tests also pass on

Linux (RH 7.1 with gcc 2.96), and I’ve had reports that it works on Debian also (thanks Pearu).

The inline and blitz provide new functionality to Python (although I’ve recently learned about the PyInline

project which may offer similar functionality to inline). On the other hand, tools for building Python extension

modules already exists (SWIG, SIP, pycpp, CXX, and others). As of yet, I’m not sure where weave ﬁts in this

spectrum. It is closest in ﬂavor to CXX in that it makes creating new C/C++ extension modules pretty easy. However,

if you’re wrapping a gaggle of legacy functions or classes, SWIG and friends are deﬁnitely the better choice. weave

is set up so that you can customize how Python types are converted to C types in weave. This is great for inline(),

but, for wrapping legacy code, it is more ﬂexible to specify things the other way around – that is how C types map to

Python types. This weave does not do. I guess it would be possible to build such a tool on top of weave, but with

good tools like SWIG around, I’m not sure the effort produces any new capabilities. Things like function overloading

are probably easily implemented in weave and it might be easier to mix Python/C code in function calls, but nothing

beyond this comes to mind. So, if you’re developing new extension modules or optimizing Python functions in C,

weave.ext_tools() might be the tool for you. If you’re wrapping legacy code, stick with SWIG.

The next several sections give the basics of how to use weave. We’ll discuss what’s happening under the covers in

more detail later on. Serious users will need to at least look at the type conversion section to understand how Python

variables map to C/C++ types and how to customize this behavior. One other note. If you don’t know C or C++ then

these docs are probably of very little help to you. Further, it’d be helpful if you know something about writing Python

extensions. weave does quite a bit for you, but for anything complex, you’ll need to do some conversions, reference

counting, etc.

Note: weave is actually part of the SciPy package. However, it also works ﬁne as a standalone package (you can

install from scipy/weave with python setup.py install). The examples here are given as if it is used as

a stand alone package. If you are using from within scipy, you can use from scipy import weave and the

examples will work identically.

1.16.3 Requirements

• Python

I use 2.1.1. Probably 2.0 or higher should work.

• C++ compiler

weave uses distutils to actually build extension modules, so it uses whatever compiler was originally

used to build Python. weave itself requires a C++ compiler. If you used a C++ compiler to build Python, your

probably ﬁne.

1.16. Weave (scipy.weave) 125

SciPy Reference Guide, Release 0.13.0

On Unix gcc is the preferred choice because I’ve done a little testing with it. All testing has been done with gcc,

but I expect the majority of compilers should work for inline and ext_tools. The one issue I’m not sure

about is that I’ve hard coded things so that compilations are linked with the stdc++ library. Is this standard

across Unix compilers, or is this a gcc-ism?

For blitz(), you’ll need a reasonably recent version of gcc. 2.95.2 works on windows and 2.96 looks ﬁne on

Linux. Other versions are likely to work. Its likely that KAI’s C++ compiler and maybe some others will work,

but I haven’t tried. My advice is to use gcc for now unless your willing to tinker with the code some.

On Windows, either MSVC or gcc (mingw32) should work. Again, you’ll need gcc for blitz() as the MSVC

compiler doesn’t handle templates well.

I have not tried Cygwin, so please report success if it works for you.

• NumPy

The python NumPy module is required for blitz() to work and for numpy.distutils which is used by weave.

1.16.4 Installation

There are currently two ways to get weave. First, weave is part of SciPy and installed automatically (as a sub-

package) whenever SciPy is installed. Second, since weave is useful outside of the scientiﬁc community, it has been

setup so that it can be used as a stand-alone module.

The stand-alone version can be downloaded from here. Instructions for installing should be found there as well.

setup.py ﬁle to simplify installation.

1.16.5 Testing

Once weave is installed, ﬁre up python and run its unit tests.

>>> import weave

>>> weave.test()

runs long time... spews tons of output and a few warnings

..............................................................

................................................................

..................................................

----------------------------------------------------------------------

Ran 184 tests in 158.418s

>>>

This takes a while, usually several minutes. On Unix with remote ﬁle systems, I’ve had it take 15 or so minutes. In the

end, it should run about 180 tests and spew some speed results along the way. If you get errors, they’ll be reported at

the end of the output. Please report errors that you ﬁnd. Some tests are known to fail at this point.

If you only want to test a single module of the package, you can do this by running test() for that speciﬁc module.

>>> import weave.scalar_spec

>>> weave.scalar_spec.test()

.......

----------------------------------------------------------------------

Ran 7 tests in 23.284s

126 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

Testing Notes:

• Windows 1

I’ve had some test fail on windows machines where I have msvc, gcc-2.95.2 (in c:gcc-2.95.2), and gcc-2.95.3-6

(in c:gcc) all installed. My environment has c:gcc in the path and does not have c:gcc-2.95.2 in the path. The test

process runs very smoothly until the end where several test using gcc fail with cpp0 not found by g++. If I check

os.system(‘gcc -v’) before running tests, I get gcc-2.95.3-6. If I check after running tests (and after failure), I

get gcc-2.95.2. ??huh??. The os.environ[’PATH’] still has c:gcc ﬁrst in it and is not corrupted (msvc/distutils

messes with the environment variables, so we have to undo its work in some places). If anyone else sees this, let

me know - - it may just be an quirk on my machine (unlikely). Testing with the gcc- 2.95.2 installation always

works.

• Windows 2

If you run the tests from PythonWin or some other GUI tool, you’ll get a ton of DOS windows popping up

periodically as weave spawns the compiler multiple times. Very annoying. Anyone know how to ﬁx this?

• wxPython

wxPython tests are not enabled by default because importing wxPython on a Unix machine without access to a

X-term will cause the program to exit. Anyone know of a safe way to detect whether wxPython can be imported

and whether a display exists on a machine?

1.16.6 Benchmarks

This section has not been updated from old scipy weave and Numeric....

This section has a few benchmarks – thats all people want to see anyway right? These are mostly taken from running

ﬁles in the weave/example directory and also from the test scripts. Without more information about what the test

actually do, their value is limited. Still, their here for the curious. Look at the example scripts for more speciﬁcs about

what problem was actually solved by each run. These examples are run under windows 2000 using Microsoft Visual

C++ and python2.1 on a 850 MHz PIII laptop with 320 MB of RAM. Speed up is the improvement (degredation)

factor of weave compared to conventional Python functions. The blitz() comparisons are shown compared to

NumPy.

Table 1.7: inline and ext_tools

Algorithm Speed up

binary search 1.50

ﬁbonacci (recursive) 82.10

ﬁbonacci (loop) 9.17

return None 0.14

map 1.20

dictionary sort 2.54

vector quantization 37.40

Table 1.8: blitz – double precision

Algorithm Speed up

a = b + c 512x512 3.05

a = b + c + d 512x512 4.59

5 pt avg. ﬁlter, 2D Image 512x512 9.01

Electromagnetics (FDTD) 100x100x100 8.61

The benchmarks shown blitz in the best possible light. NumPy (at least on my machine) is signiﬁcantly worse for

double precision than it is for single precision calculations. If your interested in single precision results, you can pretty

much divide the double precision speed up by 3 and you’ll be close.

1.16. Weave (scipy.weave) 127

SciPy Reference Guide, Release 0.13.0

1.16.7 Inline

inline() compiles and executes C/C++ code on the ﬂy. Variables in the local and global Python scope are also

available in the C/C++ code. Values are passed to the C/C++ code by assignment much like variables are passed into

a standard Python function. Values are returned from the C/C++ code through a special argument called return_val.

Also, the contents of mutable objects can be changed within the C/C++ code and the changes remain after the C code

exits and returns to Python. (more on this later)

Here’s a trivial printf example using inline():

>>> import weave

>>> a=1

>>> weave.inline(’printf("%d\\n",a);’,[’a’])

In this, its most basic form, inline(c_code, var_list) requires two arguments. c_code is a string of valid

C/C++ code. var_list is a list of variable names that are passed from Python into C/C++. Here we have a simple

printf statement that writes the Python variable ato the screen. The ﬁrst time you run this, there will be a pause

while the code is written to a .cpp ﬁle, compiled into an extension module, loaded into Python, cataloged for future

use, and executed. On windows (850 MHz PIII), this takes about 1.5 seconds when using Microsoft’s C++ compiler

(MSVC) and 6-12 seconds using gcc (mingw32 2.95.2). All subsequent executions of the code will happen very

quickly because the code only needs to be compiled once. If you kill and restart the interpreter and then execute the

same code fragment again, there will be a much shorter delay in the fractions of seconds range. This is because weave

stores a catalog of all previously compiled functions in an on disk cache. When it sees a string that has been compiled,

it loads the already compiled module and executes the appropriate function.

Note: If you try the printf example in a GUI shell such as IDLE, PythonWin, PyShell, etc., you’re unlikely to

see the output. This is because the C code is writing to stdout, instead of to the GUI window. This doesn’t mean that

inline doesn’t work in these environments – it only means that standard out in C is not the same as the standard out for

Python in these cases. Non input/output functions will work as expected.

Although effort has been made to reduce the overhead associated with calling inline, it is still less efﬁcient for simple

code snippets than using equivalent Python code. The simple printf example is actually slower by 30% or so

than using Python print statement. And, it is not difﬁcult to create code fragments that are 8-10 times slower

using inline than equivalent Python. However, for more complicated algorithms, the speedup can be worthwhile –

anywhere from 1.5-30 times faster. Algorithms that have to manipulate Python objects (sorting a list) usually only see

a factor of 2 or so improvement. Algorithms that are highly computational or manipulate NumPy arrays can see much

larger improvements. The examples/vq.py ﬁle shows a factor of 30 or more improvement on the vector quantization

algorithm that is used heavily in information theory and classiﬁcation problems.

More with printf

MSVC users will actually see a bit of compiler output that distutils does not suppress the ﬁrst time the code executes:

>>> weave.inline(r’printf("%d\n",a);’,[’a’])

sc_e013937dbc8c647ac62438874e5795131.cpp

Creating library C:\DOCUME~1\eric\LOCALS~1\Temp\python21_compiled\temp

\Release\sc_e013937dbc8c647ac62438874e5795131.lib and

object C:\DOCUME~1\eric\LOCALS~1\Temp\python21_compiled\temp\Release\sc_e013937dbc8c647ac62438874e5795131.exp

Nothing bad is happening, its just a bit annoying. * Anyone know how to turn this off?*

This example also demonstrates using ‘raw strings’. The rpreceeding the code string in the last example denotes

that this is a ‘raw string’. In raw strings, the backslash character is not interpreted as an escape character, and so it

isn’t necessary to use a double backslash to indicate that the ‘n’ is meant to be interpreted in the C printf statement

128 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

instead of by Python. If your C code contains a lot of strings and control characters, raw strings might make things

easier. Most of the time, however, standard strings work just as well.

The printf statement in these examples is formatted to print out integers. What happens if ais a string? inline

will happily, compile a new version of the code to accept strings as input, and execute the code. The result?

>>> a=’string’

>>> weave.inline(r’printf("%d\n",a);’,[’a’])

32956972

In this case, the result is non-sensical, but also non-fatal. In other situations, it might produce a compile time error

because ais required to be an integer at some point in the code, or it could produce a segmentation fault. Its possible

to protect against passing inline arguments of the wrong data type by using asserts in Python.

>>> a=’string’

>>> def protected_printf(a):

... assert(type(a) == type(1))

... weave.inline(r’printf("%d\n",a);’,[’a’])

>>> protected_printf(1)

>>> protected_printf(’string’)

AssertError...

For printing strings, the format statement needs to be changed. Also, weave doesn’t convert strings to char*. Instead

it uses CXX Py::String type, so you have to do a little more work. Here we convert it to a C++ std::string and then ask

cor the char* version.

>>> a=’string’

>>> weave.inline(r’printf("%s\n",std::string(a).c_str());’,[’a’])

string

XXX

This is a little convoluted. Perhaps strings should convert to std::string objects instead of CXX objects. Or

maybe to char*.

As in this case, C/C++ code fragments often have to change to accept different types. For the given printing task,

however, C++ streams provide a way of a single statement that works for integers and strings. By default, the stream

objects live in the std (standard) namespace and thus require the use of std::.

>>> weave.inline(’std::cout << a << std::endl;’,[’a’])

>>> a=’string’

>>> weave.inline(’std::cout << a << std::endl;’,[’a’])

string

Examples using printf and cout are included in examples/print_example.py.

More examples

This section shows several more advanced uses of inline. It includes a few algorithms from the Python Cookbook

that have been re-written in inline C to improve speed as well as a couple examples using NumPy and wxPython.

Binary search

Lets look at the example of searching a sorted list of integers for a value. For inspiration, we’ll use Kalle Svensson’s

binary_search() algorithm from the Python Cookbook. His recipe follows:

1.16. Weave (scipy.weave) 129

SciPy Reference Guide, Release 0.13.0

def binary_search(seq, t):

min =0;max =len(seq) -1

while 1:

if max <min:

return -1

m=(min +max)/2

if seq[m] <t:

min =m+1

elif seq[m] >t:

max =m-1

else:

return m

This Python version works for arbitrary Python data types. The C version below is specialized to handle integer values.

There is a little type checking done in Python to assure that we’re working with the correct data types before heading

into C. The variables seq and tdon’t need to be declared beacuse weave handles converting and declaring them in

the C code. All other temporary variables such as min, max, etc. must be declared – it is C after all. Here’s the new

mixed Python/C function:

def c_int_binary_search(seq,t):

# do a little type checking in Python

assert(type(t) == type(1))

assert(type(seq) == type([]))

# now the C code

code ="""

#line 29 "binary_search.py"

int val, m, min = 0;

int max = seq.length() - 1;

PyObject *py_val;

for(;;)

{

if (max < min )

{

return_val = Py::new_reference_to(Py::Int(-1));

break;

}

m = (min + max) /2;

val = py_to_int(PyList_GetItem(seq.ptr(),m),"val");

if (val < t)

min = m + 1;

else if (val > t)

max = m - 1;

else

{

return_val = Py::new_reference_to(Py::Int(m));

break;

}

"""

return inline(code,[’seq’,’t’])

We have two variables seq and tpassed in. tis guaranteed (by the assert) to be an integer. Python integers are

converted to C int types in the transition from Python to C. seq is a Python list. By default, it is translated to a CXX

list object. Full documentation for the CXX library can be found at its website. The basics are that the CXX provides

C++ class equivalents for Python objects that simplify, or at least object orientify, working with Python objects in

C/C++. For example, seq.length() returns the length of the list. A little more about CXX and its class methods,

etc. is in the Type Conversions section.

130 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

Note: CXX uses templates and therefore may be a little less portable than another alternative by Gordan McMillan

called SCXX which was inspired by CXX. It doesn’t use templates so it should compile faster and be more portable.

SCXX has a few less features, but it appears to me that it would mesh with the needs of weave quite well. Hopefully

xxx_spec ﬁles will be written for SCXX in the future, and we’ll be able to compare on a more empirical basis. Both

sets of spec ﬁles will probably stick around, it just a question of which becomes the default.

Most of the algorithm above looks similar in C to the original Python code. There are two main differences. The ﬁrst is

the setting of return_val instead of directly returning from the C code with a return statement. return_val

is an automatically deﬁned variable of type PyObject*that is returned from the C code back to Python. You’ll

have to handle reference counting issues when setting this variable. In this example, CXX classes and functions

handle the dirty work. All CXX functions and classes live in the namespace Py::. The following code con-

verts the integer mto a CXX Int() object and then to a PyObject*with an incremented reference count using

Py::new_reference_to().

return_val = Py::new_reference_to(Py::Int(m));

The second big differences shows up in the retrieval of integer values from the Python list. The simple Python seq[i]

call balloons into a C Python API call to grab the value out of the list and then a separate call to py_to_int() that

converts the PyObject* to an integer. py_to_int() includes both a NULL cheack and a PyInt_Check() call as

well as the conversion call. If either of the checks fail, an exception is raised. The entire C++ code block is executed

with in a try/catch block that handles exceptions much like Python does. This removes the need for most error

checking code.

It is worth note that CXX lists do have indexing operators that result in code that looks much like Python. However,

the overhead in using them appears to be relatively high, so the standard Python API was used on the seq.ptr()

which is the underlying PyObject*of the List object.

The #line directive that is the ﬁrst line of the C code block isn’t necessary, but it’s nice for debugging. If the

compilation fails because of the syntax error in the code, the error will be reported as an error in the Python ﬁle

“binary_search.py” with an offset from the given line number (29 here).

So what was all our effort worth in terms of efﬁciency? Well not a lot in this case. The examples/binary_search.py ﬁle

runs both Python and C versions of the functions As well as using the standard bisect module. If we run it on a 1

million element list and run the search 3000 times (for 0- 2999), here are the results we get:

C:\home\ej\wrk\scipy\weave\examples> python binary_search.py

Binary search for 3000 items in 1000000 length list of integers:

speed in python: 0.159999966621

speed of bisect: 0.121000051498

speed up: 1.32

speed in c: 0.110000014305

speed up: 1.45

speed in c(no asserts): 0.0900000333786

speed up: 1.78

So, we get roughly a 50-75% improvement depending on whether we use the Python asserts in our C version. If

we move down to searching a 10000 element list, the advantage evaporates. Even smaller lists might result in the

Python version being faster. I’d like to say that moving to NumPy lists (and getting rid of the GetItem() call) offers a

substantial speed up, but my preliminary efforts didn’t produce one. I think the log(N) algorithm is to blame. Because

the algorithm is nice, there just isn’t much time spent computing things, so moving to C isn’t that big of a win. If

there are ways to reduce conversion overhead of values, this may improve the C/Python speed up. Anyone have other

explanations or faster code, please let me know.

1.16. Weave (scipy.weave) 131

SciPy Reference Guide, Release 0.13.0

Dictionary Sort

The demo in examples/dict_sort.py is another example from the Python CookBook. This submission, by Alex Martelli,

demonstrates how to return the values from a dictionary sorted by their keys:

def sortedDictValues3(adict):

keys =adict.keys()

keys.sort()

return map(adict.get, keys)

Alex provides 3 algorithms and this is the 3rd and fastest of the set. The C version of this same algorithm follows:

def c_sort(adict):

assert(type(adict) == type({}))

code ="""

#line 21 "dict_sort.py"

Py::List keys = adict.keys();

Py::List items(keys.length()); keys.sort();

PyObject*item = NULL;

for(int i = 0; i < keys.length();i++)

{

item = PyList_GET_ITEM(keys.ptr(),i);

item = PyDict_GetItem(adict.ptr(),item);

Py_XINCREF(item);

PyList_SetItem(items.ptr(),i,item);

}

return_val = Py::new_reference_to(items);

"""

return inline_tools.inline(code,[’adict’],verbose=1)

Like the original Python function, the C++ version can handle any Python dictionary regardless of the key/value pair

types. It uses CXX objects for the most part to declare python types in C++, but uses Python API calls to manipulate

their contents. Again, this choice is made for speed. The C++ version, while more complicated, is about a factor of 2

faster than Python.

C:\home\ej\wrk\scipy\weave\examples> python dict_sort.py

Dict sort of 1000 items for 300 iterations:

speed in python: 0.319999933243

[0, 1, 2, 3, 4]

speed in c: 0.151000022888

speed up: 2.12

[0, 1, 2, 3, 4]

NumPy – cast/copy/transpose

CastCopyTranspose is a function called quite heavily by Linear Algebra routines in the NumPy library. Its needed

in part because of the row-major memory layout of multi-demensional Python (and C) arrays vs. the col-major order

of the underlying Fortran algorithms. For small matrices (say 100x100 or less), a signiﬁcant portion of the common

routines such as LU decompisition or singular value decompostion are spent in this setup routine. This shouldn’t

happen. Here is the Python version of the function using standard NumPy operations.

def _castCopyAndTranspose(type, array):

if a.typecode() == type:

cast_array =copy.copy(NumPy.transpose(a))

else:

cast_array =copy.copy(NumPy.transpose(a).astype(type))

return cast_array

And the following is a inline C version of the same function:

132 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

from weave.blitz_tools import blitz_type_factories

from weave import scalar_spec

from weave import inline

def _cast_copy_transpose(type,a_2d):

assert(len(shape(a_2d)) == 2)

new_array =zeros(shape(a_2d),type)

NumPy_type =scalar_spec.NumPy_to_blitz_type_mapping[type]

code =\

"""

for(int i = 0;i < _Na_2d[0]; i++)

for(int j = 0; j < _Na_2d[1]; j++)

new_array(i,j) = (%s) a_2d(j,i);

""" %NumPy_type

inline(code,[’new_array’,’a_2d’],

type_factories =blitz_type_factories,compiler=’gcc’)

return new_array

This example uses blitz++ arrays instead of the standard representation of NumPy arrays so that indexing is simplier

to write. This is accomplished by passing in the blitz++ “type factories” to override the standard Python to C++ type

conversions. Blitz++ arrays allow you to write clean, fast code, but they also are sloooow to compile (20 seconds

or more for this snippet). This is why they aren’t the default type used for Numeric arrays (and also because most

compilers can’t compile blitz arrays...). inline() is also forced to use ‘gcc’ as the compiler because the default

compiler on Windows (MSVC) will not compile blitz code. (‘gcc’ I think will use the standard compiler on Unix

machine instead of explicitly forcing gcc (check this)) Comparisons of the Python vs inline C++ code show a factor

of 3 speed up. Also shown are the results of an “inplace” transpose routine that can be used if the output of the

linear algebra routine can overwrite the original matrix (this is often appropriate). This provides another factor of 2

improvement.

#C:\home\ej\wrk\scipy\weave\examples> python cast_copy_transpose.py

# Cast/Copy/Transposing (150,150)array 1 times

# speed in python: 0.870999932289

# speed in c: 0.25

# speed up: 3.48

# inplace transpose c: 0.129999995232

# speed up: 6.70

wxPython

inline knows how to handle wxPython objects. Thats nice in and of itself, but it also demonstrates that the type

conversion mechanism is reasonably ﬂexible. Chances are, it won’t take a ton of effort to support special types you

might have. The examples/wx_example.py borrows the scrolled window example from the wxPython demo, accept

that it mixes inline C code in the middle of the drawing function.

def DoDrawing(self, dc):

red =wxNamedColour("RED");

blue =wxNamedColour("BLUE");

grey_brush =wxLIGHT_GREY_BRUSH;

code =\

"""

#line 108 "wx_example.py"

dc->BeginDrawing();

dc->SetPen(wxPen(*red,4,wxSOLID));

dc->DrawRectangle(5,5,50,50);

dc->SetBrush(*grey_brush);

dc->SetPen(wxPen(*blue,4,wxSOLID));

dc->DrawRectangle(15, 15, 50, 50);

"""

1.16. Weave (scipy.weave) 133

SciPy Reference Guide, Release 0.13.0

inline(code,[’dc’,’red’,’blue’,’grey_brush’])

dc.SetFont(wxFont(14, wxSWISS, wxNORMAL, wxNORMAL))

dc.SetTextForeground(wxColour(0xFF,0x20,0xFF))

te =dc.GetTextExtent("Hello World")

dc.DrawText("Hello World",60,65)

dc.SetPen(wxPen(wxNamedColour(’VIOLET’), 4))

dc.DrawLine(5,65+te[1], 60+te[0], 65+te[1])

...

Here, some of the Python calls to wx objects were just converted to C++ calls. There isn’t any beneﬁt, it just demon-

strates the capabilities. You might want to use this if you have a computationally intensive loop in your drawing code

that you want to speed up. On windows, you’ll have to use the MSVC compiler if you use the standard wxPython

DLLs distributed by Robin Dunn. Thats because MSVC and gcc, while binary compatible in C, are not binary com-

patible for C++. In fact, its probably best, no matter what platform you’re on, to specify that inline use the same

compiler that was used to build wxPython to be on the safe side. There isn’t currently a way to learn this info from the

library – you just have to know. Also, at least on the windows platform, you’ll need to install the wxWindows libraries

and link to them. I think there is a way around this, but I haven’t found it yet – I get some linking errors dealing with

wxString. One ﬁnal note. You’ll probably have to tweak weave/wx_spec.py or weave/wx_info.py for your machine’s

conﬁguration to point at the correct directories etc. There. That should sufﬁciently scare people into not even looking

at this... :)

Keyword Option

The basic deﬁnition of the inline() function has a slew of optional variables. It also takes keyword arguments that

are passed to distutils as compiler options. The following is a formatted cut/paste of the argument section of

inline’s doc-string. It explains all of the variables. Some examples using various options will follow.

def inline(code,arg_names,local_dict = None, global_dict = None,

force = 0,

compiler=’’,

verbose = 0,

support_code = None,

customize=None,

type_factories = None,

auto_downcast=1,

**kw):

inline has quite a few options as listed below. Also, the keyword arguments for distutils extension modules are

accepted to specify extra information needed for compiling.

Inline Arguments

code string. A string of valid C++ code. It should not specify a return statement. Instead it should assign results that

need to be returned to Python in the return_val. arg_names list of strings. A list of Python variable names that should

be transferred from Python into the C/C++ code. local_dict optional. dictionary. If speciﬁed, it is a dictionary of

values that should be used as the local scope for the C/C++ code. If local_dict is not speciﬁed the local dictionary of

the calling function is used. global_dict optional. dictionary. If speciﬁed, it is a dictionary of values that should be

used as the global scope for the C/C++ code. If global_dict is not speciﬁed the global dictionary of the calling function

is used. force optional. 0 or 1. default 0. If 1, the C++ code is compiled every time inline is called. This is really only

useful for debugging, and probably only useful if you’re editing support_code a lot. compiler optional. string. The

name of compiler to use when compiling. On windows, it understands ‘msvc’ and ‘gcc’ as well as all the compiler

names understood by distutils. On Unix, it’ll only understand the values understoof by distutils. (I should add ‘gcc’

though to this).

134 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

On windows, the compiler defaults to the Microsoft C++ compiler. If this isn’t available, it looks for mingw32 (the

gcc compiler).

On Unix, it’ll probably use the same compiler that was used when compiling Python. Cygwin’s behavior should be

similar.

verbose optional. 0,1, or 2. defualt 0. Speﬁcies how much much information is printed during the compile phase

of inlining code. 0 is silent (except on windows with msvc where it still prints some garbage). 1 informs you when

compiling starts, ﬁnishes, and how long it took. 2 prints out the command lines for the compilation process and can

be useful if you’re having problems getting code to work. Its handy for ﬁnding the name of the .cpp ﬁle if you need

to examine it. verbose has no affect if the compilation isn’t necessary. support_code optional. string. A string of

valid C++ code declaring extra code that might be needed by your compiled function. This could be declarations of

functions, classes, or structures. customize optional. base_info.custom_info object. An alternative way to speciﬁy

support_code, headers, etc. needed by the function see the weave.base_info module for more details. (not sure this’ll

be used much). type_factories optional. list of type speciﬁcation factories. These guys are what convert Python data

types to C/C++ data types. If you’d like to use a different set of type conversions than the default, specify them here.

Look in the type conversions section of the main documentation for examples. auto_downcast optional. 0 or 1. default

1. This only affects functions that have Numeric arrays as input variables. Setting this to 1 will cause all ﬂoating point

values to be cast as ﬂoat instead of double if all the NumPy arrays are of type ﬂoat. If even one of the arrays has type

double or double complex, all variables maintain there standard types.

Distutils keywords

inline() also accepts a number of distutils keywords for controlling how the code is compiled. The following

descriptions have been copied from Greg Ward’s distutils.extension.Extension class doc- strings for

convenience: sources [string] list of source ﬁlenames, relative to the distribution root (where the setup script lives), in

Unix form (slash- separated) for portability. Source ﬁles may be C, C++, SWIG (.i), platform- speciﬁc resource ﬁles,

or whatever else is recognized by the “build_ext” command as source for a Python extension. Note: The module_path

ﬁle is always appended to the front of this list include_dirs [string] list of directories to search for C/C++ header ﬁles

(in Unix form for portability) deﬁne_macros [(name : string, value : string|None)] list of macros to deﬁne; each macro

is deﬁned using a 2-tuple, where ‘value’ is either the string to deﬁne it to or None to deﬁne it without a particular value

(equivalent of “#deﬁne FOO” in source or -DFOO on Unix C compiler command line) undef_macros [string] list of

macros to undeﬁne explicitly library_dirs [string] list of directories to search for C/C++ libraries at link time libraries

[string] list of library names (not ﬁlenames or paths) to link against runtime_library_dirs [string] list of directories to

search for C/C++ libraries at run time (for shared extensions, this is when the extension is loaded) extra_objects [string]

list of extra ﬁles to link with (eg. object ﬁles not implied by ‘sources’, static library that must be explicitly speciﬁed,

binary resource ﬁles, etc.) extra_compile_args [string] any extra platform- and compiler-speciﬁc information to use

when compiling the source ﬁles in ‘sources’. For platforms and compilers where “command line” makes sense, this is

typically a list of command-line arguments, but for other platforms it could be anything. extra_link_args [string] any

extra platform- and compiler-speciﬁc information to use when linking object ﬁles together to create the extension (or

to create a new static Python interpreter). Similar interpretation as for ‘extra_compile_args’. export_symbols [string]

list of symbols to be exported from a shared extension. Not used on all platforms, and not generally necessary for

Python extensions, which typically export exactly one symbol: “init” + extension_name.

Keyword Option Examples

We’ll walk through several examples here to demonstrate the behavior of inline and also how the various arguments

are used. In the simplest (most) cases, code and arg_names are the only arguments that need to be speciﬁed. Here’s

a simple example run on Windows machine that has Microsoft VC++ installed.

>>> from weave import inline

>>> a=’string’

>>> code ="""

... int l = a.length();

... return_val = Py::new_reference_to(Py::Int(l));

1.16. Weave (scipy.weave) 135

SciPy Reference Guide, Release 0.13.0

... """

>>> inline(code,[’a’])

sc_86e98826b65b047ffd2cd5f479c627f12.cpp

Creating

library C:\DOCUME~1\eric\LOCALS~1\Temp\python21_compiled\temp\Release\sc_86e98826b65b047ffd2cd5f479c627f12.lib

and object C:\DOCUME~1\eric\LOCALS~1\Temp\python21_compiled\temp\Release\sc_86e98826b65b047ff

d2cd5f479c627f12.exp

>>> inline(code,[’a’])

When inline is ﬁrst run, you’ll notice that pause and some trash printed to the screen. The “trash” is actually part of

the compiler’s output that distutils does not supress. The name of the extension ﬁle, sc_bighonkingnumber.cpp,

is generated from the SHA-256 check sum of the C/C++ code fragment. On Unix or windows machines with only gcc

installed, the trash will not appear. On the second call, the code fragment is not compiled since it already exists, and

only the answer is returned. Now kill the interpreter and restart, and run the same code with a different string.

>>> from weave import inline

>>> a=’a longer string’

>>> code ="""

... int l = a.length();

... return_val = Py::new_reference_to(Py::Int(l));

... """

>>> inline(code,[’a’])

Notice this time, inline() did not recompile the code because it found the compiled function in the persistent

catalog of functions. There is a short pause as it looks up and loads the function, but it is much shorter than compiling

would require.

You can specify the local and global dictionaries if you’d like (much like exec or eval() in Python), but if they

aren’t speciﬁed, the “expected” ones are used – i.e. the ones from the function that called inline(). This is

accomplished through a little call frame trickery. Here is an example where the local_dict is speciﬁed using the same

code example from above:

>>> a=’a longer string’

>>> b=’an even longer string’

>>> my_dict ={’a’:b}

>>> inline(code,[’a’])

>>> inline(code,[’a’],my_dict)

Everytime, the code is changed, inline does a recompile. However, changing any of the other options in inline

does not force a recompile. The force option was added so that one could force a recompile when tinkering with

other variables. In practice, it is just as easy to change the code by a single character (like adding a space some place)

to force the recompile.

Note: It also might be nice to add some methods for purging the cache and on disk catalogs.

I use verbose sometimes for debugging. When set to 2, it’ll output all the information (including the name of

the .cpp ﬁle) that you’d expect from running a make ﬁle. This is nice if you need to examine the generated code to

see where things are going haywire. Note that error messages from failed compiles are printed to the screen even if

verbose is set to 0.

The following example demonstrates using gcc instead of the standard msvc compiler on windows using same code

fragment as above. Because the example has already been compiled, the force=1 ﬂag is needed to make inline()

136 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

ignore the previously compiled version and recompile using gcc. The verbose ﬂag is added to show what is printed

out:

>>>inline(code,[’a’],compiler=’gcc’,verbose=2,force=1)

running build_ext

building ’sc_86e98826b65b047ffd2cd5f479c627f13’ extension

c:\gcc-2.95.2\bin\g++.exe -mno-cygwin -mdll -O2 -w -Wstrict-prototypes -IC:

\home\ej\wrk\scipy\weave -IC:\Python21\Include -c C:\DOCUME~1\eric\LOCAL

S~1\Temp\python21_compiled\sc_86e98826b65b047ffd2cd5f479c627f13.cpp

-o C:\DOCUME~1\eric\LOCALS~1\Temp\python21_compiled\temp\Release\sc_86e98826b65b04ffd2cd5f479c627f13.o

skipping C:\home\ej\wrk\scipy\weave\CXX\cxxextensions.c

(C:\DOCUME~1\eric\LOCALS~1\Temp\python21_compiled\temp\Release\cxxextensions.o up-to-date)

skipping C:\home\ej\wrk\scipy\weave\CXX\cxxsupport.cxx

(C:\DOCUME~1\eric\LOCALS~1\Temp\python21_compiled\temp\Release\cxxsupport.o up-to-date)

skipping C:\home\ej\wrk\scipy\weave\CXX\IndirectPythonInterface.cxx

(C:\DOCUME~1\eric\LOCALS~1\Temp\python21_compiled\temp\Release\indirectpythoninterface.o up-to-date)

skipping C:\home\ej\wrk\scipy\weave\CXX\cxx_extensions.cxx

(C:\DOCUME~1\eric\LOCALS~1\Temp\python21_compiled\temp\Release\cxx_extensions.o

up-to-date)

writing C:\DOCUME~1\eric\LOCALS~1\Temp\python21_compiled\temp\Release\sc_86e98826b65b047ffd2cd5f479c627f13.def

c:\gcc-2.95.2\bin\dllwrap.exe --driver-name g++ -mno-cygwin

-mdll -static --output-lib

C:\DOCUME~1\eric\LOCALS~1\Temp\python21_compiled\temp\Release\libsc_86e98826b65b047ffd2cd5f479c627f13.a --def

C:\DOCUME~1\eric\LOCALS~1\Temp\python21_compiled\temp\Release\sc_86e98826b65b047ffd2cd5f479c627f13.def

-sC:\DOCUME~1\eric\LOCALS~1\Temp\python21_compiled\temp\Release\sc_86e98826b65b047ffd2cd5f479c627f13.o

C:\DOCUME~1\eric\LOCALS~1\Temp\python21_compiled\temp\Release\cxxextensions.o

C:\DOCUME~1\eric\LOCALS~1\Temp\python21_compiled\temp\Release\cxxsupport.o

C:\DOCUME~1\eric\LOCALS~1\Temp\python21_compiled\temp\Release\indirectpythoninterface.o

C:\DOCUME~1\eric\LOCALS~1\Temp\python21_compiled\temp\Release\cxx_extensions.o -LC:\Python21\libs

-lpython21 -o

C:\DOCUME~1\eric\LOCALS~1\Temp\python21_compiled\sc_86e98826b65b047ffd2cd5f479c627f13.pyd

That’s quite a bit of output. verbose=1 just prints the compile time.

>>>inline(code,[’a’],compiler=’gcc’,verbose=1,force=1)

Compiling code...

finished compiling (sec): 6.00800001621

Note: I’ve only used the compiler option for switching between ‘msvc’ and ‘gcc’ on windows. It may have use on

Unix also, but I don’t know yet.

The support_code argument is likely to be used a lot. It allows you to specify extra code fragments such as

function, structure or class deﬁnitions that you want to use in the code string. Note that changes to support_code

do not force a recompile. The catalog only relies on code (for performance reasons) to determine whether recompiling

is necessary. So, if you make a change to support_code, you’ll need to alter code in some way or use the force

argument to get the code to recompile. I usually just add some inocuous whitespace to the end of one of the lines in

code somewhere. Here’s an example of deﬁning a separate method for calculating the string length:

>>> from weave import inline

>>> a=’a longer string’

>>> support_code ="""

... PyObject*length(Py::String a)

... {

... int l = a.length();

... return Py::new_reference_to(Py::Int(l));

... }

1.16. Weave (scipy.weave) 137

SciPy Reference Guide, Release 0.13.0

... """

>>> inline("return_val = length(a);",[’a’],

... support_code =support_code)

customize is a left over from a previous way of specifying compiler options. It is a custom_info object that can

specify quite a bit of information about how a ﬁle is compiled. These info objects are the standard way of deﬁning

compile information for type conversion classes. However, I don’t think they are as handy here, especially since we’ve

exposed all the keyword arguments that distutils can handle. Between these keywords, and the support_code

option, I think customize may be obsolete. We’ll see if anyone cares to use it. If not, it’ll get axed in the next

version.

The type_factories variable is important to people who want to customize the way arguments are converted

from Python to C. We’ll talk about this in the next chapter xx of this document when we discuss type conversions.

auto_downcast handles one of the big type conversion issues that is common when using NumPy arrays in con-

junction with Python scalar values. If you have an array of single precision values and multiply that array by a Python

scalar, the result is upcast to a double precision array because the scalar value is double precision. This is not usu-

ally the desired behavior because it can double your memory usage. auto_downcast goes some distance towards

changing the casting precedence of arrays and scalars. If your only using single precision arrays, it will automatically

downcast all scalar values from double to single precision when they are passed into the C++ code. This is the default

behavior. If you want all values to keep there default type, set auto_downcast to 0.

Returning Values

Python variables in the local and global scope transfer seemlessly from Python into the C++ snippets. And, if inline

were to completely live up to its name, any modiﬁcations to variables in the C++ code would be reﬂected in the Python

variables when control was passed back to Python. For example, the desired behavior would be something like:

# THIS DOES NOT WORK

>>> a = 1

>>> weave.inline("a++;",[’a’])

>>> a

Instead you get:

>>> a=1

>>> weave.inline("a++;",[’a’])

>>> a

Variables are passed into C++ as if you are calling a Python function. Python’s calling convention is sometimes called

“pass by assignment”. This means its as if a c_a = a assignment is made right before inline call is made and the

c_a variable is used within the C++ code. Thus, any changes made to c_a are not reﬂected in Python’s avariable.

Things do get a little more confusing, however, when looking at variables with mutable types. Changes made in C++

to the contents of mutable types are reﬂected in the Python variables.

>>> a=[1,2]

>>> weave.inline("PyList_SetItem(a.ptr(),0,PyInt_FromLong(3));",[’a’])

>>> print a

[3, 2]

So modiﬁcations to the contents of mutable types in C++ are seen when control is returned to Python. Modiﬁcations

to immutable types such as tuples, strings, and numbers do not alter the Python variables. If you need to make changes

to an immutable variable, you’ll need to assign the new value to the “magic” variable return_val in C++. This

value is returned by the inline() function:

138 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

>>> a=1

>>> a=weave.inline("return_val = Py::new_reference_to(Py::Int(a+1));",[’a’])

>>> a

The return_val variable can also be used to return newly created values. This is possible by returning a tuple. The

following trivial example illustrates how this can be done:

# python version

def multi_return():

return 1,’2nd’

# C version.

def c_multi_return():

code ="""

py::tuple results(2);

results[0] = 1;

results[1] = "2nd";

return_val = results;

"""

return inline_tools.inline(code)

The example is available in examples/tuple_return.py. It also has the dubious honor of demonstrating how

much inline() can slow things down. The C version here is about 7-10 times slower than the Python version. Of

course, something so trivial has no reason to be written in C anyway.

The issue with locals() inline passes the locals() and globals() dictionaries from Python into the

C++ function from the calling function. It extracts the variables that are used in the C++ code from these dictionaries,

converts then to C++ variables, and then calculates using them. It seems like it would be trivial, then, after the

calculations were ﬁnished to then insert the new values back into the locals() and globals() dictionaries so

that the modiﬁed values were reﬂected in Python. Unfortunately, as pointed out by the Python manual, the locals()

dictionary is not writable.

I suspect locals() is not writable because there are some optimizations done to speed lookups of the local names-

pace. I’m guessing local lookups don’t always look at a dictionary to ﬁnd values. Can someone “in the know” conﬁrm

or correct this? Another thing I’d like to know is whether there is a way to write to the local namespace of another

stack frame from C/C++. If so, it would be possible to have some clean up code in compiled functions that wrote

ﬁnal values of variables in C++ back to the correct Python stack frame. I think this goes a long way toward making

inline truely live up to its name. I don’t think we’ll get to the point of creating variables in Python for variables

created in C – although I suppose with a C/C++ parser you could do that also.

A quick look at the code

weave generates a C++ ﬁle holding an extension function for each inline code snippet. These ﬁle names are

generated using from the SHA-256 signature of the code snippet and saved to a location speciﬁed by the PYTHON-

COMPILED environment variable (discussed later). The cpp ﬁles are generally about 200-400 lines long and include

quite a few functions to support type conversions, etc. However, the actual compiled function is pretty simple. Below

is the familiar printf example:

>>> import weave

>>> a=1

>>> weave.inline(’printf("%d\\n",a);’,[’a’])

And here is the extension function generated by inline:

1.16. Weave (scipy.weave) 139

SciPy Reference Guide, Release 0.13.0

static PyObject*compiled_func(PyObject*self, PyObject*args)

{

py::object return_val;

int exception_occured = 0;

PyObject *py__locals = NULL;

PyObject *py__globals = NULL;

PyObject *py_a;

py_a = NULL;

if(!PyArg_ParseTuple(args,"OO:compiled_func",&py__locals,&py__globals))

return NULL;

try

{

PyObject*raw_locals = py_to_raw_dict(py__locals,"_locals");

PyObject*raw_globals = py_to_raw_dict(py__globals,"_globals");

/*argument conversion code */

py_a = get_variable("a",raw_locals,raw_globals);

int a = convert_to_int(py_a,"a");

/*inline code */

/*NDARRAY API VERSION 90907 */

printf("%d\n",a); /*I would like to fill in changed locals and globals here...*/

}

catch(...)

{

return_val = py::object();

exception_occured = 1;

}

/*cleanup code */

if(!(PyObject*)return_val && !exception_occured)

{

return_val = Py_None;

}

return return_val.disown();

}

Every inline function takes exactly two arguments – the local and global dictionaries for the current scope. All variable

values are looked up out of these dictionaries. The lookups, along with all inline code execution, are done within

a C++ try block. If the variables aren’t found, or there is an error converting a Python variable to the appropriate

type in C++, an exception is raised. The C++ exception is automatically converted to a Python exception by SCXX

and returned to Python. The py_to_int() function illustrates how the conversions and exception handling works.

py_to_int ﬁrst checks that the given PyObject* pointer is not NULL and is a Python integer. If all is well, it calls the

Python API to convert the value to an int. Otherwise, it calls handle_bad_type() which gathers information

about what went wrong and then raises a SCXX TypeError which returns to Python as a TypeError.

int py_to_int(PyObject*py_obj,char*name)

{

if (!py_obj || !PyInt_Check(py_obj))

handle_bad_type(py_obj,"int", name);

return (int) PyInt_AsLong(py_obj);

}

void handle_bad_type(PyObject*py_obj, char*good_type, char*var_name)

{

char msg[500];

sprintf(msg,"received ’%s’ type instead of ’%s’ for variable ’%s’",

find_type(py_obj),good_type,var_name);

throw Py::TypeError(msg);

}

140 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

char*find_type(PyObject*py_obj)

{

if(py_obj == NULL) return "C NULL value";

if(PyCallable_Check(py_obj)) return "callable";

if(PyString_Check(py_obj)) return "string";

if(PyInt_Check(py_obj)) return "int";

if(PyFloat_Check(py_obj)) return "float";

if(PyDict_Check(py_obj)) return "dict";

if(PyList_Check(py_obj)) return "list";

if(PyTuple_Check(py_obj)) return "tuple";

if(PyFile_Check(py_obj)) return "file";

if(PyModule_Check(py_obj)) return "module";

//should probably do more interagation (and thinking) on these.

if(PyCallable_Check(py_obj) && PyInstance_Check(py_obj)) return "callable";

if(PyInstance_Check(py_obj)) return "instance";

if(PyCallable_Check(py_obj)) return "callable";

return "unknown type";

}

Since the inline is also executed within the try/catch block, you can use CXX exceptions within your code. It

is usually a bad idea to directly return from your code, even if an error occurs. This skips the clean up section of

the extension function. In this simple example, there isn’t any clean up code, but in more complicated examples, there

may be some reference counting that needs to be taken care of here on converted variables. To avoid this, either uses

exceptions or set return_val to NULL and use if/then’s to skip code after errors.

Technical Details

There are several main steps to using C/C++ code within Python:

1. Type conversion

2. Generating C/C++ code

3. Compile the code to an extension module

4. Catalog (and cache) the function for future use

Items 1 and 2 above are related, but most easily discussed separately. Type conversions are customizable by the user if

needed. Understanding them is pretty important for anything beyond trivial uses of inline. Generating the C/C++

code is handled by ext_function and ext_module classes and . For the most part, compiling the code is handled

by distutils. Some customizations were needed, but they were relatively minor and do not require changes to distutils

itself. Cataloging is pretty simple in concept, but surprisingly required the most code to implement (and still likely

needs some work). So, this section covers items 1 and 4 from the list. Item 2 is covered later in the chapter covering

the ext_tools module, and distutils is covered by a completely separate document xxx.

Passing Variables in/out of the C/C++ code

Note: Passing variables into the C code is pretty straight forward, but there are subtlties to how variable modiﬁcations

in C are returned to Python. see Returning Values for a more thorough discussion of this issue.

Type Conversions

1.16. Weave (scipy.weave) 141

SciPy Reference Guide, Release 0.13.0

Note: Maybe xxx_converter instead of xxx_specification is a more descriptive name. Might change in

future version?

By default, inline() makes the following type conversions between Python and C++ types.

Table 1.9: Default Data Type Conver-

sions

Python C++

int int

ﬂoat double

complex std::complex

string py::string

list py::list

dict py::dict

tuple py::tuple

ﬁle FILE*

callable py::object

instance py::object

numpy.ndarray PyArrayObject*

wxXXX wxXXX*

The Py:: namespace is deﬁned by the SCXX library which has C++ class equivalents for many Python types. std::

is the namespace of the standard library in C++.

Note:

• I haven’t ﬁgured out how to handle long int yet (I think they are currenlty converted to int - - check this).

• Hopefully VTK will be added to the list soon

Python to C++ conversions ﬁll in code in several locations in the generated inline extension function. Below is the

basic template for the function. This is actually the exact code that is generated by calling weave.inline("").

The /*inline code */section is ﬁlled with the code passed to the inline() function call. The

/*argument conversion code*/and /*cleanup code */sections are ﬁlled with code that handles

conversion from Python to C++ types and code that deallocates memory or manipulates reference counts before the

function returns. The following sections demonstrate how these two areas are ﬁlled in by the default conversion meth-

ods. * Note: I’m not sure I have reference counting correct on a few of these. The only thing I increase/decrease the

ref count on is NumPy arrays. If you see an issue, please let me know.

NumPy Argument Conversion

Integer, ﬂoating point, and complex arguments are handled in a very similar fashion. Consider the following inline

function that has a single integer variable passed in:

>>> a=1

>>> inline("",[’a’])

The argument conversion code inserted for ais:

/*argument conversion code */

int a = py_to_int (get_variable("a",raw_locals,raw_globals),"a");

get_variable() reads the variable afrom the local and global namespaces. py_to_int() has the following

form:

142 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

static int py_to_int(PyObject*py_obj,char*name)

{

if (!py_obj || !PyInt_Check(py_obj))

handle_bad_type(py_obj,"int", name);

return (int) PyInt_AsLong(py_obj);

}

Similarly, the ﬂoat and complex conversion routines look like:

static double py_to_float(PyObject*py_obj,char*name)

{

if (!py_obj || !PyFloat_Check(py_obj))

handle_bad_type(py_obj,"float", name);

return PyFloat_AsDouble(py_obj);

}

static std::complex py_to_complex(PyObject*py_obj,char*name)

{

if (!py_obj || !PyComplex_Check(py_obj))

handle_bad_type(py_obj,"complex", name);

return std::complex(PyComplex_RealAsDouble(py_obj),

PyComplex_ImagAsDouble(py_obj));

}

NumPy conversions do not require any clean up code.

String, List, Tuple, and Dictionary Conversion

Strings, Lists, Tuples and Dictionary conversions are all converted to SCXX types by default. For the following code,

>>> a=[1]

>>> inline("",[’a’])

The argument conversion code inserted for ais:

/*argument conversion code */

Py::List a = py_to_list(get_variable("a",raw_locals,raw_globals),"a");

get_variable() reads the variable afrom the local and global namespaces. py_to_list() and its friends

have the following form:

static Py::List py_to_list(PyObject*py_obj,char*name)

{

if (!py_obj || !PyList_Check(py_obj))

handle_bad_type(py_obj,"list", name);

return Py::List(py_obj);

}

static Py::String py_to_string(PyObject*py_obj,char*name)

{

if (!PyString_Check(py_obj))

handle_bad_type(py_obj,"string", name);

return Py::String(py_obj);

}

static Py::Dict py_to_dict(PyObject*py_obj,char*name)

{

if (!py_obj || !PyDict_Check(py_obj))

handle_bad_type(py_obj,"dict", name);

return Py::Dict(py_obj);

1.16. Weave (scipy.weave) 143

SciPy Reference Guide, Release 0.13.0

}

static Py::Tuple py_to_tuple(PyObject*py_obj,char*name)

{

if (!py_obj || !PyTuple_Check(py_obj))

handle_bad_type(py_obj,"tuple", name);

return Py::Tuple(py_obj);

}

SCXX handles reference counts on for strings, lists, tuples, and dictionaries, so clean up code isn’t necessary.

File Conversion

For the following code,

>>> a=open("bob",’w’)

>>> inline("",[’a’])

The argument conversion code is:

/*argument conversion code */

PyObject*py_a = get_variable("a",raw_locals,raw_globals);

FILE*a = py_to_file(py_a,"a");

get_variable() reads the variable afrom the local and global namespaces. py_to_file() converts PyObject*

to a FILE* and increments the reference count of the PyObject*:

FILE*py_to_file(PyObject*py_obj, char*name)

{

if (!py_obj || !PyFile_Check(py_obj))

handle_bad_type(py_obj,"file", name);

Py_INCREF(py_obj);

return PyFile_AsFile(py_obj);

}

Because the PyObject* was incremented, the clean up code needs to decrement the counter

/*cleanup code */

Py_XDECREF(py_a);

Its important to understand that ﬁle conversion only works on actual ﬁles – i.e. ones created using the open()

command in Python. It does not support converting arbitrary objects that support the ﬁle interface into C FILE*

pointers. This can affect many things. For example, in initial printf() examples, one might be tempted to solve the

problem of C and Python IDE’s (PythonWin, PyCrust, etc.) writing to different stdout and stderr by using fprintf()

and passing in sys.stdout and sys.stderr. For example, instead of

>>> weave.inline(’printf("hello\\n");’)

You might try:

>>> buf =sys.stdout

>>> weave.inline(’fprintf(buf,"hello\\n");’,[’buf’])

This will work as expected from a standard python interpreter, but in PythonWin, the following occurs:

>>> buf =sys.stdout

>>> weave.inline(’fprintf(buf,"hello\\n");’,[’buf’])

144 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

The traceback tells us that inline() was unable to convert ‘buf’ to a C++ type (If instance conversion was imple-

mented, the error would have occurred at runtime instead). Why is this? Let’s look at what the buf object really

is:

>>> buf

pywin.framework.interact.InteractiveView instance at 00EAD014

PythonWin has reassigned sys.stdout to a special object that implements the Python ﬁle interface. This works

great in Python, but since the special object doesn’t have a FILE* pointer underlying it, fprintf doesn’t know what

to do with it (well this will be the problem when instance conversion is implemented...).

Callable, Instance, and Module Conversion

Note: Need to look into how ref counts should be handled. Also, Instance and Module conversion are not currently

implemented.

>>> def a():

pass

>>> inline("",[’a’])

Callable and instance variables are converted to PyObject*. Nothing is done to their reference counts.

/*argument conversion code */

PyObject*a = py_to_callable(get_variable("a",raw_locals,raw_globals),"a");

get_variable() reads the variable afrom the local and global namespaces. The py_to_callable() and

py_to_instance() don’t currently increment the ref count.

PyObject*py_to_callable(PyObject*py_obj, char*name)

{

if (!py_obj || !PyCallable_Check(py_obj))

handle_bad_type(py_obj,"callable", name);

return py_obj;

}

PyObject*py_to_instance(PyObject*py_obj, char*name)

{

if (!py_obj || !PyFile_Check(py_obj))

handle_bad_type(py_obj,"instance", name);

return py_obj;

}

There is no cleanup code for callables, modules, or instances.

Customizing Conversions

Converting from Python to C++ types is handled by xxx_specification classes. A type speciﬁcation class

actually serve in two related but different roles. The ﬁrst is in determining whether a Python variable that needs to be

converted should be represented by the given class. The second is as a code generator that generates C++ code needed

to convert from Python to C++ types for a speciﬁc variable.

When

>>> a=1

>>> weave.inline(’printf("%d",a);’,[’a’])

is called for the ﬁrst time, the code snippet has to be compiled. In this process, the variable ‘a’ is tested against a

list of type speciﬁcations (the default list is stored in weave/ext_tools.py). The ﬁrst speciﬁcation in the list is used to

represent the variable.

1.16. Weave (scipy.weave) 145

SciPy Reference Guide, Release 0.13.0

Examples of xxx_specification are scattered throughout numerous “xxx_spec.py” ﬁles in the weave pack-

age. Closely related to the xxx_specification classes are yyy_info classes. These classes contain compiler,

header, and support code information necessary for including a certain set of capabilities (such as blitz++ or CXX

support) in a compiled module. xxx_specification classes have one or more yyy_info classes associated

with them. If you’d like to deﬁne your own set of type speciﬁcations, the current best route is to examine some of the

existing spec and info ﬁles. Maybe looking over sequence_spec.py and cxx_info.py are a good place to start. After

deﬁning speciﬁcation classes, you’ll need to pass them into inline using the type_factories argument. A

lot of times you may just want to change how a speciﬁc variable type is represented. Say you’d rather have Python

strings converted to std::string or maybe char*instead of using the CXX string object, but would like all other

type conversions to have default behavior. This requires that a new speciﬁcation class that handles strings is written

and then prepended to a list of the default type speciﬁcations. Since it is closer to the front of the list, it effectively

overrides the default string speciﬁcation. The following code demonstrates how this is done: ...

The Catalog

catalog.py has a class called catalog that helps keep track of previously compiled functions. This prevents

inline() and related functions from having to compile functions everytime they are called. Instead, catalog will

check an in memory cache to see if the function has already been loaded into python. If it hasn’t, then it starts searching

through persisent catalogs on disk to see if it ﬁnds an entry for the given function. By saving information about

compiled functions to disk, it isn’t necessary to re-compile functions everytime you stop and restart the interpreter.

Functions are compiled once and stored for future use.

When inline(cpp_code) is called the following things happen:

1. A fast local cache of functions is checked for the last function called for cpp_code. If an entry for cpp_code

doesn’t exist in the cache or the cached function call fails (perhaps because the function doesn’t have compatible

types) then the next step is to check the catalog.

2. The catalog class also keeps an in-memory cache with a list of all the functions compiled for cpp_code. If

cpp_code has ever been called, then this cache will be present (loaded from disk). If the cache isn’t present,

then it is loaded from disk.

If the cache is present, each function in the cache is called until one is found that was compiled for the correct

argument types. If none of the functions work, a new function is compiled with the given argument types. This

function is written to the on-disk catalog as well as into the in-memory cache.

3. When a lookup for cpp_code fails, the catalog looks through the on-disk function catalogs for the en-

tries. The PYTHONCOMPILED variable determines where to search for these catalogs and in what order.

If PYTHONCOMPILED is not present several platform dependent locations are searched. All functions found

for cpp_code in the path are loaded into the in-memory cache with functions found earlier in the search path

closer to the front of the call list.

If the function isn’t found in the on-disk catalog, then the function is compiled, written to the ﬁrst writable

directory in the PYTHONCOMPILED path, and also loaded into the in-memory cache.

Function Storage

Function caches are stored as dictionaries where the key is the entire C++ code string and the value is either a single

function (as in the “level 1” cache) or a list of functions (as in the main catalog cache). On disk catalogs are stored in

the same manor using standard Python shelves.

Early on, there was a question as to whether md5 checksums of the C++ code strings should be used instead of the

actual code strings. I think this is the route inline Perl took. Some (admittedly quick) tests of the md5 vs. the entire

string showed that using the entire string was at least a factor of 3 or 4 faster for Python. I think this is because it is

more time consuming to compute the md5 value than it is to do look-ups of long strings in the dictionary. Look at the

examples/md5_speed.py ﬁle for the test run.

146 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

Catalog search paths and the PYTHONCOMPILED variable

The default location for catalog ﬁles on Unix is is ~/.pythonXX_compiled where XX is version of Python being used.

If this directory doesn’t exist, it is created the ﬁrst time a catalog is used. The directory must be writable. If, for any

reason it isn’t, then the catalog attempts to create a directory based on your user id in the /tmp directory. The directory

permissions are set so that only you have access to the directory. If this fails, I think you’re out of luck. I don’t think

either of these should ever fail though. On Windows, a directory called pythonXX_compiled is created in the user’s

temporary directory.

The actual catalog ﬁle that lives in this directory is a Python shelf with a platform speciﬁc name such as

“nt21compiled_catalog” so that multiple OSes can share the same ﬁle systems without trampling on each other. Along

with the catalog ﬁle, the .cpp and .so or .pyd ﬁles created by inline will live in this directory. The catalog ﬁle simply

contains keys which are the C++ code strings with values that are lists of functions. The function lists point at func-

tions within these compiled modules. Each function in the lists executes the same C++ code string, but compiled for

different input variables.

You can use the PYTHONCOMPILED environment variable to specify alternative locations for compiled functions.

On Unix this is a colon (‘:’) separated list of directories. On windows, it is a (‘;’) separated list of directories. These

directories will be searched prior to the default directory for a compiled function catalog. Also, the ﬁrst writable

directory in the list is where all new compiled function catalogs, .cpp and .so or .pyd ﬁles are written. Relative

directory paths (‘.’ and ‘..’) should work ﬁne in the PYTHONCOMPILED variable as should environement variables.

There is a “special” path variable called MODULE that can be placed in the PYTHONCOMPILED variable. It

speciﬁes that the compiled catalog should reside in the same directory as the module that called it. This is useful if an

admin wants to build a lot of compiled functions during the build of a package and then install them in site-packages

along with the package. User’s who specify MODULE in their PYTHONCOMPILED variable will have access to

these compiled functions. Note, however, that if they call the function with a set of argument types that it hasn’t

previously been built for, the new function will be stored in their default directory (or some other writable directory in

the PYTHONCOMPILED path) because the user will not have write access to the site-packages directory.

An example of using the PYTHONCOMPILED path on bash follows:

PYTHONCOMPILED=MODULE:/some/path;export PYTHONCOMPILED;

If you are using python21 on linux, and the module bob.py in site-packages has a compiled function in it, then the

catalog search order when calling that function for the ﬁrst time in a python session would be:

/usr/lib/python21/site-packages/linuxpython_compiled

/some/path/linuxpython_compiled

~/.python21_compiled/linuxpython_compiled

The default location is always included in the search path.

Note: hmmm. see a possible problem here. I should probably make a sub- directory such as /usr/lib/python21/site-

packages/python21_compiled/linuxpython_compiled so that library ﬁles compiled with python21 are tried to link with

python22 ﬁles in some strange scenarios. Need to check this.

The in-module cache (in weave.inline_tools reduces the overhead of calling inline functions by about a factor

of 2. It can be reduced a little more for type loop calls where the same function is called over and over again if the

cache was a single value instead of a dictionary, but the beneﬁt is very small (less than 5%) and the utility is quite a bit

less. So, we’ll stick with a dictionary as the cache.

1.16.8 Blitz

Note: most of this section is lifted from old documentation. It should be pretty accurate, but there may be a few

discrepancies.

1.16. Weave (scipy.weave) 147

SciPy Reference Guide, Release 0.13.0

weave.blitz() compiles NumPy Python expressions for fast execution. For most applications, compiled expres-

sions should provide a factor of 2-10 speed-up over NumPy arrays. Using compiled expressions is meant to be as

unobtrusive as possible and works much like pythons exec statement. As an example, the following code fragment

takes a 5 point average of the 512x512 2d image, b, and stores it in array, a:

from scipy import *# or from NumPy import *

a=ones((512,512), Float64)

b=ones((512,512), Float64)

# ...do some stuff to fill in b...

# now average

a[1:-1,1:-1]=(b[1:-1,1:-1]+b[2:,1:-1]+b[:-2,1:-1] \

+b[1:-1,2:] +b[1:-1,:-2]) /5.

To compile the expression, convert the expression to a string by putting quotes around it and then use weave.blitz:

import weave

expr ="a[1:-1,1:-1] = (b[1:-1,1:-1] + b[2:,1:-1] + b[:-2,1:-1]" \

"+ b[1:-1,2:] + b[1:-1,:-2]) / 5."

weave.blitz(expr)

The ﬁrst time weave.blitz is run for a given expression and set of arguements, C++ code that accomplishes the

exact same task as the Python expression is generated and compiled to an extension module. This can take up to a

couple of minutes depending on the complexity of the function. Subsequent calls to the function are very fast. Futher,

the generated module is saved between program executions so that the compilation is only done once for a given

expression and associated set of array types. If the given expression is executed with a new set of array types, the

code most be compiled again. This does not overwrite the previously compiled function – both of them are saved and

available for exectution.

The following table compares the run times for standard NumPy code and compiled code for the 5 point averaging.

Method Run Time (seconds) Standard NumPy 0.46349 blitz (1st time compiling) 78.95526 blitz (subsequent calls)

0.05843 (factor of 8 speedup)

These numbers are for a 512x512 double precision image run on a 400 MHz Celeron processor under RedHat Linux

6.2.

Because of the slow compile times, its probably most effective to develop algorithms as you usually do using the

capabilities of scipy or the NumPy module. Once the algorithm is perfected, put quotes around it and execute it using

weave.blitz. This provides the standard rapid prototyping strengths of Python and results in algorithms that run

close to that of hand coded C or Fortran.

Requirements

Currently, the weave.blitz has only been tested under Linux with gcc-2.95-3 and on Windows with Mingw32

(2.95.2). Its compiler requirements are pretty heavy duty (see the blitz++ home page), so it won’t work with just any

compiler. Particularly MSVC++ isn’t up to snuff. A number of other compilers such as KAI++ will also work, but my

suspicions are that gcc will get the most use.

Limitations

1. Currently, weave.blitz handles all standard mathematical operators except for the ** power operator. The

built-in trigonmetric, log, ﬂoor/ceil, and fabs functions might work (but haven’t been tested). It also handles all

types of array indexing supported by the NumPy module. numarray’s NumPy compatible array indexing modes

are likewise supported, but numarray’s enhanced (array based) indexing modes are not supported.

148 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

weave.blitz does not currently support operations that use array broadcasting, nor have any of the special

purpose functions in NumPy such as take, compress, etc. been implemented. Note that there are no obvious

reasons why most of this functionality cannot be added to scipy.weave, so it will likely trickle into future

versions. Using slice() objects directly instead of start:stop:step is also not supported.

2. Currently Python only works on expressions that include assignment such as

>>> result =b+c+d

This means that the result array must exist before calling weave.blitz. Future versions will allow the

following:

>>> result =weave.blitz_eval("b+c+d")

3. weave.blitz works best when algorithms can be expressed in a “vectorized” form. Algorithms that have a

large number of if/thens and other conditions are better hand-written in C or Fortran. Further, the restrictions

imposed by requiring vectorized expressions sometimes preclude the use of more efﬁcient data structures or

algorithms. For maximum speed in these cases, hand-coded C or Fortran code is the only way to go.

4. weave.blitz can produce different results than NumPy in certain situations. It can happen when the array

receiving the results of a calculation is also used during the calculation. The NumPy behavior is to carry out the

entire calculation on the right hand side of an equation and store it in a temporary array. This temprorary array is

assigned to the array on the left hand side of the equation. blitz, on the other hand, does a “running” calculation

of the array elements assigning values from the right hand side to the elements on the left hand side immediately

after they are calculated. Here is an example, provided by Prabhu Ramachandran, where this happens:

# 4 point average.

>>> expr = "u[1:-1, 1:-1] = (u[0:-2, 1:-1] + u[2:, 1:-1] + \

... "u[1:-1,0:-2] + u[1:-1, 2:])*0.25"

>>> u = zeros((5, 5), ’d’); u[0,:] = 100

>>> exec (expr)

>>> u

array([[ 100., 100., 100., 100., 100.],

[ 0., 25., 25., 25., 0.],

[ 0., 0., 0., 0., 0.],

[ 0., 0., 0., 0., 0.]])

>>> u = zeros((5, 5), ’d’); u[0,:] = 100

>>> weave.blitz (expr)

>>> u

array([[ 100. , 100. , 100. , 100. , 100. ],

[ 0. , 25. , 31.25 , 32.8125 , 0. ],

[ 0. , 6.25 , 9.375 , 10.546875 , 0. ],

[ 0. , 1.5625 , 2.734375 , 3.3203125, 0. ],

[ 0. , 0. , 0. , 0. , 0. ]])

You can prevent this behavior by using a temporary array.

>>> u=zeros((5,5), ’d’); u[0,:] =100

>>> temp =zeros((4,4), ’d’);

>>> expr ="temp = (u[0:-2, 1:-1] + u[2:, 1:-1] + "\

... "u[1:-1,0:-2] + u[1:-1, 2:])*0.25;"\

... "u[1:-1,1:-1] = temp"

>>> weave.blitz (expr)

>>> u

array([[ 100., 100., 100., 100., 100.],

[ 0., 25., 25., 25., 0.],

[ 0., 0., 0., 0., 0.],

1.16. Weave (scipy.weave) 149

SciPy Reference Guide, Release 0.13.0

[ 0., 0., 0., 0., 0.],

[ 0., 0., 0., 0., 0.]])

5. One other point deserves mention lest people be confused. weave.blitz is not a general purpose Python->C

compiler. It only works for expressions that contain NumPy arrays and/or Python scalar values. This focused

scope concentrates effort on the compuationally intensive regions of the program and sidesteps the difﬁcult

issues associated with a general purpose Python->C compiler.

NumPy efﬁciency issues: What compilation buys you

Some might wonder why compiling NumPy expressions to C++ is beneﬁcial since operations on NumPy array op-

erations are already executed within C loops. The problem is that anything other than the simplest expression are

executed in less than optimal fashion. Consider the following NumPy expression:

a=1.2 *b+c*d

When NumPy calculates the value for the 2d array, a, it does the following steps:

temp1 =1.2 *b

temp2 =c*d

a=temp1 +temp2

Two things to note. Since cis an (perhaps large) array, a large temporary array must be created to store the results of

1.2 *b. The same is true for temp2. Allocation is slow. The second thing is that we have 3 loops executing, one

to calculate temp1, one for temp2 and one for adding them up. A C loop for the same problem might look like:

for(int i = 0; i < M; i++)

for(int j = 0; j < N; j++)

a[i,j] = 1.2 *b[i,j] + c[i,j] *d[i,j]

Here, the 3 loops have been fused into a single loop and there is no longer a need for a temporary array. This provides

a signiﬁcant speed improvement over the above example (write me and tell me what you get).

So, converting NumPy expressions into C/C++ loops that fuse the loops and eliminate temporary arrays can provide big

gains. The goal, then, is to convert NumPy expression to C/C++ loops, compile them in an extension module, and then

call the compiled extension function. The good news is that there is an obvious correspondence between the NumPy

expression above and the C loop. The bad news is that NumPy is generally much more powerful than this simple

example illustrates and handling all possible indexing possibilities results in loops that are less than straightforward to

write. (Take a peek at NumPy for conﬁrmation). Luckily, there are several available tools that simplify the process.

The Tools

weave.blitz relies heavily on several remarkable tools. On the Python side, the main facilitators are Jermey

Hylton’s parser module and Travis Oliphant’s NumPy module. On the compiled language side, Todd Veldhuizen’s

blitz++ array library, written in C++ (shhhh. don’t tell David Beazley), does the heavy lifting. Don’t assume that,

because it’s C++, it’s much slower than C or Fortran. Blitz++ uses a jaw dropping array of template techniques

(metaprogramming, template expression, etc) to convert innocent-looking and readable C++ expressions into to code

that usually executes within a few percentage points of Fortran code for the same problem. This is good. Unfortunately

all the template raz-ma-taz is very expensive to compile, so the 200 line extension modules often take 2 or more

minutes to compile. This isn’t so good. weave.blitz works to minimize this issue by remembering where compiled

modules live and reusing them instead of re-compiling every time a program is re-run.

Parser

Tearing NumPy expressions apart, examining the pieces, and then rebuilding them as C++ (blitz) expressions requires

a parser of some sort. I can imagine someone attacking this problem with regular expressions, but it’d likely be ugly

150 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

and fragile. Amazingly, Python solves this problem for us. It actually exposes its parsing engine to the world through

the parser module. The following fragment creates an Abstract Syntax Tree (AST) object for the expression and

then converts to a (rather unpleasant looking) deeply nested list representation of the tree.

>>> import parser

>>> import scipy.weave.misc

>>> ast =parser.suite("a = b *c + d")

>>> ast_list =ast.tolist()

>>> sym_list =scipy.weave.misc.translate_symbols(ast_list)

>>> pprint.pprint(sym_list)

[’file_input’,

[’stmt’,

[’simple_stmt’,

[’small_stmt’,

[’expr_stmt’,

[’testlist’,

[’test’,

[’and_test’,

[’not_test’,

[’comparison’,

[’expr’,

[’xor_expr’,

[’and_expr’,

[’shift_expr’,

[’arith_expr’,

[’term’,

[’factor’, [’power’, [’atom’, [’NAME’, ’a’]]]]]]]]]]]]]]],

[’EQUAL’, ’=’],

[’testlist’,

[’test’,

[’and_test’,

[’not_test’,

[’comparison’,

[’expr’,

[’xor_expr’,

[’and_expr’,

[’shift_expr’,

[’arith_expr’,

[’term’,

[’factor’, [’power’, [’atom’, [’NAME’, ’b’]]]],

[’STAR’, ’*’],

[’factor’, [’power’, [’atom’, [’NAME’, ’c’]]]]],

[’PLUS’, ’+’],

[’term’,

[’factor’, [’power’, [’atom’, [’NAME’, ’d’]]]]]]]]]]]]]]]]],

[’NEWLINE’, ’’]]],

[’ENDMARKER’, ’’]]

Despite its looks, with some tools developed by Jermey H., it’s possible to search these trees for speciﬁc patterns

(sub-trees), extract the sub-tree, manipulate them converting python speciﬁc code fragments to blitz code fragments,

and then re-insert it in the parse tree. The parser module documentation has some details on how to do this. Traversing

the new blitziﬁed tree, writing out the terminal symbols as you go, creates our new blitz++ expression string.

Blitz and NumPy

The other nice discovery in the project is that the data structure used for NumPy arrays and blitz arrays is nearly

identical. NumPy stores “strides” as byte offsets and blitz stores them as element offsets, but other than that, they are

the same. Further, most of the concept and capabilities of the two libraries are remarkably similar. It is satisfying that

two completely different implementations solved the problem with similar basic architectures. It is also fortuitous.

1.16. Weave (scipy.weave) 151

SciPy Reference Guide, Release 0.13.0

The work involved in converting NumPy expressions to blitz expressions was greatly diminished. As an example,

consider the code for slicing an array in Python with a stride:

>>> a=b[0:4:2]+c

>>> a

[0,2,4]

In Blitz it is as follows:

Array<2,int> b(10);

Array<2,int> c(3);

// ...

Array<2,int> a = b(Range(0,3,2)) + c;

Here the range object works exactly like Python slice objects with the exception that the top index (3) is inclusive

where as Python’s (4) is exclusive. Other differences include the type declarations in C++ and parentheses instead of

brackets for indexing arrays. Currently, weave.blitz handles the inclusive/exclusive issue by subtracting one from

upper indices during the translation. An alternative that is likely more robust/maintainable in the long run is to write a

PyRange class that behaves like Python’s range. This is likely very easy.

The stock blitz also doesn’t handle negative indices in ranges. The current implementation of the blitz() has a

partial solution to this problem. It calculates and index that starts with a ‘-‘ sign by subtracting it from the maximum

index in the array so that:

upper index limit

/-----\

b[:-1] -> b(Range(0,Nb[0]-1-1))

This approach fails, however, when the top index is calculated from other values. In the following scenario, if i+j

evaluates to a negative value, the compiled code will produce incorrect results and could even core-dump. Right now,

all calculated indices are assumed to be positive.

b[:i-j] -> b(Range(0,i+j))

A solution is to calculate all indices up front using if/then to handle the +/- cases. This is a little work and results in

more code, so it hasn’t been done. I’m holding out to see if blitz++ can be modiﬁed to handle negative indexing, but

haven’t looked into how much effort is involved yet. While it needs ﬁxin’, I don’t think there is a ton of code where

this is an issue.

The actual translation of the Python expressions to blitz expressions is currently a two part process. First, all x:y:z

slicing expression are removed from the AST, converted to slice(x,y,z) and re-inserted into the tree. Any math needed

on these expressions (subtracting from the maximum index, etc.) are also preformed here. _beg and _end are used as

special variables that are deﬁned as blitz::fromBegin and blitz::toEnd.

a[i+j:i+j+1,:] =b[2:3,:]

becomes a more verbose:

a[slice(i+j,i+j+1),slice(_beg,_end)] =b[slice(2,3),slice(_beg,_end)]

The second part does a simple string search/replace to convert to a blitz expression with the following translations:

slice(_beg,_end) -> _all # not strictly needed, but cuts down on code.

slice -> blitz::Range

[ -> (

] -> )

_stp -> 1

_all is deﬁned in the compiled function as blitz::Range.all(). These translations could of course happen

directly in the syntax tree. But the string replacement is slightly easier. Note that namespaces are maintained in the

152 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

C++ code to lessen the likelihood of name clashes. Currently no effort is made to detect name clashes. A good rule of

thumb is don’t use values that start with ‘_’ or ‘py_’ in compiled expressions and you’ll be ﬁne.

Type deﬁnitions and coersion

So far we’ve glossed over the dynamic vs. static typing issue between Python and C++. In Python, the type of value

that a variable holds can change through the course of program execution. C/C++, on the other hand, forces you to

declare the type of value a variables will hold prior at compile time. weave.blitz handles this issue by examining

the types of the variables in the expression being executed, and compiling a function for those explicit types. For

example:

a=ones((5,5),Float32)

b=ones((5,5),Float32)

weave.blitz("a=a+b")

When compiling this expression to C++, weave.blitz sees that the values for a and b in the local scope have type

Float32, or ‘ﬂoat’ on a 32 bit architecture. As a result, it compiles the function using the ﬂoat type (no attempt has

been made to deal with 64 bit issues).

What happens if you call a compiled function with array types that are different than the ones for which it was

originally compiled? No biggie, you’ll just have to wait on it to compile a new version for your new types. This

doesn’t overwrite the old functions, as they are still accessible. See the catalog section in the inline() documentation

to see how this is handled. Sufﬁce to say, the mechanism is transparent to the user and behaves like dynamic typing

with the occasional wait for compiling newly typed functions.

When working with combined scalar/array operations, the type of the array is always used. This is similar to the saves-

pace ﬂag that was recently added to NumPy. This prevents issues with the following expression perhaps unexpectedly

being calculated at a higher (more expensive) precision that can occur in Python:

>>> a=array((1,2,3),typecode =Float32)

>>> b=a*2.1 # results in b being a Float64 array.

In this example,

>>> a=ones((5,5),Float32)

>>> b=ones((5,5),Float32)

>>> weave.blitz("b = a *2.1")

the 2.1 is cast down to a float before carrying out the operation. If you really want to force the calculation to be a

double, deﬁne aand bas double arrays.

One other point of note. Currently, you must include both the right hand side and left hand side (assignment side)

of your equation in the compiled expression. Also, the array being assigned to must be created prior to calling

weave.blitz. I’m pretty sure this is easily changed so that a compiled_eval expression can be deﬁned, but no

effort has been made to allocate new arrays (and decern their type) on the ﬂy.

Cataloging Compiled Functions

See The Catalog section in the weave.inline() documentation.

Checking Array Sizes

Surprisingly, one of the big initial problems with compiled code was making sure all the arrays in an operation were

of compatible type. The following case is trivially easy:

1.16. Weave (scipy.weave) 153

SciPy Reference Guide, Release 0.13.0

a=b+c

It only requires that arrays a,b, and chave the same shape. However, expressions like:

a[i+j:i+j+1,:] =b[2:3,:] +c

are not so trivial. Since slicing is involved, the size of the slices, not the input arrays, must be checked. Broadcasting

complicates things further because arrays and slices with different dimensions and shapes may be compatible for math

operations (broadcasting isn’t yet supported by weave.blitz). Reductions have a similar effect as their results are

different shapes than their input operand. The binary operators in NumPy compare the shapes of their two operands just

before they operate on them. This is possible because NumPy treats each operation independently. The intermediate

(temporary) arrays created during sub-operations in an expression are tested for the correct shape before they are

combined by another operation. Because weave.blitz fuses all operations into a single loop, this isn’t possible.

The shape comparisons must be done and guaranteed compatible before evaluating the expression.

The solution chosen converts input arrays to “dummy arrays” that only represent the dimensions of the arrays, not the

data. Binary operations on dummy arrays check that input array sizes are comptible and return a dummy array with

the size correct size. Evaluating an expression of dummy arrays traces the changing array sizes through all operations

and fails if incompatible array sizes are ever found.

The machinery for this is housed in weave.size_check. It basically involves writing a new class (dummy array)

and overloading its math operators to calculate the new sizes correctly. All the code is in Python and there is a fair

amount of logic (mainly to handle indexing and slicing) so the operation does impose some overhead. For large arrays

(ie. 50x50x50), the overhead is negligible compared to evaluating the actual expression. For small arrays (ie. 16x16),

the overhead imposed for checking the shapes with this method can cause the weave.blitz to be slower than

evaluating the expression in Python.

What can be done to reduce the overhead? (1) The size checking code could be moved into C. This would likely

remove most of the overhead penalty compared to NumPy (although there is also some calling overhead), but no effort

has been made to do this. (2) You can also call weave.blitz with check_size=0 and the size checking isn’t

done. However, if the sizes aren’t compatible, it can cause a core-dump. So, foregoing size_checking isn’t advisable

until your code is well debugged.

Creating the Extension Module

weave.blitz uses the same machinery as weave.inline to build the extension module. The only difference is

the code included in the function is automatically generated from the NumPy array expression instead of supplied by

the user.

1.16.9 Extension Modules

weave.inline and weave.blitz are high level tools that generate extension modules automatically. Under

the covers, they use several classes from weave.ext_tools to help generate the extension module. The main two

classes are ext_module and ext_function (I’d like to add ext_class and ext_method also). These classes

simplify the process of generating extension modules by handling most of the “boiler plate” code automatically.

Note: inline actually sub-classes weave.ext_tools.ext_function to generate slightly different code

than the standard ext_function. The main difference is that the standard class converts function arguments to C

types, while inline always has two arguments, the local and global dicts, and the grabs the variables that need to be

convereted to C from these.

154 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

A Simple Example

The following simple example demonstrates how to build an extension module within a Python function:

# examples/increment_example.py

from weave import ext_tools

def build_increment_ext():

""" Build a simple extension with functions that increment numbers.

The extension will be built in the local directory.

"""

mod =ext_tools.ext_module(’increment_ext’)

a=1# effectively a type declaration for ’a’ in the

# following functions.

ext_code ="return_val = Py::new_reference_to(Py::Int(a+1));"

func =ext_tools.ext_function(’increment’,ext_code,[’a’])

mod.add_function(func)

ext_code ="return_val = Py::new_reference_to(Py::Int(a+2));"

func =ext_tools.ext_function(’increment_by_2’,ext_code,[’a’])

mod.add_function(func)

mod.compile()

The function build_increment_ext() creates an extension module named increment_ext and compiles

it to a shared library (.so or .pyd) that can be loaded into Python.. increment_ext contains two functions,

increment and increment_by_2. The ﬁrst line of build_increment_ext(),

mod = ext_tools.ext_module(‘increment_ext’)

creates an ext_module instance that is ready to have ext_function instances added to it. ext_function

instances are created much with a calling convention similar to weave.inline(). The most common call includes

a C/C++ code snippet and a list of the arguments for the function. The following:

ext_code ="return_val = Py::new_reference_to(Py::Int(a+1));"

func =ext_tools.ext_function(’increment’,ext_code,[’a’])

creates a C/C++ extension function that is equivalent to the following Python function:

def increment(a):

return a+1

A second method is also added to the module and then,

mod.compile()

is called to build the extension module. By default, the module is created in the current working directory. This exam-

ple is available in the examples/increment_example.py ﬁle found in the weave directory. At the bottom of

the ﬁle in the module’s “main” program, an attempt to import increment_ext without building it is made. If this

fails (the module doesn’t exist in the PYTHONPATH), the module is built by calling build_increment_ext().

This approach only takes the time-consuming (a few seconds for this example) process of building the module if it

hasn’t been built before.

if __name__ == "__main__":

try:

import increment_ext

except ImportError:

build_increment_ext()

1.16. Weave (scipy.weave) 155

SciPy Reference Guide, Release 0.13.0

import increment_ext

a=1

print ’a, a+1:’, a, increment_ext.increment(a)

print ’a, a+2:’, a, increment_ext.increment_by_2(a)

Note: If we were willing to always pay the penalty of building the C++ code for a module, we could store

the SHA-256 checksum of the C++ code along with some information about the compiler, platform, etc. Then,

ext_module.compile() could try importing the module before it actually compiles it, check the SHA-256 check-

sum and other meta-data in the imported module with the meta-data of the code it just produced and only compile the

code if the module didn’t exist or the meta-data didn’t match. This would reduce the above code to:

if __name__ == "__main__":

build_increment_ext()

a=1

print ’a, a+1:’, a, increment_ext.increment(a)

print ’a, a+2:’, a, increment_ext.increment_by_2(a)

Note: There would always be the overhead of building the C++ code, but it would only actually compile the code

once. You pay a little in overhead and get cleaner “import” code. Needs some thought.

If you run increment_example.py from the command line, you get the following:

[eric@n0]$ python increment_example.py

a, a+1: 1 2

a, a+2: 1 3

If the module didn’t exist before it was run, the module is created. If it did exist, it is just imported and used.

Fibonacci Example

examples/fibonacci.py provides a little more complex example of how to use ext_tools. Fibonacci num-

bers are a series of numbers where each number in the series is the sum of the previous two: 1, 1, 2, 3, 5, 8, etc. Here,

the ﬁrst two numbers in the series are taken to be 1. One approach to calculating Fibonacci numbers uses recursive

function calls. In Python, it might be written as:

def fib(a):

if a<= 2:

return 1

else:

return fib(a-2)+fib(a-1)

In C, the same function would look something like this:

int fib(int a)

{

if(a <= 2)

return 1;

else

return fib(a-2) + fib(a-1);

}

Recursion is much faster in C than in Python, so it would be beneﬁcial to use the C version for ﬁbonacci number

calculations instead of the Python version. We need an extension function that calls this C function to do this. This

156 Chapter 1. SciPy Tutorial

SciPy Reference Guide, Release 0.13.0

is possible by including the above code snippet as “support code” and then calling it from the extension function.

Support code snippets (usually structure deﬁnitions, helper functions and the like) are inserted into the extension

module C/C++ ﬁle before the extension function code. Here is how to build the C version of the ﬁbonacci number

generator:

def build_fibonacci():

""" Builds an extension module with fibonacci calculators.

"""

mod =ext_tools.ext_module(’fibonacci_ext’)

a=1# this is effectively a type declaration

# recursive fibonacci in C

fib_code ="""

int fib1(int a)

{

if(a <= 2)

return 1;

else

return fib1(a-2) + fib1(a-1);

}

"""

ext_code ="""

int val = fib1(a);

return_val = Py::new_reference_to(Py::Int(val));

"""

fib =ext_tools.ext_function(’fib’,ext_code,[’a’])

fib.customize.add_support_code(fib_code)

mod.add_function(fib)

mod.compile()

XXX More about custom_info, and what xxx_info instances are good for.

Note: recursion is not the fastest way to calculate ﬁbonacci numbers, but this approach serves nicely for this example.

1.16.10 Customizing Type Conversions – Type Factories

not written

1.16.11 Things I wish weave did

It is possible to get name clashes if you uses a variable name that is already deﬁned in a header automatically included

(such as stdio.h) For instance, if you try to pass in a variable named stdout, you’ll get a cryptic error report due

to the fact that stdio.h also deﬁnes the name. weave should probably try and handle this in some way. Other

things...

1.16. Weave (scipy.weave) 157

SciPy Reference Guide, Release 0.13.0

158 Chapter 1. SciPy Tutorial

CHAPTER

TWO

CONTRIBUTING TO SCIPY

This document aims to give an overview of how to contribute to SciPy. It tries to answer commonly asked questions,

and provide some insight into how the community process works in practice. Readers who are familiar with the SciPy

community and are experienced Python coders may want to jump straight to the git workﬂow documentation.

Note: You may want to check the latest version of this guide, which is available at:

https://github.com/scipy/scipy/blob/master/HACKING.rst.txt

2.1 Contributing new code

If you have been working with the scientiﬁc Python toolstack for a while, you probably have some code lying around

of which you think “this could be useful for others too”. Perhaps it’s a good idea then to contribute it to SciPy or

another open source project. The ﬁrst question to ask is then, where does this code belong? That question is hard

to answer here, so we start with a more speciﬁc one: what code is suitable for putting into SciPy? Almost all of

the new code added to scipy has in common that it’s potentially useful in multiple scientiﬁc domains and it ﬁts in

the scope of existing scipy submodules. In principle new submodules can be added too, but this is far less common.

For code that is speciﬁc to a single application, there may be an existing project that can use the code. Some scikits

(scikit-learn,scikits-image,statsmodels, etc.) are good examples here; they have a narrower focus and because of that

more domain-speciﬁc code than SciPy.

Now if you have code that you would like to see included in SciPy, how do you go about it? After checking that your

code can be distributed in SciPy under a compatible license (see FAQ for details), the ﬁrst step is to discuss on the

scipy-dev mailing list. All new features, as well as changes to existing code, are discussed and decided on there. You

can, and probably should, already start this discussion before your code is ﬁnished.

Assuming the outcome of the discussion on the mailing list is positive and you have a function or piece of code that

does what you need it to do, what next? Before code is added to SciPy, it at least has to have good documentation, unit

tests and correct code style.

1. Unit tests In principle you should aim to create unit tests that exercise all the code that you are adding.

This gives some degree of conﬁdence that your code runs correctly, also on Python versions and

hardware or OSes that you don’t have available yourself. An extensive description of how to

write unit tests is given in the NumPy testing guidelines.

2. Documentation

Clear and complete documentation is essential in order for users to be able to ﬁnd and under-

stand the code. Documentation for individual functions and classes – which includes at least a

basic description, type and meaning of all parameters and returns values, and usage examples in

doctest format – is put in docstrings. Those docstrings can be read within the interpreter, and

are compiled into a reference guide in html and pdf format. Higher-level documentation for key

159

SciPy Reference Guide, Release 0.13.0

(areas of) functionality is provided in tutorial format and/or in module docstrings. A guide on

how to write documentation is given in how to document.

3. Code style Uniformity of style in which code is written is important to others trying to understand the code.

SciPy follows the standard Python guidelines for code style, PEP8. In order to check that your

code conforms to PEP8, you can use the pep8 package style checker. Most IDEs and text editors

have settings that can help you follow PEP8, for example by translating tabs by four spaces.

Using pyﬂakes to check your code is also a good idea.

At the end of this document a checklist is given that may help to check if your code fulﬁlls all requirements for

inclusion in SciPy.

Another question you may have is: where exactly do I put my code? To answer this, it is useful to understand

how the SciPy public API (application programming interface) is deﬁned. For most modules the API is two levels

deep, which means your new function should appear as scipy.submodule.my_new_func.my_new_func

can be put in an existing or new ﬁle under /scipy/<submodule>/, its name is added to the __all__

list in that ﬁle (which lists all public functions in the ﬁle), and those public functions are then imported in

/scipy/<submodule>/__init__.py. Any private functions/classes should have a leading underscore (_) in

their name. A more detailed description of what the public API of SciPy is, is given in SciPy API.

Once you think your code is ready for inclusion in SciPy, you can send a pull request (PR) on Github. We won’t

go into the details of how to work with git here, this is described well in the git workﬂow section of the NumPy

documentation and on the Github help pages. When you send the PR for a new feature, be sure to also mention this on

the scipy-dev mailing list. This can prompt interested people to help review your PR. Assuming that you already got

positive feedback before on the general idea of your code/feature, the purpose of the code review is to ensure that the

code is correct, efﬁcient and meets the requirements outlined above. In many cases the code review happens relatively

quickly, but it’s possible that it stalls. If you have addressed all feedback already given, it’s perfectly ﬁne to ask on the

mailing list again for review (after a reasonable amount of time, say a couple of weeks, has passed). Once the review

is completed, the PR is merged into the “master” branch of SciPy.

The above describes the requirements and process for adding code to SciPy. It doesn’t yet answer the question though

how decisions are made exactly. The basic answer is: decisions are made by consensus, by everyone who chooses

to participate in the discussion on the mailing list. This includes developers, other users and yourself. Aiming for

consensus in the discussion is important – SciPy is a project by and for the scientiﬁc Python community. In those rare

cases that agreement cannot be reached, the maintainers of the module in question can decide the issue.

2.2 Contributing by helping maintain existing code

The previous section talked speciﬁcally about adding new functionality to SciPy. A large part of that discussion also

applies to maintenance of existing code. Maintenance means ﬁxing bugs, improving code quality or style, documenting

existing functionality better, adding missing unit tests, keeping build scripts up-to-date, etc. The SciPy issue list

contains all reported bugs, build/documentation issues, etc. Fixing issues helps improve the overall quality of SciPy,

and is also a good way of getting familiar with the project. You may also want to ﬁx a bug because you ran into it and

need the function in question to work correctly.

The discussion on code style and unit testing above applies equally to bug ﬁxes. It is usually best to start by writing a

unit test that shows the problem, i.e. it should pass but doesn’t. Once you have that, you can ﬁx the code so that the

test does pass. That should be enough to send a PR for this issue. Unlike when adding new code, discussing this on

the mailing list may not be necessary - if the old behavior of the code is clearly incorrect, no one will object to having

it ﬁxed. It may be necessary to add some warning or deprecation message for the changed behavior. This should be

part of the review process.

160 Chapter 2. Contributing to SciPy

SciPy Reference Guide, Release 0.13.0

2.3 Other ways to contribute

There are many ways to contribute other than contributing code. Participating in discussions on the scipy-user and

scipy-dev mailing lists is a contribution in itself. The scipy.org website contains a lot of information on the SciPy

community and can always use a new pair of hands. A redesign of this website is ongoing, see scipy.github.com. The

redesigned website is a static site based on Sphinx, the sources for it are also on Github at scipy.org-new.

The SciPy documentation is constantly being improved by many developers and users. You can contribute by sending

a PR on Github that improves the documentation, but there’s also a documentation wiki that is very convenient for

making edits to docstrings (and doesn’t require git knowledge). Anyone can register a username on that wiki, ask on

the scipy-dev mailing list for edit rights and make edits. The documentation there is updated every day with the latest

changes in the SciPy master branch, and wiki edits are regularly reviewed and merged into master. Another advantage

of the documentation wiki is that you can immediately see how the reStructuredText (reST) of docstrings and other

docs is rendered as html, so you can easily catch formatting errors.

Code that doesn’t belong in SciPy itself or in another package but helps users accomplish a certain task is valuable.

SciPy Central is the place to share this type of code (snippets, examples, plotting code, etc.).

2.4 Recommended development setup

Since Scipy contains parts written in C, C++, and Fortran that need to be compiled before use, make sure you have

the necessary compilers and Python development headers installed. Having compiled code also means that importing

Scipy from the development sources needs some additional steps, which are explained below.

First fork a copy of the main Scipy repository in Github onto your own account and then create your local repository

via:

$ git clone git@github.com:YOURUSERNAME/scipy.git scipy

$ cd scipy

$ git remote add upstream git://github.com/scipy/scipy.git

To build the development version of Scipy and run tests, spawn interactive shells with the Python import paths properly

set up etc., do one of:

$ python runtests.py -v

$ python runtests.py -v -s optimize

$ python runtests.py -v -t scipy/special/tests/test_basic.py:test_xlogy

$ python runtests.py --ipython

$ python runtests.py --python somescript.py

This builds Scipy ﬁrst, so the ﬁrst time it may take some time. If you specify -n, the tests are run against the version

of Scipy (if any) found on current PYTHONPATH.

You may want to set up an in-place build so that changes made to .py ﬁles have effect without rebuild. First, run:

$ python setup.py build_ext -i

Then you need to point your PYTHONPATH environment variable to this directory. Some IDEs (Spyder for example)

have utilities to manage PYTHONPATH. On Linux and OSX, you can run the command:

$ export PYTHONPATH=$PWD

and on Windows

$ set PYTHONPATH=/path/to/scipy

2.3. Other ways to contribute 161

SciPy Reference Guide, Release 0.13.0

Now editing a Python source ﬁle in SciPy allows you to immediately test and use your changes (in .py ﬁles), by

simply restarting the interpreter.

Another good approach is to install Scipy normally in an isolated test environment using virtualenv, and work from

there (see FAQ below).

2.5 SciPy structure

All SciPy modules should follow the following conventions. In the following, a SciPy module is deﬁned as a Python

package, say yyy, that is located in the scipy/ directory.

• Ideally, each SciPy module should be as self-contained as possible. That is, it should have minimal dependencies

on other packages or modules. Even dependencies on other SciPy modules should be kept to a minimum. A

dependency on NumPy is of course assumed.

• Directory yyy/ contains:

–A ﬁle setup.py that deﬁnes configuration(parent_package=”,top_path=None) func-

tion for numpy.distutils.

–A directory tests/ that contains ﬁles test_<name>.py corresponding to modules

yyy/<name>{.py,.so,/}.

–A directory benchmarks/ that contains ﬁles bench_<name>.py corresponding to modules

yyy/<name>{.py,.so,/}.

• Private modules should be preﬁxed with an underscore _, for instance yyy/_somemodule.py.

• User-visible functions should have good documentation following the Numpy documentation style, see how to

document

• The __init__.py of the module should contain the main reference documentation in its docstring. This is

connected to the Sphinx documentation under doc/ via Sphinx’s automodule directive.

The reference documentation should ﬁrst give a categorized list of the contents of the module using

autosummary:: directives, and after that explain points essential for understanding the use of the module.

Tutorial-style documentation with extensive examples should be separate, and put under

doc/source/tutorial/

See the existing Scipy submodules for guidance.

For further details on Numpy distutils, see:

https://github.com/numpy/numpy/blob/master/doc/DISTUTILS.rst.txt

2.6 Useful links, FAQ, checklist

2.6.1 Checklist before submitting a PR

• Are there unit tests with good code coverage?

• Do all public function have docstrings including examples?

• Is the code style correct (PEP8, pyﬂakes)

• Is the new functionality tagged with .. versionadded:: X.Y.Z (with X.Y.Z the version number of

the next release - can be found in setup.py)?

162 Chapter 2. Contributing to SciPy

SciPy Reference Guide, Release 0.13.0

• Is the new functionality mentioned in the release notes of the next release?

• Is the new functionality added to the reference guide?

• In case of larger additions, is there a tutorial or more extensive module-level description?

• In case compiled code is added, is it integrated correctly via setup.py (and preferably also Bento conﬁguration

ﬁles - bento.info and bscript)?

• If you are a ﬁrst-time contributor, did you add yourself to THANKS.txt? Please note that this is perfectly normal

and desirable - the aim is to give every single contributor credit, and if you don’t add yourself it’s simply extra

work for the reviewer (or worse, the reviewer may forget).

• Did you check that the code can be distributed under a BSD license?

2.6.2 Useful SciPy documents

• The how to document guidelines

• NumPy/SciPy testing guidelines

•SciPy API

• SciPy maintainers

• NumPy/SciPy git workﬂow

2.6.3 FAQ

I based my code on existing Matlab/R/... code I found online, is this OK?

It depends. SciPy is distributed under a BSD license, so if the code that you based your code on is also BSD licensed

or has a BSD-compatible license (MIT, Apache, ...) then it’s OK. Code which is GPL-licensed, has no clear license,

requires citation or is free for academic use only can’t be included in SciPy. Therefore if you copied existing code with

such a license or made a direct translation to Python of it, your code can’t be included. See also license compatibility.

Why is SciPy under the BSD license and not, say, the GPL?

Like Python, SciPy uses a “permissive” open source license, which allows proprietary re-use. While this allows

companies to use and modify the software without giving anything back, it is felt that the larger user base results in

more contributions overall, and companies often publish their modiﬁcations anyway, without being required to. See

John Hunter’s BSD pitch.

How do I set up a development version of SciPy in parallel to a released version that I use to do my job/research?

One simple way to achieve this is to install the released version in site-packages, by using a binary installer or pip for

example, and set up the development version in a virtualenv. First install virtualenv (optionally use virtualenvwrapper),

then create your virtualenv (named scipy-dev here) with:

$ virtualenv scipy-dev

Now, whenever you want to switch to the virtual environment, you can use the command source

scipy-dev/bin/activate, and deactivate to exit from the virtual environment and back to your previ-

ous shell. With scipy-dev activated, install ﬁrst Scipy’s dependencies:

$ pip install Numpy Nose Cython

After that, you can install a development version of Scipy, for example via:

$ python setup.py install

2.6. Useful links, FAQ, checklist 163

SciPy Reference Guide, Release 0.13.0

The installation goes to the virtual environment.

Can I use a programming language other than Python to speed up my code?

Yes. The languages used in SciPy are Python, Cython, C, C++ and Fortran. All of these have their pros and cons.

If Python really doesn’t offer enough performance, one of those languages can be used. Important concerns when

using compiled languages are maintainability and portability. For maintainability, Cython is clearly preferred over

C/C++/Fortran. Cython and C are more portable than C++/Fortran. A lot of the existing C and Fortran code in SciPy

is older, battle-tested code that was only wrapped in (but not speciﬁcally written for) Python/SciPy. Therefore the

basic advice is: use Cython. If there’s speciﬁc reasons why C/C++/Fortran should be preferred, please discuss those

reasons ﬁrst.

How do I debug code written in C/C++/Fortran inside Scipy?

The easiest way to do this is to ﬁrst write a Python script that invokes the C code whose execution you want to debug.

For instance mytest.py:

from scipy.special import hyp2f1

print(hyp2f1(5.0,1.0,-1.8,0.95))

Now, you can run:

gdb --args python runtests.py -g --python mytest.py

If you didn’t compile with debug symbols enabled before, remove the build directory ﬁrst. While in the debugger:

(gdb) break cephes_hyp2f1

(gdb) run

The execution will now stop at the corresponding C function and you can step through it as usual. Instead of

plain gdb you can of course use your favourite alternative debugger; run it on the python binary with arguments

runtests.py -g --python mytest.py.

164 Chapter 2. Contributing to SciPy

CHAPTER

THREE

API - IMPORTING FROM SCIPY

In Python the distinction between what is the public API of a library and what are private implementation details is

not always clear. Unlike in other languages like Java, it is possible in Python to access “private” function or objects.

Occasionally this may be convenient, but be aware that if you do so your code may break without warning in future

releases. Some widely understood rules for what is and isn’t public in Python are:

• Methods / functions / classes and module attributes whose names begin with a leading underscore are private.

• If a class name begins with a leading underscore none of its members are public, whether or not they begin with

a leading underscore.

• If a module name in a package begins with a leading underscore none of its members are public, whether or not

they begin with a leading underscore.

• If a module or package deﬁnes __all__ that authoritatively deﬁnes the public interface.

• If a module or package doesn’t deﬁne __all__ then all names that don’t start with a leading underscore are

public.

Note: Reading the above guidelines one could draw the conclusion that every private module or object starts with

an underscore. This is not the case; the presence of underscores do mark something as private, but the absence of

underscores do not mark something as public.

In Scipy there are modules whose names don’t start with an underscore, but that should be considered private. To

clarify which modules these are we deﬁne below what the public API is for Scipy, and give some recommendations

for how to import modules/functions/objects from Scipy.

3.1 Guidelines for importing functions from Scipy

The scipy namespace itself only contains functions imported from numpy. These functions still exist for backwards

compatibility, but should be imported from numpy directly.

Everything in the namespaces of scipy submodules is public. In general, it is recommended to import functions from

submodule namespaces. For example, the function curve_fit (deﬁned in scipy/optimize/minpack.py) should be

imported like this:

from scipy import optimize

result =optimize.curve_fit(...)

This form of importing submodules is preferred for all submodules except scipy.io (because io is also the name

of a module in the Python stdlib):

165

SciPy Reference Guide, Release 0.13.0

from scipy import interpolate

from scipy import integrate

import scipy.io as spio

In some cases, the public API is one level deeper. For example the scipy.sparse.linalg module is public, and

the functions it contains are not available in the scipy.sparse namespace. Sometimes it may result in more easily

understandable code if functions are imported from one level deeper. For example, in the following it is immediately

clear that lomax is a distribution if the second form is chosen:

# first form

from scipy import stats

stats.lomax(...)

# second form

from scipy.stats import distributions

distributions.lomax(...)

In that case the second form can be chosen, if it is documented in the next section that the submodule in question is

public.

3.2 API deﬁnition

Every submodule listed below is public. That means that these submodules are unlikely to be renamed or changed

in an incompatible way, and if that is necessary a deprecation warning will be raised for one Scipy release before the

change is made.

• scipy.cluster

–vq

–hierarchy

• scipy.constants

• scipy.fftpack

• scipy.integrate

• scipy.interpolate

• scipy.io

–arff

–harwell_boeing

–idl

–matlab

–netcdf

–wavﬁle

• scipy.linalg

–scipy.linalg.blas

–scipy.linalg.lapack

–scipy.linalg.interpolative

• scipy.misc

166 Chapter 3. API - importing from Scipy

SciPy Reference Guide, Release 0.13.0

• scipy.ndimage

• scipy.odr

• scipy.optimize

• scipy.signal

• scipy.sparse

–linalg

–csgraph

• scipy.spatial

–distance

• scipy.special

• scipy.stats

–distributions

–mstats

• scipy.weave

3.2. API deﬁnition 167

SciPy Reference Guide, Release 0.13.0

168 Chapter 3. API - importing from Scipy

CHAPTER

FOUR

RELEASE NOTES

4.1 SciPy 0.13.0 Release Notes

169

SciPy Reference Guide, Release 0.13.0

Contents

•SciPy 0.13.0 Release Notes

–New features

*scipy.integrate improvements

·N-dimensional numerical integration

·dopri*improvements

*scipy.linalg improvements

·Interpolative decompositions

·Polar decomposition

·BLAS level 3 functions

·Matrix functions

*scipy.optimize improvements

·Trust-region unconstrained minimization algorithms

*scipy.sparse improvements

·Boolean comparisons and sparse matrices

·CSR and CSC fancy indexing

*scipy.sparse.linalg improvements

*scipy.spatial improvements

*scipy.signal improvements

*scipy.special improvements

*scipy.io improvements

·Unformatted Fortran ﬁle reader

·scipy.io.wavfile enhancements

*scipy.interpolate improvements

·B-spline derivatives and antiderivatives

*scipy.stats improvements

–Deprecated features

*expm2 and expm3

*scipy.stats functions

–Backwards incompatible changes

*LIL matrix assignment

*Deprecated radon function removed

*Removed deprecated keywords xa and xb from stats.distributions

*Changes to MATLAB ﬁle readers / writers

–Other changes

–Authors

SciPy 0.13.0 is the culmination of 7 months of hard work. It contains many new features, numerous bug-ﬁxes,

improved test coverage and better documentation. There have been a number of deprecations and API changes in

this release, which are documented below. All users are encouraged to upgrade to this release, as there are a large

number of bug-ﬁxes and optimizations. Moreover, our development attention will now shift to bug-ﬁx releases on the

0.13.x branch, and on adding new features on the master branch.

This release requires Python 2.6, 2.7 or 3.1-3.3 and NumPy 1.5.1 or greater. Highlights of this release are:

• support for fancy indexing and boolean comparisons with sparse matrices

• interpolative decompositions and matrix functions in the linalg module

• two new trust-region solvers for unconstrained minimization

170 Chapter 4. Release Notes

SciPy Reference Guide, Release 0.13.0

4.1.1 New features

scipy.integrate improvements

N-dimensional numerical integration

A new function scipy.integrate.nquad, which provides N-dimensional integration functionality with a more

ﬂexible interface than dblquad and tplquad, has been added.

dopri*improvements

The intermediate results from the dopri family of ODE solvers can now be accessed by a solout callback function.

scipy.linalg improvements

Interpolative decompositions

Scipy now includes a new module scipy.linalg.interpolative containing routines for computing interpola-

tive matrix decompositions (ID). This feature is based on the ID software package by P.G. Martinsson, V. Rokhlin, Y.

Shkolnisky, and M. Tygert, previously adapted for Python in the PymatrixId package by K.L. Ho.

Polar decomposition

A new function scipy.linalg.polar, to compute the polar decomposition of a matrix, was added.

BLAS level 3 functions

The BLAS functions symm,syrk,syr2k,hemm,herk and her2k are now wrapped in scipy.linalg.

Matrix functions

Several matrix function algorithms have been implemented or updated following detailed descriptions in recent pa-

pers of Nick Higham and his co-authors. These include the matrix square root (sqrtm), the matrix logarithm

(logm), the matrix exponential (expm) and its Frechet derivative (expm_frechet), and fractional matrix powers

(fractional_matrix_power).

scipy.optimize improvements

Trust-region unconstrained minimization algorithms

The minimize function gained two trust-region solvers for unconstrained minimization: dogleg and trust-ncg.

scipy.sparse improvements

Boolean comparisons and sparse matrices

All sparse matrix types now support boolean data, and boolean operations. Two sparse matrices Aand Bcan be com-

pared in all the expected ways A < B,A >= B,A != B, producing similar results as dense Numpy arrays. Comparisons

with dense matrices and scalars are also supported.

CSR and CSC fancy indexing

Compressed sparse row and column sparse matrix types now support fancy indexing with boolean matrices, slices,

and lists. So where A is a (CSC or CSR) sparse matrix, you can do things like:

4.1. SciPy 0.13.0 Release Notes 171

SciPy Reference Guide, Release 0.13.0

>>> A[A >0.5]=1# since Boolean sparse matrices work

>>> A[:2, :3]=2

>>> A[[1,2], 2]=3

scipy.sparse.linalg improvements

The new function onenormest provides a lower bound of the 1-norm of a linear operator and has been implemented

according to Higham and Tisseur (2000). This function is not only useful for sparse matrices, but can also be used to

estimate the norm of products or powers of dense matrices without explictly building the intermediate matrix.

The multiplicative action of the matrix exponential of a linear operator (expm_multiply) has been implemented

following the description in Al-Mohy and Higham (2011).

Abstract linear operators (scipy.sparse.linalg.LinearOperator) can now be multiplied, added to each

other, and exponentiated, producing new linear operators. This enables easier construction of composite linear opera-

tions.

scipy.spatial improvements

The vertices of a ConvexHull can now be accessed via the vertices attribute, which gives proper orientation in 2-D.

scipy.signal improvements

The cosine window function scipy.signal.cosine was added.

scipy.special improvements

New functions scipy.special.xlogy and scipy.special.xlog1py were added. These functions can

simplify and speed up code that has to calculate x*log(y) and give 0 when x == 0.

scipy.io improvements

Unformatted Fortran ﬁle reader

The new class scipy.io.FortranFile facilitates reading unformatted sequential ﬁles written by Fortran code.

scipy.io.wavfile enhancements

scipy.io.wavfile.write now accepts a ﬁle buffer. Previously it only accepted a ﬁlename.

scipy.io.wavfile.read and scipy.io.wavfile.write can now handle ﬂoating point WAV ﬁles.

scipy.interpolate improvements

B-spline derivatives and antiderivatives

scipy.interpolate.splder and scipy.interpolate.splantider functions for comput-

ing B-splines that represent derivatives and antiderivatives of B-splines were added. These functions

are also available in the class-based FITPACK interface as UnivariateSpline.derivative and

UnivariateSpline.antiderivative.

172 Chapter 4. Release Notes

SciPy Reference Guide, Release 0.13.0

scipy.stats improvements

Distributions now allow using keyword parameters in addition to positional parameters in all methods.

The function scipy.stats.power_divergence has been added for the Cressie-Read power divergence statistic

and goodness of ﬁt test. Included in this family of statistics is the “G-test” (http://en.wikipedia.org/wiki/G-test).

scipy.stats.mood now accepts multidimensional input.

An option was added to scipy.stats.wilcoxon for continuity correction.

scipy.stats.chisquare now has an axis argument.

scipy.stats.mstats.chisquare now has axis and ddof arguments.

4.1.2 Deprecated features

expm2 and expm3

The matrix exponential functions scipy.linalg.expm2 and scipy.linalg.expm3 are deprecated. All users

should use the numerically more robust scipy.linalg.expm function instead.

scipy.stats functions

scipy.stats.oneway is deprecated; scipy.stats.f_oneway should be used instead.

scipy.stats.glm is deprecated. scipy.stats.ttest_ind is an equivalent function; more full-featured

general (and generalized) linear model implementations can be found in statsmodels.

scipy.stats.cmedian is deprecated; numpy.median should be used instead.

4.1.3 Backwards incompatible changes

LIL matrix assignment

Assigning values to LIL matrices with two index arrays now works similarly as assigning into ndarrays:

>>> x=lil_matrix((3,3))

>>> x[[0,1,2],[0,1,2]]=[0,1,2]

>>> x.todense()

matrix([[ 0., 0., 0.],

[ 0., 1., 0.],

[ 0., 0., 2.]])

rather than giving the result:

>>> x.todense()

matrix([[ 0., 1., 2.],

[ 0., 1., 2.],

[ 0., 1., 2.]])

Users relying on the previous behavior will need to revisit their code. The previous behavior is obtained by

‘‘x[numpy.ix_([0,1,2],[0,1,2])] = ...‘.

4.1. SciPy 0.13.0 Release Notes 173

SciPy Reference Guide, Release 0.13.0

Deprecated radon function removed

The misc.radon function, which was deprecated in scipy 0.11.0, has been removed. Users can ﬁnd a more full-

featured radon function in scikit-image.

Removed deprecated keywords xa and xb from stats.distributions

The keywords xa and xb, which were deprecated since 0.11.0, have been removed from the distributions in

scipy.stats.

Changes to MATLAB ﬁle readers / writers

The major change is that 1D arrays in numpy now become row vectors (shape 1, N) when saved to a MATLAB 5 format

ﬁle. Previously 1D arrays saved as column vectors (N, 1). This is to harmonize the behavior of writing MATLAB 4

and 5 formats, and adapt to the defaults of numpy and MATLAB - for example np.atleast_2d returns 1D arrays

as row vectors.

Trying to save arrays of greater than 2 dimensions in MATLAB 4 format now raises an error instead of silently

reshaping the array as 2D.

scipy.io.loadmat(’afile’) used to look for aﬁle on the Python system path (sys.path); now loadmat

only looks in the current directory for a relative path ﬁlename.

4.1.4 Other changes

Security ﬁx: scipy.weave previously used temporary directories in an insecure manner under certain circum-

stances.

Cython is now required to build unreleased versions of scipy. The C ﬁles generated from Cython sources are not

included in the git repo anymore. They are however still shipped in source releases.

The code base received a fairly large PEP8 cleanup. A tox pep8 command has been added; new code should pass

this test command.

Scipy cannot be compiled with gfortran 4.1 anymore (at least on RH5), likely due to that compiler version not sup-

porting entry constructs well.

4.1.5 Authors

This release contains work by the following people (contributed at least one patch to this release, names in alphabetical

order):

• Jorge Cañardo Alastuey +

• Tom Aldcroft +

• Max Bolingbroke +

• Joseph Jon Booker +

• François Boulogne

• Matthew Brett

• Christian Brodbeck +

• Per Brodtkorb +

174 Chapter 4. Release Notes

SciPy Reference Guide, Release 0.13.0

• Christian Brueffer +

• Lars Buitinck

• Evgeni Burovski +

• Tim Cera

• Lawrence Chan +

• David Cournapeau

• Drazen Lucanin +

• Alexander J. Dunlap +

• endolith

• André Gaul +

• Christoph Gohlke

• Ralf Gommers

• Alex Grifﬁng +

• Blake Grifﬁth +

• Charles Harris

• Bob Helmbold +

• Andreas Hilboll

• Kat Huang +

• Oleksandr (Sasha) Huziy +

• Gert-Ludwig Ingold +

• Thouis (Ray) Jones

• Juan Luis Cano Rodríguez +

• Robert Kern

• Andreas Kloeckner +

• Sytse Knypstra +

• Gustav Larsson +

• Denis Laxalde

• Christopher Lee

• Tim Leslie

• Wendy Liu +

• Clemens Novak +

• Takuya Oshima +

• Josef Perktold

• Illia Polosukhin +

• Przemek Porebski +

• Steve Richardson +

4.1. SciPy 0.13.0 Release Notes 175

SciPy Reference Guide, Release 0.13.0

• Branden Rolston +

• Skipper Seabold

• Fazlul Shahriar

• Leo Singer +

• Rohit Sivaprasad +

• Daniel B. Smith +

• Julian Taylor

• Louis Thibault +

• Tomas Tomecek +

• John Travers

• Richard Tsai +

• Jacob Vanderplas

• Patrick Varilly

• Pauli Virtanen

• Stefan van der Walt

• Warren Weckesser

• Pedro Werneck +

• Nils Werner +

• Michael Wimmer +

• Nathan Woods +

• Tony S. Yu +

A total of 65 people contributed to this release. People with a “+” by their names contributed a patch for the ﬁrst time.

4.2 SciPy 0.12.0 Release Notes

176 Chapter 4. Release Notes

SciPy Reference Guide, Release 0.13.0

Contents

•SciPy 0.12.0 Release Notes

–New features

*scipy.spatial improvements

·cKDTree feature-complete

·Voronoi diagrams and convex hulls

·Delaunay improvements

*Spectral estimators (scipy.signal)

*scipy.optimize improvements

·Callback functions in L-BFGS-B and TNC

·Basin hopping global optimization (scipy.optimize.basinhopping)

*scipy.special improvements

·Revised complex error functions

·Faster orthogonal polynomials

*scipy.sparse.linalg features

*Listing Matlab(R) ﬁle contents in scipy.io

*Documented BLAS and LAPACK low-level interfaces (scipy.linalg)

*Polynomial interpolation improvements (scipy.interpolate)

–Deprecated features

*scipy.lib.lapack

*fblas and cblas

–Backwards incompatible changes

*Removal of scipy.io.save_as_module

–Other changes

–Authors

SciPy 0.12.0 is the culmination of 7 months of hard work. It contains many new features, numerous bug-ﬁxes,

improved test coverage and better documentation. There have been a number of deprecations and API changes in

this release, which are documented below. All users are encouraged to upgrade to this release, as there are a large

number of bug-ﬁxes and optimizations. Moreover, our development attention will now shift to bug-ﬁx releases on the

0.12.x branch, and on adding new features on the master branch.

Some of the highlights of this release are:

• Completed QHull wrappers in scipy.spatial.

• cKDTree now a drop-in replacement for KDTree.

• A new global optimizer, basinhopping.

• Support for Python 2 and Python 3 from the same code base (no more 2to3).

This release requires Python 2.6, 2.7 or 3.1-3.3 and NumPy 1.5.1 or greater. Support for Python 2.4 and 2.5 has been

dropped as of this release.

4.2.1 New features

scipy.spatial improvements

cKDTree feature-complete

Cython version of KDTree, cKDTree, is now feature-complete. Most operations (construction, query,

query_ball_point, query_pairs, count_neighbors and sparse_distance_matrix) are between 200 and 1000 times faster

in cKDTree than in KDTree. With very minor caveats, cKDTree has exactly the same interface as KDTree, and can be

used as a drop-in replacement.

4.2. SciPy 0.12.0 Release Notes 177

SciPy Reference Guide, Release 0.13.0

Voronoi diagrams and convex hulls

scipy.spatial now contains functionality for computing Voronoi diagrams and convex hulls using the Qhull

library. (Delaunay triangulation was available since Scipy 0.9.0.)

Delaunay improvements

It’s now possible to pass in custom Qhull options in Delaunay triangulation. Coplanar points are now also recorded, if

present. Incremental construction of Delaunay triangulations is now also possible.

Spectral estimators (scipy.signal)

The functions scipy.signal.periodogram and scipy.signal.welch were added, providing DFT-based

spectral estimators.

scipy.optimize improvements

Callback functions in L-BFGS-B and TNC

A callback mechanism was added to L-BFGS-B and TNC minimization solvers.

Basin hopping global optimization (scipy.optimize.basinhopping)

A new global optimization algorithm. Basinhopping is designed to efﬁciently ﬁnd the global minimum of a smooth

function.

scipy.special improvements

Revised complex error functions

The computation of special functions related to the error function now uses a new Faddeeva library from MIT which

increases their numerical precision. The scaled and imaginary error functions erfcx and erfi were also added, and

the Dawson integral dawsn can now be evaluated for a complex argument.

Faster orthogonal polynomials

Evaluation of orthogonal polynomials (the eval_*routines) in now faster in scipy.special, and their out=

argument functions properly.

scipy.sparse.linalg features

• In scipy.sparse.linalg.spsolve, the bargument can now be either a vector or a matrix.

•scipy.sparse.linalg.inv was added. This uses spsolve to compute a sparse matrix inverse.

•scipy.sparse.linalg.expm was added. This computes the exponential of a sparse matrix using a similar

algorithm to the existing dense array implementation in scipy.linalg.expm.

Listing Matlab(R) ﬁle contents in scipy.io

A new function whosmat is available in scipy.io for inspecting contents of MAT ﬁles without reading them to

memory.

178 Chapter 4. Release Notes

SciPy Reference Guide, Release 0.13.0

Documented BLAS and LAPACK low-level interfaces (scipy.linalg)

The modules scipy.linalg.blas and scipy.linalg.lapack can be used to access low-level BLAS and

LAPACK functions.

Polynomial interpolation improvements (scipy.interpolate)

The barycentric, Krogh, piecewise and pchip polynomial interpolators in scipy.interpolate accept now an

axis argument.

4.2.2 Deprecated features

scipy.lib.lapack

The module scipy.lib.lapack is deprecated. You can use scipy.linalg.lapack instead. The module

scipy.lib.blas was deprecated earlier in Scipy 0.10.0.

fblas and cblas

Accessing the modules scipy.linalg.fblas,cblas,ﬂapack,clapack is deprecated. Instead, use the modules

scipy.linalg.lapack and scipy.linalg.blas.

4.2.3 Backwards incompatible changes

Removal of scipy.io.save_as_module

The function scipy.io.save_as_module was deprecated in Scipy 0.11.0, and is now removed.

Its private support modules scipy.io.dumbdbm_patched and scipy.io.dumb_shelve are also removed.

4.2.4 Other changes

4.2.5 Authors

• Anton Akhmerov +

• Alexander Eberspächer +

• Anne Archibald

• Jisk Attema +

• K.-Michael Aye +

• bemasc +

• Sebastian Berg +

• François Boulogne +

• Matthew Brett

• Lars Buitinck

• Steven Byrnes +

4.2. SciPy 0.12.0 Release Notes 179

SciPy Reference Guide, Release 0.13.0

• Tim Cera +

• Christian +

• Keith Clawson +

• David Cournapeau

• Nathan Crock +

• endolith

• Bradley M. Froehle +

• Matthew R Goodman

• Christoph Gohlke

• Ralf Gommers

• Robert David Grant +

• Yaroslav Halchenko

• Charles Harris

• Jonathan Helmus

• Andreas Hilboll

• Hugo +

• Oleksandr Huziy

• Jeroen Demeyer +

• Johannes Schönberger +

• Steven G. Johnson +

• Chris Jordan-Squire

• Jonathan Taylor +

• Niklas Kroeger +

• Jerome Kieffer +

• kingson +

• Josh Lawrence

• Denis Laxalde

• Alex Leach +

• Tim Leslie

• Richard Lindsley +

• Lorenzo Luengo +

• Stephen McQuay +

• MinRK

• Sturla Molden +

• Eric Moore +

• mszep +

180 Chapter 4. Release Notes

SciPy Reference Guide, Release 0.13.0

• Matt Newville +

• Vlad Niculae

• Travis Oliphant

• David Parker +

• Fabian Pedregosa

• Josef Perktold

• Zach Ploskey +

• Alex Reinhart +

• Gilles Rochefort +

• Ciro Duran Santillli +

• Jan Schlueter +

• Jonathan Scholz +

• Anthony Scopatz

• Skipper Seabold

• Fabrice Silva +

• Scott Sinclair

• Jacob Stevenson +

• Sturla Molden +

• Julian Taylor +

• thorstenkranz +

• John Travers +

• True Price +

• Nicky van Foreest

• Jacob Vanderplas

• Patrick Varilly

• Daniel Velkov +

• Pauli Virtanen

• Stefan van der Walt

• Warren Weckesser

A total of 75 people contributed to this release. People with a “+” by their names contributed a patch for the ﬁrst time.

4.3 SciPy 0.11.0 Release Notes

4.3. SciPy 0.11.0 Release Notes 181

SciPy Reference Guide, Release 0.13.0

Contents

•SciPy 0.11.0 Release Notes

–New features

*Sparse Graph Submodule

*scipy.optimize improvements

·Uniﬁed interfaces to minimizers

·Uniﬁed interface to root ﬁnding algorithms

*scipy.linalg improvements

·New matrix equation solvers

·QZ and QR Decomposition

·Pascal matrices

*Sparse matrix construction and operations

*LSMR iterative solver

*Discrete Sine Transform

*scipy.interpolate improvements

*Binned statistics (scipy.stats)

–Deprecated features

–Backwards incompatible changes

*Removal of scipy.maxentropy

*Minor change in behavior of splev

*Behavior of scipy.integrate.complex_ode

*Minor change in behavior of T-tests

–Other changes

–Authors

SciPy 0.11.0 is the culmination of 8 months of hard work. It contains many new features, numerous bug-ﬁxes,

improved test coverage and better documentation. Highlights of this release are:

• A new module has been added which provides a number of common sparse graph algorithms.

• New uniﬁed interfaces to the existing optimization and root ﬁnding functions have been added.

All users are encouraged to upgrade to this release, as there are a large number of bug-ﬁxes and optimizations. Our

development attention will now shift to bug-ﬁx releases on the 0.11.x branch, and on adding new features on the master

branch.

This release requires Python 2.4-2.7 or 3.1-3.2 and NumPy 1.5.1 or greater.

4.3.1 New features

Sparse Graph Submodule

The new submodule scipy.sparse.csgraph implements a number of efﬁcient graph algorithms for graphs

stored as sparse adjacency matrices. Available routines are:

•connected_components - determine connected components of a graph

•laplacian - compute the laplacian of a graph

•shortest_path - compute the shortest path between points on a positive graph

•dijkstra - use Dijkstra’s algorithm for shortest path

•floyd_warshall - use the Floyd-Warshall algorithm for shortest path

•breadth_first_order - compute a breadth-ﬁrst order of nodes

182 Chapter 4. Release Notes

SciPy Reference Guide, Release 0.13.0

•depth_first_order - compute a depth-ﬁrst order of nodes

•breadth_first_tree - construct the breadth-ﬁrst tree from a given node

•depth_first_tree - construct a depth-ﬁrst tree from a given node

•minimum_spanning_tree - construct the minimum spanning tree of a graph

scipy.optimize improvements

The optimize module has received a lot of attention this release. In addition to added tests, documentation improve-

ments, bug ﬁxes and code clean-up, the following improvements were made:

• A uniﬁed interface to minimizers of univariate and multivariate functions has been added.

• A uniﬁed interface to root ﬁnding algorithms for multivariate functions has been added.

• The L-BFGS-B algorithm has been updated to version 3.0.

Uniﬁed interfaces to minimizers

Two new functions scipy.optimize.minimize and scipy.optimize.minimize_scalar were added

to provide a common interface to minimizers of multivariate and univariate functions respectively. For multivari-

ate functions, scipy.optimize.minimize provides an interface to methods for unconstrained optimization

(fmin,fmin_powell,fmin_cg,fmin_ncg,fmin_bfgs and anneal) or constrained optimization (fmin_l_bfgs_b,fmin_tnc,

fmin_cobyla and fmin_slsqp). For univariate functions, scipy.optimize.minimize_scalar provides an in-

terface to methods for unconstrained and bounded optimization (brent,golden,fminbound). This allows for easier

comparing and switching between solvers.

Uniﬁed interface to root ﬁnding algorithms

The new function scipy.optimize.root provides a common interface to root ﬁnding algorithms for multivariate

functions, embeding fsolve,leastsq and nonlin solvers.

scipy.linalg improvements

New matrix equation solvers

Solvers for the Sylvester equation (scipy.linalg.solve_sylvester, discrete and continuous Lya-

punov equations (scipy.linalg.solve_lyapunov,scipy.linalg.solve_discrete_lyapunov)

and discrete and continuous algebraic Riccati equations (scipy.linalg.solve_continuous_are,

scipy.linalg.solve_discrete_are) have been added to scipy.linalg. These solvers are often used in

the ﬁeld of linear control theory.

QZ and QR Decomposition

It is now possible to calculate the QZ, or Generalized Schur, decomposition using scipy.linalg.qz. This func-

tion wraps the LAPACK routines sgges, dgges, cgges, and zgges.

The function scipy.linalg.qr_multiply, which allows efﬁcient computation of the matrix product of Q (from

a QR decompostion) and a vector, has been added.

Pascal matrices

A function for creating Pascal matrices, scipy.linalg.pascal, was added.

4.3. SciPy 0.11.0 Release Notes 183

SciPy Reference Guide, Release 0.13.0

Sparse matrix construction and operations

Two new functions, scipy.sparse.diags and scipy.sparse.block_diag, were added to easily construct

diagonal and block-diagonal sparse matrices respectively.

scipy.sparse.csc_matrix and csr_matrix now support the operations sin,tan,arcsin,arctan,

sinh,tanh,arcsinh,arctanh,rint,sign,expm1,log1p,deg2rad,rad2deg,floor,ceil and

trunc. Previously, these operations had to be performed by operating on the matrices’ data attribute.

LSMR iterative solver

LSMR, an iterative method for solving (sparse) linear and linear least-squares systems, was added as

scipy.sparse.linalg.lsmr.

Discrete Sine Transform

Bindings for the discrete sine transform functions have been added to scipy.fftpack.

scipy.interpolate improvements

For interpolation in spherical coordinates, the three classes scipy.interpolate.SmoothSphereBivariateSpline,

scipy.interpolate.LSQSphereBivariateSpline, and scipy.interpolate.RectSphereBivariateSpline

have been added.

Binned statistics (scipy.stats)

The stats module has gained functions to do binned statistics, which are a generalization of histograms, in 1-D, 2-D

and multiple dimensions: scipy.stats.binned_statistic,scipy.stats.binned_statistic_2d

and scipy.stats.binned_statistic_dd.

4.3.2 Deprecated features

scipy.sparse.cs_graph_components has been made a part of the sparse graph submodule, and renamed to

scipy.sparse.csgraph.connected_components. Calling the former routine will result in a deprecation

warning.

scipy.misc.radon has been deprecated. A more full-featured radon transform can be found in scikits-image.

scipy.io.save_as_module has been deprecated. A better way to save multiple Numpy arrays is the

numpy.savez function.

The xa and xb parameters for all distributions in scipy.stats.distributions already weren’t used; they have

now been deprecated.

4.3.3 Backwards incompatible changes

Removal of scipy.maxentropy

The scipy.maxentropy module, which was deprecated in the 0.10.0 release, has been removed. Logistic regres-

sion in scikits.learn is a good and modern alternative for this functionality.

184 Chapter 4. Release Notes

SciPy Reference Guide, Release 0.13.0

Minor change in behavior of splev

The spline evaluation function now behaves similarly to interp1d for size-1 arrays. Previous behavior:

>>> from scipy.interpolate import splev, splrep, interp1d

>>> x=[1,2,3,4,5]

>>> y=[4,5,6,7,8]

>>> tck =splrep(x, y)

>>> splev([1], tck)

>>> splev(1, tck)

Corrected behavior:

>>> splev([1], tck)

array([ 4.])

>>> splev(1, tck)

array(4.)

This affects also the UnivariateSpline classes.

Behavior of scipy.integrate.complex_ode

The behavior of the yattribute of complex_ode is changed. Previously, it expressed the complex-valued solution

in the form:

z=ode.y[::2]+1j *ode.y[1::2]

Now, it is directly the complex-valued solution:

z=ode.y

Minor change in behavior of T-tests

The T-tests scipy.stats.ttest_ind,scipy.stats.ttest_rel and scipy.stats.ttest_1samp

have been changed so that 0 / 0 now returns NaN instead of 1.

4.3.4 Other changes

The SuperLU sources in scipy.sparse.linalg have been updated to version 4.3 from upstream.

The function scipy.signal.bode, which calculates magnitude and phase data for a continuous-time system, has

been added.

The two-sample T-test scipy.stats.ttest_ind gained an option to compare samples with unequal variances,

i.e. Welch’s T-test.

scipy.misc.logsumexp now takes an optional axis keyword argument.

4.3.5 Authors

This release contains work by the following people (contributed at least one patch to this release, names in alphabetical

order):

• Jeff Armstrong

4.3. SciPy 0.11.0 Release Notes 185

SciPy Reference Guide, Release 0.13.0

• Chad Baker

• Brandon Beacher +

• behrisch +

• borishim +

• Matthew Brett

• Lars Buitinck

• Luis Pedro Coelho +

• Johann Cohen-Tanugi

• David Cournapeau

• dougal +

• Ali Ebrahim +

• endolith +

• Bjørn Forsman +

• Robert Gantner +

• Sebastian Gassner +

• Christoph Gohlke

• Ralf Gommers

• Yaroslav Halchenko

• Charles Harris

• Jonathan Helmus +

• Andreas Hilboll +

• Marc Honnorat +

• Jonathan Hunt +

• Maxim Ivanov +

• Thouis (Ray) Jones

• Christopher Kuster +

• Josh Lawrence +

• Denis Laxalde +

• Travis Oliphant

• Joonas Paalasmaa +

• Fabian Pedregosa

• Josef Perktold

• Gavin Price +

• Jim Radford +

• Andrew Schein +

• Skipper Seabold

186 Chapter 4. Release Notes

SciPy Reference Guide, Release 0.13.0

• Jacob Silterra +

• Scott Sinclair

• Alexis Tabary +

• Martin Teichmann

• Matt Terry +

• Nicky van Foreest +

• Jacob Vanderplas

• Patrick Varilly +

• Pauli Virtanen

• Nils Wagner +

• Darryl Wally +

• Stefan van der Walt

• Liming Wang +

• David Warde-Farley +

• Warren Weckesser

• Sebastian Werk +

• Mike Wimmer +

• Tony S Yu +

A total of 55 people contributed to this release. People with a “+” by their names contributed a patch for the ﬁrst time.

4.4 SciPy 0.10.0 Release Notes

Contents

•SciPy 0.10.0 Release Notes

–New features

*Bento: new optional build system

*Generalized and shift-invert eigenvalue problems in scipy.sparse.linalg

*Discrete-Time Linear Systems (scipy.signal)

*Enhancements to scipy.signal

*Additional decomposition options (scipy.linalg)

*Additional special matrices (scipy.linalg)

*Enhancements to scipy.stats

*Basic support for Harwell-Boeing ﬁle format for sparse matrices

–Deprecated features

*scipy.maxentropy

*scipy.lib.blas

*Numscons build system

–Backwards-incompatible changes

–Other changes

–Authors

4.4. SciPy 0.10.0 Release Notes 187

SciPy Reference Guide, Release 0.13.0

SciPy 0.10.0 is the culmination of 8 months of hard work. It contains many new features, numerous bug-ﬁxes,

improved test coverage and better documentation. There have been a limited number of deprecations and backwards-

incompatible changes in this release, which are documented below. All users are encouraged to upgrade to this release,

as there are a large number of bug-ﬁxes and optimizations. Moreover, our development attention will now shift to bug-

ﬁx releases on the 0.10.x branch, and on adding new features on the development master branch.

Release highlights:

• Support for Bento as optional build system.

• Support for generalized eigenvalue problems, and all shift-invert modes available in ARPACK.

This release requires Python 2.4-2.7 or 3.1- and NumPy 1.5 or greater.

4.4.1 New features

Bento: new optional build system

Scipy can now be built with Bento. Bento has some nice features like parallel builds and partial rebuilds, that are not

possible with the default build system (distutils). For usage instructions see BENTO_BUILD.txt in the scipy top-level

directory.

Currently Scipy has three build systems, distutils, numscons and bento. Numscons is deprecated and is planned and

will likely be removed in the next release.

Generalized and shift-invert eigenvalue problems in scipy.sparse.linalg

The sparse eigenvalue problem solver functions scipy.sparse.eigs/eigh now support generalized eigenvalue

problems, and all shift-invert modes available in ARPACK.

Discrete-Time Linear Systems (scipy.signal)

Support for simulating discrete-time linear systems, including scipy.signal.dlsim,

scipy.signal.dimpulse, and scipy.signal.dstep, has been added to SciPy. Conver-

sion of linear systems from continuous-time to discrete-time representations is also present via the

scipy.signal.cont2discrete function.

Enhancements to scipy.signal

A Lomb-Scargle periodogram can now be computed with the new function scipy.signal.lombscargle.

The forward-backward ﬁlter function scipy.signal.filtfilt can now ﬁlter the data in a given axis of an n-

dimensional numpy array. (Previously it only handled a 1-dimensional array.) Options have been added to allow more

control over how the data is extended before ﬁltering.

FIR ﬁlter design with scipy.signal.firwin2 now has options to create ﬁlters of type III (zero at zero and

Nyquist frequencies) and IV (zero at zero frequency).

Additional decomposition options (scipy.linalg)

A sort keyword has been added to the Schur decomposition routine (scipy.linalg.schur) to allow the sorting

of eigenvalues in the resultant Schur form.

188 Chapter 4. Release Notes

SciPy Reference Guide, Release 0.13.0

Additional special matrices (scipy.linalg)

The functions hilbert and invhilbert were added to scipy.linalg.

Enhancements to scipy.stats

• The one-sided form of Fisher’s exact test is now also implemented in stats.fisher_exact.

• The function stats.chi2_contingency for computing the chi-square test of independence of factors in a

contingency table has been added, along with the related utility functions stats.contingency.margins

and stats.contingency.expected_freq.

Basic support for Harwell-Boeing ﬁle format for sparse matrices

Both read and write are support through a simple function-based API, as well as a more complete API to control

number format. The functions may be found in scipy.sparse.io.

The following features are supported:

• Read and write sparse matrices in the CSC format

• Only real, symmetric, assembled matrix are supported (RUA format)

4.4.2 Deprecated features

scipy.maxentropy

The maxentropy module is unmaintained, rarely used and has not been functioning well for several releases. There-

fore it has been deprecated for this release, and will be removed for scipy 0.11. Logistic regression in scikits.learn

is a good alternative for this functionality. The scipy.maxentropy.logsumexp function has been moved to

scipy.misc.

scipy.lib.blas

There are similar BLAS wrappers in scipy.linalg and scipy.lib. These have now been consolidated as

scipy.linalg.blas, and scipy.lib.blas is deprecated.

Numscons build system

The numscons build system is being replaced by Bento, and will be removed in one of the next scipy releases.

4.4.3 Backwards-incompatible changes

The deprecated name invnorm was removed from scipy.stats.distributions, this distribution is available

as invgauss.

The following deprecated nonlinear solvers from scipy.optimize have been removed:

4.4. SciPy 0.10.0 Release Notes 189

SciPy Reference Guide, Release 0.13.0

- ‘‘broyden_modified‘‘ (bad performance)

- ‘‘broyden1_modified‘‘ (bad performance)

- ‘‘broyden_generalized‘‘ (equivalent to ‘‘anderson‘‘)

- ‘‘anderson2‘‘ (equivalent to ‘‘anderson‘‘)

- ‘‘broyden3‘‘ (obsoleted by new limited-memory broyden methods)

- ‘‘vackar‘‘ (renamed to ‘‘diagbroyden‘‘)

4.4.4 Other changes

scipy.constants has been updated with the CODATA 2010 constants.

__all__ dicts have been added to all modules, which has cleaned up the namespaces (particularly useful for inter-

active work).

An API section has been added to the documentation, giving recommended import guidelines and specifying which

submodules are public and which aren’t.

4.4.5 Authors

This release contains work by the following people (contributed at least one patch to this release, names in alphabetical

order):

• Jeff Armstrong +

• Matthew Brett

• Lars Buitinck +

• David Cournapeau

• FI$H 2000 +

• Michael McNeil Forbes +

• Matty G +

• Christoph Gohlke

• Ralf Gommers

• Yaroslav Halchenko

• Charles Harris

• Thouis (Ray) Jones +

• Chris Jordan-Squire +

• Robert Kern

• Chris Lasher +

• Wes McKinney +

• Travis Oliphant

• Fabian Pedregosa

• Josef Perktold

• Thomas Robitaille +

• Pim Schellart +

190 Chapter 4. Release Notes

SciPy Reference Guide, Release 0.13.0

• Anthony Scopatz +

• Skipper Seabold +

• Fazlul Shahriar +

• David Simcha +

• Scott Sinclair +

• Andrey Smirnov +

• Collin RM Stocks +

• Martin Teichmann +

• Jake Vanderplas +

• Gaël Varoquaux +

• Pauli Virtanen

• Stefan van der Walt

• Warren Weckesser

• Mark Wiebe +

A total of 35 people contributed to this release. People with a “+” by their names contributed a patch for the ﬁrst time.

4.5 SciPy 0.9.0 Release Notes

Contents

•SciPy 0.9.0 Release Notes

–Python 3

–Scipy source code location to be changed

–New features

*Delaunay tesselations (scipy.spatial)

*N-dimensional interpolation (scipy.interpolate)

*Nonlinear equation solvers (scipy.optimize)

*New linear algebra routines (scipy.linalg)

*Improved FIR ﬁlter design functions (scipy.signal)

*Improved statistical tests (scipy.stats)

–Deprecated features

*Obsolete nonlinear solvers (in scipy.optimize)

–Removed features

*Old correlate/convolve behavior (in scipy.signal)

*scipy.stats

*scipy.sparse

*scipy.sparse.linalg.arpack.speigs

–Other changes

*ARPACK interface changes

SciPy 0.9.0 is the culmination of 6 months of hard work. It contains many new features, numerous bug-ﬁxes, improved

test coverage and better documentation. There have been a number of deprecations and API changes in this release,

4.5. SciPy 0.9.0 Release Notes 191

SciPy Reference Guide, Release 0.13.0

which are documented below. All users are encouraged to upgrade to this release, as there are a large number of bug-

ﬁxes and optimizations. Moreover, our development attention will now shift to bug-ﬁx releases on the 0.9.x branch,

and on adding new features on the development trunk.

This release requires Python 2.4 - 2.7 or 3.1 - and NumPy 1.5 or greater.

Please note that SciPy is still considered to have “Beta” status, as we work toward a SciPy 1.0.0 release. The 1.0.0

release will mark a major milestone in the development of SciPy, after which changing the package structure or API

will be much more difﬁcult. Whilst these pre-1.0 releases are considered to have “Beta” status, we are committed to

making them as bug-free as possible.

However, until the 1.0 release, we are aggressively reviewing and reﬁning the functionality, organization, and interface.

This is being done in an effort to make the package as coherent, intuitive, and useful as possible. To achieve this, we

need help from the community of users. Speciﬁcally, we need feedback regarding all aspects of the project - everything

- from which algorithms we implement, to details about our function’s call signatures.

4.5.1 Python 3

Scipy 0.9.0 is the ﬁrst SciPy release to support Python 3. The only module that is not yet ported is scipy.weave.

4.5.2 Scipy source code location to be changed

Soon after this release, Scipy will stop using SVN as the version control system, and move to Git. The development

source code for Scipy can from then on be found at

http://github.com/scipy/scipy

4.5.3 New features

Delaunay tesselations (scipy.spatial)

Scipy now includes routines for computing Delaunay tesselations in N dimensions, powered by the Qhull computa-

tional geometry library. Such calculations can now make use of the new scipy.spatial.Delaunay interface.

N-dimensional interpolation (scipy.interpolate)

Support for scattered data interpolation is now signiﬁcantly improved. This version includes a

scipy.interpolate.griddata function that can perform linear and nearest-neighbour interpolation for

N-dimensional scattered data, in addition to cubic spline (C1-smooth) interpolation in 2D and 1D. An object-oriented

interface to each interpolator type is also available.

Nonlinear equation solvers (scipy.optimize)

Scipy includes new routines for large-scale nonlinear equation solving in scipy.optimize. The following methods

are implemented:

• Newton-Krylov (scipy.optimize.newton_krylov)

• (Generalized) secant methods:

–Limited-memory Broyden methods (scipy.optimize.broyden1,

scipy.optimize.broyden2)

–Anderson method (scipy.optimize.anderson)

192 Chapter 4. Release Notes

SciPy Reference Guide, Release 0.13.0

• Simple iterations (scipy.optimize.diagbroyden,scipy.optimize.excitingmixing,

scipy.optimize.linearmixing)

The scipy.optimize.nonlin module was completely rewritten, and some of the functions were deprecated (see

above).

New linear algebra routines (scipy.linalg)

Scipy now contains routines for effectively solving triangular equation systems

(scipy.linalg.solve_triangular).

Improved FIR ﬁlter design functions (scipy.signal)

The function scipy.signal.firwin was enhanced to allow the design of highpass, bandpass, bandstop and

multi-band FIR ﬁlters.

The function scipy.signal.firwin2 was added. This function uses the window method to create a linear phase

FIR ﬁlter with an arbitrary frequency response.

The functions scipy.signal.kaiser_atten and scipy.signal.kaiser_beta were added.

Improved statistical tests (scipy.stats)

A new function scipy.stats.fisher_exact was added, that provides Fisher’s exact test for 2x2 contingency

tables.

The function scipy.stats.kendalltau was rewritten to make it much faster (O(n log(n)) vs O(n^2)).

4.5.4 Deprecated features

Obsolete nonlinear solvers (in scipy.optimize)

The following nonlinear solvers from scipy.optimize are deprecated:

•broyden_modified (bad performance)

•broyden1_modified (bad performance)

•broyden_generalized (equivalent to anderson)

•anderson2 (equivalent to anderson)

•broyden3 (obsoleted by new limited-memory broyden methods)

•vackar (renamed to diagbroyden)

4.5.5 Removed features

The deprecated modules helpmod,pexec and ppimport were removed from scipy.misc.

The output_type keyword in many scipy.ndimage interpolation functions has been removed.

The econ keyword in scipy.linalg.qr has been removed. The same functionality is still available by specifying

mode=’economic’.

4.5. SciPy 0.9.0 Release Notes 193

SciPy Reference Guide, Release 0.13.0

Old correlate/convolve behavior (in scipy.signal)

The old behavior for scipy.signal.convolve,scipy.signal.convolve2d,

scipy.signal.correlate and scipy.signal.correlate2d was deprecated in 0.8.0 and has now

been removed. Convolve and correlate used to swap their arguments if the second argument has dimensions larger

than the ﬁrst one, and the mode was relative to the input with the largest dimension. The current behavior is to never

swap the inputs, which is what most people expect, and is how correlation is usually deﬁned.

scipy.stats

Many functions in scipy.stats that are either available from numpy or have been superseded, and have been

deprecated since version 0.7, have been removed: std,var,mean,median,cov,corrcoef,z,zs,stderr,samplestd,

samplevar,pdfapprox,pdf_moments and erfc. These changes are mirrored in scipy.stats.mstats.

scipy.sparse

Several methods of the sparse matrix classes in scipy.sparse which had been deprecated since version 0.7 were

removed: save,rowcol,getdata,listprint,ensure_sorted_indices,matvec,matmat and rmatvec.

The functions spkron,speye,spidentity,lil_eye and lil_diags were removed from scipy.sparse.

The ﬁrst three functions are still available as scipy.sparse.kron,scipy.sparse.eye and

scipy.sparse.identity.

The dims and nzmax keywords were removed from the sparse matrix constructor. The colind and rowind attributes

were removed from CSR and CSC matrices respectively.

scipy.sparse.linalg.arpack.speigs

A duplicated interface to the ARPACK library was removed.

4.5.6 Other changes

ARPACK interface changes

The interface to the ARPACK eigenvalue routines in scipy.sparse.linalg was changed for more robustness.

The eigenvalue and SVD routines now raise ArpackNoConvergence if the eigenvalue iteration fails to converge.

If partially converged results are desired, they can be accessed as follows:

import numpy as np

from scipy.sparse.linalg import eigs, ArpackNoConvergence

m=np.random.randn(30,30)

try:

w,v=eigs(m, 6)

except ArpackNoConvergence, err:

partially_converged_w =err.eigenvalues

partially_converged_v =err.eigenvectors

Several bugs were also ﬁxed.

The routines were moreover renamed as follows:

• eigen –> eigs

194 Chapter 4. Release Notes

SciPy Reference Guide, Release 0.13.0

• eigen_symmetric –> eigsh

• svd –> svds

4.6 SciPy 0.8.0 Release Notes

Contents

•SciPy 0.8.0 Release Notes

–Python 3

–Major documentation improvements

–Deprecated features

*Swapping inputs for correlation functions (scipy.signal)

*Obsolete code deprecated (scipy.misc)

*Additional deprecations

–New features

*DCT support (scipy.fftpack)

*Single precision support for fft functions (scipy.fftpack)

*Correlation functions now implement the usual deﬁnition (scipy.signal)

*Additions and modiﬁcation to LTI functions (scipy.signal)

*Improved waveform generators (scipy.signal)

*New functions and other changes in scipy.linalg

*New function and changes in scipy.optimize

*New sparse least squares solver

*ARPACK-based sparse SVD

*Alternative behavior available for scipy.constants.find

*Incomplete sparse LU decompositions

*Faster matlab ﬁle reader and default behavior change

*Faster evaluation of orthogonal polynomials

*Lambert W function

*Improved hypergeometric 2F1 function

*More ﬂexible interface for Radial basis function interpolation

–Removed features

*scipy.io

SciPy 0.8.0 is the culmination of 17 months of hard work. It contains many new features, numerous bug-ﬁxes,

improved test coverage and better documentation. There have been a number of deprecations and API changes in

this release, which are documented below. All users are encouraged to upgrade to this release, as there are a large

number of bug-ﬁxes and optimizations. Moreover, our development attention will now shift to bug-ﬁx releases on the

0.8.x branch, and on adding new features on the development trunk. This release requires Python 2.4 - 2.6 and NumPy

1.4.1 or greater.

Please note that SciPy is still considered to have “Beta” status, as we work toward a SciPy 1.0.0 release. The 1.0.0

release will mark a major milestone in the development of SciPy, after which changing the package structure or API

will be much more difﬁcult. Whilst these pre-1.0 releases are considered to have “Beta” status, we are committed to

making them as bug-free as possible.

However, until the 1.0 release, we are aggressively reviewing and reﬁning the functionality, organization, and interface.

This is being done in an effort to make the package as coherent, intuitive, and useful as possible. To achieve this, we

need help from the community of users. Speciﬁcally, we need feedback regarding all aspects of the project - everything

- from which algorithms we implement, to details about our function’s call signatures.

4.6. SciPy 0.8.0 Release Notes 195

SciPy Reference Guide, Release 0.13.0

4.6.1 Python 3

Python 3 compatibility is planned and is currently technically feasible, since Numpy has been ported. However, since

the Python 3 compatible Numpy 1.5 has not been released yet, support for Python 3 in Scipy is not yet included in

Scipy 0.8. SciPy 0.9, planned for fall 2010, will very likely include experimental support for Python 3.

4.6.2 Major documentation improvements

SciPy documentation is greatly improved.

4.6.3 Deprecated features

Swapping inputs for correlation functions (scipy.signal)

Concern correlate, correlate2d, convolve and convolve2d. If the second input is larger than the ﬁrst input, the inputs

are swapped before calling the underlying computation routine. This behavior is deprecated, and will be removed in

scipy 0.9.0.

Obsolete code deprecated (scipy.misc)

The modules helpmod,ppimport and pexec from scipy.misc are deprecated. They will be removed from SciPy in

version 0.9.

Additional deprecations

• linalg: The function solveh_banded currently returns a tuple containing the Cholesky factorization and the

solution to the linear system. In SciPy 0.9, the return value will be just the solution.

• The function constants.codata.ﬁnd will generate a DeprecationWarning. In Scipy version 0.8.0, the keyword

argument ‘disp’ was added to the function, with the default value ‘True’. In 0.9.0, the default will be ‘False’.

• The qshape keyword argument of signal.chirp is deprecated. Use the argument vertex_zero instead.

• Passing the coefﬁcients of a polynomial as the argument f0 to signal.chirp is deprecated. Use the function

signal.sweep_poly instead.

• The io.recaster module has been deprecated and will be removed in 0.9.0.

4.6.4 New features

DCT support (scipy.fftpack)

New realtransforms have been added, namely dct and idct for Discrete Cosine Transform; type I, II and III are available.

Single precision support for fft functions (scipy.fftpack)

fft functions can now handle single precision inputs as well: fft(x) will return a single precision array if x is single

precision.

At the moment, for FFT sizes that are not composites of 2, 3, and 5, the transform is computed internally in double

precision to avoid rounding error in FFTPACK.

196 Chapter 4. Release Notes

SciPy Reference Guide, Release 0.13.0

Correlation functions now implement the usual deﬁnition (scipy.signal)

The outputs should now correspond to their matlab and R counterparts, and do what most people expect if the

old_behavior=False argument is passed:

• correlate, convolve and their 2d counterparts do not swap their inputs depending on their relative shape anymore;

• correlation functions now conjugate their second argument while computing the slided sum-products, which

correspond to the usual deﬁnition of correlation.

Additions and modiﬁcation to LTI functions (scipy.signal)

• The functions impulse2 and step2 were added to scipy.signal. They use the function

scipy.signal.lsim2 to compute the impulse and step response of a system, respectively.

• The function scipy.signal.lsim2 was changed to pass any additional keyword arguments to the ODE

solver.

Improved waveform generators (scipy.signal)

Several improvements to the chirp function in scipy.signal were made:

• The waveform generated when method=”logarithmic” was corrected; it now generates a waveform that is also

known as an “exponential” or “geometric” chirp. (See http://en.wikipedia.org/wiki/Chirp.)

• A new chirp method, “hyperbolic”, was added.

• Instead of the keyword qshape,chirp now uses the keyword vertex_zero, a boolean.

•chirp no longer handles an arbitrary polynomial. This functionality has been moved to a new function,

sweep_poly.

A new function, sweep_poly, was added.

New functions and other changes in scipy.linalg

The functions cho_solve_banded,circulant,companion,hadamard and leslie were added to scipy.linalg.

The function block_diag was enhanced to accept scalar and 1D arguments, along with the usual 2D arguments.

New function and changes in scipy.optimize

The curve_ﬁt function has been added; it takes a function and uses non-linear least squares to ﬁt that to the provided

data.

The leastsq and fsolve functions now return an array of size one instead of a scalar when solving for a single parameter.

New sparse least squares solver

The lsqr function was added to scipy.sparse.This routine ﬁnds a least-squares solution to a large, sparse, linear

system of equations.

4.6. SciPy 0.8.0 Release Notes 197

SciPy Reference Guide, Release 0.13.0

ARPACK-based sparse SVD

A naive implementation of SVD for sparse matrices is available in scipy.sparse.linalg.eigen.arpack. It is based on

using an symmetric solver on <A, A>, and as such may not be very precise.

Alternative behavior available for scipy.constants.find

The keyword argument disp was added to the function scipy.constants.find, with the default value True.

When disp is True, the behavior is the same as in Scipy version 0.7. When False, the function returns the list of keys

instead of printing them. (In SciPy version 0.9, the default will be reversed.)

Incomplete sparse LU decompositions

Scipy now wraps SuperLU version 4.0, which supports incomplete sparse LU decompositions. These can be accessed

via scipy.sparse.linalg.spilu. Upgrade to SuperLU 4.0 also ﬁxes some known bugs.

Faster matlab ﬁle reader and default behavior change

We’ve rewritten the matlab ﬁle reader in Cython and it should now read matlab ﬁles at around the same speed that

Matlab does.

The reader reads matlab named and anonymous functions, but it can’t write them.

Until scipy 0.8.0 we have returned arrays of matlab structs as numpy object arrays, where the objects have attributes

named for the struct ﬁelds. As of 0.8.0, we return matlab structs as numpy structured arrays. You can get the older

behavior by using the optional struct_as_record=False keyword argument to scipy.io.loadmat and

friends.

There is an inconsistency in the matlab ﬁle writer, in that it writes numpy 1D arrays as column vectors in matlab 5

ﬁles, and row vectors in matlab 4 ﬁles. We will change this in the next version, so both write row vectors. There is

aFutureWarning when calling the writer to warn of this change; for now we suggest using the oned_as=’row’

keyword argument to scipy.io.savemat and friends.

Faster evaluation of orthogonal polynomials

Values of orthogonal polynomials can be evaluated with new vectorized functions in scipy.special:

eval_legendre,eval_chebyt,eval_chebyu,eval_chebyc,eval_chebys,eval_jacobi,eval_laguerre,eval_genlaguerre,

eval_hermite,eval_hermitenorm,eval_gegenbauer,eval_sh_legendre,eval_sh_chebyt,eval_sh_chebyu,

eval_sh_jacobi. This is faster than constructing the full coefﬁcient representation of the polynomials, which

was previously the only available way.

Note that the previous orthogonal polynomial routines will now also invoke this feature, when possible.

Lambert W function

scipy.special.lambertw can now be used for evaluating the Lambert W function.

Improved hypergeometric 2F1 function

Implementation of scipy.special.hyp2f1 for real parameters was revised. The new version should produce

accurate values for all real parameters.

198 Chapter 4. Release Notes

SciPy Reference Guide, Release 0.13.0

More ﬂexible interface for Radial basis function interpolation

The scipy.interpolate.Rbf class now accepts a callable as input for the “function” argument, in addition to

the built-in radial basis functions which can be selected with a string argument.

4.6.5 Removed features

scipy.stsci: the package was removed

The module scipy.misc.limits was removed.

The IO code in both NumPy and SciPy is being extensively reworked. NumPy will be where basic code for reading

and writing NumPy arrays is located, while SciPy will house ﬁle readers and writers for various data formats (data,

audio, video, images, matlab, etc.).

Several functions in scipy.io are removed in the 0.8.0 release including: npﬁle,save,load,create_module,cre-

ate_shelf,objload,objsave,fopen,read_array,write_array,fread,fwrite,bswap,packbits,unpackbits, and con-

vert_objectarray. Some of these functions have been replaced by NumPy’s raw reading and writing capabilities,

memory-mapping capabilities, or array methods. Others have been moved from SciPy to NumPy, since basic array

reading and writing capability is now handled by NumPy.

4.7 SciPy 0.7.2 Release Notes

Contents

•SciPy 0.7.2 Release Notes

SciPy 0.7.2 is a bug-ﬁx release with no new features compared to 0.7.1. The only change is that all C sources from

Cython code have been regenerated with Cython 0.12.1. This ﬁxes the incompatibility between binaries of SciPy 0.7.1

and NumPy 1.4.

4.8 SciPy 0.7.1 Release Notes

Contents

•SciPy 0.7.1 Release Notes

–scipy.io

–scipy.odr

–scipy.signal

–scipy.sparse

–scipy.special

–scipy.stats

–Windows binaries for python 2.6

–Universal build for scipy

SciPy 0.7.1 is a bug-ﬁx release with no new features compared to 0.7.0.

Bugs ﬁxed:

• Several ﬁxes in Matlab ﬁle IO

4.7. SciPy 0.7.2 Release Notes 199

SciPy Reference Guide, Release 0.13.0

Bugs ﬁxed:

• Work around a failure with Python 2.6

Memory leak in lﬁlter have been ﬁxed, as well as support for array object

Bugs ﬁxed:

• #880, #925: lﬁlter ﬁxes

• #871: bicgstab fails on Win32

Bugs ﬁxed:

• #883: scipy.io.mmread with scipy.sparse.lil_matrix broken

• lil_matrix and csc_matrix reject now unexpected sequences, cf. http://thread.gmane.org/gmane.comp.python.scientiﬁc.user/19996

Several bugs of varying severity were ﬁxed in the special functions:

• #503, #640: iv: problems at large arguments ﬁxed by new implementation

• #623: jv: ﬁx errors at large arguments

• #679: struve: ﬁx wrong output for v < 0

• #803: pbdv produces invalid output

• #804: lqmn: ﬁx crashes on some input

• #823: betainc: ﬁx documentation

• #834: exp1 strange behavior near negative integer values

• #852: jn_zeros: more accurate results for large s, also in jnp/yn/ynp_zeros

• #853: jv, yv, iv: invalid results for non-integer v < 0, complex x

• #854: jv, yv, iv, kv: return nan more consistently when out-of-domain

• #927: ellipj: ﬁx segfault on Windows

• #946: ellpj: ﬁx segfault on Mac OS X/python 2.6 combination.

• ive, jve, yve, kv, kve: with real-valued input, return nan for out-of-domain instead of returning only the real part

of the result.

Also, when scipy.special.errprint(1) has been enabled, warning messages are now issued as Python warn-

ings instead of printing them to stderr.

• linregress, mannwhitneyu, describe: errors ﬁxed

• kstwobign, norm, expon, exponweib, exponpow, frechet, genexpon, rdist, truncexpon, planck: improvements to

numerical accuracy in distributions

4.8.1 Windows binaries for python 2.6

python 2.6 binaries for windows are now included. The binary for python 2.5 requires numpy 1.2.0 or above, and and

the one for python 2.6 requires numpy 1.3.0 or above.

4.8.2 Universal build for scipy

Mac OS X binary installer is now a proper universal build, and does not depend on gfortran anymore (libgfortran is

statically linked). The python 2.5 version of scipy requires numpy 1.2.0 or above, the python 2.6 version requires

numpy 1.3.0 or above.

200 Chapter 4. Release Notes

SciPy Reference Guide, Release 0.13.0

4.9 SciPy 0.7.0 Release Notes

Contents

•SciPy 0.7.0 Release Notes

–Python 2.6 and 3.0

–Major documentation improvements

–Running Tests

–Building SciPy

–Sandbox Removed

–Sparse Matrices

–Statistics package

–Reworking of IO package

–New Hierarchical Clustering module

–New Spatial package

–Reworked fftpack package

–New Constants package

–New Radial Basis Function module

–New complex ODE integrator

–New generalized symmetric and hermitian eigenvalue problem solver

–Bug ﬁxes in the interpolation package

–Weave clean up

–Known problems

SciPy 0.7.0 is the culmination of 16 months of hard work. It contains many new features, numerous bug-ﬁxes,

improved test coverage and better documentation. There have been a number of deprecations and API changes in

this release, which are documented below. All users are encouraged to upgrade to this release, as there are a large

number of bug-ﬁxes and optimizations. Moreover, our development attention will now shift to bug-ﬁx releases on

the 0.7.x branch, and on adding new features on the development trunk. This release requires Python 2.4 or 2.5 and

NumPy 1.2 or greater.

Please note that SciPy is still considered to have “Beta” status, as we work toward a SciPy 1.0.0 release. The 1.0.0

release will mark a major milestone in the development of SciPy, after which changing the package structure or API

will be much more difﬁcult. Whilst these pre-1.0 releases are considered to have “Beta” status, we are committed to

making them as bug-free as possible. For example, in addition to ﬁxing numerous bugs in this release, we have also

doubled the number of unit tests since the last release.

However, until the 1.0 release, we are aggressively reviewing and reﬁning the functionality, organization, and interface.

This is being done in an effort to make the package as coherent, intuitive, and useful as possible. To achieve this, we

need help from the community of users. Speciﬁcally, we need feedback regarding all aspects of the project - everything

- from which algorithms we implement, to details about our function’s call signatures.

Over the last year, we have seen a rapid increase in community involvement, and numerous infrastructure improve-

ments to lower the barrier to contributions (e.g., more explicit coding standards, improved testing infrastructure, better

documentation tools). Over the next year, we hope to see this trend continue and invite everyone to become more

involved.

4.9.1 Python 2.6 and 3.0

A signiﬁcant amount of work has gone into making SciPy compatible with Python 2.6; however, there are still some

issues in this regard. The main issue with 2.6 support is NumPy. On UNIX (including Mac OS X), NumPy 1.2.1

mostly works, with a few caveats. On Windows, there are problems related to the compilation process. The upcoming

4.9. SciPy 0.7.0 Release Notes 201

SciPy Reference Guide, Release 0.13.0

NumPy 1.3 release will ﬁx these problems. Any remaining issues with 2.6 support for SciPy 0.7 will be addressed in

a bug-ﬁx release.

Python 3.0 is not supported at all; it requires NumPy to be ported to Python 3.0. This requires immense effort, since a

lot of C code has to be ported. The transition to 3.0 is still under consideration; currently, we don’t have any timeline

or roadmap for this transition.

4.9.2 Major documentation improvements

SciPy documentation is greatly improved; you can view a HTML reference manual online or download it as a PDF

ﬁle. The new reference guide was built using the popular Sphinx tool.

This release also includes an updated tutorial, which hadn’t been available since SciPy was ported to NumPy in

2005. Though not comprehensive, the tutorial shows how to use several essential parts of Scipy. It also includes the

ndimage documentation from the numarray manual.

Nevertheless, more effort is needed on the documentation front. Luckily, contributing to Scipy documentation is now

easier than before: if you ﬁnd that a part of it requires improvements, and want to help us out, please register a user

name in our web-based documentation editor at http://docs.scipy.org/ and correct the issues.

4.9.3 Running Tests

NumPy 1.2 introduced a new testing framework based on nose. Starting with this release, SciPy now uses the new

NumPy test framework as well. Taking advantage of the new testing framework requires nose version 0.10, or later.

One major advantage of the new framework is that it greatly simpliﬁes writing unit tests - which has all ready paid off,

given the rapid increase in tests. To run the full test suite:

>>> import scipy

>>> scipy.test(’full’)

For more information, please see The NumPy/SciPy Testing Guide.

We have also greatly improved our test coverage. There were just over 2,000 unit tests in the 0.6.0 release; this release

nearly doubles that number, with just over 4,000 unit tests.

4.9.4 Building SciPy

Support for NumScons has been added. NumScons is a tentative new build system for NumPy/SciPy, using SCons at

its core.

SCons is a next-generation build system, intended to replace the venerable Make with the integrated functionality

of autoconf/automake and ccache. Scons is written in Python and its conﬁguration ﬁles are Python scripts.

NumScons is meant to replace NumPy’s custom version of distutils providing more advanced functionality, such

as autoconf, improved fortran support, more tools, and support for numpy.distutils/scons cooperation.

4.9.5 Sandbox Removed

While porting SciPy to NumPy in 2005, several packages and modules were moved into scipy.sandbox. The

sandbox was a staging ground for packages that were undergoing rapid development and whose APIs were in ﬂux. It

was also a place where broken code could live. The sandbox has served its purpose well, but was starting to create

confusion. Thus scipy.sandbox was removed. Most of the code was moved into scipy, some code was made

into a scikit, and the remaining code was just deleted, as the functionality had been replaced by other code.

202 Chapter 4. Release Notes

SciPy Reference Guide, Release 0.13.0

4.9.6 Sparse Matrices

Sparse matrices have seen extensive improvements. There is now support for integer dtypes such int8,uint32, etc.

Two new sparse formats were added:

• new class dia_matrix : the sparse DIAgonal format

• new class bsr_matrix : the Block CSR format

Several new sparse matrix construction functions were added:

•sparse.kron : sparse Kronecker product

•sparse.bmat : sparse version of numpy.bmat

•sparse.vstack : sparse version of numpy.vstack

•sparse.hstack : sparse version of numpy.hstack

Extraction of submatrices and nonzero values have been added:

•sparse.tril : extract lower triangle

•sparse.triu : extract upper triangle

•sparse.find : nonzero values and their indices

csr_matrix and csc_matrix now support slicing and fancy indexing (e.g., A[1:3, 4:7] and

A[[3,2,6,8],:]). Conversions among all sparse formats are now possible:

• using member functions such as .tocsr() and .tolil()

• using the .asformat() member function, e.g. A.asformat(’csr’)

• using constructors A = lil_matrix([[1,2]]); B = csr_matrix(A)

All sparse constructors now accept dense matrices and lists of lists. For example:

•A = csr_matrix( rand(3,3) ) and B = lil_matrix( [[1,2],[3,4]] )

The handling of diagonals in the spdiags function has been changed. It now agrees with the MATLAB(TM) function

of the same name.

Numerous efﬁciency improvements to format conversions and sparse matrix arithmetic have been made. Finally, this

release contains numerous bugﬁxes.

4.9.7 Statistics package

Statistical functions for masked arrays have been added, and are accessible through scipy.stats.mstats. The

functions are similar to their counterparts in scipy.stats but they have not yet been veriﬁed for identical interfaces

and algorithms.

Several bugs were ﬁxed for statistical functions, of those, kstest and percentileofscore gained new keyword

arguments.

Added deprecation warning for mean,median,var,std,cov, and corrcoef. These functions should

be replaced by their numpy counterparts. Note, however, that some of the default options differ between the

scipy.stats and numpy versions of these functions.

Numerous bug ﬁxes to stats.distributions: all generic methods now work correctly, several methods in

individual distributions were corrected. However, a few issues remain with higher moments (skew,kurtosis)

and entropy. The maximum likelihood estimator, fit, does not work out-of-the-box for some distributions - in

some cases, starting values have to be carefully chosen, in other cases, the generic implementation of the maximum

likelihood method might not be the numerically appropriate estimation method.

4.9. SciPy 0.7.0 Release Notes 203

SciPy Reference Guide, Release 0.13.0

We expect more bugﬁxes, increases in numerical precision and enhancements in the next release of scipy.

4.9.8 Reworking of IO package

The IO code in both NumPy and SciPy is being extensively reworked. NumPy will be where basic code for reading

and writing NumPy arrays is located, while SciPy will house ﬁle readers and writers for various data formats (data,

audio, video, images, matlab, etc.).

Several functions in scipy.io have been deprecated and will be removed in the 0.8.0 release including

npfile,save,load,create_module,create_shelf,objload,objsave,fopen,read_array,

write_array,fread,fwrite,bswap,packbits,unpackbits, and convert_objectarray. Some

of these functions have been replaced by NumPy’s raw reading and writing capabilities, memory-mapping capabili-

ties, or array methods. Others have been moved from SciPy to NumPy, since basic array reading and writing capability

is now handled by NumPy.

The Matlab (TM) ﬁle readers/writers have a number of improvements:

• default version 5

• v5 writers for structures, cell arrays, and objects

• v5 readers/writers for function handles and 64-bit integers

• new struct_as_record keyword argument to loadmat, which loads struct arrays in matlab as record arrays in

numpy

• string arrays have dtype=’U...’ instead of dtype=object

•loadmat no longer squeezes singleton dimensions, i.e. squeeze_me=False by default

4.9.9 New Hierarchical Clustering module

This module adds new hierarchical clustering functionality to the scipy.cluster package. The function inter-

faces are similar to the functions provided MATLAB(TM)’s Statistics Toolbox to help facilitate easier migration to

the NumPy/SciPy framework. Linkage methods implemented include single, complete, average, weighted, centroid,

median, and ward.

In addition, several functions are provided for computing inconsistency statistics, cophenetic distance, and maximum

distance between descendants. The fcluster and fclusterdata functions transform a hierarchical clustering

into a set of ﬂat clusters. Since these ﬂat clusters are generated by cutting the tree into a forest of trees, the leaders

function takes a linkage and a ﬂat clustering, and ﬁnds the root of each tree in the forest. The ClusterNode class

represents a hierarchical clusterings as a ﬁeld-navigable tree object. to_tree converts a matrix-encoded hierarchical

clustering to a ClusterNode object. Routines for converting between MATLAB and SciPy linkage encodings are

provided. Finally, a dendrogram function plots hierarchical clusterings as a dendrogram, using matplotlib.

4.9.10 New Spatial package

The new spatial package contains a collection of spatial algorithms and data structures, useful for spatial statistics and

clustering applications. It includes rapidly compiled code for computing exact and approximate nearest neighbors, as

well as a pure-python kd-tree with the same interface, but that supports annotation and a variety of other algorithms.

The API for both modules may change somewhat, as user requirements become clearer.

It also includes a distance module, containing a collection of distance and dissimilarity functions for computing

distances between vectors, which is useful for spatial statistics, clustering, and kd-trees. Distance and dissimilar-

ity functions provided include Bray-Curtis, Canberra, Chebyshev, City Block, Cosine, Dice, Euclidean, Hamming,

204 Chapter 4. Release Notes

SciPy Reference Guide, Release 0.13.0

Jaccard, Kulsinski, Mahalanobis, Matching, Minkowski, Rogers-Tanimoto, Russell-Rao, Squared Euclidean, Stan-

dardized Euclidean, Sokal-Michener, Sokal-Sneath, and Yule.

The pdist function computes pairwise distance between all unordered pairs of vectors in a set of vectors. The cdist

computes the distance on all pairs of vectors in the Cartesian product of two sets of vectors. Pairwise distance matrices

are stored in condensed form; only the upper triangular is stored. squareform converts distance matrices between

square and condensed forms.

4.9.11 Reworked fftpack package

FFTW2, FFTW3, MKL and DJBFFT wrappers have been removed. Only (NETLIB) fftpack remains. By focusing on

one backend, we hope to add new features - like ﬂoat32 support - more easily.

4.9.12 New Constants package

scipy.constants provides a collection of physical constants and conversion factors. These constants are

taken from CODATA Recommended Values of the Fundamental Physical Constants: 2002. They may be found at

physics.nist.gov/constants. The values are stored in the dictionary physical_constants as a tuple containing the value,

the units, and the relative precision - in that order. All constants are in SI units, unless otherwise stated. Several helper

functions are provided.

4.9.13 New Radial Basis Function module

scipy.interpolate now contains a Radial Basis Function module. Radial basis functions can be used for

smoothing/interpolating scattered data in n-dimensions, but should be used with caution for extrapolation outside

of the observed data range.

4.9.14 New complex ODE integrator

scipy.integrate.ode now contains a wrapper for the ZVODE complex-valued ordinary differential equation

solver (by Peter N. Brown, Alan C. Hindmarsh, and George D. Byrne).

4.9.15 New generalized symmetric and hermitian eigenvalue problem solver

scipy.linalg.eigh now contains wrappers for more LAPACK symmetric and hermitian eigenvalue problem

solvers. Users can now solve generalized problems, select a range of eigenvalues only, and choose to use a faster algo-

rithm at the expense of increased memory usage. The signature of the scipy.linalg.eigh changed accordingly.

4.9.16 Bug ﬁxes in the interpolation package

The shape of return values from scipy.interpolate.interp1d used to be incorrect, if interpolated data

had more than 2 dimensions and the axis keyword was set to a non-default value. This has been ﬁxed. Moreover,

interp1d returns now a scalar (0D-array) if the input is a scalar. Users of scipy.interpolate.interp1d

may need to revise their code if it relies on the previous behavior.

4.9.17 Weave clean up

There were numerous improvements to scipy.weave.blitz++ was relicensed by the author to be compatible

with the SciPy license. wx_spec.py was removed.

4.9. SciPy 0.7.0 Release Notes 205

SciPy Reference Guide, Release 0.13.0

4.9.18 Known problems

Here are known problems with scipy 0.7.0:

• weave test failures on windows: those are known, and are being revised.

• weave test failure with gcc 4.3 (std::labs): this is a gcc 4.3 bug. A workaround is to add #include <cstdlib> in

scipy/weave/blitz/blitz/funcs.h (line 27). You can make the change in the installed scipy (in site-packages).

206 Chapter 4. Release Notes

CHAPTER

FIVE

REFERENCE

5.1 Clustering package (scipy.cluster)

scipy.cluster.vq

Clustering algorithms are useful in information theory, target detection, communications, compression, and other

areas. The vq module only supports vector quantization and the k-means algorithms.

scipy.cluster.hierarchy

The hierarchy module provides functions for hierarchical and agglomerative clustering. Its features include gener-

ating hierarchical clusters from distance matrices, computing distance matrices from observation vectors, calculating

statistics on clusters, cutting linkages to generate ﬂat clusters, and visualizing clusters with dendrograms.

5.2 K-means clustering and vector quantization

(scipy.cluster.vq)

Provides routines for k-means clustering, generating code books from k-means models, and quantizing vectors by

comparing them with centroids in a code book.

whiten(obs) Normalize a group of observations on a per feature basis.

vq(obs, code_book) Assign codes from a code book to observations.

kmeans(obs, k_or_guess[, iter, thresh]) Performs k-means on a set of observation vectors forming k clusters.

kmeans2(data, k[, iter, thresh, minit, missing]) Classify a set of observations into k clusters using the k-means algorithm.

scipy.cluster.vq.whiten(obs)

Normalize a group of observations on a per feature basis.

Before running k-means, it is beneﬁcial to rescale each feature dimension of the observation set with whitening.

Each feature is divided by its standard deviation across all observations to give it unit variance.

Parameters obs : ndarray

Each row of the array is an observation. The columns are the features seen during each

observation.

>>> # f0 f1 f2

>>> obs =[[ 1.,1.,1.], #o0

... [2.,2.,2.], #o1

... [3.,3.,3.], #o2

... [4.,4.,4.]]) #o3

207

SciPy Reference Guide, Release 0.13.0

Returns result : ndarray

Contains the values in obs scaled by the standard deviation of each column.

Examples

>>> from scipy.cluster.vq import whiten

>>> features =np.array([[1.9,2.3,1.7],

... [1.5,2.5,2.2],

... [0.8,0.6,1.7,]])

>>> whiten(features)

array([[ 4.17944278, 2.69811351, 7.21248917],

[ 3.29956009, 2.93273208, 9.33380951],

[ 1.75976538, 0.7038557 , 7.21248917]])

scipy.cluster.vq.vq(obs,code_book)

Assign codes from a code book to observations.

Assigns a code from a code book to each observation. Each observation vector in the ‘M’ by ‘N’ obs array is

compared with the centroids in the code book and assigned the code of the closest centroid.

The features in obs should have unit variance, which can be acheived by passing them through the whiten

function. The code book can be created with the k-means algorithm or a different encoding algorithm.

Parameters obs : ndarray

Each row of the ‘N’ x ‘M’ array is an observation. The columns are the “features” seen

during each observation. The features must be whitened ﬁrst using the whiten function

or something equivalent.

code_book : ndarray

The code book is usually generated using the k-means algorithm. Each row of the array

holds a different code, and the columns are the features of the code.

>>> # f0 f1 f2 f3

>>> code_book =[

... [1.,2.,3.,4.], #c0

... [1.,2.,3.,4.], #c1

... [1.,2.,3.,4.]]) #c2

Returns code : ndarray

A length N array holding the code book index for each observation.

dist : ndarray

The distortion (distance) between the observation and its nearest code.

Notes

This currently forces 32-bit math precision for speed. Anyone know of a situation where this undermines the

accuracy of the algorithm?

Examples

>>> from numpy import array

>>> from scipy.cluster.vq import vq

>>> code_book =array([[1.,1.,1.],

... [2.,2.,2.]])

>>> features =array([[ 1.9,2.3,1.7],

... [1.5,2.5,2.2],

... [0.8,0.6,1.7]])

>>> vq(features,code_book)

(array([1, 1, 0],’i’), array([ 0.43588989, 0.73484692, 0.83066239]))

208 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.cluster.vq.kmeans(obs,k_or_guess,iter=20,thresh=1e-05)

Performs k-means on a set of observation vectors forming k clusters.

The k-means algorithm adjusts the centroids until sufﬁcient progress cannot be made, i.e. the change in distor-

tion since the last iteration is less than some threshold. This yields a code book mapping centroids to codes and

vice versa.

Distortion is deﬁned as the sum of the squared differences between the observations and the corresponding

centroid.

Parameters obs : ndarray

Each row of the M by N array is an observation vector. The columns are the features

seen during each observation. The features must be whitened ﬁrst with the whiten

function.

k_or_guess : int or ndarray

The number of centroids to generate. A code is assigned to each centroid, which is also

the row index of the centroid in the code_book matrix generated.

The initial k centroids are chosen by randomly selecting observations from the obser-

vation matrix. Alternatively, passing a k by N array speciﬁes the initial k centroids.

iter : int, optional

The number of times to run k-means, returning the codebook with the lowest distor-

tion. This argument is ignored if initial centroids are speciﬁed with an array for the

k_or_guess parameter. This parameter does not represent the number of iterations

of the k-means algorithm.

thresh : ﬂoat, optional

Terminates the k-means algorithm if the change in distortion since the last k-means

iteration is less than or equal to thresh.

Returns codebook : ndarray

A k by N array of k centroids. The i’th centroid codebook[i] is represented with the

code i. The centroids and codes generated represent the lowest distortion seen, not

necessarily the globally minimal distortion.

distortion : ﬂoat

The distortion between the observations passed and the centroids generated.

See Also

kmeans2 a different implementation of k-means clustering with more methods for generating initial cen-

troids but without using a distortion change threshold as a stopping criterion.

whiten must be called prior to passing an observation matrix to kmeans.

Examples

>>> from numpy import array

>>> from scipy.cluster.vq import vq, kmeans, whiten

>>> features =array([[ 1.9,2.3],

... [1.5,2.5],

... [0.8,0.6],

... [0.4,1.8],

... [0.1,0.1],

... [0.2,1.8],

... [2.0,0.5],

... [0.3,1.5],

... [1.0,1.0]])

>>> whitened =whiten(features)

>>> book =array((whitened[0],whitened[2]))

>>> kmeans(whitened,book)

5.2. K-means clustering and vector quantization (scipy.cluster.vq) 209

SciPy Reference Guide, Release 0.13.0

(array([[ 2.3110306 , 2.86287398],

[ 0.93218041, 1.24398691]]), 0.85684700941625547)

>>> from numpy import random

>>> random.seed((1000,2000))

>>> codes =3

>>> kmeans(whitened,codes)

(array([[ 2.3110306 , 2.86287398],

[ 1.32544402, 0.65607529],

[ 0.40782893, 2.02786907]]), 0.5196582527686241)

scipy.cluster.vq.kmeans2(data,k,iter=10,thresh=1e-05,minit=’random’,missing=’warn’)

Classify a set of observations into k clusters using the k-means algorithm.

The algorithm attempts to minimize the Euclidian distance between observations and centroids. Several initial-

ization methods are included.

Parameters data : ndarray

A ‘M’ by ‘N’ array of ‘M’ observations in ‘N’ dimensions or a length ‘M’ array of ‘M’

one-dimensional observations.

k: int or ndarray

The number of clusters to form as well as the number of centroids to generate. If minit

initialization string is ‘matrix’, or if a ndarray is given instead, it is interpreted as initial

cluster to use instead.

iter : int

Number of iterations of the k-means algrithm to run. Note that this differs in meaning

from the iters parameter to the kmeans function.

thresh : ﬂoat

(not used yet)

minit : string

Method for initialization. Available methods are ‘random’, ‘points’, ‘uniform’, and

‘matrix’:

‘random’: generate k centroids from a Gaussian with mean and variance estimated

from the data.

‘points’: choose k observations (rows) at random from data for the initial centroids.

‘uniform’: generate k observations from the data from a uniform distribution deﬁned

by the data set (unsupported).

‘matrix’: interpret the k parameter as a k by M (or length k array for one-dimensional

data) array of initial centroids.

Returns centroid : ndarray

A ‘k’ by ‘N’ array of centroids found at the last iteration of k-means.

label : ndarray

label[i] is the code or index of the centroid the i’th observation is closest to.

5.2.1 Background information

The k-means algorithm takes as input the number of clusters to generate, k, and a set of observation vectors to cluster.

It returns a set of centroids, one for each of the k clusters. An observation vector is classiﬁed with the cluster number

or centroid index of the centroid closest to it.

A vector v belongs to cluster i if it is closer to centroid i than any other centroids. If v belongs to i, we say centroid i is

the dominating centroid of v. The k-means algorithm tries to minimize distortion, which is deﬁned as the sum of the

squared distances between each observation vector and its dominating centroid. Each step of the k-means algorithm

reﬁnes the choices of centroids to reduce distortion. The change in distortion is used as a stopping criterion: when

210 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

the change is lower than a threshold, the k-means algorithm is not making sufﬁcient progress and terminates. One can

also deﬁne a maximum number of iterations.

Since vector quantization is a natural application for k-means, information theory terminology is often used. The

centroid index or cluster index is also referred to as a “code” and the table mapping codes to centroids and vice

versa is often referred as a “code book”. The result of k-means, a set of centroids, can be used to quantize vectors.

Quantization aims to ﬁnd an encoding of vectors that reduces the expected distortion.

All routines expect obs to be a M by N array where the rows are the observation vectors. The codebook is a k by N

array where the i’th row is the centroid of code word i. The observation vectors and centroids have the same feature

dimension.

As an example, suppose we wish to compress a 24-bit color image (each pixel is represented by one byte for red, one

for blue, and one for green) before sending it over the web. By using a smaller 8-bit encoding, we can reduce the

amount of data by two thirds. Ideally, the colors for each of the 256 possible 8-bit encoding values should be chosen

to minimize distortion of the color. Running k-means with k=256 generates a code book of 256 codes, which ﬁlls up

all possible 8-bit sequences. Instead of sending a 3-byte value for each pixel, the 8-bit centroid index (or code word)

of the dominating centroid is transmitted. The code book is also sent over the wire so each 8-bit code can be translated

back to a 24-bit pixel value representation. If the image of interest was of an ocean, we would expect many 24-bit

blues to be represented by 8-bit codes. If it was an image of a human face, more ﬂesh tone colors would be represented

in the code book.

5.3 Hierarchical clustering (scipy.cluster.hierarchy)

These functions cut hierarchical clusterings into ﬂat clusterings or ﬁnd the roots of the forest formed by a cut by

providing the ﬂat cluster ids of each observation.

fcluster(Z, t[, criterion, depth, R, monocrit]) Forms ﬂat clusters from the hierarchical clustering deﬁned by

fclusterdata(X, t[, criterion, metric, ...]) Cluster observation data using a given metric.

leaders(Z, T) Returns the root nodes in a hierarchical clustering.

scipy.cluster.hierarchy.fcluster(Z,t,criterion=’inconsistent’,depth=2,R=None,mon-

ocrit=None)

Forms ﬂat clusters from the hierarchical clustering deﬁned by the linkage matrix Z.

Parameters Z: ndarray

The hierarchical clustering encoded with the matrix returned by the linkage func-

tion.

t: ﬂoat

The threshold to apply when forming ﬂat clusters.

criterion : str, optional

The criterion to use in forming ﬂat clusters. This can be any of the following values:

inconsistent

[If a cluster node and all its] descendants have an inconsistent value

less than or equal to tthen all its leaf descendants belong to the same

ﬂat cluster. When no non-singleton cluster meets this criterion, every

node is assigned to its own cluster. (Default)

distance [Forms ﬂat clusters so that the original] observations in each ﬂat clus-

ter have no greater a cophenetic distance than t.

maxclust [Finds a minimum threshold rso that] the cophenetic distance be-

tween any two original observations in the same ﬂat cluster is no more

than rand no more than tﬂat clusters are formed.

monocrit [Forms a ﬂat cluster from a cluster node c] with index i when

monocrit[j] <= t.

5.3. Hierarchical clustering (scipy.cluster.hierarchy) 211

SciPy Reference Guide, Release 0.13.0

For example, to threshold on the maximum mean distance as com-

puted in the inconsistency matrix R with a threshold of 0.8 do:

MR = maxRstat(Z, R, 3)

cluster(Z, t=0.8, criterion=’monocrit’, monocrit=MR)

maxclust_monocrit

[Forms a ﬂat cluster from a] non-singleton cluster node cwhen

monocrit[i] <= r for all cluster indices ibelow and including

c.ris minimized such that no more than tﬂat clusters are formed.

monocrit must be monotonic. For example, to minimize the thresh-

old t on maximum inconsistency values so that no more than 3 ﬂat

clusters are formed, do:

MI = maxinconsts(Z, R)

cluster(Z, t=3, criterion=’maxclust_monocrit’, monocrit=MI)

depth : int, optional

The maximum depth to perform the inconsistency calculation. It has no meaning for

the other criteria. Default is 2.

R: ndarray, optional

The inconsistency matrix to use for the ‘inconsistent’ criterion. This matrix is com-

puted if not provided.

monocrit : ndarray, optional

An array of length n-1. monocrit[i] is the statistics upon which non-singleton i is

thresholded. The monocrit vector must be monotonic, i.e. given a node c with index i,

for all node indices j corresponding to nodes below c, monocrit[i] >= monocrit[j].

Returns fcluster : ndarray

An array of length n. T[i] is the ﬂat cluster number to which original observation i

belongs.

scipy.cluster.hierarchy.fclusterdata(X,t,criterion=’inconsistent’,metric=’euclidean’,

depth=2,method=’single’,R=None)

Cluster observation data using a given metric.

Clusters the original observations in the n-by-m data matrix X (n observations in m dimensions), using the

euclidean distance metric to calculate distances between original observations, performs hierarchical clustering

using the single linkage algorithm, and forms ﬂat clusters using the inconsistency method with tas the cut-off

threshold.

A one-dimensional array T of length n is returned. T[i] is the index of the ﬂat cluster to which the original

observation i belongs.

Parameters X: (N, M) ndarray

N by M data matrix with N observations in M dimensions.

t: ﬂoat

The threshold to apply when forming ﬂat clusters.

criterion : str, optional

Speciﬁes the criterion for forming ﬂat clusters. Valid values are ‘inconsistent’ (de-

fault), ‘distance’, or ‘maxclust’ cluster formation algorithms. See fcluster for

descriptions.

metric : str, optional

The distance metric for calculating pairwise distances. See distance.pdist for

descriptions and linkage to verify compatibility with the linkage method.

depth : int, optional

The maximum depth for the inconsistency calculation. See inconsistent for

more information.

method : str, optional

The linkage method to use (single, complete, average, weighted, median centroid,

ward). See linkage for more information. Default is “single”.

212 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

R: ndarray, optional

The inconsistency matrix. It will be computed if necessary if it is not passed.

Returns fclusterdata : ndarray

A vector of length n. T[i] is the ﬂat cluster number to which original observation i

belongs.

Notes

This function is similar to the MATLAB function clusterdata.

scipy.cluster.hierarchy.leaders(Z,T)

Returns the root nodes in a hierarchical clustering.

Returns the root nodes in a hierarchical clustering corresponding to a cut deﬁned by a ﬂat cluster assignment

vector T. See the fcluster function for more information on the format of T.

For each ﬂat cluster jof the kﬂat clusters represented in the n-sized ﬂat cluster assignment vector T, this

function ﬁnds the lowest cluster node iin the linkage tree Z such that:

•leaf descendents belong only to ﬂat cluster j (i.e. T[p]==j for all pin S(i)where S(i)is the set of leaf

ids of leaf nodes descendent with cluster node i)

•there does not exist a leaf that is not descendent with ithat also belongs to cluster j(i.e. T[q]!=j for

all qnot in S(i)). If this condition is violated, Tis not a valid cluster assignment vector, and an exception

will be thrown.

Parameters Z: ndarray

The hierarchical clustering encoded as a matrix. See linkage for more information.

T: ndarray

The ﬂat cluster assignment vector.

Returns L: ndarray

The leader linkage node id’s stored as a k-element 1-D array where kis the number

of ﬂat clusters found in T.

L[j]=i is the linkage cluster node id that is the leader of ﬂat cluster with id M[j].

If i<n,icorresponds to an original observation, otherwise it corresponds to a

non-singleton cluster.

For example: if L[3]=2 and M[3]=8, the ﬂat cluster with id 8’s leader is linkage

node 2.

M: ndarray

The leader linkage node id’s stored as a k-element 1-D array where kis the number

of ﬂat clusters found in T. This allows the set of ﬂat cluster ids to be any arbitrary set

of kintegers.

These are routines for agglomerative clustering.

linkage(y[, method, metric]) Performs hierarchical/agglomerative clustering on the condensed distance matrix y.

single(y) Performs single/min/nearest linkage on the condensed distance matrix y

complete(y) Performs complete/max/farthest point linkage on a condensed distance matrix

average(y) Performs average/UPGMA linkage on a condensed distance matrix

weighted(y) Performs weighted/WPGMA linkage on the condensed distance matrix.

centroid(y) Performs centroid/UPGMC linkage.

median(y) Performs median/WPGMC linkage.

ward(y) Performs Ward’s linkage on a condensed or redundant distance matrix.

scipy.cluster.hierarchy.linkage(y,method=’single’,metric=’euclidean’)

Performs hierarchical/agglomerative clustering on the condensed distance matrix y.

5.3. Hierarchical clustering (scipy.cluster.hierarchy) 213

SciPy Reference Guide, Release 0.13.0

y must be a n

2sized vector where n is the number of original observations paired in the distance matrix. The

behavior of this function is very similar to the MATLAB linkage function.

A 4 by (n−1) matrix Zis returned. At the i-th iteration, clusters with indices Z[i, 0] and Z[i, 1]

are combined to form cluster n+i. A cluster with an index less than ncorresponds to one of the noriginal

observations. The distance between clusters Z[i, 0] and Z[i, 1] is given by Z[i, 2]. The fourth value

Z[i, 3] represents the number of original observations in the newly formed cluster.

The following linkage methods are used to compute the distance d(s, t)between two clusters sand t. The

algorithm begins with a forest of clusters that have yet to be used in the hierarchy being formed. When two

clusters sand tfrom this forest are combined into a single cluster u,sand tare removed from the forest, and u

is added to the forest. When only one cluster remains in the forest, the algorithm stops, and this cluster becomes

the root.

A distance matrix is maintained at each iteration. The d[i,j] entry corresponds to the distance between cluster

iand jin the original forest.

At each iteration, the algorithm must update the distance matrix to reﬂect the distance of the newly formed

cluster u with the remaining clusters in the forest.

Suppose there are |u|original observations u[0], . . . , u[|u| − 1] in cluster uand |v|original objects

v[0], . . . , v[|v| − 1] in cluster v. Recall sand tare combined to form cluster u. Let vbe any remaining cluster

in the forest that is not u.

The following are methods for calculating the distance between the newly formed cluster uand each v.

•method=’single’ assigns

d(u, v) = min(dist(u[i], v[j]))

for all points iin cluster uand jin cluster v. This is also known as the Nearest Point Algorithm.

•method=’complete’ assigns

d(u, v) = max(dist(u[i], v[j]))

for all points iin cluster u and jin cluster v. This is also known by the Farthest Point Algorithm or Voor

Hees Algorithm.

•method=’average’ assigns

d(u, v) = X

d(u[i], v[j])

(|u|∗|v|)

for all points iand jwhere |u|and |v|are the cardinalities of clusters uand v, respectively. This is also

called the UPGMA algorithm. This is called UPGMA.

•method=’weighted’ assigns

d(u, v)=(dist(s, v) + dist(t, v))/2

where cluster u was formed with cluster s and t and v is a remaining cluster in the forest. (also called

WPGMA)

214 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

•method=’centroid’ assigns

dist(s, t) = ||cs−ct||2

where csand ctare the centroids of clusters sand t, respectively. When two clusters sand tare combined

into a new cluster u, the new centroid is computed over all the original objects in clusters sand t. The

distance then becomes the Euclidean distance between the centroid of uand the centroid of a remaining

cluster vin the forest. This is also known as the UPGMC algorithm.

•method=’median’ assigns math:d(s,t) like the centroid method. When two clusters sand tare com-

bined into a new cluster u, the average of centroids s and t give the new centroid u. This is also known as

the WPGMC algorithm.

•method=’ward’ uses the Ward variance minimization algorithm. The new entry d(u, v)is computed as

follows,

d(u, v) = r|v|+|s|

Td(v, s)2+|v|+|t|

Td(v, t)2+|v|

Td(s, t)2

where uis the newly joined cluster consisting of clusters sand t,vis an unused cluster in the forest,

T=|v|+|s|+|t|, and | ∗ | is the cardinality of its argument. This is also known as the incremental

algorithm.

Warning: When the minimum distance pair in the forest is chosen, there may be two or more pairs with the same

minimum distance. This implementation may chose a different minimum than the MATLAB version.

Parameters y: ndarray

A condensed or redundant distance matrix. A condensed distance matrix is a ﬂat array

containing the upper triangular of the distance matrix. This is the form that pdist

returns. Alternatively, a collection of mobservation vectors in n dimensions may be

passed as an mby narray.

method : str, optional

The linkage algorithm to use. See the Linkage Methods section below for full

descriptions.

metric : str, optional

The distance metric to use. See the distance.pdist function for a list of valid

distance metrics.

Returns Z: ndarray

The hierarchical clustering encoded as a linkage matrix.

scipy.cluster.hierarchy.single(y)

Performs single/min/nearest linkage on the condensed distance matrix y

Parameters y: ndarray

The upper triangular of the distance matrix. The result of pdist is returned in this

form.

Returns Z: ndarray

The linkage matrix.

See Also

linkage for advanced creation of hierarchical clusterings.

scipy.cluster.hierarchy.complete(y)

Performs complete/max/farthest point linkage on a condensed distance matrix

Parameters y: ndarray

5.3. Hierarchical clustering (scipy.cluster.hierarchy) 215

SciPy Reference Guide, Release 0.13.0

The upper triangular of the distance matrix. The result of pdist is returned in this

form.

Returns Z: ndarray

A linkage matrix containing the hierarchical clustering. See the linkage function

documentation for more information on its structure.

See Also

linkage

scipy.cluster.hierarchy.average(y)

Performs average/UPGMA linkage on a condensed distance matrix

Parameters y: ndarray

The upper triangular of the distance matrix. The result of pdist is returned in this

form.

Returns Z: ndarray

A linkage matrix containing the hierarchical clustering. See the linkage function

documentation for more information on its structure.

See Also

linkage for advanced creation of hierarchical clusterings.

scipy.cluster.hierarchy.weighted(y)

Performs weighted/WPGMA linkage on the condensed distance matrix.

See linkage for more information on the return structure and algorithm.

Parameters y: ndarray

The upper triangular of the distance matrix. The result of pdist is returned in this

form.

Returns Z: ndarray

A linkage matrix containing the hierarchical clustering. See the linkage function

documentation for more information on its structure.

See Also

linkage for advanced creation of hierarchical clusterings.

scipy.cluster.hierarchy.centroid(y)

Performs centroid/UPGMC linkage.

See linkage for more information on the return structure and algorithm.

The following are common calling conventions:

1.Z = centroid(y)

Performs centroid/UPGMC linkage on the condensed distance matrix y. See linkage for more infor-

mation on the return structure and algorithm.

2.Z = centroid(X)

Performs centroid/UPGMC linkage on the observation matrix Xusing Euclidean distance as the distance

metric. See linkage for more information on the return structure and algorithm.

Parameters Q: ndarray

216 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

A condensed or redundant distance matrix. A condensed distance matrix is a ﬂat array

containing the upper triangular of the distance matrix. This is the form that pdist

returns. Alternatively, a collection of m observation vectors in n dimensions may be

passed as a m by n array.

Returns Z: ndarray

A linkage matrix containing the hierarchical clustering. See the linkage function

documentation for more information on its structure.

See Also

linkage for advanced creation of hierarchical clusterings.

scipy.cluster.hierarchy.median(y)

Performs median/WPGMC linkage.

See linkage for more information on the return structure and algorithm.

The following are common calling conventions:

1.Z = median(y)

Performs median/WPGMC linkage on the condensed distance matrix y. See linkage for more

information on the return structure and algorithm.

2.Z = median(X)

Performs median/WPGMC linkage on the observation matrix Xusing Euclidean distance as the

distance metric. See linkage for more information on the return structure and algorithm.

Parameters Q: ndarray

A condensed or redundant distance matrix. A condensed distance matrix is a ﬂat array

containing the upper triangular of the distance matrix. This is the form that pdist

returns. Alternatively, a collection of m observation vectors in n dimensions may be

passed as a m by n array.

Returns Z: ndarray

The hierarchical clustering encoded as a linkage matrix.

See Also

linkage for advanced creation of hierarchical clusterings.

scipy.cluster.hierarchy.ward(y)

Performs Ward’s linkage on a condensed or redundant distance matrix.

See linkage for more information on the return structure and algorithm.

The following are common calling conventions:

1.Z = ward(y) Performs Ward’s linkage on the condensed distance matrix Z. See linkage for more

information on the return structure and algorithm.

2.Z = ward(X) Performs Ward’s linkage on the observation matrix Xusing Euclidean distance as the

distance metric. See linkage for more information on the return structure and algorithm.

Parameters Q: ndarray

A condensed or redundant distance matrix. A condensed distance matrix is a ﬂat array

containing the upper triangular of the distance matrix. This is the form that pdist

returns. Alternatively, a collection of m observation vectors in n dimensions may be

passed as a m by n array.

Returns Z: ndarray

The hierarchical clustering encoded as a linkage matrix.

5.3. Hierarchical clustering (scipy.cluster.hierarchy) 217

SciPy Reference Guide, Release 0.13.0

See Also

linkage for advanced creation of hierarchical clusterings.

These routines compute statistics on hierarchies.

cophenet(Z[, Y]) Calculates the cophenetic distances between each observation in

from_mlab_linkage(Z) Converts a linkage matrix generated by MATLAB(TM) to a new

inconsistent(Z[, d]) Calculates inconsistency statistics on a linkage.

maxinconsts(Z, R) Returns the maximum inconsistency coefﬁcient for each non-singleton cluster and its descendents.

maxdists(Z) Returns the maximum distance between any non-singleton cluster.

maxRstat(Z, R, i) Returns the maximum statistic for each non-singleton cluster and its descendents.

to_mlab_linkage(Z) Converts a linkage matrix to a MATLAB(TM) compatible one.

scipy.cluster.hierarchy.cophenet(Z,Y=None)

Calculates the cophenetic distances between each observation in the hierarchical clustering deﬁned by the

linkage Z.

Suppose pand qare original observations in disjoint clusters sand t, respectively and sand tare joined by

a direct parent cluster u. The cophenetic distance between observations iand jis simply the distance between

clusters sand t.

Parameters Z: ndarray

The hierarchical clustering encoded as an array (see linkage function).

Y: ndarray (optional)

Calculates the cophenetic correlation coefﬁcient cof a hierarchical clustering deﬁned

by the linkage matrix Zof a set of nobservations in mdimensions. Yis the condensed

distance matrix from which Zwas generated.

Returns c: ndarray

The cophentic correlation distance (if yis passed).

d: ndarray

The cophenetic distance matrix in condensed form. The ij th entry is the cophenetic

distance between original observations iand j.

scipy.cluster.hierarchy.from_mlab_linkage(Z)

Converts a linkage matrix generated by MATLAB(TM) to a new linkage matrix compatible with this module.

The conversion does two things:

•the indices are converted from 1..N to 0..(N-1) form, and

•a fourth column Z[:,3] is added where Z[i,3] is represents the number of original observations (leaves) in

the non-singleton cluster i.

This function is useful when loading in linkages from legacy data ﬁles generated by MATLAB.

Parameters Z: ndarray

A linkage matrix generated by MATLAB(TM).

Returns ZS : ndarray

A linkage matrix compatible with this library.

scipy.cluster.hierarchy.inconsistent(Z,d=2)

Calculates inconsistency statistics on a linkage.

Note: This function behaves similarly to the MATLAB(TM) inconsistent function.

Parameters Z: ndarray

The (n−1) by 4 matrix encoding the linkage (hierarchical clustering). See linkage

documentation for more information on its form.

218 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

d: int, optional

The number of links up to dlevels below each non-singleton cluster.

Returns R: ndarray

A(n−1) by 5 matrix where the i‘th row contains the link statistics for the non-

singleton cluster i. The link statistics are computed over the link heights for links d

levels below the cluster i.R[i,0] and R[i,1] are the mean and standard devia-

tion of the link heights, respectively; R[i,2] is the number of links included in the

calculation; and R[i,3] is the inconsistency coefﬁcient,

Z[i,2]−R[i,0]

R[i, 1]

scipy.cluster.hierarchy.maxinconsts(Z,R)

Returns the maximum inconsistency coefﬁcient for each non-singleton cluster and its descendents.

Parameters Z: ndarray

The hierarchical clustering encoded as a matrix. See linkage for more information.

R: ndarray

The inconsistency matrix.

Returns MI : ndarray

A monotonic (n-1)-sized numpy array of doubles.

scipy.cluster.hierarchy.maxdists(Z)

Returns the maximum distance between any non-singleton cluster.

Parameters Z: ndarray

The hierarchical clustering encoded as a matrix. See linkage for more information.

Returns maxdists : ndarray

A(n-1) sized numpy array of doubles; MD[i] represents the maximum distance

between any cluster (including singletons) below and including the node with index

i. More speciﬁcally, MD[i] = Z[Q(i)-n, 2].max() where Q(i) is the set of

all node indices below and including node i.

scipy.cluster.hierarchy.maxRstat(Z,R,i)

Returns the maximum statistic for each non-singleton cluster and its descendents.

Parameters Z: array_like

The hierarchical clustering encoded as a matrix. See linkage for more information.

R: array_like

The inconsistency matrix.

i: int

The column of Rto use as the statistic.

Returns MR : ndarray

Calculates the maximum statistic for the i’th column of the inconsistency matrix R

for each non-singleton cluster node. MR[j] is the maximum over R[Q(j)-n, i]

where Q(j) the set of all node ids corresponding to nodes below and including j.

scipy.cluster.hierarchy.to_mlab_linkage(Z)

Converts a linkage matrix to a MATLAB(TM) compatible one.

Converts a linkage matrix Zgenerated by the linkage function of this module to a MATLAB(TM) compatible

one. The return linkage matrix has the last column removed and the cluster indices are converted to 1..N

indexing.

Parameters Z: ndarray

A linkage matrix generated by this library.

Returns to_mlab_linkage : ndarray

5.3. Hierarchical clustering (scipy.cluster.hierarchy) 219

SciPy Reference Guide, Release 0.13.0

A linkage matrix compatible with MATLAB(TM)’s hierarchical clustering functions.

The return linkage matrix has the last column removed and the cluster indices are

converted to 1..N indexing.

Routines for visualizing ﬂat clusters.

dendrogram(Z[, p, truncate_mode, ...]) Plots the hierarchical clustering as a dendrogram.

scipy.cluster.hierarchy.dendrogram(Z,p=30,truncate_mode=None,color_threshold=None,

get_leaves=True,orientation=’top’,la-

bels=None,count_sort=False,distance_sort=False,

show_leaf_counts=True,no_plot=False,

no_labels=False,color_list=None,leaf_font_size=None,

leaf_rotation=None,leaf_label_func=None,

no_leaves=False,show_contracted=False,

link_color_func=None)

Plots the hierarchical clustering as a dendrogram.

The dendrogram illustrates how each cluster is composed by drawing a U-shaped link between a non-singleton

cluster and its children. The height of the top of the U-link is the distance between its children clusters. It is

also the cophenetic distance between original observations in the two children clusters. It is expected that the

distances in Z[:,2] be monotonic, otherwise crossings appear in the dendrogram.

Parameters Z: ndarray

The linkage matrix encoding the hierarchical clustering to render as a dendrogram.

See the linkage function for more information on the format of Z.

p: int, optional

The pparameter for truncate_mode.

truncate_mode : str, optional

The dendrogram can be hard to read when the original observation matrix from which

the linkage is derived is large. Truncation is used to condense the dendrogram. There

are several modes:

None/’none’: no truncation is performed (Default)

‘lastp’: the last pnon-singleton formed in the linkage

are the only non-leaf nodes in the linkage; they correspond to to rows

Z[n-p-2:end] in Z. All other non-singleton clusters are contracted

into leaf nodes.

‘mlab’: This corresponds to MATLAB(TM) behavior. (not

implemented yet)

‘level’/’mtica’: no more than plevels of the

dendrogram tree are displayed. This corresponds to Mathematica(TM)

behavior.

color_threshold : double, optional

For brevity, let tbe the color_threshold. Colors all the descendent links below

a cluster node kthe same color if kis the ﬁrst node below the cut threshold t. All links

connecting nodes with distances greater than or equal to the threshold are colored blue.

If tis less than or equal to zero, all nodes are colored blue. If color_threshold

is None or ‘default’, corresponding with MATLAB(TM) behavior, the threshold is set

to 0.7*max(Z[:,2]).

get_leaves : bool, optional

Includes a list R[’leaves’]=H in the result dictionary. For each i,H[i] == j,

cluster node jappears in position iin the left-to-right traversal of the leaves, where

j < 2n−1and i<n.

orientation : str, optional

The direction to plot the dendrogram, which can be any of the following strings:

220 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

‘top’ plots the root at the top, and plot descendent

links going downwards. (default).

‘bottom’- plots the root at the bottom, and plot descendent

links going upwards.

‘left’- plots the root at the left, and plot descendent

links going right.

‘right’- plots the root at the right, and plot descendent

links going left.

labels : ndarray, optional

By default labels is None so the index of the original observation is used to label

the leaf nodes. Otherwise, this is an n-sized list (or tuple). The labels[i] value is

the text to put under the ith leaf node only if it corresponds to an original observation

and not a non-singleton cluster.

count_sort : str or bool, optional

For each node n, the order (visually, from left-to-right) n’s two descendent links are

plotted is determined by this parameter, which can be any of the following values:

False: nothing is done.

‘ascending’/True: the child with the minimum number of

original objects in its cluster is plotted ﬁrst.

‘descendent’: the child with the maximum number of

original objects in its cluster is plotted ﬁrst.

Note distance_sort and count_sort cannot both be True.

distance_sort : str or bool, optional

For each node n, the order (visually, from left-to-right) n’s two descendent links are

plotted is determined by this parameter, which can be any of the following values:

False: nothing is done.

‘ascending’/True: the child with the minimum distance

between its direct descendents is plotted ﬁrst.

‘descending’: the child with the maximum distance

between its direct descendents is plotted ﬁrst.

Note distance_sort and count_sort cannot both be True.

show_leaf_counts : bool, optional

When True, leaf nodes representing k > 1original observation are labeled with the

number of observations they contain in parentheses.

no_plot : bool, optional

When True, the ﬁnal rendering is not performed. This is useful if only the data struc-

tures computed for the rendering are needed or if matplotlib is not available.

no_labels : bool, optional

When True, no labels appear next to the leaf nodes in the rendering of the dendrogram.

leaf_label_rotation : double, optional

Speciﬁes the angle (in degrees) to rotate the leaf labels. When unspeciﬁed, the rotation

based on the number of nodes in the dendrogram. (Default=0)

leaf_font_size : int, optional

Speciﬁes the font size (in points) of the leaf labels. When unspeciﬁed, the size based

on the number of nodes in the dendrogram.

leaf_label_func : lambda or function, optional

When leaf_label_func is a callable function, for each leaf with cluster index k <

2n−1. The function is expected to return a string with the label for the leaf.

Indices k < n correspond to original observations while indices k≥ncorrespond to

non-singleton clusters.

For example, to label singletons with their node id and non-singletons with their id,

count, and inconsistency coefﬁcient, simply do:

5.3. Hierarchical clustering (scipy.cluster.hierarchy) 221

SciPy Reference Guide, Release 0.13.0

>>> # First define the leaf label function.

>>> def llf(id):

... if id <n:

... return str(id)

... else:

>>> return ’[%d %d %1.2f]’ %(id, count, R[n-id,3])

>>>

>>> # The text for the leaf nodes is going to be big so force

>>> # a rotation of 90 degrees.

>>> dendrogram(Z, leaf_label_func=llf, leaf_rotation=90)

show_contracted : bool

When True the heights of non-singleton nodes contracted into a leaf node are plotted

as crosses along the link connecting that leaf node. This really is only useful when

truncation is used (see truncate_mode parameter).

link_color_func : lambda/function

When a callable function, link_color_function is called with each non-singleton id

corresponding to each U-shaped link it will paint. The function is expected to return

the color to paint the link, encoded as a matplotlib color string code. For example:

>>> dendrogram(Z, link_color_func=lambda k: colors[k])

colors the direct links below each untruncated non-singleton node kusing

colors[k].

Returns R: dict

A dictionary of data structures computed to render the dendrogram. Its has the fol-

lowing keys:

‘icoords’: a list of lists [I1, I2, ..., Ip] where

Ik is a list of 4 independent variable coordinates corresponding to

the line that represents the k’th link painted.

‘dcoords’: a list of lists [I2, I2, ..., Ip] where

Ik is a list of 4 independent variable coordinates corresponding to

the line that represents the k’th link painted.

‘ivl’: a list of labels corresponding to the leaf nodes.

‘leaves’: for each i, H[i] == j, cluster node

jappears in position iin the left-to-right traversal of the leaves,

where j < 2n−1and i<n. If jis less than n, the ith leaf node

corresponds to an original observation. Otherwise, it corresponds

to a non-singleton cluster.

These are data structures and routines for representing hierarchies as tree objects.

ClusterNode(id[, left, right, dist, count]) A tree node class for representing a cluster.

leaves_list(Z) Returns a list of leaf node ids

to_tree(Z[, rd]) Converts a hierarchical clustering encoded in the matrix Z(by

class scipy.cluster.hierarchy.ClusterNode(id,left=None,right=None,dist=0,count=1)

A tree node class for representing a cluster.

Leaf nodes correspond to original observations, while non-leaf nodes correspond to non-singleton clusters.

The to_tree function converts a matrix returned by the linkage function into an easy-to-use tree representation.

See Also

to_tree for converting a linkage matrix Zinto a tree object.

222 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Methods

get_count() The number of leaf nodes (original observations) belonging to the cluster node nd.

get_id() The identiﬁer of the target node.

get_left() Return a reference to the left child tree object.

get_right() Returns a reference to the right child tree object.

is_leaf() Returns True if the target node is a leaf.

pre_order([func]) Performs pre-order traversal without recursive function calls.

ClusterNode.get_count()

The number of leaf nodes (original observations) belonging to the cluster node nd. If the target node is a

leaf, 1 is returned.

Returns get_count : int

The number of leaf nodes below the target node.

ClusterNode.get_id()

The identiﬁer of the target node.

For 0<=i<n,icorresponds to original observation i. For n <= i < 2n-1,icorresponds to

non-singleton cluster formed at iteration i-n.

Returns id : int

The identiﬁer of the target node.

ClusterNode.get_left()

Return a reference to the left child tree object.

Returns left : ClusterNode

The left child of the target node. If the node is a leaf, None is returned.

ClusterNode.get_right()

Returns a reference to the right child tree object.

Returns right : ClusterNode

The left child of the target node. If the node is a leaf, None is returned.

ClusterNode.is_leaf()

Returns True if the target node is a leaf.

Returns leafness : bool

True if the target node is a leaf node.

ClusterNode.pre_order(func=<function <lambda> at 0x5336cf8>)

Performs pre-order traversal without recursive function calls.

When a leaf node is ﬁrst encountered, func is called with the leaf node as its argument, and its result is

appended to the list.

For example, the statement:

ids =root.pre_order(lambda x: x.id)

returns a list of the node ids corresponding to the leaf nodes of the tree as they appear from left to right.

Parameters func : function

Applied to each leaf ClusterNode object in the pre-order traversal. Given the

i’th leaf node in the pre-ordeR traversal n[i], the result of func(n[i]) is stored

in L[i]. If not provided, the index of the original observation to which the node

corresponds is used.

5.3. Hierarchical clustering (scipy.cluster.hierarchy) 223

SciPy Reference Guide, Release 0.13.0

Returns L: list

The pre-order traversal.

scipy.cluster.hierarchy.leaves_list(Z)

Returns a list of leaf node ids

The return corresponds to the observation vector index as it appears in the tree from left to right. Z is a linkage

matrix.

Parameters Z: ndarray

The hierarchical clustering encoded as a matrix. Zis a linkage matrix. See linkage

for more information.

Returns leaves_list : ndarray

The list of leaf node ids.

scipy.cluster.hierarchy.to_tree(Z,rd=False)

Converts a hierarchical clustering encoded in the matrix Z(by linkage) into an easy-to-use tree object.

The reference r to the root ClusterNode object is returned.

Each ClusterNode object has a left, right, dist, id, and count attribute. The left and right attributes point to

ClusterNode objects that were combined to generate the cluster. If both are None then the ClusterNode object

is a leaf node, its count must be 1, and its distance is meaningless but set to 0.

Note: This function is provided for the convenience of the library user. ClusterNodes are not used as input to

any of the functions in this library.

Parameters Z: ndarray

The linkage matrix in proper form (see the linkage function documentation).

rd : bool, optional

When False, a reference to the root ClusterNode object is returned. Otherwise, a tuple

(r,d) is returned. ris a reference to the root node while dis a dictionary mapping clus-

ter ids to ClusterNode references. If a cluster id is less than n, then it corresponds to a

singleton cluster (leaf node). See linkage for more information on the assignment

of cluster ids to clusters.

Returns L: list

The pre-order traversal.

These are predicates for checking the validity of linkage and inconsistency matrices as well as for checking isomor-

phism of two ﬂat cluster assignments.

is_valid_im(R[, warning, throw, name]) Returns True if the inconsistency matrix passed is valid.

is_valid_linkage(Z[, warning, throw, name]) Checks the validity of a linkage matrix.

is_isomorphic(T1, T2) Determines if two different cluster assignments are equivalent.

is_monotonic(Z) Returns True if the linkage passed is monotonic.

correspond(Z, Y) Checks for correspondence between linkage and condensed distance matrices

num_obs_linkage(Z) Returns the number of original observations of the linkage matrix passed.

scipy.cluster.hierarchy.is_valid_im(R,warning=False,throw=False,name=None)

Returns True if the inconsistency matrix passed is valid.

It must be a nby 4 numpy array of doubles. The standard deviations R[:,1] must be nonnegative. The link

counts R[:,2] must be positive and no greater than n−1.

Parameters R: ndarray

The inconsistency matrix to check for validity.

warning : bool, optional

When True, issues a Python warning if the linkage matrix passed is invalid.

224 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

throw : bool, optional

When True, throws a Python exception if the linkage matrix passed is invalid.

name : str, optional

This string refers to the variable name of the invalid linkage matrix.

Returns b: bool

True if the inconsistency matrix is valid.

scipy.cluster.hierarchy.is_valid_linkage(Z,warning=False,throw=False,name=None)

Checks the validity of a linkage matrix.

A linkage matrix is valid if it is a two dimensional ndarray (type double) with nrows and 4 columns. The ﬁrst

two columns must contain indices between 0 and 2n−1. For a given row i,0≤Z[i,0]≤i+n−1and

0≤Z[i, 1] ≤i+n−1(i.e. a cluster cannot join another cluster unless the cluster being joined has been

generated.)

Parameters Z: array_like

Linkage matrix.

warning : bool, optional

When True, issues a Python warning if the linkage matrix passed is invalid.

throw : bool, optional

When True, throws a Python exception if the linkage matrix passed is invalid.

name : str, optional

This string refers to the variable name of the invalid linkage matrix.

Returns b: bool

True iff the inconsistency matrix is valid.

scipy.cluster.hierarchy.is_isomorphic(T1,T2)

Determines if two different cluster assignments are equivalent.

Parameters T1 : array_like

An assignment of singleton cluster ids to ﬂat cluster ids.

T2 : array_like

An assignment of singleton cluster ids to ﬂat cluster ids.

Returns b: bool

Whether the ﬂat cluster assignments T1 and T2 are equivalent.

scipy.cluster.hierarchy.is_monotonic(Z)

Returns True if the linkage passed is monotonic.

The linkage is monotonic if for every cluster sand tjoined, the distance between them is no less than the

distance between any previously joined clusters.

Parameters Z: ndarray

The linkage matrix to check for monotonicity.

Returns b: bool

A boolean indicating whether the linkage is monotonic.

scipy.cluster.hierarchy.correspond(Z,Y)

Checks for correspondence between linkage and condensed distance matrices

They must have the same number of original observations for the check to succeed.

This function is useful as a sanity check in algorithms that make extensive use of linkage and distance matrices

that must correspond to the same set of original observations.

Parameters Z: array_like

The linkage matrix to check for correspondence.

Y: array_like

The condensed distance matrix to check for correspondence.

Returns b: bool

5.3. Hierarchical clustering (scipy.cluster.hierarchy) 225

SciPy Reference Guide, Release 0.13.0

A boolean indicating whether the linkage matrix and distance matrix could possibly

correspond to one another.

scipy.cluster.hierarchy.num_obs_linkage(Z)

Returns the number of original observations of the linkage matrix passed.

Parameters Z: ndarray

The linkage matrix on which to perform the operation.

Returns n: int

The number of original observations in the linkage.

Utility routines for plotting:

set_link_color_palette(palette) Set list of matplotlib color codes for dendrogram color_threshold.

scipy.cluster.hierarchy.set_link_color_palette(palette)

Set list of matplotlib color codes for dendrogram color_threshold.

Parameters palette : list

A list of matplotlib color codes. The order of the color codes is the order in which the

colors are cycled through when color thresholding in the dendrogram.

5.3.1 References

• MATLAB and MathWorks are registered trademarks of The MathWorks, Inc.

• Mathematica is a registered trademark of The Wolfram Research, Inc.

5.4 Constants (scipy.constants)

Physical and mathematical constants and units.

5.4.1 Mathematical constants

pi Pi

golden Golden ratio

226 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

5.4.2 Physical constants

cspeed of light in vacuum

mu_0 the magnetic constant µ0

epsilon_0 the electric constant (vacuum permittivity), 0

hthe Planck constant h

hbar ¯h=h/(2π)

GNewtonian constant of gravitation

gstandard acceleration of gravity

eelementary charge

Rmolar gas constant

alpha ﬁne-structure constant

N_A Avogadro constant

kBoltzmann constant

sigma Stefan-Boltzmann constant σ

Wien Wien displacement law constant

Rydberg Rydberg constant

m_e electron mass

m_p proton mass

m_n neutron mass

Constants database

In addition to the above variables, scipy.constants also contains the 2010 CODATA recommended values [CO-

DATA2010] database containing more physical constants.

value(key) Value in physical_constants indexed by key

unit(key) Unit in physical_constants indexed by key

precision(key) Relative precision in physical_constants indexed by key

find([sub, disp]) Return list of codata.physical_constant keys containing a given string.

ConstantWarning Accessing a constant no longer in current CODATA data set

scipy.constants.value(key)

Value in physical_constants indexed by key

Parameters key : Python string or unicode

Key in dictionary physical_constants

Returns value : ﬂoat

Value in physical_constants corresponding to key

See Also

codata Contains the description of physical_constants, which, as a dictionary literal object, does

not itself possess a docstring.

Examples

>>> from scipy.constants import codata

>>> codata.value(’elementary charge’)

1.602176487e-019

scipy.constants.unit(key)

Unit in physical_constants indexed by key

5.4. Constants (scipy.constants) 227

SciPy Reference Guide, Release 0.13.0

Parameters key : Python string or unicode

Key in dictionary physical_constants

Returns unit : Python string

Unit in physical_constants corresponding to key

See Also

codata Contains the description of physical_constants, which, as a dictionary literal object, does

not itself possess a docstring.

Examples

>>> from scipy.constants import codata

>>> codata.unit(u’proton mass’)

’kg’

scipy.constants.precision(key)

Relative precision in physical_constants indexed by key

Parameters key : Python string or unicode

Key in dictionary physical_constants

Returns prec : ﬂoat

Relative precision in physical_constants corresponding to key

See Also

codata Contains the description of physical_constants, which, as a dictionary literal object, does

not itself possess a docstring.

Examples

>>> from scipy.constants import codata

>>> codata.precision(u’proton mass’)

4.96226989798e-08

scipy.constants.find(sub=None,disp=False)

Return list of codata.physical_constant keys containing a given string.

Parameters sub : str, unicode

Sub-string to search keys for. By default, return all keys.

disp : bool

If True, print the keys that are found, and return None. Otherwise, return the list of

keys without printing anything.

Returns keys : list or None

If disp is False, the list of keys is returned. Otherwise, None is returned.

See Also

codata Contains the description of physical_constants, which, as a dictionary literal object, does

not itself possess a docstring.

exception scipy.constants.ConstantWarning

Accessing a constant no longer in current CODATA data set

scipy.constants.physical_constants

Dictionary of physical constants, of the format physical_constants[name] = (value, unit,

uncertainty).

Available constants:

228 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

alpha particle mass 6.64465675e-27 kg

alpha particle mass energy equivalent 5.97191967e-10 J

alpha particle mass energy equivalent in MeV 3727.37924 MeV

alpha particle mass in u 4.00150617913 u

alpha particle molar mass 0.00400150617912 kg mol^-1

alpha particle-electron mass ratio 7294.2995361

alpha particle-proton mass ratio 3.97259968933

Angstrom star 1.00001495e-10 m

atomic mass constant 1.660538921e-27 kg

atomic mass constant energy equivalent 1.492417954e-10 J

atomic mass constant energy equivalent in MeV 931.494061 MeV

atomic mass unit-electron volt relationship 931494061.0 eV

atomic mass unit-hartree relationship 34231776.845 E_h

atomic mass unit-hertz relationship 2.2523427168e+23 Hz

atomic mass unit-inverse meter relationship 7.5130066042e+14 m^-1

atomic mass unit-joule relationship 1.492417954e-10 J

atomic mass unit-kelvin relationship 1.08095408e+13 K

atomic mass unit-kilogram relationship 1.660538921e-27 kg

atomic unit of 1st hyperpolarizability 3.206361449e-53 C^3 m^3 J^-2

atomic unit of 2nd hyperpolarizability 6.23538054e-65 C^4 m^4 J^-3

atomic unit of action 1.054571726e-34 J s

atomic unit of charge 1.602176565e-19 C

atomic unit of charge density 1.081202338e+12 C m^-3

atomic unit of current 0.00662361795 A

atomic unit of electric dipole mom. 8.47835326e-30 C m

atomic unit of electric field 5.14220652e+11 V m^-1

atomic unit of electric field gradient 9.717362e+21 V m^-2

atomic unit of electric polarizability 1.6487772754e-41 C^2 m^2 J^-1

atomic unit of electric potential 27.21138505 V

atomic unit of electric quadrupole mom. 4.486551331e-40 C m^2

atomic unit of energy 4.35974434e-18 J

atomic unit of force 8.23872278e-08 N

atomic unit of length 5.2917721092e-11 m

atomic unit of mag. dipole mom. 1.854801936e-23 J T^-1

atomic unit of mag. flux density 235051.7464 T

atomic unit of magnetizability 7.891036607e-29 J T^-2

atomic unit of mass 9.10938291e-31 kg

atomic unit of mom.um 1.99285174e-24 kg m s^-1

atomic unit of permittivity 1.11265005605e-10 F m^-1

atomic unit of time 2.4188843265e-17 s

atomic unit of velocity 2187691.26379 m s^-1

Avogadro constant 6.02214129e+23 mol^-1

Bohr magneton 9.27400968e-24 J T^-1

Bohr magneton in eV/T 5.7883818066e-05 eV T^-1

Bohr magneton in Hz/T 13996245550.0 Hz T^-1

Bohr magneton in inverse meters per tesla 46.6864498 m^-1 T^-1

Bohr magneton in K/T 0.67171388 K T^-1

Bohr radius 5.2917721092e-11 m

Boltzmann constant 1.3806488e-23 J K^-1

Boltzmann constant in eV/K 8.6173324e-05 eV K^-1

Continued on next page

5.4. Constants (scipy.constants) 229

SciPy Reference Guide, Release 0.13.0

Table 5.11 – continued from previous page

Boltzmann constant in Hz/K 20836618000.0 Hz K^-1

Boltzmann constant in inverse meters per kelvin 69.503476 m^-1 K^-1

characteristic impedance of vacuum 376.730313462 ohm

classical electron radius 2.8179403267e-15 m

Compton wavelength 2.4263102389e-12 m

Compton wavelength over 2 pi 3.86159268e-13 m

conductance quantum 7.7480917346e-05 S

conventional value of Josephson constant 4.835979e+14 Hz V^-1

conventional value of von Klitzing constant 25812.807 ohm

Cu x unit 1.00207697e-13 m

deuteron g factor 0.8574382308

deuteron mag. mom. 4.33073489e-27 J T^-1

deuteron mag. mom. to Bohr magneton ratio 0.0004669754556

deuteron mag. mom. to nuclear magneton ratio 0.8574382308

deuteron mass 3.34358348e-27 kg

deuteron mass energy equivalent 3.00506297e-10 J

deuteron mass energy equivalent in MeV 1875.612859 MeV

deuteron mass in u 2.01355321271 u

deuteron molar mass 0.00201355321271 kg mol^-1

deuteron rms charge radius 2.1424e-15 m

deuteron-electron mag. mom. ratio -0.0004664345537

deuteron-electron mass ratio 3670.4829652

deuteron-neutron mag. mom. ratio -0.44820652

deuteron-proton mag. mom. ratio 0.307012207

deuteron-proton mass ratio 1.99900750097

electric constant 8.85418781762e-12 F m^-1

electron charge to mass quotient -1.758820088e+11 C kg^-1

electron g factor -2.00231930436

electron gyromag. ratio 1.760859708e+11 s^-1 T^-1

electron gyromag. ratio over 2 pi 28024.95266 MHz T^-1

electron mag. mom. -9.2847643e-24 J T^-1

electron mag. mom. anomaly 0.00115965218076

electron mag. mom. to Bohr magneton ratio -1.00115965218

electron mag. mom. to nuclear magneton ratio -1838.2819709

electron mass 9.10938291e-31 kg

electron mass energy equivalent 8.18710506e-14 J

electron mass energy equivalent in MeV 0.510998928 MeV

electron mass in u 0.00054857990946 u

electron molar mass 5.4857990946e-07 kg mol^-1

electron to alpha particle mass ratio 0.000137093355578

electron to shielded helion mag. mom. ratio 864.058257

electron to shielded proton mag. mom. ratio -658.2275971

electron volt 1.602176565e-19 J

electron volt-atomic mass unit relationship 1.07354415e-09 u

electron volt-hartree relationship 0.03674932379 E_h

electron volt-hertz relationship 2.417989348e+14 Hz

electron volt-inverse meter relationship 806554.429 m^-1

electron volt-joule relationship 1.602176565e-19 J

electron volt-kelvin relationship 11604.519 K

electron volt-kilogram relationship 1.782661845e-36 kg

Continued on next page

230 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Table 5.11 – continued from previous page

electron-deuteron mag. mom. ratio -2143.923498

electron-deuteron mass ratio 0.00027244371095

electron-helion mass ratio 0.00018195430761

electron-muon mag. mom. ratio 206.7669896

electron-muon mass ratio 0.00483633166

electron-neutron mag. mom. ratio 960.9205

electron-neutron mass ratio 0.00054386734461

electron-proton mag. mom. ratio -658.2106848

electron-proton mass ratio 0.00054461702178

electron-tau mass ratio 0.000287592

electron-triton mass ratio 0.00018192000653

elementary charge 1.602176565e-19 C

elementary charge over h 2.417989348e+14 A J^-1

Faraday constant 96485.3365 C mol^-1

Faraday constant for conventional electric current 96485.3321 C_90 mol^-1

Fermi coupling constant 1.166364e-05 GeV^-2

fine-structure constant 0.0072973525698

first radiation constant 3.74177153e-16 W m^2

first radiation constant for spectral radiance 1.191042869e-16 W m^2 sr^-1

Hartree energy 4.35974434e-18 J

Hartree energy in eV 27.21138505 eV

hartree-atomic mass unit relationship 2.9212623246e-08 u

hartree-electron volt relationship 27.21138505 eV

hartree-hertz relationship 6.57968392073e+15 Hz

hartree-inverse meter relationship 21947463.1371 m^-1

hartree-joule relationship 4.35974434e-18 J

hartree-kelvin relationship 315775.04 K

hartree-kilogram relationship 4.85086979e-35 kg

helion g factor -4.255250613

helion mag. mom. -1.074617486e-26 J T^-1

helion mag. mom. to Bohr magneton ratio -0.001158740958

helion mag. mom. to nuclear magneton ratio -2.127625306

helion mass 5.00641234e-27 kg

helion mass energy equivalent 4.49953902e-10 J

helion mass energy equivalent in MeV 2808.391482 MeV

helion mass in u 3.0149322468 u

helion molar mass 0.0030149322468 kg mol^-1

helion-electron mass ratio 5495.8852754

helion-proton mass ratio 2.9931526707

hertz-atomic mass unit relationship 4.4398216689e-24 u

hertz-electron volt relationship 4.135667516e-15 eV

hertz-hartree relationship 1.519829846e-16 E_h

hertz-inverse meter relationship 3.33564095198e-09 m^-1

hertz-joule relationship 6.62606957e-34 J

hertz-kelvin relationship 4.7992434e-11 K

hertz-kilogram relationship 7.37249668e-51 kg

inverse fine-structure constant 137.035999074

inverse meter-atomic mass unit relationship 1.3310250512e-15 u

inverse meter-electron volt relationship 1.23984193e-06 eV

inverse meter-hartree relationship 4.55633525276e-08 E_h

Continued on next page

5.4. Constants (scipy.constants) 231

SciPy Reference Guide, Release 0.13.0

Table 5.11 – continued from previous page

inverse meter-hertz relationship 299792458.0 Hz

inverse meter-joule relationship 1.986445684e-25 J

inverse meter-kelvin relationship 0.01438777 K

inverse meter-kilogram relationship 2.210218902e-42 kg

inverse of conductance quantum 12906.4037217 ohm

Josephson constant 4.8359787e+14 Hz V^-1

joule-atomic mass unit relationship 6700535850.0 u

joule-electron volt relationship 6.24150934e+18 eV

joule-hartree relationship 2.29371248e+17 E_h

joule-hertz relationship 1.509190311e+33 Hz

joule-inverse meter relationship 5.03411701e+24 m^-1

joule-kelvin relationship 7.2429716e+22 K

joule-kilogram relationship 1.11265005605e-17 kg

kelvin-atomic mass unit relationship 9.2510868e-14 u

kelvin-electron volt relationship 8.6173324e-05 eV

kelvin-hartree relationship 3.1668114e-06 E_h

kelvin-hertz relationship 20836618000.0 Hz

kelvin-inverse meter relationship 69.503476 m^-1

kelvin-joule relationship 1.3806488e-23 J

kelvin-kilogram relationship 1.536179e-40 kg

kilogram-atomic mass unit relationship 6.02214129e+26 u

kilogram-electron volt relationship 5.60958885e+35 eV

kilogram-hartree relationship 2.061485968e+34 E_h

kilogram-hertz relationship 1.356392608e+50 Hz

kilogram-inverse meter relationship 4.52443873e+41 m^-1

kilogram-joule relationship 8.98755178737e+16 J

kilogram-kelvin relationship 6.5096582e+39 K

lattice parameter of silicon 5.431020504e-10 m

Loschmidt constant (273.15 K, 100 kPa) 2.6516462e+25 m^-3

Loschmidt constant (273.15 K, 101.325 kPa) 2.6867805e+25 m^-3

mag. constant 1.25663706144e-06 N A^-2

mag. flux quantum 2.067833758e-15 Wb

Mo x unit 1.00209952e-13 m

molar gas constant 8.3144621 J mol^-1 K^-1

molar mass constant 0.001 kg mol^-1

molar mass of carbon-12 0.012 kg mol^-1

molar Planck constant 3.9903127176e-10 J s mol^-1

molar Planck constant times c 0.119626565779 J m mol^-1

molar volume of ideal gas (273.15 K, 100 kPa) 0.022710953 m^3 mol^-1

molar volume of ideal gas (273.15 K, 101.325 kPa) 0.022413968 m^3 mol^-1

molar volume of silicon 1.205883301e-05 m^3 mol^-1

muon Compton wavelength 1.173444103e-14 m

muon Compton wavelength over 2 pi 1.867594294e-15 m

muon g factor -2.0023318418

muon mag. mom. -4.49044807e-26 J T^-1

muon mag. mom. anomaly 0.00116592091

muon mag. mom. to Bohr magneton ratio -0.00484197044

muon mag. mom. to nuclear magneton ratio -8.89059697

muon mass 1.883531475e-28 kg

muon mass energy equivalent 1.692833667e-11 J

Continued on next page

232 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Table 5.11 – continued from previous page

muon mass energy equivalent in MeV 105.6583715 MeV

muon mass in u 0.1134289267 u

muon molar mass 0.0001134289267 kg mol^-1

muon-electron mass ratio 206.7682843

muon-neutron mass ratio 0.1124545177

muon-proton mag. mom. ratio -3.183345107

muon-proton mass ratio 0.1126095272

muon-tau mass ratio 0.0594649

natural unit of action 1.054571726e-34 J s

natural unit of action in eV s 6.58211928e-16 eV s

natural unit of energy 8.18710506e-14 J

natural unit of energy in MeV 0.510998928 MeV

natural unit of length 3.86159268e-13 m

natural unit of mass 9.10938291e-31 kg

natural unit of mom.um 2.73092429e-22 kg m s^-1

natural unit of mom.um in MeV/c 0.510998928 MeV/c

natural unit of time 1.28808866833e-21 s

natural unit of velocity 299792458.0 m s^-1

neutron Compton wavelength 1.3195909068e-15 m

neutron Compton wavelength over 2 pi 2.1001941568e-16 m

neutron g factor -3.82608545

neutron gyromag. ratio 183247179.0 s^-1 T^-1

neutron gyromag. ratio over 2 pi 29.1646943 MHz T^-1

neutron mag. mom. -9.6623647e-27 J T^-1

neutron mag. mom. to Bohr magneton ratio -0.00104187563

neutron mag. mom. to nuclear magneton ratio -1.91304272

neutron mass 1.674927351e-27 kg

neutron mass energy equivalent 1.505349631e-10 J

neutron mass energy equivalent in MeV 939.565379 MeV

neutron mass in u 1.008664916 u

neutron molar mass 0.001008664916 kg mol^-1

neutron to shielded proton mag. mom. ratio -0.68499694

neutron-electron mag. mom. ratio 0.00104066882

neutron-electron mass ratio 1838.6836605

neutron-muon mass ratio 8.892484

neutron-proton mag. mom. ratio -0.68497934

neutron-proton mass difference 2.30557392e-30

neutron-proton mass difference energy equivalent 2.0721465e-13

neutron-proton mass difference energy equivalent in MeV 1.29333217

neutron-proton mass difference in u 0.00138844919

neutron-proton mass ratio 1.00137841917

neutron-tau mass ratio 0.52879

Newtonian constant of gravitation 6.67384e-11 m^3 kg^-1 s^-2

Newtonian constant of gravitation over h-bar c 6.70837e-39 (GeV/c^2)^-2

nuclear magneton 5.05078353e-27 J T^-1

nuclear magneton in eV/T 3.1524512605e-08 eV T^-1

nuclear magneton in inverse meters per tesla 0.02542623527 m^-1 T^-1

nuclear magneton in K/T 0.00036582682 K T^-1

nuclear magneton in MHz/T 7.62259357 MHz T^-1

Planck constant 6.62606957e-34 J s

Continued on next page

5.4. Constants (scipy.constants) 233

SciPy Reference Guide, Release 0.13.0

Table 5.11 – continued from previous page

Planck constant in eV s 4.135667516e-15 eV s

Planck constant over 2 pi 1.054571726e-34 J s

Planck constant over 2 pi in eV s 6.58211928e-16 eV s

Planck constant over 2 pi times c in MeV fm 197.3269718 MeV fm

Planck length 1.616199e-35 m

Planck mass 2.17651e-08 kg

Planck mass energy equivalent in GeV 1.220932e+19 GeV

Planck temperature 1.416833e+32 K

Planck time 5.39106e-44 s

proton charge to mass quotient 95788335.8 C kg^-1

proton Compton wavelength 1.32140985623e-15 m

proton Compton wavelength over 2 pi 2.1030891047e-16 m

proton g factor 5.585694713

proton gyromag. ratio 267522200.5 s^-1 T^-1

proton gyromag. ratio over 2 pi 42.5774806 MHz T^-1

proton mag. mom. 1.410606743e-26 J T^-1

proton mag. mom. to Bohr magneton ratio 0.00152103221

proton mag. mom. to nuclear magneton ratio 2.792847356

proton mag. shielding correction 2.5694e-05

proton mass 1.672621777e-27 kg

proton mass energy equivalent 1.503277484e-10 J

proton mass energy equivalent in MeV 938.272046 MeV

proton mass in u 1.00727646681 u

proton molar mass 0.00100727646681 kg mol^-1

proton rms charge radius 8.775e-16 m

proton-electron mass ratio 1836.15267245

proton-muon mass ratio 8.88024331

proton-neutron mag. mom. ratio -1.45989806

proton-neutron mass ratio 0.99862347826

proton-tau mass ratio 0.528063

quantum of circulation 0.0003636947552 m^2 s^-1

quantum of circulation times 2 0.0007273895104 m^2 s^-1

Rydberg constant 10973731.5685 m^-1

Rydberg constant times c in Hz 3.28984196036e+15 Hz

Rydberg constant times hc in eV 13.60569253 eV

Rydberg constant times hc in J 2.179872171e-18 J

Sackur-Tetrode constant (1 K, 100 kPa) -1.1517078

Sackur-Tetrode constant (1 K, 101.325 kPa) -1.1648708

second radiation constant 0.01438777 m K

shielded helion gyromag. ratio 203789465.9 s^-1 T^-1

shielded helion gyromag. ratio over 2 pi 32.43410084 MHz T^-1

shielded helion mag. mom. -1.074553044e-26 J T^-1

shielded helion mag. mom. to Bohr magneton ratio -0.001158671471

shielded helion mag. mom. to nuclear magneton ratio -2.127497718

shielded helion to proton mag. mom. ratio -0.761766558

shielded helion to shielded proton mag. mom. ratio -0.7617861313

shielded proton gyromag. ratio 267515326.8 s^-1 T^-1

shielded proton gyromag. ratio over 2 pi 42.5763866 MHz T^-1

shielded proton mag. mom. 1.410570499e-26 J T^-1

shielded proton mag. mom. to Bohr magneton ratio 0.001520993128

Continued on next page

234 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Table 5.11 – continued from previous page

shielded proton mag. mom. to nuclear magneton ratio 2.792775598

speed of light in vacuum 299792458.0 m s^-1

standard acceleration of gravity 9.80665 m s^-2

standard atmosphere 101325.0 Pa

standard-state pressure 100000.0 Pa

Stefan-Boltzmann constant 5.670373e-08 W m^-2 K^-4

tau Compton wavelength 6.97787e-16 m

tau Compton wavelength over 2 pi 1.11056e-16 m

tau mass 3.16747e-27 kg

tau mass energy equivalent 2.84678e-10 J

tau mass energy equivalent in MeV 1776.82 MeV

tau mass in u 1.90749 u

tau molar mass 0.00190749 kg mol^-1

tau-electron mass ratio 3477.15

tau-muon mass ratio 16.8167

tau-neutron mass ratio 1.89111

tau-proton mass ratio 1.89372

Thomson cross section 6.652458734e-29 m^2

triton g factor 5.957924896

triton mag. mom. 1.504609447e-26 J T^-1

triton mag. mom. to Bohr magneton ratio 0.001622393657

triton mag. mom. to nuclear magneton ratio 2.978962448

triton mass 5.0073563e-27 kg

triton mass energy equivalent 4.50038741e-10 J

triton mass energy equivalent in MeV 2808.921005 MeV

triton mass in u 3.0155007134 u

triton molar mass 0.0030155007134 kg mol^-1

triton-electron mass ratio 5496.9215267

triton-proton mass ratio 2.9937170308

unified atomic mass unit 1.660538921e-27 kg

von Klitzing constant 25812.8074434 ohm

weak mixing angle 0.2223

Wien frequency displacement law constant 58789254000.0 Hz K^-1

Wien wavelength displacement law constant 0.0028977721 m K

{220} lattice spacing of silicon 1.920155714e-10 m

5.4. Constants (scipy.constants) 235

SciPy Reference Guide, Release 0.13.0

5.4.3 Units

SI preﬁxes

yotta 1024

zetta 1021

exa 1018

peta 1015

tera 1012

giga 109

mega 106

kilo 103

hecto 102

deka 101

deci 10−1

centi 10−2

milli 10−3

micro 10−6

nano 10−9

pico 10−12

femto 10−15

atto 10−18

zepto 10−21

Binary preﬁxes

kibi 210

mebi 220

gibi 230

tebi 240

pebi 250

exbi 260

zebi 270

yobi 280

Weight

gram 10−3kg

metric_ton 103kg

grain one grain in kg

lb one pound (avoirdupous) in kg

oz one ounce in kg

stone one stone in kg

grain one grain in kg

long_ton one long ton in kg

short_ton one short ton in kg

troy_ounce one Troy ounce in kg

troy_pound one Troy pound in kg

carat one carat in kg

m_u atomic mass constant (in kg)

236 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Angle

degree degree in radians

arcmin arc minute in radians

arcsec arc second in radians

Time

minute one minute in seconds

hour one hour in seconds

day one day in seconds

week one week in seconds

year one year (365 days) in seconds

Julian_year one Julian year (365.25 days) in seconds

Length

inch one inch in meters

foot one foot in meters

yard one yard in meters

mile one mile in meters

mil one mil in meters

pt one point in meters

survey_foot one survey foot in meters

survey_mile one survey mile in meters

nautical_mile one nautical mile in meters

fermi one Fermi in meters

angstrom one Angstrom in meters

micron one micron in meters

au one astronomical unit in meters

light_year one light year in meters

parsec one parsec in meters

Pressure

atm standard atmosphere in pascals

bar one bar in pascals

torr one torr (mmHg) in pascals

psi one psi in pascals

Area

hectare one hectare in square meters

acre one acre in square meters

5.4. Constants (scipy.constants) 237

SciPy Reference Guide, Release 0.13.0

Volume

liter one liter in cubic meters

gallon one gallon (US) in cubic meters

gallon_imp one gallon (UK) in cubic meters

fluid_ounce one ﬂuid ounce (US) in cubic meters

fluid_ounce_imp one ﬂuid ounce (UK) in cubic meters

bbl one barrel in cubic meters

Speed

kmh kilometers per hour in meters per second

mph miles per hour in meters per second

mach one Mach (approx., at 15 C, 1 atm) in meters per second

knot one knot in meters per second

Temperature

zero_Celsius zero of Celsius scale in Kelvin

degree_Fahrenheit one Fahrenheit (only differences) in Kelvins

C2K(C) Convert Celsius to Kelvin

K2C(K) Convert Kelvin to Celsius

F2C(F) Convert Fahrenheit to Celsius

C2F(C) Convert Celsius to Fahrenheit

F2K(F) Convert Fahrenheit to Kelvin

K2F(K) Convert Kelvin to Fahrenheit

scipy.constants.C2K(C)

Convert Celsius to Kelvin

Parameters C: array_like

Celsius temperature(s) to be converted.

Returns K: ﬂoat or array of ﬂoats

Equivalent Kelvin temperature(s).

Notes

Computes K = C + zero_Celsius where zero_Celsius = 273.15, i.e., (the absolute value of) temper-

ature “absolute zero” as measured in Celsius.

Examples

>>> from scipy.constants.constants import C2K

>>> C2K(_np.array([-40,40.0]))

array([ 233.15, 313.15])

scipy.constants.K2C(K)

Convert Kelvin to Celsius

Parameters K: array_like

Kelvin temperature(s) to be converted.

Returns C: ﬂoat or array of ﬂoats

238 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Equivalent Celsius temperature(s).

Notes

Computes C = K - zero_Celsius where zero_Celsius = 273.15, i.e., (the absolute value of) temper-

ature “absolute zero” as measured in Celsius.

Examples

>>> from scipy.constants.constants import K2C

>>> K2C(_np.array([233.15,313.15]))

array([-40., 40.])

scipy.constants.F2C(F)

Convert Fahrenheit to Celsius

Parameters F: array_like

Fahrenheit temperature(s) to be converted.

Returns C: ﬂoat or array of ﬂoats

Equivalent Celsius temperature(s).

Notes

Computes C = (F - 32) / 1.8.

Examples

>>> from scipy.constants.constants import F2C

>>> F2C(_np.array([-40,40.0]))

array([-40. , 4.44444444])

scipy.constants.C2F(C)

Convert Celsius to Fahrenheit

Parameters C: array_like

Celsius temperature(s) to be converted.

Returns F: ﬂoat or array of ﬂoats

Equivalent Fahrenheit temperature(s).

Notes

Computes F = 1.8 *C + 32.

Examples

>>> from scipy.constants.constants import C2F

>>> C2F(_np.array([-40,40.0]))

array([ -40., 104.])

scipy.constants.F2K(F)

Convert Fahrenheit to Kelvin

Parameters F: array_like

Fahrenheit temperature(s) to be converted.

Returns K: ﬂoat or array of ﬂoats

Equivalent Kelvin temperature(s).

Notes

Computes K = (F - 32)/1.8 + zero_Celsius where zero_Celsius = 273.15, i.e., (the absolute

value of) temperature “absolute zero” as measured in Celsius.

5.4. Constants (scipy.constants) 239

SciPy Reference Guide, Release 0.13.0

Examples

>>> from scipy.constants.constants import F2K

>>> F2K(_np.array([-40,104]))

array([ 233.15, 313.15])

scipy.constants.K2F(K)

Convert Kelvin to Fahrenheit

Parameters K: array_like

Kelvin temperature(s) to be converted.

Returns F: ﬂoat or array of ﬂoats

Equivalent Fahrenheit temperature(s).

Notes

Computes F = 1.8 *(K - zero_Celsius) + 32 where zero_Celsius = 273.15, i.e., (the abso-

lute value of) temperature “absolute zero” as measured in Celsius.

Examples

>>> from scipy.constants.constants import K2F

>>> K2F(_np.array([233.15,313.15]))

array([ -40., 104.])

Energy

eV one electron volt in Joules

calorie one calorie (thermochemical) in Joules

calorie_IT one calorie (International Steam Table calorie, 1956) in Joules

erg one erg in Joules

Btu one British thermal unit (International Steam Table) in Joules

Btu_th one British thermal unit (thermochemical) in Joules

ton_TNT one ton of TNT in Joules

Power

hp one horsepower in watts

Force

dyn one dyne in newtons

lbf one pound force in newtons

kgf one kilogram force in newtons

Optics

lambda2nu(lambda_) Convert wavelength to optical frequency

nu2lambda(nu) Convert optical frequency to wavelength.

240 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.constants.lambda2nu(lambda_)

Convert wavelength to optical frequency

Parameters lambda : array_like

Wavelength(s) to be converted.

Returns nu : ﬂoat or array of ﬂoats

Equivalent optical frequency.

Notes

Computes nu = c / lambda where c = 299792458.0, i.e., the (vacuum) speed of light in meters/second.

Examples

>>> from scipy.constants.constants import lambda2nu

>>> lambda2nu(_np.array((1, speed_of_light)))

array([ 2.99792458e+08, 1.00000000e+00])

scipy.constants.nu2lambda(nu)

Convert optical frequency to wavelength.

Parameters nu : array_like

Optical frequency to be converted.

Returns lambda : ﬂoat or array of ﬂoats

Equivalent wavelength(s).

Notes

Computes lambda = c / nu where c = 299792458.0, i.e., the (vacuum) speed of light in meters/second.

Examples

>>> from scipy.constants.constants import nu2lambda

>>> nu2lambda(_np.array((1, speed_of_light)))

array([ 2.99792458e+08, 1.00000000e+00])

5.4.4 References

5.5 Discrete Fourier transforms (scipy.fftpack)

5.5.1 Fast Fourier Transforms (FFTs)

fft(x[, n, axis, overwrite_x]) Return discrete Fourier transform of real or complex sequence.

ifft(x[, n, axis, overwrite_x]) Return discrete inverse Fourier transform of real or complex sequence.

fft2(x[, shape, axes, overwrite_x]) 2-D discrete Fourier transform.

ifft2(x[, shape, axes, overwrite_x]) 2-D discrete inverse Fourier transform of real or complex sequence.

fftn(x[, shape, axes, overwrite_x]) Return multidimensional discrete Fourier transform.

ifftn(x[, shape, axes, overwrite_x]) Return inverse multi-dimensional discrete Fourier transform of

rfft(x[, n, axis, overwrite_x]) Discrete Fourier transform of a real sequence.

irfft(x[, n, axis, overwrite_x]) Return inverse discrete Fourier transform of real sequence x.

dct(x[, type, n, axis, norm, overwrite_x]) Return the Discrete Cosine Transform of arbitrary type sequence x.

idct(x[, type, n, axis, norm, overwrite_x]) Return the Inverse Discrete Cosine Transform of an arbitrary type sequence.

5.5. Discrete Fourier transforms (scipy.fftpack) 241

SciPy Reference Guide, Release 0.13.0

scipy.fftpack.fft(x,n=None,axis=-1,overwrite_x=0)

Return discrete Fourier transform of real or complex sequence.

The returned complex array contains y(0), y(1),..., y(n-1) where

y(j) = (x *exp(-2*pi*sqrt(-1)*j*np.arange(n)/n)).sum().

Parameters x: array_like

Array to Fourier transform.

n: int, optional

Length of the Fourier transform. If n < x.shape[axis],xis truncated.

If n > x.shape[axis],xis zero-padded. The default results in n =

x.shape[axis].

axis : int, optional

Axis along which the fft’s are computed; the default is over the last axis (i.e.,

axis=-1).

overwrite_x : bool, optional

If True the contents of xcan be destroyed; the default is False.

Returns z: complex ndarray

with the elements:

[y(0),y(1),..,y(n/2),y(1-n/2),...,y(-1)] if n is even

[y(0),y(1),..,y((n-1)/2),y(-(n-1)/2),...,y(-1)] if n is odd

where:

y(j) = sum[k=0..n-1] x[k] *exp(-sqrt(-1)*j*k*2*pi/n), j = 0..n-1

Note that y(-j) = y(n-j).conjugate().

See Also

ifft Inverse FFT

rfft FFT of a real sequence

Notes

The packing of the result is “standard”: If A = fft(a, n), then A[0] contains the zero-frequency term,

A[1:n/2] contains the positive-frequency terms, and A[n/2:] contains the negative-frequency terms, in

order of decreasingly negative frequency. So for an 8-point transform, the frequencies of the result are [0, 1, 2,

3, -4, -3, -2, -1]. To rearrange the fft output so that the zero-frequency component is centered, like [-4, -3, -2, -1,

0, 1, 2, 3], use fftshift.

For neven, A[n/2] contains the sum of the positive and negative-frequency terms. For neven and xreal,

A[n/2] will always be real.

This function is most efﬁcient when nis a power of two.

Examples

>>> from scipy.fftpack import fft, ifft

>>> x=np.arange(5)

>>> np.allclose(fft(ifft(x)), x, atol=1e-15)# within numerical accuracy.

True

scipy.fftpack.ifft(x,n=None,axis=-1,overwrite_x=0)

Return discrete inverse Fourier transform of real or complex sequence.

The returned complex array contains y(0), y(1),..., y(n-1) where

y(j) = (x *exp(2*pi*sqrt(-1)*j*np.arange(n)/n)).mean().

242 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Parameters x: array_like

Transformed data to invert.

n: int, optional

Length of the inverse Fourier transform. If n < x.shape[axis],xis trun-

cated. If n > x.shape[axis],xis zero-padded. The default results in n =

x.shape[axis].

axis : int, optional

Axis along which the ifft’s are computed; the default is over the last axis (i.e.,

axis=-1).

overwrite_x : bool, optional

If True the contents of xcan be destroyed; the default is False.

scipy.fftpack.fft2(x,shape=None,axes=(-2,-1),overwrite_x=0)

2-D discrete Fourier transform.

Return the two-dimensional discrete Fourier transform of the 2-D argument x.

See Also

fftn for detailed information.

scipy.fftpack.ifft2(x,shape=None,axes=(-2,-1),overwrite_x=0)

2-D discrete inverse Fourier transform of real or complex sequence.

Return inverse two-dimensional discrete Fourier transform of arbitrary type sequence x.

See ifft for more information.

See Also

fft2,ifft

scipy.fftpack.fftn(x,shape=None,axes=None,overwrite_x=0)

Return multidimensional discrete Fourier transform.

The returned array contains:

y[j_1,..,j_d] = sum[k_1=0..n_1-1, ..., k_d=0..n_d-1]

x[k_1,..,k_d] *prod[i=1..d] exp(-sqrt(-1)*2*pi/n_i *j_i *k_i)

where d = len(x.shape) and n = x.shape. Note that y[..., -j_i, ...] = y[..., n_i-j_i,

...].conjugate().

Parameters x: array_like

The (n-dimensional) array to transform.

shape : tuple of ints, optional

The shape of the result. If both shape and axes (see below) are None,

shape is x.shape; if shape is None but axes is not None, then shape is

scipy.take(x.shape, axes, axis=0). If shape[i] > x.shape[i],

the i-th dimension is padded with zeros. If shape[i] < x.shape[i], the i-th

dimension is truncated to length shape[i].

axes : array_like of ints, optional

The axes of x(yif shape is not None) along which the transform is applied.

overwrite_x : bool, optional

If True, the contents of xcan be destroyed. Default is False.

Returns y: complex-valued n-dimensional numpy array

The (n-dimensional) DFT of the input array.

5.5. Discrete Fourier transforms (scipy.fftpack) 243

SciPy Reference Guide, Release 0.13.0

See Also

ifftn

Examples

>>> y=(-np.arange(16), 8-np.arange(16), np.arange(16))

>>> np.allclose(y, fftn(ifftn(y)))

True

scipy.fftpack.ifftn(x,shape=None,axes=None,overwrite_x=0)

Return inverse multi-dimensional discrete Fourier transform of arbitrary type sequence x.

The returned array contains:

y[j_1,..,j_d] = 1/p *sum[k_1=0..n_1-1, ..., k_d=0..n_d-1]

x[k_1,..,k_d] *prod[i=1..d] exp(sqrt(-1)*2*pi/n_i *j_i *k_i)

where d = len(x.shape),n = x.shape, and p = prod[i=1..d] n_i.

For description of parameters see fftn.

See Also

fftn for detailed information.

scipy.fftpack.rfft(x,n=None,axis=-1,overwrite_x=0)

Discrete Fourier transform of a real sequence.

The returned real arrays contains:

[y(0),Re(y(1)),Im(y(1)),...,Re(y(n/2))] if n is even

[y(0),Re(y(1)),Im(y(1)),...,Re(y(n/2)),Im(y(n/2))] if n is odd

where

y(j) = sum[k=0..n-1] x[k] *exp(-sqrt(-1)*j*k*2*pi/n)

j = 0..n-1

Note that y(-j) == y(n-j).conjugate().

Parameters x: array_like, real-valued

The data to transform.

n: int, optional

Deﬁnes the length of the Fourier transform. If nis not speciﬁed (the default)

then n = x.shape[axis]. If n < x.shape[axis],xis truncated, if n >

x.shape[axis],xis zero-padded.

axis : int, optional

The axis along which the transform is applied. The default is the last axis.

overwrite_x : bool, optional

If set to true, the contents of xcan be overwritten. Default is False.

See Also

fft,irfft,scipy.fftpack.basic

Notes

Within numerical accuracy, y == rfft(irfft(y)).

244 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.fftpack.irfft(x,n=None,axis=-1,overwrite_x=0)

Return inverse discrete Fourier transform of real sequence x.

The contents of x is interpreted as the output of the rfft(..) function.

Parameters x: array_like

Transformed data to invert.

n: int, optional

Length of the inverse Fourier transform. If n < x.shape[axis], x is truncated. If n >

x.shape[axis], x is zero-padded. The default results in n = x.shape[axis].

axis : int, optional

Axis along which the ifft’s are computed; the default is over the last axis (i.e., axis=-1).

overwrite_x : bool, optional

If True the contents of xcan be destroyed; the default is False.

Returns irfft : ndarray of ﬂoats

The inverse discrete Fourier transform.

See Also

rfft,ifft

Notes

The returned real array contains:

[y(0),y(1),...,y(n-1)]

where for n is even:

y(j) = 1/n (sum[k=1..n/2-1] (x[2*k-1]+sqrt(-1)*x[2*k])

*exp(sqrt(-1)*j*k*2*pi/n)

+ c.c. + x[0] + (-1)**(j) x[n-1])

and for n is odd:

y(j) = 1/n (sum[k=1..(n-1)/2] (x[2*k-1]+sqrt(-1)*x[2*k])

*exp(sqrt(-1)*j*k*2*pi/n)

+ c.c. + x[0])

c.c. denotes complex conjugate of preceeding expression.

For details on input parameters, see rfft.

scipy.fftpack.dct(x,type=2,n=None,axis=-1,norm=None,overwrite_x=0)

Return the Discrete Cosine Transform of arbitrary type sequence x.

Parameters x: array_like

The input array.

type : {1, 2, 3}, optional

Type of the DCT (see Notes). Default type is 2.

n: int, optional

Length of the transform.

axis : int, optional

Axis over which to compute the transform.

norm : {None, ‘ortho’}, optional

Normalization mode (see Notes). Default is None.

overwrite_x : bool, optional

If True the contents of x can be destroyed. (default=False)

Returns y: ndarray of real

The transformed input array.

5.5. Discrete Fourier transforms (scipy.fftpack) 245

SciPy Reference Guide, Release 0.13.0

See Also

idct

Notes

For a single dimension array x,dct(x, norm=’ortho’) is equal to MATLAB dct(x).

There are theoretically 8 types of the DCT, only the ﬁrst 3 types are implemented in scipy. ‘The’ DCT generally

refers to DCT type 2, and ‘the’ Inverse DCT generally refers to DCT type 3.

type I

There are several deﬁnitions of the DCT-I; we use the following (for norm=None):

N-2

y[k] = x[0] + (-1)**k x[N-1] + 2 *sum x[n]*cos(pi*k*n/(N-1))

n=1

Only None is supported as normalization mode for DCT-I. Note also that the DCT-I is only supported for input

size > 1

type II

There are several deﬁnitions of the DCT-II; we use the following (for norm=None):

N-1

y[k] = 2*sum x[n]*cos(pi*k*(2n+1)/(2*N)), 0 <= k < N.

n=0

If norm=’ortho’,y[k] is multiplied by a scaling factor f:

f = sqrt(1/(4*N)) if k = 0,

f = sqrt(1/(2*N)) otherwise.

Which makes the corresponding matrix of coefﬁcients orthonormal (OO’ = Id).

type III

There are several deﬁnitions, we use the following (for norm=None):

N-1

y[k] = x[0] + 2 *sum x[n]*cos(pi*(k+0.5)*n/N), 0 <= k < N.

n=1

or, for norm=’ortho’ and 0 <= k < N:

N-1

y[k] = x[0] / sqrt(N) + sqrt(1/N) *sum x[n]*cos(pi*(k+0.5)*n/N)

n=1

The (unnormalized) DCT-III is the inverse of the (unnormalized) DCT-II, up to a factor 2N. The orthonormalized

DCT-III is exactly the inverse of the orthonormalized DCT-II.

References

[R29],[R30]

scipy.fftpack.idct(x,type=2,n=None,axis=-1,norm=None,overwrite_x=0)

Return the Inverse Discrete Cosine Transform of an arbitrary type sequence.

Parameters x: array_like

The input array.

type : {1, 2, 3}, optional

246 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Type of the DCT (see Notes). Default type is 2.

n: int, optional

Length of the transform.

axis : int, optional

Axis over which to compute the transform.

norm : {None, ‘ortho’}, optional

Normalization mode (see Notes). Default is None.

overwrite_x : bool, optional

If True the contents of x can be destroyed. (default=False)

Returns idct : ndarray of real

The transformed input array.

See Also

dct

Notes

For a single dimension array x,idct(x, norm=’ortho’) is equal to MATLAB idct(x).

‘The’ IDCT is the IDCT of type 2, which is the same as DCT of type 3.

IDCT of type 1 is the DCT of type 1, IDCT of type 2 is the DCT of type 3, and IDCT of type 3 is the DCT of

type 2. For the deﬁnition of these types, see dct.

5.5.2 Differential and pseudo-differential operators

diff(x[, order, period, _cache]) Return k-th derivative (or integral) of a periodic sequence x.

tilbert(x, h[, period, _cache]) Return h-Tilbert transform of a periodic sequence x.

itilbert(x, h[, period, _cache]) Return inverse h-Tilbert transform of a periodic sequence x.

hilbert(x[, _cache]) Return Hilbert transform of a periodic sequence x.

ihilbert(x) Return inverse Hilbert transform of a periodic sequence x.

cs_diff(x, a, b[, period, _cache]) Return (a,b)-cosh/sinh pseudo-derivative of a periodic sequence.

sc_diff(x, a, b[, period, _cache]) Return (a,b)-sinh/cosh pseudo-derivative of a periodic sequence x.

ss_diff(x, a, b[, period, _cache]) Return (a,b)-sinh/sinh pseudo-derivative of a periodic sequence x.

cc_diff(x, a, b[, period, _cache]) Return (a,b)-cosh/cosh pseudo-derivative of a periodic sequence.

shift(x, a[, period, _cache]) Shift periodic sequence x by a: y(u) = x(u+a).

scipy.fftpack.diff(x,order=1,period=None,_cache={})

Return k-th derivative (or integral) of a periodic sequence x.

If x_j and y_j are Fourier coefﬁcients of periodic functions x and y, respectively, then:

y_j = pow(sqrt(-1)*j*2*pi/period, order) *x_j

y_0 = 0 if order is not 0.

Parameters x: array_like

Input array.

order : int, optional

The order of differentiation. Default order is 1. If order is negative, then integration is

carried out under the assumption that x_0 == 0.

period : ﬂoat, optional

The assumed period of the sequence. Default is 2*pi.

5.5. Discrete Fourier transforms (scipy.fftpack) 247

SciPy Reference Guide, Release 0.13.0

Notes

If sum(x, axis=0) = 0 then diff(diff(x, k), -k) == x (within numerical accuracy).

For odd order and even len(x), the Nyquist mode is taken zero.

scipy.fftpack.tilbert(x,h,period=None,_cache={})

Return h-Tilbert transform of a periodic sequence x.

If x_j and y_j are Fourier coefﬁcients of periodic functions x and y, respectively, then:

y_j =sqrt(-1)*coth(j*h*2*pi/period) *x_j

y_0 =0

Parameters x: array_like

The input array to transform.

h: ﬂoat

Deﬁnes the parameter of the Tilbert transform.

period : ﬂoat, optional

The assumed period of the sequence. Default period is 2*pi.

Returns tilbert : ndarray

The result of the transform.

Notes

If sum(x, axis=0) == 0 and n = len(x) is odd then tilbert(itilbert(x)) == x.

If 2*pi *h / period is approximately 10 or larger, then numerically tilbert == hilbert (the-

oretically oo-Tilbert == Hilbert).

For even len(x), the Nyquist mode of xis taken zero.

scipy.fftpack.itilbert(x,h,period=None,_cache={})

Return inverse h-Tilbert transform of a periodic sequence x.

If x_j and y_j are Fourier coefﬁcients of periodic functions x and y, respectively, then:

y_j = -sqrt(-1)*tanh(j*h*2*pi/period) *x_j

y_0 =0

For more details, see tilbert.

scipy.fftpack.hilbert(x,_cache={})

Return Hilbert transform of a periodic sequence x.

If x_j and y_j are Fourier coefﬁcients of periodic functions x and y, respectively, then:

y_j =sqrt(-1)*sign(j) *x_j

y_0 =0

Parameters x: array_like

The input array, should be periodic.

_cache : dict, optional

Dictionary that contains the kernel used to do a convolution with.

Returns y: ndarray

The transformed input.

248 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Notes

If sum(x, axis=0) == 0 then hilbert(ihilbert(x)) == x.

For even len(x), the Nyquist mode of x is taken zero.

The sign of the returned transform does not have a factor -1 that is more often than not found in the deﬁnition

of the Hilbert transform. Note also that scipy.signal.hilbert does have an extra -1 factor compared to

this function.

scipy.fftpack.ihilbert(x)

Return inverse Hilbert transform of a periodic sequence x.

If x_j and y_j are Fourier coefﬁcients of periodic functions x and y, respectively, then:

y_j = -sqrt(-1)*sign(j) *x_j

y_0 =0

scipy.fftpack.cs_diff(x,a,b,period=None,_cache={})

Return (a,b)-cosh/sinh pseudo-derivative of a periodic sequence.

If x_j and y_j are Fourier coefﬁcients of periodic functions x and y, respectively, then:

y_j = -sqrt(-1)*cosh(j*a*2*pi/period)/sinh(j*b*2*pi/period) *x_j

y_0 =0

Parameters x: array_like

The array to take the pseudo-derivative from.

a, b : ﬂoat

Deﬁnes the parameters of the cosh/sinh pseudo-differential operator.

period : ﬂoat, optional

The period of the sequence. Default period is 2*pi.

Returns cs_diff : ndarray

Pseudo-derivative of periodic sequence x.

Notes

For even len(x), the Nyquist mode of xis taken as zero.

scipy.fftpack.sc_diff(x,a,b,period=None,_cache={})

Return (a,b)-sinh/cosh pseudo-derivative of a periodic sequence x.

If x_j and y_j are Fourier coefﬁcients of periodic functions x and y, respectively, then:

y_j =sqrt(-1)*sinh(j*a*2*pi/period)/cosh(j*b*2*pi/period) *x_j

y_0 =0

Parameters x: array_like

Input array.

a,b : ﬂoat

Deﬁnes the parameters of the sinh/cosh pseudo-differential operator.

period : ﬂoat, optional

The period of the sequence x. Default is 2*pi.

Notes

sc_diff(cs_diff(x,a,b),b,a) == x For even len(x), the Nyquist mode of x is taken as zero.

5.5. Discrete Fourier transforms (scipy.fftpack) 249

SciPy Reference Guide, Release 0.13.0

scipy.fftpack.ss_diff(x,a,b,period=None,_cache={})

Return (a,b)-sinh/sinh pseudo-derivative of a periodic sequence x.

If x_j and y_j are Fourier coefﬁcients of periodic functions x and y, respectively, then:

y_j =sinh(j*a*2*pi/period)/sinh(j*b*2*pi/period) *x_j

y_0 =a/b*x_0

Parameters x: array_like

The array to take the pseudo-derivative from.

a,b

Deﬁnes the parameters of the sinh/sinh pseudo-differential operator.

period : ﬂoat, optional

The period of the sequence x. Default is 2*pi.

Notes

ss_diff(ss_diff(x,a,b),b,a) == x

scipy.fftpack.cc_diff(x,a,b,period=None,_cache={})

Return (a,b)-cosh/cosh pseudo-derivative of a periodic sequence.

If x_j and y_j are Fourier coefﬁcients of periodic functions x and y, respectively, then:

y_j =cosh(j*a*2*pi/period)/cosh(j*b*2*pi/period) *x_j

Parameters x: array_like

The array to take the pseudo-derivative from.

a,b : ﬂoat

Deﬁnes the parameters of the sinh/sinh pseudo-differential operator.

period : ﬂoat, optional

The period of the sequence x. Default is 2*pi.

Returns cc_diff : ndarray

Pseudo-derivative of periodic sequence x.

Notes

cc_diff(cc_diff(x,a,b),b,a) == x

scipy.fftpack.shift(x,a,period=None,_cache={})

Shift periodic sequence x by a: y(u) = x(u+a).

If x_j and y_j are Fourier coefﬁcients of periodic functions x and y, respectively, then:

y_j =exp(j*a*2*pi/period*sqrt(-1)) *x_f

Parameters x: array_like

The array to take the pseudo-derivative from.

a: ﬂoat

Deﬁnes the parameters of the sinh/sinh pseudo-differential

period : ﬂoat, optional

The period of the sequences x and y. Default period is 2*pi.

Continued on next page

250 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Table 5.16 – continued from previous page

5.5.3 Helper functions

fftshift(x[, axes]) Shift the zero-frequency component to the center of the spectrum.

ifftshift(x[, axes]) The inverse of fftshift.

fftfreq(n[, d]) Return the Discrete Fourier Transform sample frequencies.

rfftfreq(n[, d]) DFT sample frequencies (for usage with rfft, irfft).

scipy.fftpack.fftshift(x,axes=None)

Shift the zero-frequency component to the center of the spectrum.

This function swaps half-spaces for all axes listed (defaults to all). Note that y[0] is the Nyquist component

only if len(x) is even.

Parameters x: array_like

Input array.

axes : int or shape tuple, optional

Axes over which to shift. Default is None, which shifts all axes.

Returns y: ndarray

The shifted array.

See Also

ifftshift The inverse of fftshift.

Examples

>>> freqs =np.fft.fftfreq(10,0.1)

>>> freqs

array([ 0., 1., 2., 3., 4., -5., -4., -3., -2., -1.])

>>> np.fft.fftshift(freqs)

array([-5., -4., -3., -2., -1., 0., 1., 2., 3., 4.])

Shift the zero-frequency component only along the second axis:

>>> freqs =np.fft.fftfreq(9, d=1./9).reshape(3,3)

>>> freqs

array([[ 0., 1., 2.],

[ 3., 4., -4.],

[-3., -2., -1.]])

>>> np.fft.fftshift(freqs, axes=(1,))

array([[ 2., 0., 1.],

[-4., 3., 4.],

[-1., -3., -2.]])

scipy.fftpack.ifftshift(x,axes=None)

The inverse of fftshift.

Parameters x: array_like

Input array.

axes : int or shape tuple, optional

Axes over which to calculate. Defaults to None, which shifts all axes.

Returns y: ndarray

The shifted array.

5.5. Discrete Fourier transforms (scipy.fftpack) 251

SciPy Reference Guide, Release 0.13.0

See Also

fftshift Shift zero-frequency component to the center of the spectrum.

Examples

>>> freqs =np.fft.fftfreq(9, d=1./9).reshape(3,3)

>>> freqs

array([[ 0., 1., 2.],

[ 3., 4., -4.],

[-3., -2., -1.]])

>>> np.fft.ifftshift(np.fft.fftshift(freqs))

array([[ 0., 1., 2.],

[ 3., 4., -4.],

[-3., -2., -1.]])

scipy.fftpack.fftfreq(n,d=1.0)

Return the Discrete Fourier Transform sample frequencies.

The returned ﬂoat array contains the frequency bins in cycles/unit (with zero at the start) given a window length

nand a sample spacing d:

f = [0, 1, ..., n/2-1, -n/2, ..., -1] / (d*n) if n is even

f = [0, 1, ..., (n-1)/2, -(n-1)/2, ..., -1] / (d*n) if n is odd

Parameters n: int

Window length.

d: scalar

Sample spacing.

Returns out : ndarray

The array of length n, containing the sample frequencies.

Examples

>>> signal =np.array([-2,8,6,4,1,0,3,5], dtype=float)

>>> fourier =np.fft.fft(signal)

>>> n=signal.size

>>> timestep =0.1

>>> freq =np.fft.fftfreq(n, d=timestep)

>>> freq

array([ 0. , 1.25, 2.5 , 3.75, -5. , -3.75, -2.5 , -1.25])

scipy.fftpack.rfftfreq(n,d=1.0)

DFT sample frequencies (for usage with rfft, irfft).

The returned ﬂoat array contains the frequency bins in cycles/unit (with zero at the start) given a window length

nand a sample spacing d:

f = [0,1,1,2,2,...,n/2-1,n/2-1,n/2]/(d*n) if n is even

f = [0,1,1,2,2,...,n/2-1,n/2-1,n/2,n/2]/(d*n) if n is odd

Parameters n: int

Window length.

d: scalar, optional

Sample spacing. Default is 1.

Returns out : ndarray

The array of length n, containing the sample frequencies.

252 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Examples

>>> from scipy import fftpack

>>> sig =np.array([-2,8,6,4,1,0,3,5], dtype=float)

>>> sig_fft =fftpack.rfft(sig)

>>> n=sig_fft.size

>>> timestep =0.1

>>> freq =fftpack.rfftfreq(n, d=timestep)

>>> freq

array([ 0. , 1.25, 1.25, 2.5 , 2.5 , 3.75, 3.75, 5. ])

5.5.4 Convolutions (scipy.fftpack.convolve)

convolve convolve - Function signature:

convolve_z convolve_z - Function signature:

init_convolution_kernel init_convolution_kernel - Function signature:

destroy_convolve_cache destroy_convolve_cache - Function signature:

scipy.fftpack.convolve.convolve = <fortran object>

convolve - Function signature:

y = convolve(x,omega,[swap_real_imag,overwrite_x])

Required arguments:

x : input rank-1 array(‘d’) with bounds (n) omega : input rank-1 array(‘d’) with bounds (n)

Optional arguments:

overwrite_x := 0 input int swap_real_imag := 0 input int

Return objects:

y : rank-1 array(‘d’) with bounds (n) and x storage

scipy.fftpack.convolve.convolve_z = <fortran object>

convolve_z - Function signature:

y = convolve_z(x,omega_real,omega_imag,[overwrite_x])

Required arguments:

x : input rank-1 array(‘d’) with bounds (n) omega_real : input rank-1 array(‘d’) with bounds (n)

omega_imag : input rank-1 array(‘d’) with bounds (n)

Optional arguments:

overwrite_x := 0 input int

Return objects:

y : rank-1 array(‘d’) with bounds (n) and x storage

scipy.fftpack.convolve.init_convolution_kernel = <fortran object>

init_convolution_kernel - Function signature:

omega = init_convolution_kernel(n,kernel_func,[d,zero_nyquist,kernel_func_extra_args])

Required arguments:

n : input int kernel_func : call-back function

5.5. Discrete Fourier transforms (scipy.fftpack) 253

SciPy Reference Guide, Release 0.13.0

Optional arguments:

d := 0 input int kernel_func_extra_args := () input tuple zero_nyquist := d%2 input int

Return objects:

omega : rank-1 array(‘d’) with bounds (n)

Call-back functions:

def kernel_func(k): return kernel_func Required arguments:

k : input int

Return objects:

kernel_func : ﬂoat

scipy.fftpack.convolve.destroy_convolve_cache = <fortran object>

destroy_convolve_cache - Function signature: destroy_convolve_cache()

5.5.5 Other (scipy.fftpack._fftpack)

drfft drfft - Function signature:

zfft zfft - Function signature:

zrfft zrfft - Function signature:

zfftnd zfftnd - Function signature:

destroy_drfft_cache destroy_drfft_cache - Function signature:

destroy_zfft_cache destroy_zfft_cache - Function signature:

destroy_zfftnd_cache destroy_zfftnd_cache - Function signature:

scipy.fftpack._fftpack.drfft = <fortran object>

drfft - Function signature:

y = drfft(x,[n,direction,normalize,overwrite_x])

Required arguments:

x : input rank-1 array(‘d’) with bounds (*)

Optional arguments:

overwrite_x := 0 input int n := size(x) input int direction := 1 input int normalize := (direction<0)

input int

Return objects:

y : rank-1 array(‘d’) with bounds (*) and x storage

scipy.fftpack._fftpack.zfft = <fortran object>

zfft - Function signature:

y = zfft(x,[n,direction,normalize,overwrite_x])

Required arguments:

x : input rank-1 array(‘D’) with bounds (*)

Optional arguments:

overwrite_x := 0 input int n := size(x) input int direction := 1 input int normalize := (direction<0)

input int

Return objects:

y : rank-1 array(‘D’) with bounds (*) and x storage

254 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.fftpack._fftpack.zrfft = <fortran object>

zrfft - Function signature:

y = zrfft(x,[n,direction,normalize,overwrite_x])

Required arguments:

x : input rank-1 array(‘D’) with bounds (*)

Optional arguments:

overwrite_x := 1 input int n := size(x) input int direction := 1 input int normalize := (direction<0)

input int

Return objects:

y : rank-1 array(‘D’) with bounds (*) and x storage

scipy.fftpack._fftpack.zfftnd = <fortran object>

zfftnd - Function signature:

y = zfftnd(x,[s,direction,normalize,overwrite_x])

Required arguments:

x : input rank-1 array(‘D’) with bounds (*)

Optional arguments:

overwrite_x := 0 input int s := old_shape(x,j++) input rank-1 array(‘i’) with bounds (r) direction

:= 1 input int normalize := (direction<0) input int

Return objects:

y : rank-1 array(‘D’) with bounds (*) and x storage

scipy.fftpack._fftpack.destroy_drfft_cache = <fortran object>

destroy_drfft_cache - Function signature: destroy_drfft_cache()

scipy.fftpack._fftpack.destroy_zfft_cache = <fortran object>

destroy_zfft_cache - Function signature: destroy_zfft_cache()

scipy.fftpack._fftpack.destroy_zfftnd_cache = <fortran object>

destroy_zfftnd_cache - Function signature: destroy_zfftnd_cache()

5.6 Integration and ODEs (scipy.integrate)

5.6.1 Integrating functions, given function object

quad(func, a, b[, args, full_output, ...]) Compute a deﬁnite integral.

dblquad(func, a, b, gfun, hfun[, args, ...]) Compute a double integral.

tplquad(func, a, b, gfun, hfun, qfun, rfun) Compute a triple (deﬁnite) integral.

nquad(func, ranges[, args, opts]) Integration over multiple variables.

fixed_quad(func, a, b[, args, n]) Compute a deﬁnite integral using ﬁxed-order Gaussian quadrature.

quadrature(func, a, b[, args, tol, rtol, ...]) Compute a deﬁnite integral using ﬁxed-tolerance Gaussian quadrature.

romberg(function, a, b[, args, tol, rtol, ...]) Romberg integration of a callable function or method.

scipy.integrate.quad(func,a,b,args=(),full_output=0,epsabs=1.49e-08,epsrel=1.49e-08,limit=50,

points=None,weight=None,wvar=None,wopts=None,maxp1=50,limlst=50)

Compute a deﬁnite integral.

5.6. Integration and ODEs (scipy.integrate) 255

SciPy Reference Guide, Release 0.13.0

Integrate func from ato b(possibly inﬁnite interval) using a technique from the Fortran library QUADPACK.

Run scipy.integrate.quad_explain() for more information on the more esoteric inputs and outputs.

Parameters func : function

A Python function or method to integrate. If func takes many arguments, it is inte-

grated along the axis corresponding to the ﬁrst argument.

a: ﬂoat

Lower limit of integration (use -numpy.inf for -inﬁnity).

b: ﬂoat

Upper limit of integration (use numpy.inf for +inﬁnity).

args : tuple, optional

Extra arguments to pass to func.

full_output : int, optional

Non-zero to return a dictionary of integration information. If non-zero, warning mes-

sages are also suppressed and the message is appended to the output tuple.

Returns y: ﬂoat

The integral of func from ato b.

abserr : ﬂoat

An estimate of the absolute error in the result.

infodict : dict

A dictionary containing additional information. Run scipy.integrate.quad_explain()

for more information.

message :

A convergence message.

explain :

Appended only with ‘cos’ or ‘sin’ weighting and inﬁnite integration limits, it contains

an explanation of the codes in infodict[’ierlst’]

Other Parameters

epsabs : ﬂoat or int, optional

Absolute error tolerance.

epsrel : ﬂoat or int, optional

Relative error tolerance.

limit : ﬂoat or int, optional

An upper bound on the number of subintervals used in the adaptive algorithm.

points : (sequence of ﬂoats,ints), optional

A sequence of break points in the bounded integration interval where local difﬁculties

of the integrand may occur (e.g., singularities, discontinuities). The sequence does

not have to be sorted.

weight : ﬂoat or int, optional

String indicating weighting function.

wvar : optional

Variables for use with weighting functions.

wopts : optional

Optional input for reusing Chebyshev moments.

maxp1 : ﬂoat or int, optional

An upper bound on the number of Chebyshev moments.

limlst : int, optional

Upper bound on the number of cylces (>=3) for use with a sinusoidal weighting and

an inﬁnite end-point.

See Also

dblquad double integral

tplquad triple integral

256 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

nquad n-dimensional integrals (uses quad recursively)

fixed_quadﬁxed-order Gaussian quadrature

quadratureadaptive Gaussian quadrature

odeint ODE integrator

ode ODE integrator

simps integrator for sampled data

romb integrator for sampled data

scipy.special

for coefﬁcients and roots of orthogonal polynomials

Examples

Calculate R4

0x2dx and compare with an analytic result

>>> from scipy import integrate

>>> x2 =lambda x: x**2

>>> integrate.quad(x2, 0,4)

(21.333333333333332, 2.3684757858670003e-13)

>>> print(4**3/3.)# analytical result

21.3333333333

Calculate R∞

0e−xdx

>>> invexp =lambda x: np.exp(-x)

>>> integrate.quad(invexp, 0, np.inf)

(1.0, 5.842605999138044e-11)

>>> f=lambda x,a : a*x

>>> y, err =integrate.quad(f, 0,1, args=(1,))

>>> y

0.5

>>> y, err =integrate.quad(f, 0,1, args=(3,))

>>> y

1.5

scipy.integrate.dblquad(func,a,b,gfun,hfun,args=(),epsabs=1.49e-08,epsrel=1.49e-08)

Compute a double integral.

Return the double (deﬁnite) integral of func(y, x) from x = a..b and y = gfun(x)..hfun(x).

Parameters func : callable

A Python function or method of at least two variables: y must be the ﬁrst argument

and x the second argument.

(a,b) : tuple

The limits of integration in x: a<b

gfun : callable

The lower boundary curve in y which is a function taking a single ﬂoating point argu-

ment (x) and returning a ﬂoating point result: a lambda function can be useful here.

hfun : callable

The upper boundary curve in y (same requirements as gfun).

args : sequence, optional

Extra arguments to pass to func.

epsabs : ﬂoat, optional

5.6. Integration and ODEs (scipy.integrate) 257

SciPy Reference Guide, Release 0.13.0

Absolute tolerance passed directly to the inner 1-D quadrature integration. Default is

1.49e-8.

epsrel : ﬂoat

Relative tolerance of the inner 1-D integrals. Default is 1.49e-8.

Returns y: ﬂoat

The resultant integral.

abserr : ﬂoat

An estimate of the error.

See Also

quad single integral

tplquad triple integral

nquad N-dimensional integrals

fixed_quadﬁxed-order Gaussian quadrature

quadratureadaptive Gaussian quadrature

odeint ODE integrator

ode ODE integrator

simps integrator for sampled data

romb integrator for sampled data

scipy.special

for coefﬁcients and roots of orthogonal polynomials

scipy.integrate.tplquad(func,a,b,gfun,hfun,qfun,rfun,args=(),epsabs=1.49e-08,epsrel=1.49e-

08)

Compute a triple (deﬁnite) integral.

Return the triple integral of func(z, y, x) from x = a..b,y = gfun(x)..hfun(x), and z =

qfun(x,y)..rfun(x,y).

Parameters func : function

A Python function or method of at least three variables in the order (z, y, x).

(a,b) : tuple

The limits of integration in x: a<b

gfun : function

The lower boundary curve in y which is a function taking a single ﬂoating point argu-

ment (x) and returning a ﬂoating point result: a lambda function can be useful here.

hfun : function

The upper boundary curve in y (same requirements as gfun).

qfun : function

The lower boundary surface in z. It must be a function that takes two ﬂoats in the

order (x, y) and returns a ﬂoat.

rfun : function

The upper boundary surface in z. (Same requirements as qfun.)

args : Arguments

Extra arguments to pass to func.

epsabs : ﬂoat, optional

Absolute tolerance passed directly to the innermost 1-D quadrature integration. De-

fault is 1.49e-8.

epsrel : ﬂoat, optional

Relative tolerance of the innermost 1-D integrals. Default is 1.49e-8.

258 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Returns y: ﬂoat

The resultant integral.

abserr : ﬂoat

An estimate of the error.

See Also

quad Adaptive quadrature using QUADPACK

quadratureAdaptive Gaussian quadrature

fixed_quadFixed-order Gaussian quadrature

dblquad Double integrals

nquad N-dimensional integrals

romb Integrators for sampled data

simps Integrators for sampled data

ode ODE integrators

odeint ODE integrators

scipy.special

For coefﬁcients and roots of orthogonal polynomials

scipy.integrate.nquad(func,ranges,args=None,opts=None)

Integration over multiple variables.

Wraps quad to enable integration over multiple variables. Various options allow improved integration of dis-

continuous functions, as well as the use of weighted integration, and generally ﬁner control of the integration

process.

Parameters func : callable

The function to be integrated. Has arguments of x0, ... xn,t0, tm, where

integration is carried out over x0, ... xn, which must be ﬂoats. Function signa-

ture should be func(x0, x1, ..., xn, t0, t1, ..., tm). Integration

is carried out in order. That is, integration over x0 is the innermost integral, and xn

is the outermost.

ranges : iterable object

Each element of ranges may be either a sequence of 2 numbers, or else a callable that

returns such a sequence. ranges[0] corresponds to integration over x0, and so on.

If an element of ranges is a callable, then it will be called with all of the integration

arguments available. e.g. if func = f(x0, x1, x2), then ranges[0] may be

deﬁned as either (a, b) or else as (a, b) = range0(x1, x2).

args : iterable object, optional

Additional arguments t0, ..., tn, required by func.

opts : iterable object or dict, optional

Options to be passed to quad. May be empty, a dict, or a sequence of dicts or func-

tions that return a dict. If empty, the default options from scipy.integrate.quadare

used. If a dict, the same options are used for all levels of integraion. If a sequence,

then each element of the sequence corresponds to a particular integration. e.g. opts[0]

corresponds to integration over x0, and so on. The available options together with

their default values are:

•epsabs = 1.49e-08

•epsrel = 1.49e-08

•limit = 50

•points = None

5.6. Integration and ODEs (scipy.integrate) 259

SciPy Reference Guide, Release 0.13.0

•weight = None

•wvar = None

•wopts = None

The full_output option from quad is unavailable, due to the complexity of han-

dling the large amount of data such an option would return for this kind of nested

integration. For more information on these options, see quad and quad_explain.

Returns result : ﬂoat

The result of the integration.

abserr : ﬂoat

The maximum of the estimates of the absolute error in the various integration results.

See Also

quad 1-dimensional numerical integration

dblquad,tplquad

fixed_quadﬁxed-order Gaussian quadrature

quadratureadaptive Gaussian quadrature

Examples

>>> from scipy import integrate

>>> func =lambda x0,x1,x2,x3 : x0**2+x1*x2 -x3**3+np.sin(x0) +(

... 1if (x0-.2*x3-.5-.25*x1>0)else 0)

>>> points =[[lambda (x1,x2,x3) : 0.2*x3 +0.5 +0.25*x1], [], [], []]

>>> def opts0(*args, **kwargs):

... return {’points’:[0.2*args[2]+0.5 +0.25*args[0]]}

>>> integrate.nquad(func, [[0,1], [-1,1], [.13,.8], [-.15,1]],

... opts=[opts0,{},{},{}])

(1.5267454070738633, 2.9437360001402324e-14)

>>> scale = .1

>>> def func2(x0, x1, x2, x3, t0, t1):

... return x0*x1*x3**2+np.sin(x2) +1+(1if x0+t1*x1-t0>0else 0)

>>> def lim0(x1, x2, x3, t0, t1):

... return [scale *(x1**2+x2 +np.cos(x3)*t0*t1 +1)-1,

... scale *(x1**2+x2 +np.cos(x3)*t0*t1 +1)+1]

>>> def lim1(x2, x3, t0, t1):

... return [scale *(t0*x2 +t1*x3) -1,

... scale *(t0*x2 +t1*x3) +1]

>>> def lim2(x3, t0, t1):

... return [scale *(x3 +t0**2*t1**3)-1,

... scale *(x3 +t0**2*t1**3)+1]

>>> def lim3(t0, t1):

... return [scale *(t0+t1) -1, scale *(t0+t1) +1]

>>> def opts0(x1, x2, x3, t0, t1):

... return {’points’ : [t0 -t1*x1]}

>>> def opts1(x2, x3, t0, t1):

... return {}

>>> def opts2(x3, t0, t1):

... return {}

>>> def opts3(t0, t1):

... return {}

>>> integrate.nquad(func2, [lim0, lim1, lim2, lim3], args=(0,0),

opts=[opts0, opts1, opts2, opts3])

(25.066666666666666, 2.7829590483937256e-13)

260 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.integrate.fixed_quad(func,a,b,args=(),n=5)

Compute a deﬁnite integral using ﬁxed-order Gaussian quadrature.

Integrate func from ato busing Gaussian quadrature of order n.

Parameters func : callable

A Python function or method to integrate (must accept vector inputs).

a: ﬂoat

Lower limit of integration.

b: ﬂoat

Upper limit of integration.

args : tuple, optional

Extra arguments to pass to function, if any.

n: int, optional

Order of quadrature integration. Default is 5.

Returns val : ﬂoat

Gaussian quadrature approximation to the integral

See Also

quad adaptive quadrature using QUADPACK

dblquad double integrals

tplquad triple integrals

romberg adaptive Romberg quadrature

quadratureadaptive Gaussian quadrature

romb integrators for sampled data

simps integrators for sampled data

cumtrapz cumulative integration for sampled data

ode ODE integrator

odeint ODE integrator

scipy.integrate.quadrature(func,a,b,args=(),tol=1.49e-08,rtol=1.49e-08,maxiter=50,

vec_func=True)

Compute a deﬁnite integral using ﬁxed-tolerance Gaussian quadrature.

Integrate func from ato busing Gaussian quadrature with absolute tolerance tol.

Parameters func : function

A Python function or method to integrate.

a: ﬂoat

Lower limit of integration.

b: ﬂoat

Upper limit of integration.

args : tuple, optional

Extra arguments to pass to function.

tol, rol : ﬂoat, optional

Iteration stops when error between last two iterates is less than tol OR the relative

change is less than rtol.

maxiter : int, optional

Maximum number of iterations.

vec_func : bool, optional

5.6. Integration and ODEs (scipy.integrate) 261

SciPy Reference Guide, Release 0.13.0

True or False if func handles arrays as arguments (is a “vector” function). Default is

True.

Returns val : ﬂoat

Gaussian quadrature approximation (within tolerance) to integral.

err : ﬂoat

Difference between last two estimates of the integral.

See Also

romberg adaptive Romberg quadrature

fixed_quadﬁxed-order Gaussian quadrature

quad adaptive quadrature using QUADPACK

dblquad double integrals

tplquad triple integrals

romb integrator for sampled data

simps integrator for sampled data

cumtrapz cumulative integration for sampled data

ode ODE integrator

odeint ODE integrator

scipy.integrate.romberg(function,a,b,args=(),tol=1.48e-08,rtol=1.48e-08,show=False,div-

max=10,vec_func=False)

Romberg integration of a callable function or method.

Returns the integral of function (a function of one variable) over the interval (a,b).

If show is 1, the triangular array of the intermediate results will be printed. If vec_func is True (default is False),

then function is assumed to support vector arguments.

Parameters function : callable

Function to be integrated.

a: ﬂoat

Lower limit of integration.

b: ﬂoat

Upper limit of integration.

Returns results : ﬂoat

Result of the integration.

Other Parameters

args : tuple, optional

Extra arguments to pass to function. Each element of args will be passed as a single

argument to func. Default is to pass no extra arguments.

tol, rtol : ﬂoat, optional

The desired absolute and relative tolerances. Defaults are 1.48e-8.

show : bool, optional

Whether to print the results. Default is False.

divmax : int, optional

Maximum order of extrapolation. Default is 10.

vec_func : bool, optional

Whether func handles arrays as arguments (i.e whether it is a “vector” function). De-

fault is False.

262 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

See Also

fixed_quadFixed-order Gaussian quadrature.

quad Adaptive quadrature using QUADPACK.

dblquad Double integrals.

tplquad Triple integrals.

romb Integrators for sampled data.

simps Integrators for sampled data.

cumtrapz Cumulative integration for sampled data.

ode ODE integrator.

odeint ODE integrator.

References

[R31]

Examples

Integrate a gaussian from 0 to 1 and compare to the error function.

>>> from scipy import integrate

>>> from scipy.special import erf

>>> gaussian =lambda x: 1/np.sqrt(np.pi) *np.exp(-x**2)

>>> result =integrate.romberg(gaussian, 0,1, show=True)

Romberg integration of <function vfunc at ...> from [0, 1]

Steps StepSize Results

1 1.000000 0.385872

2 0.500000 0.412631 0.421551

4 0.250000 0.419184 0.421368 0.421356

8 0.125000 0.420810 0.421352 0.421350 0.421350

16 0.062500 0.421215 0.421350 0.421350 0.421350 0.421350

32 0.031250 0.421317 0.421350 0.421350 0.421350 0.421350 0.421350

The ﬁnal result is 0.421350396475 after 33 function evaluations.

>>> print("%g %g"%(2*result, erf(1)))

0.842701 0.842701

5.6.2 Integrating functions, given ﬁxed samples

cumtrapz(y[, x, dx, axis, initial]) Cumulatively integrate y(x) using the composite trapezoidal rule.

simps(y[, x, dx, axis, even]) Integrate y(x) using samples along the given axis and the composite

romb(y[, dx, axis, show]) Romberg integration using samples of a function.

scipy.integrate.cumtrapz(y,x=None,dx=1.0,axis=-1,initial=None)

Cumulatively integrate y(x) using the composite trapezoidal rule.

Parameters y: array_like

Values to integrate.

x: array_like, optional

5.6. Integration and ODEs (scipy.integrate) 263

SciPy Reference Guide, Release 0.13.0

The coordinate to integrate along. If None (default), use spacing dx between consec-

utive elements in y.

dx : int, optional

Spacing between elements of y. Only used if xis None.

axis : int, optional

Speciﬁes the axis to cumulate. Default is -1 (last axis).

initial : scalar, optional

If given, uses this value as the ﬁrst value in the returned result. Typically this value

should be 0. Default is None, which means no value at x[0] is returned and res has

one element less than yalong the axis of integration.

Returns res : ndarray

The result of cumulative integration of yalong axis. If initial is None, the shape is

such that the axis of integration has one less value than y. If initial is given, the shape

is equal to that of y.

See Also

numpy.cumsum,numpy.cumprod

quad adaptive quadrature using QUADPACK

romberg adaptive Romberg quadrature

quadratureadaptive Gaussian quadrature

fixed_quadﬁxed-order Gaussian quadrature

dblquad double integrals

tplquad triple integrals

romb integrators for sampled data

ode ODE integrators

odeint ODE integrators

Examples

>>> from scipy import integrate

>>> import matplotlib.pyplot as plt

>>> x=np.linspace(-2,2, num=20)

>>> y=x

>>> y_int =integrate.cumtrapz(y, x, initial=0)

>>> plt.plot(x, y_int, ’ro’, x, y[0]+0.5 *x**2,’b-’)

>>> plt.show()

264 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.integrate.simps(y,x=None,dx=1,axis=-1,even=’avg’)

Integrate y(x) using samples along the given axis and the composite Simpson’s rule. If x is None, spacing of dx

is assumed.

If there are an even number of samples, N, then there are an odd number of intervals (N-1), but Simpson’s rule

requires an even number of intervals. The parameter ‘even’ controls how this is handled.

Parameters y: array_like

Array to be integrated.

x: array_like, optional

If given, the points at which yis sampled.

dx : int, optional

Spacing of integration points along axis of y. Only used when xis None. Default is 1.

axis : int, optional

Axis along which to integrate. Default is the last axis.

even : {‘avg’, ‘ﬁrst’, ‘str’}, optional

‘avg’ [Average two results:1) use the ﬁrst N-2 intervals with] a trapezoidal

rule on the last interval and 2) use the last N-2 intervals with a trape-

zoidal rule on the ﬁrst interval.

‘ﬁrst’ [Use Simpson’s rule for the ﬁrst N-2 intervals with] a trapezoidal rule

on the last interval.

‘last’ [Use Simpson’s rule for the last N-2 intervals with a] trapezoidal rule

on the ﬁrst interval.

See Also

quad adaptive quadrature using QUADPACK

romberg adaptive Romberg quadrature

quadratureadaptive Gaussian quadrature

fixed_quadﬁxed-order Gaussian quadrature

dblquad double integrals

tplquad triple integrals

romb integrators for sampled data

5.6. Integration and ODEs (scipy.integrate) 265

SciPy Reference Guide, Release 0.13.0

cumtrapz cumulative integration for sampled data

ode ODE integrators

odeint ODE integrators

Notes

For an odd number of samples that are equally spaced the result is exact if the function is a polynomial of order

3 or less. If the samples are not equally spaced, then the result is exact only if the function is a polynomial of

order 2 or less.

scipy.integrate.romb(y,dx=1.0,axis=-1,show=False)

Romberg integration using samples of a function.

Parameters y: array_like

A vector of 2**k+1equally-spaced samples of a function.

dx : array_like, optional

The sample spacing. Default is 1.

axis : int, optional

The axis along which to integrate. Default is -1 (last axis).

show : bool, optional

When yis a single 1-D array, then if this argument is True print the table showing

Richardson extrapolation from the samples. Default is False.

Returns romb : ndarray

The integrated result for axis.

See Also

quad adaptive quadrature using QUADPACK

romberg adaptive Romberg quadrature

quadratureadaptive Gaussian quadrature

fixed_quadﬁxed-order Gaussian quadrature

dblquad double integrals

tplquad triple integrals

simps integrators for sampled data

cumtrapz cumulative integration for sampled data

ode ODE integrators

odeint ODE integrators

See Also

scipy.special for orthogonal polynomials (special) for Gaussian quadrature roots and weights for other weight-

ing factors and regions.

5.6.3 Integrators of ODE systems

odeint(func, y0, t[, args, Dfun, col_deriv, ...]) Integrate a system of ordinary differential equations.

ode(f[, jac]) A generic interface class to numeric integrators.

complex_ode(f[, jac]) A wrapper of ode for complex systems.

266 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.integrate.odeint(func,y0,t,args=(),Dfun=None,col_deriv=0,full_output=0,ml=None,

mu=None,rtol=None,atol=None,tcrit=None,h0=0.0,hmax=0.0,

hmin=0.0,ixpr=0,mxstep=0,mxhnil=0,mxordn=12,mxords=5,print-

messg=0)

Integrate a system of ordinary differential equations.

Solve a system of ordinary differential equations using lsoda from the FORTRAN library odepack.

Solves the initial value problem for stiff or non-stiff systems of ﬁrst order ode-s:

dy/dt =func(y,t0,...)

where y can be a vector.

Parameters func : callable(y, t0, ...)

Computes the derivative of y at t0.

y0 : array

Initial condition on y (can be a vector).

t: array

A sequence of time points for which to solve for y. The initial value point should be

the ﬁrst element of this sequence.

args : tuple, optional

Extra arguments to pass to function.

Dfun : callable(y, t0, ...)

Gradient (Jacobian) of func.

col_deriv : bool, optional

True if Dfun deﬁnes derivatives down columns (faster), otherwise Dfun should deﬁne

derivatives across rows.

full_output : bool, optional

True if to return a dictionary of optional outputs as the second output

printmessg : bool, optional

Whether to print the convergence message

Returns y: array, shape (len(t), len(y0))

Array containing the value of y for each desired time in t, with the initial value y0 in

the ﬁrst row.

infodict : dict, only returned if full_output == True

Dictionary containing additional output information

key meaning

‘hu’ vector of step sizes successfully used for each time step.

‘tcur’ vector with the value of t reached for each time step. (will always be at

least as large as the input times).

‘tolsf’ vector of tolerance scale factors, greater than 1.0, computed when a

request for too much accuracy was detected.

‘tsw’ value of t at the time of the last method switch (given for each time step)

‘nst’ cumulative number of time steps

‘nfe’ cumulative number of function evaluations for each time step

‘nje’ cumulative number of jacobian evaluations for each time step

‘nqu’ a vector of method orders for each successful step.

‘imxer’ index of the component of largest magnitude in the weighted local error

vector (e / ewt) on an error return, -1 otherwise.

‘lenrw’ the length of the double work array required.

‘leniw’ the length of integer work array required.

‘mused’a vector of method indicators for each successful time step: 1: adams

(nonstiff), 2: bdf (stiff)

Other Parameters

ml, mu : int, optional

5.6. Integration and ODEs (scipy.integrate) 267

SciPy Reference Guide, Release 0.13.0

If either of these are not None or non-negative, then the Jacobian is assumed to be

banded. These give the number of lower and upper non-zero diagonals in this banded

matrix. For the banded case, Dfun should return a matrix whose columns contain the

non-zero bands (starting with the lowest diagonal). Thus, the return matrix from Dfun

should have shape len(y0) *(ml + mu + 1) when ml >=0 or mu >=0.

rtol, atol : ﬂoat, optional

The input parameters rtol and atol determine the error control performed by the solver.

The solver will control the vector, e, of estimated local errors in y, according to an

inequality of the form max-norm of (e / ewt) <= 1, where ewt is a vector

of positive error weights computed as ewt = rtol *abs(y) + atol. rtol

and atol can be either vectors the same length as y or scalars. Defaults to 1.49012e-8.

tcrit : ndarray, optional

Vector of critical points (e.g. singularities) where integration care should be taken.

h0 : ﬂoat, (0: solver-determined), optional

The step size to be attempted on the ﬁrst step.

hmax : ﬂoat, (0: solver-determined), optional

The maximum absolute step size allowed.

hmin : ﬂoat, (0: solver-determined), optional

The minimum absolute step size allowed.

ixpr : bool, optional

Whether to generate extra printing at method switches.

mxstep : int, (0: solver-determined), optional

Maximum number of (internally deﬁned) steps allowed for each integration point in t.

mxhnil : int, (0: solver-determined), optional

Maximum number of messages printed.

mxordn : int, (0: solver-determined), optional

Maximum order to be allowed for the non-stiff (Adams) method.

mxords : int, (0: solver-determined), optional

Maximum order to be allowed for the stiff (BDF) method.

See Also

ode a more object-oriented integrator based on VODE.

quad for ﬁnding the area under a curve.

class scipy.integrate.ode(f,jac=None)

A generic interface class to numeric integrators.

Solve an equation system y0(t) = f(t, y)with (optional) jac = df/dy.

Parameters f: callable f(t, y, *f_args)

Rhs of the equation. t is a scalar, y.shape == (n,).f_args is set by calling

set_f_params(*args).fshould return a scalar, array or list (not a tuple).

jac : callable jac(t, y, *jac_args)

Jacobian of the rhs, jac[i,j] = d f[i] / d y[j].jac_args is set by call-

ing set_f_params(*args).

See Also

odeint an integrator with a simpler interface based on lsoda from ODEPACK

quad for ﬁnding the area under a curve

268 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Notes

Available integrators are listed below. They can be selected using the set_integrator method.

“vode”

Real-valued Variable-coefﬁcient Ordinary Differential Equation solver, with ﬁxed-leading-coefﬁcient im-

plementation. It provides implicit Adams method (for non-stiff problems) and a method based on back-

ward differentiation formulas (BDF) (for stiff problems).

Source: http://www.netlib.org/ode/vode.f

Warning: This integrator is not re-entrant. You cannot have two ode instances using the “vode”

integrator at the same time.

This integrator accepts the following parameters in set_integrator method of the ode class:

•atol : ﬂoat or sequence absolute tolerance for solution

•rtol : ﬂoat or sequence relative tolerance for solution

•lband : None or int

•rband : None or int Jacobian band width, jac[i,j] != 0 for i-lband <= j <= i+rband. Setting these

requires your jac routine to return the jacobian in packed format, jac_packed[i-j+lband, j] = jac[i,j].

•method: ‘adams’ or ‘bdf’ Which solver to use, Adams (non-stiff) or BDF (stiff)

•with_jacobian : bool Whether to use the jacobian

•nsteps : int Maximum number of (internally deﬁned) steps allowed during one call to the solver.

•ﬁrst_step : ﬂoat

•min_step : ﬂoat

•max_step : ﬂoat Limits for the step sizes used by the integrator.

•order : int Maximum order used by the integrator, order <= 12 for Adams, <= 5 for BDF.

“zvode”

Complex-valued Variable-coefﬁcient Ordinary Differential Equation solver, with ﬁxed-leading-coefﬁcient

implementation. It provides implicit Adams method (for non-stiff problems) and a method based on

backward differentiation formulas (BDF) (for stiff problems).

Source: http://www.netlib.org/ode/zvode.f

Warning: This integrator is not re-entrant. You cannot have two ode instances using the “zvode”

integrator at the same time.

This integrator accepts the same parameters in set_integrator as the “vode” solver.

Note: When using ZVODE for a stiff system, it should only be used for the case in which the function

f is analytic, that is, when each f(i) is an analytic function of each y(j). Analyticity means that the partial

derivative df(i)/dy(j) is a unique complex number, and this fact is critical in the way ZVODE solves the

dense or banded linear systems that arise in the stiff case. For a complex stiff ODE system in which f is

not analytic, ZVODE is likely to have convergence failures, and for this problem one should instead use

DVODE on the equivalent real system (in the real and imaginary parts of y).

“lsoda”

Real-valued Variable-coefﬁcient Ordinary Differential Equation solver, with ﬁxed-leading-coefﬁcient im-

plementation. It provides automatic method switching between implicit Adams method (for non-stiff

problems) and a method based on backward differentiation formulas (BDF) (for stiff problems).

Source: http://www.netlib.org/odepack

Warning: This integrator is not re-entrant. You cannot have two ode instances using the “lsoda”

integrator at the same time.

This integrator accepts the following parameters in set_integrator method of the ode class:

•atol : ﬂoat or sequence absolute tolerance for solution

5.6. Integration and ODEs (scipy.integrate) 269

SciPy Reference Guide, Release 0.13.0

•rtol : ﬂoat or sequence relative tolerance for solution

•lband : None or int

•rband : None or int Jacobian band width, jac[i,j] != 0 for i-lband <= j <= i+rband. Setting these

requires your jac routine to return the jacobian in packed format, jac_packed[i-j+lband, j] = jac[i,j].

•with_jacobian : bool Whether to use the jacobian

•nsteps : int Maximum number of (internally deﬁned) steps allowed during one call to the solver.

•ﬁrst_step : ﬂoat

•min_step : ﬂoat

•max_step : ﬂoat Limits for the step sizes used by the integrator.

•max_order_ns : int Maximum order used in the nonstiff case (default 12).

•max_order_s : int Maximum order used in the stiff case (default 5).

•max_hnil : int Maximum number of messages reporting too small step size (t + h = t) (default 0)

•ixpr : int Whether to generate extra printing at method switches (default False).

“dopri5”

This is an explicit runge-kutta method of order (4)5 due to Dormand & Prince (with stepsize control and

dense output).

Authors:

E. Hairer and G. Wanner Universite de Geneve, Dept. de Mathematiques CH-1211 Geneve 24,

Switzerland e-mail: ernst.hairer@math.unige.ch,gerhard.wanner@math.unige.ch

This code is described in [HNW93].

This integrator accepts the following parameters in set_integrator() method of the ode class:

•atol : ﬂoat or sequence absolute tolerance for solution

•rtol : ﬂoat or sequence relative tolerance for solution

•nsteps : int Maximum number of (internally deﬁned) steps allowed during one call to the solver.

•ﬁrst_step : ﬂoat

•max_step : ﬂoat

•safety : ﬂoat Safety factor on new step selection (default 0.9)

•ifactor : ﬂoat

•dfactor : ﬂoat Maximum factor to increase/decrease step size by in one step

•beta : ﬂoat Beta parameter for stabilised step size control.

•verbosity : int Switch for printing messages (< 0 for no messages).

“dop853”

This is an explicit runge-kutta method of order 8(5,3) due to Dormand & Prince (with stepsize control and

dense output).

Options and references the same as “dopri5”.

References

[HNW93]

Examples

A problem to integrate and the corresponding jacobian:

>>> from scipy.integrate import ode

>>>

>>> y0, t0 =[1.0j,2.0], 0

>>>

>>> def f(t, y, arg1):

>>> return [1j*arg1*y[0]+y[1], -arg1*y[1]**2]

>>> def jac(t, y, arg1):

>>> return [[1j*arg1, 1], [0,-arg1*2*y[1]]]

The integration:

270 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

>>> r=ode(f, jac).set_integrator(’zvode’, method=’bdf’, with_jacobian=True)

>>> r.set_initial_value(y0, t0).set_f_params(2.0).set_jac_params(2.0)

>>> t1 =10

>>> dt =1

>>> while r.successful() and r.t<t1:

>>> r.integrate(r.t+dt)

>>> print("%g %g"%(r.t, r.y))

Attributes

t (ﬂoat) Current time.

y (ndarray) Current variable values.

Methods

integrate(t[, step, relax]) Find y=y(t), set y as an initial condition, and return y.

set_f_params(*args) Set extra parameters for user-supplied function f.

set_initial_value(y[, t]) Set initial conditions y(t) = y.

set_integrator(name, **integrator_params) Set integrator by name.

set_jac_params(*args) Set extra parameters for user-supplied function jac.

set_solout(solout) Set callable to be called at every successful integration step.

successful() Check if integration was successful.

ode.integrate(t,step=0,relax=0)

Find y=y(t), set y as an initial condition, and return y.

ode.set_f_params(*args)

Set extra parameters for user-supplied function f.

ode.set_initial_value(y,t=0.0)

Set initial conditions y(t) = y.

ode.set_integrator(name,**integrator_params)

Set integrator by name.

Parameters name : str

Name of the integrator.

integrator_params :

Additional parameters for the integrator.

ode.set_jac_params(*args)

Set extra parameters for user-supplied function jac.

ode.set_solout(solout)

Set callable to be called at every successful integration step.

Parameters solout : callable

solout(t, y) is called at each internal integrator step, t is a scalar providing

the current independent position y is the current soloution y.shape == (n,)

solout should return -1 to stop integration otherwise it should return None or 0

ode.successful()

Check if integration was successful.

class scipy.integrate.complex_ode(f,jac=None)

A wrapper of ode for complex systems.

5.6. Integration and ODEs (scipy.integrate) 271

SciPy Reference Guide, Release 0.13.0

This functions similarly as ode, but re-maps a complex-valued equation system to a real-valued one before

using the integrators.

Parameters f: callable f(t, y, *f_args)

Rhs of the equation. t is a scalar, y.shape == (n,).f_args is set by calling

set_f_params(*args).

jac : callable jac(t, y, *jac_args)

Jacobian of the rhs, jac[i,j] = d f[i] / d y[j].jac_args is set by call-

ing set_f_params(*args).

Examples

For usage examples, see ode.

Attributes

t (ﬂoat) Current time.

y (ndarray) Current variable values.

Methods

integrate(t[, step, relax]) Find y=y(t), set y as an initial condition, and return y.

set_f_params(*args) Set extra parameters for user-supplied function f.

set_initial_value(y[, t]) Set initial conditions y(t) = y.

set_integrator(name, **integrator_params) Set integrator by name.

set_jac_params(*args) Set extra parameters for user-supplied function jac.

set_solout(solout) Set callable to be called at every successful integration step.

successful() Check if integration was successful.

complex_ode.integrate(t,step=0,relax=0)

Find y=y(t), set y as an initial condition, and return y.

complex_ode.set_f_params(*args)

Set extra parameters for user-supplied function f.

complex_ode.set_initial_value(y,t=0.0)

Set initial conditions y(t) = y.

complex_ode.set_integrator(name,**integrator_params)

Set integrator by name.

Parameters name : str

Name of the integrator

integrator_params :

Additional parameters for the integrator.

complex_ode.set_jac_params(*args)

Set extra parameters for user-supplied function jac.

complex_ode.set_solout(solout)

Set callable to be called at every successful integration step.

Parameters solout : callable

solout(t, y) is called at each internal integrator step, t is a scalar providing

the current independent position y is the current soloution y.shape == (n,)

solout should return -1 to stop integration otherwise it should return None or 0

complex_ode.successful()

Check if integration was successful.

272 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

5.7 Interpolation (scipy.interpolate)

Sub-package for objects used in interpolation.

As listed below, this sub-package contains spline functions and classes, one-dimensional and multi-dimensional (uni-

variate and multivariate) interpolation classes, Lagrange and Taylor polynomial interpolators, and wrappers for FIT-

PACK and DFITPACK functions.

5.7.1 Univariate interpolation

interp1d(x, y[, kind, axis, copy, ...]) Interpolate a 1-D function.

BarycentricInterpolator(xi[, yi, axis]) The interpolating polynomial for a set of points

KroghInterpolator(xi, yi[, axis]) Interpolating polynomial for a set of points.

PiecewisePolynomial(xi, yi[, orders, ...]) Piecewise polynomial curve speciﬁed by points and derivatives

PchipInterpolator(x, y[, axis]) PCHIP 1-d monotonic cubic interpolation

barycentric_interpolate(xi, yi, x[, axis]) Convenience function for polynomial interpolation.

krogh_interpolate(xi, yi, x[, der, axis]) Convenience function for polynomial interpolation.

piecewise_polynomial_interpolate(xi, yi, x) Convenience function for piecewise polynomial interpolation.

pchip_interpolate(xi, yi, x[, der, axis]) Convenience function for pchip interpolation.

class scipy.interpolate.interp1d(x,y,kind=’linear’,axis=-1,copy=True,bounds_error=True,

ﬁll_value=np.nan)

Interpolate a 1-D function.

xand yare arrays of values used to approximate some function f: y = f(x). This class returns a function

whose call method uses interpolation to ﬁnd the value of new points.

Parameters x: (N,) array_like

A 1-D array of monotonically increasing real values.

y: (...,N,...) array_like

A N-D array of real values. The length of yalong the interpolation axis must be equal

to the length of x.

kind : str or int, optional

Speciﬁes the kind of interpolation as a string (‘linear’, ‘nearest’, ‘zero’, ‘slinear’,

‘quadratic, ‘cubic’ where ‘slinear’, ‘quadratic’ and ‘cubic’ refer to a spline interpola-

tion of ﬁrst, second or third order) or as an integer specifying the order of the spline

interpolator to use. Default is ‘linear’.

axis : int, optional

Speciﬁes the axis of yalong which to interpolate. Interpolation defaults to the last axis

of y.

copy : bool, optional

If True, the class makes internal copies of x and y. If False, references to xand yare

used. The default is to copy.

bounds_error : bool, optional

If True, a ValueError is raised any time interpolation is attempted on a value outside

of the range of x (where extrapolation is necessary). If False, out of bounds values are

assigned ﬁll_value. By default, an error is raised.

ﬁll_value : ﬂoat, optional

If provided, then this value will be used to ﬁll in for requested points outside of the

data range. If not provided, then the default is NaN.

5.7. Interpolation (scipy.interpolate) 273

SciPy Reference Guide, Release 0.13.0

See Also

UnivariateSpline

A more recent wrapper of the FITPACK routines.

splrep,splev,interp2d

Examples

>>> from scipy import interpolate

>>> x=np.arange(0,10)

>>> y=np.exp(-x/3.0)

>>> f=interpolate.interp1d(x, y)

>>> xnew =np.arange(0,9,0.1)

>>> ynew =f(xnew) # use interpolation function returned by ‘interp1d‘

>>> plt.plot(x, y, ’o’, xnew, ynew, ’-’)

>>> plt.show()

Methods

__call__(x) Evaluate the interpolant

interp1d.__call__(x)

Evaluate the interpolant

Parameters x: array-like

Points to evaluate the interpolant at.

Returns y: array-like

Interpolated values. Shape is determined by replacing the interpolation axis in

the original array with the shape of x.

class scipy.interpolate.BarycentricInterpolator(xi,yi=None,axis=0)

The interpolating polynomial for a set of points

Constructs a polynomial that passes through a given set of points. Allows evaluation of the polynomial, efﬁcient

changing of the y values to be interpolated, and updating by adding more x values. For reasons of numerical

stability, this function does not compute the coefﬁcients of the polynomial.

The values yi need to be provided before the function is evaluated, but none of the preprocessing depends on

them, so rapid updates are possible.

Parameters xi : array-like

1-d array of x coordinates of the points the polynomial should pass through

yi : array-like

The y coordinates of the points the polynomial should pass through. If None, the y

values will be supplied later via the set_y method.

axis : int, optional

Axis in the yi array corresponding to the x-coordinate values.

Notes

This class uses a “barycentric interpolation” method that treats the problem as a special case of rational function

interpolation. This algorithm is quite stable, numerically, but even in a world of exact computation, unless the

x coordinates are chosen very carefully - Chebyshev zeros (e.g. cos(i*pi/n)) are a good choice - polynomial

interpolation itself is a very ill-conditioned process due to the Runge phenomenon.

Based on Berrut and Trefethen 2004, “Barycentric Lagrange Interpolation”.

274 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Methods

__call__(x) Evaluate the interpolating polynomial at the points x

add_xi(xi[, yi]) Add more x values to the set to be interpolated

set_yi(yi[, axis]) Update the y values to be interpolated

BarycentricInterpolator.__call__(x)

Evaluate the interpolating polynomial at the points x

Parameters x: array-like

Points to evaluate the interpolant at.

Returns y: array-like

Interpolated values. Shape is determined by replacing the interpolation axis in

the original array with the shape of x.

Notes

Currently the code computes an outer product between x and the weights, that is, it constructs an interme-

diate array of size N by len(x), where N is the degree of the polynomial.

BarycentricInterpolator.add_xi(xi,yi=None)

Add more x values to the set to be interpolated

The barycentric interpolation algorithm allows easy updating by adding more points for the polynomial

to pass through.

Parameters xi : array_like

The x coordinates of the points that the polynomial should pass through.

yi : array_like, optional

The y coordinates of the points the polynomial should pass through. Should have

shape (xi.size, R); if R > 1 then the polynomial is vector-valued. If yi is

not given, the y values will be supplied later. yi should be given if and only if the

interpolator has y values speciﬁed.

BarycentricInterpolator.set_yi(yi,axis=None)

Update the y values to be interpolated

The barycentric interpolation algorithm requires the calculation of weights, but these depend only on the

xi. The yi can be changed at any time.

Parameters yi : array_like

The y coordinates of the points the polynomial should pass through. If None, the

y values will be supplied later.

axis : int, optional

Axis in the yi array corresponding to the x-coordinate values.

class scipy.interpolate.KroghInterpolator(xi,yi,axis=0)

Interpolating polynomial for a set of points.

The polynomial passes through all the pairs (xi,yi). One may additionally specify a number of derivatives at

each point xi; this is done by repeating the value xi and specifying the derivatives as successive yi values.

Allows evaluation of the polynomial and all its derivatives. For reasons of numerical stability, this function does

not compute the coefﬁcients of the polynomial, although they can be obtained by evaluating all the derivatives.

Parameters xi : array-like, length N

Known x-coordinates. Must be sorted in increasing order.

yi : array-like

5.7. Interpolation (scipy.interpolate) 275

SciPy Reference Guide, Release 0.13.0

Known y-coordinates. When an xi occurs two or more times in a row, the correspond-

ing yi’s represent derivative values.

axis : int, optional

Axis in the yi array corresponding to the x-coordinate values.

Notes

Be aware that the algorithms implemented here are not necessarily the most numerically stable known. More-

over, even in a world of exact computation, unless the x coordinates are chosen very carefully - Chebyshev

zeros (e.g. cos(i*pi/n)) are a good choice - polynomial interpolation itself is a very ill-conditioned process due

to the Runge phenomenon. In general, even with well-chosen x values, degrees higher than about thirty cause

problems with numerical instability in this code.

Based on [R33].

References

[R33]

Examples

To produce a polynomial that is zero at 0 and 1 and has derivative 2 at 0, call

>>> KroghInterpolator([0,0,1],[0,2,0])

This constructs the quadratic 2*X**2-2*X. The derivative condition is indicated by the repeated zero in the xi

array; the corresponding yi values are 0, the function value, and 2, the derivative value.

For another example, given xi, yi, and a derivative ypi for each point, appropriate arrays can be constructed as:

>>> xi_k, yi_k =np.repeat(xi, 2), np.ravel(np.dstack((yi,ypi)))

>>> KroghInterpolator(xi_k, yi_k)

To produce a vector-valued polynomial, supply a higher-dimensional array for yi:

>>> KroghInterpolator([0,1],[[2,3],[4,5]])

This constructs a linear polynomial giving (2,3) at 0 and (4,5) at 1.

Methods

__call__(x) Evaluate the interpolant

derivative(x[, der]) Evaluate one derivative of the polynomial at the point x

derivatives(x[, der]) Evaluate many derivatives of the polynomial at the point x

KroghInterpolator.__call__(x)

Evaluate the interpolant

Parameters x: array-like

Points to evaluate the interpolant at.

Returns y: array-like

Interpolated values. Shape is determined by replacing the interpolation axis in

the original array with the shape of x.

KroghInterpolator.derivative(x,der=1)

Evaluate one derivative of the polynomial at the point x

Parameters x: array-like

Point or points at which to evaluate the derivatives

276 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

der : integer, optional

Which derivative to extract. This number includes the function value as 0th

derivative.

Returns d: ndarray

Derivative interpolated at the x-points. Shape of d is determined by replacing the

interpolation axis in the original array with the shape of x.

Notes

This is computed by evaluating all derivatives up to the desired one (using self.derivatives()) and then

discarding the rest.

KroghInterpolator.derivatives(x,der=None)

Evaluate many derivatives of the polynomial at the point x

Produce an array of all derivative values at the point x.

Parameters x: array-like

Point or points at which to evaluate the derivatives

der : None or integer

How many derivatives to extract; None for all potentially nonzero derivatives

(that is a number equal to the number of points). This number includes the func-

tion value as 0th derivative.

Returns d: ndarray

Array with derivatives; d[j] contains the j-th derivative. Shape of d[j] is deter-

mined by replacing the interpolation axis in the original array with the shape of

Examples

>>> KroghInterpolator([0,0,0],[1,2,3]).derivatives(0)

array([1.0,2.0,3.0])

>>> KroghInterpolator([0,0,0],[1,2,3]).derivatives([0,0])

array([[1.0,1.0],

[2.0,2.0],

[3.0,3.0]])

class scipy.interpolate.PiecewisePolynomial(xi,yi,orders=None,direction=None,axis=0)

Piecewise polynomial curve speciﬁed by points and derivatives

This class represents a curve that is a piecewise polynomial. It passes through a list of points and has speciﬁed

derivatives at each point. The degree of the polynomial may vary from segment to segment, as may the number

of derivatives available. The degree should not exceed about thirty.

Appending points to the end of the curve is efﬁcient.

Parameters xi : array-like

a sorted 1-d array of x-coordinates

yi : array-like or list of array-likes

yi[i][j] is the j-th derivative known at xi[i] (for axis=0)

orders : list of integers, or integer

a list of polynomial orders, or a single universal order

direction : {None, 1, -1}

indicates whether the xi are increasing or decreasing +1 indicates increasing -1 indi-

cates decreasing None indicates that it should be deduced from the ﬁrst two xi

axis : int, optional

Axis in the yi array corresponding to the x-coordinate values.

5.7. Interpolation (scipy.interpolate) 277

SciPy Reference Guide, Release 0.13.0

Notes

If orders is None, or orders[i] is None, then the degree of the polynomial segment is exactly the degree required

to match all i available derivatives at both endpoints. If orders[i] is not None, then some derivatives will be

ignored. The code will try to use an equal number of derivatives from each end; if the total number of derivatives

needed is odd, it will prefer the rightmost endpoint. If not enough derivatives are available, an exception is raised.

Methods

__call__(x) Evaluate the interpolant

append(xi, yi[, order]) Append a single point with derivatives to the PiecewisePolynomial

derivative(x[, der]) Evaluate one derivative of the polynomial at the point x

derivatives(x[, der]) Evaluate many derivatives of the polynomial at the point x

extend(xi, yi[, orders]) Extend the PiecewisePolynomial by a list of points

PiecewisePolynomial.__call__(x)

Evaluate the interpolant

Parameters x: array-like

Points to evaluate the interpolant at.

Returns y: array-like

Interpolated values. Shape is determined by replacing the interpolation axis in

the original array with the shape of x.

PiecewisePolynomial.append(xi,yi,order=None)

Append a single point with derivatives to the PiecewisePolynomial

Parameters xi : ﬂoat

Input

yi : array_like

yi is the list of derivatives known at xi

order : integer or None

a polynomial order, or instructions to use the highest possible order

PiecewisePolynomial.derivative(x,der=1)

Evaluate one derivative of the polynomial at the point x

Parameters x: array-like

Point or points at which to evaluate the derivatives

der : integer, optional

Which derivative to extract. This number includes the function value as 0th

derivative.

Returns d: ndarray

Derivative interpolated at the x-points. Shape of d is determined by replacing the

interpolation axis in the original array with the shape of x.

Notes

This is computed by evaluating all derivatives up to the desired one (using self.derivatives()) and then

discarding the rest.

PiecewisePolynomial.derivatives(x,der=None)

Evaluate many derivatives of the polynomial at the point x

Produce an array of all derivative values at the point x.

Parameters x: array-like

Point or points at which to evaluate the derivatives

278 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

der : None or integer

How many derivatives to extract; None for all potentially nonzero derivatives

(that is a number equal to the number of points). This number includes the func-

tion value as 0th derivative.

Returns d: ndarray

Array with derivatives; d[j] contains the j-th derivative. Shape of d[j] is deter-

mined by replacing the interpolation axis in the original array with the shape of

Examples

>>> KroghInterpolator([0,0,0],[1,2,3]).derivatives(0)

array([1.0,2.0,3.0])

>>> KroghInterpolator([0,0,0],[1,2,3]).derivatives([0,0])

array([[1.0,1.0],

[2.0,2.0],

[3.0,3.0]])

PiecewisePolynomial.extend(xi,yi,orders=None)

Extend the PiecewisePolynomial by a list of points

Parameters xi : array_like

A sorted list of x-coordinates.

yi : list of lists of length N1

yi[i] (if axis == 0) is the list of derivatives known at xi[i].

orders : int or list of ints

A list of polynomial orders, or a single universal order.

direction : {None, 1, -1}

Indicates whether the xi are increasing or decreasing.

+1 indicates increasing

-1 indicates decreasing

None indicates that it should be deduced from the ﬁrst two xi.

class scipy.interpolate.PchipInterpolator(x,y,axis=0)

PCHIP 1-d monotonic cubic interpolation

x and y are arrays of values used to approximate some function f, with y = f(x). The interpolant uses

monotonic cubic splines to ﬁnd the value of new points.

Parameters x: ndarray

A 1-D array of monotonically increasing real values. xcannot include duplicate values

(otherwise f is overspeciﬁed)

y: ndarray

A 1-D array of real values. y‘s length along the interpolation axis must be equal to the

length of x.

axis : int, optional

Axis in the yi array corresponding to the x-coordinate values.

Notes

Assumes x is sorted in monotonic order (e.g. x[1] > x[0]).

Methods

__call__(x) Evaluate the interpolant

append(xi, yi[, order]) Append a single point with derivatives to the PiecewisePolynomial

Continued on next page

5.7. Interpolation (scipy.interpolate) 279

SciPy Reference Guide, Release 0.13.0

Table 5.29 – continued from previous page

derivative(x[, der]) Evaluate one derivative of the polynomial at the point x

derivatives(x[, der]) Evaluate many derivatives of the polynomial at the point x

extend(xi, yi[, orders]) Extend the PiecewisePolynomial by a list of points

PchipInterpolator.__call__(x)

Evaluate the interpolant

Parameters x: array-like

Points to evaluate the interpolant at.

Returns y: array-like

Interpolated values. Shape is determined by replacing the interpolation axis in

the original array with the shape of x.

PchipInterpolator.append(xi,yi,order=None)

Append a single point with derivatives to the PiecewisePolynomial

Parameters xi : ﬂoat

Input

yi : array_like

yi is the list of derivatives known at xi

order : integer or None

a polynomial order, or instructions to use the highest possible order

PchipInterpolator.derivative(x,der=1)

Evaluate one derivative of the polynomial at the point x

Parameters x: array-like

Point or points at which to evaluate the derivatives

der : integer, optional

Which derivative to extract. This number includes the function value as 0th

derivative.

Returns d: ndarray

Derivative interpolated at the x-points. Shape of d is determined by replacing the

interpolation axis in the original array with the shape of x.

Notes

This is computed by evaluating all derivatives up to the desired one (using self.derivatives()) and then

discarding the rest.

PchipInterpolator.derivatives(x,der=None)

Evaluate many derivatives of the polynomial at the point x

Produce an array of all derivative values at the point x.

Parameters x: array-like

Point or points at which to evaluate the derivatives

der : None or integer

How many derivatives to extract; None for all potentially nonzero derivatives

(that is a number equal to the number of points). This number includes the func-

tion value as 0th derivative.

Returns d: ndarray

Array with derivatives; d[j] contains the j-th derivative. Shape of d[j] is deter-

mined by replacing the interpolation axis in the original array with the shape of

280 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Examples

>>> KroghInterpolator([0,0,0],[1,2,3]).derivatives(0)

array([1.0,2.0,3.0])

>>> KroghInterpolator([0,0,0],[1,2,3]).derivatives([0,0])

array([[1.0,1.0],

[2.0,2.0],

[3.0,3.0]])

PchipInterpolator.extend(xi,yi,orders=None)

Extend the PiecewisePolynomial by a list of points

Parameters xi : array_like

A sorted list of x-coordinates.

yi : list of lists of length N1

yi[i] (if axis == 0) is the list of derivatives known at xi[i].

orders : int or list of ints

A list of polynomial orders, or a single universal order.

direction : {None, 1, -1}

Indicates whether the xi are increasing or decreasing.

+1 indicates increasing

-1 indicates decreasing

None indicates that it should be deduced from the ﬁrst two xi.

scipy.interpolate.barycentric_interpolate(xi,yi,x,axis=0)

Convenience function for polynomial interpolation.

Constructs a polynomial that passes through a given set of points, then evaluates the polynomial. For reasons of

numerical stability, this function does not compute the coefﬁcients of the polynomial.

This function uses a “barycentric interpolation” method that treats the problem as a special case of rational

function interpolation. This algorithm is quite stable, numerically, but even in a world of exact computation,

unless the xcoordinates are chosen very carefully - Chebyshev zeros (e.g. cos(i*pi/n)) are a good choice -

polynomial interpolation itself is a very ill-conditioned process due to the Runge phenomenon.

Parameters xi : array_like

1-d array of x coordinates of the points the polynomial should pass through

yi : array_like

The y coordinates of the points the polynomial should pass through.

x: scalar or array_like

Points to evaluate the interpolator at.

axis : int, optional

Axis in the yi array corresponding to the x-coordinate values.

Returns y: scalar or array_like

Interpolated values. Shape is determined by replacing the interpolation axis in the

original array with the shape of x.

See Also

BarycentricInterpolator

Notes

Construction of the interpolation weights is a relatively slow process. If you want to call this many times with

the same xi (but possibly varying yi or x) you should use the class BarycentricInterpolator. This is

what this function uses internally.

scipy.interpolate.krogh_interpolate(xi,yi,x,der=0,axis=0)

Convenience function for polynomial interpolation.

5.7. Interpolation (scipy.interpolate) 281

SciPy Reference Guide, Release 0.13.0

See KroghInterpolator for more details.

Parameters xi : array_like

Known x-coordinates.

yi : array_like

Known y-coordinates, of shape (xi.size, R). Interpreted as vectors of length R,

or scalars if R=1.

x: array_like

Point or points at which to evaluate the derivatives.

der : int or list

How many derivatives to extract; None for all potentially nonzero derivatives (that is a

number equal to the number of points), or a list of derivatives to extract. This number

includes the function value as 0th derivative.

axis : int, optional

Axis in the yi array corresponding to the x-coordinate values.

Returns d: ndarray

If the interpolator’s values are R-dimensional then the returned array will be the num-

ber of derivatives by N by R. If xis a scalar, the middle dimension will be dropped; if

the yi are scalars then the last dimension will be dropped.

See Also

KroghInterpolator

Notes

Construction of the interpolating polynomial is a relatively expensive process. If you want to evaluate it repeat-

edly consider using the class KroghInterpolator (which is what this function uses).

scipy.interpolate.piecewise_polynomial_interpolate(xi,yi,x,orders=None,der=0,

axis=0)

Convenience function for piecewise polynomial interpolation.

Parameters xi : array_like

A sorted list of x-coordinates.

yi : list of lists

yi[i] is the list of derivatives known at xi[i].

x: scalar or array_like

Coordinates at which to evalualte the polynomial.

orders : int or list of ints, optional

A list of polynomial orders, or a single universal order.

der : int or list

How many derivatives to extract; None for all potentially nonzero derivatives (that is a

number equal to the number of points), or a list of derivatives to extract. This number

includes the function value as 0th derivative.

axis : int, optional

Axis in the yi array corresponding to the x-coordinate values.

Returns y: ndarray

Interpolated values or derivatives. If multiple derivatives were requested, these are

given along the ﬁrst axis.

See Also

PiecewisePolynomial

282 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Notes

If orders is None, or orders[i] is None, then the degree of the polynomial segment is exactly the degree

required to match all i available derivatives at both endpoints. If orders[i] is not None, then some derivatives

will be ignored. The code will try to use an equal number of derivatives from each end; if the total number of

derivatives needed is odd, it will prefer the rightmost endpoint. If not enough derivatives are available, an

exception is raised.

Construction of these piecewise polynomials can be an expensive process; if you repeatedly evaluate the same

polynomial, consider using the class PiecewisePolynomial (which is what this function does).

scipy.interpolate.pchip_interpolate(xi,yi,x,der=0,axis=0)

Convenience function for pchip interpolation.

See PchipInterpolator for details.

Parameters xi : array_like

A sorted list of x-coordinates, of length N.

yi : list of lists

yi[i] is the list of derivatives known at xi[i]. Of length N.

x: scalar or array_like

Of length M.

der : integer or list

How many derivatives to extract; None for all potentially nonzero derivatives (that is a

number equal to the number of points), or a list of derivatives to extract. This number

includes the function value as 0th derivative.

axis : int, optional

Axis in the yi array corresponding to the x-coordinate values.

Returns y: scalar or array_like

The result, of length R or length M or M by R,

See Also

PchipInterpolator

5.7.2 Multivariate interpolation

Unstructured data:

griddata(points, values, xi[, method, ...]) Interpolate unstructured N-dimensional data.

LinearNDInterpolator(points, values[, ...]) Piecewise linear interpolant in N dimensions.

NearestNDInterpolator(points, values) Nearest-neighbour interpolation in N dimensions.

CloughTocher2DInterpolator(points, values[, tol]) Piecewise cubic, C1 smooth, curvature-minimizing interpolant in 2D.

Rbf(*args) A class for radial basis function approximation/interpolation of n-dimensional scattered data.

interp2d(x, y, z[, kind, copy, ...]) Interpolate over a 2-D grid.

scipy.interpolate.griddata(points,values,xi,method=’linear’,ﬁll_value=nan)

Interpolate unstructured N-dimensional data. New in version 0.9.

Parameters points : ndarray of ﬂoats, shape (N, ndim)

Data point coordinates. Can either be an array of size (N, ndim), or a tuple of ndim

arrays.

values : ndarray of ﬂoat or complex, shape (N,)

Data values.

xi : ndarray of ﬂoat, shape (M, ndim)

Points at which to interpolate data.

5.7. Interpolation (scipy.interpolate) 283

SciPy Reference Guide, Release 0.13.0

method : {‘linear’, ‘nearest’, ‘cubic’}, optional

Method of interpolation. One of

nearest return the value at the data point closest to the point of interpolation.

See NearestNDInterpolator for more details.

linear tesselate the input point set to n-dimensional simplices, and interpolate

linearly on each simplex. See LinearNDInterpolator for more

details.

cubic (1-D) return the value determined from a cubic spline.

cubic (2-D) return the value determined from a piecewise cubic, continuously dif-

ferentiable (C1), and approximately curvature-minimizing polynomial

surface. See CloughTocher2DInterpolator for more details.

ﬁll_value : ﬂoat, optional

Value used to ﬁll in for requested points outside of the convex hull of the input points.

If not provided, then the default is nan. This option has no effect for the ‘nearest’

method.

Examples

Suppose we want to interpolate the 2-D function

>>> def func(x, y):

>>> return x*(1-x)*np.cos(4*np.pi*x) *np.sin(4*np.pi*y**2)**2

on a grid in [0, 1]x[0, 1]

>>> grid_x, grid_y =np.mgrid[0:1:100j,0:1:200j]

but we only know its values at 1000 data points:

>>> points =np.random.rand(1000,2)

>>> values =func(points[:,0], points[:,1])

This can be done with griddata – below we try out all of the interpolation methods:

>>> from scipy.interpolate import griddata

>>> grid_z0 =griddata(points, values, (grid_x, grid_y), method=’nearest’)

>>> grid_z1 =griddata(points, values, (grid_x, grid_y), method=’linear’)

>>> grid_z2 =griddata(points, values, (grid_x, grid_y), method=’cubic’)

One can see that the exact result is reproduced by all of the methods to some degree, but for this smooth function

the piecewise cubic interpolant gives the best results:

>>> import matplotlib.pyplot as plt

>>> plt.subplot(221)

>>> plt.imshow(func(grid_x, grid_y).T, extent=(0,1,0,1), origin=’lower’)

>>> plt.plot(points[:,0], points[:,1], ’k.’, ms=1)

>>> plt.title(’Original’)

>>> plt.subplot(222)

>>> plt.imshow(grid_z0.T, extent=(0,1,0,1), origin=’lower’)

>>> plt.title(’Nearest’)

>>> plt.subplot(223)

>>> plt.imshow(grid_z1.T, extent=(0,1,0,1), origin=’lower’)

>>> plt.title(’Linear’)

>>> plt.subplot(224)

>>> plt.imshow(grid_z2.T, extent=(0,1,0,1), origin=’lower’)

>>> plt.title(’Cubic’)

>>> plt.gcf().set_size_inches(6,6)

>>> plt.show()

284 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

class scipy.interpolate.LinearNDInterpolator(points,values,ﬁll_value=np.nan)

Piecewise linear interpolant in N dimensions. New in version 0.9.

Parameters points : ndarray of ﬂoats, shape (npoints, ndims); or Delaunay

Data point coordinates, or a precomputed Delaunay triangulation.

values : ndarray of ﬂoat or complex, shape (npoints, ...)

Data values.

ﬁll_value : ﬂoat, optional

Value used to ﬁll in for requested points outside of the convex hull of the input points.

If not provided, then the default is nan.

Notes

The interpolant is constructed by triangulating the input data with Qhull [R34], and on each triangle performing

linear barycentric interpolation.

References

[R34]

5.7. Interpolation (scipy.interpolate) 285

SciPy Reference Guide, Release 0.13.0

Methods

__call__(xi) Evaluate interpolator at given points.

LinearNDInterpolator.__call__(xi)

Evaluate interpolator at given points.

Parameters xi : ndarray of ﬂoat, shape (..., ndim)

Points where to interpolate data at.

class scipy.interpolate.NearestNDInterpolator(points,values)

Nearest-neighbour interpolation in N dimensions. New in version 0.9.

Parameters points : (Npoints, Ndims) ndarray of ﬂoats

Data point coordinates.

values : (Npoints,) ndarray of ﬂoat or complex

Data values.

Notes

Uses scipy.spatial.cKDTree

Methods

__call__(*args) Evaluate interpolator at given points.

NearestNDInterpolator.__call__(*args)

Evaluate interpolator at given points.

Parameters xi : ndarray of ﬂoat, shape (..., ndim)

Points where to interpolate data at.

class scipy.interpolate.CloughTocher2DInterpolator(points,values,tol=1e-6)

Piecewise cubic, C1 smooth, curvature-minimizing interpolant in 2D. New in version 0.9.

Parameters points : ndarray of ﬂoats, shape (npoints, ndims); or Delaunay

Data point coordinates, or a precomputed Delaunay triangulation.

values : ndarray of ﬂoat or complex, shape (npoints, ...)

Data values.

ﬁll_value : ﬂoat, optional

Value used to ﬁll in for requested points outside of the convex hull of the input points.

If not provided, then the default is nan.

tol : ﬂoat, optional

Absolute/relative tolerance for gradient estimation.

maxiter : int, optional

Maximum number of iterations in gradient estimation.

Notes

The interpolant is constructed by triangulating the input data with Qhull [R32], and constructing a piecewise

cubic interpolating Bezier polynomial on each triangle, using a Clough-Tocher scheme [CT]. The interpolant is

guaranteed to be continuously differentiable.

The gradients of the interpolant are chosen so that the curvature of the interpolating surface is approxima-

tively minimized. The gradients necessary for this are estimated using the global algorithm described in [Niel-

son83,Renka84]_.

286 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

References

[R32],[CT],[Nielson83],[Renka84]

Methods

__call__(xi) Evaluate interpolator at given points.

CloughTocher2DInterpolator.__call__(xi)

Evaluate interpolator at given points.

Parameters xi : ndarray of ﬂoat, shape (..., ndim)

Points where to interpolate data at.

class scipy.interpolate.Rbf(*args)

A class for radial basis function approximation/interpolation of n-dimensional scattered data.

Parameters *args : arrays

x, y, z, ..., d, where x, y, z, ... are the coordinates of the nodes and d is the array of

values at the nodes

function : str or callable, optional

The radial basis function, based on the radius, r, given by the norm (default is Eu-

clidean distance); the default is ‘multiquadric’:

’multiquadric’: sqrt((r/self.epsilon)**2 + 1)

’inverse’: 1.0/sqrt((r/self.epsilon)**2 + 1)

’gaussian’: exp(-(r/self.epsilon)**2)

’linear’: r

’cubic’: r**3

’quintic’: r**5

’thin_plate’: r**2*log(r)

If callable, then it must take 2 arguments (self, r). The epsilon parameter will be

available as self.epsilon. Other keyword arguments passed in will be available as

well.

epsilon : ﬂoat, optional

Adjustable constant for gaussian or multiquadrics functions - defaults to approximate

average distance between nodes (which is a good start).

smooth : ﬂoat, optional

Values greater than zero increase the smoothness of the approximation. 0 is for inter-

polation (default), the function will always go through the nodal points in this case.

norm : callable, optional

A function that returns the ‘distance’ between two points, with inputs as arrays of

positions (x, y, z, ...), and an output as an array of distance. E.g, the default:

def euclidean_norm(x1, x2):

return sqrt( ((x1 -x2)**2).sum(axis=0) )

which is called with x1=x1[ndims,newaxis,:] and x2=x2[ndims,:,newaxis] such that

the result is a matrix of the distances from each point in x1 to each point in x2.

Examples

>>> rbfi =Rbf(x, y, z, d) # radial basis function interpolator instance

>>> di =rbfi(xi, yi, zi) # interpolated values

Methods

5.7. Interpolation (scipy.interpolate) 287

SciPy Reference Guide, Release 0.13.0

__call__(*args)

Rbf.__call__(*args)

class scipy.interpolate.interp2d(x,y,z,kind=’linear’,copy=True,bounds_error=False,

ﬁll_value=nan)

Interpolate over a 2-D grid.

x,yand zare arrays of values used to approximate some function f: z = f(x, y). This class returns a

function whose call method uses spline interpolation to ﬁnd the value of new points.

If xand yrepresent a regular grid, consider using RectBivariateSpline.

Parameters x, y : array_like

Arrays deﬁning the data point coordinates.

If the points lie on a regular grid, xcan specify the column coordinates and ythe row

coordinates, for example:

>>> x=[0,1,2]; y =[0,3];z=[[1,2,3], [4,5,6]]

Otherwise, xand ymust specify the full coordinates for each point, for example:

>>> x=[0,1,2,0,1,2]; y =[0,0,0,3,3,3];z=[1,2,3,4,5,6]

If xand yare multi-dimensional, they are ﬂattened before use.

z: array_like

The values of the function to interpolate at the data points. If zis a multi-dimensional

array, it is ﬂattened before use. The length of a ﬂattened zarray is either len(x)*len(y)

if xand yspecify the column and row coordinates or len(z) == len(x) ==

len(y) if xand yspecify coordinates for each point.

kind : {‘linear’, ‘cubic’, ‘quintic’}, optional

The kind of spline interpolation to use. Default is ‘linear’.

copy : bool, optional

If True, the class makes internal copies of x, y and z. If False, references may be used.

The default is to copy.

bounds_error : bool, optional

If True, when interpolated values are requested outside of the domain of the input data

(x,y), a ValueError is raised. If False, then ﬁll_value is used.

ﬁll_value : number, optional

If provided, the value to use for points outside of the interpolation domain. If omitted

(None), values outside the domain are extrapolated.

See Also

RectBivariateSpline

Much faster 2D interpolation if your input data is on a grid

bisplrep,bisplev

BivariateSpline

a more recent wrapper of the FITPACK routines

interp1d one dimension version of this function

288 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Notes

The minimum number of data points required along the interpolation axis is (k+1)**2, with k=1 for linear,

k=3 for cubic and k=5 for quintic interpolation.

The interpolator is constructed by bisplrep, with a smoothing factor of 0. If more control over smoothing is

needed, bisplrep should be used directly.

Examples

Construct a 2-D grid and interpolate on it:

>>> from scipy import interpolate

>>> x=np.arange(-5.01,5.01,0.25)

>>> y=np.arange(-5.01,5.01,0.25)

>>> xx, yy =np.meshgrid(x, y)

>>> z=np.sin(xx**2+yy**2)

>>> f=interpolate.interp2d(x, y, z, kind=’cubic’)

Now use the obtained interpolation function and plot the result:

>>> xnew =np.arange(-5.01,5.01,1e-2)

>>> ynew =np.arange(-5.01,5.01,1e-2)

>>> znew =f(xnew, ynew)

>>> plt.plot(x, z[:, 0], ’ro-’, xnew, znew[:, 0], ’b-’)

>>> plt.show()

Methods

__call__(x, y[, dx, dy]) Interpolate the function.

interp2d.__call__(x,y,dx=0,dy=0)

Interpolate the function.

Parameters x: 1D array

x-coordinates of the mesh on which to interpolate.

y: 1D array

y-coordinates of the mesh on which to interpolate.

dx : int >= 0, < kx

Order of partial derivatives in x.

dy : int >= 0, < ky

Order of partial derivatives in y.

Returns z: 2D array with shape (len(y), len(x))

The interpolated values.

For data on a grid:

RectBivariateSpline(x, y, z[, bbox, kx, ky, s]) Bivariate spline approximation over a rectangular mesh.

See Also

scipy.ndimage.map_coordinates

5.7.3 1-D Splines

5.7. Interpolation (scipy.interpolate) 289

SciPy Reference Guide, Release 0.13.0

UnivariateSpline(x, y[, w, bbox, k, s]) One-dimensional smoothing spline ﬁt to a given set of data points.

InterpolatedUnivariateSpline(x, y[, w, bbox, k]) One-dimensional interpolating spline for a given set of data points.

LSQUnivariateSpline(x, y, t[, w, bbox, k]) One-dimensional spline with explicit internal knots.

class scipy.interpolate.UnivariateSpline(x, y, w=None, bbox=[None, None], k=3, s=None)

One-dimensional smoothing spline ﬁt to a given set of data points.

Fits a spline y=s(x) of degree kto the provided x,ydata. sspeciﬁes the number of knots by specifying a

smoothing condition.

Parameters x: (N,) array_like

1-D array of independent input data. Must be increasing.

y: (N,) array_like

1-D array of dependent input data, of the same length as x.

w: (N,) array_like, optional

Weights for spline ﬁtting. Must be positive. If None (default), weights are all equal.

bbox : (2,) array_like, optional

2-sequence specifying the boundary of the approximation interval. If None (default),

bbox=[x[0], x[-1]].

k: int, optional

Degree of the smoothing spline. Must be <= 5.

s: ﬂoat or None, optional

Positive smoothing factor used to choose the number of knots. Number of knots will

be increased until the smoothing condition is satisﬁed:

sum((w[i]*(y[i]-s(x[i])))**2,axis=0) <= s

If None (default), s=len(w) which should be a good value if 1/w[i] is an estimate of

the standard deviation of y[i]. If 0, spline will interpolate through all data points.

See Also

InterpolatedUnivariateSpline

Subclass with smoothing forced to 0

LSQUnivariateSpline

Subclass in which knots are user-selected instead of being set by smoothing condition

splrep An older, non object-oriented wrapping of FITPACK

splev,sproot,splint,spalde

BivariateSpline

A similar class for two-dimensional spline interpolation

Notes

The number of data points must be larger than the spline degree k.

Examples

>>> from numpy import linspace,exp

>>> from numpy.random import randn

>>> import matplotlib.pyplot as plt

>>> from scipy.interpolate import UnivariateSpline

>>> x=linspace(-3,3,100)

>>> y=exp(-x**2)+randn(100)/10

>>> s=UnivariateSpline(x, y, s=1)

>>> xs =linspace(-3,3,1000)

290 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

>>> ys =s(xs)

>>> plt.plot(x, y, ’.-’)

>>> plt.plot(xs, ys)

>>> plt.show()

xs,ys is now a smoothed, super-sampled version of the noisy gaussian x,y.

Methods

__call__(x[, nu]) Evaluate spline (or its nu-th derivative) at positions x.

antiderivative([n]) Construct a new spline representing the antiderivative of this spline.

derivative([n]) Construct a new spline representing the derivative of this spline.

derivatives(x) Return all derivatives of the spline at the point x.

get_coeffs() Return spline coefﬁcients.

get_knots() Return positions of (boundary and interior) knots of the spline.

get_residual() Return weighted sum of squared residuals of the spline

integral(a, b) Return deﬁnite integral of the spline between two given points.

roots() Return the zeros of the spline.

set_smoothing_factor(s) Continue spline computation with the given smoothing

UnivariateSpline.__call__(x,nu=0)

Evaluate spline (or its nu-th derivative) at positions x.

Note: x can be unordered but the evaluation is more efﬁcient if x is (partially) ordered.

UnivariateSpline.antiderivative(n=1)

Construct a new spline representing the antiderivative of this spline. New in version 0.13.0.

Parameters n: int, optional

Order of antiderivative to evaluate. Default: 1

Returns spline : UnivariateSpline

Spline of order k2=k+n representing the antiderivative of this spline.

See Also

splantider,derivative

Examples

>>> from scipy.interpolate import UnivariateSpline

>>> x=np.linspace(0, np.pi/2,70)

>>> y=1/np.sqrt(1-0.8*np.sin(x)**2)

>>> spl =UnivariateSpline(x, y, s=0)

The derivative is the inverse operation of the antiderivative, although some ﬂoating point error accumu-

lates:

>>> spl(1.7), spl.antiderivative().derivative()(1.7)

(array(2.1565429877197317), array(2.1565429877201865))

Antiderivative can be used to evaluate deﬁnite integrals:

>>> ispl =spl.antiderivative()

>>> ispl(np.pi/2)-ispl(0)

2.2572053588768486

This is indeed an approximation to the complete elliptic integral K(m) = Rπ/2

0[1 −msin2x]−1/2dx:

5.7. Interpolation (scipy.interpolate) 291

SciPy Reference Guide, Release 0.13.0

>>> from scipy.special import ellipk

>>> ellipk(0.8)

2.2572053268208538

UnivariateSpline.derivative(n=1)

Construct a new spline representing the derivative of this spline. New in version 0.13.0.

Parameters n: int, optional

Order of derivative to evaluate. Default: 1

Returns spline : UnivariateSpline

Spline of order k2=k-n representing the derivative of this spline.

See Also

splder,antiderivative

Examples

This can be used for ﬁnding maxima of a curve:

>>> from scipy.interpolate import UnivariateSpline

>>> x=np.linspace(0,10,70)

>>> y=np.sin(x)

>>> spl =UnivariateSpline(x, y, k=4, s=0)

Now, differentiate the spline and ﬁnd the zeros of the derivative. (NB: sproot only works for order 3

splines, so we ﬁt an order 4 spline):

>>> spl.derivative().roots() /np.pi

array([ 0.50000001, 1.5 , 2.49999998])

This agrees well with roots π/2 + nπ of cos(x) = sin’(x).

UnivariateSpline.derivatives(x)

Return all derivatives of the spline at the point x.

UnivariateSpline.get_coeffs()

Return spline coefﬁcients.

UnivariateSpline.get_knots()

Return positions of (boundary and interior) knots of the spline.

UnivariateSpline.get_residual()

Return weighted sum of squared residuals of the spline approximation: sum((w[i] *

(y[i]-s(x[i])))**2, axis=0).

UnivariateSpline.integral(a,b)

Return deﬁnite integral of the spline between two given points.

UnivariateSpline.roots()

Return the zeros of the spline.

Restriction: only cubic splines are supported by ﬁtpack.

UnivariateSpline.set_smoothing_factor(s)

Continue spline computation with the given smoothing factor s and with the knots found at the last call.

class scipy.interpolate.InterpolatedUnivariateSpline(x, y, w=None, bbox=[None, None],

k=3)

One-dimensional interpolating spline for a given set of data points.

Fits a spline y=s(x) of degree kto the provided x,ydata. Spline function passes through all provided points.

Equivalent to UnivariateSpline with s=0.

292 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Parameters x: (N,) array_like

Input dimension of data points – must be increasing

y: (N,) array_like

input dimension of data points

w: (N,) array_like, optional

Weights for spline ﬁtting. Must be positive. If None (default), weights are all equal.

bbox : (2,) array_like, optional

2-sequence specifying the boundary of the approximation interval. If None (default),

bbox=[x[0],x[-1]].

k: int, optional

Degree of the smoothing spline. Must be 1 <= k<= 5.

See Also

UnivariateSpline

Superclass – allows knots to be selected by a smoothing condition

LSQUnivariateSpline

spline for which knots are user-selected

splrep An older, non object-oriented wrapping of FITPACK

splev,sproot,splint,spalde

BivariateSpline

A similar class for two-dimensional spline interpolation

Notes

The number of data points must be larger than the spline degree k.

Examples

>>> from numpy import linspace,exp

>>> from numpy.random import randn

>>> from scipy.interpolate import InterpolatedUnivariateSpline

>>> import matplotlib.pyplot as plt

>>> x=linspace(-3,3,100)

>>> y=exp(-x**2)+randn(100)/10

>>> s=InterpolatedUnivariateSpline(x, y)

>>> xs =linspace(-3,3,1000)

>>> ys =s(xs)

>>> plt.plot(x, y, ’.-’)

>>> plt.plot(xs, ys)

>>> plt.show()

xs,ys is now a smoothed, super-sampled version of the noisy gaussian x,y

Methods

__call__(x[, nu]) Evaluate spline (or its nu-th derivative) at positions x.

antiderivative([n]) Construct a new spline representing the antiderivative of this spline.

derivative([n]) Construct a new spline representing the derivative of this spline.

derivatives(x) Return all derivatives of the spline at the point x.

get_coeffs() Return spline coefﬁcients.

get_knots() Return positions of (boundary and interior) knots of the spline.

get_residual() Return weighted sum of squared residuals of the spline

Continued on next page

5.7. Interpolation (scipy.interpolate) 293

SciPy Reference Guide, Release 0.13.0

Table 5.39 – continued from previous page

integral(a, b) Return deﬁnite integral of the spline between two given points.

roots() Return the zeros of the spline.

set_smoothing_factor(s) Continue spline computation with the given smoothing

InterpolatedUnivariateSpline.__call__(x,nu=0)

Evaluate spline (or its nu-th derivative) at positions x.

Note: x can be unordered but the evaluation is more efﬁcient if x is (partially) ordered.

InterpolatedUnivariateSpline.antiderivative(n=1)

Construct a new spline representing the antiderivative of this spline. New in version 0.13.0.

Parameters n: int, optional

Order of antiderivative to evaluate. Default: 1

Returns spline : UnivariateSpline

Spline of order k2=k+n representing the antiderivative of this spline.

See Also

splantider,derivative

Examples

>>> from scipy.interpolate import UnivariateSpline

>>> x=np.linspace(0, np.pi/2,70)

>>> y=1/np.sqrt(1-0.8*np.sin(x)**2)

>>> spl =UnivariateSpline(x, y, s=0)

The derivative is the inverse operation of the antiderivative, although some ﬂoating point error accumu-

lates:

>>> spl(1.7), spl.antiderivative().derivative()(1.7)

(array(2.1565429877197317), array(2.1565429877201865))

Antiderivative can be used to evaluate deﬁnite integrals:

>>> ispl =spl.antiderivative()

>>> ispl(np.pi/2)-ispl(0)

2.2572053588768486

This is indeed an approximation to the complete elliptic integral K(m) = Rπ/2

0[1 −msin2x]−1/2dx:

>>> from scipy.special import ellipk

>>> ellipk(0.8)

2.2572053268208538

InterpolatedUnivariateSpline.derivative(n=1)

Construct a new spline representing the derivative of this spline. New in version 0.13.0.

Parameters n: int, optional

Order of derivative to evaluate. Default: 1

Returns spline : UnivariateSpline

Spline of order k2=k-n representing the derivative of this spline.

See Also

splder,antiderivative

294 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Examples

This can be used for ﬁnding maxima of a curve:

>>> from scipy.interpolate import UnivariateSpline

>>> x=np.linspace(0,10,70)

>>> y=np.sin(x)

>>> spl =UnivariateSpline(x, y, k=4, s=0)

Now, differentiate the spline and ﬁnd the zeros of the derivative. (NB: sproot only works for order 3

splines, so we ﬁt an order 4 spline):

>>> spl.derivative().roots() /np.pi

array([ 0.50000001, 1.5 , 2.49999998])

This agrees well with roots π/2 + nπ of cos(x) = sin’(x).

InterpolatedUnivariateSpline.derivatives(x)

Return all derivatives of the spline at the point x.

InterpolatedUnivariateSpline.get_coeffs()

Return spline coefﬁcients.

InterpolatedUnivariateSpline.get_knots()

Return positions of (boundary and interior) knots of the spline.

InterpolatedUnivariateSpline.get_residual()

Return weighted sum of squared residuals of the spline approximation: sum((w[i] *

(y[i]-s(x[i])))**2, axis=0).

InterpolatedUnivariateSpline.integral(a,b)

Return deﬁnite integral of the spline between two given points.

InterpolatedUnivariateSpline.roots()

Return the zeros of the spline.

Restriction: only cubic splines are supported by ﬁtpack.

InterpolatedUnivariateSpline.set_smoothing_factor(s)

Continue spline computation with the given smoothing factor s and with the knots found at the last call.

class scipy.interpolate.LSQUnivariateSpline(x, y, t, w=None, bbox=[None, None], k=3)

One-dimensional spline with explicit internal knots.

Fits a spline y=s(x) of degree kto the provided x,ydata. tspeciﬁes the internal knots of the spline

Parameters x: (N,) array_like

Input dimension of data points – must be increasing

y: (N,) array_like

Input dimension of data points

t: (M,) array_like

interior knots of the spline. Must be in ascending order and bbox[0]<t[0]<...<t[-

1]<bbox[-1]

w: (N,) array_like, optional

weights for spline ﬁtting. Must be positive. If None (default), weights are all equal.

bbox : (2,) array_like, optional

2-sequence specifying the boundary of the approximation interval. If None (default),

bbox=[x[0],x[-1]].

k: int, optional

Degree of the smoothing spline. Must be 1 <= k<= 5.

Raises ValueError

5.7. Interpolation (scipy.interpolate) 295

SciPy Reference Guide, Release 0.13.0

If the interior knots do not satisfy the Schoenberg-Whitney conditions

See Also

UnivariateSpline

Superclass – knots are speciﬁed by setting a smoothing condition

InterpolatedUnivariateSpline

spline passing through all points

splrep An older, non object-oriented wrapping of FITPACK

splev,sproot,splint,spalde

BivariateSpline

A similar class for two-dimensional spline interpolation

Notes

The number of data points must be larger than the spline degree k.

Examples

>>> from numpy import linspace,exp

>>> from numpy.random import randn

>>> from scipy.interpolate import LSQUnivariateSpline

>>> import matplotlib.pyplot as plt

>>> x=linspace(-3,3,100)

>>> y=exp(-x**2)+randn(100)/10

>>> t=[-1,0,1]

>>> s=LSQUnivariateSpline(x,y,t)

>>> xs =linspace(-3,3,1000)

>>> ys =s(xs)

>>> plt.plot(x, y, ’.-’)

>>> plt.plot(xs, ys)

>>> plt.show()

xs,ys is now a smoothed, super-sampled version of the noisy gaussian x,y with knots [-3,-1,0,1,3]

Methods

__call__(x[, nu]) Evaluate spline (or its nu-th derivative) at positions x.

antiderivative([n]) Construct a new spline representing the antiderivative of this spline.

derivative([n]) Construct a new spline representing the derivative of this spline.

derivatives(x) Return all derivatives of the spline at the point x.

get_coeffs() Return spline coefﬁcients.

get_knots() Return positions of (boundary and interior) knots of the spline.

get_residual() Return weighted sum of squared residuals of the spline

integral(a, b) Return deﬁnite integral of the spline between two given points.

roots() Return the zeros of the spline.

set_smoothing_factor(s) Continue spline computation with the given smoothing

LSQUnivariateSpline.__call__(x,nu=0)

Evaluate spline (or its nu-th derivative) at positions x.

Note: x can be unordered but the evaluation is more efﬁcient if x is (partially) ordered.

LSQUnivariateSpline.antiderivative(n=1)

296 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Construct a new spline representing the antiderivative of this spline. New in version 0.13.0.

Parameters n: int, optional

Order of antiderivative to evaluate. Default: 1

Returns spline : UnivariateSpline

Spline of order k2=k+n representing the antiderivative of this spline.

See Also

splantider,derivative

Examples

>>> from scipy.interpolate import UnivariateSpline

>>> x=np.linspace(0, np.pi/2,70)

>>> y=1/np.sqrt(1-0.8*np.sin(x)**2)

>>> spl =UnivariateSpline(x, y, s=0)

The derivative is the inverse operation of the antiderivative, although some ﬂoating point error accumu-

lates:

>>> spl(1.7), spl.antiderivative().derivative()(1.7)

(array(2.1565429877197317), array(2.1565429877201865))

Antiderivative can be used to evaluate deﬁnite integrals:

>>> ispl =spl.antiderivative()

>>> ispl(np.pi/2)-ispl(0)

2.2572053588768486

This is indeed an approximation to the complete elliptic integral K(m) = Rπ/2

0[1 −msin2x]−1/2dx:

>>> from scipy.special import ellipk

>>> ellipk(0.8)

2.2572053268208538

LSQUnivariateSpline.derivative(n=1)

Construct a new spline representing the derivative of this spline. New in version 0.13.0.

Parameters n: int, optional

Order of derivative to evaluate. Default: 1

Returns spline : UnivariateSpline

Spline of order k2=k-n representing the derivative of this spline.

See Also

splder,antiderivative

Examples

This can be used for ﬁnding maxima of a curve:

>>> from scipy.interpolate import UnivariateSpline

>>> x=np.linspace(0,10,70)

>>> y=np.sin(x)

>>> spl =UnivariateSpline(x, y, k=4, s=0)

Now, differentiate the spline and ﬁnd the zeros of the derivative. (NB: sproot only works for order 3

splines, so we ﬁt an order 4 spline):

5.7. Interpolation (scipy.interpolate) 297

SciPy Reference Guide, Release 0.13.0

>>> spl.derivative().roots() /np.pi

array([ 0.50000001, 1.5 , 2.49999998])

This agrees well with roots π/2 + nπ of cos(x) = sin’(x).

LSQUnivariateSpline.derivatives(x)

Return all derivatives of the spline at the point x.

LSQUnivariateSpline.get_coeffs()

Return spline coefﬁcients.

LSQUnivariateSpline.get_knots()

Return positions of (boundary and interior) knots of the spline.

LSQUnivariateSpline.get_residual()

Return weighted sum of squared residuals of the spline approximation: sum((w[i] *

(y[i]-s(x[i])))**2, axis=0).

LSQUnivariateSpline.integral(a,b)

Return deﬁnite integral of the spline between two given points.

LSQUnivariateSpline.roots()

Return the zeros of the spline.

Restriction: only cubic splines are supported by ﬁtpack.

LSQUnivariateSpline.set_smoothing_factor(s)

Continue spline computation with the given smoothing factor s and with the knots found at the last call.

The above univariate spline classes have the following methods:

UnivariateSpline.__call__(x[, nu]) Evaluate spline (or its nu-th derivative) at positions x.

UnivariateSpline.derivatives(x) Return all derivatives of the spline at the point x.

UnivariateSpline.integral(a, b) Return deﬁnite integral of the spline between two given points.

UnivariateSpline.roots() Return the zeros of the spline.

UnivariateSpline.derivative([n]) Construct a new spline representing the derivative of this spline.

UnivariateSpline.antiderivative([n]) Construct a new spline representing the antiderivative of this spline.

UnivariateSpline.get_coeffs() Return spline coefﬁcients.

UnivariateSpline.get_knots() Return positions of (boundary and interior) knots of the spline.

UnivariateSpline.get_residual() Return weighted sum of squared residuals of the spline

UnivariateSpline.set_smoothing_factor(s) Continue spline computation with the given smoothing

Functional interface to FITPACK functions:

splrep(x, y[, w, xb, xe, k, task, s, t, ...]) Find the B-spline representation of 1-D curve.

splprep(x[, w, u, ub, ue, k, task, s, t, ...]) Find the B-spline representation of an N-dimensional curve.

splev(x, tck[, der, ext]) Evaluate a B-spline or its derivatives.

splint(a, b, tck[, full_output]) Evaluate the deﬁnite integral of a B-spline.

sproot(tck[, mest]) Find the roots of a cubic B-spline.

spalde(x, tck) Evaluate all derivatives of a B-spline.

splder(tck[, n]) Compute the spline representation of the derivative of a given spline ..

splantider(tck[, n]) Compute the spline for the antiderivative (integral) of a given spline.

bisplrep(x, y, z[, w, xb, xe, yb, ye, kx, ...]) Find a bivariate B-spline representation of a surface.

bisplev(x, y, tck[, dx, dy]) Evaluate a bivariate B-spline and its derivatives.

298 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.interpolate.splrep(x,y,w=None,xb=None,xe=None,k=3,task=0,s=None,t=None,

full_output=0,per=0,quiet=1)

Find the B-spline representation of 1-D curve.

Given the set of data points (x[i], y[i]) determine a smooth spline approximation of degree k on the

interval xb <= x <= xe.

Parameters x, y : array_like

The data points deﬁning a curve y = f(x).

w: array_like

Strictly positive rank-1 array of weights the same length as x and y. The weights

are used in computing the weighted least-squares spline ﬁt. If the errors in the y

values have standard-deviation given by the vector d, then w should be 1/d. Default is

ones(len(x)).

xb, xe : ﬂoat

The interval to ﬁt. If None, these default to x[0] and x[-1] respectively.

k: int

The order of the spline ﬁt. It is recommended to use cubic splines. Even order splines

should be avoided especially with small s values. 1 <= k <= 5

task : {1, 0, -1}

If task==0 ﬁnd t and c for a given smoothing factor, s.

If task==1 ﬁnd t and c for another value of the smoothing factor, s. There must have

been a previous call with task=0 or task=1 for the same set of data (t will be stored an

used internally)

If task=-1 ﬁnd the weighted least square spline for a given set of knots, t. These should

be interior knots as knots on the ends will be added automatically.

s: ﬂoat

A smoothing condition. The amount of smoothness is determined by satisfying the

conditions: sum((w * (y - g))**2,axis=0) <= s where g(x) is the smoothed interpola-

tion of (x,y). The user can use s to control the tradeoff between closeness and smooth-

ness of ﬁt. Larger s means more smoothing while smaller values of s indicate less

smoothing. Recommended values of s depend on the weights, w. If the weights rep-

resent the inverse of the standard-deviation of y, then a good s value should be found

in the range (m-sqrt(2*m),m+sqrt(2*m)) where m is the number of datapoints in x, y,

and w. default : s=m-sqrt(2*m) if weights are supplied. s = 0.0 (interpolating) if no

weights are supplied.

t: int

The knots needed for task=-1. If given then task is automatically set to -1.

full_output : bool

If non-zero, then return optional outputs.

per : bool

If non-zero, data points are considered periodic with period x[m-1] - x[0] and a smooth

periodic spline approximation is returned. Values of y[m-1] and w[m-1] are not used.

quiet : bool

Non-zero to suppress messages.

Returns tck : tuple

(t,c,k) a tuple containing the vector of knots, the B-spline coefﬁcients, and the degree

of the spline.

fp : array, optional

The weighted sum of squared residuals of the spline approximation.

ier : int, optional

An integer ﬂag about splrep success. Success is indicated if ier<=0. If ier in [1,2,3]

an error occurred but was not raised. Otherwise an error is raised.

msg : str, optional

A message corresponding to the integer ﬂag, ier.

5.7. Interpolation (scipy.interpolate) 299

SciPy Reference Guide, Release 0.13.0

See Also

UnivariateSpline,BivariateSpline,splprep,splev,sproot,spalde,splint,

bisplrep,bisplev

Notes

See splev for evaluation of the spline and its derivatives. Uses the FORTRAN routine curﬁt from FITPACK.

References

Based on algorithms described in [1], [2], [3], and [4]:

[R52],[R53],[R54],[R55]

Examples

>>> x=linspace(0,10,10)

>>> y=sin(x)

>>> tck =splrep(x, y)

>>> x2 =linspace(0,10,200)

>>> y2 =splev(x2, tck)

>>> plot(x, y, ’o’, x2, y2)

scipy.interpolate.splprep(x,w=None,u=None,ub=None,ue=None,k=3,task=0,s=None,

t=None,full_output=0,nest=None,per=0,quiet=1)

Find the B-spline representation of an N-dimensional curve.

Given a list of N rank-1 arrays, x, which represent a curve in N-dimensional space parametrized by u, ﬁnd a

smooth approximating spline curve g(u). Uses the FORTRAN routine parcur from FITPACK.

Parameters x: array_like

A list of sample vector arrays representing the curve.

w: array_like

Strictly positive rank-1 array of weights the same length as x[0]. The weights are

used in computing the weighted least-squares spline ﬁt. If the errors in the xval-

ues have standard-deviation given by the vector d, then wshould be 1/d. Default is

ones(len(x[0])).

u: array_like, optional

An array of parameter values. If not given, these values are calculated automatically

as M = len(x[0]), where

v[0] = 0

v[i] = v[i-1] + distance(x[i],x[i-1])

u[i] = v[i] / v[M-1]

ub, ue : int, optional

The end-points of the parameters interval. Defaults to u[0] and u[-1].

k: int, optional

Degree of the spline. Cubic splines are recommended. Even values of kshould be

avoided especially with a small s-value. 1<=k<=5, default is 3.

task : int, optional

If task==0 (default), ﬁnd t and c for a given smoothing factor, s. If task==1, ﬁnd t

and c for another value of the smoothing factor, s. There must have been a previous

call with task=0 or task=1 for the same set of data. If task=-1 ﬁnd the weighted least

square spline for a given set of knots, t.

s: ﬂoat, optional

A smoothing condition. The amount of smoothness is determined by satisfying

the conditions: sum((w *(y - g))**2,axis=0) <= s, where g(x) is the

smoothed interpolation of (x,y). The user can use sto control the trade-off between

300 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

closeness and smoothness of ﬁt. Larger smeans more smoothing while smaller val-

ues of sindicate less smoothing. Recommended values of sdepend on the weights,

w. If the weights represent the inverse of the standard-deviation of y, then a good s

value should be found in the range (m-sqrt(2*m),m+sqrt(2*m)), where m is

the number of data points in x, y, and w.

t: int, optional

The knots needed for task=-1.

full_output : int, optional

If non-zero, then return optional outputs.

nest : int, optional

An over-estimate of the total number of knots of the spline to help in determining the

storage space. By default nest=m/2. Always large enough is nest=m+k+1.

per : int, optional

If non-zero, data points are considered periodic with period x[m-1] - x[0] and a

smooth periodic spline approximation is returned. Values of y[m-1] and w[m-1]

are not used.

quiet : int, optional

Non-zero to suppress messages.

Returns tck : tuple

A tuple (t,c,k) containing the vector of knots, the B-spline coefﬁcients, and the degree

of the spline.

u: array

An array of the values of the parameter.

fp : ﬂoat

The weighted sum of squared residuals of the spline approximation.

ier : int

An integer ﬂag about splrep success. Success is indicated if ier<=0. If ier in [1,2,3]

an error occurred but was not raised. Otherwise an error is raised.

msg : str

A message corresponding to the integer ﬂag, ier.

See Also

splrep,splev,sproot,spalde,splint,bisplrep,bisplev,UnivariateSpline,

BivariateSpline

Notes

See splev for evaluation of the spline and its derivatives.

References

[R49],[R50],[R51]

scipy.interpolate.splev(x,tck,der=0,ext=0)

Evaluate a B-spline or its derivatives.

Given the knots and coefﬁcients of a B-spline representation, evaluate the value of the smoothing polynomial

and its derivatives. This is a wrapper around the FORTRAN routines splev and splder of FITPACK.

Parameters x: array_like

A 1-D array of points at which to return the value of the smoothed spline or its deriva-

tives. If tck was returned from splprep, then the parameter values, u should be

given.

tck : tuple

A sequence of length 3 returned by splrep or splprep containing the knots, co-

efﬁcients, and degree of the spline.

5.7. Interpolation (scipy.interpolate) 301

SciPy Reference Guide, Release 0.13.0

der : int

The order of derivative of the spline to compute (must be less than or equal to k).

ext : int

Controls the value returned for elements of xnot in the interval deﬁned by the knot

sequence.

•if ext=0, return the extrapolated value.

•if ext=1, return 0

•if ext=2, raise a ValueError

The default value is 0.

Returns y: ndarray or list of ndarrays

An array of values representing the spline function evaluated at the points in x. If

tck was returned from splrep, then this is a list of arrays representing the curve in

N-dimensional space.

See Also

splprep,splrep,sproot,spalde,splint,bisplrep,bisplev

References

[R44],[R45],[R46]

scipy.interpolate.splint(a,b,tck,full_output=0)

Evaluate the deﬁnite integral of a B-spline.

Given the knots and coefﬁcients of a B-spline, evaluate the deﬁnite integral of the smoothing polynomial be-

tween two given points.

Parameters a, b : ﬂoat

The end-points of the integration interval.

tck : tuple

A tuple (t,c,k) containing the vector of knots, the B-spline coefﬁcients, and the degree

of the spline (see splev).

full_output : int, optional

Non-zero to return optional output.

Returns integral : ﬂoat

The resulting integral.

wrk : ndarray

An array containing the integrals of the normalized B-splines deﬁned on the set of

knots.

See Also

splprep,splrep,sproot,spalde,splev,bisplrep,bisplev,UnivariateSpline,

BivariateSpline

References

[R47],[R48]

scipy.interpolate.sproot(tck,mest=10)

Find the roots of a cubic B-spline.

Given the knots (>=8) and coefﬁcients of a cubic B-spline return the roots of the spline.

Parameters tck : tuple

A tuple (t,c,k) containing the vector of knots, the B-spline coefﬁcients, and the degree

of the spline. The number of knots must be >= 8, and the degree must be 3. The knots

must be a montonically increasing sequence.

302 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

mest : int

An estimate of the number of zeros (Default is 10).

Returns zeros : ndarray

An array giving the roots of the spline.

See Also

splprep,splrep,splint,spalde,splev,bisplrep,bisplev,UnivariateSpline,

BivariateSpline

References

[R56],[R57],[R58]

scipy.interpolate.spalde(x,tck)

Evaluate all derivatives of a B-spline.

Given the knots and coefﬁcients of a cubic B-spline compute all derivatives up to order k at a point (or set of

points).

Parameters x: array_like

A point or a set of points at which to evaluate the derivatives. Note that t(k) <= x

<= t(n-k+1) must hold for each x.

tck : tuple

A tuple (t,c,k) containing the vector of knots, the B-spline coefﬁcients, and the degree

of the spline.

Returns results : {ndarray, list of ndarrays}

An array (or a list of arrays) containing all derivatives up to order k inclusive for each

point x.

See Also

splprep,splrep,splint,sproot,splev,bisplrep,bisplev,UnivariateSpline,

BivariateSpline

References

[R41],[R42],[R43]

scipy.interpolate.splder(tck,n=1)

Compute the spline representation of the derivative of a given spline New in version 0.13.0.

Parameters tck : tuple of (t, c, k)

Spline whose derivative to compute

n: int, optional

Order of derivative to evaluate. Default: 1

Returns tck_der : tuple of (t2, c2, k2)

Spline of order k2=k-n representing the derivative of the input spline.

See Also

splantider,splev,spalde

Examples

This can be used for ﬁnding maxima of a curve:

>>> from scipy.interpolate import splrep, splder, sproot

>>> x=np.linspace(0,10,70)

>>> y=np.sin(x)

>>> spl =splrep(x, y, k=4)

5.7. Interpolation (scipy.interpolate) 303

SciPy Reference Guide, Release 0.13.0

Now, differentiate the spline and ﬁnd the zeros of the derivative. (NB: sproot only works for order 3 splines,

so we ﬁt an order 4 spline):

>>> dspl =splder(spl)

>>> sproot(dspl) /np.pi

array([ 0.50000001, 1.5 , 2.49999998])

This agrees well with roots π/2 + nπ of cos(x) = sin0(x).

scipy.interpolate.splantider(tck,n=1)

Compute the spline for the antiderivative (integral) of a given spline. New in version 0.13.0.

Parameters tck : tuple of (t, c, k)

Spline whose antiderivative to compute

n: int, optional

Order of antiderivative to evaluate. Default: 1

Returns tck_ader : tuple of (t2, c2, k2)

Spline of order k2=k+n representing the antiderivative of the input spline.

See Also

splder,splev,spalde

Notes

The splder function is the inverse operation of this function. Namely, splder(splantider(tck)) is

identical to tck, modulo rounding error.

Examples

>>> from scipy.interpolate import splrep, splder, splantider, splev

>>> x=np.linspace(0, np.pi/2,70)

>>> y=1/np.sqrt(1-0.8*np.sin(x)**2)

>>> spl =splrep(x, y)

The derivative is the inverse operation of the antiderivative, although some ﬂoating point error accumulates:

>>> splev(1.7, spl), splev(1.7, splder(splantider(spl)))

(array(2.1565429877197317), array(2.1565429877201865))

Antiderivative can be used to evaluate deﬁnite integrals:

>>> ispl =splantider(spl)

>>> splev(np.pi/2, ispl) -splev(0, ispl)

2.2572053588768486

This is indeed an approximation to the complete elliptic integral K(m) = Rπ/2

0[1 −msin2x]−1/2dx:

>>> from scipy.special import ellipk

>>> ellipk(0.8)

2.2572053268208538

scipy.interpolate.bisplrep(x,y,z,w=None,xb=None,xe=None,yb=None,ye=None,kx=3,ky=3,

task=0,s=None,eps=1e-16,tx=None,ty=None,full_output=0,nx-

est=None,nyest=None,quiet=1)

Find a bivariate B-spline representation of a surface.

Given a set of data points (x[i], y[i], z[i]) representing a surface z=f(x,y), compute a B-spline representation of

the surface. Based on the routine SURFIT from FITPACK.

Parameters x, y, z : ndarray

304 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Rank-1 arrays of data points.

w: ndarray, optional

Rank-1 array of weights. By default w=np.ones(len(x)).

xb, xe : ﬂoat, optional

End points of approximation interval in x. By default xb = x.min(),

xe=x.max().

yb, ye : ﬂoat, optional

End points of approximation interval in y. By default yb=y.min(), ye =

y.max().

kx, ky : int, optional

The degrees of the spline (1 <= kx, ky <= 5). Third order (kx=ky=3) is recommended.

task : int, optional

If task=0, ﬁnd knots in x and y and coefﬁcients for a given smoothing factor, s. If

task=1, ﬁnd knots and coefﬁcients for another value of the smoothing factor, s. bis-

plrep must have been previously called with task=0 or task=1. If task=-1, ﬁnd coefﬁ-

cients for a given set of knots tx, ty.

s: ﬂoat, optional

A non-negative smoothing factor. If weights correspond to the inverse of the standard-

deviation of the errors in z, then a good s-value should be found in the range

(m-sqrt(2*m),m+sqrt(2*m)) where m=len(x).

eps : ﬂoat, optional

A threshold for determining the effective rank of an over-determined linear system of

equations (0 < eps < 1). eps is not likely to need changing.

tx, ty : ndarray, optional

Rank-1 arrays of the knots of the spline for task=-1

full_output : int, optional

Non-zero to return optional outputs.

nxest, nyest : int, optional

Over-estimates of the total number of knots. If None

then nxest = max(kx+sqrt(m/2),2*kx+3),nyest =

max(ky+sqrt(m/2),2*ky+3).

quiet : int, optional

Non-zero to suppress printing of messages.

Returns tck : array_like

A list [tx, ty, c, kx, ky] containing the knots (tx, ty) and coefﬁcients (c) of the bivariate

B-spline representation of the surface along with the degree of the spline.

fp : ndarray

The weighted sum of squared residuals of the spline approximation.

ier : int

An integer ﬂag about splrep success. Success is indicated if ier<=0. If ier in [1,2,3]

an error occurred but was not raised. Otherwise an error is raised.

msg : str

A message corresponding to the integer ﬂag, ier.

See Also

splprep,splrep,splint,sproot,splev,UnivariateSpline,BivariateSpline

Notes

See bisplev to evaluate the value of the B-spline given its tck representation.

References

[R38],[R39],[R40]

5.7. Interpolation (scipy.interpolate) 305

SciPy Reference Guide, Release 0.13.0

scipy.interpolate.bisplev(x,y,tck,dx=0,dy=0)

Evaluate a bivariate B-spline and its derivatives.

Return a rank-2 array of spline function values (or spline derivative values) at points given by the cross-product

of the rank-1 arrays xand y. In special cases, return an array or just a ﬂoat if either xor yor both are ﬂoats.

Based on BISPEV from FITPACK.

Parameters x, y : ndarray

Rank-1 arrays specifying the domain over which to evaluate the spline or its derivative.

tck : tuple

A sequence of length 5 returned by bisplrep containing the knot locations, the

coefﬁcients, and the degree of the spline: [tx, ty, c, kx, ky].

dx, dy : int, optional

The orders of the partial derivatives in xand yrespectively.

Returns vals : ndarray

The B-spline or its derivative evaluated over the set formed by the cross-product of x

and y.

See Also

splprep,splrep,splint,sproot,splev,UnivariateSpline,BivariateSpline

Notes

See bisplrep to generate the tck representation.

References

[R35],[R36],[R37]

5.7.4 2-D Splines

For data on a grid:

RectBivariateSpline(x, y, z[, bbox, kx, ky, s]) Bivariate spline approximation over a rectangular mesh.

RectSphereBivariateSpline(u, v, r[, s, ...]) Bivariate spline approximation over a rectangular mesh on a sphere.

class scipy.interpolate.RectBivariateSpline(x, y, z, bbox=[None, None, None, None], kx=3,

ky=3, s=0)

Bivariate spline approximation over a rectangular mesh.

Can be used for both smoothing and interpolating data.

Parameters x,y : array_like

1-D arrays of coordinates in strictly ascending order.

z: array_like

2-D array of data with shape (x.size,y.size).

bbox : array_like, optional

Sequence of length 4 specifying the boundary of the rectangular ap-

proximation domain. By default, bbox=[min(x,tx),max(x,tx),

min(y,ty),max(y,ty)].

kx, ky : ints, optional

Degrees of the bivariate spline. Default is 3.

s: ﬂoat, optional

Positive smoothing factor deﬁned for estimation condition:

sum((w[i]*(z[i]-s(x[i], y[i])))**2, axis=0) <= s Default

306 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

is s=0, which is for interpolation.

See Also

SmoothBivariateSpline

a smoothing bivariate spline for scattered data

bisplrep an older wrapping of FITPACK

bisplev an older wrapping of FITPACK

UnivariateSpline

a similar class for univariate spline interpolation

Methods

__call__(x, y[, mth]) Evaluate spline at the grid points deﬁned by the coordinate arrays

ev(xi, yi) Evaluate spline at points (x[i], y[i]), i=0,...,len(x)-1

get_coeffs() Return spline coefﬁcients.

get_knots() Return a tuple (tx,ty) where tx,ty contain knots positions of the spline with respect to x-, y-variable, respectively.

get_residual() Return weighted sum of squared residuals of the spline

integral(xa, xb, ya, yb) Evaluate the integral of the spline over area [xa,xb] x [ya,yb].

RectBivariateSpline.__call__(x,y,mth=’array’)

Evaluate spline at the grid points deﬁned by the coordinate arrays x,y.

RectBivariateSpline.ev(xi,yi)

Evaluate spline at points (x[i], y[i]), i=0,...,len(x)-1

RectBivariateSpline.get_coeffs()

Return spline coefﬁcients.

RectBivariateSpline.get_knots()

Return a tuple (tx,ty) where tx,ty contain knots positions of the spline with respect to x-, y-variable,

respectively. The position of interior and additional knots are given as

t[k+1:-k-1] and t[:k+1]=b, t[-k-1:]=e, respectively.

RectBivariateSpline.get_residual()

Return weighted sum of squared residuals of the spline approximation: sum ((w[i]*(z[i]-

s(x[i],y[i])))**2,axis=0)

RectBivariateSpline.integral(xa,xb,ya,yb)

Evaluate the integral of the spline over area [xa,xb] x [ya,yb].

Parameters xa, xb : ﬂoat

The end-points of the x integration interval.

ya, yb : ﬂoat

The end-points of the y integration interval.

Returns integ : ﬂoat

The value of the resulting integral.

class scipy.interpolate.RectSphereBivariateSpline(u,v,r,s=0.0,pole_continuity=False,

pole_values=None,pole_exact=False,

pole_ﬂat=False)

Bivariate spline approximation over a rectangular mesh on a sphere.

Can be used for smoothing data. New in version 0.11.0.

Parameters u: array_like

5.7. Interpolation (scipy.interpolate) 307

SciPy Reference Guide, Release 0.13.0

1-D array of latitude coordinates in strictly ascending order. Coordinates must be

given in radians and lie within the interval (0, pi).

v: array_like

1-D array of longitude coordinates in strictly ascending order. Coordinates must be

given in radians, and must lie within (0, 2pi).

r: array_like

2-D array of data with shape (u.size, v.size).

s: ﬂoat, optional

Positive smoothing factor deﬁned for estimation condition (s=0 is for interpolation).

pole_continuity : bool or (bool, bool), optional

Order of continuity at the poles u=0 (pole_continuity[0]) and u=pi

(pole_continuity[1]). The order of continuity at the pole will be 1 or 0 when

this is True or False, respectively. Defaults to False.

pole_values : ﬂoat or (ﬂoat, ﬂoat), optional

Data values at the poles u=0 and u=pi. Either the whole parameter or each individual

element can be None. Defaults to None.

pole_exact : bool or (bool, bool), optional

Data value exactness at the poles u=0 and u=pi. If True, the value is considered to

be the right function value, and it will be ﬁtted exactly. If False, the value will be

considered to be a data value just like the other data values. Defaults to False.

pole_ﬂat : bool or (bool, bool), optional

For the poles at u=0 and u=pi, specify whether or not the approximation has vanish-

ing derivatives. Defaults to False.

See Also

RectBivariateSpline

bivariate spline approximation over a rectangular mesh

Notes

Currently, only the smoothing spline approximation (iopt[0] = 0 and iopt[0] = 1 in the FITPACK

routine) is supported. The exact least-squares spline approximation is not implemented yet.

When actually performing the interpolation, the requested vvalues must lie within the same length 2pi interval

that the original vvalues were chosen from.

For more information, see the FITPACK site about this function.

Examples

Suppose we have global data on a coarse grid

>>> lats =np.linspace(10,170,9)*np.pi /180.

>>> lons =np.linspace(0,350,18)*np.pi /180.

>>> data =np.dot(np.atleast_2d(90. -np.linspace(-80.,80.,18)).T,

np.atleast_2d(180. - np.abs(np.linspace(0., 350., 9)))).T

We want to interpolate it to a global one-degree grid

>>> new_lats =np.linspace(1,180,180)*np.pi /180

>>> new_lons =np.linspace(1,360,360)*np.pi /180

>>> new_lats, new_lons =np.meshgrid(new_lats, new_lons)

We need to set up the interpolator object

>>> from scipy.interpolate import RectSphereBivariateSpline

>>> lut =RectSphereBivariateSpline(lats, lons, data)

308 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Finally we interpolate the data. The RectSphereBivariateSpline object only takes 1-D arrays as input,

therefore we need to do some reshaping.

>>> data_interp =lut.ev(new_lats.ravel(),

... new_lons.ravel()).reshape((360,180)).T

Looking at the original and the interpolated data, one can see that the interpolant reproduces the original data

very well:

>>> fig =plt.figure()

>>> ax1 =fig.add_subplot(211)

>>> ax1.imshow(data, interpolation=’nearest’)

>>> ax2 =fig.add_subplot(212)

>>> ax2.imshow(data_interp, interpolation=’nearest’)

>>> plt.show()

Chosing the optimal value of scan be a delicate task. Recommended values for sdepend on the accuracy of

the data values. If the user has an idea of the statistical errors on the data, she can also ﬁnd a proper estimate

for s. By assuming that, if she speciﬁes the right s, the interpolator will use a spline f(u,v) which exactly

reproduces the function underlying the data, she can evaluate sum((r(i,j)-s(u(i),v(j)))**2) to ﬁnd

a good estimate for this s. For example, if she knows that the statistical errors on her r(i,j)-values are not

greater than 0.1, she may expect that a good sshould have a value not larger than u.size *v.size *

(0.1)**2.

If nothing is known about the statistical error in r(i,j),smust be determined by trial and error. The best is

then to start with a very large value of s(to determine the least-squares polynomial and the corresponding upper

bound fp0 for s) and then to progressively decrease the value of s(say by a factor 10 in the beginning, i.e. s

= fp0 / 10, fp0 / 100, ... and more carefully as the approximation shows more detail) to obtain

closer ﬁts.

The interpolation results for different values of sgive some insight into this process:

>>> fig2 =plt.figure()

>>> s=[3e9,2e9,1e9,1e8]

>>> for ii in xrange(len(s)):

>>> lut =RectSphereBivariateSpline(lats, lons, data, s=s[ii])

>>> data_interp =lut.ev(new_lats.ravel(),

... new_lons.ravel()).reshape((360,180)).T

>>> ax =fig2.add_subplot(2,2, ii+1)

>>> ax.imshow(data_interp, interpolation=’nearest’)

>>> ax.set_title("s = %g"%s[ii])

>>> plt.show()

Methods

__call__(theta, phi) Evaluate the spline at the grid ponts deﬁned by the coordinate

ev(thetai, phii) Evaluate the spline at the points (theta[i], phi[i]),

get_coeffs() Return spline coefﬁcients.

get_knots() Return a tuple (tx,ty) where tx,ty contain knots positions of the spline with respect to x-, y-variable, respectively.

get_residual() Return weighted sum of squared residuals of the spline

RectSphereBivariateSpline.__call__(theta,phi)

Evaluate the spline at the grid ponts deﬁned by the coordinate arrays theta, phi.

RectSphereBivariateSpline.ev(thetai,phii)

Evaluate the spline at the points (theta[i], phi[i]), i=0,...,len(theta)-1

RectSphereBivariateSpline.get_coeffs()

5.7. Interpolation (scipy.interpolate) 309

SciPy Reference Guide, Release 0.13.0

Return spline coefﬁcients.

RectSphereBivariateSpline.get_knots()

Return a tuple (tx,ty) where tx,ty contain knots positions of the spline with respect to x-, y-variable,

respectively. The position of interior and additional knots are given as

t[k+1:-k-1] and t[:k+1]=b, t[-k-1:]=e, respectively.

RectSphereBivariateSpline.get_residual()

Return weighted sum of squared residuals of the spline approximation: sum ((w[i]*(z[i]-

s(x[i],y[i])))**2,axis=0)

For unstructured data:

BivariateSpline Base class for bivariate splines.

SmoothBivariateSpline(x, y, z[, w, bbox, ...]) Smooth bivariate spline approximation.

SmoothSphereBivariateSpline(theta, phi, r[, ...]) Smooth bivariate spline approximation in spherical coordinates.

LSQBivariateSpline(x, y, z, tx, ty[, w, ...]) Weighted least-squares bivariate spline approximation.

LSQSphereBivariateSpline(theta, phi, r, tt, tp) Weighted least-squares bivariate spline approximation in spherical coordinates.

class scipy.interpolate.BivariateSpline

Base class for bivariate splines.

This describes a spline s(x, y) of degrees kx and ky on the rectangle [xb, xe] *[yb, ye] calculated

from a given set of data points (x, y, z).

This class is meant to be subclassed, not instantiated directly. To construct these splines, call either

SmoothBivariateSpline or LSQBivariateSpline.

See Also

UnivariateSpline

a similar class for univariate spline interpolation

SmoothBivariateSpline

to create a BivariateSpline through the given points

LSQBivariateSpline

to create a BivariateSpline using weighted least-squares ﬁtting

SphereBivariateSpline

bivariate spline interpolation in spherical cooridinates

bisplrep older wrapping of FITPACK

bisplev older wrapping of FITPACK

Methods

__call__(x, y[, mth]) Evaluate spline at the grid points deﬁned by the coordinate arrays

ev(xi, yi) Evaluate spline at points (x[i], y[i]), i=0,...,len(x)-1

get_coeffs() Return spline coefﬁcients.

get_knots() Return a tuple (tx,ty) where tx,ty contain knots positions of the spline with respect to x-, y-variable, respectively.

get_residual() Return weighted sum of squared residuals of the spline

integral(xa, xb, ya, yb) Evaluate the integral of the spline over area [xa,xb] x [ya,yb].

BivariateSpline.__call__(x,y,mth=’array’)

Evaluate spline at the grid points deﬁned by the coordinate arrays x,y.

310 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

BivariateSpline.ev(xi,yi)

Evaluate spline at points (x[i], y[i]), i=0,...,len(x)-1

BivariateSpline.get_coeffs()

Return spline coefﬁcients.

BivariateSpline.get_knots()

Return a tuple (tx,ty) where tx,ty contain knots positions of the spline with respect to x-, y-variable,

respectively. The position of interior and additional knots are given as

t[k+1:-k-1] and t[:k+1]=b, t[-k-1:]=e, respectively.

BivariateSpline.get_residual()

Return weighted sum of squared residuals of the spline approximation: sum ((w[i]*(z[i]-

s(x[i],y[i])))**2,axis=0)

BivariateSpline.integral(xa,xb,ya,yb)

Evaluate the integral of the spline over area [xa,xb] x [ya,yb].

Parameters xa, xb : ﬂoat

The end-points of the x integration interval.

ya, yb : ﬂoat

The end-points of the y integration interval.

Returns integ : ﬂoat

The value of the resulting integral.

class scipy.interpolate.SmoothBivariateSpline(x, y, z, w=None, bbox=[None, None, None,

None], kx=3, ky=3, s=None, eps=None)

Smooth bivariate spline approximation.

Parameters x, y, z : array_like

1-D sequences of data points (order is not important).

w: array_like, optional

Positive 1-D sequence of weights, of same length as x,yand z.

bbox : array_like, optional

Sequence of length 4 specifying the boundary of the rectangular ap-

proximation domain. By default, bbox=[min(x,tx),max(x,tx),

min(y,ty),max(y,ty)].

kx, ky : ints, optional

Degrees of the bivariate spline. Default is 3.

s: ﬂoat, optional

Positive smoothing factor deﬁned for estimation condition:

sum((w[i]*(z[i]-s(x[i], y[i])))**2, axis=0) <= s Default

s=len(w) which should be a good value if 1/w[i] is an estimate of the standard

deviation of z[i].

eps : ﬂoat, optional

A threshold for determining the effective rank of an over-determined linear system of

equations. eps should have a value between 0 and 1, the default is 1e-16.

See Also

bisplrep an older wrapping of FITPACK

bisplev an older wrapping of FITPACK

UnivariateSpline

a similar class for univariate spline interpolation

LSQUnivariateSpline

to create a BivariateSpline using weighted

5.7. Interpolation (scipy.interpolate) 311

SciPy Reference Guide, Release 0.13.0

Notes

The length of x,yand zshould be at least (kx+1) *(ky+1).

Methods

__call__(x, y[, mth]) Evaluate spline at the grid points deﬁned by the coordinate arrays

ev(xi, yi) Evaluate spline at points (x[i], y[i]), i=0,...,len(x)-1

get_coeffs() Return spline coefﬁcients.

get_knots() Return a tuple (tx,ty) where tx,ty contain knots positions of the spline with respect to x-, y-variable, respectively.

get_residual() Return weighted sum of squared residuals of the spline

integral(xa, xb, ya, yb) Evaluate the integral of the spline over area [xa,xb] x [ya,yb].

SmoothBivariateSpline.__call__(x,y,mth=’array’)

Evaluate spline at the grid points deﬁned by the coordinate arrays x,y.

SmoothBivariateSpline.ev(xi,yi)

Evaluate spline at points (x[i], y[i]), i=0,...,len(x)-1

SmoothBivariateSpline.get_coeffs()

Return spline coefﬁcients.

SmoothBivariateSpline.get_knots()

Return a tuple (tx,ty) where tx,ty contain knots positions of the spline with respect to x-, y-variable,

respectively. The position of interior and additional knots are given as

t[k+1:-k-1] and t[:k+1]=b, t[-k-1:]=e, respectively.

SmoothBivariateSpline.get_residual()

Return weighted sum of squared residuals of the spline approximation: sum ((w[i]*(z[i]-

s(x[i],y[i])))**2,axis=0)

SmoothBivariateSpline.integral(xa,xb,ya,yb)

Evaluate the integral of the spline over area [xa,xb] x [ya,yb].

Parameters xa, xb : ﬂoat

The end-points of the x integration interval.

ya, yb : ﬂoat

The end-points of the y integration interval.

Returns integ : ﬂoat

The value of the resulting integral.

class scipy.interpolate.SmoothSphereBivariateSpline(theta,phi,r,w=None,s=0.0,

eps=1e-16)

Smooth bivariate spline approximation in spherical coordinates. New in version 0.11.0.

Parameters theta, phi, r : array_like

1-D sequences of data points (order is not important). Coordinates must be given in

radians. Theta must lie within the interval (0, pi), and phi must lie within the interval

(0, 2pi).

w: array_like, optional

Positive 1-D sequence of weights.

s: ﬂoat, optional

Positive smoothing factor deﬁned for estimation condition: sum((w(i)*(r(i) -

s(theta(i), phi(i))))**2, axis=0) <= s Default s=len(w) which

should be a good value if 1/w[i] is an estimate of the standard deviation of r[i].

eps : ﬂoat, optional

312 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

A threshold for determining the effective rank of an over-determined linear system of

equations. eps should have a value between 0 and 1, the default is 1e-16.

Notes

For more information, see the FITPACK site about this function.

Examples

Suppose we have global data on a coarse grid (the input data does not have to be on a grid):

>>> theta =np.linspace(0., np.pi, 7)

>>> phi =np.linspace(0.,2*np.pi, 9)

>>> data =np.empty((theta.shape[0], phi.shape[0]))

>>> data[:,0], data[0,:], data[-1,:] =0.,0.,0.

>>> data[1:-1,1], data[1:-1,-1]=1.,1.

>>> data[1,1:-1], data[-2,1:-1]=1.,1.

>>> data[2:-2,2], data[2:-2,-2]=2.,2.

>>> data[2,2:-2], data[-3,2:-2]=2.,2.

>>> data[3,3:-2]=3.

>>> data =np.roll(data, 4,1)

We need to set up the interpolator object

>>> lats, lons =np.meshgrid(theta, phi)

>>> from scipy.interpolate import SmoothSphereBivariateSpline

>>> lut =SmoothSphereBivariateSpline(lats.ravel(), lons.ravel(),

data.T.ravel(),s=3.5)

As a ﬁrst test, we’ll see what the algorithm returns when run on the input coordinates

>>> data_orig =lut(theta, phi)

Finally we interpolate the data to a ﬁner grid

>>> fine_lats =np.linspace(0., np.pi, 70)

>>> fine_lons =np.linspace(0.,2*np.pi, 90)

>>> data_smth =lut(fine_lats, fine_lons)

>>> fig =plt.figure()

>>> ax1 =fig.add_subplot(131)

>>> ax1.imshow(data, interpolation=’nearest’)

>>> ax2 =fig.add_subplot(132)

>>> ax2.imshow(data_orig, interpolation=’nearest’)

>>> ax3 =fig.add_subplot(133)

>>> ax3.imshow(data_smth, interpolation=’nearest’)

>>> plt.show()

Methods

__call__(theta, phi) Evaluate the spline at the grid ponts deﬁned by the coordinate

ev(thetai, phii) Evaluate the spline at the points (theta[i], phi[i]),

get_coeffs() Return spline coefﬁcients.

get_knots() Return a tuple (tx,ty) where tx,ty contain knots positions of the spline with respect to x-, y-variable, respectively.

get_residual() Return weighted sum of squared residuals of the spline

5.7. Interpolation (scipy.interpolate) 313

SciPy Reference Guide, Release 0.13.0

SmoothSphereBivariateSpline.__call__(theta,phi)

Evaluate the spline at the grid ponts deﬁned by the coordinate arrays theta, phi.

SmoothSphereBivariateSpline.ev(thetai,phii)

Evaluate the spline at the points (theta[i], phi[i]), i=0,...,len(theta)-1

SmoothSphereBivariateSpline.get_coeffs()

Return spline coefﬁcients.

SmoothSphereBivariateSpline.get_knots()

Return a tuple (tx,ty) where tx,ty contain knots positions of the spline with respect to x-, y-variable,

respectively. The position of interior and additional knots are given as

t[k+1:-k-1] and t[:k+1]=b, t[-k-1:]=e, respectively.

SmoothSphereBivariateSpline.get_residual()

Return weighted sum of squared residuals of the spline approximation: sum ((w[i]*(z[i]-

s(x[i],y[i])))**2,axis=0)

class scipy.interpolate.LSQBivariateSpline(x, y, z, tx, ty, w=None, bbox=[None, None, None,

None], kx=3, ky=3, eps=None)

Weighted least-squares bivariate spline approximation.

Parameters x, y, z : array_like

1-D sequences of data points (order is not important).

tx, ty : array_like

Strictly ordered 1-D sequences of knots coordinates.

w: array_like, optional

Positive 1-D array of weights, of the same length as x,yand z.

bbox : (4,) array_like, optional

Sequence of length 4 specifying the boundary of the rectangular ap-

proximation domain. By default, bbox=[min(x,tx),max(x,tx),

min(y,ty),max(y,ty)].

kx, ky : ints, optional

Degrees of the bivariate spline. Default is 3.

s: ﬂoat, optional

Positive smoothing factor deﬁned for estimation condition:

sum((w[i]*(z[i]-s(x[i], y[i])))**2, axis=0) <= s Default

s=len(w) which should be a good value if 1/w[i] is an estimate of the standard

deviation of z[i].

eps : ﬂoat, optional

A threshold for determining the effective rank of an over-determined linear system of

equations. eps should have a value between 0 and 1, the default is 1e-16.

See Also

bisplrep an older wrapping of FITPACK

bisplev an older wrapping of FITPACK

UnivariateSpline

a similar class for univariate spline interpolation

SmoothBivariateSpline

create a smoothing BivariateSpline

Notes

The length of x,yand zshould be at least (kx+1) *(ky+1).

314 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Methods

__call__(x, y[, mth]) Evaluate spline at the grid points deﬁned by the coordinate arrays

ev(xi, yi) Evaluate spline at points (x[i], y[i]), i=0,...,len(x)-1

get_coeffs() Return spline coefﬁcients.

get_knots() Return a tuple (tx,ty) where tx,ty contain knots positions of the spline with respect to x-, y-variable, respectively.

get_residual() Return weighted sum of squared residuals of the spline

integral(xa, xb, ya, yb) Evaluate the integral of the spline over area [xa,xb] x [ya,yb].

LSQBivariateSpline.__call__(x,y,mth=’array’)

Evaluate spline at the grid points deﬁned by the coordinate arrays x,y.

LSQBivariateSpline.ev(xi,yi)

Evaluate spline at points (x[i], y[i]), i=0,...,len(x)-1

LSQBivariateSpline.get_coeffs()

Return spline coefﬁcients.

LSQBivariateSpline.get_knots()

Return a tuple (tx,ty) where tx,ty contain knots positions of the spline with respect to x-, y-variable,

respectively. The position of interior and additional knots are given as

t[k+1:-k-1] and t[:k+1]=b, t[-k-1:]=e, respectively.

LSQBivariateSpline.get_residual()

Return weighted sum of squared residuals of the spline approximation: sum ((w[i]*(z[i]-

s(x[i],y[i])))**2,axis=0)

LSQBivariateSpline.integral(xa,xb,ya,yb)

Evaluate the integral of the spline over area [xa,xb] x [ya,yb].

Parameters xa, xb : ﬂoat

The end-points of the x integration interval.

ya, yb : ﬂoat

The end-points of the y integration interval.

Returns integ : ﬂoat

The value of the resulting integral.

class scipy.interpolate.LSQSphereBivariateSpline(theta,phi,r,tt,tp,w=None,eps=1e-16)

Weighted least-squares bivariate spline approximation in spherical coordinates. New in version 0.11.0.

Parameters theta, phi, r : array_like

1-D sequences of data points (order is not important). Coordinates must be given in

radians. Theta must lie within the interval (0, pi), and phi must lie within the interval

(0, 2pi).

tt, tp : array_like

Strictly ordered 1-D sequences of knots coordinates. Coordinates must satisfy 0 <

tt[i] < pi,0 < tp[i] < 2*pi.

w: array_like, optional

Positive 1-D sequence of weights, of the same length as theta,phi and r.

eps : ﬂoat, optional

A threshold for determining the effective rank of an over-determined linear system of

equations. eps should have a value between 0 and 1, the default is 1e-16.

Notes

For more information, see the FITPACK site about this function.

5.7. Interpolation (scipy.interpolate) 315

SciPy Reference Guide, Release 0.13.0

Examples

Suppose we have global data on a coarse grid (the input data does not have to be on a grid):

>>> theta =np.linspace(0., np.pi, 7)

>>> phi =np.linspace(0.,2*np.pi, 9)

>>> data =np.empty((theta.shape[0], phi.shape[0]))

>>> data[:,0], data[0,:], data[-1,:] =0.,0.,0.

>>> data[1:-1,1], data[1:-1,-1]=1.,1.

>>> data[1,1:-1], data[-2,1:-1]=1.,1.

>>> data[2:-2,2], data[2:-2,-2]=2.,2.

>>> data[2,2:-2], data[-3,2:-2]=2.,2.

>>> data[3,3:-2]=3.

>>> data =np.roll(data, 4,1)

We need to set up the interpolator object. Here, we must also specify the coordinates of the knots to use.

>>> lats, lons =np.meshgrid(theta, phi)

>>> knotst, knotsp =theta.copy(), phi.copy()

>>> knotst[0]+= .0001

>>> knotst[-1]-= .0001

>>> knotsp[0]+= .0001

>>> knotsp[-1]-= .0001

>>> from scipy.interpolate import LSQSphereBivariateSpline

>>> lut =LSQSphereBivariateSpline(lats.ravel(), lons.ravel(),

data.T.ravel(),knotst,knotsp)

As a ﬁrst test, we’ll see what the algorithm returns when run on the input coordinates

>>> data_orig =lut(theta, phi)

Finally we interpolate the data to a ﬁner grid

>>> fine_lats =np.linspace(0., np.pi, 70)

>>> fine_lons =np.linspace(0.,2*np.pi, 90)

>>> data_lsq =lut(fine_lats, fine_lons)

>>> fig =plt.figure()

>>> ax1 =fig.add_subplot(131)

>>> ax1.imshow(data, interpolation=’nearest’)

>>> ax2 =fig.add_subplot(132)

>>> ax2.imshow(data_orig, interpolation=’nearest’)

>>> ax3 =fig.add_subplot(133)

>>> ax3.imshow(data_lsq, interpolation=’nearest’)

>>> plt.show()

Methods

__call__(theta, phi) Evaluate the spline at the grid ponts deﬁned by the coordinate

ev(thetai, phii) Evaluate the spline at the points (theta[i], phi[i]),

get_coeffs() Return spline coefﬁcients.

get_knots() Return a tuple (tx,ty) where tx,ty contain knots positions of the spline with respect to x-, y-variable, respectively.

get_residual() Return weighted sum of squared residuals of the spline

LSQSphereBivariateSpline.__call__(theta,phi)

Evaluate the spline at the grid ponts deﬁned by the coordinate arrays theta, phi.

316 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

LSQSphereBivariateSpline.ev(thetai,phii)

Evaluate the spline at the points (theta[i], phi[i]), i=0,...,len(theta)-1

LSQSphereBivariateSpline.get_coeffs()

Return spline coefﬁcients.

LSQSphereBivariateSpline.get_knots()

Return a tuple (tx,ty) where tx,ty contain knots positions of the spline with respect to x-, y-variable,

respectively. The position of interior and additional knots are given as

t[k+1:-k-1] and t[:k+1]=b, t[-k-1:]=e, respectively.

LSQSphereBivariateSpline.get_residual()

Return weighted sum of squared residuals of the spline approximation: sum ((w[i]*(z[i]-

s(x[i],y[i])))**2,axis=0)

Low-level interface to FITPACK functions:

bisplrep(x, y, z[, w, xb, xe, yb, ye, kx, ...]) Find a bivariate B-spline representation of a surface.

bisplev(x, y, tck[, dx, dy]) Evaluate a bivariate B-spline and its derivatives.

scipy.interpolate.bisplrep(x,y,z,w=None,xb=None,xe=None,yb=None,ye=None,kx=3,ky=3,

task=0,s=None,eps=1e-16,tx=None,ty=None,full_output=0,nx-

est=None,nyest=None,quiet=1)

Find a bivariate B-spline representation of a surface.

Given a set of data points (x[i], y[i], z[i]) representing a surface z=f(x,y), compute a B-spline representation of

the surface. Based on the routine SURFIT from FITPACK.

Parameters x, y, z : ndarray

Rank-1 arrays of data points.

w: ndarray, optional

Rank-1 array of weights. By default w=np.ones(len(x)).

xb, xe : ﬂoat, optional

End points of approximation interval in x. By default xb = x.min(),

xe=x.max().

yb, ye : ﬂoat, optional

End points of approximation interval in y. By default yb=y.min(), ye =

y.max().

kx, ky : int, optional

The degrees of the spline (1 <= kx, ky <= 5). Third order (kx=ky=3) is recommended.

task : int, optional

If task=0, ﬁnd knots in x and y and coefﬁcients for a given smoothing factor, s. If

task=1, ﬁnd knots and coefﬁcients for another value of the smoothing factor, s. bis-

plrep must have been previously called with task=0 or task=1. If task=-1, ﬁnd coefﬁ-

cients for a given set of knots tx, ty.

s: ﬂoat, optional

A non-negative smoothing factor. If weights correspond to the inverse of the standard-

deviation of the errors in z, then a good s-value should be found in the range

(m-sqrt(2*m),m+sqrt(2*m)) where m=len(x).

eps : ﬂoat, optional

A threshold for determining the effective rank of an over-determined linear system of

equations (0 < eps < 1). eps is not likely to need changing.

tx, ty : ndarray, optional

Rank-1 arrays of the knots of the spline for task=-1

full_output : int, optional

Non-zero to return optional outputs.

5.7. Interpolation (scipy.interpolate) 317

SciPy Reference Guide, Release 0.13.0

nxest, nyest : int, optional

Over-estimates of the total number of knots. If None

then nxest = max(kx+sqrt(m/2),2*kx+3),nyest =

max(ky+sqrt(m/2),2*ky+3).

quiet : int, optional

Non-zero to suppress printing of messages.

Returns tck : array_like

A list [tx, ty, c, kx, ky] containing the knots (tx, ty) and coefﬁcients (c) of the bivariate

B-spline representation of the surface along with the degree of the spline.

fp : ndarray

The weighted sum of squared residuals of the spline approximation.

ier : int

An integer ﬂag about splrep success. Success is indicated if ier<=0. If ier in [1,2,3]

an error occurred but was not raised. Otherwise an error is raised.

msg : str

A message corresponding to the integer ﬂag, ier.

See Also

splprep,splrep,splint,sproot,splev,UnivariateSpline,BivariateSpline

Notes

See bisplev to evaluate the value of the B-spline given its tck representation.

References

[R38],[R39],[R40]

scipy.interpolate.bisplev(x,y,tck,dx=0,dy=0)

Evaluate a bivariate B-spline and its derivatives.

Return a rank-2 array of spline function values (or spline derivative values) at points given by the cross-product

of the rank-1 arrays xand y. In special cases, return an array or just a ﬂoat if either xor yor both are ﬂoats.

Based on BISPEV from FITPACK.

Parameters x, y : ndarray

Rank-1 arrays specifying the domain over which to evaluate the spline or its derivative.

tck : tuple

A sequence of length 5 returned by bisplrep containing the knot locations, the

coefﬁcients, and the degree of the spline: [tx, ty, c, kx, ky].

dx, dy : int, optional

The orders of the partial derivatives in xand yrespectively.

Returns vals : ndarray

The B-spline or its derivative evaluated over the set formed by the cross-product of x

and y.

See Also

splprep,splrep,splint,sproot,splev,UnivariateSpline,BivariateSpline

Notes

See bisplrep to generate the tck representation.

References

[R35],[R36],[R37]

318 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

5.7.5 Additional tools

lagrange(x, w) Return a Lagrange interpolating polynomial.

approximate_taylor_polynomial(f, x, degree, ...) Estimate the Taylor polynomial of f at x by polynomial ﬁtting.

scipy.interpolate.lagrange(x,w)

Return a Lagrange interpolating polynomial.

Given two 1-D arrays xand w, returns the Lagrange interpolating polynomial through the points (x, w).

Warning: This implementation is numerically unstable. Do not expect to be able to use more than about 20

points even if they are chosen optimally.

Parameters x: array_like

xrepresents the x-coordinates of a set of datapoints.

w: array_like

wrepresents the y-coordinates of a set of datapoints, i.e. f(x).

Returns lagrange : numpy.poly1d instance

The Lagrange interpolating polynomial.

scipy.interpolate.approximate_taylor_polynomial(f,x,degree,scale,order=None)

Estimate the Taylor polynomial of f at x by polynomial ﬁtting.

Parameters f: callable

The function whose Taylor polynomial is sought. Should accept a vector of xvalues.

x: scalar

The point at which the polynomial is to be evaluated.

degree : int

The degree of the Taylor polynomial

scale : scalar

The width of the interval to use to evaluate the Taylor polynomial. Function values

spread over a range this wide are used to ﬁt the polynomial. Must be chosen carefully.

order : int or None, optional

The order of the polynomial to be used in the ﬁtting; fwill be evaluated order+1

times. If None, use degree.

Returns p: poly1d instance

The Taylor polynomial (translated to the origin, so that for example p(0)=f(x)).

Notes

The appropriate choice of “scale” is a trade-off; too large and the function differs from its Taylor polynomial too

much to get a good answer, too small and round-off errors overwhelm the higher-order terms. The algorithm

used becomes numerically unstable around order 30 even under ideal circumstances.

Choosing order somewhat larger than degree may improve the higher-order terms.

See Also

scipy.ndimage.map_coordinates,scipy.ndimage.spline_filter,

scipy.signal.resample,scipy.signal.bspline,scipy.signal.gauss_spline,

scipy.signal.qspline1d,scipy.signal.cspline1d,scipy.signal.qspline1d_eval,

scipy.signal.cspline1d_eval,scipy.signal.qspline2d,scipy.signal.cspline2d.

5.7. Interpolation (scipy.interpolate) 319

SciPy Reference Guide, Release 0.13.0

5.8 Input and output (scipy.io)

SciPy has many modules, classes, and functions available to read data from and write data to a variety of ﬁle formats.

See Also

numpy-reference.routines.io (in Numpy)

5.8.1 MATLAB® ﬁles

loadmat(ﬁle_name[, mdict, appendmat]) Load MATLAB ﬁle

savemat(ﬁle_name, mdict[, appendmat, ...]) Save a dictionary of names and arrays into a MATLAB-style .mat ﬁle.

whosmat(ﬁle_name[, appendmat]) List variables inside a MATLAB ﬁle

scipy.io.loadmat(ﬁle_name,mdict=None,appendmat=True,**kwargs)

Load MATLAB ﬁle

Parameters ﬁle_name : str

Name of the mat ﬁle (do not need .mat extension if appendmat==True) Can also pass

open ﬁle-like object.

m_dict : dict, optional

Dictionary in which to insert matﬁle variables.

appendmat : bool, optional

True to append the .mat extension to the end of the given ﬁlename, if not already

present.

byte_order : str or None, optional

None by default, implying byte order guessed from mat ﬁle. Otherwise can be one of

(‘native’, ‘=’, ‘little’, ‘<’, ‘BIG’, ‘>’).

mat_dtype : bool, optional

If True, return arrays in same dtype as would be loaded into MATLAB (instead of the

dtype with which they are saved).

squeeze_me : bool, optional

Whether to squeeze unit matrix dimensions or not.

chars_as_strings : bool, optional

Whether to convert char arrays to string arrays.

matlab_compatible : bool, optional

Returns matrices as would be loaded by MATLAB (implies squeeze_me=False,

chars_as_strings=False, mat_dtype=True, struct_as_record=True).

struct_as_record : bool, optional

Whether to load MATLAB structs as numpy record arrays, or as old-style numpy

arrays with dtype=object. Setting this ﬂag to False replicates the behavior of scipy

version 0.7.x (returning numpy object arrays). The default setting is True, because it

allows easier round-trip load and save of MATLAB ﬁles.

variable_names : None or sequence

If None (the default) - read all variables in ﬁle. Otherwise variable_names should be

a sequence of strings, giving names of the matlab variables to read from the ﬁle. The

reader will skip any variable with a name not in this sequence, possibly saving some

read processing.

Returns mat_dict : dict

dictionary with variable names as keys, and loaded matrices as values

320 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Notes

v4 (Level 1.0), v6 and v7 to 7.2 matﬁles are supported.

You will need an HDF5 python library to read matlab 7.3 format mat ﬁles. Because scipy does not supply one,

we do not implement the HDF5 / 7.3 interface here.

scipy.io.savemat(ﬁle_name,mdict,appendmat=True,format=‘5’,long_ﬁeld_names=False,

do_compression=False,oned_as=’row’)

Save a dictionary of names and arrays into a MATLAB-style .mat ﬁle.

This saves the array objects in the given dictionary to a MATLAB- style .mat ﬁle.

Parameters ﬁle_name : str or ﬁle-like object

Name of the .mat ﬁle (.mat extension not needed if appendmat == True). Can

also pass open ﬁle_like object.

mdict : dict

Dictionary from which to save matﬁle variables.

appendmat : bool, optional

True (the default) to append the .mat extension to the end of the given ﬁlename, if not

already present.

format : {‘5’, ‘4’}, string, optional

‘5’ (the default) for MATLAB 5 and up (to 7.2), ‘4’ for MATLAB 4 .mat ﬁles

long_ﬁeld_names : bool, optional

False (the default) - maximum ﬁeld name length in a structure is 31 characters which

is the documented maximum length. True - maximum ﬁeld name length in a structure

is 63 characters which works for MATLAB 7.6+

do_compression : bool, optional

Whether or not to compress matrices on write. Default is False.

oned_as : {‘row’, ‘column’}, optional

If ‘column’, write 1-D numpy arrays as column vectors. If ‘row’, write 1-D numpy

arrays as row vectors.

See Also

mio4.MatFile4Writer,mio5.MatFile5Writer

scipy.io.whosmat(ﬁle_name,appendmat=True,**kwargs)

List variables inside a MATLAB ﬁle New in version 0.12.0.

Parameters ﬁle_name : str

Name of the mat ﬁle (do not need .mat extension if appendmat==True) Can also pass

open ﬁle-like object.

appendmat : bool, optional

True to append the .mat extension to the end of the given ﬁlename, if not already

present.

byte_order : str or None, optional

None by default, implying byte order guessed from mat ﬁle. Otherwise can be one of

(‘native’, ‘=’, ‘little’, ‘<’, ‘BIG’, ‘>’).

mat_dtype : bool, optional

If True, return arrays in same dtype as would be loaded into MATLAB (instead of the

dtype with which they are saved).

squeeze_me : bool, optional

Whether to squeeze unit matrix dimensions or not.

chars_as_strings : bool, optional

Whether to convert char arrays to string arrays.

matlab_compatible : bool, optional

5.8. Input and output (scipy.io) 321

SciPy Reference Guide, Release 0.13.0

Returns matrices as would be loaded by MATLAB (implies squeeze_me=False,

chars_as_strings=False, mat_dtype=True, struct_as_record=True).

struct_as_record : bool, optional

Whether to load MATLAB structs as numpy record arrays, or as old-style numpy

arrays with dtype=object. Setting this ﬂag to False replicates the behavior of scipy

version 0.7.x (returning numpy object arrays). The default setting is True, because it

allows easier round-trip load and save of MATLAB ﬁles.

Returns variables : list of tuples

A list of tuples, where each tuple holds the matrix name (a string), its shape (tuple of

ints), and its data class (a string). Possible data classes are: int8, uint8, int16, uint16,

int32, uint32, int64, uint64, single, double, cell, struct, object, char, sparse, function,

opaque, logical, unknown.

Notes

v4 (Level 1.0), v6 and v7 to 7.2 matﬁles are supported.

You will need an HDF5 python library to read matlab 7.3 format mat ﬁles. Because scipy does not supply one,

we do not implement the HDF5 / 7.3 interface here.

5.8.2 IDL® ﬁles

readsav(ﬁle_name[, idict, python_dict, ...]) Read an IDL .sav ﬁle

scipy.io.readsav(ﬁle_name,idict=None,python_dict=False,uncompressed_ﬁle_name=None,ver-

bose=False)

Read an IDL .sav ﬁle

Parameters ﬁle_name : str

Name of the IDL save ﬁle.

idict : dict, optional

Dictionary in which to insert .sav ﬁle variables

python_dict : bool, optional

By default, the object return is not a Python dictionary, but a case-insensitive dic-

tionary with item, attribute, and call access to variables. To get a standard Python

dictionary, set this option to True.

uncompressed_ﬁle_name : str, optional

This option only has an effect for .sav ﬁles written with the /compress option. If a

ﬁle name is speciﬁed, compressed .sav ﬁles are uncompressed to this ﬁle. Otherwise,

readsav will use the tempfile module to determine a temporary ﬁlename automat-

ically, and will remove the temporary ﬁle upon successfully reading it in.

verbose : bool, optional

Whether to print out information about the save ﬁle, including the records read, and

available variables.

Returns idl_dict : AttrDict or dict

If python_dict is set to False (default), this function returns a case-insensitive dictio-

nary with item, attribute, and call access to variables. If python_dict is set to True,

this function returns a Python dictionary with all variable names in lowercase. If idict

was speciﬁed, then variables are written to the dictionary speciﬁed, and the updated

dictionary is returned.

5.8.3 Matrix Market ﬁles

322 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

mminfo(source) Queries the contents of the Matrix Market ﬁle ‘ﬁlename’ to

mmread(source) Reads the contents of a Matrix Market ﬁle ‘ﬁlename’ into a matrix.

mmwrite(target, a[, comment, ﬁeld, precision]) Writes the sparse or dense matrix A to a Matrix Market formatted ﬁle.

scipy.io.mminfo(source)

Queries the contents of the Matrix Market ﬁle ‘ﬁlename’ to extract size and storage information.

Parameters source : ﬁle

Matrix Market ﬁlename (extension .mtx) or open ﬁle object

Returns rows,cols : int

Number of matrix rows and columns

entries : int

Number of non-zero entries of a sparse matrix or rows*cols for a dense matrix

format : str

Either ‘coordinate’ or ‘array’.

ﬁeld : str

Either ‘real’, ‘complex’, ‘pattern’, or ‘integer’.

symm : str

Either ‘general’, ‘symmetric’, ‘skew-symmetric’, or ‘hermitian’.

scipy.io.mmread(source)

Reads the contents of a Matrix Market ﬁle ‘ﬁlename’ into a matrix.

Parameters source : ﬁle

Matrix Market ﬁlename (extensions .mtx, .mtz.gz) or open ﬁle object.

Returns a:

Sparse or full matrix

scipy.io.mmwrite(target,a,comment=’‘,ﬁeld=None,precision=None)

Writes the sparse or dense matrix A to a Matrix Market formatted ﬁle.

Parameters target : ﬁle

Matrix Market ﬁlename (extension .mtx) or open ﬁle object

a: array like

Sparse or full matrix

comment : str, optional

comments to be prepended to the Matrix Market ﬁle

ﬁeld : str, optional

Either ‘real’, ‘complex’, ‘pattern’, or ‘integer’.

precision : int, optional

Number of digits to display for real or complex values.

5.8.4 Unformatted Fortran ﬁles

FortranFile(ﬁlename[, mode, header_dtype]) A ﬁle object for unformatted sequential ﬁles from Fortran code.

class scipy.io.FortranFile(ﬁlename,mode=’r’,header_dtype=<type ‘numpy.uint32’>)

A ﬁle object for unformatted sequential ﬁles from Fortran code.

Parameters ﬁlename: ﬁle or str

Open ﬁle object or ﬁlename.

mode : {‘r’, ‘w’}, optional

Read-write mode, default is ‘r’.

5.8. Input and output (scipy.io) 323

SciPy Reference Guide, Release 0.13.0

header_dtype : data-type

Data type of the header. Size and endiness must match the input/output ﬁle.

Notes

These ﬁles are broken up into records of unspeciﬁed types. The size of each record is given at the start (although

the size of this header is not standard) and the data is written onto disk without any formatting. Fortran compilers

supporting the BACKSPACE statement will write a second copy of the size to facilitate backwards seeking.

This class only supports ﬁles written with both sizes for the record. It also does not support the subrecords used

in Intel and gfortran compilers for records which are greater than 2GB with a 4-byte header.

An example of an unformatted sequential ﬁle in Fortran would be written as:

OPEN(1, FILE=myfilename, FORM=’unformatted’)

WRITE(1) myvariable

Since this is a non-standard ﬁle format, whose contents depend on the compiler and the endianness of the

machine, caution is advised. Files from gfortran 4.8.0 and gfortran 4.1.2 on x86_64 are known to work.

Consider using Fortran direct-access ﬁles or ﬁles from the newer Stream I/O, which can be easily read by

numpy.fromfile.

Examples

To create an unformatted sequential Fortran ﬁle:

>>> from scipy.io import FortranFile

>>> f=FortranFile(’test.unf’,’w’)

>>> f.write_record(np.array([1,2,3,4,5],dtype=np.int32))

>>> f.write_record(np.linspace(0,1,20).reshape((5,-1)))

>>> f.close()

To read this ﬁle:

>>> from scipy.io import FortranFile

>>> f=FortranFile(’test.unf’,’r’)

>>> print(f.read_ints(dtype=np.int32))

[12345]

>>> print(f.read_reals(dtype=np.float).reshape((5,-1)))

[[ 0. 0.05263158 0.10526316 0.15789474]

[ 0.21052632 0.26315789 0.31578947 0.36842105]

[ 0.42105263 0.47368421 0.52631579 0.57894737]

[ 0.63157895 0.68421053 0.73684211 0.78947368]

[ 0.84210526 0.89473684 0.94736842 1. ]]

>>> f.close()

Methods

close() Closes the ﬁle.

read_ints([dtype]) Reads a record of a given type from the ﬁle, defaulting to an integer

read_reals([dtype]) Reads a record of a given type from the ﬁle, defaulting to a ﬂoating

read_record([dtype]) Reads a record of a given type from the ﬁle.

write_record(s) Write a record (including sizes) to the ﬁle.

FortranFile.close()

Closes the ﬁle. It is unsupported to call any other methods off this object after closing it. Note that this

324 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

class supports the ‘with’ statement in modern versions of Python, to call this automatically

FortranFile.read_ints(dtype=’i4’)

Reads a record of a given type from the ﬁle, defaulting to an integer type (INTEGER*4 in Fortran)

Parameters dtype : data-type

Data type specifying the size and endiness of the data.

Returns data : ndarray

A one-dimensional array object.

See Also

read_reals,read_record

FortranFile.read_reals(dtype=’f8’)

Reads a record of a given type from the ﬁle, defaulting to a ﬂoating point number (real*8 in Fortran)

Parameters dtype : data-type

Data type specifying the size and endiness of the data.

Returns data : ndarray

A one-dimensional array object.

See Also

read_ints,read_record

FortranFile.read_record(dtype=None)

Reads a record of a given type from the ﬁle.

Parameters dtype : data-type

Data type specifying the size and endiness of the data.

Returns data : ndarray

A one-dimensional array object.

See Also

read_reals,read_ints

Notes

If the record contains a multi-dimensional array, calling reshape or resize will restructure the array to the

correct size. Since Fortran multidimensional arrays are stored in column-major format, this may have

some non-intuitive consequences. If the variable was declared as ‘INTEGER var(5,4)’, for example, var

could be read with ‘read_record(dtype=np.integer).reshape( (4,5) )’ since Python uses row-major ordering

of indices.

One can transpose to obtain the indices in the same order as in Fortran.

FortranFile.write_record(s)

Write a record (including sizes) to the ﬁle.

Parameters s: array_like

The data to write.

5.8.5 Wav sound ﬁles (scipy.io.wavfile)

read(ﬁlename[, mmap]) Return the sample rate (in samples/sec) and data from a WAV ﬁle

write(ﬁlename, rate, data) Write a numpy array as a WAV ﬁle

5.8. Input and output (scipy.io) 325

SciPy Reference Guide, Release 0.13.0

scipy.io.wavfile.read(ﬁlename,mmap=False)

Return the sample rate (in samples/sec) and data from a WAV ﬁle

Parameters ﬁlename : string or open ﬁle handle

Input wav ﬁle.

mmap : bool, optional

Whether to read data as memory mapped. Only to be used on real ﬁles (Default: False)

New in version 0.12.0.

Returns rate : int

Sample rate of wav ﬁle

data : numpy array

Data read from wav ﬁle

Notes

•The ﬁle can be an open ﬁle or a ﬁlename.

•The returned sample rate is a Python integer

•The data is returned as a numpy array with a data-type determined from the ﬁle.

scipy.io.wavfile.write(ﬁlename,rate,data)

Write a numpy array as a WAV ﬁle

Parameters ﬁlename : string or open ﬁle handle

Output wav ﬁle

rate : int

The sample rate (in samples/sec).

data : ndarray

A 1-D or 2-D numpy array of either integer or ﬂoat data-type.

Notes

•The ﬁle can be an open ﬁle or a ﬁlename.

•Writes a simple uncompressed WAV ﬁle.

•The bits-per-sample will be determined by the data-type.

•To write multiple-channels, use a 2-D array of shape (Nsamples, Nchannels).

5.8.6 Arff ﬁles (scipy.io.arff)

loadarff(f) Read an arff ﬁle.

scipy.io.arff.loadarff(f)

Read an arff ﬁle.

The data is returned as a record array, which can be accessed much like a dictionary of numpy arrays. For

example, if one of the attributes is called ‘pressure’, then its ﬁrst 10 data points can be accessed from the data

record array like so: data[’pressure’][0:10]

Parameters f: ﬁle-like or str

File-like object to read from, or ﬁlename to open.

Returns data : record array

The data of the arff ﬁle, accessible by attribute names.

meta :MetaData

326 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Contains information about the arff ﬁle such as name and type of attributes, the rela-

tion (name of the dataset), etc...

Raises ‘ParseArffError‘

This is raised if the given ﬁle is not ARFF-formatted.

NotImplementedError

The ARFF ﬁle has an attribute which is not supported yet.

Notes

This function should be able to read most arff ﬁles. Not implemented functionality include:

•date type attributes

•string type attributes

It can read ﬁles with numeric and nominal attributes. It cannot read ﬁles with sparse data ({} in the ﬁle).

However, this function can read ﬁles with missing data (? in the ﬁle), representing the data points as NaNs.

5.8.7 Netcdf (scipy.io.netcdf)

netcdf_file(ﬁlename[, mode, mmap, version]) A ﬁle object for NetCDF data.

netcdf_variable(data, typecode, size, shape, ...) A data object for the netcdf module.

class scipy.io.netcdf.netcdf_file(ﬁlename,mode=’r’,mmap=None,version=1)

A ﬁle object for NetCDF data.

Anetcdf_file object has two standard attributes: dimensions and variables. The values of both are dic-

tionaries, mapping dimension names to their associated lengths and variable names to variables, respectively.

Application programs should never modify these dictionaries.

All other attributes correspond to global attributes deﬁned in the NetCDF ﬁle. Global ﬁle attributes are created

by assigning to an attribute of the netcdf_file object.

Parameters ﬁlename : string or ﬁle-like

string -> ﬁlename

mode : {‘r’, ‘w’}, optional

read-write mode, default is ‘r’

mmap : None or bool, optional

Whether to mmap ﬁlename when reading. Default is True when ﬁlename is a ﬁle

name, False when ﬁlename is a ﬁle-like object

version : {1, 2}, optional

version of netcdf to read / write, where 1 means Classic format and 2 means 64-bit

offset format. Default is 1. See here for more info.

Notes

The major advantage of this module over other modules is that it doesn’t require the code to be linked to the

NetCDF libraries. This module is derived from pupynere.

NetCDF ﬁles are a self-describing binary data format. The ﬁle contains metadata that describes the dimensions

and variables in the ﬁle. More details about NetCDF ﬁles can be found here. There are three main sections to a

NetCDF data structure:

1.Dimensions

2.Variables

3.Attributes

5.8. Input and output (scipy.io) 327

SciPy Reference Guide, Release 0.13.0

The dimensions section records the name and length of each dimension used by the variables. The variables

would then indicate which dimensions it uses and any attributes such as data units, along with containing the

data values for the variable. It is good practice to include a variable that is the same name as a dimension to

provide the values for that axes. Lastly, the attributes section would contain additional information such as the

name of the ﬁle creator or the instrument used to collect the data.

When writing data to a NetCDF ﬁle, there is often the need to indicate the ‘record dimension’. A record

dimension is the unbounded dimension for a variable. For example, a temperature variable may have dimensions

of latitude, longitude and time. If one wants to add more temperature data to the NetCDF ﬁle as time progresses,

then the temperature variable should have the time dimension ﬂagged as the record dimension.

In addition, the NetCDF ﬁle header contains the position of the data in the ﬁle, so access can be done in an

efﬁcient manner without loading unnecessary data into memory. It uses the mmap module to create Numpy

arrays mapped to the data on disk, for the same purpose.

Examples

To create a NetCDF ﬁle:

>>> from scipy.io import netcdf

>>> f=netcdf.netcdf_file(’simple.nc’,’w’)

>>> f.history =’Created for a test’

>>> f.createDimension(’time’,10)

>>> time =f.createVariable(’time’,’i’, (’time’,))

>>> time[:] =np.arange(10)

>>> time.units =’days since 2008-01-01’

>>> f.close()

Note the assignment of range(10) to time[:]. Exposing the slice of the time variable allows for the data

to be set in the object, rather than letting range(10) overwrite the time variable.

To read the NetCDF ﬁle we just created:

>>> from scipy.io import netcdf

>>> f=netcdf.netcdf_file(’simple.nc’,’r’)

>>> print(f.history)

Created for a test

>>> time =f.variables[’time’]

>>> print(time.units)

days since 2008-01-01

>>> print(time.shape)

(10,)

>>> print(time[-1])

>>> f.close()

A NetCDF ﬁle can also be used as context manager:

>>> from scipy.io import netcdf

>>> with netcdf.netcdf_file(’simple.nc’,’r’)as f:

>>> print(f.history)

Created for a test

Methods

close() Closes the NetCDF ﬁle.

createDimension(name, length) Adds a dimension to the Dimension section of the NetCDF data structure.

createVariable(name, type, dimensions) Create an empty variable for the netcdf_file object, specifying its data type and the dimensions it uses.

Continued on next page

328 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Table 5.62 – continued from previous page

flush() Perform a sync-to-disk ﬂush if the netcdf_file object is in write mode.

sync() Perform a sync-to-disk ﬂush if the netcdf_file object is in write mode.

netcdf_file.close()

Closes the NetCDF ﬁle.

netcdf_file.createDimension(name,length)

Adds a dimension to the Dimension section of the NetCDF data structure.

Note that this function merely adds a new dimension that the variables can reference. The values for

the dimension, if desired, should be added as a variable using createVariable, referring to this

dimension.

Parameters name : str

Name of the dimension (Eg, ‘lat’ or ‘time’).

length : int

Length of the dimension.

See Also

createVariable

netcdf_file.createVariable(name,type,dimensions)

Create an empty variable for the netcdf_file object, specifying its data type and the dimensions it

uses.

Parameters name : str

Name of the new variable.

type : dtype or str

Data type of the variable.

dimensions : sequence of str

List of the dimension names used by the variable, in the desired order.

Returns variable : netcdf_variable

The newly created netcdf_variable object. This object has also been added

to the netcdf_file object as well.

See Also

createDimension

Notes

Any dimensions to be used by the variable should already exist in the NetCDF data structure or should be

created by createDimension prior to creating the NetCDF variable.

netcdf_file.flush()

Perform a sync-to-disk ﬂush if the netcdf_file object is in write mode.

See Also

sync Identical function

netcdf_file.sync()

Perform a sync-to-disk ﬂush if the netcdf_file object is in write mode.

5.8. Input and output (scipy.io) 329

SciPy Reference Guide, Release 0.13.0

See Also

sync Identical function

class scipy.io.netcdf.netcdf_variable(data,typecode,size,shape,dimensions,at-

tributes=None)

A data object for the netcdf module.

netcdf_variable objects are constructed by calling the method netcdf_file.createVariable on

the netcdf_file object. netcdf_variable objects behave much like array objects deﬁned in numpy,

except that their data resides in a ﬁle. Data is read by indexing and written by assigning to an indexed sub-

set; the entire array can be accessed by the index [:] or (for scalars) by using the methods getValue and

assignValue.netcdf_variable objects also have attribute shape with the same meaning as for arrays,

but the shape cannot be modiﬁed. There is another read-only attribute dimensions, whose value is the tuple of

dimension names.

All other attributes correspond to variable attributes deﬁned in the NetCDF ﬁle. Variable attributes are created

by assigning to an attribute of the netcdf_variable object.

Parameters data : array_like

The data array that holds the values for the variable. Typically, this is initialized as

empty, but with the proper shape.

typecode : dtype character code

Desired data-type for the data array.

size : int

Desired element size for the data array.

shape : sequence of ints

The shape of the array. This should match the lengths of the variable’s dimensions.

dimensions : sequence of strings

The names of the dimensions used by the variable. Must be in the same order of the

dimension lengths given by shape.

attributes : dict, optional

Attribute values (any type) keyed by string names. These attributes become attributes

for the netcdf_variable object.

See Also

isrec,shape

Attributes

dimensions (list of str) List of names of dimensions used by the variable object.

isrec, shape Properties

Methods

assignValue(value) Assign a scalar value to a netcdf_variable of length one.

getValue() Retrieve a scalar value from a netcdf_variable of length one.

itemsize() Return the itemsize of the variable.

typecode() Return the typecode of the variable.

netcdf_variable.assignValue(value)

Assign a scalar value to a netcdf_variable of length one.

Parameters value : scalar

Scalar value (of compatible type) to assign to a length-one netcdf variable. This

value will be written to ﬁle.

330 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Raises ValueError

If the input is not a scalar, or if the destination is not a length-one netcdf variable.

netcdf_variable.getValue()

Retrieve a scalar value from a netcdf_variable of length one.

Raises ValueError

If the netcdf variable is an array of length greater than one, this exception will be

raised.

netcdf_variable.itemsize()

Return the itemsize of the variable.

Returns itemsize : int

The element size of the variable (eg, 8 for ﬂoat64).

netcdf_variable.typecode()

Return the typecode of the variable.

Returns typecode : char

The character typecode of the variable (eg, ‘i’ for int).

5.9 Linear algebra (scipy.linalg)

Linear algebra functions.

See Also

numpy.linalg for more linear algebra functions. Note that although scipy.linalg imports most of them,

identically named functions from scipy.linalg may offer more or slightly differing functionality.

5.9.1 Basics

inv(a[, overwrite_a, check_ﬁnite]) Compute the inverse of a matrix.

solve(a, b[, sym_pos, lower, overwrite_a, ...]) Solve the equation ax=bfor x.

solve_banded(l_and_u, ab, b[, overwrite_ab, ...]) Solve the equation a x = b for x, assuming a is banded matrix.

solveh_banded(ab, b[, overwrite_ab, ...]) Solve equation a x = b.

solve_triangular(a, b[, trans, lower, ...]) Solve the equation ax=bfor x, assuming a is a triangular matrix.

det(a[, overwrite_a, check_ﬁnite]) Compute the determinant of a matrix

norm(a[, ord]) Matrix or vector norm.

lstsq(a, b[, cond, overwrite_a, ...]) Compute least-squares solution to equation Ax = b.

pinv(a[, cond, rcond, return_rank, check_ﬁnite]) Compute the (Moore-Penrose) pseudo-inverse of a matrix.

pinv2(a[, cond, rcond, return_rank, ...]) Compute the (Moore-Penrose) pseudo-inverse of a matrix.

pinvh(a[, cond, rcond, lower, return_rank, ...]) Compute the (Moore-Penrose) pseudo-inverse of a Hermitian matrix.

kron(a, b) Kronecker product.

tril(m[, k]) Make a copy of a matrix with elements above the k-th diagonal zeroed.

triu(m[, k]) Make a copy of a matrix with elements below the k-th diagonal zeroed.

scipy.linalg.inv(a,overwrite_a=False,check_ﬁnite=True)

Compute the inverse of a matrix.

Parameters a: array_like

Square matrix to be inverted.

overwrite_a : bool, optional

5.9. Linear algebra (scipy.linalg) 331

SciPy Reference Guide, Release 0.13.0

Discard data in a(may improve performance). Default is False.

check_ﬁnite : boolean, optional

Whether to check that the input matrix contains only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns ainv : ndarray

Inverse of the matrix a.

Raises LinAlgError :

If ais singular.

ValueError :

If ais not square, or not 2-dimensional.

Examples

>>> a=np.array([[1.,2.], [3.,4.]])

>>> sp.linalg.inv(a)

array([[-2. , 1. ],

[ 1.5, -0.5]])

>>> np.dot(a, sp.linalg.inv(a))

array([[ 1., 0.],

[ 0., 1.]])

scipy.linalg.solve(a,b,sym_pos=False,lower=False,overwrite_a=False,overwrite_b=False,de-

bug=False,check_ﬁnite=True)

Solve the equation ax=bfor x.

Parameters a: (M, M) array_like

A square matrix.

b: (M,) or (M, N) array_like

Right-hand side matrix in ax=b.

sym_pos : bool

Assume ais symmetric and positive deﬁnite.

lower : boolean

Use only data contained in the lower triangle of a, if sym_pos is true. Default is to use

upper triangle.

overwrite_a : bool

Allow overwriting data in a(may enhance performance). Default is False.

overwrite_b : bool

Allow overwriting data in b(may enhance performance). Default is False.

check_ﬁnite : boolean, optional

Whether to check that the input matrices contain only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns x: (M,) or (M, N) ndarray

Solution to the system ax=b. Shape of the return matches the shape of b.

Raises LinAlgError

If ais singular.

Examples

Given aand b, solve for x:

>>> a=np.array([[3,2,0],[1,-1,0],[0,5,1]])

>>> b=np.array([2,4,-1])

>>> x=linalg.solve(a,b)

>>> x

array([ 2., -2., 9.])

332 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

>>> np.dot(a, x) == b

array([ True, True, True], dtype=bool)

scipy.linalg.solve_banded(l_and_u,ab,b,overwrite_ab=False,overwrite_b=False,debug=False,

check_ﬁnite=True)

Solve the equation a x = b for x, assuming a is banded matrix.

The matrix a is stored in ab using the matrix diagonal ordered form:

ab[u +i-j, j] == a[i,j]

Example of ab (shape of a is (6,6), u=1, l=2):

*a01 a12 a23 a34 a45

a00 a11 a22 a33 a44 a55

a10 a21 a32 a43 a54 *

a20 a31 a42 a53 * *

Parameters (l, u) : (integer, integer)

Number of non-zero lower and upper diagonals

ab : (l+u+ 1, M) array_like

Banded matrix

b: (M,) or (M, K) array_like

Right-hand side

overwrite_ab : boolean, optional

Discard data in ab (may enhance performance)

overwrite_b : boolean, optional

Discard data in b(may enhance performance)

check_ﬁnite : boolean, optional

Whether to check that the input matrices contain only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns x: (M,) or (M, K) ndarray

The solution to the system a x = b. Returned shape depends on the shape of b.

scipy.linalg.solveh_banded(ab,b,overwrite_ab=False,overwrite_b=False,lower=False,

check_ﬁnite=True)

Solve equation a x = b. a is Hermitian positive-deﬁnite banded matrix.

The matrix a is stored in ab either in lower diagonal or upper diagonal ordered form:

ab[u + i - j, j] == a[i,j] (if upper form; i <= j) ab[ i - j, j] == a[i,j] (if lower form; i >= j)

Example of ab (shape of a is (6,6), u=2):

upper form:

* * a02 a13 a24 a35

*a01 a12 a23 a34 a45

a00 a11 a22 a33 a44 a55

lower form:

a00 a11 a22 a33 a44 a55

a10 a21 a32 a43 a54 *

a20 a31 a42 a53 * *

Cells marked with * are not used.

Parameters ab : (u+ 1, M) array_like

Banded matrix

5.9. Linear algebra (scipy.linalg) 333

SciPy Reference Guide, Release 0.13.0

b: (M,) or (M, K) array_like

Right-hand side

overwrite_ab : bool, optional

Discard data in ab (may enhance performance)

overwrite_b : bool, optional

Discard data in b(may enhance performance)

lower : bool, optional

Is the matrix in the lower form. (Default is upper form)

check_ﬁnite : boolean, optional

Whether to check that the input matrices contain only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns x: (M,) or (M, K) ndarray

The solution to the system a x = b. Shape of return matches shape of b.

scipy.linalg.solve_triangular(a,b,trans=0,lower=False,unit_diagonal=False,over-

write_b=False,debug=False,check_ﬁnite=True)

Solve the equation ax=bfor x, assuming a is a triangular matrix.

Parameters a: (M, M) array_like

A triangular matrix

b: (M,) or (M, N) array_like

Right-hand side matrix in ax=b

lower : boolean

Use only data contained in the lower triangle of a. Default is to use upper triangle.

trans : {0, 1, 2, ‘N’, ‘T’, ‘C’}, optional

Type of system to solve:

trans system

0 or ‘N’ a x = b

1 or ‘T’ a^T x = b

2 or ‘C’ a^H x = b

unit_diagonal : bool, optional

If True, diagonal elements of aare assumed to be 1 and will not be referenced.

overwrite_b : bool, optional

Allow overwriting data in b(may enhance performance)

check_ﬁnite : bool, optional

Whether to check that the input matrices contain only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns x: (M,) or (M, N) ndarray

Solution to the system ax=b. Shape of return matches b.

Raises LinAlgError

If ais singular

Notes

New in version 0.9.0.

scipy.linalg.det(a,overwrite_a=False,check_ﬁnite=True)

Compute the determinant of a matrix

The determinant of a square matrix is a value derived arithmetically from the coefﬁcients of the matrix.

The determinant for a 3x3 matrix, for example, is computed as follows:

334 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

abc

d e f = A

ghi

det(A) = a*e*i+b*f*g+c*d*h-c*e*g-b*d*i-a*f*h

Parameters a: (M, M) array_like

A square matrix.

overwrite_a : bool

Allow overwriting data in a (may enhance performance).

check_ﬁnite : boolean, optional

Whether to check that the input matrix contains only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns det : ﬂoat or complex

Determinant of a.

Notes

The determinant is computed via LU factorization, LAPACK routine z/dgetrf.

Examples

>>> a=np.array([[1,2,3],[4,5,6],[7,8,9]])

>>> linalg.det(a)

0.0

>>> a=np.array([[0,2,3],[4,5,6],[7,8,9]])

>>> linalg.det(a)

3.0

scipy.linalg.norm(a,ord=None)

Matrix or vector norm.

This function is able to return one of seven different matrix norms, or one of an inﬁnite number of vector norms

(described below), depending on the value of the ord parameter.

Parameters x: {(M,), (M, N)} array_like

Input array.

ord : {non-zero int, inf, -inf, ‘fro’}, optional

Order of the norm (see table under Notes). inf means numpy’s inf object.

Returns n: ﬂoat

Norm of the matrix or vector.

Notes

For values of ord <= 0, the result is, strictly speaking, not a mathematical ‘norm’, but it may still be useful

for various numerical purposes.

The following norms can be calculated:

5.9. Linear algebra (scipy.linalg) 335

SciPy Reference Guide, Release 0.13.0

ord norm for matrices norm for vectors

None Frobenius norm 2-norm

‘fro’ Frobenius norm –

inf max(sum(abs(x), axis=1)) max(abs(x))

-inf min(sum(abs(x), axis=1)) min(abs(x))

0 – sum(x != 0)

1 max(sum(abs(x), axis=0)) as below

-1 min(sum(abs(x), axis=0)) as below

2 2-norm (largest sing. value) as below

-2 smallest singular value as below

other – sum(abs(x)**ord)**(1./ord)

The Frobenius norm is given by [R63]:

||A||F= [Pi,j abs(ai,j )2]1/2

References

[R63]

Examples

>>> from numpy import linalg as LA

>>> a=np.arange(9)-4

>>> a

array([-4, -3, -2, -1, 0, 1, 2, 3, 4])

>>> b=a.reshape((3,3))

>>> b

array([[-4, -3, -2],

[-1, 0, 1],

[ 2, 3, 4]])

>>> LA.norm(a)

7.745966692414834

>>> LA.norm(b)

7.745966692414834

>>> LA.norm(b, ’fro’)

7.745966692414834

>>> LA.norm(a, np.inf)

>>> LA.norm(b, np.inf)

>>> LA.norm(a, -np.inf)

>>> LA.norm(b, -np.inf)

>>> LA.norm(a, 1)

>>> LA.norm(b, 1)

>>> LA.norm(a, -1)

-4.6566128774142013e-010

>>> LA.norm(b, -1)

>>> LA.norm(a, 2)

7.745966692414834

>>> LA.norm(b, 2)

7.3484692283495345

336 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

>>> LA.norm(a, -2)

nan

>>> LA.norm(b, -2)

1.8570331885190563e-016

>>> LA.norm(a, 3)

5.8480354764257312

>>> LA.norm(a, -3)

nan

scipy.linalg.lstsq(a,b,cond=None,overwrite_a=False,overwrite_b=False,check_ﬁnite=True)

Compute least-squares solution to equation Ax = b.

Compute a vector x such that the 2-norm |b - A x| is minimized.

Parameters a: (M, N) array_like

Left hand side matrix (2-D array).

b: (M,) or (M, K) array_like

Right hand side matrix or vector (1-D or 2-D array).

cond : ﬂoat, optional

Cutoff for ‘small’ singular values; used to determine effective rank of a. Singular

values smaller than rcond *largest_singular_value are considered zero.

overwrite_a : bool, optional

Discard data in a(may enhance performance). Default is False.

overwrite_b : bool, optional

Discard data in b(may enhance performance). Default is False.

check_ﬁnite : boolean, optional

Whether to check that the input matrices contain only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns x: (N,) or (N, K) ndarray

Least-squares solution. Return shape matches shape of b.

residues : () or (1,) or (K,) ndarray

Sums of residues, squared 2-norm for each column in b-ax. If rank of matrix a

is < N or > M this is an empty array. If b was 1-D, this is an (1,) shape array, otherwise

the shape is (K,).

rank : int

Effective rank of matrix a.

s: (min(M,N),) ndarray

Singular values of a. The condition number of a is abs(s[0]/s[-1]).

Raises LinAlgError :

If computation does not converge.

See Also

optimize.nnls

linear least squares with non-negativity constraint

scipy.linalg.pinv(a,cond=None,rcond=None,return_rank=False,check_ﬁnite=True)

Compute the (Moore-Penrose) pseudo-inverse of a matrix.

Calculate a generalized inverse of a matrix using a least-squares solver.

Parameters a: (M, N) array_like

Matrix to be pseudo-inverted.

cond, rcond : ﬂoat, optional

Cutoff for ‘small’ singular values in the least-squares solver. Singular values smaller

than rcond *largest_singular_value are considered zero.

5.9. Linear algebra (scipy.linalg) 337

SciPy Reference Guide, Release 0.13.0

return_rank : bool, optional

if True, return the effective rank of the matrix

check_ﬁnite : boolean, optional

Whether to check that the input matrix contains only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns B: (N, M) ndarray

The pseudo-inverse of matrix a.

rank : int

The effective rank of the matrix. Returned if return_rank == True

Raises LinAlgError

If computation does not converge.

Examples

>>> a=np.random.randn(9,6)

>>> B=linalg.pinv(a)

>>> np.allclose(a, dot(a, dot(B, a)))

True

>>> np.allclose(B, dot(B, dot(a, B)))

True

scipy.linalg.pinv2(a,cond=None,rcond=None,return_rank=False,check_ﬁnite=True)

Compute the (Moore-Penrose) pseudo-inverse of a matrix.

Calculate a generalized inverse of a matrix using its singular-value decomposition and including all ‘large’

singular values.

Parameters a: (M, N) array_like

Matrix to be pseudo-inverted.

cond, rcond : ﬂoat or None

Cutoff for ‘small’ singular values. Singular values smaller than

rcond*largest_singular_value are considered zero. If None or -1,

suitable machine precision is used.

return_rank : bool, optional

if True, return the effective rank of the matrix

check_ﬁnite : boolean, optional

Whether to check that the input matrix contains only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns B: (N, M) ndarray

The pseudo-inverse of matrix a.

rank : int

The effective rank of the matrix. Returned if return_rank == True

Raises LinAlgError

If SVD computation does not converge.

Examples

>>> a=np.random.randn(9,6)

>>> B=linalg.pinv2(a)

>>> np.allclose(a, dot(a, dot(B, a)))

True

>>> np.allclose(B, dot(B, dot(a, B)))

True

338 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.linalg.pinvh(a,cond=None,rcond=None,lower=True,return_rank=False,check_ﬁnite=True)

Compute the (Moore-Penrose) pseudo-inverse of a Hermitian matrix.

Calculate a generalized inverse of a Hermitian or real symmetric matrix using its eigenvalue decomposition and

including all eigenvalues with ‘large’ absolute value.

Parameters a: (N, N) array_like

Real symmetric or complex hermetian matrix to be pseudo-inverted

cond, rcond : ﬂoat or None

Cutoff for ‘small’ eigenvalues. Singular values smaller than rcond *

largest_eigenvalue are considered zero.

If None or -1, suitable machine precision is used.

lower : bool

Whether the pertinent array data is taken from the lower or upper triangle of a. (De-

fault: lower)

return_rank : bool, optional

if True, return the effective rank of the matrix

check_ﬁnite : boolean, optional

Whether to check that the input matrix contains only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns B: (N, N) ndarray

The pseudo-inverse of matrix a.

rank : int

The effective rank of the matrix. Returned if return_rank == True

Raises LinAlgError

If eigenvalue does not converge

Examples

>>> from numpy import *

>>> a=random.randn(9,6)

>>> a=np.dot(a, a.T)

>>> B=pinvh(a)

>>> allclose(a, dot(a, dot(B, a)))

True

>>> allclose(B, dot(B, dot(a, B)))

True

scipy.linalg.kron(a,b)

Kronecker product.

The result is the block matrix:

a[0,0]*b a[0,1]*b ... a[0,-1]*b

a[1,0]*b a[1,1]*b ... a[1,-1]*b

...

a[-1,0]*b a[-1,1]*b ... a[-1,-1]*b

Parameters a: (M, N) ndarray

Input array

b: (P, Q) ndarray

Input array

Returns A: (M*P, N*Q) ndarray

Kronecker product of aand b.

5.9. Linear algebra (scipy.linalg) 339

SciPy Reference Guide, Release 0.13.0

Examples

>>> from numpy import array

>>> from scipy.linalg import kron

>>> kron(array([[1,2],[3,4]]), array([[1,1,1]]))

array([[1, 1, 1, 2, 2, 2],

[3, 3, 3, 4, 4, 4]])

scipy.linalg.tril(m,k=0)

Make a copy of a matrix with elements above the k-th diagonal zeroed.

Parameters m: array_like

Matrix whose elements to return

k: integer

Diagonal above which to zero elements. k== 0 is the main diagonal, k< 0 subdiagonal

and k> 0 superdiagonal.

Returns tril : ndarray

Return is the same shape and type as m.

Examples

>>> from scipy.linalg import tril

>>> tril([[1,2,3],[4,5,6],[7,8,9],[10,11,12]], -1)

array([[ 0, 0, 0],

[ 4, 0, 0],

[ 7, 8, 0],

[10, 11, 12]])

scipy.linalg.triu(m,k=0)

Make a copy of a matrix with elements below the k-th diagonal zeroed.

Parameters m: array_like

Matrix whose elements to return

k: int, optional

Diagonal below which to zero elements. k== 0 is the main diagonal, k< 0 subdiagonal

and k> 0 superdiagonal.

Returns triu : ndarray

Return matrix with zeroed elements below the k-th diagonal and has same shape and

type as m.

Examples

>>> from scipy.linalg import triu

>>> triu([[1,2,3],[4,5,6],[7,8,9],[10,11,12]], -1)

array([[ 1, 2, 3],

[ 4, 5, 6],

[ 0, 8, 9],

[ 0, 0, 12]])

5.9.2 Eigenvalue Problems

eig(a[, b, left, right, overwrite_a, ...]) Solve an ordinary or generalized eigenvalue problem of a square matrix.

eigvals(a[, b, overwrite_a, check_ﬁnite]) Compute eigenvalues from an ordinary or generalized eigenvalue problem.

eigh(a[, b, lower, eigvals_only, ...]) Solve an ordinary or generalized eigenvalue problem for a complex

eigvalsh(a[, b, lower, overwrite_a, ...]) Solve an ordinary or generalized eigenvalue problem for a complex

Continued on next page

340 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Table 5.65 – continued from previous page

eig_banded(a_band[, lower, eigvals_only, ...]) Solve real symmetric or complex hermitian band matrix eigenvalue problem.

eigvals_banded(a_band[, lower, ...]) Solve real symmetric or complex hermitian band matrix eigenvalue problem.

scipy.linalg.eig(a,b=None,left=False,right=True,overwrite_a=False,overwrite_b=False,

check_ﬁnite=True)

Solve an ordinary or generalized eigenvalue problem of a square matrix.

Find eigenvalues w and right or left eigenvectors of a general matrix:

a vr[:,i] = w[i] b vr[:,i]

a.H vl[:,i] = w[i].conj() b.H vl[:,i]

where .H is the Hermitian conjugation.

Parameters a: (M, M) array_like

A complex or real matrix whose eigenvalues and eigenvectors will be computed.

b: (M, M) array_like, optional

Right-hand side matrix in a generalized eigenvalue problem. Default is None, identity

matrix is assumed.

left : bool, optional

Whether to calculate and return left eigenvectors. Default is False.

right : bool, optional

Whether to calculate and return right eigenvectors. Default is True.

overwrite_a : bool, optional

Whether to overwrite a; may improve performance. Default is False.

overwrite_b : bool, optional

Whether to overwrite b; may improve performance. Default is False.

check_ﬁnite : boolean, optional

Whether to check that the input matrices contain only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns w: (M,) double or complex ndarray

The eigenvalues, each repeated according to its multiplicity.

vl : (M, M) double or complex ndarray

The normalized left eigenvector corresponding to the eigenvalue w[i] is the column

v[:,i]. Only returned if left=True.

vr : (M, M) double or complex ndarray

The normalized right eigenvector corresponding to the eigenvalue w[i] is the column

vr[:,i]. Only returned if right=True.

Raises LinAlgError

If eigenvalue computation does not converge.

See Also

eigh Eigenvalues and right eigenvectors for symmetric/Hermitian arrays.

scipy.linalg.eigvals(a,b=None,overwrite_a=False,check_ﬁnite=True)

Compute eigenvalues from an ordinary or generalized eigenvalue problem.

Find eigenvalues of a general matrix:

a vr[:,i] = w[i] b vr[:,i]

Parameters a: (M, M) array_like

A complex or real matrix whose eigenvalues and eigenvectors will be computed.

5.9. Linear algebra (scipy.linalg) 341

SciPy Reference Guide, Release 0.13.0

b: (M, M) array_like, optional

Right-hand side matrix in a generalized eigenvalue problem. If omitted, identity ma-

trix is assumed.

overwrite_a : boolean, optional

Whether to overwrite data in a (may improve performance)

check_ﬁnite : boolean, optional

Whether to check that the input matrices contain only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns w: (M,) double or complex ndarray

The eigenvalues, each repeated according to its multiplicity, but not in any speciﬁc

order.

Raises LinAlgError

If eigenvalue computation does not converge

See Also

eigvalsh eigenvalues of symmetric or Hermitian arrays,

eig eigenvalues and right eigenvectors of general arrays.

eigh eigenvalues and eigenvectors of symmetric/Hermitian arrays.

scipy.linalg.eigh(a,b=None,lower=True,eigvals_only=False,overwrite_a=False,over-

write_b=False,turbo=True,eigvals=None,type=1,check_ﬁnite=True)

Solve an ordinary or generalized eigenvalue problem for a complex Hermitian or real symmetric matrix.

Find eigenvalues w and optionally eigenvectors v of matrix a, where bis positive deﬁnite:

a v[:,i] = w[i] b v[:,i]

v[i,:].conj() a v[:,i] = w[i]

v[i,:].conj() b v[:,i] = 1

Parameters a: (M, M) array_like

A complex Hermitian or real symmetric matrix whose eigenvalues and eigenvectors

will be computed.

b: (M, M) array_like, optional

A complex Hermitian or real symmetric deﬁnite positive matrix in. If omitted, identity

matrix is assumed.

lower : bool, optional

Whether the pertinent array data is taken from the lower or upper triangle of a. (De-

fault: lower)

eigvals_only : bool, optional

Whether to calculate only eigenvalues and no eigenvectors. (Default: both are calcu-

lated)

turbo : bool, optional

Use divide and conquer algorithm (faster but expensive in memory, only for general-

ized eigenvalue problem and if eigvals=None)

eigvals : tuple (lo, hi), optional

Indexes of the smallest and largest (in ascending order) eigenvalues and corresponding

eigenvectors to be returned: 0 <= lo <= hi <= M-1. If omitted, all eigenvalues and

eigenvectors are returned.

type : int, optional

Speciﬁes the problem type to be solved:

342 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

type = 1: a v[:,i] = w[i] b v[:,i]

type = 2: a b v[:,i] = w[i] v[:,i]

type = 3: b a v[:,i] = w[i] v[:,i]

overwrite_a : bool, optional

Whether to overwrite data in a(may improve performance)

overwrite_b : bool, optional

Whether to overwrite data in b(may improve performance)

check_ﬁnite : boolean, optional

Whether to check that the input matrices contain only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns w: (N,) ﬂoat ndarray

The N (1<=N<=M) selected eigenvalues, in ascending order, each repeated according

to its multiplicity.

v: (M, N) complex ndarray

(if eigvals_only == False)

The normalized selected eigenvector corresponding to the eigenvalue w[i] is the col-

umn v[:,i].

Normalization:

type 1 and 3: v.conj() a v = w

type 2: inv(v).conj() a inv(v) = w

type = 1 or 2: v.conj() b v = I

type = 3: v.conj() inv(b) v = I

Raises LinAlgError :

If eigenvalue computation does not converge, an error occurred, or b matrix is not

deﬁnite positive. Note that if input matrices are not symmetric or hermitian, no error

is reported but results will be wrong.

See Also

eig eigenvalues and right eigenvectors for non-symmetric arrays

scipy.linalg.eigvalsh(a,b=None,lower=True,overwrite_a=False,overwrite_b=False,turbo=True,

eigvals=None,type=1,check_ﬁnite=True)

Solve an ordinary or generalized eigenvalue problem for a complex Hermitian or real symmetric matrix.

Find eigenvalues w of matrix a, where b is positive deﬁnite:

a v[:,i] = w[i] b v[:,i]

v[i,:].conj() a v[:,i] = w[i]

v[i,:].conj() b v[:,i] = 1

Parameters a: (M, M) array_like

A complex Hermitian or real symmetric matrix whose eigenvalues and eigenvectors

will be computed.

b: (M, M) array_like, optional

A complex Hermitian or real symmetric deﬁnite positive matrix in. If omitted, identity

matrix is assumed.

lower : bool, optional

Whether the pertinent array data is taken from the lower or upper triangle of a. (De-

fault: lower)

turbo : bool, optional

Use divide and conquer algorithm (faster but expensive in memory, only for general-

ized eigenvalue problem and if eigvals=None)

eigvals : tuple (lo, hi), optional

5.9. Linear algebra (scipy.linalg) 343

SciPy Reference Guide, Release 0.13.0

Indexes of the smallest and largest (in ascending order) eigenvalues and corresponding

eigenvectors to be returned: 0 <= lo < hi <= M-1. If omitted, all eigenvalues and

eigenvectors are returned.

type : integer, optional

Speciﬁes the problem type to be solved:

type = 1: a v[:,i] = w[i] b v[:,i]

type = 2: a b v[:,i] = w[i] v[:,i]

type = 3: b a v[:,i] = w[i] v[:,i]

overwrite_a : bool, optional

Whether to overwrite data in a(may improve performance)

overwrite_b : bool, optional

Whether to overwrite data in b(may improve performance)

check_ﬁnite : boolean, optional

Whether to check that the input matrices contain only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns w: (N,) ﬂoat ndarray

The N (1<=N<=M) selected eigenvalues, in ascending order, each repeated according

to its multiplicity.

Raises LinAlgError :

If eigenvalue computation does not converge, an error occurred, or b matrix is not

deﬁnite positive. Note that if input matrices are not symmetric or hermitian, no error

is reported but results will be wrong.

See Also

eigvals eigenvalues of general arrays

eigh eigenvalues and right eigenvectors for symmetric/Hermitian arrays

eig eigenvalues and right eigenvectors for non-symmetric arrays

scipy.linalg.eig_banded(a_band,lower=False,eigvals_only=False,overwrite_a_band=False,se-

lect=’a’,select_range=None,max_ev=0,check_ﬁnite=True)

Solve real symmetric or complex hermitian band matrix eigenvalue problem.

Find eigenvalues w and optionally right eigenvectors v of a:

a v[:,i] = w[i] v[:,i]

v.H v = identity

The matrix a is stored in a_band either in lower diagonal or upper diagonal ordered form:

a_band[u + i - j, j] == a[i,j] (if upper form; i <= j) a_band[ i - j, j] == a[i,j] (if lower form; i >= j)

where u is the number of bands above the diagonal.

Example of a_band (shape of a is (6,6), u=2):

upper form:

* * a02 a13 a24 a35

*a01 a12 a23 a34 a45

a00 a11 a22 a33 a44 a55

lower form:

a00 a11 a22 a33 a44 a55

a10 a21 a32 a43 a54 *

a20 a31 a42 a53 * *

Cells marked with * are not used.

344 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Parameters a_band : (u+1, M) array_like

The bands of the M by M matrix a.

lower : bool, optional

Is the matrix in the lower form. (Default is upper form)

eigvals_only : bool, optional

Compute only the eigenvalues and no eigenvectors. (Default: calculate also eigenvec-

tors)

overwrite_a_band : bool, optional

Discard data in a_band (may enhance performance)

select : {‘a’, ‘v’, ‘i’}, optional

Which eigenvalues to calculate

select calculated

‘a’ All eigenvalues

‘v’ Eigenvalues in the interval (min, max]

‘i’ Eigenvalues with indices min <= i <= max

select_range : (min, max), optional

Range of selected eigenvalues

max_ev : int, optional

For select==’v’, maximum number of eigenvalues expected. For other values of select,

has no meaning.

In doubt, leave this parameter untouched.

check_ﬁnite : boolean, optional

Whether to check that the input matrix contains only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns w: (M,) ndarray

The eigenvalues, in ascending order, each repeated according to its multiplicity.

v: (M, M) ﬂoat or complex ndarray

The normalized eigenvector corresponding to the eigenvalue w[i] is the column v[:,i].

Raises LinAlgError if eigenvalue computation does not converge

scipy.linalg.eigvals_banded(a_band,lower=False,overwrite_a_band=False,select=’a’,se-

lect_range=None,check_ﬁnite=True)

Solve real symmetric or complex hermitian band matrix eigenvalue problem.

Find eigenvalues w of a:

a v[:,i] = w[i] v[:,i]

v.H v = identity

The matrix a is stored in a_band either in lower diagonal or upper diagonal ordered form:

a_band[u + i - j, j] == a[i,j] (if upper form; i <= j) a_band[ i - j, j] == a[i,j] (if lower form; i >= j)

where u is the number of bands above the diagonal.

Example of a_band (shape of a is (6,6), u=2):

upper form:

* * a02 a13 a24 a35

*a01 a12 a23 a34 a45

a00 a11 a22 a33 a44 a55

lower form:

a00 a11 a22 a33 a44 a55

a10 a21 a32 a43 a54 *

a20 a31 a42 a53 * *

Cells marked with * are not used.

5.9. Linear algebra (scipy.linalg) 345

SciPy Reference Guide, Release 0.13.0

Parameters a_band : (u+1, M) array_like

The bands of the M by M matrix a.

lower : boolean

Is the matrix in the lower form. (Default is upper form)

overwrite_a_band:

Discard data in a_band (may enhance performance)

select : {‘a’, ‘v’, ‘i’}

Which eigenvalues to calculate

select calculated

‘a’ All eigenvalues

‘v’ Eigenvalues in the interval (min, max]

‘i’ Eigenvalues with indices min <= i <= max

select_range : (min, max)

Range of selected eigenvalues

check_ﬁnite : boolean, optional

Whether to check that the input matrix contains only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns w: (M,) ndarray

The eigenvalues, in ascending order, each repeated according to its multiplicity.

Raises LinAlgError if eigenvalue computation does not converge

See Also

eig_bandedeigenvalues and right eigenvectors for symmetric/Hermitian band matrices

eigvals eigenvalues of general arrays

eigh eigenvalues and right eigenvectors for symmetric/Hermitian arrays

eig eigenvalues and right eigenvectors for non-symmetric arrays

5.9.3 Decompositions

lu(a[, permute_l, overwrite_a, check_ﬁnite]) Compute pivoted LU decompostion of a matrix.

lu_factor(a[, overwrite_a, check_ﬁnite]) Compute pivoted LU decomposition of a matrix.

lu_solve(lu_and_piv, b[, trans, ...]) Solve an equation system, a x = b, given the LU factorization of a

svd(a[, full_matrices, compute_uv, ...]) Singular Value Decomposition.

svdvals(a[, overwrite_a, check_ﬁnite]) Compute singular values of a matrix.

diagsvd(s, M, N) Construct the sigma matrix in SVD from singular values and size M, N.

orth(A) Construct an orthonormal basis for the range of A using SVD

cholesky(a[, lower, overwrite_a, check_ﬁnite]) Compute the Cholesky decomposition of a matrix.

cholesky_banded(ab[, overwrite_ab, lower, ...]) Cholesky decompose a banded Hermitian positive-deﬁnite matrix

cho_factor(a[, lower, overwrite_a, check_ﬁnite]) Compute the Cholesky decomposition of a matrix, to use in cho_solve

cho_solve(c_and_lower, b[, overwrite_b, ...]) Solve the linear equations A x = b, given the Cholesky factorization of A.

cho_solve_banded(cb_and_lower, b[, ...]) Solve the linear equations A x = b, given the Cholesky factorization of A.

polar(a[, side]) Compute the polar decomposition.

qr(a[, overwrite_a, lwork, mode, pivoting, ...]) Compute QR decomposition of a matrix.

qr_multiply(a, c[, mode, pivoting, ...]) Calculate the QR decomposition and multiply Q with a matrix.

qz(A, B[, output, lwork, sort, overwrite_a, ...]) QZ decompostion for generalized eigenvalues of a pair of matrices.

schur(a[, output, lwork, overwrite_a, sort, ...]) Compute Schur decomposition of a matrix.

rsf2csf(T, Z[, check_ﬁnite]) Convert real Schur form to complex Schur form.

hessenberg(a[, calc_q, overwrite_a, ...]) Compute Hessenberg form of a matrix.

346 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.linalg.lu(a,permute_l=False,overwrite_a=False,check_ﬁnite=True)

Compute pivoted LU decompostion of a matrix.

The decomposition is:

A=PLU

where P is a permutation matrix, L lower triangular with unit diagonal elements, and U upper triangular.

Parameters a: (M, N) array_like

Array to decompose

permute_l : bool

Perform the multiplication P*L (Default: do not permute)

overwrite_a : bool

Whether to overwrite data in a (may improve performance)

check_ﬁnite : boolean, optional

Whether to check that the input matrix contains only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns (If permute_l == False)

p: (M, M) ndarray

Permutation matrix

l: (M, K) ndarray

Lower triangular or trapezoidal matrix with unit diagonal. K = min(M, N)

u: (K, N) ndarray

Upper triangular or trapezoidal matrix

(If permute_l == True)

pl : (M, K) ndarray

Permuted L matrix. K = min(M, N)

u: (K, N) ndarray

Upper triangular or trapezoidal matrix

Notes

This is a LU factorization routine written for Scipy.

scipy.linalg.lu_factor(a,overwrite_a=False,check_ﬁnite=True)

Compute pivoted LU decomposition of a matrix.

The decomposition is:

A=PLU

where P is a permutation matrix, L lower triangular with unit diagonal elements, and U upper triangular.

Parameters a: (M, M) array_like

Matrix to decompose

overwrite_a : boolean

Whether to overwrite data in A (may increase performance)

check_ﬁnite : boolean, optional

Whether to check that the input matrix contains only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns lu : (N, N) ndarray

Matrix containing U in its upper triangle, and L in its lower triangle. The unit diagonal

elements of L are not stored.

piv : (N,) ndarray

Pivot indices representing the permutation matrix P: row i of matrix was interchanged

with row piv[i].

5.9. Linear algebra (scipy.linalg) 347

SciPy Reference Guide, Release 0.13.0

See Also

lu_solve solve an equation system using the LU factorization of a matrix

Notes

This is a wrapper to the *GETRF routines from LAPACK.

scipy.linalg.lu_solve(lu_and_piv,b,trans=0,overwrite_b=False,check_ﬁnite=True)

Solve an equation system, a x = b, given the LU factorization of a

Parameters (lu, piv)

Factorization of the coefﬁcient matrix a, as given by lu_factor

b: array

Right-hand side

trans : {0, 1, 2}

Type of system to solve:

trans system

0 ax=b

1 a^T x = b

2 a^H x = b

check_ﬁnite : boolean, optional

Whether to check that the input matrices contain only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns x: array

Solution to the system

See Also

lu_factor LU factorize a matrix

scipy.linalg.svd(a,full_matrices=True,compute_uv=True,overwrite_a=False,check_ﬁnite=True)

Singular Value Decomposition.

Factorizes the matrix a into two unitary matrices U and Vh, and a 1-D array s of singular values (real, non-

negative) such that a == U*S*Vh, where S is a suitably shaped matrix of zeros with main diagonal s.

Parameters a: (M, N) array_like

Matrix to decompose.

full_matrices : bool, optional

If True, Uand Vh are of shape (M,M),(N,N). If False, the shapes are (M,K) and

(K,N), where K = min(M,N).

compute_uv : bool, optional

Whether to compute also Uand Vh in addition to s. Default is True.

overwrite_a : bool, optional

Whether to overwrite a; may improve performance. Default is False.

check_ﬁnite : boolean, optional

Whether to check that the input matrix contains only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns U: ndarray

Unitary matrix having left singular vectors as columns. Of shape (M,M) or (M,K),

depending on full_matrices.

s: ndarray

The singular values, sorted in non-increasing order. Of shape (K,), with K =

min(M, N).

348 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Vh : ndarray

Unitary matrix having right singular vectors as rows. Of shape (N,N) or (K,N)

depending on full_matrices.

For compute_uv = False, only sis returned.

Raises LinAlgError

If SVD computation does not converge.

See Also

svdvals Compute singular values of a matrix.

diagsvd Construct the Sigma matrix, given the vector s.

Examples

>>> from scipy import linalg

>>> a=np.random.randn(9,6)+1.j*np.random.randn(9,6)

>>> U, s, Vh =linalg.svd(a)

>>> U.shape, Vh.shape, s.shape

((9, 9), (6, 6), (6,))

>>> U, s, Vh =linalg.svd(a, full_matrices=False)

>>> U.shape, Vh.shape, s.shape

((9, 6), (6, 6), (6,))

>>> S=linalg.diagsvd(s, 6,6)

>>> np.allclose(a, np.dot(U, np.dot(S, Vh)))

True

>>> s2 =linalg.svd(a, compute_uv=False)

>>> np.allclose(s, s2)

True

scipy.linalg.svdvals(a,overwrite_a=False,check_ﬁnite=True)

Compute singular values of a matrix.

Parameters a: (M, N) array_like

Matrix to decompose.

overwrite_a : bool, optional

Whether to overwrite a; may improve performance. Default is False.

check_ﬁnite : boolean, optional

Whether to check that the input matrix contains only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns s: (min(M, N),) ndarray

The singular values, sorted in decreasing order.

Raises LinAlgError

If SVD computation does not converge.

See Also

svd Compute the full singular value decomposition of a matrix.

diagsvd Construct the Sigma matrix, given the vector s.

scipy.linalg.diagsvd(s,M,N)

Construct the sigma matrix in SVD from singular values and size M, N.

Parameters s: (M,) or (N,) array_like

Singular values

5.9. Linear algebra (scipy.linalg) 349

SciPy Reference Guide, Release 0.13.0

M: int

Size of the matrix whose singular values are s.

N: int

Size of the matrix whose singular values are s.

Returns S: (M, N) ndarray

The S-matrix in the singular value decomposition

scipy.linalg.orth(A)

Construct an orthonormal basis for the range of A using SVD

Parameters A: (M, N) ndarray

Input array

Returns Q: (M, K) ndarray

Orthonormal basis for the range of A. K = effective rank of A, as determined by

automatic cutoff

See Also

svd Singular value decomposition of a matrix

scipy.linalg.cholesky(a,lower=False,overwrite_a=False,check_ﬁnite=True)

Compute the Cholesky decomposition of a matrix.

Returns the Cholesky decomposition, A=LL∗or A=U∗Uof a Hermitian positive-deﬁnite matrix A.

Parameters a: (M, M) array_like

Matrix to be decomposed

lower : bool

Whether to compute the upper or lower triangular Cholesky factorization. Default is

upper-triangular.

overwrite_a : bool

Whether to overwrite data in a(may improve performance).

check_ﬁnite : boolean, optional

Whether to check that the input matrix contains only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns c: (M, M) ndarray

Upper- or lower-triangular Cholesky factor of a.

Raises LinAlgError : if decomposition fails.

Examples

>>> from scipy import array, linalg, dot

>>> a=array([[1,-2j],[2j,5]])

>>> L=linalg.cholesky(a, lower=True)

>>> L

array([[ 1.+0.j, 0.+0.j],

[ 0.+2.j, 1.+0.j]])

>>> dot(L, L.T.conj())

array([[ 1.+0.j, 0.-2.j],

[ 0.+2.j, 5.+0.j]])

scipy.linalg.cholesky_banded(ab,overwrite_ab=False,lower=False,check_ﬁnite=True)

Cholesky decompose a banded Hermitian positive-deﬁnite matrix

The matrix a is stored in ab either in lower diagonal or upper diagonal ordered form:

ab[u + i - j, j] == a[i,j] (if upper form; i <= j) ab[ i - j, j] == a[i,j] (if lower form; i >= j)

350 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Example of ab (shape of a is (6,6), u=2):

upper form:

* * a02 a13 a24 a35

*a01 a12 a23 a34 a45

a00 a11 a22 a33 a44 a55

lower form:

a00 a11 a22 a33 a44 a55

a10 a21 a32 a43 a54 *

a20 a31 a42 a53 * *

Parameters ab : (u + 1, M) array_like

Banded matrix

overwrite_ab : boolean

Discard data in ab (may enhance performance)

lower : boolean

Is the matrix in the lower form. (Default is upper form)

check_ﬁnite : boolean, optional

Whether to check that the input matrix contains only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns c: (u + 1, M) ndarray

Cholesky factorization of a, in the same banded format as ab

scipy.linalg.cho_factor(a,lower=False,overwrite_a=False,check_ﬁnite=True)

Compute the Cholesky decomposition of a matrix, to use in cho_solve

Returns a matrix containing the Cholesky decomposition, A=LL*or A=U*Uof a Hermitian positive-

deﬁnite matrix a. The return value can be directly used as the ﬁrst parameter to cho_solve.

Warning: The returned matrix also contains random data in the entries not used by the Cholesky decom-

position. If you need to zero these entries, use the function cholesky instead.

Parameters a: (M, M) array_like

Matrix to be decomposed

lower : boolean

Whether to compute the upper or lower triangular Cholesky factorization (Default:

upper-triangular)

overwrite_a : boolean

Whether to overwrite data in a (may improve performance)

check_ﬁnite : boolean, optional

Whether to check that the input matrix contains only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns c: (M, M) ndarray

Matrix whose upper or lower triangle contains the Cholesky factor of a. Other parts

of the matrix contain random data.

lower : boolean

Flag indicating whether the factor is in the lower or upper triangle

Raises LinAlgError

Raised if decomposition fails.

5.9. Linear algebra (scipy.linalg) 351

SciPy Reference Guide, Release 0.13.0

See Also

cho_solve Solve a linear set equations using the Cholesky factorization of a matrix.

scipy.linalg.cho_solve(c_and_lower,b,overwrite_b=False,check_ﬁnite=True)

Solve the linear equations A x = b, given the Cholesky factorization of A.

Parameters (c, lower) : tuple, (array, bool)

Cholesky factorization of a, as given by cho_factor

b: array

Right-hand side

check_ﬁnite : boolean, optional

Whether to check that the input matrices contain only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns x: array

The solution to the system A x = b

See Also

cho_factorCholesky factorization of a matrix

scipy.linalg.cho_solve_banded(cb_and_lower,b,overwrite_b=False,check_ﬁnite=True)

Solve the linear equations A x = b, given the Cholesky factorization of A.

Parameters (cb, lower) : tuple, (array, bool)

cb is the Cholesky factorization of A, as given by cholesky_banded. lower must be

the same value that was given to cholesky_banded.

b: array

Right-hand side

overwrite_b : bool

If True, the function will overwrite the values in b.

check_ﬁnite : boolean, optional

Whether to check that the input matrices contain only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns x: array

The solution to the system A x = b

See Also

cholesky_banded

Cholesky factorization of a banded matrix

Notes

New in version 0.8.0.

scipy.linalg.polar(a,side=’right’)

Compute the polar decomposition.

Returns the factors of the polar decomposition [R64] uand psuch that a = up (if side is “right”) or a = pu

(if side is “left”), where pis positive semideﬁnite. Depending on the shape of a, either the rows or columns of

uare orthonormal. When ais a square array, uis a square unitary array. When ais not square, the “canonical

polar decomposition” [R65] is computed.

Parameters a: (m, n) array_like

The array to be factored.

side : string, optional

352 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Determines whether a right or left polar decomposition is computed. If side is “right”,

then a = up. If side is “left”, then a = pu. The default is “right”.

Returns u: (m, n) ndarray

If ais square, then uis unitary. If m > n, then the columns of aare orthonormal, and

if m < n, then the rows of uare orthonormal.

p: ndarray

pis Hermitian positive semideﬁnite. If ais nonsingular, pis positive deﬁnite. The

shape of pis (n, n) or (m, m), depending on whether side is “right” or “left”, respec-

tively.

References

[R64],[R65]

Examples

>>> a=np.array([[1,-1], [2,4]])

>>> u,p=polar(a)

>>> u

array([[ 0.85749293, -0.51449576],

[ 0.51449576, 0.85749293]])

>>> p

array([[ 1.88648444, 1.2004901 ],

[ 1.2004901 , 3.94446746]])

A non-square example, with m < n:

>>> b=np.array([[0.5,1,2], [1.5,3,4]])

>>> u,p=polar(b)

>>> u

array([[-0.21196618, -0.42393237, 0.88054056],

[ 0.39378971, 0.78757942, 0.4739708 ]])

>>> p

array([[ 0.48470147, 0.96940295, 1.15122648],

[ 0.96940295, 1.9388059 , 2.30245295],

[ 1.15122648, 2.30245295, 3.65696431]])

>>> u.dot(p) # Verify the decomposition.

array([[ 0.5, 1. , 2. ],

[ 1.5, 3. , 4. ]])

>>> u.dot(u.T) # The rows of u are orthonormal.

array([[ 1.00000000e+00, -2.07353665e-17],

[ -2.07353665e-17, 1.00000000e+00]])

Another non-square example, with m > n:

>>> c=b.T

>>> u,p=polar(c)

>>> u

array([[-0.21196618, 0.39378971],

[-0.42393237, 0.78757942],

[ 0.88054056, 0.4739708 ]])

>>> p

array([[ 1.23116567, 1.93241587],

[ 1.93241587, 4.84930602]])

>>> u.dot(p) # Verify the decomposition.

array([[ 0.5, 1.5],

[ 1. , 3. ],

[ 2. , 4. ]])

>>> u.T.dot(u) # The columns of u are orthonormal.

5.9. Linear algebra (scipy.linalg) 353

SciPy Reference Guide, Release 0.13.0

array([[ 1.00000000e+00, -1.26363763e-16],

[ -1.26363763e-16, 1.00000000e+00]])

scipy.linalg.qr(a,overwrite_a=False,lwork=None,mode=’full’,pivoting=False,check_ﬁnite=True)

Compute QR decomposition of a matrix.

Calculate the decomposition A=QRwhere Q is unitary/orthogonal and R upper triangular.

Parameters a: (M, N) array_like

Matrix to be decomposed

overwrite_a : bool, optional

Whether data in a is overwritten (may improve performance)

lwork : int, optional

Work array size, lwork >= a.shape[1]. If None or -1, an optimal size is computed.

mode : {‘full’, ‘r’, ‘economic’, ‘raw’}, optional

Determines what information is to be returned: either both Q and R (‘full’, default),

only R (‘r’) or both Q and R but computed in economy-size (‘economic’, see Notes).

The ﬁnal option ‘raw’ (added in Scipy 0.11) makes the function return two matrices

(Q, TAU) in the internal format used by LAPACK.

pivoting : bool, optional

Whether or not factorization should include pivoting for rank-revealing qr decompo-

sition. If pivoting, compute the decomposition AP=QRas above, but where P is

chosen such that the diagonal of R is non-increasing.

check_ﬁnite : boolean, optional

Whether to check that the input matrix contains only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns Q: ﬂoat or complex ndarray

Of shape (M, M), or (M, K) for mode=’economic’. Not returned if mode=’r’.

R: ﬂoat or complex ndarray

Of shape (M, N), or (K, N) for mode=’economic’.K = min(M, N).

P: int ndarray

Of shape (N,) for pivoting=True. Not returned if pivoting=False.

Raises LinAlgError

Raised if decomposition fails

Notes

This is an interface to the LAPACK routines dgeqrf, zgeqrf, dorgqr, zungqr, dgeqp3, and zgeqp3.

If mode=economic, the shapes of Q and R are (M, K) and (K, N) instead of (M,M) and (M,N), with

K=min(M,N).

Examples

>>> from scipy import random, linalg, dot, diag, all, allclose

>>> a=random.randn(9,6)

>>> q,r=linalg.qr(a)

>>> allclose(a, np.dot(q, r))

True

>>> q.shape, r.shape

((9, 9), (9, 6))

>>> r2 =linalg.qr(a, mode=’r’)

>>> allclose(r, r2)

True

354 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

>>> q3, r3 =linalg.qr(a, mode=’economic’)

>>> q3.shape, r3.shape

((9, 6), (6, 6))

>>> q4, r4, p4 =linalg.qr(a, pivoting=True)

>>> d=abs(diag(r4))

>>> all(d[1:] <= d[:-1])

True

>>> allclose(a[:, p4], dot(q4, r4))

True

>>> q4.shape, r4.shape, p4.shape

((9, 9), (9, 6), (6,))

>>> q5, r5, p5 =linalg.qr(a, mode=’economic’, pivoting=True)

>>> q5.shape, r5.shape, p5.shape

((9, 6), (6, 6), (6,))

scipy.linalg.qr_multiply(a,c,mode=’right’,pivoting=False,conjugate=False,overwrite_a=False,

overwrite_c=False)

Calculate the QR decomposition and multiply Q with a matrix.

Calculate the decomposition A=QRwhere Q is unitary/orthogonal and R upper triangular. Multiply Q with

a vector or a matrix c. New in version 0.11.0.

Parameters a: ndarray, shape (M, N)

Matrix to be decomposed

c: ndarray, one- or two-dimensional

calculate the product of c and q, depending on the mode:

mode : {‘left’, ‘right’}, optional

dot(Q, c) is returned if mode is ‘left’, dot(c, Q) is returned if mode is

‘right’. The shape of c must be appropriate for the matrix multiplications, if mode is

‘left’, min(a.shape) == c.shape[0], if mode is ‘right’, a.shape[0] ==

c.shape[1].

pivoting : bool, optional

Whether or not factorization should include pivoting for rank-revealing qr decompo-

sition, see the documentation of qr.

conjugate : bool, optional

Whether Q should be complex-conjugated. This might be faster than explicit conju-

gation.

overwrite_a : bool, optional

Whether data in a is overwritten (may improve performance)

overwrite_c : bool, optional

Whether data in c is overwritten (may improve performance). If this is used, c must

be big enough to keep the result, i.e. c.shape[0] = a.shape[0] if mode is ‘left’.

Returns CQ : ﬂoat or complex ndarray

the product of Q and c, as deﬁned in mode

R: ﬂoat or complex ndarray

Of shape (K, N), K = min(M, N).

P: ndarray of ints

Of shape (N,) for pivoting=True. Not returned if pivoting=False.

Raises LinAlgError

Raised if decomposition fails

Notes

This is an interface to the LAPACK routines dgeqrf, zgeqrf, dormqr, zunmqr, dgeqp3, and zgeqp3.

5.9. Linear algebra (scipy.linalg) 355

SciPy Reference Guide, Release 0.13.0

scipy.linalg.qz(A,B,output=’real’,lwork=None,sort=None,overwrite_a=False,overwrite_b=False,

check_ﬁnite=True)

QZ decompostion for generalized eigenvalues of a pair of matrices.

The QZ, or generalized Schur, decomposition for a pair of N x N nonsymmetric matrices (A,B) is:

(A,B) = (Q*AA*Z’, Q*BB*Z’)

where AA, BB is in generalized Schur form if BB is upper-triangular with non-negative diagonal and AA is

upper-triangular, or for real QZ decomposition (output=’real’) block upper triangular with 1x1 and 2x2

blocks. In this case, the 1x1 blocks correspond to real generalized eigenvalues and 2x2 blocks are ‘standardized’

by making the corresponding elements of BB have the form:

[a0]

[0b]

and the pair of corresponding 2x2 blocks in AA and BB will have a complex conjugate pair of generalized

eigenvalues. If (output=’complex’) or A and B are complex matrices, Z’ denotes the conjugate-transpose

of Z. Q and Z are unitary matrices. New in version 0.11.0.

Parameters A: (N, N) array_like

2d array to decompose

B: (N, N) array_like

2d array to decompose

output : str {‘real’,’complex’}

Construct the real or complex QZ decomposition for real matrices. Default is ‘real’.

lwork : int, optional

Work array size. If None or -1, it is automatically computed.

sort : {None, callable, ‘lhp’, ‘rhp’, ‘iuc’, ‘ouc’}, optional

NOTE: THIS INPUT IS DISABLED FOR NOW, IT DOESN’T WORK WELL ON

WINDOWS.

Speciﬁes whether the upper eigenvalues should be sorted. A callable may be passed

that, given a eigenvalue, returns a boolean denoting whether the eigenvalue should be

sorted to the top-left (True). For real matrix pairs, the sort function takes three real

arguments (alphar, alphai, beta). The eigenvalue x = (alphar + alphai*1j)/beta. For

complex matrix pairs or output=’complex’, the sort function takes two complex argu-

ments (alpha, beta). The eigenvalue x = (alpha/beta). Alternatively, string parameters

may be used:

•‘lhp’ Left-hand plane (x.real < 0.0)

•‘rhp’ Right-hand plane (x.real > 0.0)

•‘iuc’ Inside the unit circle (x*x.conjugate() <= 1.0)

•‘ouc’ Outside the unit circle (x*x.conjugate() > 1.0)

Defaults to None (no sorting).

check_ﬁnite : boolean

If true checks the elements of Aand Bare ﬁnite numbers. If false does no checking

and passes matrix through to underlying algorithm.

Returns AA : (N, N) ndarray

Generalized Schur form of A.

BB : (N, N) ndarray

Generalized Schur form of B.

Q: (N, N) ndarray

The left Schur vectors.

Z: (N, N) ndarray

The right Schur vectors.

sdim : int, optional

If sorting was requested, a ﬁfth return value will contain the number of eigenvalues

for which the sort condition was True.

356 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Notes

Q is transposed versus the equivalent function in Matlab.

Examples

>>> from scipy import linalg

>>> np.random.seed(1234)

>>> A=np.arange(9).reshape((3,3))

>>> B=np.random.randn(3,3)

>>> AA, BB, Q, Z =linalg.qz(A, B)

>>> AA

array([[-13.40928183, -4.62471562, 1.09215523],

[ 0. , 0. , 1.22805978],

[ 0. , 0. , 0.31973817]])

>>> BB

array([[ 0.33362547, -1.37393632, 0.02179805],

[ 0. , 1.68144922, 0.74683866],

[ 0. , 0. , 0.9258294 ]])

>>> Q

array([[ 0.14134727, -0.97562773, 0.16784365],

[ 0.49835904, -0.07636948, -0.86360059],

[ 0.85537081, 0.20571399, 0.47541828]])

>>> Z

array([[-0.24900855, -0.51772687, 0.81850696],

[-0.79813178, 0.58842606, 0.12938478],

[-0.54861681, -0.6210585 , -0.55973739]])

scipy.linalg.schur(a,output=’real’,lwork=None,overwrite_a=False,sort=None,

check_ﬁnite=True)

Compute Schur decomposition of a matrix.

The Schur decomposition is:

A=ZTZ^H

where Z is unitary and T is either upper-triangular, or for real Schur decomposition (output=’real’), quasi-upper

triangular. In the quasi-triangular form, 2x2 blocks describing complex-valued eigenvalue pairs may extrude

from the diagonal.

Parameters a: (M, M) array_like

Matrix to decompose

output : {‘real’, ‘complex’}, optional

Construct the real or complex Schur decomposition (for real matrices).

lwork : int, optional

Work array size. If None or -1, it is automatically computed.

overwrite_a : bool, optional

Whether to overwrite data in a (may improve performance).

sort : {None, callable, ‘lhp’, ‘rhp’, ‘iuc’, ‘ouc’}, optional

Speciﬁes whether the upper eigenvalues should be sorted. A callable may be passed

that, given a eigenvalue, returns a boolean denoting whether the eigenvalue should be

sorted to the top-left (True). Alternatively, string parameters may be used:

’lhp’ Left-hand plane (x.real < 0.0)

’rhp’ Right-hand plane (x.real > 0.0)

’iuc’ Inside the unit circle (x*x.conjugate() <= 1.0)

’ouc’ Outside the unit circle (x*x.conjugate() > 1.0)

Defaults to None (no sorting).

5.9. Linear algebra (scipy.linalg) 357

SciPy Reference Guide, Release 0.13.0

check_ﬁnite : boolean, optional

Whether to check that the input matrix contains only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns T: (M, M) ndarray

Schur form of A. It is real-valued for the real Schur decomposition.

Z: (M, M) ndarray

An unitary Schur transformation matrix for A. It is real-valued for the real Schur

decomposition.

sdim : int

If and only if sorting was requested, a third return value will contain the number of

eigenvalues satisfying the sort condition.

Raises LinAlgError

Error raised under three conditions:

1.The algorithm failed due to a failure of the QR algorithm to compute all eigenval-

ues

2.If eigenvalue sorting was requested, the eigenvalues could not be reordered due to

a failure to separate eigenvalues, usually because of poor conditioning

3.If eigenvalue sorting was requested, roundoff errors caused the leading eigenval-

ues to no longer satisfy the sorting condition

See Also

rsf2csf Convert real Schur form to complex Schur form

scipy.linalg.rsf2csf(T,Z,check_ﬁnite=True)

Convert real Schur form to complex Schur form.

Convert a quasi-diagonal real-valued Schur form to the upper triangular complex-valued Schur form.

Parameters T: (M, M) array_like

Real Schur form of the original matrix

Z: (M, M) array_like

Schur transformation matrix

check_ﬁnite : boolean, optional

Whether to check that the input matrices contain only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns T: (M, M) ndarray

Complex Schur form of the original matrix

Z: (M, M) ndarray

Schur transformation matrix corresponding to the complex form

See Also

schur Schur decompose a matrix

scipy.linalg.hessenberg(a,calc_q=False,overwrite_a=False,check_ﬁnite=True)

Compute Hessenberg form of a matrix.

The Hessenberg decomposition is:

A=QHQ^H

where Qis unitary/orthogonal and Hhas only zero elements below the ﬁrst sub-diagonal.

Parameters a: (M, M) array_like

Matrix to bring into Hessenberg form.

358 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

calc_q : bool, optional

Whether to compute the transformation matrix. Default is False.

overwrite_a : bool, optional

Whether to overwrite a; may improve performance. Default is False.

check_ﬁnite : boolean, optional

Whether to check that the input matrix contains only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns H: (M, M) ndarray

Hessenberg form of a.

Q: (M, M) ndarray

Unitary/orthogonal similarity transformation matrix A=QHQ^H. Only returned

if calc_q=True.

See Also

scipy.linalg.interpolative – Interpolative matrix decompositions

5.9.4 Matrix Functions

expm(A[, q]) Compute the matrix exponential using Pade approximation.

logm(A[, disp]) Compute matrix logarithm.

cosm(A) Compute the matrix cosine.

sinm(A) Compute the matrix sine.

tanm(A) Compute the matrix tangent.

coshm(A) Compute the hyperbolic matrix cosine.

sinhm(A) Compute the hyperbolic matrix sine.

tanhm(A) Compute the hyperbolic matrix tangent.

signm(a[, disp]) Matrix sign function.

sqrtm(A[, disp, blocksize]) Matrix square root.

funm(A, func[, disp]) Evaluate a matrix function speciﬁed by a callable.

expm_frechet(A, E[, method, compute_expm, ...]) Frechet derivative of the matrix exponential of A in the direction E.

fractional_matrix_power(A, t)

scipy.linalg.expm(A,q=None)

Compute the matrix exponential using Pade approximation.

Parameters A: (N, N) array_like

Matrix to be exponentiated

Returns expm : (N, N) ndarray

Matrix exponential of A

References

N. J. Higham, “The Scaling and Squaring Method for the Matrix Exponential Revisited”, SIAM. J. Matrix Anal.

& Appl. 26, 1179 (2005).

scipy.linalg.logm(A,disp=True)

Compute matrix logarithm.

The matrix logarithm is the inverse of expm: expm(logm(A)) == A

Parameters A: (N, N) array_like

Matrix whose logarithm to evaluate

5.9. Linear algebra (scipy.linalg) 359

SciPy Reference Guide, Release 0.13.0

disp : bool, optional

Print warning if error in the result is estimated large instead of returning estimated

error. (Default: True)

Returns logm : (N, N) ndarray

Matrix logarithm of A

errest : ﬂoat

(if disp == False)

1-norm of the estimated error, ||err||_1 / ||A||_1

scipy.linalg.cosm(A)

Compute the matrix cosine.

This routine uses expm to compute the matrix exponentials.

Parameters A: (N, N) array_like

Input array

Returns cosm : (N, N) ndarray

Matrix cosine of A

scipy.linalg.sinm(A)

Compute the matrix sine.

This routine uses expm to compute the matrix exponentials.

Parameters A: (N, N) array_like

Input array.

Returns sinm : (N, N) ndarray

Matrix cosine of A

scipy.linalg.tanm(A)

Compute the matrix tangent.

This routine uses expm to compute the matrix exponentials.

Parameters A: (N, N) array_like

Input array.

Returns tanm : (N, N) ndarray

Matrix tangent of A

scipy.linalg.coshm(A)

Compute the hyperbolic matrix cosine.

This routine uses expm to compute the matrix exponentials.

Parameters A: (N, N) array_like

Input array.

Returns coshm : (N, N) ndarray

Hyperbolic matrix cosine of A

scipy.linalg.sinhm(A)

Compute the hyperbolic matrix sine.

This routine uses expm to compute the matrix exponentials.

Parameters A: (N, N) array_like

Input array.

Returns sinhm : (N, N) ndarray

Hyperbolic matrix sine of A

360 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.linalg.tanhm(A)

Compute the hyperbolic matrix tangent.

This routine uses expm to compute the matrix exponentials.

Parameters A: (N, N) array_like

Input array

Returns tanhm : (N, N) ndarray

Hyperbolic matrix tangent of A

scipy.linalg.signm(a,disp=True)

Matrix sign function.

Extension of the scalar sign(x) to matrices.

Parameters A: (N, N) array_like

Matrix at which to evaluate the sign function

disp : bool, optional

Print warning if error in the result is estimated large instead of returning estimated

error. (Default: True)

Returns signm : (N, N) ndarray

Value of the sign function at A

errest : ﬂoat

(if disp == False)

1-norm of the estimated error, ||err||_1 / ||A||_1

Examples

>>> from scipy.linalg import signm, eigvals

>>> a=[[1,2,3], [1,2,1], [1,1,1]]

>>> eigvals(a)

array([ 4.12488542+0.j, -0.76155718+0.j, 0.63667176+0.j])

>>> eigvals(signm(a))

array([-1.+0.j, 1.+0.j, 1.+0.j])

scipy.linalg.sqrtm(A,disp=True,blocksize=64)

Matrix square root.

Parameters A: (N, N) array_like

Matrix whose square root to evaluate

disp : bool, optional

Print warning if error in the result is estimated large instead of returning estimated

error. (Default: True)

blocksize : integer, optional

If the blocksize is not degenerate with respect to the size of the input array, then use a

blocked algorithm. (Default: 64)

Returns sqrtm : (N, N) ndarray

Value of the sqrt function at A

errest : ﬂoat

(if disp == False)

Frobenius norm of the estimated error, ||err||_F / ||A||_F

References

[R66]

scipy.linalg.funm(A,func,disp=True)

Evaluate a matrix function speciﬁed by a callable.

5.9. Linear algebra (scipy.linalg) 361

SciPy Reference Guide, Release 0.13.0

Returns the value of matrix-valued function fat A. The function fis an extension of the scalar-valued function

func to matrices.

Parameters A: (N, N) array_like

Matrix at which to evaluate the function

func : callable

Callable object that evaluates a scalar function f. Must be vectorized (eg. using vec-

torize).

disp : bool, optional

Print warning if error in the result is estimated large instead of returning estimated

error. (Default: True)

Returns funm : (N, N) ndarray

Value of the matrix function speciﬁed by func evaluated at A

errest : ﬂoat

(if disp == False)

1-norm of the estimated error, ||err||_1 / ||A||_1

scipy.linalg.expm_frechet(A,E,method=’SPS’,compute_expm=True,check_ﬁnite=True)

Frechet derivative of the matrix exponential of A in the direction E. New in version 0.13.0.

Parameters A: (N, N) array_like

Matrix of which to take the matrix exponential.

E: (N, N) array_like

Matrix direction in which to take the Frechet derivative.

method : str, optional

Choice of algorithm. Should be one of

•SPS

•blockEnlarge

compute_expm : bool, optional

Whether to compute also expm_A in addition to expm_frechet_AE. Default is True.

check_ﬁnite : boolean, optional

Whether to check that the input matrix contains only ﬁnite numbers. Disabling may

give a performance gain, but may result in problems (crashes, non-termination) if the

inputs do contain inﬁnities or NaNs.

Returns expm_A : ndarray

Matrix exponential of A.

expm_frechet_AE : ndarray

Frechet derivative of the matrix exponential of A in the direction E.

For compute_expm = False, only expm_frechet_AE is returned.

See Also

expm Compute the exponential of a matrix.

Notes

This section describes the available implementations that can be selected by the method parameter. The default

method is SPS.

Method blockEnlarge is a naive algorithm.

Method SPS is Scaling-Pade-Squaring [R60]. It is a sophisticated implementation which should take only about

3/8 as much time as the naive implementation. The asymptotics are the same.

References

[R60]

362 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Examples

>>> import scipy.linalg

>>> A=np.random.randn(3,3)

>>> E=np.random.randn(3,3)

>>> expm_A, expm_frechet_AE =scipy.linalg.expm_frechet(A, E)

>>> expm_A.shape, expm_frechet_AE.shape

((3, 3), (3, 3))

>>> import scipy.linalg

>>> A=np.random.randn(3,3)

>>> E=np.random.randn(3,3)

>>> expm_A, expm_frechet_AE =scipy.linalg.expm_frechet(A, E)

>>> M=np.zeros((6,6))

>>> M[:3, :3]=A; M[:3,3:] =E; M[3:, 3:] =A

>>> expm_M =scipy.linalg.expm(M)

>>> np.allclose(expm_A, expm_M[:3, :3])

True

>>> np.allclose(expm_frechet_AE, expm_M[:3,3:])

True

scipy.linalg.fractional_matrix_power(A,t)

5.9.5 Matrix Equation Solvers

solve_sylvester(a, b, q) Computes a solution (X) to the Sylvester equation (AX + XB = Q).

solve_continuous_are(a, b, q, r) Solves the continuous algebraic Riccati equation, or CARE, deﬁned

solve_discrete_are(a, b, q, r) Solves the disctrete algebraic Riccati equation, or DARE, deﬁned as

solve_discrete_lyapunov(a, q) Solves the Discrete Lyapunov Equation (A’XA-X=-Q) directly.

solve_lyapunov(a, q) Solves the continuous Lyapunov equation (AX + XA^H = Q) given the values

scipy.linalg.solve_sylvester(a,b,q)

Computes a solution (X) to the Sylvester equation (AX + XB = Q). New in version 0.11.0.

Parameters a: (M, M) array_like

Leading matrix of the Sylvester equation

b: (N, N) array_like

Trailing matrix of the Sylvester equation

q: (M, N) array_like

Right-hand side

Returns x: (M, N) ndarray

The solution to the Sylvester equation.

Raises LinAlgError

If solution was not found

Notes

Computes a solution to the Sylvester matrix equation via the Bartels- Stewart algorithm. The A and B matri-

ces ﬁrst undergo Schur decompositions. The resulting matrices are used to construct an alternative Sylvester

equation (RY + YS^T = F) where the R and S matrices are in quasi-triangular form (or, when R, S or F are

complex, triangular form). The simpliﬁed equation is then solved using *TRSYL from LAPACK directly.

scipy.linalg.solve_continuous_are(a,b,q,r)

Solves the continuous algebraic Riccati equation, or CARE, deﬁned as (A’X + XA - XBR^-1B’X+Q=0) directly

5.9. Linear algebra (scipy.linalg) 363

SciPy Reference Guide, Release 0.13.0

using a Schur decomposition method. New in version 0.11.0.

Parameters a: (M, M) array_like

Input

b: (M, N) array_like

Input

q: (M, M) array_like

Input

r: (N, N) array_like

Non-singular, square matrix

Returns x: (M, M) ndarray

Solution to the continuous algebraic Riccati equation

See Also

solve_discrete_are

Solves the discrete algebraic Riccati equation

Notes

Method taken from: Laub, “A Schur Method for Solving Algebraic Riccati Equations.”

U.S. Energy Research and Development Agency under contract ERDA-E(49-18)-2087.

http://dspace.mit.edu/bitstream/handle/1721.1/1301/R-0859-05666488.pdf

scipy.linalg.solve_discrete_are(a,b,q,r)

Solves the disctrete algebraic Riccati equation, or DARE, deﬁned as (X = A’XA-(A’XB)(R+B’XB)^-

1(B’XA)+Q), directly using a Schur decomposition method. New in version 0.11.0.

Parameters a: (M, M) array_like

Non-singular, square matrix

b: (M, N) array_like

Input

q: (M, M) array_like

Input

r: (N, N) array_like

Non-singular, square matrix

Returns x: ndarray

Solution to the continuous Lyapunov equation

See Also

solve_continuous_are

Solves the continuous algebraic Riccati equation

Notes

Method taken from: Laub, “A Schur Method for Solving Algebraic Riccati Equations.”

U.S. Energy Research and Development Agency under contract ERDA-E(49-18)-2087.

http://dspace.mit.edu/bitstream/handle/1721.1/1301/R-0859-05666488.pdf

scipy.linalg.solve_discrete_lyapunov(a,q)

Solves the Discrete Lyapunov Equation (A’XA-X=-Q) directly. New in version 0.11.0.

Parameters a: (M, M) array_like

A square matrix

q: (M, M) array_like

Right-hand side square matrix

Returns x: ndarray

Solution to the continuous Lyapunov equation

364 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Notes

Algorithm is based on a direct analytical solution from: Hamilton, James D. Time Series Analysis, Prince-

ton: Princeton University Press, 1994. 265. Print. http://www.scribd.com/doc/20577138/Hamilton-1994-Time-

Series-Analysis

scipy.linalg.solve_lyapunov(a,q)

Solves the continuous Lyapunov equation (AX + XA^H = Q) given the values of A and Q using the Bartels-

Stewart algorithm. New in version 0.11.0.

Parameters a: array_like

A square matrix

q: array_like

Right-hand side square matrix

Returns x: array_like

Solution to the continuous Lyapunov equation

See Also

solve_sylvester

computes the solution to the Sylvester equation

Notes

Because the continuous Lyapunov equation is just a special form of the Sylvester equation, this solver relies

entirely on solve_sylvester for a solution.

5.9.6 Special Matrices

block_diag(*arrs) Create a block diagonal matrix from provided arrays.

circulant(c) Construct a circulant matrix.

companion(a) Create a companion matrix.

hadamard(n[, dtype]) Construct a Hadamard matrix.

hankel(c[, r]) Construct a Hankel matrix.

hilbert(n) Create a Hilbert matrix of order n.

invhilbert(n[, exact]) Compute the inverse of the Hilbert matrix of order n.

leslie(f, s) Create a Leslie matrix.

pascal(n[, kind, exact]) Returns the n x n Pascal matrix.

toeplitz(c[, r]) Construct a Toeplitz matrix.

tri(N[, M, k, dtype]) Construct (N, M) matrix ﬁlled with ones at and below the k-th diagonal.

scipy.linalg.block_diag(*arrs)

Create a block diagonal matrix from provided arrays.

Given the inputs A,Band C, the output will have these arrays arranged on the diagonal:

[[A, 0,0],

[0,B,0],

[0,0, C]]

Parameters A, B, C, ... : array_like, up to 2-D

Input arrays. A 1-D array or array_like sequence of length n‘is treated as a 2-D array

with shape ‘‘(1,n)‘.

Returns D: ndarray

Array with A,B,C, ... on the diagonal. Dhas the same dtype as A.

5.9. Linear algebra (scipy.linalg) 365

SciPy Reference Guide, Release 0.13.0

Notes

If all the input arrays are square, the output is known as a block diagonal matrix.

Examples

>>> from scipy.linalg import block_diag

>>> A=[[1,0],

... [0,1]]

>>> B=[[3,4,5],

... [6,7,8]]

>>> C=[[7]]

>>> block_diag(A, B, C)

[[100000]

[010000]

[003450]

[006780]

[000007]]

>>> block_diag(1.0, [2,3], [[4,5], [6,7]])

array([[ 1., 0., 0., 0., 0.],

[ 0., 2., 3., 0., 0.],

[ 0., 0., 0., 4., 5.],

[ 0., 0., 0., 6., 7.]])

scipy.linalg.circulant(c)

Construct a circulant matrix.

Parameters c: (N,) array_like

1-D array, the ﬁrst column of the matrix.

Returns A: (N, N) ndarray

A circulant matrix whose ﬁrst column is c.

See Also

toeplitz Toeplitz matrix

hankel Hankel matrix

Notes

New in version 0.8.0.

Examples

>>> from scipy.linalg import circulant

>>> circulant([1,2,3])

array([[1, 3, 2],

[2, 1, 3],

[3, 2, 1]])

scipy.linalg.companion(a)

Create a companion matrix.

Create the companion matrix [R59] associated with the polynomial whose coefﬁcients are given in a.

Parameters a: (N,) array_like

1-D array of polynomial coefﬁcients. The length of amust be at least two, and a[0]

must not be zero.

Returns c: (N-1, N-1) ndarray

The ﬁrst row of cis -a[1:]/a[0], and the ﬁrst sub-diagonal is all ones. The data-

type of the array is the same as the data-type of 1.0*a[0].

366 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Raises ValueError

If any of the following are true: a) a.ndim != 1; b) a.size < 2; c) a[0] ==

Notes

New in version 0.8.0.

References

[R59]

Examples

>>> from scipy.linalg import companion

>>> companion([1,-10,31,-30])

array([[ 10., -31., 30.],

[ 1., 0., 0.],

[ 0., 1., 0.]])

scipy.linalg.hadamard(n,dtype=<type ‘int’>)

Construct a Hadamard matrix.

Constructs an n-by-n Hadamard matrix, using Sylvester’s construction. nmust be a power of 2.

Parameters n: int

The order of the matrix. nmust be a power of 2.

dtype : numpy dtype

The data type of the array to be constructed.

Returns H: (n, n) ndarray

The Hadamard matrix.

Notes

New in version 0.8.0.

Examples

>>> from scipy.linalg import hadamard

>>> hadamard(2, dtype=complex)

array([[ 1.+0.j, 1.+0.j],

[ 1.+0.j, -1.-0.j]])

>>> hadamard(4)

array([[ 1, 1, 1, 1],

[ 1, -1, 1, -1],

[ 1, 1, -1, -1],

[ 1, -1, -1, 1]])

scipy.linalg.hankel(c,r=None)

Construct a Hankel matrix.

The Hankel matrix has constant anti-diagonals, with cas its ﬁrst column and ras its last row. If ris not given,

then r = zeros_like(c) is assumed.

Parameters c: array_like

First column of the matrix. Whatever the actual shape of c, it will be converted to a

1-D array.

r: array_like

5.9. Linear algebra (scipy.linalg) 367

SciPy Reference Guide, Release 0.13.0

Last row of the matrix. If None, r = zeros_like(c) is assumed. r[0] is ignored;

the last row of the returned matrix is [c[-1], r[1:]]. Whatever the actual shape

of r, it will be converted to a 1-D array.

Returns A: (len(c), len(r)) ndarray

The Hankel matrix. Dtype is the same as (c[0] + r[0]).dtype.

See Also

toeplitz Toeplitz matrix

circulant circulant matrix

Examples

>>> from scipy.linalg import hankel

>>> hankel([1,17,99])

array([[ 1, 17, 99],

[17, 99, 0],

[99, 0, 0]])

>>> hankel([1,2,3,4], [4,7,7,8,9])

array([[1, 2, 3, 4, 7],

[2, 3, 4, 7, 7],

[3, 4, 7, 7, 8],

[4, 7, 7, 8, 9]])

scipy.linalg.hilbert(n)

Create a Hilbert matrix of order n.

Returns the nby narray with entries h[i,j] = 1 / (i + j + 1).

Parameters n: int

The size of the array to create.

Returns h: (n, n) ndarray

The Hilbert matrix.

See Also

invhilbertCompute the inverse of a Hilbert matrix.

Notes

New in version 0.10.0.

Examples

>>> from scipy.linalg import hilbert

>>> hilbert(3)

array([[ 1. , 0.5 , 0.33333333],

[ 0.5 , 0.33333333, 0.25 ],

[ 0.33333333, 0.25 , 0.2 ]])

scipy.linalg.invhilbert(n,exact=False)

Compute the inverse of the Hilbert matrix of order n.

The entries in the inverse of a Hilbert matrix are integers. When nis greater than 14, some entries in the inverse

exceed the upper limit of 64 bit integers. The exact argument provides two options for dealing with these large

integers.

Parameters n: int

The order of the Hilbert matrix.

exact : bool

368 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

If False, the data type of the array that is returned is np.ﬂoat64, and the array is an

approximation of the inverse. If True, the array is the exact integer inverse array. To

represent the exact inverse when n > 14, the returned array is an object array of long

integers. For n <= 14, the exact inverse is returned as an array with data type np.int64.

Returns invh : (n, n) ndarray

The data type of the array is np.ﬂoat64 if exact is False. If exact is True, the data type

is either np.int64 (for n <= 14) or object (for n > 14). In the latter case, the objects in

the array will be long integers.

See Also

hilbert Create a Hilbert matrix.

Notes

New in version 0.10.0.

Examples

>>> from scipy.linalg import invhilbert

>>> invhilbert(4)

array([[ 16., -120., 240., -140.],

[ -120., 1200., -2700., 1680.],

[ 240., -2700., 6480., -4200.],

[ -140., 1680., -4200., 2800.]])

>>> invhilbert(4, exact=True)

array([[ 16, -120, 240, -140],

[ -120, 1200, -2700, 1680],

[ 240, -2700, 6480, -4200],

[ -140, 1680, -4200, 2800]], dtype=int64)

>>> invhilbert(16)[7,7]

4.2475099528537506e+19

>>> invhilbert(16, exact=True)[7,7]

42475099528537378560L

scipy.linalg.leslie(f,s)

Create a Leslie matrix.

Given the length n array of fecundity coefﬁcients fand the length n-1 array of survival coefﬁcents s, return the

associated Leslie matrix.

Parameters f: (N,) array_like

The “fecundity” coefﬁcients.

s: (N-1,) array_like

The “survival” coefﬁcients, has to be 1-D. The length of smust be one less than the

length of f, and it must be at least 1.

Returns L: (N, N) ndarray

The array is zero except for the ﬁrst row, which is f, and the ﬁrst sub-diagonal, which

is s. The data-type of the array will be the data-type of f[0]+s[0].

Notes

New in version 0.8.0. The Leslie matrix is used to model discrete-time, age-structured population growth [R61]

[R62]. In a population with nage classes, two sets of parameters deﬁne a Leslie matrix: the n“fecundity

coefﬁcients”, which give the number of offspring per-capita produced by each age class, and the n- 1 “survival

coefﬁcients”, which give the per-capita survival rate of each age class.

5.9. Linear algebra (scipy.linalg) 369

SciPy Reference Guide, Release 0.13.0

References

[R61],[R62]

Examples

>>> from scipy.linalg import leslie

>>> leslie([0.1,2.0,1.0,0.1], [0.2,0.8,0.7])

array([[ 0.1, 2. , 1. , 0.1],

[ 0.2, 0. , 0. , 0. ],

[ 0. , 0.8, 0. , 0. ],

[ 0. , 0. , 0.7, 0. ]])

scipy.linalg.pascal(n,kind=’symmetric’,exact=True)

Returns the n x n Pascal matrix.

The Pascal matrix is a matrix containing the binomial coefﬁcients as its elements. New in version 0.11.0.

Parameters n: int

The size of the matrix to create; that is, the result is an n x n matrix.

kind : str, optional

Must be one of ‘symmetric’, ‘lower’, or ‘upper’. Default is ‘symmetric’.

exact : bool, optional

If exact is True, the result is either an array of type numpy.uint64 (if n <= 35) or an

object array of Python long integers. If exact is False, the coefﬁcients in the matrix

are computed using scipy.misc.comb with exact=False. The result will be a

ﬂoating point array, and the values in the array will not be the exact coefﬁcients, but

this version is much faster than exact=True.

Returns p: (n, n) ndarray

The Pascal matrix.

Notes

See http://en.wikipedia.org/wiki/Pascal_matrix for more information about Pascal matrices.

Examples

>>> from scipy.linalg import pascal

>>> pascal(4)

array([[ 1, 1, 1, 1],

[ 1, 2, 3, 4],

[ 1, 3, 6, 10],

[ 1, 4, 10, 20]], dtype=uint64)

>>> pascal(4, kind=’lower’)

array([[1, 0, 0, 0],

[1, 1, 0, 0],

[1, 2, 1, 0],

[1, 3, 3, 1]], dtype=uint64)

>>> pascal(50)[-1,-1]

25477612258980856902730428600L

>>> from scipy.misc import comb

>>> comb(98,49, exact=True)

25477612258980856902730428600L

scipy.linalg.toeplitz(c,r=None)

Construct a Toeplitz matrix.

The Toeplitz matrix has constant diagonals, with c as its ﬁrst column and r as its ﬁrst row. If r is not given, r

== conjugate(c) is assumed.

370 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Parameters c: array_like

First column of the matrix. Whatever the actual shape of c, it will be converted to a

1-D array.

r: array_like

First row of the matrix. If None, r = conjugate(c) is assumed; in this case, if

c[0] is real, the result is a Hermitian matrix. r[0] is ignored; the ﬁrst row of the returned

matrix is [c[0], r[1:]]. Whatever the actual shape of r, it will be converted to a

1-D array.

Returns A: (len(c), len(r)) ndarray

The Toeplitz matrix. Dtype is the same as (c[0] + r[0]).dtype.

See Also

circulant circulant matrix

hankel Hankel matrix

Notes

The behavior when cor ris a scalar, or when cis complex and ris None, was changed in version 0.8.0. The

behavior in previous versions was undocumented and is no longer supported.

Examples

>>> from scipy.linalg import toeplitz

>>> toeplitz([1,2,3], [1,4,5,6])

array([[1, 4, 5, 6],

[2, 1, 4, 5],

[3, 2, 1, 4]])

>>> toeplitz([1.0,2+3j,4-1j])

array([[ 1.+0.j, 2.-3.j, 4.+1.j],

[ 2.+3.j, 1.+0.j, 2.-3.j],

[ 4.-1.j, 2.+3.j, 1.+0.j]])

scipy.linalg.tri(N,M=None,k=0,dtype=None)

Construct (N, M) matrix ﬁlled with ones at and below the k-th diagonal.

The matrix has A[i,j] == 1 for i <= j + k

Parameters N: integer

The size of the ﬁrst dimension of the matrix.

M: integer or None

The size of the second dimension of the matrix. If Mis None, M=Nis assumed.

k: integer

Number of subdiagonal below which matrix is ﬁlled with ones. k= 0 is the main

diagonal, k< 0 subdiagonal and k> 0 superdiagonal.

dtype : dtype

Data type of the matrix.

Returns tri : (N, M) ndarray

Tri matrix.

Examples

>>> from scipy.linalg import tri

>>> tri(3,5,2, dtype=int)

array([[1, 1, 1, 0, 0],

[1, 1, 1, 1, 0],

[1, 1, 1, 1, 1]])

>>> tri(3,5,-1, dtype=int)

5.9. Linear algebra (scipy.linalg) 371

SciPy Reference Guide, Release 0.13.0

array([[0, 0, 0, 0, 0],

[1, 0, 0, 0, 0],

[1, 1, 0, 0, 0]])

5.9.7 Low-level routines

get_blas_funcs(names[, arrays, dtype]) Return available BLAS function objects from names.

get_lapack_funcs(names[, arrays, dtype]) Return available LAPACK function objects from names.

find_best_blas_type([arrays, dtype]) Find best-matching BLAS/LAPACK type.

scipy.linalg.get_blas_funcs(names,arrays=(),dtype=None)

Return available BLAS function objects from names.

Arrays are used to determine the optimal preﬁx of BLAS routines.

Parameters names : str or sequence of str

Name(s) of BLAS functions withouth type preﬁx.

arrays : sequency of ndarrays, optional

Arrays can be given to determine optiomal preﬁx of BLAS routines. If not given,

double-precision routines will be used, otherwise the most generic type in arrays will

be used.

dtype : str or dtype, optional

Data-type speciﬁer. Not used if arrays is non-empty.

Returns funcs : list

List containing the found function(s).

Notes

This routines automatically chooses between Fortran/C interfaces. Fortran code is used whenever possible for

arrays with column major order. In all other cases, C code is preferred.

In BLAS, the naming convention is that all functions start with a type preﬁx, which depends on the type of

the principal matrix. These can be one of {‘s’, ‘d’, ‘c’, ‘z’} for the numpy types {ﬂoat32, ﬂoat64, complex64,

complex128} respectively. The code and the dtype are stored in attributes typecode and dtype of the returned

functions.

scipy.linalg.get_lapack_funcs(names,arrays=(),dtype=None)

Return available LAPACK function objects from names.

Arrays are used to determine the optimal preﬁx of LAPACK routines.

Parameters names : str or sequence of str

Name(s) of LAPACK functions withouth type preﬁx.

arrays : sequency of ndarrays, optional

Arrays can be given to determine optiomal preﬁx of LAPACK routines. If not given,

double-precision routines will be used, otherwise the most generic type in arrays will

be used.

dtype : str or dtype, optional

Data-type speciﬁer. Not used if arrays is non-empty.

Returns funcs : list

List containing the found function(s).

372 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Notes

This routines automatically chooses between Fortran/C interfaces. Fortran code is used whenever possible for

arrays with column major order. In all other cases, C code is preferred.

In LAPACK, the naming convention is that all functions start with a type preﬁx, which depends on the type of

the principal matrix. These can be one of {‘s’, ‘d’, ‘c’, ‘z’} for the numpy types {ﬂoat32, ﬂoat64, complex64,

complex128} respectevely, and are stored in attribute typecode of the returned functions.

scipy.linalg.find_best_blas_type(arrays=(),dtype=None)

Find best-matching BLAS/LAPACK type.

Arrays are used to determine the optimal preﬁx of BLAS routines.

Parameters arrays : sequency of ndarrays, optional

Arrays can be given to determine optiomal preﬁx of BLAS routines. If not given,

double-precision routines will be used, otherwise the most generic type in arrays will

be used.

dtype : str or dtype, optional

Data-type speciﬁer. Not used if arrays is non-empty.

Returns preﬁx : str

BLAS/LAPACK preﬁx character.

dtype : dtype

Inferred Numpy data type.

prefer_fortran : bool

Whether to prefer Fortran order routines over C order.

See Also

scipy.linalg.blas – Low-level BLAS functions

scipy.linalg.lapack – Low-level LAPACK functions

5.10 Low-level BLAS functions

This module contains low-level functions from the BLAS library. New in version 0.12.0.

Warning: These functions do little to no error checking. It is possible to cause crashes by mis-using them, so

prefer using the higher-level routines in scipy.linalg.

5.11 Finding functions

get_blas_funcs(names[, arrays, dtype]) Return available BLAS function objects from names.

find_best_blas_type([arrays, dtype]) Find best-matching BLAS/LAPACK type.

5.12 All functions

caxpy caxpy - Function signature:

ccopy ccopy - Function signature:

cdotc cdotc - Function signature:

Continued on next page

5.12. All functions 373

SciPy Reference Guide, Release 0.13.0

Table 5.72 – continued from previous page

cdotu cdotu - Function signature:

cgemm cgemm - Function signature:

cgemv cgemv - Function signature:

cgerc cgerc - Function signature:

cgeru cgeru - Function signature:

chemm chemm - Function signature:

chemv chemv - Function signature:

cherk cherk - Function signature:

cher2k cher2k - Function signature:

crotg crotg - Function signature:

cscal cscal - Function signature:

csrot csrot - Function signature:

csscal csscal - Function signature:

csymm csymm - Function signature:

csyrk csyrk - Function signature:

csyr2k csyr2k - Function signature:

cswap cswap - Function signature:

ctrmv ctrmv - Function signature:

dasum dasum - Function signature:

daxpy daxpy - Function signature:

dcopy dcopy - Function signature:

ddot ddot - Function signature:

dgemm dgemm - Function signature:

dgemv dgemv - Function signature:

dger dger - Function signature:

dnrm2 dnrm2 - Function signature:

drot drot - Function signature:

drotg drotg - Function signature:

drotm drotm - Function signature:

drotmg drotmg - Function signature:

dscal dscal - Function signature:

dswap dswap - Function signature:

dsymm dsymm - Function signature:

dsymv dsymv - Function signature:

dsyrk dsyrk - Function signature:

dsyr2k dsyr2k - Function signature:

dtrmv dtrmv - Function signature:

dzasum dzasum - Function signature:

dznrm2 dznrm2 - Function signature:

icamax icamax - Function signature:

idamax idamax - Function signature:

isamax isamax - Function signature:

izamax izamax - Function signature:

sasum sasum - Function signature:

saxpy saxpy - Function signature:

scasum scasum - Function signature:

scnrm2 scnrm2 - Function signature:

scopy scopy - Function signature:

sdot sdot - Function signature:

sgemm sgemm - Function signature:

Continued on next page

374 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Table 5.72 – continued from previous page

sgemv sgemv - Function signature:

sger sger - Function signature:

snrm2 snrm2 - Function signature:

srot srot - Function signature:

srotg srotg - Function signature:

srotm srotm - Function signature:

srotmg srotmg - Function signature:

sscal sscal - Function signature:

sswap sswap - Function signature:

ssymm ssymm - Function signature:

ssymv ssymv - Function signature:

ssyrk ssyrk - Function signature:

ssyr2k ssyr2k - Function signature:

strmv strmv - Function signature:

zaxpy zaxpy - Function signature:

zcopy zcopy - Function signature:

zdotc zdotc - Function signature:

zdotu zdotu - Function signature:

zdrot zdrot - Function signature:

zdscal zdscal - Function signature:

zgemm zgemm - Function signature:

zgemv zgemv - Function signature:

zgerc zgerc - Function signature:

zgeru zgeru - Function signature:

zhemm zhemm - Function signature:

zhemv zhemv - Function signature:

zherk zherk - Function signature:

zher2k zher2k - Function signature:

zrotg zrotg - Function signature:

zscal zscal - Function signature:

zsymm zsymm - Function signature:

zsyrk zsyrk - Function signature:

zsyr2k zsyr2k - Function signature:

zswap zswap - Function signature:

ztrmv ztrmv - Function signature:

scipy.linalg.blas.caxpy = <fortran object>

caxpy - Function signature:

z = caxpy(x,y,[n,a,offx,incx,offy,incy])

Required arguments:

x : input rank-1 array(‘F’) with bounds (*) y : input rank-1 array(‘F’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int a := (1.0, 0.0) input complex offx := 0 input int incx := 1

input int offy := 0 input int incy := 1 input int

Return objects:

z : rank-1 array(‘F’) with bounds (*) and y storage

scipy.linalg.blas.ccopy = <fortran object>

5.12. All functions 375

SciPy Reference Guide, Release 0.13.0

ccopy - Function signature:

y = ccopy(x,y,[n,offx,incx,offy,incy])

Required arguments:

x : input rank-1 array(‘F’) with bounds (*) y : input rank-1 array(‘F’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int offy := 0 input int incy

:= 1 input int

Return objects:

y : rank-1 array(‘F’) with bounds (*)

scipy.linalg.blas.cdotc = <fortran cdotc>

cdotc - Function signature:

xy = cdotc(x,y,[n,offx,incx,offy,incy])

Required arguments:

x : input rank-1 array(‘F’) with bounds (*) y : input rank-1 array(‘F’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int offy := 0 input int incy

:= 1 input int

Return objects:

xy : complex

scipy.linalg.blas.cdotu = <fortran cdotu>

cdotu - Function signature:

xy = cdotu(x,y,[n,offx,incx,offy,incy])

Required arguments:

x : input rank-1 array(‘F’) with bounds (*) y : input rank-1 array(‘F’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int offy := 0 input int incy

:= 1 input int

Return objects:

xy : complex

scipy.linalg.blas.cgemm = <fortran object>

cgemm - Function signature:

c = cgemm(alpha,a,b,[beta,c,trans_a,trans_b,overwrite_c])

Required arguments:

alpha : input complex a : input rank-2 array(‘F’) with bounds (lda,ka) b : input rank-2 array(‘F’)

with bounds (ldb,kb)

Optional arguments:

beta := (0.0, 0.0) input complex c : input rank-2 array(‘F’) with bounds (m,n) overwrite_c := 0

input int trans_a := 0 input int trans_b := 0 input int

Return objects:

c : rank-2 array(‘F’) with bounds (m,n)

376 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.linalg.blas.cgemv = <fortran object>

cgemv - Function signature:

y = cgemv(alpha,a,x,[beta,y,offx,incx,offy,incy,trans,overwrite_y])

Required arguments:

alpha : input complex a : input rank-2 array(‘F’) with bounds (m,n) x : input rank-1 array(‘F’)

with bounds (*)

Optional arguments:

beta := (0.0, 0.0) input complex y : input rank-1 array(‘F’) with bounds (ly) overwrite_y := 0

input int offx := 0 input int incx := 1 input int offy := 0 input int incy := 1 input int trans := 0

input int

Return objects:

y : rank-1 array(‘F’) with bounds (ly)

scipy.linalg.blas.cgerc = <fortran object>

cgerc - Function signature:

a = cgerc(alpha,x,y,[incx,incy,a,overwrite_x,overwrite_y,overwrite_a])

Required arguments:

alpha : input complex x : input rank-1 array(‘F’) with bounds (m) y : input rank-1 array(‘F’)

with bounds (n)

Optional arguments:

overwrite_x := 1 input int incx := 1 input int overwrite_y := 1 input int incy := 1 input int a :=

(0.0,0.0) input rank-2 array(‘F’) with bounds (m,n) overwrite_a := 0 input int

Return objects:

a : rank-2 array(‘F’) with bounds (m,n)

scipy.linalg.blas.cgeru = <fortran object>

cgeru - Function signature:

a = cgeru(alpha,x,y,[incx,incy,a,overwrite_x,overwrite_y,overwrite_a])

Required arguments:

alpha : input complex x : input rank-1 array(‘F’) with bounds (m) y : input rank-1 array(‘F’)

with bounds (n)

Optional arguments:

overwrite_x := 1 input int incx := 1 input int overwrite_y := 1 input int incy := 1 input int a :=

(0.0,0.0) input rank-2 array(‘F’) with bounds (m,n) overwrite_a := 0 input int

Return objects:

a : rank-2 array(‘F’) with bounds (m,n)

scipy.linalg.blas.chemm = <fortran object>

chemm - Function signature:

c = chemm(alpha,a,b,[beta,c,side,lower,overwrite_c])

Required arguments:

alpha : input complex a : input rank-2 array(‘F’) with bounds (lda,ka) b : input rank-2 array(‘F’)

with bounds (ldb,kb)

5.12. All functions 377

SciPy Reference Guide, Release 0.13.0

Optional arguments:

beta := (0.0, 0.0) input complex c : input rank-2 array(‘F’) with bounds (m,n) overwrite_c := 0

input int side := 0 input int lower := 0 input int

Return objects:

c : rank-2 array(‘F’) with bounds (m,n)

scipy.linalg.blas.chemv = <fortran object>

chemv - Function signature:

y = chemv(alpha,a,x,[beta,y,offx,incx,offy,incy,lower,overwrite_y])

Required arguments:

alpha : input complex a : input rank-2 array(‘F’) with bounds (n,n) x : input rank-1 array(‘F’)

with bounds (*)

Optional arguments:

beta := (0.0, 0.0) input complex y : input rank-1 array(‘F’) with bounds (ly) overwrite_y := 0

input int offx := 0 input int incx := 1 input int offy := 0 input int incy := 1 input int lower := 0

input int

Return objects:

y : rank-1 array(‘F’) with bounds (ly)

scipy.linalg.blas.cherk = <fortran object>

cherk - Function signature:

c = cherk(alpha,a,[beta,c,trans,lower,overwrite_c])

Required arguments:

alpha : input complex a : input rank-2 array(‘F’) with bounds (lda,ka)

Optional arguments:

beta := (0.0, 0.0) input complex c : input rank-2 array(‘F’) with bounds (n,n) overwrite_c := 0

input int trans := 0 input int lower := 0 input int

Return objects:

c : rank-2 array(‘F’) with bounds (n,n)

scipy.linalg.blas.cher2k = <fortran object>

cher2k - Function signature:

c = cher2k(alpha,a,b,[beta,c,trans,lower,overwrite_c])

Required arguments:

alpha : input complex a : input rank-2 array(‘F’) with bounds (lda,ka) b : input rank-2 array(‘F’)

with bounds (ldb,kb)

Optional arguments:

beta := (0.0, 0.0) input complex c : input rank-2 array(‘F’) with bounds (n,n) overwrite_c := 0

input int trans := 0 input int lower := 0 input int

Return objects:

c : rank-2 array(‘F’) with bounds (n,n)

scipy.linalg.blas.crotg = <fortran object>

crotg - Function signature:

c,s = crotg(a,b)

378 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Required arguments:

a : input complex b : input complex

Return objects:

c : complex s : complex

scipy.linalg.blas.cscal = <fortran object>

cscal - Function signature:

x = cscal(a,x,[n,offx,incx])

Required arguments:

a : input complex x : input rank-1 array(‘F’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int

Return objects:

x : rank-1 array(‘F’) with bounds (*)

scipy.linalg.blas.csrot = <fortran object>

csrot - Function signature:

x,y = csrot(x,y,c,s,[n,offx,incx,offy,incy,overwrite_x,overwrite_y])

Required arguments:

x : input rank-1 array(‘F’) with bounds (*) y : input rank-1 array(‘F’) with bounds (*) c : input

ﬂoat s : input ﬂoat

Optional arguments:

n := (len(x)-offx)/abs(incx) input int overwrite_x := 0 input int offx := 0 input int incx := 1 input

int overwrite_y := 0 input int offy := 0 input int incy := 1 input int

Return objects:

x : rank-1 array(‘F’) with bounds (*) y : rank-1 array(‘F’) with bounds (*)

scipy.linalg.blas.csscal = <fortran object>

csscal - Function signature:

x = csscal(a,x,[n,offx,incx,overwrite_x])

Required arguments:

a : input ﬂoat x : input rank-1 array(‘F’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int overwrite_x := 0 input int offx := 0 input int incx := 1 input

int

Return objects:

x : rank-1 array(‘F’) with bounds (*)

scipy.linalg.blas.csymm = <fortran object>

csymm - Function signature:

c = csymm(alpha,a,b,[beta,c,side,lower,overwrite_c])

Required arguments:

alpha : input complex a : input rank-2 array(‘F’) with bounds (lda,ka) b : input rank-2 array(‘F’)

with bounds (ldb,kb)

5.12. All functions 379

SciPy Reference Guide, Release 0.13.0

Optional arguments:

beta := (0.0, 0.0) input complex c : input rank-2 array(‘F’) with bounds (m,n) overwrite_c := 0

input int side := 0 input int lower := 0 input int

Return objects:

c : rank-2 array(‘F’) with bounds (m,n)

scipy.linalg.blas.csyrk = <fortran object>

csyrk - Function signature:

c = csyrk(alpha,a,[beta,c,trans,lower,overwrite_c])

Required arguments:

alpha : input complex a : input rank-2 array(‘F’) with bounds (lda,ka)

Optional arguments:

beta := (0.0, 0.0) input complex c : input rank-2 array(‘F’) with bounds (n,n) overwrite_c := 0

input int trans := 0 input int lower := 0 input int

Return objects:

c : rank-2 array(‘F’) with bounds (n,n)

scipy.linalg.blas.csyr2k = <fortran object>

csyr2k - Function signature:

c = csyr2k(alpha,a,b,[beta,c,trans,lower,overwrite_c])

Required arguments:

alpha : input complex a : input rank-2 array(‘F’) with bounds (lda,ka) b : input rank-2 array(‘F’)

with bounds (ldb,kb)

Optional arguments:

beta := (0.0, 0.0) input complex c : input rank-2 array(‘F’) with bounds (n,n) overwrite_c := 0

input int trans := 0 input int lower := 0 input int

Return objects:

c : rank-2 array(‘F’) with bounds (n,n)

scipy.linalg.blas.cswap = <fortran object>

cswap - Function signature:

x,y = cswap(x,y,[n,offx,incx,offy,incy])

Required arguments:

x : input rank-1 array(‘F’) with bounds (*) y : input rank-1 array(‘F’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int offy := 0 input int incy

:= 1 input int

Return objects:

x : rank-1 array(‘F’) with bounds (*) y : rank-1 array(‘F’) with bounds (*)

scipy.linalg.blas.ctrmv = <fortran object>

ctrmv - Function signature:

x = ctrmv(a,x,[offx,incx,lower,trans,unitdiag,overwrite_x])

380 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Required arguments:

a : input rank-2 array(‘F’) with bounds (n,n) x : input rank-1 array(‘F’) with bounds (*)

Optional arguments:

overwrite_x := 0 input int offx := 0 input int incx := 1 input int lower := 0 input int trans := 0

input int unitdiag := 0 input int

Return objects:

x : rank-1 array(‘F’) with bounds (*)

scipy.linalg.blas.dasum = <fortran dasum>

dasum - Function signature:

s = dasum(x,[n,offx,incx])

Required arguments:

x : input rank-1 array(‘d’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int

Return objects:

s : ﬂoat

scipy.linalg.blas.daxpy = <fortran object>

daxpy - Function signature:

z = daxpy(x,y,[n,a,offx,incx,offy,incy])

Required arguments:

x : input rank-1 array(‘d’) with bounds (*) y : input rank-1 array(‘d’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int a := 1.0 input ﬂoat offx := 0 input int incx := 1 input int offy

:= 0 input int incy := 1 input int

Return objects:

z : rank-1 array(‘d’) with bounds (*) and y storage

scipy.linalg.blas.dcopy = <fortran object>

dcopy - Function signature:

y = dcopy(x,y,[n,offx,incx,offy,incy])

Required arguments:

x : input rank-1 array(‘d’) with bounds (*) y : input rank-1 array(‘d’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int offy := 0 input int incy

:= 1 input int

Return objects:

y : rank-1 array(‘d’) with bounds (*)

scipy.linalg.blas.ddot = <fortran ddot>

ddot - Function signature:

xy = ddot(x,y,[n,offx,incx,offy,incy])

5.12. All functions 381

SciPy Reference Guide, Release 0.13.0

Required arguments:

x : input rank-1 array(‘d’) with bounds (*) y : input rank-1 array(‘d’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int offy := 0 input int incy

:= 1 input int

Return objects:

xy : ﬂoat

scipy.linalg.blas.dgemm = <fortran object>

dgemm - Function signature:

c = dgemm(alpha,a,b,[beta,c,trans_a,trans_b,overwrite_c])

Required arguments:

alpha : input ﬂoat a : input rank-2 array(‘d’) with bounds (lda,ka) b : input rank-2 array(‘d’) with

bounds (ldb,kb)

Optional arguments:

beta := 0.0 input ﬂoat c : input rank-2 array(‘d’) with bounds (m,n) overwrite_c := 0 input int

trans_a := 0 input int trans_b := 0 input int

Return objects:

c : rank-2 array(‘d’) with bounds (m,n)

scipy.linalg.blas.dgemv = <fortran object>

dgemv - Function signature:

y = dgemv(alpha,a,x,[beta,y,offx,incx,offy,incy,trans,overwrite_y])

Required arguments:

alpha : input ﬂoat a : input rank-2 array(‘d’) with bounds (m,n) x : input rank-1 array(‘d’) with

bounds (*)

Optional arguments:

beta := 0.0 input ﬂoat y : input rank-1 array(‘d’) with bounds (ly) overwrite_y := 0 input int offx

:= 0 input int incx := 1 input int offy := 0 input int incy := 1 input int trans := 0 input int

Return objects:

y : rank-1 array(‘d’) with bounds (ly)

scipy.linalg.blas.dger = <fortran object>

dger - Function signature:

a = dger(alpha,x,y,[incx,incy,a,overwrite_x,overwrite_y,overwrite_a])

Required arguments:

alpha : input ﬂoat x : input rank-1 array(‘d’) with bounds (m) y : input rank-1 array(‘d’) with

bounds (n)

Optional arguments:

overwrite_x := 1 input int incx := 1 input int overwrite_y := 1 input int incy := 1 input int a := 0.0

input rank-2 array(‘d’) with bounds (m,n) overwrite_a := 0 input int

Return objects:

a : rank-2 array(‘d’) with bounds (m,n)

scipy.linalg.blas.dnrm2 = <fortran dnrm2>

382 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

dnrm2 - Function signature:

n2 = dnrm2(x,[n,offx,incx])

Required arguments:

x : input rank-1 array(‘d’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int

Return objects:

n2 : ﬂoat

scipy.linalg.blas.drot = <fortran object>

drot - Function signature:

x,y = drot(x,y,c,s,[n,offx,incx,offy,incy,overwrite_x,overwrite_y])

Required arguments:

x : input rank-1 array(‘d’) with bounds (*) y : input rank-1 array(‘d’) with bounds (*) c : input

ﬂoat s : input ﬂoat

Optional arguments:

n := (len(x)-offx)/abs(incx) input int overwrite_x := 0 input int offx := 0 input int incx := 1 input

int overwrite_y := 0 input int offy := 0 input int incy := 1 input int

Return objects:

x : rank-1 array(‘d’) with bounds (*) y : rank-1 array(‘d’) with bounds (*)

scipy.linalg.blas.drotg = <fortran object>

drotg - Function signature:

c,s = drotg(a,b)

Required arguments:

a : input ﬂoat b : input ﬂoat

Return objects:

c : ﬂoat s : ﬂoat

scipy.linalg.blas.drotm = <fortran object>

drotm - Function signature:

x,y = drotm(x,y,param,[n,offx,incx,offy,incy,overwrite_x,overwrite_y])

Required arguments:

x : input rank-1 array(‘d’) with bounds (*) y : input rank-1 array(‘d’) with bounds (*) param :

input rank-1 array(‘d’) with bounds (5)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int overwrite_x := 0 input int offx := 0 input int incx := 1 input

int overwrite_y := 0 input int offy := 0 input int incy := 1 input int

Return objects:

x : rank-1 array(‘d’) with bounds (*) y : rank-1 array(‘d’) with bounds (*)

scipy.linalg.blas.drotmg = <fortran object>

drotmg - Function signature:

param = drotmg(d1,d2,x1,y1)

5.12. All functions 383

SciPy Reference Guide, Release 0.13.0

Required arguments:

d1 : input ﬂoat d2 : input ﬂoat x1 : input ﬂoat y1 : input ﬂoat

Return objects:

param : rank-1 array(‘d’) with bounds (5)

scipy.linalg.blas.dscal = <fortran object>

dscal - Function signature:

x = dscal(a,x,[n,offx,incx])

Required arguments:

a : input ﬂoat x : input rank-1 array(‘d’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int

Return objects:

x : rank-1 array(‘d’) with bounds (*)

scipy.linalg.blas.dswap = <fortran object>

dswap - Function signature:

x,y = dswap(x,y,[n,offx,incx,offy,incy])

Required arguments:

x : input rank-1 array(‘d’) with bounds (*) y : input rank-1 array(‘d’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int offy := 0 input int incy

:= 1 input int

Return objects:

x : rank-1 array(‘d’) with bounds (*) y : rank-1 array(‘d’) with bounds (*)

scipy.linalg.blas.dsymm = <fortran object>

dsymm - Function signature:

c = dsymm(alpha,a,b,[beta,c,side,lower,overwrite_c])

Required arguments:

alpha : input ﬂoat a : input rank-2 array(‘d’) with bounds (lda,ka) b : input rank-2 array(‘d’) with

bounds (ldb,kb)

Optional arguments:

beta := 0.0 input ﬂoat c : input rank-2 array(‘d’) with bounds (m,n) overwrite_c := 0 input int

side := 0 input int lower := 0 input int

Return objects:

c : rank-2 array(‘d’) with bounds (m,n)

scipy.linalg.blas.dsymv = <fortran object>

dsymv - Function signature:

y = dsymv(alpha,a,x,[beta,y,offx,incx,offy,incy,lower,overwrite_y])

Required arguments:

alpha : input ﬂoat a : input rank-2 array(‘d’) with bounds (n,n) x : input rank-1 array(‘d’) with

bounds (*)

384 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Optional arguments:

beta := 0.0 input ﬂoat y : input rank-1 array(‘d’) with bounds (ly) overwrite_y := 0 input int offx

:= 0 input int incx := 1 input int offy := 0 input int incy := 1 input int lower := 0 input int

Return objects:

y : rank-1 array(‘d’) with bounds (ly)

scipy.linalg.blas.dsyrk = <fortran object>

dsyrk - Function signature:

c = dsyrk(alpha,a,[beta,c,trans,lower,overwrite_c])

Required arguments:

alpha : input ﬂoat a : input rank-2 array(‘d’) with bounds (lda,ka)

Optional arguments:

beta := 0.0 input ﬂoat c : input rank-2 array(‘d’) with bounds (n,n) overwrite_c := 0 input int

trans := 0 input int lower := 0 input int

Return objects:

c : rank-2 array(‘d’) with bounds (n,n)

scipy.linalg.blas.dsyr2k = <fortran object>

dsyr2k - Function signature:

c = dsyr2k(alpha,a,b,[beta,c,trans,lower,overwrite_c])

Required arguments:

alpha : input ﬂoat a : input rank-2 array(‘d’) with bounds (lda,ka) b : input rank-2 array(‘d’) with

bounds (ldb,kb)

Optional arguments:

beta := 0.0 input ﬂoat c : input rank-2 array(‘d’) with bounds (n,n) overwrite_c := 0 input int

trans := 0 input int lower := 0 input int

Return objects:

c : rank-2 array(‘d’) with bounds (n,n)

scipy.linalg.blas.dtrmv = <fortran object>

dtrmv - Function signature:

x = dtrmv(a,x,[offx,incx,lower,trans,unitdiag,overwrite_x])

Required arguments:

a : input rank-2 array(‘d’) with bounds (n,n) x : input rank-1 array(‘d’) with bounds (*)

Optional arguments:

overwrite_x := 0 input int offx := 0 input int incx := 1 input int lower := 0 input int trans := 0

input int unitdiag := 0 input int

Return objects:

x : rank-1 array(‘d’) with bounds (*)

scipy.linalg.blas.dzasum = <fortran dzasum>

dzasum - Function signature:

s = dzasum(x,[n,offx,incx])

5.12. All functions 385

SciPy Reference Guide, Release 0.13.0

Required arguments:

x : input rank-1 array(‘D’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int

Return objects:

s : ﬂoat

scipy.linalg.blas.dznrm2 = <fortran dznrm2>

dznrm2 - Function signature:

n2 = dznrm2(x,[n,offx,incx])

Required arguments:

x : input rank-1 array(‘D’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int

Return objects:

n2 : ﬂoat

scipy.linalg.blas.icamax = <fortran object>

icamax - Function signature:

k = icamax(x,[n,offx,incx])

Required arguments:

x : input rank-1 array(‘F’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int

Return objects:

k : int

scipy.linalg.blas.idamax = <fortran object>

idamax - Function signature:

k = idamax(x,[n,offx,incx])

Required arguments:

x : input rank-1 array(‘d’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int

Return objects:

k : int

scipy.linalg.blas.isamax = <fortran object>

isamax - Function signature:

k = isamax(x,[n,offx,incx])

Required arguments:

x : input rank-1 array(‘f’) with bounds (*)

386 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int

Return objects:

k : int

scipy.linalg.blas.izamax = <fortran object>

izamax - Function signature:

k = izamax(x,[n,offx,incx])

Required arguments:

x : input rank-1 array(‘D’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int

Return objects:

k : int

scipy.linalg.blas.sasum = <fortran sasum>

sasum - Function signature:

s = sasum(x,[n,offx,incx])

Required arguments:

x : input rank-1 array(‘f’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int

Return objects:

s : ﬂoat

scipy.linalg.blas.saxpy = <fortran object>

saxpy - Function signature:

z = saxpy(x,y,[n,a,offx,incx,offy,incy])

Required arguments:

x : input rank-1 array(‘f’) with bounds (*) y : input rank-1 array(‘f’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int a := 1.0 input ﬂoat offx := 0 input int incx := 1 input int offy

:= 0 input int incy := 1 input int

Return objects:

z : rank-1 array(‘f’) with bounds (*) and y storage

scipy.linalg.blas.scasum = <fortran scasum>

scasum - Function signature:

s = scasum(x,[n,offx,incx])

Required arguments:

x : input rank-1 array(‘F’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int

5.12. All functions 387

SciPy Reference Guide, Release 0.13.0

Return objects:

s : ﬂoat

scipy.linalg.blas.scnrm2 = <fortran scnrm2>

scnrm2 - Function signature:

n2 = scnrm2(x,[n,offx,incx])

Required arguments:

x : input rank-1 array(‘F’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int

Return objects:

n2 : ﬂoat

scipy.linalg.blas.scopy = <fortran object>

scopy - Function signature:

y = scopy(x,y,[n,offx,incx,offy,incy])

Required arguments:

x : input rank-1 array(‘f’) with bounds (*) y : input rank-1 array(‘f’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int offy := 0 input int incy

:= 1 input int

Return objects:

y : rank-1 array(‘f’) with bounds (*)

scipy.linalg.blas.sdot = <fortran sdot>

sdot - Function signature:

xy = sdot(x,y,[n,offx,incx,offy,incy])

Required arguments:

x : input rank-1 array(‘f’) with bounds (*) y : input rank-1 array(‘f’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int offy := 0 input int incy

:= 1 input int

Return objects:

xy : ﬂoat

scipy.linalg.blas.sgemm = <fortran object>

sgemm - Function signature:

c = sgemm(alpha,a,b,[beta,c,trans_a,trans_b,overwrite_c])

Required arguments:

alpha : input ﬂoat a : input rank-2 array(‘f’) with bounds (lda,ka) b : input rank-2 array(‘f’) with

bounds (ldb,kb)

Optional arguments:

beta := 0.0 input ﬂoat c : input rank-2 array(‘f’) with bounds (m,n) overwrite_c := 0 input int

trans_a := 0 input int trans_b := 0 input int

388 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Return objects:

c : rank-2 array(‘f’) with bounds (m,n)

scipy.linalg.blas.sgemv = <fortran object>

sgemv - Function signature:

y = sgemv(alpha,a,x,[beta,y,offx,incx,offy,incy,trans,overwrite_y])

Required arguments:

alpha : input ﬂoat a : input rank-2 array(‘f’) with bounds (m,n) x : input rank-1 array(‘f’) with

bounds (*)

Optional arguments:

beta := 0.0 input ﬂoat y : input rank-1 array(‘f’) with bounds (ly) overwrite_y := 0 input int offx

:= 0 input int incx := 1 input int offy := 0 input int incy := 1 input int trans := 0 input int

Return objects:

y : rank-1 array(‘f’) with bounds (ly)

scipy.linalg.blas.sger = <fortran object>

sger - Function signature:

a = sger(alpha,x,y,[incx,incy,a,overwrite_x,overwrite_y,overwrite_a])

Required arguments:

alpha : input ﬂoat x : input rank-1 array(‘f’) with bounds (m) y : input rank-1 array(‘f’) with

bounds (n)

Optional arguments:

overwrite_x := 1 input int incx := 1 input int overwrite_y := 1 input int incy := 1 input int a := 0.0

input rank-2 array(‘f’) with bounds (m,n) overwrite_a := 0 input int

Return objects:

a : rank-2 array(‘f’) with bounds (m,n)

scipy.linalg.blas.snrm2 = <fortran snrm2>

snrm2 - Function signature:

n2 = snrm2(x,[n,offx,incx])

Required arguments:

x : input rank-1 array(‘f’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int

Return objects:

n2 : ﬂoat

scipy.linalg.blas.srot = <fortran object>

srot - Function signature:

x,y = srot(x,y,c,s,[n,offx,incx,offy,incy,overwrite_x,overwrite_y])

Required arguments:

x : input rank-1 array(‘f’) with bounds (*) y : input rank-1 array(‘f’) with bounds (*) c : input

ﬂoat s : input ﬂoat

5.12. All functions 389

SciPy Reference Guide, Release 0.13.0

Optional arguments:

n := (len(x)-offx)/abs(incx) input int overwrite_x := 0 input int offx := 0 input int incx := 1 input

int overwrite_y := 0 input int offy := 0 input int incy := 1 input int

Return objects:

x : rank-1 array(‘f’) with bounds (*) y : rank-1 array(‘f’) with bounds (*)

scipy.linalg.blas.srotg = <fortran object>

srotg - Function signature:

c,s = srotg(a,b)

Required arguments:

a : input ﬂoat b : input ﬂoat

Return objects:

c : ﬂoat s : ﬂoat

scipy.linalg.blas.srotm = <fortran object>

srotm - Function signature:

x,y = srotm(x,y,param,[n,offx,incx,offy,incy,overwrite_x,overwrite_y])

Required arguments:

x : input rank-1 array(‘f’) with bounds (*) y : input rank-1 array(‘f’) with bounds (*) param :

input rank-1 array(‘f’) with bounds (5)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int overwrite_x := 0 input int offx := 0 input int incx := 1 input

int overwrite_y := 0 input int offy := 0 input int incy := 1 input int

Return objects:

x : rank-1 array(‘f’) with bounds (*) y : rank-1 array(‘f’) with bounds (*)

scipy.linalg.blas.srotmg = <fortran object>

srotmg - Function signature:

param = srotmg(d1,d2,x1,y1)

Required arguments:

d1 : input ﬂoat d2 : input ﬂoat x1 : input ﬂoat y1 : input ﬂoat

Return objects:

param : rank-1 array(‘f’) with bounds (5)

scipy.linalg.blas.sscal = <fortran object>

sscal - Function signature:

x = sscal(a,x,[n,offx,incx])

Required arguments:

a : input ﬂoat x : input rank-1 array(‘f’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int

Return objects:

x : rank-1 array(‘f’) with bounds (*)

390 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.linalg.blas.sswap = <fortran object>

sswap - Function signature:

x,y = sswap(x,y,[n,offx,incx,offy,incy])

Required arguments:

x : input rank-1 array(‘f’) with bounds (*) y : input rank-1 array(‘f’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int offy := 0 input int incy

:= 1 input int

Return objects:

x : rank-1 array(‘f’) with bounds (*) y : rank-1 array(‘f’) with bounds (*)

scipy.linalg.blas.ssymm = <fortran object>

ssymm - Function signature:

c = ssymm(alpha,a,b,[beta,c,side,lower,overwrite_c])

Required arguments:

alpha : input ﬂoat a : input rank-2 array(‘f’) with bounds (lda,ka) b : input rank-2 array(‘f’) with

bounds (ldb,kb)

Optional arguments:

beta := 0.0 input ﬂoat c : input rank-2 array(‘f’) with bounds (m,n) overwrite_c := 0 input int side

:= 0 input int lower := 0 input int

Return objects:

c : rank-2 array(‘f’) with bounds (m,n)

scipy.linalg.blas.ssymv = <fortran object>

ssymv - Function signature:

y = ssymv(alpha,a,x,[beta,y,offx,incx,offy,incy,lower,overwrite_y])

Required arguments:

alpha : input ﬂoat a : input rank-2 array(‘f’) with bounds (n,n) x : input rank-1 array(‘f’) with

bounds (*)

Optional arguments:

beta := 0.0 input ﬂoat y : input rank-1 array(‘f’) with bounds (ly) overwrite_y := 0 input int offx

:= 0 input int incx := 1 input int offy := 0 input int incy := 1 input int lower := 0 input int

Return objects:

y : rank-1 array(‘f’) with bounds (ly)

scipy.linalg.blas.ssyrk = <fortran object>

ssyrk - Function signature:

c = ssyrk(alpha,a,[beta,c,trans,lower,overwrite_c])

Required arguments:

alpha : input ﬂoat a : input rank-2 array(‘f’) with bounds (lda,ka)

Optional arguments:

beta := 0.0 input ﬂoat c : input rank-2 array(‘f’) with bounds (n,n) overwrite_c := 0 input int trans

:= 0 input int lower := 0 input int

5.12. All functions 391

SciPy Reference Guide, Release 0.13.0

Return objects:

c : rank-2 array(‘f’) with bounds (n,n)

scipy.linalg.blas.ssyr2k = <fortran object>

ssyr2k - Function signature:

c = ssyr2k(alpha,a,b,[beta,c,trans,lower,overwrite_c])

Required arguments:

alpha : input ﬂoat a : input rank-2 array(‘f’) with bounds (lda,ka) b : input rank-2 array(‘f’) with

bounds (ldb,kb)

Optional arguments:

beta := 0.0 input ﬂoat c : input rank-2 array(‘f’) with bounds (n,n) overwrite_c := 0 input int trans

:= 0 input int lower := 0 input int

Return objects:

c : rank-2 array(‘f’) with bounds (n,n)

scipy.linalg.blas.strmv = <fortran object>

strmv - Function signature:

x = strmv(a,x,[offx,incx,lower,trans,unitdiag,overwrite_x])

Required arguments:

a : input rank-2 array(‘f’) with bounds (n,n) x : input rank-1 array(‘f’) with bounds (*)

Optional arguments:

overwrite_x := 0 input int offx := 0 input int incx := 1 input int lower := 0 input int trans := 0

input int unitdiag := 0 input int

Return objects:

x : rank-1 array(‘f’) with bounds (*)

scipy.linalg.blas.zaxpy = <fortran object>

zaxpy - Function signature:

z = zaxpy(x,y,[n,a,offx,incx,offy,incy])

Required arguments:

x : input rank-1 array(‘D’) with bounds (*) y : input rank-1 array(‘D’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int a := (1.0, 0.0) input complex offx := 0 input int incx := 1

input int offy := 0 input int incy := 1 input int

Return objects:

z : rank-1 array(‘D’) with bounds (*) and y storage

scipy.linalg.blas.zcopy = <fortran object>

zcopy - Function signature:

y = zcopy(x,y,[n,offx,incx,offy,incy])

Required arguments:

x : input rank-1 array(‘D’) with bounds (*) y : input rank-1 array(‘D’) with bounds (*)

392 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int offy := 0 input int incy

:= 1 input int

Return objects:

y : rank-1 array(‘D’) with bounds (*)

scipy.linalg.blas.zdotc = <fortran zdotc>

zdotc - Function signature:

xy = zdotc(x,y,[n,offx,incx,offy,incy])

Required arguments:

x : input rank-1 array(‘D’) with bounds (*) y : input rank-1 array(‘D’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int offy := 0 input int incy

:= 1 input int

Return objects:

xy : complex

scipy.linalg.blas.zdotu = <fortran zdotu>

zdotu - Function signature:

xy = zdotu(x,y,[n,offx,incx,offy,incy])

Required arguments:

x : input rank-1 array(‘D’) with bounds (*) y : input rank-1 array(‘D’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int offy := 0 input int incy

:= 1 input int

Return objects:

xy : complex

scipy.linalg.blas.zdrot = <fortran object>

zdrot - Function signature:

x,y = zdrot(x,y,c,s,[n,offx,incx,offy,incy,overwrite_x,overwrite_y])

Required arguments:

x : input rank-1 array(‘D’) with bounds (*) y : input rank-1 array(‘D’) with bounds (*) c : input

ﬂoat s : input ﬂoat

Optional arguments:

n := (len(x)-offx)/abs(incx) input int overwrite_x := 0 input int offx := 0 input int incx := 1 input

int overwrite_y := 0 input int offy := 0 input int incy := 1 input int

Return objects:

x : rank-1 array(‘D’) with bounds (*) y : rank-1 array(‘D’) with bounds (*)

scipy.linalg.blas.zdscal = <fortran object>

zdscal - Function signature:

x = zdscal(a,x,[n,offx,incx,overwrite_x])

5.12. All functions 393

SciPy Reference Guide, Release 0.13.0

Required arguments:

a : input ﬂoat x : input rank-1 array(‘D’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int overwrite_x := 0 input int offx := 0 input int incx := 1 input

int

Return objects:

x : rank-1 array(‘D’) with bounds (*)

scipy.linalg.blas.zgemm = <fortran object>

zgemm - Function signature:

c = zgemm(alpha,a,b,[beta,c,trans_a,trans_b,overwrite_c])

Required arguments:

alpha : input complex a : input rank-2 array(‘D’) with bounds (lda,ka) b : input rank-2 array(‘D’)

with bounds (ldb,kb)

Optional arguments:

beta := (0.0, 0.0) input complex c : input rank-2 array(‘D’) with bounds (m,n) overwrite_c := 0

input int trans_a := 0 input int trans_b := 0 input int

Return objects:

c : rank-2 array(‘D’) with bounds (m,n)

scipy.linalg.blas.zgemv = <fortran object>

zgemv - Function signature:

y = zgemv(alpha,a,x,[beta,y,offx,incx,offy,incy,trans,overwrite_y])

Required arguments:

alpha : input complex a : input rank-2 array(‘D’) with bounds (m,n) x : input rank-1 array(‘D’)

with bounds (*)

Optional arguments:

beta := (0.0, 0.0) input complex y : input rank-1 array(‘D’) with bounds (ly) overwrite_y := 0

input int offx := 0 input int incx := 1 input int offy := 0 input int incy := 1 input int trans := 0

input int

Return objects:

y : rank-1 array(‘D’) with bounds (ly)

scipy.linalg.blas.zgerc = <fortran object>

zgerc - Function signature:

a = zgerc(alpha,x,y,[incx,incy,a,overwrite_x,overwrite_y,overwrite_a])

Required arguments:

alpha : input complex x : input rank-1 array(‘D’) with bounds (m) y : input rank-1 array(‘D’)

with bounds (n)

Optional arguments:

overwrite_x := 1 input int incx := 1 input int overwrite_y := 1 input int incy := 1 input int a :=

(0.0,0.0) input rank-2 array(‘D’) with bounds (m,n) overwrite_a := 0 input int

Return objects:

a : rank-2 array(‘D’) with bounds (m,n)

394 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.linalg.blas.zgeru = <fortran object>

zgeru - Function signature:

a = zgeru(alpha,x,y,[incx,incy,a,overwrite_x,overwrite_y,overwrite_a])

Required arguments:

alpha : input complex x : input rank-1 array(‘D’) with bounds (m) y : input rank-1 array(‘D’)

with bounds (n)

Optional arguments:

overwrite_x := 1 input int incx := 1 input int overwrite_y := 1 input int incy := 1 input int a :=

(0.0,0.0) input rank-2 array(‘D’) with bounds (m,n) overwrite_a := 0 input int

Return objects:

a : rank-2 array(‘D’) with bounds (m,n)

scipy.linalg.blas.zhemm = <fortran object>

zhemm - Function signature:

c = zhemm(alpha,a,b,[beta,c,side,lower,overwrite_c])

Required arguments:

alpha : input complex a : input rank-2 array(‘D’) with bounds (lda,ka) b : input rank-2 array(‘D’)

with bounds (ldb,kb)

Optional arguments:

beta := (0.0, 0.0) input complex c : input rank-2 array(‘D’) with bounds (m,n) overwrite_c := 0

input int side := 0 input int lower := 0 input int

Return objects:

c : rank-2 array(‘D’) with bounds (m,n)

scipy.linalg.blas.zhemv = <fortran object>

zhemv - Function signature:

y = zhemv(alpha,a,x,[beta,y,offx,incx,offy,incy,lower,overwrite_y])

Required arguments:

alpha : input complex a : input rank-2 array(‘D’) with bounds (n,n) x : input rank-1 array(‘D’)

with bounds (*)

Optional arguments:

beta := (0.0, 0.0) input complex y : input rank-1 array(‘D’) with bounds (ly) overwrite_y := 0

input int offx := 0 input int incx := 1 input int offy := 0 input int incy := 1 input int lower := 0

input int

Return objects:

y : rank-1 array(‘D’) with bounds (ly)

scipy.linalg.blas.zherk = <fortran object>

zherk - Function signature:

c = zherk(alpha,a,[beta,c,trans,lower,overwrite_c])

Required arguments:

alpha : input complex a : input rank-2 array(‘D’) with bounds (lda,ka)

5.12. All functions 395

SciPy Reference Guide, Release 0.13.0

Optional arguments:

beta := (0.0, 0.0) input complex c : input rank-2 array(‘D’) with bounds (n,n) overwrite_c := 0

input int trans := 0 input int lower := 0 input int

Return objects:

c : rank-2 array(‘D’) with bounds (n,n)

scipy.linalg.blas.zher2k = <fortran object>

zher2k - Function signature:

c = zher2k(alpha,a,b,[beta,c,trans,lower,overwrite_c])

Required arguments:

alpha : input complex a : input rank-2 array(‘D’) with bounds (lda,ka) b : input rank-2 array(‘D’)

with bounds (ldb,kb)

Optional arguments:

beta := (0.0, 0.0) input complex c : input rank-2 array(‘D’) with bounds (n,n) overwrite_c := 0

input int trans := 0 input int lower := 0 input int

Return objects:

c : rank-2 array(‘D’) with bounds (n,n)

scipy.linalg.blas.zrotg = <fortran object>

zrotg - Function signature:

c,s = zrotg(a,b)

Required arguments:

a : input complex b : input complex

Return objects:

c : complex s : complex

scipy.linalg.blas.zscal = <fortran object>

zscal - Function signature:

x = zscal(a,x,[n,offx,incx])

Required arguments:

a : input complex x : input rank-1 array(‘D’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int

Return objects:

x : rank-1 array(‘D’) with bounds (*)

scipy.linalg.blas.zsymm = <fortran object>

zsymm - Function signature:

c = zsymm(alpha,a,b,[beta,c,side,lower,overwrite_c])

Required arguments:

alpha : input complex a : input rank-2 array(‘D’) with bounds (lda,ka) b : input rank-2 array(‘D’)

with bounds (ldb,kb)

396 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Optional arguments:

beta := (0.0, 0.0) input complex c : input rank-2 array(‘D’) with bounds (m,n) overwrite_c := 0

input int side := 0 input int lower := 0 input int

Return objects:

c : rank-2 array(‘D’) with bounds (m,n)

scipy.linalg.blas.zsyrk = <fortran object>

zsyrk - Function signature:

c = zsyrk(alpha,a,[beta,c,trans,lower,overwrite_c])

Required arguments:

alpha : input complex a : input rank-2 array(‘D’) with bounds (lda,ka)

Optional arguments:

beta := (0.0, 0.0) input complex c : input rank-2 array(‘D’) with bounds (n,n) overwrite_c := 0

input int trans := 0 input int lower := 0 input int

Return objects:

c : rank-2 array(‘D’) with bounds (n,n)

scipy.linalg.blas.zsyr2k = <fortran object>

zsyr2k - Function signature:

c = zsyr2k(alpha,a,b,[beta,c,trans,lower,overwrite_c])

Required arguments:

alpha : input complex a : input rank-2 array(‘D’) with bounds (lda,ka) b : input rank-2 array(‘D’)

with bounds (ldb,kb)

Optional arguments:

beta := (0.0, 0.0) input complex c : input rank-2 array(‘D’) with bounds (n,n) overwrite_c := 0

input int trans := 0 input int lower := 0 input int

Return objects:

c : rank-2 array(‘D’) with bounds (n,n)

scipy.linalg.blas.zswap = <fortran object>

zswap - Function signature:

x,y = zswap(x,y,[n,offx,incx,offy,incy])

Required arguments:

x : input rank-1 array(‘D’) with bounds (*) y : input rank-1 array(‘D’) with bounds (*)

Optional arguments:

n := (len(x)-offx)/abs(incx) input int offx := 0 input int incx := 1 input int offy := 0 input int incy

:= 1 input int

Return objects:

x : rank-1 array(‘D’) with bounds (*) y : rank-1 array(‘D’) with bounds (*)

scipy.linalg.blas.ztrmv = <fortran object>

ztrmv - Function signature:

x = ztrmv(a,x,[offx,incx,lower,trans,unitdiag,overwrite_x])

5.12. All functions 397

SciPy Reference Guide, Release 0.13.0

Required arguments:

a : input rank-2 array(‘D’) with bounds (n,n) x : input rank-1 array(‘D’) with bounds (*)

Optional arguments:

overwrite_x := 0 input int offx := 0 input int incx := 1 input int lower := 0 input int trans := 0

input int unitdiag := 0 input int

Return objects:

x : rank-1 array(‘D’) with bounds (*)

5.13 Low-level LAPACK functions

This module contains low-level functions from the LAPACK library. New in version 0.12.0.

Warning: These functions do little to no error checking. It is possible to cause crashes by mis-using them, so

prefer using the higher-level routines in scipy.linalg.

5.14 Finding functions

get_lapack_funcs(names[, arrays, dtype]) Return available LAPACK function objects from names.

5.15 All functions

cgbsv cgbsv - Function signature:

cgbtrf cgbtrf - Function signature:

cgbtrs cgbtrs - Function signature:

cgebal cgebal - Function signature:

cgees cgees - Function signature:

cgeev cgeev - Function signature:

cgegv cgegv - Function signature:

cgehrd cgehrd - Function signature:

cgelss cgelss - Function signature:

cgeqp3 cgeqp3 - Function signature:

cgeqrf cgeqrf - Function signature:

cgerqf cgerqf - Function signature:

cgesdd cgesdd - Function signature:

cgesv cgesv - Function signature:

cgetrf cgetrf - Function signature:

cgetri cgetri - Function signature:

cgetrs cgetrs - Function signature:

cgges cgges - Function signature:

cggev cggev - Function signature:

chbevd chbevd - Function signature:

chbevx chbevx - Function signature:

cheev cheev - Function signature:

cheevd cheevd - Function signature:

cheevr cheevr - Function signature:

Continued on next page

398 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Table 5.74 – continued from previous page

chegv chegv - Function signature:

chegvd chegvd - Function signature:

chegvx chegvx - Function signature:

claswp claswp - Function signature:

clauum clauum - Function signature:

cpbsv cpbsv - Function signature:

cpbtrf cpbtrf - Function signature:

cpbtrs cpbtrs - Function signature:

cposv cposv - Function signature:

cpotrf cpotrf - Function signature:

cpotri cpotri - Function signature:

cpotrs cpotrs - Function signature:

ctrsyl ctrsyl - Function signature:

ctrtri ctrtri - Function signature:

ctrtrs ctrtrs - Function signature:

cungqr cungqr - Function signature:

cungrq cungrq - Function signature:

cunmqr cunmqr - Function signature:

dgbsv dgbsv - Function signature:

dgbtrf dgbtrf - Function signature:

dgbtrs dgbtrs - Function signature:

dgebal dgebal - Function signature:

dgees dgees - Function signature:

dgeev dgeev - Function signature:

dgegv dgegv - Function signature:

dgehrd dgehrd - Function signature:

dgelss dgelss - Function signature:

dgeqp3 dgeqp3 - Function signature:

dgeqrf dgeqrf - Function signature:

dgerqf dgerqf - Function signature:

dgesdd dgesdd - Function signature:

dgesv dgesv - Function signature:

dgetrf dgetrf - Function signature:

dgetri dgetri - Function signature:

dgetrs dgetrs - Function signature:

dgges dgges - Function signature:

dggev dggev - Function signature:

dlamch dlamch - Function signature:

dlaswp dlaswp - Function signature:

dlauum dlauum - Function signature:

dorgqr dorgqr - Function signature:

dorgrq dorgrq - Function signature:

dormqr dormqr - Function signature:

dpbsv dpbsv - Function signature:

dpbtrf dpbtrf - Function signature:

dpbtrs dpbtrs - Function signature:

dposv dposv - Function signature:

dpotrf dpotrf - Function signature:

dpotri dpotri - Function signature:

dpotrs dpotrs - Function signature:

Continued on next page

5.15. All functions 399

SciPy Reference Guide, Release 0.13.0

Table 5.74 – continued from previous page

dsbev dsbev - Function signature:

dsbevd dsbevd - Function signature:

dsbevx dsbevx - Function signature:

dsyev dsyev - Function signature:

dsyevd dsyevd - Function signature:

dsyevr dsyevr - Function signature:

dsygv dsygv - Function signature:

dsygvd dsygvd - Function signature:

dsygvx dsygvx - Function signature:

dtrsyl dtrsyl - Function signature:

dtrtri dtrtri - Function signature:

dtrtrs dtrtrs - Function signature:

sgbsv sgbsv - Function signature:

sgbtrf sgbtrf - Function signature:

sgbtrs sgbtrs - Function signature:

sgebal sgebal - Function signature:

sgees sgees - Function signature:

sgeev sgeev - Function signature:

sgegv sgegv - Function signature:

sgehrd sgehrd - Function signature:

sgelss sgelss - Function signature:

sgeqp3 sgeqp3 - Function signature:

sgeqrf sgeqrf - Function signature:

sgerqf sgerqf - Function signature:

sgesdd sgesdd - Function signature:

sgesv sgesv - Function signature:

sgetrf sgetrf - Function signature:

sgetri sgetri - Function signature:

sgetrs sgetrs - Function signature:

sgges sgges - Function signature:

sggev sggev - Function signature:

slamch slamch - Function signature:

slaswp slaswp - Function signature:

slauum slauum - Function signature:

sorgqr sorgqr - Function signature:

sorgrq sorgrq - Function signature:

sormqr sormqr - Function signature:

spbsv spbsv - Function signature:

spbtrf spbtrf - Function signature:

spbtrs spbtrs - Function signature:

sposv sposv - Function signature:

spotrf spotrf - Function signature:

spotri spotri - Function signature:

spotrs spotrs - Function signature:

ssbev ssbev - Function signature:

ssbevd ssbevd - Function signature:

ssbevx ssbevx - Function signature:

ssyev ssyev - Function signature:

ssyevd ssyevd - Function signature:

ssyevr ssyevr - Function signature:

Continued on next page

400 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Table 5.74 – continued from previous page

ssygv ssygv - Function signature:

ssygvd ssygvd - Function signature:

ssygvx ssygvx - Function signature:

strsyl strsyl - Function signature:

strtri strtri - Function signature:

strtrs strtrs - Function signature:

zgbsv zgbsv - Function signature:

zgbtrf zgbtrf - Function signature:

zgbtrs zgbtrs - Function signature:

zgebal zgebal - Function signature:

zgees zgees - Function signature:

zgeev zgeev - Function signature:

zgegv zgegv - Function signature:

zgehrd zgehrd - Function signature:

zgelss zgelss - Function signature:

zgeqp3 zgeqp3 - Function signature:

zgeqrf zgeqrf - Function signature:

zgerqf zgerqf - Function signature:

zgesdd zgesdd - Function signature:

zgesv zgesv - Function signature:

zgetrf zgetrf - Function signature:

zgetri zgetri - Function signature:

zgetrs zgetrs - Function signature:

zgges zgges - Function signature:

zggev zggev - Function signature:

zhbevd zhbevd - Function signature:

zhbevx zhbevx - Function signature:

zheev zheev - Function signature:

zheevd zheevd - Function signature:

zheevr zheevr - Function signature:

zhegv zhegv - Function signature:

zhegvd zhegvd - Function signature:

zhegvx zhegvx - Function signature:

zlaswp zlaswp - Function signature:

zlauum zlauum - Function signature:

zpbsv zpbsv - Function signature:

zpbtrf zpbtrf - Function signature:

zpbtrs zpbtrs - Function signature:

zposv zposv - Function signature:

zpotrf zpotrf - Function signature:

zpotri zpotri - Function signature:

zpotrs zpotrs - Function signature:

ztrsyl ztrsyl - Function signature:

ztrtri ztrtri - Function signature:

ztrtrs ztrtrs - Function signature:

zungqr zungqr - Function signature:

zungrq zungrq - Function signature:

zunmqr zunmqr - Function signature:

scipy.linalg.lapack.cgbsv = <fortran object>

5.15. All functions 401

SciPy Reference Guide, Release 0.13.0

cgbsv - Function signature:

lub,piv,x,info = cgbsv(kl,ku,ab,b,[overwrite_ab,overwrite_b])

Required arguments:

kl : input int ku : input int ab : input rank-2 array(‘F’) with bounds (2*kl+ku+1,n) b : input

rank-2 array(‘F’) with bounds (n,nrhs)

Optional arguments:

overwrite_ab := 0 input int overwrite_b := 0 input int

Return objects:

lub : rank-2 array(‘F’) with bounds (2*kl+ku+1,n) and ab storage piv : rank-1 array(‘i’) with

bounds (n) x : rank-2 array(‘F’) with bounds (n,nrhs) and b storage info : int

scipy.linalg.lapack.cgbtrf = <fortran object>

cgbtrf - Function signature:

lu,ipiv,info = cgbtrf(ab,kl,ku,[m,n,ldab,overwrite_ab])

Required arguments:

ab : input rank-2 array(‘F’) with bounds (ldab,*) kl : input int ku : input int

Optional arguments:

m := shape(ab,1) input int n := shape(ab,1) input int overwrite_ab := 0 input int ldab :=

shape(ab,0) input int

Return objects:

lu : rank-2 array(‘F’) with bounds (ldab,*) and ab storage ipiv : rank-1 array(‘i’) with bounds

(MIN(m,n)) info : int

scipy.linalg.lapack.cgbtrs = <fortran object>

cgbtrs - Function signature:

x,info = cgbtrs(ab,kl,ku,b,ipiv,[trans,n,ldab,ldb,overwrite_b])

Required arguments:

ab : input rank-2 array(‘F’) with bounds (ldab,*) kl : input int ku : input int b : input rank-2

array(‘F’) with bounds (ldb,*) ipiv : input rank-1 array(‘i’) with bounds (n)

Optional arguments:

overwrite_b := 0 input int trans := 0 input int n := shape(ab,1) input int ldab := shape(ab,0) input

int ldb := shape(b,0) input int

Return objects:

x : rank-2 array(‘F’) with bounds (ldb,*) and b storage info : int

scipy.linalg.lapack.cgebal = <fortran object>

cgebal - Function signature:

ba,lo,hi,pivscale,info = cgebal(a,[scale,permute,overwrite_a])

Required arguments:

a : input rank-2 array(‘F’) with bounds (m,n)

Optional arguments:

scale := 0 input int permute := 0 input int overwrite_a := 0 input int

Return objects:

ba : rank-2 array(‘F’) with bounds (m,n) and a storage lo : int hi : int pivscale : rank-1 array(‘f’)

with bounds (n) info : int

402 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.linalg.lapack.cgees = <fortran object>

cgees - Function signature:

t,sdim,w,vs,work,info = cgees(cselect,a,[compute_v,sort_t,lwork,cselect_extra_args,overwrite_a])

Required arguments:

cselect : call-back function a : input rank-2 array(‘F’) with bounds (n,n)

Optional arguments:

compute_v := 1 input int sort_t := 0 input int cselect_extra_args := () input tuple overwrite_a :=

0 input int lwork := 3*n input int

Return objects:

t : rank-2 array(‘F’) with bounds (n,n) and a storage sdim : int w : rank-1 array(‘F’) with

bounds (n) vs : rank-2 array(‘F’) with bounds (ldvs,n) work : rank-1 array(‘F’) with bounds

(MAX(lwork,1)) info : int

Call-back functions:

def cselect(arg): return cselect Required arguments:

arg : input complex

Return objects:

cselect : int

scipy.linalg.lapack.cgeev = <fortran object>

cgeev - Function signature:

w,vl,vr,info = cgeev(a,[compute_vl,compute_vr,lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘F’) with bounds (n,n)

Optional arguments:

compute_vl := 1 input int compute_vr := 1 input int overwrite_a := 0 input int lwork := 2*n input

int

Return objects:

w : rank-1 array(‘F’) with bounds (n) vl : rank-2 array(‘F’) with bounds (ldvl,n) vr : rank-2

array(‘F’) with bounds (ldvr,n) info : int

scipy.linalg.lapack.cgegv = <fortran object>

cgegv - Function signature:

alpha,beta,vl,vr,info = cgegv(a,b,[compute_vl,compute_vr,lwork,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘F’) with bounds (n,n) b : input rank-2 array(‘F’) with bounds (n,n)

Optional arguments:

compute_vl := 1 input int compute_vr := 1 input int overwrite_a := 0 input int overwrite_b := 0

input int lwork := 2*n input int

Return objects:

alpha : rank-1 array(‘F’) with bounds (n) beta : rank-1 array(‘F’) with bounds (n) vl : rank-2

array(‘F’) with bounds (ldvl,n) vr : rank-2 array(‘F’) with bounds (ldvr,n) info : int

scipy.linalg.lapack.cgehrd = <fortran object>

5.15. All functions 403

SciPy Reference Guide, Release 0.13.0

cgehrd - Function signature:

ht,tau,info = cgehrd(a,[lo,hi,lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘F’) with bounds (n,n)

Optional arguments:

lo := 0 input int hi := n-1 input int overwrite_a := 0 input int lwork := MAX(n,1) input int

Return objects:

ht : rank-2 array(‘F’) with bounds (n,n) and a storage tau : rank-1 array(‘F’) with bounds (n - 1)

info : int

scipy.linalg.lapack.cgelss = <fortran object>

cgelss - Function signature:

v,x,s,rank,work,info = cgelss(a,b,[cond,lwork,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘F’) with bounds (m,n) b : input rank-2 array(‘F’) with bounds

(maxmn,nrhs)

Optional arguments:

overwrite_a := 0 input int overwrite_b := 0 input int cond := -1.0 input ﬂoat lwork :=

2*minmn+MAX(maxmn,nrhs) input int

Return objects:

v : rank-2 array(‘F’) with bounds (m,n) and a storage x : rank-2 array(‘F’) with bounds

(maxmn,nrhs) and b storage s : rank-1 array(‘f’) with bounds (minmn) rank : int work : rank-1

array(‘F’) with bounds (MAX(lwork,1)) info : int

scipy.linalg.lapack.cgeqp3 = <fortran object>

cgeqp3 - Function signature:

qr,jpvt,tau,work,info = cgeqp3(a,[lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘F’) with bounds (m,n)

Optional arguments:

overwrite_a := 0 input int lwork := 3*(n+1) input int

Return objects:

qr : rank-2 array(‘F’) with bounds (m,n) and a storage jpvt : rank-1 array(‘i’) with bounds (n) tau :

rank-1 array(‘F’) with bounds (MIN(m,n)) work : rank-1 array(‘F’) with bounds (MAX(lwork,1))

info : int

scipy.linalg.lapack.cgeqrf = <fortran object>

cgeqrf - Function signature:

qr,tau,work,info = cgeqrf(a,[lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘F’) with bounds (m,n)

Optional arguments:

overwrite_a := 0 input int lwork := 3*n input int

404 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Return objects:

qr : rank-2 array(‘F’) with bounds (m,n) and a storage tau : rank-1 array(‘F’) with bounds

(MIN(m,n)) work : rank-1 array(‘F’) with bounds (MAX(lwork,1)) info : int

scipy.linalg.lapack.cgerqf = <fortran object>

cgerqf - Function signature:

qr,tau,work,info = cgerqf(a,[lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘F’) with bounds (m,n)

Optional arguments:

overwrite_a := 0 input int lwork := 3*m input int

Return objects:

qr : rank-2 array(‘F’) with bounds (m,n) and a storage tau : rank-1 array(‘F’) with bounds

(MIN(m,n)) work : rank-1 array(‘F’) with bounds (MAX(lwork,1)) info : int

scipy.linalg.lapack.cgesdd = <fortran object>

cgesdd - Function signature:

u,s,vt,info = cgesdd(a,[compute_uv,full_matrices,lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘F’) with bounds (m,n)

Optional arguments:

overwrite_a := 0 input int compute_uv := 1 input int full_matrices := 1 input int lwork := (com-

pute_uv?2*minmn*minmn+MAX(m,n)+2*minmn:2*minmn+MAX(m,n)) input int

Return objects:

u : rank-2 array(‘F’) with bounds (u0,u1) s : rank-1 array(‘f’) with bounds (minmn) vt : rank-2

array(‘F’) with bounds (vt0,vt1) info : int

scipy.linalg.lapack.cgesv = <fortran object>

cgesv - Function signature:

lu,piv,x,info = cgesv(a,b,[overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘F’) with bounds (n,n) b : input rank-2 array(‘F’) with bounds (n,nrhs)

Optional arguments:

overwrite_a := 0 input int overwrite_b := 0 input int

Return objects:

lu : rank-2 array(‘F’) with bounds (n,n) and a storage piv : rank-1 array(‘i’) with bounds (n) x :

rank-2 array(‘F’) with bounds (n,nrhs) and b storage info : int

scipy.linalg.lapack.cgetrf = <fortran object>

cgetrf - Function signature:

lu,piv,info = cgetrf(a,[overwrite_a])

Required arguments:

a : input rank-2 array(‘F’) with bounds (m,n)

5.15. All functions 405

SciPy Reference Guide, Release 0.13.0

Optional arguments:

overwrite_a := 0 input int

Return objects:

lu : rank-2 array(‘F’) with bounds (m,n) and a storage piv : rank-1 array(‘i’) with bounds

(MIN(m,n)) info : int

scipy.linalg.lapack.cgetri = <fortran object>

cgetri - Function signature:

inv_a,info = cgetri(lu,piv,[lwork,overwrite_lu])

Required arguments:

lu : input rank-2 array(‘F’) with bounds (n,n) piv : input rank-1 array(‘i’) with bounds (n)

Optional arguments:

overwrite_lu := 0 input int lwork := 3*n input int

Return objects:

inv_a : rank-2 array(‘F’) with bounds (n,n) and lu storage info : int

scipy.linalg.lapack.cgetrs = <fortran object>

cgetrs - Function signature:

x,info = cgetrs(lu,piv,b,[trans,overwrite_b])

Required arguments:

lu : input rank-2 array(‘F’) with bounds (n,n) piv : input rank-1 array(‘i’) with bounds (n) b :

input rank-2 array(‘F’) with bounds (n,nrhs)

Optional arguments:

overwrite_b := 0 input int trans := 0 input int

Return objects:

x : rank-2 array(‘F’) with bounds (n,nrhs) and b storage info : int

scipy.linalg.lapack.cgges = <fortran object>

cgges - Function signature:

a,b,sdim,alpha,beta,vsl,vsr,work,info = cgges(cselect,a,b,[jobvsl,jobvsr,sort_t,ldvsl,ldvsr,lwork,cselect_extra_args,overwrite_a,overwrite_b])

Required arguments:

cselect : call-back function a : input rank-2 array(‘F’) with bounds (lda,*) b : input rank-2

array(‘F’) with bounds (ldb,*)

Optional arguments:

jobvsl := 1 input int jobvsr := 1 input int sort_t := 0 input int cselect_extra_args := () input tuple

overwrite_a := 0 input int overwrite_b := 0 input int ldvsl := ((jobvsl==1)?n:1) input int ldvsr :=

((jobvsr==1)?n:1) input int lwork := 2*n input int

Return objects:

a : rank-2 array(‘F’) with bounds (lda,*) b : rank-2 array(‘F’) with bounds (ldb,*) sdim : int

alpha : rank-1 array(‘F’) with bounds (n) beta : rank-1 array(‘F’) with bounds (n) vsl : rank-

2 array(‘F’) with bounds (ldvsl,n) vsr : rank-2 array(‘F’) with bounds (ldvsr,n) work : rank-1

array(‘F’) with bounds (MAX(lwork,1)) info : int

Call-back functions:

def cselect(alpha,beta): return cselect Required arguments:

alpha : input complex beta : input complex

406 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Return objects:

cselect : int

scipy.linalg.lapack.cggev = <fortran object>

cggev - Function signature:

alpha,beta,vl,vr,work,info = cggev(a,b,[compute_vl,compute_vr,lwork,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘F’) with bounds (n,n) b : input rank-2 array(‘F’) with bounds (n,n)

Optional arguments:

compute_vl := 1 input int compute_vr := 1 input int overwrite_a := 0 input int overwrite_b := 0

input int lwork := 2*n input int

Return objects:

alpha : rank-1 array(‘F’) with bounds (n) beta : rank-1 array(‘F’) with bounds (n) vl : rank-2 ar-

ray(‘F’) with bounds (ldvl,n) vr : rank-2 array(‘F’) with bounds (ldvr,n) work : rank-1 array(‘F’)

with bounds (MAX(lwork,1)) info : int

scipy.linalg.lapack.chbevd = <fortran object>

chbevd - Function signature:

w,z,info = chbevd(ab,[compute_v,lower,ldab,lrwork,liwork,overwrite_ab])

Required arguments:

ab : input rank-2 array(‘F’) with bounds (ldab,*)

Optional arguments:

overwrite_ab := 1 input int compute_v := 1 input int lower := 0 input int ldab := shape(ab,0) input

int lrwork := (compute_v?1+5*n+2*n*n:n) input int liwork := (compute_v?3+5*n:1) input int

Return objects:

w : rank-1 array(‘f’) with bounds (n) z : rank-2 array(‘F’) with bounds (ldz,ldz) info : int

scipy.linalg.lapack.chbevx = <fortran object>

chbevx - Function signature:

w,z,m,ifail,info = chbevx(ab,vl,vu,il,iu,[ldab,compute_v,range,lower,abstol,mmax,overwrite_ab])

Required arguments:

ab : input rank-2 array(‘F’) with bounds (ldab,*) vl : input ﬂoat vu : input ﬂoat il : input int iu :

input int

Optional arguments:

overwrite_ab := 1 input int ldab := shape(ab,0) input int compute_v := 1 input int range := 0 input

int lower := 0 input int abstol := 0.0 input ﬂoat mmax := (compute_v?(range==2?(iu-il+1):n):1)

input int

Return objects:

w : rank-1 array(‘f’) with bounds (n) z : rank-2 array(‘F’) with bounds (ldz,mmax) m : int ifail :

rank-1 array(‘i’) with bounds ((compute_v?n:1)) info : int

scipy.linalg.lapack.cheev = <fortran object>

cheev - Function signature:

w,v,info = cheev(a,[compute_v,lower,lwork,overwrite_a])

5.15. All functions 407

SciPy Reference Guide, Release 0.13.0

Required arguments:

a : input rank-2 array(‘F’) with bounds (n,n)

Optional arguments:

compute_v := 1 input int lower := 0 input int overwrite_a := 0 input int lwork := 2*n-1 input int

Return objects:

w : rank-1 array(‘f’) with bounds (n) v : rank-2 array(‘F’) with bounds (n,n) and a storage info :

int

scipy.linalg.lapack.cheevd = <fortran object>

cheevd - Function signature:

w,v,info = cheevd(a,[compute_v,lower,lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘F’) with bounds (n,n)

Optional arguments:

compute_v := 1 input int lower := 0 input int overwrite_a := 0 input int lwork := (com-

pute_v?2*n+n*n:n+1) input int

Return objects:

w : rank-1 array(‘f’) with bounds (n) v : rank-2 array(‘F’) with bounds (n,n) and a storage info :

int

scipy.linalg.lapack.cheevr = <fortran object>

cheevr - Function signature:

w,z,info = cheevr(a,[jobz,range,uplo,il,iu,lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘F’) with bounds (n,n)

Optional arguments:

jobz := ‘V’ input string(len=1) range := ‘A’ input string(len=1) uplo := ‘L’ input string(len=1)

overwrite_a := 0 input int il := 1 input int iu := n input int lwork := 18*n input int

Return objects:

w : rank-1 array(‘f’) with bounds (n) z : rank-2 array(‘F’) with bounds (n,m) info : int

scipy.linalg.lapack.chegv = <fortran object>

chegv - Function signature:

a,w,info = chegv(a,b,[itype,jobz,uplo,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘F’) with bounds (n,n) b : input rank-2 array(‘F’) with bounds (n,n)

Optional arguments:

itype := 1 input int jobz := ‘V’ input string(len=1) uplo := ‘L’ input string(len=1) overwrite_a :=

0 input int overwrite_b := 0 input int

Return objects:

a : rank-2 array(‘F’) with bounds (n,n) w : rank-1 array(‘f’) with bounds (n) info : int

scipy.linalg.lapack.chegvd = <fortran object>

408 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

chegvd - Function signature:

a,w,info = chegvd(a,b,[itype,jobz,uplo,lwork,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘F’) with bounds (n,n) b : input rank-2 array(‘F’) with bounds (n,n)

Optional arguments:

itype := 1 input int jobz := ‘V’ input string(len=1) uplo := ‘L’ input string(len=1) overwrite_a :=

0 input int overwrite_b := 0 input int lwork := 2*n+n*n input int

Return objects:

a : rank-2 array(‘F’) with bounds (n,n) w : rank-1 array(‘f’) with bounds (n) info : int

scipy.linalg.lapack.chegvx = <fortran object>

chegvx - Function signature:

w,z,ifail,info = chegvx(a,b,iu,[itype,jobz,uplo,il,lwork,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘F’) with bounds (n,n) b : input rank-2 array(‘F’) with bounds (n,n) iu :

input int

Optional arguments:

itype := 1 input int jobz := ‘V’ input string(len=1) uplo := ‘L’ input string(len=1) overwrite_a :=

0 input int overwrite_b := 0 input int il := 1 input int lwork := 18*n-1 input int

Return objects:

w : rank-1 array(‘f’) with bounds (n) z : rank-2 array(‘F’) with bounds (n,m) ifail : rank-1

array(‘i’) with bounds (n) info : int

scipy.linalg.lapack.claswp = <fortran object>

claswp - Function signature:

a = claswp(a,piv,[k1,k2,off,inc,overwrite_a])

Required arguments:

a : input rank-2 array(‘F’) with bounds (nrows,n) piv : input rank-1 array(‘i’) with bounds (*)

Optional arguments:

overwrite_a := 0 input int k1 := 0 input int k2 := len(piv)-1 input int off := 0 input int inc := 1

input int

Return objects:

a : rank-2 array(‘F’) with bounds (nrows,n)

scipy.linalg.lapack.clauum = <fortran object>

clauum - Function signature:

a,info = clauum(c,[lower,overwrite_c])

Required arguments:

c : input rank-2 array(‘F’) with bounds (n,n)

Optional arguments:

overwrite_c := 0 input int lower := 0 input int

Return objects:

a : rank-2 array(‘F’) with bounds (n,n) and c storage info : int

5.15. All functions 409

SciPy Reference Guide, Release 0.13.0

scipy.linalg.lapack.cpbsv = <fortran object>

cpbsv - Function signature:

c,x,info = cpbsv(ab,b,[lower,ldab,overwrite_ab,overwrite_b])

Required arguments:

ab : input rank-2 array(‘F’) with bounds (ldab,n) b : input rank-2 array(‘F’) with bounds

(ldb,nrhs)

Optional arguments:

lower := 0 input int overwrite_ab := 0 input int ldab := shape(ab,0) input int overwrite_b := 0

input int

Return objects:

c : rank-2 array(‘F’) with bounds (ldab,n) and ab storage x : rank-2 array(‘F’) with bounds

(ldb,nrhs) and b storage info : int

scipy.linalg.lapack.cpbtrf = <fortran object>

cpbtrf - Function signature:

c,info = cpbtrf(ab,[lower,ldab,overwrite_ab])

Required arguments:

ab : input rank-2 array(‘F’) with bounds (ldab,n)

Optional arguments:

lower := 0 input int overwrite_ab := 0 input int ldab := shape(ab,0) input int

Return objects:

c : rank-2 array(‘F’) with bounds (ldab,n) and ab storage info : int

scipy.linalg.lapack.cpbtrs = <fortran object>

cpbtrs - Function signature:

x,info = cpbtrs(ab,b,[lower,ldab,overwrite_b])

Required arguments:

ab : input rank-2 array(‘F’) with bounds (ldab,n) b : input rank-2 array(‘F’) with bounds

(ldb,nrhs)

Optional arguments:

lower := 0 input int ldab := shape(ab,0) input int overwrite_b := 0 input int

Return objects:

x : rank-2 array(‘F’) with bounds (ldb,nrhs) and b storage info : int

scipy.linalg.lapack.cposv = <fortran object>

cposv - Function signature:

c,x,info = cposv(a,b,[lower,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘F’) with bounds (n,n) b : input rank-2 array(‘F’) with bounds (n,nrhs)

Optional arguments:

overwrite_a := 0 input int overwrite_b := 0 input int lower := 0 input int

410 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Return objects:

c : rank-2 array(‘F’) with bounds (n,n) and a storage x : rank-2 array(‘F’) with bounds (n,nrhs)

and b storage info : int

scipy.linalg.lapack.cpotrf = <fortran object>

cpotrf - Function signature:

c,info = cpotrf(a,[lower,clean,overwrite_a])

Required arguments:

a : input rank-2 array(‘F’) with bounds (n,n)

Optional arguments:

overwrite_a := 0 input int lower := 0 input int clean := 1 input int

Return objects:

c : rank-2 array(‘F’) with bounds (n,n) and a storage info : int

scipy.linalg.lapack.cpotri = <fortran object>

cpotri - Function signature:

inv_a,info = cpotri(c,[lower,overwrite_c])

Required arguments:

c : input rank-2 array(‘F’) with bounds (n,n)

Optional arguments:

overwrite_c := 0 input int lower := 0 input int

Return objects:

inv_a : rank-2 array(‘F’) with bounds (n,n) and c storage info : int

scipy.linalg.lapack.cpotrs = <fortran object>

cpotrs - Function signature:

x,info = cpotrs(c,b,[lower,overwrite_b])

Required arguments:

c : input rank-2 array(‘F’) with bounds (n,n) b : input rank-2 array(‘F’) with bounds (n,nrhs)

Optional arguments:

overwrite_b := 0 input int lower := 0 input int

Return objects:

x : rank-2 array(‘F’) with bounds (n,nrhs) and b storage info : int

scipy.linalg.lapack.ctrsyl = <fortran object>

ctrsyl - Function signature:

x,scale,info = ctrsyl(a,b,c,[trana,tranb,isgn,overwrite_c])

Required arguments:

a : input rank-2 array(‘F’) with bounds (m,m) b : input rank-2 array(‘F’) with bounds (n,n) c :

input rank-2 array(‘F’) with bounds (m,n)

Optional arguments:

trana := ‘N’ input string(len=1) tranb := ‘N’ input string(len=1) isgn := 1 input int overwrite_c :=

0 input int

5.15. All functions 411

SciPy Reference Guide, Release 0.13.0

Return objects:

x : rank-2 array(‘F’) with bounds (m,n) and c storage scale : ﬂoat info : int

scipy.linalg.lapack.ctrtri = <fortran object>

ctrtri - Function signature:

inv_c,info = ctrtri(c,[lower,unitdiag,overwrite_c])

Required arguments:

c : input rank-2 array(‘F’) with bounds (n,n)

Optional arguments:

overwrite_c := 0 input int lower := 0 input int unitdiag := 0 input int

Return objects:

inv_c : rank-2 array(‘F’) with bounds (n,n) and c storage info : int

scipy.linalg.lapack.ctrtrs = <fortran object>

ctrtrs - Function signature:

x,info = ctrtrs(a,b,[lower,trans,unitdiag,lda,overwrite_b])

Required arguments:

a : input rank-2 array(‘F’) with bounds (lda,n) b : input rank-2 array(‘F’) with bounds (ldb,nrhs)

Optional arguments:

lower := 0 input int trans := 0 input int unitdiag := 0 input int lda := shape(a,0) input int over-

write_b := 0 input int

Return objects:

x : rank-2 array(‘F’) with bounds (ldb,nrhs) and b storage info : int

scipy.linalg.lapack.cungqr = <fortran object>

cungqr - Function signature:

q,work,info = cungqr(a,tau,[lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘F’) with bounds (m,n) tau : input rank-1 array(‘F’) with bounds (k)

Optional arguments:

overwrite_a := 0 input int lwork := 3*n input int

Return objects:

q : rank-2 array(‘F’) with bounds (m,n) and a storage work : rank-1 array(‘F’) with bounds

(MAX(lwork,1)) info : int

scipy.linalg.lapack.cungrq = <fortran object>

cungrq - Function signature:

q,work,info = cungrq(a,tau,[lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘F’) with bounds (m,n) tau : input rank-1 array(‘F’) with bounds (k)

Optional arguments:

overwrite_a := 0 input int lwork := 3*m input int

412 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Return objects:

q : rank-2 array(‘F’) with bounds (m,n) and a storage work : rank-1 array(‘F’) with bounds

(MAX(lwork,1)) info : int

scipy.linalg.lapack.cunmqr = <fortran object>

cunmqr - Function signature:

cq,work,info = cunmqr(side,trans,a,tau,c,lwork,[overwrite_c])

Required arguments:

side : input string(len=1) trans : input string(len=1) a : input rank-2 array(‘F’) with bounds

(lda,k) tau : input rank-1 array(‘F’) with bounds (k) c : input rank-2 array(‘F’) with bounds

(ldc,n) lwork : input int

Optional arguments:

overwrite_c := 0 input int

Return objects:

cq : rank-2 array(‘F’) with bounds (ldc,n) and c storage work : rank-1 array(‘F’) with bounds

(MAX(lwork,1)) info : int

scipy.linalg.lapack.dgbsv = <fortran object>

dgbsv - Function signature:

lub,piv,x,info = dgbsv(kl,ku,ab,b,[overwrite_ab,overwrite_b])

Required arguments:

kl : input int ku : input int ab : input rank-2 array(‘d’) with bounds (2*kl+ku+1,n) b : input

rank-2 array(‘d’) with bounds (n,nrhs)

Optional arguments:

overwrite_ab := 0 input int overwrite_b := 0 input int

Return objects:

lub : rank-2 array(‘d’) with bounds (2*kl+ku+1,n) and ab storage piv : rank-1 array(‘i’) with

bounds (n) x : rank-2 array(‘d’) with bounds (n,nrhs) and b storage info : int

scipy.linalg.lapack.dgbtrf = <fortran object>

dgbtrf - Function signature:

lu,ipiv,info = dgbtrf(ab,kl,ku,[m,n,ldab,overwrite_ab])

Required arguments:

ab : input rank-2 array(‘d’) with bounds (ldab,*) kl : input int ku : input int

Optional arguments:

m := shape(ab,1) input int n := shape(ab,1) input int overwrite_ab := 0 input int ldab :=

shape(ab,0) input int

Return objects:

lu : rank-2 array(‘d’) with bounds (ldab,*) and ab storage ipiv : rank-1 array(‘i’) with bounds

(MIN(m,n)) info : int

scipy.linalg.lapack.dgbtrs = <fortran object>

dgbtrs - Function signature:

x,info = dgbtrs(ab,kl,ku,b,ipiv,[trans,n,ldab,ldb,overwrite_b])

5.15. All functions 413

SciPy Reference Guide, Release 0.13.0

Required arguments:

ab : input rank-2 array(‘d’) with bounds (ldab,*) kl : input int ku : input int b : input rank-2

array(‘d’) with bounds (ldb,*) ipiv : input rank-1 array(‘i’) with bounds (n)

Optional arguments:

overwrite_b := 0 input int trans := 0 input int n := shape(ab,1) input int ldab := shape(ab,0) input

int ldb := shape(b,0) input int

Return objects:

x : rank-2 array(‘d’) with bounds (ldb,*) and b storage info : int

scipy.linalg.lapack.dgebal = <fortran object>

dgebal - Function signature:

ba,lo,hi,pivscale,info = dgebal(a,[scale,permute,overwrite_a])

Required arguments:

a : input rank-2 array(‘d’) with bounds (m,n)

Optional arguments:

scale := 0 input int permute := 0 input int overwrite_a := 0 input int

Return objects:

ba : rank-2 array(‘d’) with bounds (m,n) and a storage lo : int hi : int pivscale : rank-1 array(‘d’)

with bounds (n) info : int

scipy.linalg.lapack.dgees = <fortran object>

dgees - Function signature:

t,sdim,wr,wi,vs,work,info = dgees(dselect,a,[compute_v,sort_t,lwork,dselect_extra_args,overwrite_a])

Required arguments:

dselect : call-back function a : input rank-2 array(‘d’) with bounds (n,n)

Optional arguments:

compute_v := 1 input int sort_t := 0 input int dselect_extra_args := () input tuple overwrite_a :=

0 input int lwork := 3*n input int

Return objects:

t : rank-2 array(‘d’) with bounds (n,n) and a storage sdim : int wr : rank-1 array(‘d’) with bounds

(n) wi : rank-1 array(‘d’) with bounds (n) vs : rank-2 array(‘d’) with bounds (ldvs,n) work :

rank-1 array(‘d’) with bounds (MAX(lwork,1)) info : int

Call-back functions:

def dselect(arg1,arg2): return dselect Required arguments:

arg1 : input ﬂoat arg2 : input ﬂoat

Return objects:

dselect : int

scipy.linalg.lapack.dgeev = <fortran object>

dgeev - Function signature:

wr,wi,vl,vr,info = dgeev(a,[compute_vl,compute_vr,lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘d’) with bounds (n,n)

414 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Optional arguments:

compute_vl := 1 input int compute_vr := 1 input int overwrite_a := 0 input int lwork := 4*n input

int

Return objects:

wr : rank-1 array(‘d’) with bounds (n) wi : rank-1 array(‘d’) with bounds (n) vl : rank-2 array(‘d’)

with bounds (ldvl,n) vr : rank-2 array(‘d’) with bounds (ldvr,n) info : int

scipy.linalg.lapack.dgegv = <fortran object>

dgegv - Function signature:

alphar,alphai,beta,vl,vr,info = dgegv(a,b,[compute_vl,compute_vr,lwork,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘d’) with bounds (n,n) b : input rank-2 array(‘d’) with bounds (n,n)

Optional arguments:

compute_vl := 1 input int compute_vr := 1 input int overwrite_a := 0 input int overwrite_b := 0

input int lwork := 8*n input int

Return objects:

alphar : rank-1 array(‘d’) with bounds (n) alphai : rank-1 array(‘d’) with bounds (n) beta : rank-1

array(‘d’) with bounds (n) vl : rank-2 array(‘d’) with bounds (ldvl,n) vr : rank-2 array(‘d’) with

bounds (ldvr,n) info : int

scipy.linalg.lapack.dgehrd = <fortran object>

dgehrd - Function signature:

ht,tau,info = dgehrd(a,[lo,hi,lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘d’) with bounds (n,n)

Optional arguments:

lo := 0 input int hi := n-1 input int overwrite_a := 0 input int lwork := MAX(n,1) input int

Return objects:

ht : rank-2 array(‘d’) with bounds (n,n) and a storage tau : rank-1 array(‘d’) with bounds (n - 1)

info : int

scipy.linalg.lapack.dgelss = <fortran object>

dgelss - Function signature:

v,x,s,rank,work,info = dgelss(a,b,[cond,lwork,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘d’) with bounds (m,n) b : input rank-2 array(‘d’) with bounds

(maxmn,nrhs)

Optional arguments:

overwrite_a := 0 input int overwrite_b := 0 input int cond := -1.0 input ﬂoat lwork :=

3*minmn+MAX(2*minmn,MAX(maxmn,nrhs)) input int

Return objects:

v : rank-2 array(‘d’) with bounds (m,n) and a storage x : rank-2 array(‘d’) with bounds

(maxmn,nrhs) and b storage s : rank-1 array(‘d’) with bounds (minmn) rank : int work : rank-1

array(‘d’) with bounds (MAX(lwork,1)) info : int

5.15. All functions 415

SciPy Reference Guide, Release 0.13.0

scipy.linalg.lapack.dgeqp3 = <fortran object>

dgeqp3 - Function signature:

qr,jpvt,tau,work,info = dgeqp3(a,[lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘d’) with bounds (m,n)

Optional arguments:

overwrite_a := 0 input int lwork := 3*(n+1) input int

Return objects:

qr : rank-2 array(‘d’) with bounds (m,n) and a storage jpvt : rank-1 array(‘i’) with bounds (n) tau :

rank-1 array(‘d’) with bounds (MIN(m,n)) work : rank-1 array(‘d’) with bounds (MAX(lwork,1))

info : int

scipy.linalg.lapack.dgeqrf = <fortran object>

dgeqrf - Function signature:

qr,tau,work,info = dgeqrf(a,[lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘d’) with bounds (m,n)

Optional arguments:

overwrite_a := 0 input int lwork := 3*n input int

Return objects:

qr : rank-2 array(‘d’) with bounds (m,n) and a storage tau : rank-1 array(‘d’) with bounds

(MIN(m,n)) work : rank-1 array(‘d’) with bounds (MAX(lwork,1)) info : int

scipy.linalg.lapack.dgerqf = <fortran object>

dgerqf - Function signature:

qr,tau,work,info = dgerqf(a,[lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘d’) with bounds (m,n)

Optional arguments:

overwrite_a := 0 input int lwork := 3*m input int

Return objects:

qr : rank-2 array(‘d’) with bounds (m,n) and a storage tau : rank-1 array(‘d’) with bounds

(MIN(m,n)) work : rank-1 array(‘d’) with bounds (MAX(lwork,1)) info : int

scipy.linalg.lapack.dgesdd = <fortran object>

dgesdd - Function signature:

u,s,vt,info = dgesdd(a,[compute_uv,full_matrices,lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘d’) with bounds (m,n)

Optional arguments:

overwrite_a := 0 input int compute_uv := 1 input int full_matrices := 1 input int lwork := (com-

pute_uv?4*minmn*minmn+MAX(m,n)+9*minmn:MAX(14*minmn+4,10*minmn+2+25*(25+8))+MAX(m,n))

input int

416 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Return objects:

u : rank-2 array(‘d’) with bounds (u0,u1) s : rank-1 array(‘d’) with bounds (minmn) vt : rank-2

array(‘d’) with bounds (vt0,vt1) info : int

scipy.linalg.lapack.dgesv = <fortran object>

dgesv - Function signature:

lu,piv,x,info = dgesv(a,b,[overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘d’) with bounds (n,n) b : input rank-2 array(‘d’) with bounds (n,nrhs)

Optional arguments:

overwrite_a := 0 input int overwrite_b := 0 input int

Return objects:

lu : rank-2 array(‘d’) with bounds (n,n) and a storage piv : rank-1 array(‘i’) with bounds (n) x :

rank-2 array(‘d’) with bounds (n,nrhs) and b storage info : int

scipy.linalg.lapack.dgetrf = <fortran object>

dgetrf - Function signature:

lu,piv,info = dgetrf(a,[overwrite_a])

Required arguments:

a : input rank-2 array(‘d’) with bounds (m,n)

Optional arguments:

overwrite_a := 0 input int

Return objects:

lu : rank-2 array(‘d’) with bounds (m,n) and a storage piv : rank-1 array(‘i’) with bounds

(MIN(m,n)) info : int

scipy.linalg.lapack.dgetri = <fortran object>

dgetri - Function signature:

inv_a,info = dgetri(lu,piv,[lwork,overwrite_lu])

Required arguments:

lu : input rank-2 array(‘d’) with bounds (n,n) piv : input rank-1 array(‘i’) with bounds (n)

Optional arguments:

overwrite_lu := 0 input int lwork := 3*n input int

Return objects:

inv_a : rank-2 array(‘d’) with bounds (n,n) and lu storage info : int

scipy.linalg.lapack.dgetrs = <fortran object>

dgetrs - Function signature:

x,info = dgetrs(lu,piv,b,[trans,overwrite_b])

Required arguments:

lu : input rank-2 array(‘d’) with bounds (n,n) piv : input rank-1 array(‘i’) with bounds (n) b :

input rank-2 array(‘d’) with bounds (n,nrhs)

Optional arguments:

overwrite_b := 0 input int trans := 0 input int

5.15. All functions 417

SciPy Reference Guide, Release 0.13.0

Return objects:

x : rank-2 array(‘d’) with bounds (n,nrhs) and b storage info : int

scipy.linalg.lapack.dgges = <fortran object>

dgges - Function signature:

a,b,sdim,alphar,alphai,beta,vsl,vsr,work,info = dgges(dselect,a,b,[jobvsl,jobvsr,sort_t,ldvsl,ldvsr,lwork,dselect_extra_args,overwrite_a,overwrite_b])

Required arguments:

dselect : call-back function a : input rank-2 array(‘d’) with bounds (lda,*) b : input rank-2

array(‘d’) with bounds (ldb,*)

Optional arguments:

jobvsl := 1 input int jobvsr := 1 input int sort_t := 0 input int dselect_extra_args := () input tuple

overwrite_a := 0 input int overwrite_b := 0 input int ldvsl := ((jobvsl==1)?n:1) input int ldvsr :=

((jobvsr==1)?n:1) input int lwork := 8*n+16 input int

Return objects:

a : rank-2 array(‘d’) with bounds (lda,*) b : rank-2 array(‘d’) with bounds (ldb,*) sdim : int

alphar : rank-1 array(‘d’) with bounds (n) alphai : rank-1 array(‘d’) with bounds (n) beta : rank-1

array(‘d’) with bounds (n) vsl : rank-2 array(‘d’) with bounds (ldvsl,n) vsr : rank-2 array(‘d’)

with bounds (ldvsr,n) work : rank-1 array(‘d’) with bounds (MAX(lwork,1)) info : int

Call-back functions:

def dselect(alphar,alphai,beta): return dselect Required arguments:

alphar : input ﬂoat alphai : input ﬂoat beta : input ﬂoat

Return objects:

dselect : int

scipy.linalg.lapack.dggev = <fortran object>

dggev - Function signature:

alphar,alphai,beta,vl,vr,work,info = dggev(a,b,[compute_vl,compute_vr,lwork,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘d’) with bounds (n,n) b : input rank-2 array(‘d’) with bounds (n,n)

Optional arguments:

compute_vl := 1 input int compute_vr := 1 input int overwrite_a := 0 input int overwrite_b := 0

input int lwork := 8*n input int

Return objects:

alphar : rank-1 array(‘d’) with bounds (n) alphai : rank-1 array(‘d’) with bounds (n) beta : rank-1

array(‘d’) with bounds (n) vl : rank-2 array(‘d’) with bounds (ldvl,n) vr : rank-2 array(‘d’) with

bounds (ldvr,n) work : rank-1 array(‘d’) with bounds (MAX(lwork,1)) info : int

scipy.linalg.lapack.dlamch = <fortran dlamch>

dlamch - Function signature:

dlamch = dlamch(cmach)

Required arguments:

cmach : input string(len=1)

Return objects:

dlamch : ﬂoat

418 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.linalg.lapack.dlaswp = <fortran object>

dlaswp - Function signature:

a = dlaswp(a,piv,[k1,k2,off,inc,overwrite_a])

Required arguments:

a : input rank-2 array(‘d’) with bounds (nrows,n) piv : input rank-1 array(‘i’) with bounds (*)

Optional arguments:

overwrite_a := 0 input int k1 := 0 input int k2 := len(piv)-1 input int off := 0 input int inc := 1

input int

Return objects:

a : rank-2 array(‘d’) with bounds (nrows,n)

scipy.linalg.lapack.dlauum = <fortran object>

dlauum - Function signature:

a,info = dlauum(c,[lower,overwrite_c])

Required arguments:

c : input rank-2 array(‘d’) with bounds (n,n)

Optional arguments:

overwrite_c := 0 input int lower := 0 input int

Return objects:

a : rank-2 array(‘d’) with bounds (n,n) and c storage info : int

scipy.linalg.lapack.dorgqr = <fortran object>

dorgqr - Function signature:

q,work,info = dorgqr(a,tau,[lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘d’) with bounds (m,n) tau : input rank-1 array(‘d’) with bounds (k)

Optional arguments:

overwrite_a := 0 input int lwork := 3*n input int

Return objects:

q : rank-2 array(‘d’) with bounds (m,n) and a storage work : rank-1 array(‘d’) with bounds

(MAX(lwork,1)) info : int

scipy.linalg.lapack.dorgrq = <fortran object>

dorgrq - Function signature:

q,work,info = dorgrq(a,tau,[lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘d’) with bounds (m,n) tau : input rank-1 array(‘d’) with bounds (k)

Optional arguments:

overwrite_a := 0 input int lwork := 3*m input int

Return objects:

q : rank-2 array(‘d’) with bounds (m,n) and a storage work : rank-1 array(‘d’) with bounds

(MAX(lwork,1)) info : int

5.15. All functions 419

SciPy Reference Guide, Release 0.13.0

scipy.linalg.lapack.dormqr = <fortran object>

dormqr - Function signature:

cq,work,info = dormqr(side,trans,a,tau,c,lwork,[overwrite_c])

Required arguments:

side : input string(len=1) trans : input string(len=1) a : input rank-2 array(‘d’) with bounds (lda,k)

tau : input rank-1 array(‘d’) with bounds (k) c : input rank-2 array(‘d’) with bounds (ldc,n) lwork

: input int

Optional arguments:

overwrite_c := 0 input int

Return objects:

cq : rank-2 array(‘d’) with bounds (ldc,n) and c storage work : rank-1 array(‘d’) with bounds

(MAX(lwork,1)) info : int

scipy.linalg.lapack.dpbsv = <fortran object>

dpbsv - Function signature:

c,x,info = dpbsv(ab,b,[lower,ldab,overwrite_ab,overwrite_b])

Required arguments:

ab : input rank-2 array(‘d’) with bounds (ldab,n) b : input rank-2 array(‘d’) with bounds

(ldb,nrhs)

Optional arguments:

lower := 0 input int overwrite_ab := 0 input int ldab := shape(ab,0) input int overwrite_b := 0

input int

Return objects:

c : rank-2 array(‘d’) with bounds (ldab,n) and ab storage x : rank-2 array(‘d’) with bounds

(ldb,nrhs) and b storage info : int

scipy.linalg.lapack.dpbtrf = <fortran object>

dpbtrf - Function signature:

c,info = dpbtrf(ab,[lower,ldab,overwrite_ab])

Required arguments:

ab : input rank-2 array(‘d’) with bounds (ldab,n)

Optional arguments:

lower := 0 input int overwrite_ab := 0 input int ldab := shape(ab,0) input int

Return objects:

c : rank-2 array(‘d’) with bounds (ldab,n) and ab storage info : int

scipy.linalg.lapack.dpbtrs = <fortran object>

dpbtrs - Function signature:

x,info = dpbtrs(ab,b,[lower,ldab,overwrite_b])

Required arguments:

ab : input rank-2 array(‘d’) with bounds (ldab,n) b : input rank-2 array(‘d’) with bounds

(ldb,nrhs)

Optional arguments:

lower := 0 input int ldab := shape(ab,0) input int overwrite_b := 0 input int

420 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Return objects:

x : rank-2 array(‘d’) with bounds (ldb,nrhs) and b storage info : int

scipy.linalg.lapack.dposv = <fortran object>

dposv - Function signature:

c,x,info = dposv(a,b,[lower,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘d’) with bounds (n,n) b : input rank-2 array(‘d’) with bounds (n,nrhs)

Optional arguments:

overwrite_a := 0 input int overwrite_b := 0 input int lower := 0 input int

Return objects:

c : rank-2 array(‘d’) with bounds (n,n) and a storage x : rank-2 array(‘d’) with bounds (n,nrhs)

and b storage info : int

scipy.linalg.lapack.dpotrf = <fortran object>

dpotrf - Function signature:

c,info = dpotrf(a,[lower,clean,overwrite_a])

Required arguments:

a : input rank-2 array(‘d’) with bounds (n,n)

Optional arguments:

overwrite_a := 0 input int lower := 0 input int clean := 1 input int

Return objects:

c : rank-2 array(‘d’) with bounds (n,n) and a storage info : int

scipy.linalg.lapack.dpotri = <fortran object>

dpotri - Function signature:

inv_a,info = dpotri(c,[lower,overwrite_c])

Required arguments:

c : input rank-2 array(‘d’) with bounds (n,n)

Optional arguments:

overwrite_c := 0 input int lower := 0 input int

Return objects:

inv_a : rank-2 array(‘d’) with bounds (n,n) and c storage info : int

scipy.linalg.lapack.dpotrs = <fortran object>

dpotrs - Function signature:

x,info = dpotrs(c,b,[lower,overwrite_b])

Required arguments:

c : input rank-2 array(‘d’) with bounds (n,n) b : input rank-2 array(‘d’) with bounds (n,nrhs)

Optional arguments:

overwrite_b := 0 input int lower := 0 input int

Return objects:

x : rank-2 array(‘d’) with bounds (n,nrhs) and b storage info : int

5.15. All functions 421

SciPy Reference Guide, Release 0.13.0

scipy.linalg.lapack.dsbev = <fortran object>

dsbev - Function signature:

w,z,info = dsbev(ab,[compute_v,lower,ldab,overwrite_ab])

Required arguments:

ab : input rank-2 array(‘d’) with bounds (ldab,*)

Optional arguments:

overwrite_ab := 1 input int compute_v := 1 input int lower := 0 input int ldab := shape(ab,0) input

int

Return objects:

w : rank-1 array(‘d’) with bounds (n) z : rank-2 array(‘d’) with bounds (ldz,ldz) info : int

scipy.linalg.lapack.dsbevd = <fortran object>

dsbevd - Function signature:

w,z,info = dsbevd(ab,[compute_v,lower,ldab,liwork,overwrite_ab])

Required arguments:

ab : input rank-2 array(‘d’) with bounds (ldab,*)

Optional arguments:

overwrite_ab := 1 input int compute_v := 1 input int lower := 0 input int ldab := shape(ab,0) input

int liwork := (compute_v?3+5*n:1) input int

Return objects:

w : rank-1 array(‘d’) with bounds (n) z : rank-2 array(‘d’) with bounds (ldz,ldz) info : int

scipy.linalg.lapack.dsbevx = <fortran object>

dsbevx - Function signature:

w,z,m,ifail,info = dsbevx(ab,vl,vu,il,iu,[ldab,compute_v,range,lower,abstol,mmax,overwrite_ab])

Required arguments:

ab : input rank-2 array(‘d’) with bounds (ldab,*) vl : input ﬂoat vu : input ﬂoat il : input int iu :

input int

Optional arguments:

overwrite_ab := 1 input int ldab := shape(ab,0) input int compute_v := 1 input int range := 0 input

int lower := 0 input int abstol := 0.0 input ﬂoat mmax := (compute_v?(range==2?(iu-il+1):n):1)

input int

Return objects:

w : rank-1 array(‘d’) with bounds (n) z : rank-2 array(‘d’) with bounds (ldz,mmax) m : int ifail :

rank-1 array(‘i’) with bounds ((compute_v?n:1)) info : int

scipy.linalg.lapack.dsyev = <fortran object>

dsyev - Function signature:

w,v,info = dsyev(a,[compute_v,lower,lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘d’) with bounds (n,n)

Optional arguments:

compute_v := 1 input int lower := 0 input int overwrite_a := 0 input int lwork := 3*n-1 input int

422 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Return objects:

w : rank-1 array(‘d’) with bounds (n) v : rank-2 array(‘d’) with bounds (n,n) and a storage info :

int

scipy.linalg.lapack.dsyevd = <fortran object>

dsyevd - Function signature:

w,v,info = dsyevd(a,[compute_v,lower,lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘d’) with bounds (n,n)

Optional arguments:

compute_v := 1 input int lower := 0 input int overwrite_a := 0 input int lwork := (com-

pute_v?1+6*n+2*n*n:2*n+1) input int

Return objects:

w : rank-1 array(‘d’) with bounds (n) v : rank-2 array(‘d’) with bounds (n,n) and a storage info :

int

scipy.linalg.lapack.dsyevr = <fortran object>

dsyevr - Function signature:

w,z,info = dsyevr(a,[jobz,range,uplo,il,iu,lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘d’) with bounds (n,n)

Optional arguments:

jobz := ‘V’ input string(len=1) range := ‘A’ input string(len=1) uplo := ‘L’ input string(len=1)

overwrite_a := 0 input int il := 1 input int iu := n input int lwork := 26*n input int

Return objects:

w : rank-1 array(‘d’) with bounds (n) z : rank-2 array(‘d’) with bounds (n,m) info : int

scipy.linalg.lapack.dsygv = <fortran object>

dsygv - Function signature:

a,w,info = dsygv(a,b,[itype,jobz,uplo,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘d’) with bounds (n,n) b : input rank-2 array(‘d’) with bounds (n,n)

Optional arguments:

itype := 1 input int jobz := ‘V’ input string(len=1) uplo := ‘L’ input string(len=1) overwrite_a :=

0 input int overwrite_b := 0 input int

Return objects:

a : rank-2 array(‘d’) with bounds (n,n) w : rank-1 array(‘d’) with bounds (n) info : int

scipy.linalg.lapack.dsygvd = <fortran object>

dsygvd - Function signature:

a,w,info = dsygvd(a,b,[itype,jobz,uplo,lwork,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘d’) with bounds (n,n) b : input rank-2 array(‘d’) with bounds (n,n)

5.15. All functions 423

SciPy Reference Guide, Release 0.13.0

Optional arguments:

itype := 1 input int jobz := ‘V’ input string(len=1) uplo := ‘L’ input string(len=1) overwrite_a :=

0 input int overwrite_b := 0 input int lwork := 1+6*n+2*n*n input int

Return objects:

a : rank-2 array(‘d’) with bounds (n,n) w : rank-1 array(‘d’) with bounds (n) info : int

scipy.linalg.lapack.dsygvx = <fortran object>

dsygvx - Function signature:

w,z,ifail,info = dsygvx(a,b,iu,[itype,jobz,uplo,il,lwork,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘d’) with bounds (n,n) b : input rank-2 array(‘d’) with bounds (n,n) iu :

input int

Optional arguments:

itype := 1 input int jobz := ‘V’ input string(len=1) uplo := ‘L’ input string(len=1) overwrite_a :=

0 input int overwrite_b := 0 input int il := 1 input int lwork := 8*n input int

Return objects:

w : rank-1 array(‘d’) with bounds (n) z : rank-2 array(‘d’) with bounds (n,m) ifail : rank-1

array(‘i’) with bounds (n) info : int

scipy.linalg.lapack.dtrsyl = <fortran object>

dtrsyl - Function signature:

x,scale,info = dtrsyl(a,b,c,[trana,tranb,isgn,overwrite_c])

Required arguments:

a : input rank-2 array(‘d’) with bounds (m,m) b : input rank-2 array(‘d’) with bounds (n,n) c :

input rank-2 array(‘d’) with bounds (m,n)

Optional arguments:

trana := ‘N’ input string(len=1) tranb := ‘N’ input string(len=1) isgn := 1 input int overwrite_c :=

0 input int

Return objects:

x : rank-2 array(‘d’) with bounds (m,n) and c storage scale : ﬂoat info : int

scipy.linalg.lapack.dtrtri = <fortran object>

dtrtri - Function signature:

inv_c,info = dtrtri(c,[lower,unitdiag,overwrite_c])

Required arguments:

c : input rank-2 array(‘d’) with bounds (n,n)

Optional arguments:

overwrite_c := 0 input int lower := 0 input int unitdiag := 0 input int

Return objects:

inv_c : rank-2 array(‘d’) with bounds (n,n) and c storage info : int

scipy.linalg.lapack.dtrtrs = <fortran object>

dtrtrs - Function signature:

x,info = dtrtrs(a,b,[lower,trans,unitdiag,lda,overwrite_b])

424 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Required arguments:

a : input rank-2 array(‘d’) with bounds (lda,n) b : input rank-2 array(‘d’) with bounds (ldb,nrhs)

Optional arguments:

lower := 0 input int trans := 0 input int unitdiag := 0 input int lda := shape(a,0) input int over-

write_b := 0 input int

Return objects:

x : rank-2 array(‘d’) with bounds (ldb,nrhs) and b storage info : int

scipy.linalg.lapack.sgbsv = <fortran object>

sgbsv - Function signature:

lub,piv,x,info = sgbsv(kl,ku,ab,b,[overwrite_ab,overwrite_b])

Required arguments:

kl : input int ku : input int ab : input rank-2 array(‘f’) with bounds (2*kl+ku+1,n) b : input rank-2

array(‘f’) with bounds (n,nrhs)

Optional arguments:

overwrite_ab := 0 input int overwrite_b := 0 input int

Return objects:

lub : rank-2 array(‘f’) with bounds (2*kl+ku+1,n) and ab storage piv : rank-1 array(‘i’) with

bounds (n) x : rank-2 array(‘f’) with bounds (n,nrhs) and b storage info : int

scipy.linalg.lapack.sgbtrf = <fortran object>

sgbtrf - Function signature:

lu,ipiv,info = sgbtrf(ab,kl,ku,[m,n,ldab,overwrite_ab])

Required arguments:

ab : input rank-2 array(‘f’) with bounds (ldab,*) kl : input int ku : input int

Optional arguments:

m := shape(ab,1) input int n := shape(ab,1) input int overwrite_ab := 0 input int ldab :=

shape(ab,0) input int

Return objects:

lu : rank-2 array(‘f’) with bounds (ldab,*) and ab storage ipiv : rank-1 array(‘i’) with bounds

(MIN(m,n)) info : int

scipy.linalg.lapack.sgbtrs = <fortran object>

sgbtrs - Function signature:

x,info = sgbtrs(ab,kl,ku,b,ipiv,[trans,n,ldab,ldb,overwrite_b])

Required arguments:

ab : input rank-2 array(‘f’) with bounds (ldab,*) kl : input int ku : input int b : input rank-2

array(‘f’) with bounds (ldb,*) ipiv : input rank-1 array(‘i’) with bounds (n)

Optional arguments:

overwrite_b := 0 input int trans := 0 input int n := shape(ab,1) input int ldab := shape(ab,0) input

int ldb := shape(b,0) input int

Return objects:

x : rank-2 array(‘f’) with bounds (ldb,*) and b storage info : int

scipy.linalg.lapack.sgebal = <fortran object>

5.15. All functions 425

SciPy Reference Guide, Release 0.13.0

sgebal - Function signature:

ba,lo,hi,pivscale,info = sgebal(a,[scale,permute,overwrite_a])

Required arguments:

a : input rank-2 array(‘f’) with bounds (m,n)

Optional arguments:

scale := 0 input int permute := 0 input int overwrite_a := 0 input int

Return objects:

ba : rank-2 array(‘f’) with bounds (m,n) and a storage lo : int hi : int pivscale : rank-1 array(‘f’)

with bounds (n) info : int

scipy.linalg.lapack.sgees = <fortran object>

sgees - Function signature:

t,sdim,wr,wi,vs,work,info = sgees(sselect,a,[compute_v,sort_t,lwork,sselect_extra_args,overwrite_a])

Required arguments:

sselect : call-back function a : input rank-2 array(‘f’) with bounds (n,n)

Optional arguments:

compute_v := 1 input int sort_t := 0 input int sselect_extra_args := () input tuple overwrite_a :=

0 input int lwork := 3*n input int

Return objects:

t : rank-2 array(‘f’) with bounds (n,n) and a storage sdim : int wr : rank-1 array(‘f’) with bounds

(n) wi : rank-1 array(‘f’) with bounds (n) vs : rank-2 array(‘f’) with bounds (ldvs,n) work :

rank-1 array(‘f’) with bounds (MAX(lwork,1)) info : int

Call-back functions:

def sselect(arg1,arg2): return sselect Required arguments:

arg1 : input ﬂoat arg2 : input ﬂoat

Return objects:

sselect : int

scipy.linalg.lapack.sgeev = <fortran object>

sgeev - Function signature:

wr,wi,vl,vr,info = sgeev(a,[compute_vl,compute_vr,lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘f’) with bounds (n,n)

Optional arguments:

compute_vl := 1 input int compute_vr := 1 input int overwrite_a := 0 input int lwork := 4*n input

int

Return objects:

wr : rank-1 array(‘f’) with bounds (n) wi : rank-1 array(‘f’) with bounds (n) vl : rank-2 array(‘f’)

with bounds (ldvl,n) vr : rank-2 array(‘f’) with bounds (ldvr,n) info : int

scipy.linalg.lapack.sgegv = <fortran object>

sgegv - Function signature:

alphar,alphai,beta,vl,vr,info = sgegv(a,b,[compute_vl,compute_vr,lwork,overwrite_a,overwrite_b])

426 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Required arguments:

a : input rank-2 array(‘f’) with bounds (n,n) b : input rank-2 array(‘f’) with bounds (n,n)

Optional arguments:

compute_vl := 1 input int compute_vr := 1 input int overwrite_a := 0 input int overwrite_b := 0

input int lwork := 8*n input int

Return objects:

alphar : rank-1 array(‘f’) with bounds (n) alphai : rank-1 array(‘f’) with bounds (n) beta : rank-1

array(‘f’) with bounds (n) vl : rank-2 array(‘f’) with bounds (ldvl,n) vr : rank-2 array(‘f’) with

bounds (ldvr,n) info : int

scipy.linalg.lapack.sgehrd = <fortran object>

sgehrd - Function signature:

ht,tau,info = sgehrd(a,[lo,hi,lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘f’) with bounds (n,n)

Optional arguments:

lo := 0 input int hi := n-1 input int overwrite_a := 0 input int lwork := MAX(n,1) input int

Return objects:

ht : rank-2 array(‘f’) with bounds (n,n) and a storage tau : rank-1 array(‘f’) with bounds (n - 1)

info : int

scipy.linalg.lapack.sgelss = <fortran object>

sgelss - Function signature:

v,x,s,rank,work,info = sgelss(a,b,[cond,lwork,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘f’) with bounds (m,n) b : input rank-2 array(‘f’) with bounds

(maxmn,nrhs)

Optional arguments:

overwrite_a := 0 input int overwrite_b := 0 input int cond := -1.0 input ﬂoat lwork :=

3*minmn+MAX(2*minmn,MAX(maxmn,nrhs)) input int

Return objects:

v : rank-2 array(‘f’) with bounds (m,n) and a storage x : rank-2 array(‘f’) with bounds

(maxmn,nrhs) and b storage s : rank-1 array(‘f’) with bounds (minmn) rank : int work : rank-1

array(‘f’) with bounds (MAX(lwork,1)) info : int

scipy.linalg.lapack.sgeqp3 = <fortran object>

sgeqp3 - Function signature:

qr,jpvt,tau,work,info = sgeqp3(a,[lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘f’) with bounds (m,n)

Optional arguments:

overwrite_a := 0 input int lwork := 3*(n+1) input int

Return objects:

qr : rank-2 array(‘f’) with bounds (m,n) and a storage jpvt : rank-1 array(‘i’) with bounds (n) tau :

5.15. All functions 427

SciPy Reference Guide, Release 0.13.0

rank-1 array(‘f’) with bounds (MIN(m,n)) work : rank-1 array(‘f’) with bounds (MAX(lwork,1))

info : int

scipy.linalg.lapack.sgeqrf = <fortran object>

sgeqrf - Function signature:

qr,tau,work,info = sgeqrf(a,[lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘f’) with bounds (m,n)

Optional arguments:

overwrite_a := 0 input int lwork := 3*n input int

Return objects:

qr : rank-2 array(‘f’) with bounds (m,n) and a storage tau : rank-1 array(‘f’) with bounds

(MIN(m,n)) work : rank-1 array(‘f’) with bounds (MAX(lwork,1)) info : int

scipy.linalg.lapack.sgerqf = <fortran object>

sgerqf - Function signature:

qr,tau,work,info = sgerqf(a,[lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘f’) with bounds (m,n)

Optional arguments:

overwrite_a := 0 input int lwork := 3*m input int

Return objects:

qr : rank-2 array(‘f’) with bounds (m,n) and a storage tau : rank-1 array(‘f’) with bounds

(MIN(m,n)) work : rank-1 array(‘f’) with bounds (MAX(lwork,1)) info : int

scipy.linalg.lapack.sgesdd = <fortran object>

sgesdd - Function signature:

u,s,vt,info = sgesdd(a,[compute_uv,full_matrices,lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘f’) with bounds (m,n)

Optional arguments:

overwrite_a := 0 input int compute_uv := 1 input int full_matrices := 1 input int lwork := (com-

pute_uv?4*minmn*minmn+MAX(m,n)+9*minmn:MAX(14*minmn+4,10*minmn+2+25*(25+8))+MAX(m,n))

input int

Return objects:

u : rank-2 array(‘f’) with bounds (u0,u1) s : rank-1 array(‘f’) with bounds (minmn) vt : rank-2

array(‘f’) with bounds (vt0,vt1) info : int

scipy.linalg.lapack.sgesv = <fortran object>

sgesv - Function signature:

lu,piv,x,info = sgesv(a,b,[overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘f’) with bounds (n,n) b : input rank-2 array(‘f’) with bounds (n,nrhs)

428 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Optional arguments:

overwrite_a := 0 input int overwrite_b := 0 input int

Return objects:

lu : rank-2 array(‘f’) with bounds (n,n) and a storage piv : rank-1 array(‘i’) with bounds (n) x :

rank-2 array(‘f’) with bounds (n,nrhs) and b storage info : int

scipy.linalg.lapack.sgetrf = <fortran object>

sgetrf - Function signature:

lu,piv,info = sgetrf(a,[overwrite_a])

Required arguments:

a : input rank-2 array(‘f’) with bounds (m,n)

Optional arguments:

overwrite_a := 0 input int

Return objects:

lu : rank-2 array(‘f’) with bounds (m,n) and a storage piv : rank-1 array(‘i’) with bounds

(MIN(m,n)) info : int

scipy.linalg.lapack.sgetri = <fortran object>

sgetri - Function signature:

inv_a,info = sgetri(lu,piv,[lwork,overwrite_lu])

Required arguments:

lu : input rank-2 array(‘f’) with bounds (n,n) piv : input rank-1 array(‘i’) with bounds (n)

Optional arguments:

overwrite_lu := 0 input int lwork := 3*n input int

Return objects:

inv_a : rank-2 array(‘f’) with bounds (n,n) and lu storage info : int

scipy.linalg.lapack.sgetrs = <fortran object>

sgetrs - Function signature:

x,info = sgetrs(lu,piv,b,[trans,overwrite_b])

Required arguments:

lu : input rank-2 array(‘f’) with bounds (n,n) piv : input rank-1 array(‘i’) with bounds (n) b :

input rank-2 array(‘f’) with bounds (n,nrhs)

Optional arguments:

overwrite_b := 0 input int trans := 0 input int

Return objects:

x : rank-2 array(‘f’) with bounds (n,nrhs) and b storage info : int

scipy.linalg.lapack.sgges = <fortran object>

sgges - Function signature:

a,b,sdim,alphar,alphai,beta,vsl,vsr,work,info = sgges(sselect,a,b,[jobvsl,jobvsr,sort_t,ldvsl,ldvsr,lwork,sselect_extra_args,overwrite_a,overwrite_b])

Required arguments:

sselect : call-back function a : input rank-2 array(‘f’) with bounds (lda,*) b : input rank-2

array(‘f’) with bounds (ldb,*)

5.15. All functions 429

SciPy Reference Guide, Release 0.13.0

Optional arguments:

jobvsl := 1 input int jobvsr := 1 input int sort_t := 0 input int sselect_extra_args := () input tuple

overwrite_a := 0 input int overwrite_b := 0 input int ldvsl := ((jobvsl==1)?n:1) input int ldvsr :=

((jobvsr==1)?n:1) input int lwork := 8*n+16 input int

Return objects:

a : rank-2 array(‘f’) with bounds (lda,*) b : rank-2 array(‘f’) with bounds (ldb,*) sdim : int

alphar : rank-1 array(‘f’) with bounds (n) alphai : rank-1 array(‘f’) with bounds (n) beta : rank-1

array(‘f’) with bounds (n) vsl : rank-2 array(‘f’) with bounds (ldvsl,n) vsr : rank-2 array(‘f’) with

bounds (ldvsr,n) work : rank-1 array(‘f’) with bounds (MAX(lwork,1)) info : int

Call-back functions:

def sselect(alphar,alphai,beta): return sselect Required arguments:

alphar : input ﬂoat alphai : input ﬂoat beta : input ﬂoat

Return objects:

sselect : int

scipy.linalg.lapack.sggev = <fortran object>

sggev - Function signature:

alphar,alphai,beta,vl,vr,work,info = sggev(a,b,[compute_vl,compute_vr,lwork,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘f’) with bounds (n,n) b : input rank-2 array(‘f’) with bounds (n,n)

Optional arguments:

compute_vl := 1 input int compute_vr := 1 input int overwrite_a := 0 input int overwrite_b := 0

input int lwork := 8*n input int

Return objects:

alphar : rank-1 array(‘f’) with bounds (n) alphai : rank-1 array(‘f’) with bounds (n) beta : rank-1

array(‘f’) with bounds (n) vl : rank-2 array(‘f’) with bounds (ldvl,n) vr : rank-2 array(‘f’) with

bounds (ldvr,n) work : rank-1 array(‘f’) with bounds (MAX(lwork,1)) info : int

scipy.linalg.lapack.slamch = <fortran slamch>

slamch - Function signature:

slamch = slamch(cmach)

Required arguments:

cmach : input string(len=1)

Return objects:

slamch : ﬂoat

scipy.linalg.lapack.slaswp = <fortran object>

slaswp - Function signature:

a = slaswp(a,piv,[k1,k2,off,inc,overwrite_a])

Required arguments:

a : input rank-2 array(‘f’) with bounds (nrows,n) piv : input rank-1 array(‘i’) with bounds (*)

Optional arguments:

overwrite_a := 0 input int k1 := 0 input int k2 := len(piv)-1 input int off := 0 input int inc := 1

input int

430 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Return objects:

a : rank-2 array(‘f’) with bounds (nrows,n)

scipy.linalg.lapack.slauum = <fortran object>

slauum - Function signature:

a,info = slauum(c,[lower,overwrite_c])

Required arguments:

c : input rank-2 array(‘f’) with bounds (n,n)

Optional arguments:

overwrite_c := 0 input int lower := 0 input int

Return objects:

a : rank-2 array(‘f’) with bounds (n,n) and c storage info : int

scipy.linalg.lapack.sorgqr = <fortran object>

sorgqr - Function signature:

q,work,info = sorgqr(a,tau,[lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘f’) with bounds (m,n) tau : input rank-1 array(‘f’) with bounds (k)

Optional arguments:

overwrite_a := 0 input int lwork := 3*n input int

Return objects:

q : rank-2 array(‘f’) with bounds (m,n) and a storage work : rank-1 array(‘f’) with bounds

(MAX(lwork,1)) info : int

scipy.linalg.lapack.sorgrq = <fortran object>

sorgrq - Function signature:

q,work,info = sorgrq(a,tau,[lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘f’) with bounds (m,n) tau : input rank-1 array(‘f’) with bounds (k)

Optional arguments:

overwrite_a := 0 input int lwork := 3*m input int

Return objects:

q : rank-2 array(‘f’) with bounds (m,n) and a storage work : rank-1 array(‘f’) with bounds

(MAX(lwork,1)) info : int

scipy.linalg.lapack.sormqr = <fortran object>

sormqr - Function signature:

cq,work,info = sormqr(side,trans,a,tau,c,lwork,[overwrite_c])

Required arguments:

side : input string(len=1) trans : input string(len=1) a : input rank-2 array(‘f’) with bounds (lda,k)

tau : input rank-1 array(‘f’) with bounds (k) c : input rank-2 array(‘f’) with bounds (ldc,n) lwork

: input int

Optional arguments:

overwrite_c := 0 input int

5.15. All functions 431

SciPy Reference Guide, Release 0.13.0

Return objects:

cq : rank-2 array(‘f’) with bounds (ldc,n) and c storage work : rank-1 array(‘f’) with bounds

(MAX(lwork,1)) info : int

scipy.linalg.lapack.spbsv = <fortran object>

spbsv - Function signature:

c,x,info = spbsv(ab,b,[lower,ldab,overwrite_ab,overwrite_b])

Required arguments:

ab : input rank-2 array(‘f’) with bounds (ldab,n) b : input rank-2 array(‘f’) with bounds (ldb,nrhs)

Optional arguments:

lower := 0 input int overwrite_ab := 0 input int ldab := shape(ab,0) input int overwrite_b := 0

input int

Return objects:

c : rank-2 array(‘f’) with bounds (ldab,n) and ab storage x : rank-2 array(‘f’) with bounds

(ldb,nrhs) and b storage info : int

scipy.linalg.lapack.spbtrf = <fortran object>

spbtrf - Function signature:

c,info = spbtrf(ab,[lower,ldab,overwrite_ab])

Required arguments:

ab : input rank-2 array(‘f’) with bounds (ldab,n)

Optional arguments:

lower := 0 input int overwrite_ab := 0 input int ldab := shape(ab,0) input int

Return objects:

c : rank-2 array(‘f’) with bounds (ldab,n) and ab storage info : int

scipy.linalg.lapack.spbtrs = <fortran object>

spbtrs - Function signature:

x,info = spbtrs(ab,b,[lower,ldab,overwrite_b])

Required arguments:

ab : input rank-2 array(‘f’) with bounds (ldab,n) b : input rank-2 array(‘f’) with bounds (ldb,nrhs)

Optional arguments:

lower := 0 input int ldab := shape(ab,0) input int overwrite_b := 0 input int

Return objects:

x : rank-2 array(‘f’) with bounds (ldb,nrhs) and b storage info : int

scipy.linalg.lapack.sposv = <fortran object>

sposv - Function signature:

c,x,info = sposv(a,b,[lower,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘f’) with bounds (n,n) b : input rank-2 array(‘f’) with bounds (n,nrhs)

Optional arguments:

overwrite_a := 0 input int overwrite_b := 0 input int lower := 0 input int

432 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Return objects:

c : rank-2 array(‘f’) with bounds (n,n) and a storage x : rank-2 array(‘f’) with bounds (n,nrhs)

and b storage info : int

scipy.linalg.lapack.spotrf = <fortran object>

spotrf - Function signature:

c,info = spotrf(a,[lower,clean,overwrite_a])

Required arguments:

a : input rank-2 array(‘f’) with bounds (n,n)

Optional arguments:

overwrite_a := 0 input int lower := 0 input int clean := 1 input int

Return objects:

c : rank-2 array(‘f’) with bounds (n,n) and a storage info : int

scipy.linalg.lapack.spotri = <fortran object>

spotri - Function signature:

inv_a,info = spotri(c,[lower,overwrite_c])

Required arguments:

c : input rank-2 array(‘f’) with bounds (n,n)

Optional arguments:

overwrite_c := 0 input int lower := 0 input int

Return objects:

inv_a : rank-2 array(‘f’) with bounds (n,n) and c storage info : int

scipy.linalg.lapack.spotrs = <fortran object>

spotrs - Function signature:

x,info = spotrs(c,b,[lower,overwrite_b])

Required arguments:

c : input rank-2 array(‘f’) with bounds (n,n) b : input rank-2 array(‘f’) with bounds (n,nrhs)

Optional arguments:

overwrite_b := 0 input int lower := 0 input int

Return objects:

x : rank-2 array(‘f’) with bounds (n,nrhs) and b storage info : int

scipy.linalg.lapack.ssbev = <fortran object>

ssbev - Function signature:

w,z,info = ssbev(ab,[compute_v,lower,ldab,overwrite_ab])

Required arguments:

ab : input rank-2 array(‘f’) with bounds (ldab,*)

Optional arguments:

overwrite_ab := 1 input int compute_v := 1 input int lower := 0 input int ldab := shape(ab,0) input

int

Return objects:

w : rank-1 array(‘f’) with bounds (n) z : rank-2 array(‘f’) with bounds (ldz,ldz) info : int

5.15. All functions 433

SciPy Reference Guide, Release 0.13.0

scipy.linalg.lapack.ssbevd = <fortran object>

ssbevd - Function signature:

w,z,info = ssbevd(ab,[compute_v,lower,ldab,liwork,overwrite_ab])

Required arguments:

ab : input rank-2 array(‘f’) with bounds (ldab,*)

Optional arguments:

overwrite_ab := 1 input int compute_v := 1 input int lower := 0 input int ldab := shape(ab,0) input

int liwork := (compute_v?3+5*n:1) input int

Return objects:

w : rank-1 array(‘f’) with bounds (n) z : rank-2 array(‘f’) with bounds (ldz,ldz) info : int

scipy.linalg.lapack.ssbevx = <fortran object>

ssbevx - Function signature:

w,z,m,ifail,info = ssbevx(ab,vl,vu,il,iu,[ldab,compute_v,range,lower,abstol,mmax,overwrite_ab])

Required arguments:

ab : input rank-2 array(‘f’) with bounds (ldab,*) vl : input ﬂoat vu : input ﬂoat il : input int iu :

input int

Optional arguments:

overwrite_ab := 1 input int ldab := shape(ab,0) input int compute_v := 1 input int range := 0 input

int lower := 0 input int abstol := 0.0 input ﬂoat mmax := (compute_v?(range==2?(iu-il+1):n):1)

input int

Return objects:

w : rank-1 array(‘f’) with bounds (n) z : rank-2 array(‘f’) with bounds (ldz,mmax) m : int ifail :

rank-1 array(‘i’) with bounds ((compute_v?n:1)) info : int

scipy.linalg.lapack.ssyev = <fortran object>

ssyev - Function signature:

w,v,info = ssyev(a,[compute_v,lower,lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘f’) with bounds (n,n)

Optional arguments:

compute_v := 1 input int lower := 0 input int overwrite_a := 0 input int lwork := 3*n-1 input int

Return objects:

w : rank-1 array(‘f’) with bounds (n) v : rank-2 array(‘f’) with bounds (n,n) and a storage info :

int

scipy.linalg.lapack.ssyevd = <fortran object>

ssyevd - Function signature:

w,v,info = ssyevd(a,[compute_v,lower,lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘f’) with bounds (n,n)

Optional arguments:

compute_v := 1 input int lower := 0 input int overwrite_a := 0 input int lwork := (com-

pute_v?1+6*n+2*n*n:2*n+1) input int

434 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Return objects:

w : rank-1 array(‘f’) with bounds (n) v : rank-2 array(‘f’) with bounds (n,n) and a storage info :

int

scipy.linalg.lapack.ssyevr = <fortran object>

ssyevr - Function signature:

w,z,info = ssyevr(a,[jobz,range,uplo,il,iu,lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘f’) with bounds (n,n)

Optional arguments:

jobz := ‘V’ input string(len=1) range := ‘A’ input string(len=1) uplo := ‘L’ input string(len=1)

overwrite_a := 0 input int il := 1 input int iu := n input int lwork := 26*n input int

Return objects:

w : rank-1 array(‘f’) with bounds (n) z : rank-2 array(‘f’) with bounds (n,m) info : int

scipy.linalg.lapack.ssygv = <fortran object>

ssygv - Function signature:

a,w,info = ssygv(a,b,[itype,jobz,uplo,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘f’) with bounds (n,n) b : input rank-2 array(‘f’) with bounds (n,n)

Optional arguments:

itype := 1 input int jobz := ‘V’ input string(len=1) uplo := ‘L’ input string(len=1) overwrite_a :=

0 input int overwrite_b := 0 input int

Return objects:

a : rank-2 array(‘f’) with bounds (n,n) w : rank-1 array(‘f’) with bounds (n) info : int

scipy.linalg.lapack.ssygvd = <fortran object>

ssygvd - Function signature:

a,w,info = ssygvd(a,b,[itype,jobz,uplo,lwork,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘f’) with bounds (n,n) b : input rank-2 array(‘f’) with bounds (n,n)

Optional arguments:

itype := 1 input int jobz := ‘V’ input string(len=1) uplo := ‘L’ input string(len=1) overwrite_a :=

0 input int overwrite_b := 0 input int lwork := 1+6*n+2*n*n input int

Return objects:

a : rank-2 array(‘f’) with bounds (n,n) w : rank-1 array(‘f’) with bounds (n) info : int

scipy.linalg.lapack.ssygvx = <fortran object>

ssygvx - Function signature:

w,z,ifail,info = ssygvx(a,b,iu,[itype,jobz,uplo,il,lwork,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘f’) with bounds (n,n) b : input rank-2 array(‘f’) with bounds (n,n) iu :

input int

5.15. All functions 435

SciPy Reference Guide, Release 0.13.0

Optional arguments:

itype := 1 input int jobz := ‘V’ input string(len=1) uplo := ‘L’ input string(len=1) overwrite_a :=

0 input int overwrite_b := 0 input int il := 1 input int lwork := 8*n input int

Return objects:

w : rank-1 array(‘f’) with bounds (n) z : rank-2 array(‘f’) with bounds (n,m) ifail : rank-1

array(‘i’) with bounds (n) info : int

scipy.linalg.lapack.strsyl = <fortran object>

strsyl - Function signature:

x,scale,info = strsyl(a,b,c,[trana,tranb,isgn,overwrite_c])

Required arguments:

a : input rank-2 array(‘f’) with bounds (m,m) b : input rank-2 array(‘f’) with bounds (n,n) c :

input rank-2 array(‘f’) with bounds (m,n)

Optional arguments:

trana := ‘N’ input string(len=1) tranb := ‘N’ input string(len=1) isgn := 1 input int overwrite_c :=

0 input int

Return objects:

x : rank-2 array(‘f’) with bounds (m,n) and c storage scale : ﬂoat info : int

scipy.linalg.lapack.strtri = <fortran object>

strtri - Function signature:

inv_c,info = strtri(c,[lower,unitdiag,overwrite_c])

Required arguments:

c : input rank-2 array(‘f’) with bounds (n,n)

Optional arguments:

overwrite_c := 0 input int lower := 0 input int unitdiag := 0 input int

Return objects:

inv_c : rank-2 array(‘f’) with bounds (n,n) and c storage info : int

scipy.linalg.lapack.strtrs = <fortran object>

strtrs - Function signature:

x,info = strtrs(a,b,[lower,trans,unitdiag,lda,overwrite_b])

Required arguments:

a : input rank-2 array(‘f’) with bounds (lda,n) b : input rank-2 array(‘f’) with bounds (ldb,nrhs)

Optional arguments:

lower := 0 input int trans := 0 input int unitdiag := 0 input int lda := shape(a,0) input int over-

write_b := 0 input int

Return objects:

x : rank-2 array(‘f’) with bounds (ldb,nrhs) and b storage info : int

scipy.linalg.lapack.zgbsv = <fortran object>

zgbsv - Function signature:

lub,piv,x,info = zgbsv(kl,ku,ab,b,[overwrite_ab,overwrite_b])

436 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Required arguments:

kl : input int ku : input int ab : input rank-2 array(‘D’) with bounds (2*kl+ku+1,n) b : input

rank-2 array(‘D’) with bounds (n,nrhs)

Optional arguments:

overwrite_ab := 0 input int overwrite_b := 0 input int

Return objects:

lub : rank-2 array(‘D’) with bounds (2*kl+ku+1,n) and ab storage piv : rank-1 array(‘i’) with

bounds (n) x : rank-2 array(‘D’) with bounds (n,nrhs) and b storage info : int

scipy.linalg.lapack.zgbtrf = <fortran object>

zgbtrf - Function signature:

lu,ipiv,info = zgbtrf(ab,kl,ku,[m,n,ldab,overwrite_ab])

Required arguments:

ab : input rank-2 array(‘D’) with bounds (ldab,*) kl : input int ku : input int

Optional arguments:

m := shape(ab,1) input int n := shape(ab,1) input int overwrite_ab := 0 input int ldab :=

shape(ab,0) input int

Return objects:

lu : rank-2 array(‘D’) with bounds (ldab,*) and ab storage ipiv : rank-1 array(‘i’) with bounds

(MIN(m,n)) info : int

scipy.linalg.lapack.zgbtrs = <fortran object>

zgbtrs - Function signature:

x,info = zgbtrs(ab,kl,ku,b,ipiv,[trans,n,ldab,ldb,overwrite_b])

Required arguments:

ab : input rank-2 array(‘D’) with bounds (ldab,*) kl : input int ku : input int b : input rank-2

array(‘D’) with bounds (ldb,*) ipiv : input rank-1 array(‘i’) with bounds (n)

Optional arguments:

overwrite_b := 0 input int trans := 0 input int n := shape(ab,1) input int ldab := shape(ab,0) input

int ldb := shape(b,0) input int

Return objects:

x : rank-2 array(‘D’) with bounds (ldb,*) and b storage info : int

scipy.linalg.lapack.zgebal = <fortran object>

zgebal - Function signature:

ba,lo,hi,pivscale,info = zgebal(a,[scale,permute,overwrite_a])

Required arguments:

a : input rank-2 array(‘D’) with bounds (m,n)

Optional arguments:

scale := 0 input int permute := 0 input int overwrite_a := 0 input int

Return objects:

ba : rank-2 array(‘D’) with bounds (m,n) and a storage lo : int hi : int pivscale : rank-1 array(‘d’)

with bounds (n) info : int

scipy.linalg.lapack.zgees = <fortran object>

5.15. All functions 437

SciPy Reference Guide, Release 0.13.0

zgees - Function signature:

t,sdim,w,vs,work,info = zgees(zselect,a,[compute_v,sort_t,lwork,zselect_extra_args,overwrite_a])

Required arguments:

zselect : call-back function a : input rank-2 array(‘D’) with bounds (n,n)

Optional arguments:

compute_v := 1 input int sort_t := 0 input int zselect_extra_args := () input tuple overwrite_a :=

0 input int lwork := 3*n input int

Return objects:

t : rank-2 array(‘D’) with bounds (n,n) and a storage sdim : int w : rank-1 array(‘D’) with

bounds (n) vs : rank-2 array(‘D’) with bounds (ldvs,n) work : rank-1 array(‘D’) with bounds

(MAX(lwork,1)) info : int

Call-back functions:

def zselect(arg): return zselect Required arguments:

arg : input complex

Return objects:

zselect : int

scipy.linalg.lapack.zgeev = <fortran object>

zgeev - Function signature:

w,vl,vr,info = zgeev(a,[compute_vl,compute_vr,lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘D’) with bounds (n,n)

Optional arguments:

compute_vl := 1 input int compute_vr := 1 input int overwrite_a := 0 input int lwork := 2*n input

int

Return objects:

w : rank-1 array(‘D’) with bounds (n) vl : rank-2 array(‘D’) with bounds (ldvl,n) vr : rank-2

array(‘D’) with bounds (ldvr,n) info : int

scipy.linalg.lapack.zgegv = <fortran object>

zgegv - Function signature:

alpha,beta,vl,vr,info = zgegv(a,b,[compute_vl,compute_vr,lwork,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘D’) with bounds (n,n) b : input rank-2 array(‘D’) with bounds (n,n)

Optional arguments:

compute_vl := 1 input int compute_vr := 1 input int overwrite_a := 0 input int overwrite_b := 0

input int lwork := 2*n input int

Return objects:

alpha : rank-1 array(‘D’) with bounds (n) beta : rank-1 array(‘D’) with bounds (n) vl : rank-2

array(‘D’) with bounds (ldvl,n) vr : rank-2 array(‘D’) with bounds (ldvr,n) info : int

scipy.linalg.lapack.zgehrd = <fortran object>

zgehrd - Function signature:

ht,tau,info = zgehrd(a,[lo,hi,lwork,overwrite_a])

438 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Required arguments:

a : input rank-2 array(‘D’) with bounds (n,n)

Optional arguments:

lo := 0 input int hi := n-1 input int overwrite_a := 0 input int lwork := MAX(n,1) input int

Return objects:

ht : rank-2 array(‘D’) with bounds (n,n) and a storage tau : rank-1 array(‘D’) with bounds (n - 1)

info : int

scipy.linalg.lapack.zgelss = <fortran object>

zgelss - Function signature:

v,x,s,rank,work,info = zgelss(a,b,[cond,lwork,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘D’) with bounds (m,n) b : input rank-2 array(‘D’) with bounds

(maxmn,nrhs)

Optional arguments:

overwrite_a := 0 input int overwrite_b := 0 input int cond := -1.0 input ﬂoat lwork :=

2*minmn+MAX(maxmn,nrhs) input int

Return objects:

v : rank-2 array(‘D’) with bounds (m,n) and a storage x : rank-2 array(‘D’) with bounds

(maxmn,nrhs) and b storage s : rank-1 array(‘d’) with bounds (minmn) rank : int work : rank-1

array(‘D’) with bounds (MAX(lwork,1)) info : int

scipy.linalg.lapack.zgeqp3 = <fortran object>

zgeqp3 - Function signature:

qr,jpvt,tau,work,info = zgeqp3(a,[lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘D’) with bounds (m,n)

Optional arguments:

overwrite_a := 0 input int lwork := 3*(n+1) input int

Return objects:

qr : rank-2 array(‘D’) with bounds (m,n) and a storage jpvt : rank-1 array(‘i’) with bounds

(n) tau : rank-1 array(‘D’) with bounds (MIN(m,n)) work : rank-1 array(‘D’) with bounds

(MAX(lwork,1)) info : int

scipy.linalg.lapack.zgeqrf = <fortran object>

zgeqrf - Function signature:

qr,tau,work,info = zgeqrf(a,[lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘D’) with bounds (m,n)

Optional arguments:

overwrite_a := 0 input int lwork := 3*n input int

Return objects:

qr : rank-2 array(‘D’) with bounds (m,n) and a storage tau : rank-1 array(‘D’) with bounds

(MIN(m,n)) work : rank-1 array(‘D’) with bounds (MAX(lwork,1)) info : int

5.15. All functions 439

SciPy Reference Guide, Release 0.13.0

scipy.linalg.lapack.zgerqf = <fortran object>

zgerqf - Function signature:

qr,tau,work,info = zgerqf(a,[lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘D’) with bounds (m,n)

Optional arguments:

overwrite_a := 0 input int lwork := 3*m input int

Return objects:

qr : rank-2 array(‘D’) with bounds (m,n) and a storage tau : rank-1 array(‘D’) with bounds

(MIN(m,n)) work : rank-1 array(‘D’) with bounds (MAX(lwork,1)) info : int

scipy.linalg.lapack.zgesdd = <fortran object>

zgesdd - Function signature:

u,s,vt,info = zgesdd(a,[compute_uv,full_matrices,lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘D’) with bounds (m,n)

Optional arguments:

overwrite_a := 0 input int compute_uv := 1 input int full_matrices := 1 input int lwork := (com-

pute_uv?2*minmn*minmn+MAX(m,n)+2*minmn:2*minmn+MAX(m,n)) input int

Return objects:

u : rank-2 array(‘D’) with bounds (u0,u1) s : rank-1 array(‘d’) with bounds (minmn) vt : rank-2

array(‘D’) with bounds (vt0,vt1) info : int

scipy.linalg.lapack.zgesv = <fortran object>

zgesv - Function signature:

lu,piv,x,info = zgesv(a,b,[overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘D’) with bounds (n,n) b : input rank-2 array(‘D’) with bounds (n,nrhs)

Optional arguments:

overwrite_a := 0 input int overwrite_b := 0 input int

Return objects:

lu : rank-2 array(‘D’) with bounds (n,n) and a storage piv : rank-1 array(‘i’) with bounds (n) x :

rank-2 array(‘D’) with bounds (n,nrhs) and b storage info : int

scipy.linalg.lapack.zgetrf = <fortran object>

zgetrf - Function signature:

lu,piv,info = zgetrf(a,[overwrite_a])

Required arguments:

a : input rank-2 array(‘D’) with bounds (m,n)

Optional arguments:

overwrite_a := 0 input int

440 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Return objects:

lu : rank-2 array(‘D’) with bounds (m,n) and a storage piv : rank-1 array(‘i’) with bounds

(MIN(m,n)) info : int

scipy.linalg.lapack.zgetri = <fortran object>

zgetri - Function signature:

inv_a,info = zgetri(lu,piv,[lwork,overwrite_lu])

Required arguments:

lu : input rank-2 array(‘D’) with bounds (n,n) piv : input rank-1 array(‘i’) with bounds (n)

Optional arguments:

overwrite_lu := 0 input int lwork := 3*n input int

Return objects:

inv_a : rank-2 array(‘D’) with bounds (n,n) and lu storage info : int

scipy.linalg.lapack.zgetrs = <fortran object>

zgetrs - Function signature:

x,info = zgetrs(lu,piv,b,[trans,overwrite_b])

Required arguments:

lu : input rank-2 array(‘D’) with bounds (n,n) piv : input rank-1 array(‘i’) with bounds (n) b :

input rank-2 array(‘D’) with bounds (n,nrhs)

Optional arguments:

overwrite_b := 0 input int trans := 0 input int

Return objects:

x : rank-2 array(‘D’) with bounds (n,nrhs) and b storage info : int

scipy.linalg.lapack.zgges = <fortran object>

zgges - Function signature:

a,b,sdim,alpha,beta,vsl,vsr,work,info = zgges(zselect,a,b,[jobvsl,jobvsr,sort_t,ldvsl,ldvsr,lwork,zselect_extra_args,overwrite_a,overwrite_b])

Required arguments:

zselect : call-back function a : input rank-2 array(‘D’) with bounds (lda,*) b : input rank-2

array(‘D’) with bounds (ldb,*)

Optional arguments:

jobvsl := 1 input int jobvsr := 1 input int sort_t := 0 input int zselect_extra_args := () input tuple

overwrite_a := 0 input int overwrite_b := 0 input int ldvsl := ((jobvsl==1)?n:1) input int ldvsr :=

((jobvsr==1)?n:1) input int lwork := 2*n input int

Return objects:

a : rank-2 array(‘D’) with bounds (lda,*) b : rank-2 array(‘D’) with bounds (ldb,*) sdim : int

alpha : rank-1 array(‘D’) with bounds (n) beta : rank-1 array(‘D’) with bounds (n) vsl : rank-

2 array(‘D’) with bounds (ldvsl,n) vsr : rank-2 array(‘D’) with bounds (ldvsr,n) work : rank-1

array(‘D’) with bounds (MAX(lwork,1)) info : int

Call-back functions:

def zselect(alpha,beta): return zselect Required arguments:

alpha : input complex beta : input complex

5.15. All functions 441

SciPy Reference Guide, Release 0.13.0

Return objects:

zselect : int

scipy.linalg.lapack.zggev = <fortran object>

zggev - Function signature:

alpha,beta,vl,vr,work,info = zggev(a,b,[compute_vl,compute_vr,lwork,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘D’) with bounds (n,n) b : input rank-2 array(‘D’) with bounds (n,n)

Optional arguments:

compute_vl := 1 input int compute_vr := 1 input int overwrite_a := 0 input int overwrite_b := 0

input int lwork := 2*n input int

Return objects:

alpha : rank-1 array(‘D’) with bounds (n) beta : rank-1 array(‘D’) with bounds (n) vl : rank-

2 array(‘D’) with bounds (ldvl,n) vr : rank-2 array(‘D’) with bounds (ldvr,n) work : rank-1

array(‘D’) with bounds (MAX(lwork,1)) info : int

scipy.linalg.lapack.zhbevd = <fortran object>

zhbevd - Function signature:

w,z,info = zhbevd(ab,[compute_v,lower,ldab,lrwork,liwork,overwrite_ab])

Required arguments:

ab : input rank-2 array(‘D’) with bounds (ldab,*)

Optional arguments:

overwrite_ab := 1 input int compute_v := 1 input int lower := 0 input int ldab := shape(ab,0) input

int lrwork := (compute_v?1+5*n+2*n*n:n) input int liwork := (compute_v?3+5*n:1) input int

Return objects:

w : rank-1 array(‘d’) with bounds (n) z : rank-2 array(‘D’) with bounds (ldz,ldz) info : int

scipy.linalg.lapack.zhbevx = <fortran object>

zhbevx - Function signature:

w,z,m,ifail,info = zhbevx(ab,vl,vu,il,iu,[ldab,compute_v,range,lower,abstol,mmax,overwrite_ab])

Required arguments:

ab : input rank-2 array(‘D’) with bounds (ldab,*) vl : input ﬂoat vu : input ﬂoat il : input int iu :

input int

Optional arguments:

overwrite_ab := 1 input int ldab := shape(ab,0) input int compute_v := 1 input int range := 0 input

int lower := 0 input int abstol := 0.0 input ﬂoat mmax := (compute_v?(range==2?(iu-il+1):n):1)

input int

Return objects:

w : rank-1 array(‘d’) with bounds (n) z : rank-2 array(‘D’) with bounds (ldz,mmax) m : int ifail

: rank-1 array(‘i’) with bounds ((compute_v?n:1)) info : int

scipy.linalg.lapack.zheev = <fortran object>

zheev - Function signature:

w,v,info = zheev(a,[compute_v,lower,lwork,overwrite_a])

442 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Required arguments:

a : input rank-2 array(‘D’) with bounds (n,n)

Optional arguments:

compute_v := 1 input int lower := 0 input int overwrite_a := 0 input int lwork := 2*n-1 input int

Return objects:

w : rank-1 array(‘d’) with bounds (n) v : rank-2 array(‘D’) with bounds (n,n) and a storage info

: int

scipy.linalg.lapack.zheevd = <fortran object>

zheevd - Function signature:

w,v,info = zheevd(a,[compute_v,lower,lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘D’) with bounds (n,n)

Optional arguments:

compute_v := 1 input int lower := 0 input int overwrite_a := 0 input int lwork := (com-

pute_v?2*n+n*n:n+1) input int

Return objects:

w : rank-1 array(‘d’) with bounds (n) v : rank-2 array(‘D’) with bounds (n,n) and a storage info

: int

scipy.linalg.lapack.zheevr = <fortran object>

zheevr - Function signature:

w,z,info = zheevr(a,[jobz,range,uplo,il,iu,lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘D’) with bounds (n,n)

Optional arguments:

jobz := ‘V’ input string(len=1) range := ‘A’ input string(len=1) uplo := ‘L’ input string(len=1)

overwrite_a := 0 input int il := 1 input int iu := n input int lwork := 18*n input int

Return objects:

w : rank-1 array(‘d’) with bounds (n) z : rank-2 array(‘D’) with bounds (n,m) info : int

scipy.linalg.lapack.zhegv = <fortran object>

zhegv - Function signature:

a,w,info = zhegv(a,b,[itype,jobz,uplo,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘D’) with bounds (n,n) b : input rank-2 array(‘D’) with bounds (n,n)

Optional arguments:

itype := 1 input int jobz := ‘V’ input string(len=1) uplo := ‘L’ input string(len=1) overwrite_a :=

0 input int overwrite_b := 0 input int

Return objects:

a : rank-2 array(‘D’) with bounds (n,n) w : rank-1 array(‘d’) with bounds (n) info : int

scipy.linalg.lapack.zhegvd = <fortran object>

5.15. All functions 443

SciPy Reference Guide, Release 0.13.0

zhegvd - Function signature:

a,w,info = zhegvd(a,b,[itype,jobz,uplo,lwork,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘D’) with bounds (n,n) b : input rank-2 array(‘D’) with bounds (n,n)

Optional arguments:

itype := 1 input int jobz := ‘V’ input string(len=1) uplo := ‘L’ input string(len=1) overwrite_a :=

0 input int overwrite_b := 0 input int lwork := 2*n+n*n input int

Return objects:

a : rank-2 array(‘D’) with bounds (n,n) w : rank-1 array(‘d’) with bounds (n) info : int

scipy.linalg.lapack.zhegvx = <fortran object>

zhegvx - Function signature:

w,z,ifail,info = zhegvx(a,b,iu,[itype,jobz,uplo,il,lwork,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘D’) with bounds (n,n) b : input rank-2 array(‘D’) with bounds (n,n) iu :

input int

Optional arguments:

itype := 1 input int jobz := ‘V’ input string(len=1) uplo := ‘L’ input string(len=1) overwrite_a :=

0 input int overwrite_b := 0 input int il := 1 input int lwork := 18*n-1 input int

Return objects:

w : rank-1 array(‘d’) with bounds (n) z : rank-2 array(‘D’) with bounds (n,m) ifail : rank-1

array(‘i’) with bounds (n) info : int

scipy.linalg.lapack.zlaswp = <fortran object>

zlaswp - Function signature:

a = zlaswp(a,piv,[k1,k2,off,inc,overwrite_a])

Required arguments:

a : input rank-2 array(‘D’) with bounds (nrows,n) piv : input rank-1 array(‘i’) with bounds (*)

Optional arguments:

overwrite_a := 0 input int k1 := 0 input int k2 := len(piv)-1 input int off := 0 input int inc := 1

input int

Return objects:

a : rank-2 array(‘D’) with bounds (nrows,n)

scipy.linalg.lapack.zlauum = <fortran object>

zlauum - Function signature:

a,info = zlauum(c,[lower,overwrite_c])

Required arguments:

c : input rank-2 array(‘D’) with bounds (n,n)

Optional arguments:

overwrite_c := 0 input int lower := 0 input int

Return objects:

a : rank-2 array(‘D’) with bounds (n,n) and c storage info : int

444 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.linalg.lapack.zpbsv = <fortran object>

zpbsv - Function signature:

c,x,info = zpbsv(ab,b,[lower,ldab,overwrite_ab,overwrite_b])

Required arguments:

ab : input rank-2 array(‘D’) with bounds (ldab,n) b : input rank-2 array(‘D’) with bounds

(ldb,nrhs)

Optional arguments:

lower := 0 input int overwrite_ab := 0 input int ldab := shape(ab,0) input int overwrite_b := 0

input int

Return objects:

c : rank-2 array(‘D’) with bounds (ldab,n) and ab storage x : rank-2 array(‘D’) with bounds

(ldb,nrhs) and b storage info : int

scipy.linalg.lapack.zpbtrf = <fortran object>

zpbtrf - Function signature:

c,info = zpbtrf(ab,[lower,ldab,overwrite_ab])

Required arguments:

ab : input rank-2 array(‘D’) with bounds (ldab,n)

Optional arguments:

lower := 0 input int overwrite_ab := 0 input int ldab := shape(ab,0) input int

Return objects:

c : rank-2 array(‘D’) with bounds (ldab,n) and ab storage info : int

scipy.linalg.lapack.zpbtrs = <fortran object>

zpbtrs - Function signature:

x,info = zpbtrs(ab,b,[lower,ldab,overwrite_b])

Required arguments:

ab : input rank-2 array(‘D’) with bounds (ldab,n) b : input rank-2 array(‘D’) with bounds

(ldb,nrhs)

Optional arguments:

lower := 0 input int ldab := shape(ab,0) input int overwrite_b := 0 input int

Return objects:

x : rank-2 array(‘D’) with bounds (ldb,nrhs) and b storage info : int

scipy.linalg.lapack.zposv = <fortran object>

zposv - Function signature:

c,x,info = zposv(a,b,[lower,overwrite_a,overwrite_b])

Required arguments:

a : input rank-2 array(‘D’) with bounds (n,n) b : input rank-2 array(‘D’) with bounds (n,nrhs)

Optional arguments:

overwrite_a := 0 input int overwrite_b := 0 input int lower := 0 input int

5.15. All functions 445

SciPy Reference Guide, Release 0.13.0

Return objects:

c : rank-2 array(‘D’) with bounds (n,n) and a storage x : rank-2 array(‘D’) with bounds (n,nrhs)

and b storage info : int

scipy.linalg.lapack.zpotrf = <fortran object>

zpotrf - Function signature:

c,info = zpotrf(a,[lower,clean,overwrite_a])

Required arguments:

a : input rank-2 array(‘D’) with bounds (n,n)

Optional arguments:

overwrite_a := 0 input int lower := 0 input int clean := 1 input int

Return objects:

c : rank-2 array(‘D’) with bounds (n,n) and a storage info : int

scipy.linalg.lapack.zpotri = <fortran object>

zpotri - Function signature:

inv_a,info = zpotri(c,[lower,overwrite_c])

Required arguments:

c : input rank-2 array(‘D’) with bounds (n,n)

Optional arguments:

overwrite_c := 0 input int lower := 0 input int

Return objects:

inv_a : rank-2 array(‘D’) with bounds (n,n) and c storage info : int

scipy.linalg.lapack.zpotrs = <fortran object>

zpotrs - Function signature:

x,info = zpotrs(c,b,[lower,overwrite_b])

Required arguments:

c : input rank-2 array(‘D’) with bounds (n,n) b : input rank-2 array(‘D’) with bounds (n,nrhs)

Optional arguments:

overwrite_b := 0 input int lower := 0 input int

Return objects:

x : rank-2 array(‘D’) with bounds (n,nrhs) and b storage info : int

scipy.linalg.lapack.ztrsyl = <fortran object>

ztrsyl - Function signature:

x,scale,info = ztrsyl(a,b,c,[trana,tranb,isgn,overwrite_c])

Required arguments:

a : input rank-2 array(‘D’) with bounds (m,m) b : input rank-2 array(‘D’) with bounds (n,n) c :

input rank-2 array(‘D’) with bounds (m,n)

Optional arguments:

trana := ‘N’ input string(len=1) tranb := ‘N’ input string(len=1) isgn := 1 input int overwrite_c :=

0 input int

446 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Return objects:

x : rank-2 array(‘D’) with bounds (m,n) and c storage scale : ﬂoat info : int

scipy.linalg.lapack.ztrtri = <fortran object>

ztrtri - Function signature:

inv_c,info = ztrtri(c,[lower,unitdiag,overwrite_c])

Required arguments:

c : input rank-2 array(‘D’) with bounds (n,n)

Optional arguments:

overwrite_c := 0 input int lower := 0 input int unitdiag := 0 input int

Return objects:

inv_c : rank-2 array(‘D’) with bounds (n,n) and c storage info : int

scipy.linalg.lapack.ztrtrs = <fortran object>

ztrtrs - Function signature:

x,info = ztrtrs(a,b,[lower,trans,unitdiag,lda,overwrite_b])

Required arguments:

a : input rank-2 array(‘D’) with bounds (lda,n) b : input rank-2 array(‘D’) with bounds (ldb,nrhs)

Optional arguments:

lower := 0 input int trans := 0 input int unitdiag := 0 input int lda := shape(a,0) input int over-

write_b := 0 input int

Return objects:

x : rank-2 array(‘D’) with bounds (ldb,nrhs) and b storage info : int

scipy.linalg.lapack.zungqr = <fortran object>

zungqr - Function signature:

q,work,info = zungqr(a,tau,[lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘D’) with bounds (m,n) tau : input rank-1 array(‘D’) with bounds (k)

Optional arguments:

overwrite_a := 0 input int lwork := 3*n input int

Return objects:

q : rank-2 array(‘D’) with bounds (m,n) and a storage work : rank-1 array(‘D’) with bounds

(MAX(lwork,1)) info : int

scipy.linalg.lapack.zungrq = <fortran object>

zungrq - Function signature:

q,work,info = zungrq(a,tau,[lwork,overwrite_a])

Required arguments:

a : input rank-2 array(‘D’) with bounds (m,n) tau : input rank-1 array(‘D’) with bounds (k)

Optional arguments:

overwrite_a := 0 input int lwork := 3*m input int

5.15. All functions 447

SciPy Reference Guide, Release 0.13.0

Return objects:

q : rank-2 array(‘D’) with bounds (m,n) and a storage work : rank-1 array(‘D’) with bounds

(MAX(lwork,1)) info : int

scipy.linalg.lapack.zunmqr = <fortran object>

zunmqr - Function signature:

cq,work,info = zunmqr(side,trans,a,tau,c,lwork,[overwrite_c])

Required arguments:

side : input string(len=1) trans : input string(len=1) a : input rank-2 array(‘D’) with bounds

(lda,k) tau : input rank-1 array(‘D’) with bounds (k) c : input rank-2 array(‘D’) with bounds

(ldc,n) lwork : input int

Optional arguments:

overwrite_c := 0 input int

Return objects:

cq : rank-2 array(‘D’) with bounds (ldc,n) and c storage work : rank-1 array(‘D’) with bounds

(MAX(lwork,1)) info : int

5.16 Interpolative matrix decomposition (scipy.linalg.interpolative)

New in version 0.13. An interpolative decomposition (ID) of a matrix A∈Cm×nof rank k≤min{m, n}is a

factorization

AΠ = AΠ1AΠ2=AΠ1I T ,

where Π = [Π1,Π2]is a permutation matrix with Π1∈ {0,1}n×k, i.e., AΠ2=AΠ1T. This can equivalently be

written as A=BP , where B=AΠ1and P= [I, T ]ΠTare the skeleton and interpolation matrices, respectively.

If Adoes not have exact rank k, then there exists an approximation in the form of an ID such that A=BP +E,

where kEk ∼ σk+1 is on the order of the (k+ 1)-th largest singular value of A. Note that σk+1 is the best possible

error for a rank-kapproximation and, in fact, is achieved by the singular value decomposition (SVD) A≈U SV ∗,

where U∈Cm×kand V∈Cn×khave orthonormal columns and S= diag(σi)∈Ck×kis diagonal with nonnegative

entries. The principal advantages of using an ID over an SVD are that:

• it is cheaper to construct;

• it preserves the structure of A; and

• it is more efﬁcient to compute with in light of the identity submatrix of P.

5.16.1 Routines

Main functionality:

interp_decomp(A, eps_or_k[, rand]) Compute ID of a matrix.

reconstruct_matrix_from_id(B, idx, proj) Reconstruct matrix from its ID.

reconstruct_interp_matrix(idx, proj) Reconstruct interpolation matrix from ID.

reconstruct_skel_matrix(A, k, idx) Reconstruct skeleton matrix from ID.

id_to_svd(B, idx, proj) Convert ID to SVD.

svd(A, eps_or_k[, rand]) Compute SVD of a matrix via an ID.

Continued on next page

448 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Table 5.75 – continued from previous page

estimate_spectral_norm(A[, its]) Estimate spectral norm of a matrix by the randomized power method.

estimate_spectral_norm_diff(A, B[, its]) Estimate spectral norm of the difference of two matrices by the randomized power method.

estimate_rank(A, eps) Estimate matrix rank to a speciﬁed relative precision using randomized methods.

scipy.linalg.interpolative.interp_decomp(A,eps_or_k,rand=True)

Compute ID of a matrix.

An ID of a matrix Ais a factorization deﬁned by a rank k, a column index array idx, and interpolation coefﬁcients

proj such that:

numpy.dot(A[:,idx[:k]], proj) =A[:,idx[k:]]

The original matrix can then be reconstructed as:

numpy.hstack([A[:,idx[:k]],

numpy.dot(A[:,idx[:k]], proj)]

)[:,numpy.argsort(idx)]

or via the routine reconstruct_matrix_from_id. This can equivalently be written as:

numpy.dot(A[:,idx[:k]],

numpy.hstack([numpy.eye(k), proj])

)[:,np.argsort(idx)]

in terms of the skeleton and interpolation matrices:

B=A[:,idx[:k]]

and:

P=numpy.hstack([numpy.eye(k), proj])[:,np.argsort(idx)]

respectively. See also reconstruct_interp_matrix and reconstruct_skel_matrix.

The ID can be computed to any relative precision or rank (depending on the value of eps_or_k). If a precision

is speciﬁed (eps_or_k < 1), then this function has the output signature:

k, idx, proj =interp_decomp(A, eps_or_k)

Otherwise, if a rank is speciﬁed (eps_or_k >= 1), then the output signature is:

idx, proj =interp_decomp(A, eps_or_k)

Parameters A:numpy.ndarray or scipy.sparse.linalg.LinearOperator with

rmatvec

Matrix to be factored

eps_or_k : ﬂoat or int

Relative error (if eps_or_k < 1) or rank (if eps_or_k >= 1) of approximation.

rand : bool, optional

Whether to use random sampling if Ais of type numpy.ndarray

(randomized algorithms are always used if Ais of type

scipy.sparse.linalg.LinearOperator).

Returns k: int

Rank required to achieve speciﬁed relative precision if eps_or_k < 1.

idx :numpy.ndarray

Column index array.

proj :numpy.ndarray

5.16. Interpolative matrix decomposition (scipy.linalg.interpolative) 449

SciPy Reference Guide, Release 0.13.0

Interpolation coefﬁcients.

scipy.linalg.interpolative.reconstruct_matrix_from_id(B,idx,proj)

Reconstruct matrix from its ID.

A matrix Awith skeleton matrix Band ID indices and coefﬁcients idx and proj, respectively, can be reconstructed

as:

numpy.hstack([B, numpy.dot(B, proj)])[:,numpy.argsort(idx)]

See also reconstruct_interp_matrix and reconstruct_skel_matrix.

Parameters B:numpy.ndarray

Skeleton matrix.

idx :numpy.ndarray

Column index array.

proj :numpy.ndarray

Interpolation coefﬁcients.

Returns numpy.ndarray

Reconstructed matrix.

scipy.linalg.interpolative.reconstruct_interp_matrix(idx,proj)

Reconstruct interpolation matrix from ID.

The interpolation matrix can be reconstructed from the ID indices and coefﬁcients idx and proj, respectively, as:

P=numpy.hstack([numpy.eye(proj.shape[0]), proj])[:,numpy.argsort(idx)]

The original matrix can then be reconstructed from its skeleton matrix Bvia:

numpy.dot(B, P)

See also reconstruct_matrix_from_id and reconstruct_skel_matrix.

Parameters idx :numpy.ndarray

Column index array.

proj :numpy.ndarray

Interpolation coefﬁcients.

Returns numpy.ndarray

Interpolation matrix.

scipy.linalg.interpolative.reconstruct_skel_matrix(A,k,idx)

Reconstruct skeleton matrix from ID.

The skeleton matrix can be reconstructed from the original matrix Aand its ID rank and indices kand idx,

respectively, as:

B=A[:,idx[:k]]

The original matrix can then be reconstructed via:

numpy.hstack([B, numpy.dot(B, proj)])[:,numpy.argsort(idx)]

See also reconstruct_matrix_from_id and reconstruct_interp_matrix.

Parameters A:numpy.ndarray

Original matrix.

k: int

Rank of ID.

idx :numpy.ndarray

Column index array.

450 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Returns numpy.ndarray

Skeleton matrix.

scipy.linalg.interpolative.id_to_svd(B,idx,proj)

Convert ID to SVD.

The SVD reconstruction of a matrix with skeleton matrix Band ID indices and coefﬁcients idx and proj, respec-

tively, is:

U,S,V=id_to_svd(B, idx, proj)

A=numpy.dot(U, numpy.dot(numpy.diag(S), V.conj().T))

See Also

source,lookfor

Notes

When used interactively with an object, np.info(obj) is equivalent to help(obj) on the Python prompt

or obj? on the IPython prompt.

Examples

>>> np.info(np.polyval)

polyval(p, x)

Evaluate the polynomial p at x.

...

When using a string for object it is possible to get multiple results.

>>> np.info(’fft’)

*** Found in numpy ***

Core FFT routines

...

*** Found in numpy.fft ***

fft(a, n=None, axis=-1)

...

*** Repeat reference found in numpy.fft.fftpack ***

*** Total of 3 references found. ***

scipy.misc.lena()

Get classic image processing example image, Lena, at 8-bit grayscale bit-depth, 512 x 512 size.

Parameters None

Returns lena : ndarray

Lena image

Examples

>>> import scipy.misc

>>> lena =scipy.misc.lena()

>>> lena.shape

(512, 512)

>>> lena.max()

245

>>> lena.dtype

dtype(’int32’)

>>> import matplotlib.pyplot as plt

>>> plt.gray()

5.17. Miscellaneous routines (scipy.misc) 463

SciPy Reference Guide, Release 0.13.0

>>> plt.imshow(lena)

>>> plt.show()

scipy.misc.logsumexp(a,axis=None,b=None)

Compute the log of the sum of exponentials of input elements.

Parameters a: array_like

Input array.

axis : int, optional

Axis over which the sum is taken. By default axis is None, and all elements are

summed. New in version 0.11.0.

b: array-like, optional

Scaling factor for exp(a) must be of the same shape as aor broadcastable to a. New

in version 0.12.0.

Returns res : ndarray

The result, np.log(np.sum(np.exp(a))) calculated in a numerically more

stable way. If bis given then np.log(np.sum(b*np.exp(a))) is returned.

See Also

numpy.logaddexp,numpy.logaddexp2

Notes

Numpy has a logaddexp function which is very similar to logsumexp, but only handles two arguments. lo-

gaddexp.reduce is similar to this function, but may be less stable.

Examples

>>> from scipy.misc import logsumexp

>>> a=np.arange(10)

>>> np.log(np.sum(np.exp(a)))

9.4586297444267107

>>> logsumexp(a)

9.4586297444267107

With weights

464 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

>>> a=np.arange(10)

>>> b=np.arange(10,0,-1)

>>> logsumexp(a, b=b)

9.9170178533034665

>>> np.log(np.sum(b*np.exp(a)))

9.9170178533034647

scipy.misc.pade(an,m)

Return Pade approximation to a polynomial as the ratio of two polynomials.

Parameters an : (N,) array_like

Taylor series coefﬁcients.

m: int

The order of the returned approximating polynomials.

Returns p, q : Polynomial class

The pade approximation of the polynomial deﬁned by an is p(x)/q(x).

Examples

>>> from scipy import misc

>>> e_exp =[1.0,1.0,1.0/2.0,1.0/6.0,1.0/24.0,1.0/120.0]

>>> p,q=misc.pade(e_exp, 2)

>>> e_exp.reverse()

>>> e_poly =np.poly1d(e_exp)

Compare e_poly(x) and the pade approximation p(x)/q(x)

>>> e_poly(1)

2.7166666666666668

>>> p(1)/q(1)

2.7179487179487181

scipy.misc.toimage(arr,high=255,low=0,cmin=None,cmax=None,pal=None,mode=None,chan-

nel_axis=None)

Takes a numpy array and returns a PIL image.

The mode of the PIL image depends on the array shape and the pal and mode keywords.

For 2-D arrays, if pal is a valid (N,3) byte-array giving the RGB values (from 0 to 255) then mode=’P’,

otherwise mode=’L’, unless mode is given as ‘F’ or ‘I’ in which case a ﬂoat and/or integer array is made.

Notes

For 3-D arrays, the channel_axis argument tells which dimension of the array holds the channel data.

For 3-D arrays if one of the dimensions is 3, the mode is ‘RGB’ by default or ‘YCbCr’ if selected.

The numpy array must be either 2 dimensional or 3 dimensional.

scipy.misc.who(vardict=None)

Print the Numpy arrays in the given dictionary.

If there is no dictionary passed in or vardict is None then returns Numpy arrays in the globals() dictionary (all

Numpy arrays in the namespace).

Parameters vardict : dict, optional

A dictionary possibly containing ndarrays. Default is globals().

Returns out : None

Returns ‘None’.

5.17. Miscellaneous routines (scipy.misc) 465

SciPy Reference Guide, Release 0.13.0

Notes

Prints out the name, shape, bytes and type of all of the ndarrays present in vardict.

Examples

>>> a=np.arange(10)

>>> b=np.ones(20)

>>> np.who()

Name Shape Bytes Type

===========================================================

a 10 40 int32

b 20 160 float64

Upper bound on total bytes = 200

>>> d={’x’: np.arange(2.0), ’y’: np.arange(3.0), ’txt’:’Some str’,

... ’idx’:5}

>>> np.who(d)

Name Shape Bytes Type

===========================================================

y 3 24 float64

x 2 16 float64

Upper bound on total bytes = 40

5.18 Multi-dimensional image processing (scipy.ndimage)

This package contains various functions for multi-dimensional image processing.

5.18.1 Filters scipy.ndimage.filters

convolve(input, weights[, output, mode, ...]) Multidimensional convolution.

convolve1d(input, weights[, axis, output, ...]) Calculate a one-dimensional convolution along the given axis.

correlate(input, weights[, output, mode, ...]) Multi-dimensional correlation.

correlate1d(input, weights[, axis, output, ...]) Calculate a one-dimensional correlation along the given axis.

gaussian_filter(input, sigma[, order, ...]) Multidimensional Gaussian ﬁlter.

gaussian_filter1d(input, sigma[, axis, ...]) One-dimensional Gaussian ﬁlter.

gaussian_gradient_magnitude(input, sigma[, ...]) Multidimensional gradient magnitude using Gaussian derivatives.

gaussian_laplace(input, sigma[, output, ...]) Multidimensional Laplace ﬁlter using gaussian second derivatives.

generic_filter(input, function[, size, ...]) Calculates a multi-dimensional ﬁlter using the given function.

generic_filter1d(input, function, ﬁlter_size) Calculate a one-dimensional ﬁlter along the given axis.

generic_gradient_magnitude(input, derivative) Gradient magnitude using a provided gradient function.

generic_laplace(input, derivative2[, ...]) N-dimensional Laplace ﬁlter using a provided second derivative function

laplace(input[, output, mode, cval]) N-dimensional Laplace ﬁlter based on approximate second derivatives.

maximum_filter(input[, size, footprint, ...]) Calculates a multi-dimensional maximum ﬁlter.

maximum_filter1d(input, size[, axis, ...]) Calculate a one-dimensional maximum ﬁlter along the given axis.

median_filter(input[, size, footprint, ...]) Calculates a multidimensional median ﬁlter.

minimum_filter(input[, size, footprint, ...]) Calculates a multi-dimensional minimum ﬁlter.

minimum_filter1d(input, size[, axis, ...]) Calculate a one-dimensional minimum ﬁlter along the given axis.

percentile_filter(input, percentile[, size, ...]) Calculates a multi-dimensional percentile ﬁlter.

prewitt(input[, axis, output, mode, cval]) Calculate a Prewitt ﬁlter.

rank_filter(input, rank[, size, footprint, ...]) Calculates a multi-dimensional rank ﬁlter.

Continued on next page

466 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Table 5.78 – continued from previous page

sobel(input[, axis, output, mode, cval]) Calculate a Sobel ﬁlter.

uniform_filter(input[, size, output, mode, ...]) Multi-dimensional uniform ﬁlter.

uniform_filter1d(input, size[, axis, ...]) Calculate a one-dimensional uniform ﬁlter along the given axis.

scipy.ndimage.filters.convolve(input,weights,output=None,mode=’reﬂect’,cval=0.0,ori-

gin=0)

Multidimensional convolution.

The array is convolved with the given kernel.

Parameters input : array_like

Input array to ﬁlter.

weights : array_like

Array of weights, same number of dimensions as input

output : ndarray, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’,’constant’,’nearest’,’mirror’, ‘wrap’}, optional

the mode parameter determines how the array borders are handled. For ‘constant’

mode, values beyond borders are set to be cval. Default is ‘reﬂect’.

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

origin : array_like, optional

The origin parameter controls the placement of the ﬁlter. Default is 0.

Returns result : ndarray

The result of convolution of input with weights.

See Also

correlate Correlate an image with a kernel.

Notes

Each value in result is Ci=PjIi+j−kWj, where W is the weights kernel, j is the n-D spatial index over W, I

is the input and k is the coordinate of the center of W, speciﬁed by origin in the input parameters.

Examples

Perhaps the simplest case to understand is mode=’constant’, cval=0.0, because in this case borders

(i.e. where the weights kernel, centered on any one value, extends beyond an edge of input.

>>> a=np.array([[1,2,0,0],

.... [5, 3, 0, 4],

.... [0, 0, 0, 7],

.... [9, 3, 0, 0]])

>>> k=np.array([[1,1,1],[1,1,0],[1,0,0]])

>>> from scipy import ndimage

>>> ndimage.convolve(a, k, mode=’constant’, cval=0.0)

array([[11, 10, 7, 4],

[10, 3, 11, 11],

[15, 12, 14, 7],

[12, 3, 7, 0]])

Setting cval=1.0 is equivalent to padding the outer edge of input with 1.0’s (and then extracting only the

original region of the result).

5.18. Multi-dimensional image processing (scipy.ndimage) 467

SciPy Reference Guide, Release 0.13.0

>>> ndimage.convolve(a, k, mode=’constant’, cval=1.0)

array([[13, 11, 8, 7],

[11, 3, 11, 14],

[16, 12, 14, 10],

[15, 6, 10, 5]])

With mode=’reflect’ (the default), outer values are reﬂected at the edge of input to ﬁll in missing values.

>>> b=np.array([[2,0,0],

[1, 0, 0],

[0, 0, 0]])

>>> k=np.array([[0,1,0],[0,1,0],[0,1,0]])

>>> ndimage.convolve(b, k, mode=’reflect’)

array([[5, 0, 0],

[3, 0, 0],

[1, 0, 0]])

This includes diagonally at the corners.

>>> k=np.array([[1,0,0],[0,1,0],[0,0,1]])

>>> ndimage.convolve(b, k)

array([[4, 2, 0],

[3, 2, 0],

[1, 1, 0]])

With mode=’nearest’, the single nearest value in to an edge in input is repeated as many times as needed

to match the overlapping weights.

>>> c=np.array([[2,0,1],

[1, 0, 0],

[0, 0, 0]])

>>> k=np.array([[0,1,0],

[0, 1, 0],

[0, 1, 0]])

>>> ndimage.convolve(c, k, mode=’nearest’)

array([[7, 0, 3],

[5, 0, 2],

[3, 0, 1]])

scipy.ndimage.filters.convolve1d(input,weights,axis=-1,output=None,mode=’reﬂect’,

cval=0.0,origin=0)

Calculate a one-dimensional convolution along the given axis.

The lines of the array along the given axis are convolved with the given weights.

Parameters input : array_like

Input array to ﬁlter.

weights : ndarray

One-dimensional sequence of numbers.

axis : int, optional

The axis of input along which to calculate. Default is -1.

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

468 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

origin : scalar, optional

The origin parameter controls the placement of the ﬁlter. Default 0.0.

Returns convolve1d : ndarray

Convolved array with same shape as input

scipy.ndimage.filters.correlate(input,weights,output=None,mode=’reﬂect’,cval=0.0,ori-

gin=0)

Multi-dimensional correlation.

The array is correlated with the given kernel.

Parameters input : array-like

input array to ﬁlter

weights : ndarray

array of weights, same number of dimensions as input

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’,’constant’,’nearest’,’mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is

the value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

origin : scalar, optional

The origin parameter controls the placement of the ﬁlter. Default 0

See Also

convolve Convolve an image with a kernel.

scipy.ndimage.filters.correlate1d(input,weights,axis=-1,output=None,mode=’reﬂect’,

cval=0.0,origin=0)

Calculate a one-dimensional correlation along the given axis.

The lines of the array along the given axis are correlated with the given weights.

Parameters input : array_like

Input array to ﬁlter.

weights : array

One-dimensional sequence of numbers.

axis : int, optional

The axis of input along which to calculate. Default is -1.

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

origin : scalar, optional

The origin parameter controls the placement of the ﬁlter. Default 0.0.

scipy.ndimage.filters.gaussian_filter(input,sigma,order=0,output=None,mode=’reﬂect’,

cval=0.0)

Multidimensional Gaussian ﬁlter.

Parameters input : array_like

Input array to ﬁlter.

sigma : scalar or sequence of scalars

5.18. Multi-dimensional image processing (scipy.ndimage) 469

SciPy Reference Guide, Release 0.13.0

Standard deviation for Gaussian kernel. The standard deviations of the Gaussian ﬁlter

are given for each axis as a sequence, or as a single number, in which case it is equal

for all axes.

order : {0, 1, 2, 3} or sequence from same set, optional

The order of the ﬁlter along each axis is given as a sequence of integers, or as a single

number. An order of 0 corresponds to convolution with a Gaussian kernel. An order

of 1, 2, or 3 corresponds to convolution with the ﬁrst, second or third derivatives of a

Gaussian. Higher order derivatives are not implemented

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

Returns gaussian_ﬁlter : ndarray

Returned array of same shape as input.

Notes

The multidimensional ﬁlter is implemented as a sequence of one-dimensional convolution ﬁlters. The interme-

diate arrays are stored in the same data type as the output. Therefore, for output types with a limited precision,

the results may be imprecise because intermediate results may be stored with insufﬁcient precision.

scipy.ndimage.filters.gaussian_filter1d(input,sigma,axis=-1,order=0,output=None,

mode=’reﬂect’,cval=0.0)

One-dimensional Gaussian ﬁlter.

Parameters input : array_like

Input array to ﬁlter.

sigma : scalar

standard deviation for Gaussian kernel

axis : int, optional

The axis of input along which to calculate. Default is -1.

order : {0, 1, 2, 3}, optional

An order of 0 corresponds to convolution with a Gaussian kernel. An order of 1, 2, or

3 corresponds to convolution with the ﬁrst, second or third derivatives of a Gaussian.

Higher order derivatives are not implemented

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

Returns gaussian_ﬁlter1d : ndarray

scipy.ndimage.filters.gaussian_gradient_magnitude(input,sigma,output=None,

mode=’reﬂect’,cval=0.0)

Multidimensional gradient magnitude using Gaussian derivatives.

Parameters input : array_like

Input array to ﬁlter.

sigma : scalar or sequence of scalars

The standard deviations of the Gaussian ﬁlter are given for each axis as a sequence, or

as a single number, in which case it is equal for all axes..

output : array, optional

470 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

scipy.ndimage.filters.gaussian_laplace(input,sigma,output=None,mode=’reﬂect’,

cval=0.0)

Multidimensional Laplace ﬁlter using gaussian second derivatives.

Parameters input : array_like

Input array to ﬁlter.

sigma : scalar or sequence of scalars

The standard deviations of the Gaussian ﬁlter are given for each axis as a sequence, or

as a single number, in which case it is equal for all axes.

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

scipy.ndimage.filters.generic_filter(input,function,size=None,footprint=None,out-

put=None,mode=’reﬂect’,cval=0.0,origin=0,ex-

tra_arguments=(),extra_keywords=None)

Calculates a multi-dimensional ﬁlter using the given function.

At each element the provided function is called. The input values within the ﬁlter footprint at that element are

passed to the function as a 1D array of double values.

Parameters input : array_like

Input array to ﬁlter.

function : callable

Function to apply at each element.

size : scalar or tuple, optional

See footprint, below

footprint : array, optional

Either size or footprint must be deﬁned. size gives the shape that is taken from the

input array, at every element position, to deﬁne the input to the ﬁlter function. foot-

print is a boolean array that speciﬁes (implicitly) a shape, but also which of the el-

ements within this shape will get passed to the ﬁlter function. Thus size=(n,m)

is equivalent to footprint=np.ones((n,m)). We adjust size to the number of

dimensions of the input array, so that, if the input array is shape (10,10,10), and size

is 2, then the actual size used is (2,2,2).

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

origin : scalar, optional

The origin parameter controls the placement of the ﬁlter. Default 0.0.

extra_arguments : sequence, optional

Sequence of extra positional arguments to pass to passed function

5.18. Multi-dimensional image processing (scipy.ndimage) 471

SciPy Reference Guide, Release 0.13.0

extra_keywords : dict, optional

dict of extra keyword arguments to pass to passed function

scipy.ndimage.filters.generic_filter1d(input,function,ﬁlter_size,axis=-1,out-

put=None,mode=’reﬂect’,cval=0.0,origin=0,

extra_arguments=(),extra_keywords=None)

Calculate a one-dimensional ﬁlter along the given axis.

generic_filter1d iterates over the lines of the array, calling the given function at each line. The arguments

of the line are the input line, and the output line. The input and output lines are 1D double arrays. The input line

is extended appropriately according to the ﬁlter size and origin. The output line must be modiﬁed in-place with

the result.

Parameters input : array_like

Input array to ﬁlter.

function : callable

Function to apply along given axis.

ﬁlter_size : scalar

Length of the ﬁlter.

axis : int, optional

The axis of input along which to calculate. Default is -1.

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

origin : scalar, optional

The origin parameter controls the placement of the ﬁlter. Default 0.0.

extra_arguments : sequence, optional

Sequence of extra positional arguments to pass to passed function

extra_keywords : dict, optional

dict of extra keyword arguments to pass to passed function

scipy.ndimage.filters.generic_gradient_magnitude(input,derivative,output=None,

mode=’reﬂect’,cval=0.0,

extra_arguments=(),ex-

tra_keywords=None)

Gradient magnitude using a provided gradient function.

Parameters input : array_like

Input array to ﬁlter.

derivative : callable

Callable with the following signature:

derivative(input, axis, output, mode, cval,

*extra_arguments, **extra_keywords)

See extra_arguments,extra_keywords below. derivative can assume that input and

output are ndarrays. Note that the output from derivative is modiﬁed inplace; be

careful to copy important inputs before returning them.

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

472 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

extra_keywords : dict, optional

dict of extra keyword arguments to pass to passed function

extra_arguments : sequence, optional

Sequence of extra positional arguments to pass to passed function

scipy.ndimage.filters.generic_laplace(input,derivative2,output=None,mode=’reﬂect’,

cval=0.0,extra_arguments=(),ex-

tra_keywords=None)

N-dimensional Laplace ﬁlter using a provided second derivative function

Parameters input : array_like

Input array to ﬁlter.

derivative2 : callable

Callable with the following signature:

derivative2(input, axis, output, mode, cval,

*extra_arguments, **extra_keywords)

See extra_arguments,extra_keywords below.

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

extra_keywords : dict, optional

dict of extra keyword arguments to pass to passed function

extra_arguments : sequence, optional

Sequence of extra positional arguments to pass to passed function

scipy.ndimage.filters.laplace(input,output=None,mode=’reﬂect’,cval=0.0)

N-dimensional Laplace ﬁlter based on approximate second derivatives.

Parameters input : array_like

Input array to ﬁlter.

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

scipy.ndimage.filters.maximum_filter(input,size=None,footprint=None,output=None,

mode=’reﬂect’,cval=0.0,origin=0)

Calculates a multi-dimensional maximum ﬁlter.

Parameters input : array_like

Input array to ﬁlter.

size : scalar or tuple, optional

See footprint, below

footprint : array, optional

Either size or footprint must be deﬁned. size gives the shape that is taken from the

input array, at every element position, to deﬁne the input to the ﬁlter function. foot-

print is a boolean array that speciﬁes (implicitly) a shape, but also which of the el-

ements within this shape will get passed to the ﬁlter function. Thus size=(n,m)

5.18. Multi-dimensional image processing (scipy.ndimage) 473

SciPy Reference Guide, Release 0.13.0

is equivalent to footprint=np.ones((n,m)). We adjust size to the number of

dimensions of the input array, so that, if the input array is shape (10,10,10), and size

is 2, then the actual size used is (2,2,2).

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

origin : scalar, optional

The origin parameter controls the placement of the ﬁlter. Default 0.0.

scipy.ndimage.filters.maximum_filter1d(input,size,axis=-1,output=None,mode=’reﬂect’,

cval=0.0,origin=0)

Calculate a one-dimensional maximum ﬁlter along the given axis.

The lines of the array along the given axis are ﬁltered with a maximum ﬁlter of given size.

Parameters input : array_like

Input array to ﬁlter.

size : int

Length along which to calculate the 1-D maximum.

axis : int, optional

The axis of input along which to calculate. Default is -1.

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

origin : scalar, optional

The origin parameter controls the placement of the ﬁlter. Default 0.0.

scipy.ndimage.filters.median_filter(input,size=None,footprint=None,output=None,

mode=’reﬂect’,cval=0.0,origin=0)

Calculates a multidimensional median ﬁlter.

Parameters input : array_like

Input array to ﬁlter.

size : scalar or tuple, optional

See footprint, below

footprint : array, optional

Either size or footprint must be deﬁned. size gives the shape that is taken from the

input array, at every element position, to deﬁne the input to the ﬁlter function. foot-

print is a boolean array that speciﬁes (implicitly) a shape, but also which of the el-

ements within this shape will get passed to the ﬁlter function. Thus size=(n,m)

is equivalent to footprint=np.ones((n,m)). We adjust size to the number of

dimensions of the input array, so that, if the input array is shape (10,10,10), and size

is 2, then the actual size used is (2,2,2).

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

474 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

origin : scalar, optional

The origin parameter controls the placement of the ﬁlter. Default 0.0.

Returns median_ﬁlter : ndarray

Return of same shape as input.

scipy.ndimage.filters.minimum_filter(input,size=None,footprint=None,output=None,

mode=’reﬂect’,cval=0.0,origin=0)

Calculates a multi-dimensional minimum ﬁlter.

Parameters input : array_like

Input array to ﬁlter.

size : scalar or tuple, optional

See footprint, below

footprint : array, optional

Either size or footprint must be deﬁned. size gives the shape that is taken from the

input array, at every element position, to deﬁne the input to the ﬁlter function. foot-

print is a boolean array that speciﬁes (implicitly) a shape, but also which of the el-

ements within this shape will get passed to the ﬁlter function. Thus size=(n,m)

is equivalent to footprint=np.ones((n,m)). We adjust size to the number of

dimensions of the input array, so that, if the input array is shape (10,10,10), and size

is 2, then the actual size used is (2,2,2).

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

origin : scalar, optional

The origin parameter controls the placement of the ﬁlter. Default 0.0.

scipy.ndimage.filters.minimum_filter1d(input,size,axis=-1,output=None,mode=’reﬂect’,

cval=0.0,origin=0)

Calculate a one-dimensional minimum ﬁlter along the given axis.

The lines of the array along the given axis are ﬁltered with a minimum ﬁlter of given size.

Parameters input : array_like

Input array to ﬁlter.

size : int

length along which to calculate 1D minimum

axis : int, optional

The axis of input along which to calculate. Default is -1.

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

origin : scalar, optional

The origin parameter controls the placement of the ﬁlter. Default 0.0.

scipy.ndimage.filters.percentile_filter(input,percentile,size=None,footprint=None,out-

put=None,mode=’reﬂect’,cval=0.0,origin=0)

Calculates a multi-dimensional percentile ﬁlter.

5.18. Multi-dimensional image processing (scipy.ndimage) 475

SciPy Reference Guide, Release 0.13.0

Parameters input : array_like

Input array to ﬁlter.

percentile : scalar

The percentile parameter may be less then zero, i.e., percentile = -20 equals percentile

= 80

size : scalar or tuple, optional

See footprint, below

footprint : array, optional

Either size or footprint must be deﬁned. size gives the shape that is taken from the

input array, at every element position, to deﬁne the input to the ﬁlter function. foot-

print is a boolean array that speciﬁes (implicitly) a shape, but also which of the el-

ements within this shape will get passed to the ﬁlter function. Thus size=(n,m)

is equivalent to footprint=np.ones((n,m)). We adjust size to the number of

dimensions of the input array, so that, if the input array is shape (10,10,10), and size

is 2, then the actual size used is (2,2,2).

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

origin : scalar, optional

The origin parameter controls the placement of the ﬁlter. Default 0.0.

scipy.ndimage.filters.prewitt(input,axis=-1,output=None,mode=’reﬂect’,cval=0.0)

Calculate a Prewitt ﬁlter.

Parameters input : array_like

Input array to ﬁlter.

axis : int, optional

The axis of input along which to calculate. Default is -1.

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

scipy.ndimage.filters.rank_filter(input,rank,size=None,footprint=None,output=None,

mode=’reﬂect’,cval=0.0,origin=0)

Calculates a multi-dimensional rank ﬁlter.

Parameters input : array_like

Input array to ﬁlter.

rank : integer

The rank parameter may be less then zero, i.e., rank = -1 indicates the largest element.

size : scalar or tuple, optional

See footprint, below

footprint : array, optional

Either size or footprint must be deﬁned. size gives the shape that is taken from the

input array, at every element position, to deﬁne the input to the ﬁlter function. foot-

print is a boolean array that speciﬁes (implicitly) a shape, but also which of the el-

ements within this shape will get passed to the ﬁlter function. Thus size=(n,m)

is equivalent to footprint=np.ones((n,m)). We adjust size to the number of

476 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

dimensions of the input array, so that, if the input array is shape (10,10,10), and size

is 2, then the actual size used is (2,2,2).

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

origin : scalar, optional

The origin parameter controls the placement of the ﬁlter. Default 0.0.

scipy.ndimage.filters.sobel(input,axis=-1,output=None,mode=’reﬂect’,cval=0.0)

Calculate a Sobel ﬁlter.

Parameters input : array_like

Input array to ﬁlter.

axis : int, optional

The axis of input along which to calculate. Default is -1.

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

scipy.ndimage.filters.uniform_filter(input,size=3,output=None,mode=’reﬂect’,cval=0.0,

origin=0)

Multi-dimensional uniform ﬁlter.

Parameters input : array_like

Input array to ﬁlter.

size : int or sequence of ints

The sizes of the uniform ﬁlter are given for each axis as a sequence, or as a single

number, in which case the size is equal for all axes.

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

origin : scalar, optional

The origin parameter controls the placement of the ﬁlter. Default 0.0.

Notes

The multi-dimensional ﬁlter is implemented as a sequence of one-dimensional uniform ﬁlters. The intermediate

arrays are stored in the same data type as the output. Therefore, for output types with a limited precision, the

results may be imprecise because intermediate results may be stored with insufﬁcient precision.

scipy.ndimage.filters.uniform_filter1d(input,size,axis=-1,output=None,mode=’reﬂect’,

cval=0.0,origin=0)

Calculate a one-dimensional uniform ﬁlter along the given axis.

The lines of the array along the given axis are ﬁltered with a uniform ﬁlter of given size.

5.18. Multi-dimensional image processing (scipy.ndimage) 477

SciPy Reference Guide, Release 0.13.0

Parameters input : array_like

Input array to ﬁlter.

size : integer

length of uniform ﬁlter

axis : int, optional

The axis of input along which to calculate. Default is -1.

output : array, optional

The output parameter passes an array in which to store the ﬁlter output.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

origin : scalar, optional

The origin parameter controls the placement of the ﬁlter. Default 0.0.

5.18.2 Fourier ﬁlters scipy.ndimage.fourier

fourier_ellipsoid(input, size[, n, axis, output]) Multi-dimensional ellipsoid fourier ﬁlter.

fourier_gaussian(input, sigma[, n, axis, output]) Multi-dimensional Gaussian fourier ﬁlter.

fourier_shift(input, shift[, n, axis, output]) Multi-dimensional fourier shift ﬁlter.

fourier_uniform(input, size[, n, axis, output]) Multi-dimensional uniform fourier ﬁlter.

scipy.ndimage.fourier.fourier_ellipsoid(input,size,n=-1,axis=-1,output=None)

Multi-dimensional ellipsoid fourier ﬁlter.

The array is multiplied with the fourier transform of a ellipsoid of given sizes.

Parameters input : array_like

The input array.

size : ﬂoat or sequence

The size of the box used for ﬁltering. If a ﬂoat, size is the same for all axes. If a

sequence, size has to contain one value for each axis.

n: int, optional

If nis negative (default), then the input is assumed to be the result of a complex fft. If

nis larger than or equal to zero, the input is assumed to be the result of a real fft, and n

gives the length of the array before transformation along the real transform direction.

axis : int, optional

The axis of the real transform.

output : ndarray, optional

If given, the result of ﬁltering the input is placed in this array. None is returned in this

case.

Returns fourier_ellipsoid : ndarray or None

The ﬁltered input. If output is given as a parameter, None is returned.

Notes

This function is implemented for arrays of rank 1, 2, or 3.

scipy.ndimage.fourier.fourier_gaussian(input,sigma,n=-1,axis=-1,output=None)

Multi-dimensional Gaussian fourier ﬁlter.

The array is multiplied with the fourier transform of a Gaussian kernel.

Parameters input : array_like

478 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

The input array.

sigma : ﬂoat or sequence

The sigma of the Gaussian kernel. If a ﬂoat, sigma is the same for all axes. If a

sequence, sigma has to contain one value for each axis.

n: int, optional

If nis negative (default), then the input is assumed to be the result of a complex fft. If

nis larger than or equal to zero, the input is assumed to be the result of a real fft, and n

gives the length of the array before transformation along the real transform direction.

axis : int, optional

The axis of the real transform.

output : ndarray, optional

If given, the result of ﬁltering the input is placed in this array. None is returned in this

case.

Returns fourier_gaussian : ndarray or None

The ﬁltered input. If output is given as a parameter, None is returned.

scipy.ndimage.fourier.fourier_shift(input,shift,n=-1,axis=-1,output=None)

Multi-dimensional fourier shift ﬁlter.

The array is multiplied with the fourier transform of a shift operation.

Parameters input : array_like

The input array.

shift : ﬂoat or sequence

The size of the box used for ﬁltering. If a ﬂoat, shift is the same for all axes. If a

sequence, shift has to contain one value for each axis.

n: int, optional

If nis negative (default), then the input is assumed to be the result of a complex fft. If

nis larger than or equal to zero, the input is assumed to be the result of a real fft, and n

gives the length of the array before transformation along the real transform direction.

axis : int, optional

The axis of the real transform.

output : ndarray, optional

If given, the result of shifting the input is placed in this array. None is returned in this

case.

Returns fourier_shift : ndarray or None

The shifted input. If output is given as a parameter, None is returned.

scipy.ndimage.fourier.fourier_uniform(input,size,n=-1,axis=-1,output=None)

Multi-dimensional uniform fourier ﬁlter.

The array is multiplied with the fourier transform of a box of given size.

Parameters input : array_like

The input array.

size : ﬂoat or sequence

The size of the box used for ﬁltering. If a ﬂoat, size is the same for all axes. If a

sequence, size has to contain one value for each axis.

n: int, optional

If nis negative (default), then the input is assumed to be the result of a complex fft. If

nis larger than or equal to zero, the input is assumed to be the result of a real fft, and n

gives the length of the array before transformation along the real transform direction.

axis : int, optional

The axis of the real transform.

output : ndarray, optional

If given, the result of ﬁltering the input is placed in this array. None is returned in this

case.

5.18. Multi-dimensional image processing (scipy.ndimage) 479

SciPy Reference Guide, Release 0.13.0

Returns fourier_uniform : ndarray or None

The ﬁltered input. If output is given as a parameter, None is returned.

5.18.3 Interpolation scipy.ndimage.interpolation

affine_transform(input, matrix[, offset, ...]) Apply an afﬁne transformation.

geometric_transform(input, mapping[, ...]) Apply an arbritrary geometric transform.

map_coordinates(input, coordinates[, ...]) Map the input array to new coordinates by interpolation.

rotate(input, angle[, axes, reshape, ...]) Rotate an array.

shift(input, shift[, output, order, mode, ...]) Shift an array.

spline_filter(input[, order, output]) Multi-dimensional spline ﬁlter.

spline_filter1d(input[, order, axis, output]) Calculates a one-dimensional spline ﬁlter along the given axis.

zoom(input, zoom[, output, order, mode, ...]) Zoom an array.

scipy.ndimage.interpolation.affine_transform(input,matrix,offset=0.0,out-

put_shape=None,output=None,or-

der=3,mode=’constant’,cval=0.0,

preﬁlter=True)

Apply an afﬁne transformation.

The given matrix and offset are used to ﬁnd for each point in the output the corresponding coordinates in the input

by an afﬁne transformation. The value of the input at those coordinates is determined by spline interpolation of

the requested order. Points outside the boundaries of the input are ﬁlled according to the given mode.

Parameters input : ndarray

The input array.

matrix : ndarray

The matrix must be two-dimensional or can also be given as a one-dimensional se-

quence or array. In the latter case, it is assumed that the matrix is diagonal. A more

efﬁcient algorithms is then applied that exploits the separability of the problem.

offset : ﬂoat or sequence, optional

The offset into the array where the transform is applied. If a ﬂoat, offset is the same

for each axis. If a sequence, offset should contain one value for each axis.

output_shape : tuple of ints, optional

Shape tuple.

output : ndarray or dtype, optional

The array in which to place the output, or the dtype of the returned array.

order : int, optional

The order of the spline interpolation, default is 3. The order has to be in the range 0-5.

mode : str, optional

Points outside the boundaries of the input are ﬁlled according to the given mode (‘con-

stant’, ‘nearest’, ‘reﬂect’ or ‘wrap’). Default is ‘constant’.

cval : scalar, optional

Value used for points outside the boundaries of the input if mode=’constant’.

Default is 0.0

preﬁlter : bool, optional

The parameter preﬁlter determines if the input is pre-ﬁltered with spline_filter

before interpolation (necessary for spline interpolation of order > 1). If False, it is

assumed that the input is already ﬁltered. Default is True.

Returns afﬁne_transform : ndarray or None

The transformed input. If output is given as a parameter, None is returned.

480 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.ndimage.interpolation.geometric_transform(input,mapping,output_shape=None,

output=None,order=3,

mode=’constant’,cval=0.0,pre-

ﬁlter=True,extra_arguments=(),

extra_keywords={})

Apply an arbritrary geometric transform.

The given mapping function is used to ﬁnd, for each point in the output, the corresponding coordinates in the

input. The value of the input at those coordinates is determined by spline interpolation of the requested order.

Parameters input : array_like

The input array.

mapping : callable

A callable object that accepts a tuple of length equal to the output array rank, and

returns the corresponding input coordinates as a tuple of length equal to the input

array rank.

output_shape : tuple of ints

Shape tuple.

output : ndarray or dtype, optional

The array in which to place the output, or the dtype of the returned array.

order : int, optional

The order of the spline interpolation, default is 3. The order has to be in the range 0-5.

mode : str, optional

Points outside the boundaries of the input are ﬁlled according to the given mode (‘con-

stant’, ‘nearest’, ‘reﬂect’ or ‘wrap’). Default is ‘constant’.

cval : scalar, optional

Value used for points outside the boundaries of the input if mode=’constant’.

Default is 0.0

preﬁlter : bool, optional

The parameter preﬁlter determines if the input is pre-ﬁltered with spline_filter

before interpolation (necessary for spline interpolation of order > 1). If False, it is

assumed that the input is already ﬁltered. Default is True.

extra_arguments : tuple, optional

Extra arguments passed to mapping.

extra_keywords : dict, optional

Extra keywords passed to mapping.

Returns return_value : ndarray or None

The ﬁltered input. If output is given as a parameter, None is returned.

See Also

map_coordinates,affine_transform,spline_filter1d

Examples

>>> a=np.arange(12.).reshape((4,3))

>>> def shift_func(output_coords):

... return (output_coords[0]-0.5, output_coords[1]-0.5)

...

>>> sp.ndimage.geometric_transform(a, shift_func)

array([[ 0. , 0. , 0. ],

[ 0. , 1.362, 2.738],

[ 0. , 4.812, 6.187],

[ 0. , 8.263, 9.637]])

5.18. Multi-dimensional image processing (scipy.ndimage) 481

SciPy Reference Guide, Release 0.13.0

scipy.ndimage.interpolation.map_coordinates(input,coordinates,output=None,order=3,

mode=’constant’,cval=0.0,preﬁlter=True)

Map the input array to new coordinates by interpolation.

The array of coordinates is used to ﬁnd, for each point in the output, the corresponding coordinates in the input.

The value of the input at those coordinates is determined by spline interpolation of the requested order.

The shape of the output is derived from that of the coordinate array by dropping the ﬁrst axis. The values of the

array along the ﬁrst axis are the coordinates in the input array at which the output value is found.

Parameters input : ndarray

The input array.

coordinates : array_like

The coordinates at which input is evaluated.

output : ndarray or dtype, optional

The array in which to place the output, or the dtype of the returned array.

order : int, optional

The order of the spline interpolation, default is 3. The order has to be in the range 0-5.

mode : str, optional

Points outside the boundaries of the input are ﬁlled according to the given mode (‘con-

stant’, ‘nearest’, ‘reﬂect’ or ‘wrap’). Default is ‘constant’.

cval : scalar, optional

Value used for points outside the boundaries of the input if mode=’constant’.

Default is 0.0

preﬁlter : bool, optional

The parameter preﬁlter determines if the input is pre-ﬁltered with spline_filter

before interpolation (necessary for spline interpolation of order > 1). If False, it is

assumed that the input is already ﬁltered. Default is True.

Returns map_coordinates : ndarray

The result of transforming the input. The shape of the output is derived from that of

coordinates by dropping the ﬁrst axis.

See Also

spline_filter,geometric_transform,scipy.interpolate

Examples

>>> from scipy import ndimage

>>> a=np.arange(12.).reshape((4,3))

>>> a

array([[ 0., 1., 2.],

[ 3., 4., 5.],

[ 6., 7., 8.],

[ 9., 10., 11.]])

>>> ndimage.map_coordinates(a, [[0.5,2], [0.5,1]], order=1)

[ 2. 7.]

Above, the interpolated value of a[0.5, 0.5] gives output[0], while a[2, 1] is output[1].

>>> inds =np.array([[0.5,2], [0.5,4]])

>>> ndimage.map_coordinates(a, inds, order=1, cval=-33.3)

array([ 2. , -33.3])

>>> ndimage.map_coordinates(a, inds, order=1, mode=’nearest’)

array([ 2., 8.])

>>> ndimage.map_coordinates(a, inds, order=1, cval=0, output=bool)

array([ True, False], dtype=bool

482 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.ndimage.interpolation.rotate(input,angle,axes=(1,0),reshape=True,output=None,or-

der=3,mode=’constant’,cval=0.0,preﬁlter=True)

Rotate an array.

The array is rotated in the plane deﬁned by the two axes given by the axes parameter using spline interpolation

of the requested order.

Parameters input : ndarray

The input array.

angle : ﬂoat

The rotation angle in degrees.

axes : tuple of 2 ints, optional

The two axes that deﬁne the plane of rotation. Default is the ﬁrst two axes.

reshape : bool, optional

If reshape is true, the output shape is adapted so that the input array is contained

completely in the output. Default is True.

output : ndarray or dtype, optional

The array in which to place the output, or the dtype of the returned array.

order : int, optional

The order of the spline interpolation, default is 3. The order has to be in the range 0-5.

mode : str, optional

Points outside the boundaries of the input are ﬁlled according to the given mode (‘con-

stant’, ‘nearest’, ‘reﬂect’ or ‘wrap’). Default is ‘constant’.

cval : scalar, optional

Value used for points outside the boundaries of the input if mode=’constant’.

Default is 0.0

preﬁlter : bool, optional

The parameter preﬁlter determines if the input is pre-ﬁltered with spline_filter

before interpolation (necessary for spline interpolation of order > 1). If False, it is

assumed that the input is already ﬁltered. Default is True.

Returns rotate : ndarray or None

The rotated input. If output is given as a parameter, None is returned.

scipy.ndimage.interpolation.shift(input,shift,output=None,order=3,mode=’constant’,

cval=0.0,preﬁlter=True)

Shift an array.

The array is shifted using spline interpolation of the requested order. Points outside the boundaries of the input

are ﬁlled according to the given mode.

Parameters input : ndarray

The input array.

shift : ﬂoat or sequence, optional

The shift along the axes. If a ﬂoat, shift is the same for each axis. If a sequence,

shift should contain one value for each axis.

output : ndarray or dtype, optional

The array in which to place the output, or the dtype of the returned array.

order : int, optional

The order of the spline interpolation, default is 3. The order has to be in the range 0-5.

mode : str, optional

Points outside the boundaries of the input are ﬁlled according to the given mode (‘con-

stant’, ‘nearest’, ‘reﬂect’ or ‘wrap’). Default is ‘constant’.

cval : scalar, optional

Value used for points outside the boundaries of the input if mode=’constant’.

Default is 0.0

preﬁlter : bool, optional

5.18. Multi-dimensional image processing (scipy.ndimage) 483

SciPy Reference Guide, Release 0.13.0

The parameter preﬁlter determines if the input is pre-ﬁltered with spline_filter

before interpolation (necessary for spline interpolation of order > 1). If False, it is

assumed that the input is already ﬁltered. Default is True.

Returns shift : ndarray or None

The shifted input. If output is given as a parameter, None is returned.

scipy.ndimage.interpolation.spline_filter(input,order=3,output=<type

‘numpy.ﬂoat64’>)

Multi-dimensional spline ﬁlter.

For more details, see spline_filter1d.

See Also

spline_filter1d

Notes

The multi-dimensional ﬁlter is implemented as a sequence of one-dimensional spline ﬁlters. The intermediate

arrays are stored in the same data type as the output. Therefore, for output types with a limited precision, the

results may be imprecise because intermediate results may be stored with insufﬁcient precision.

scipy.ndimage.interpolation.spline_filter1d(input,order=3,axis=-1,output=<type

‘numpy.ﬂoat64’>)

Calculates a one-dimensional spline ﬁlter along the given axis.

The lines of the array along the given axis are ﬁltered by a spline ﬁlter. The order of the spline must be >= 2 and

<= 5.

Parameters input : array_like

The input array.

order : int, optional

The order of the spline, default is 3.

axis : int, optional

The axis along which the spline ﬁlter is applied. Default is the last axis.

output : ndarray or dtype, optional

The array in which to place the output, or the dtype of the returned array. Default is

numpy.float64.

Returns spline_ﬁlter1d : ndarray or None

The ﬁltered input. If output is given as a parameter, None is returned.

scipy.ndimage.interpolation.zoom(input,zoom,output=None,order=3,mode=’constant’,

cval=0.0,preﬁlter=True)

Zoom an array.

The array is zoomed using spline interpolation of the requested order.

Parameters input : ndarray

The input array.

zoom : ﬂoat or sequence, optional

The zoom factor along the axes. If a ﬂoat, zoom is the same for each axis. If a

sequence, zoom should contain one value for each axis.

output : ndarray or dtype, optional

The array in which to place the output, or the dtype of the returned array.

order : int, optional

The order of the spline interpolation, default is 3. The order has to be in the range 0-5.

mode : str, optional

Points outside the boundaries of the input are ﬁlled according to the given mode (‘con-

stant’, ‘nearest’, ‘reﬂect’ or ‘wrap’). Default is ‘constant’.

cval : scalar, optional

484 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Value used for points outside the boundaries of the input if mode=’constant’.

Default is 0.0

preﬁlter : bool, optional

The parameter preﬁlter determines if the input is pre-ﬁltered with spline_filter

before interpolation (necessary for spline interpolation of order > 1). If False, it is

assumed that the input is already ﬁltered. Default is True.

Returns zoom : ndarray or None

The zoomed input. If output is given as a parameter, None is returned.

5.18.4 Measurements scipy.ndimage.measurements

center_of_mass(input[, labels, index]) Calculate the center of mass of the values of an array at labels.

extrema(input[, labels, index]) Calculate the minimums and maximums of the values of an array at labels, along with their positions.

find_objects(input[, max_label]) Find objects in a labeled array.

histogram(input, min, max, bins[, labels, index]) Calculate the histogram of the values of an array, optionally at labels.

label(input[, structure, output]) Label features in an array.

labeled_comprehension(input, labels, index, ...) Roughly equivalent to [func(input[labels == i]) for i in index].

maximum(input[, labels, index]) Calculate the maximum of the values of an array over labeled regions.

maximum_position(input[, labels, index]) Find the positions of the maximums of the values of an array at labels.

mean(input[, labels, index]) Calculate the mean of the values of an array at labels.

minimum(input[, labels, index]) Calculate the minimum of the values of an array over labeled regions.

minimum_position(input[, labels, index]) Find the positions of the minimums of the values of an array at labels.

standard_deviation(input[, labels, index]) Calculate the standard deviation of the values of an n-D image array,

sum(input[, labels, index]) Calculate the sum of the values of the array.

variance(input[, labels, index]) Calculate the variance of the values of an n-D image array, optionally at

watershed_ift(input, markers[, structure, ...]) Apply watershed from markers using an iterative forest transform algorithm.

scipy.ndimage.measurements.center_of_mass(input,labels=None,index=None)

Calculate the center of mass of the values of an array at labels.

Parameters input : ndarray

Data from which to calculate center-of-mass.

labels : ndarray, optional

Labels for objects in input, as generated by ndimage.label. Only used with index.

Dimensions must be the same as input.

index : int or sequence of ints, optional

Labels for which to calculate centers-of-mass. If not speciﬁed, all labels greater than

zero are used. Only used with labels.

Returns center_of_mass : tuple, or list of tuples

Coordinates of centers-of-mass.

Examples

>>> a=np.array(([0,0,0,0],

[0,1,1,0],

[0,1,1,0]))

>>> from scipy import ndimage

>>> ndimage.measurements.center_of_mass(a)

(2.0, 1.5)

Calculation of multiple objects in an image

5.18. Multi-dimensional image processing (scipy.ndimage) 485

SciPy Reference Guide, Release 0.13.0

>>> b=np.array(([0,1,1,0],

[0,1,0,0],

[0,0,0,0],

[0,0,1,1],

[0,0,1,1]))

>>> lbl =ndimage.label(b)[0]

>>> ndimage.measurements.center_of_mass(b, lbl, [1,2])

[(0.33333333333333331, 1.3333333333333333), (3.5, 2.5)]

scipy.ndimage.measurements.extrema(input,labels=None,index=None)

Calculate the minimums and maximums of the values of an array at labels, along with their positions.

Parameters input : ndarray

Nd-image data to process.

labels : ndarray, optional

Labels of features in input. If not None, must be same shape as input.

index : int or sequence of ints, optional

Labels to include in output. If None (default), all values where non-zero labels are

used.

Returns minimums, maximums : int or ndarray

Values of minimums and maximums in each feature.

min_positions, max_positions : tuple or list of tuples

Each tuple gives the n-D coordinates of the corresponding minimum or maximum.

See Also

maximum,minimum,maximum_position,minimum_position,center_of_mass

Examples

>>> a=np.array([[1,2,0,0],

[5, 3, 0, 4],

[0, 0, 0, 7],

[9, 3, 0, 0]])

>>> from scipy import ndimage

>>> ndimage.extrema(a)

(0, 9, (0, 2), (3, 0))

Features to process can be speciﬁed using labels and index:

>>> lbl, nlbl =ndimage.label(a)

>>> ndimage.extrema(a, lbl, index=np.arange(1, nlbl+1))

(array([1, 4, 3]),

array([5, 7, 9]),

[(0.0, 0.0), (1.0, 3.0), (3.0, 1.0)],

[(1.0, 0.0), (2.0, 3.0), (3.0, 0.0)])

If no index is given, non-zero labels are processed:

>>> ndimage.extrema(a, lbl)

(1, 9, (0, 0), (3, 0))

scipy.ndimage.measurements.find_objects(input,max_label=0)

Find objects in a labeled array.

Parameters input : ndarray of ints

Array containing objects deﬁned by different labels.

max_label : int, optional

486 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Maximum label to be searched for in input. If max_label is not given, the positions of

all objects are returned.

Returns object_slices : list of tuples

A list of tuples, with each tuple containing N slices (with N the dimension of the input

array). Slices correspond to the minimal parallelepiped that contains the object. If a

number is missing, None is returned instead of a slice.

See Also

label,center_of_mass

Notes

This function is very useful for isolating a volume of interest inside a 3-D array, that cannot be “seen through”.

Examples

>>> a=np.zeros((6,6), dtype=np.int)

>>> a[2:4,2:4]=1

>>> a[4,4]=1

>>> a[:2, :3]=2

>>> a[0,5]=3

>>> a

array([[2, 2, 2, 0, 0, 3],

[2, 2, 2, 0, 0, 0],

[0, 0, 1, 1, 0, 0],

[0, 0, 0, 0, 1, 0],

[0, 0, 0, 0, 0, 0]])

>>> ndimage.find_objects(a)

[(slice(2, 5, None), slice(2, 5, None)), (slice(0, 2, None), slice(0, 3, None)), (slice(0, 1, None), slice(5, 6, None))]

>>> ndimage.find_objects(a, max_label=2)

[(slice(2, 5, None), slice(2, 5, None)), (slice(0, 2, None), slice(0, 3, None))]

>>> ndimage.find_objects(a == 1, max_label=2)

[(slice(2, 5, None), slice(2, 5, None)), None]

>>> loc =ndimage.find_objects(a)[0]

>>> a[loc]

array([[1, 1, 0]

[1, 1, 0]

[0, 0, 1]])

scipy.ndimage.measurements.histogram(input,min,max,bins,labels=None,index=None)

Calculate the histogram of the values of an array, optionally at labels.

Histogram calculates the frequency of values in an array within bins determined by min,max, and bins. The

labels and index keywords can limit the scope of the histogram to speciﬁed sub-regions within the array.

Parameters input : array_like

Data for which to calculate histogram.

min, max : int

Minimum and maximum values of range of histogram bins.

bins : int

Number of bins.

labels : array_like, optional

Labels for objects in input. If not None, must be same shape as input.

index : int or sequence of ints, optional

Label or labels for which to calculate histogram. If None, all values where label is

greater than zero are used

5.18. Multi-dimensional image processing (scipy.ndimage) 487

SciPy Reference Guide, Release 0.13.0

Returns hist : ndarray

Histogram counts.

Examples

>>> a=np.array([[ 0. ,0.2146,0.5962,0. ],

[ 0. , 0.7778, 0. , 0. ],

[ 0. , 0. , 0. , 0. ],

[ 0. , 0. , 0.7181, 0.2787],

[ 0. , 0. , 0.6573, 0.3094]])

>>> from scipy import ndimage

>>> ndimage.measurements.histogram(a, 0,1,10)

array([13, 0, 2, 1, 0, 1, 1, 2, 0, 0])

With labels and no indices, non-zero elements are counted:

>>> lbl, nlbl =ndimage.label(a)

>>> ndimage.measurements.histogram(a, 0,1,10, lbl)

array([0, 0, 2, 1, 0, 1, 1, 2, 0, 0])

Indices can be used to count only certain objects:

>>> ndimage.measurements.histogram(a, 0,1,10, lbl, 2)

array([0, 0, 1, 1, 0, 0, 1, 1, 0, 0])

scipy.ndimage.measurements.label(input,structure=None,output=None)

Label features in an array.

Parameters input : array_like

An array-like object to be labeled. Any non-zero values in input are counted as fea-

tures and zero values are considered the background.

structure : array_like, optional

A structuring element that deﬁnes feature connections. structure must be symmetric.

If no structuring element is provided, one is automatically generated with a squared

connectivity equal to one. That is, for a 2-D input array, the default structuring element

is:

[[0,1,0],

[1,1,1],

[0,1,0]]

output : (None, data-type, array_like), optional

If output is a data type, it speciﬁes the type of the resulting labeled feature array If

output is an array-like object, then output will be updated with the labeled features

from this function. This function can operate in-place, by passing output=input. Note

that the output must be able to store the largest label, or this function will raise an

Exception.

Returns label : ndarray or int

An integer ndarray where each unique feature in input has a unique label in the re-

turned array.

num_features : int

How many objects were found.

If output is None, this function returns a tuple of (labeled_array,num_features).

If output is a ndarray, then it will be updated with values in labeled_array and only

num_features will be returned by this function.

488 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

See Also

find_objects

generate a list of slices for the labeled features (or objects); useful for ﬁnding features’ position

or dimensions

Examples

Create an image with some features, then label it using the default (cross-shaped) structuring element:

>>> a=np.array([[0,0,1,1,0,0],

... [0,0,0,1,0,0],

... [1,1,0,0,1,0],

... [0,0,0,1,0,0]])

>>> labeled_array, num_features =label(a)

Each of the 4 features are labeled with a different integer:

>>> print(num_features)

>>> print(labeled_array)

array([[0, 0, 1, 1, 0, 0],

[0, 0, 0, 1, 0, 0],

[2, 2, 0, 0, 3, 0],

[0, 0, 0, 4, 0, 0]])

Generate a structuring element that will consider features connected even if they touch diagonally:

>>> s=generate_binary_structure(2,2)

or,

>>> s=[[1,1,1],

[1,1,1],

[1,1,1]]

Label the image using the new structuring element:

>>> labeled_array, num_features =label(a, structure=s)

Show the 2 labeled features (note that features 1, 3, and 4 from above are now considered a single feature):

>>> print(num_features)

>>> print(labeled_array)

array([[0, 0, 1, 1, 0, 0],

[0, 0, 0, 1, 0, 0],

[2, 2, 0, 0, 1, 0],

[0, 0, 0, 1, 0, 0]])

scipy.ndimage.measurements.labeled_comprehension(input,labels,index,func,out_dtype,

default,pass_positions=False)

Roughly equivalent to [func(input[labels == i]) for i in index].

Sequentially applies an arbitrary function (that works on array_like input) to subsets of an n-D image array

speciﬁed by labels and index. The option exists to provide the function with positional parameters as the second

argument.

Parameters input : array_like

Data from which to select labels to process.

labels : array_like or None

5.18. Multi-dimensional image processing (scipy.ndimage) 489

SciPy Reference Guide, Release 0.13.0

Labels to objects in input. If not None, array must be same shape as input. If None,

func is applied to raveled input.

index : int, sequence of ints or None

Subset of labels to which to apply func. If a scalar, a single value is returned. If None,

func is applied to all non-zero values of labels.

func : callable

Python function to apply to labels from input.

out_dtype : dtype

Dtype to use for result.

default : int, ﬂoat or None

Default return value when a element of index does not exist in labels.

pass_positions : bool, optional

If True, pass linear indices to func as a second argument. Default is False.

Returns result : ndarray

Result of applying func to each of labels to input in index.

Examples

>>> a=np.array([[1,2,0,0],

[5, 3, 0, 4],

[0, 0, 0, 7],

[9, 3, 0, 0]])

>>> from scipy import ndimage

>>> lbl, nlbl =ndimage.label(a)

>>> lbls =np.arange(1, nlbl+1)

>>> ndimage.labeled_comprehension(a, lbl, lbls, np.mean, float,0)

array([ 2.75, 5.5 , 6. ])

Falling back to default:

>>> lbls =np.arange(1, nlbl+2)

>>> ndimage.labeled_comprehension(a, lbl, lbls, np.mean, float,-1)

array([ 2.75, 5.5 , 6. , -1. ])

Passing positions:

>>> def fn(val, pos):

... print("fn says: %s :%s"%(val, pos))

... return (val.sum()) if (pos.sum() %2== 0)else (-val.sum())

...

>>> ndimage.labeled_comprehension(a, lbl, lbls, fn, float,0,True)

fn says: [1 2 5 3] : [0 1 4 5]

fn says: [4 7] : [7 11]

fn says: [9 3] : [12 13]

array([ 11., 11., -12.])

scipy.ndimage.measurements.maximum(input,labels=None,index=None)

Calculate the maximum of the values of an array over labeled regions.

Parameters input : array_like

Array_like of values. For each region speciﬁed by labels, the maximal values of input

over the region is computed.

labels : array_like, optional

An array of integers marking different regions over which the maximum value of input

is to be computed. labels must have the same shape as input. If labels is not speciﬁed,

the maximum over the whole array is returned.

index : array_like, optional

490 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

A list of region labels that are taken into account for computing the maxima. If index

is None, the maximum over all elements where labels is non-zero is returned.

Returns output : ﬂoat or list of ﬂoats

List of maxima of input over the regions determined by labels and whose index is in

index. If index or labels are not speciﬁed, a ﬂoat is returned: the maximal value of

input if labels is None, and the maximal value of elements where labels is greater than

zero if index is None.

See Also

label,minimum,median,maximum_position,extrema,sum,mean,variance,

standard_deviation

Notes

The function returns a Python list and not a Numpy array, use np.array to convert the list to an array.

Examples

>>> a=np.arange(16).reshape((4,4))

>>> a

array([[ 0, 1, 2, 3],

[ 4, 5, 6, 7],

[ 8, 9, 10, 11],

[12, 13, 14, 15]])

>>> labels =np.zeros_like(a)

>>> labels[:2,:2]=1

>>> labels[2:, 1:3]=2

>>> labels

array([[1, 1, 0, 0],

[1, 1, 0, 0],

[0, 2, 2, 0],

[0, 2, 2, 0]])

>>> from scipy import ndimage

>>> ndimage.maximum(a)

15.0

>>> ndimage.maximum(a, labels=labels, index=[1,2])

[5.0, 14.0]

>>> ndimage.maximum(a, labels=labels)

14.0

>>> b=np.array([[1,2,0,0],

[5, 3, 0, 4],

[0, 0, 0, 7],

[9, 3, 0, 0]])

>>> labels, labels_nb =ndimage.label(b)

>>> labels

array([[1, 1, 0, 0],

[1, 1, 0, 2],

[0, 0, 0, 2],

[3, 3, 0, 0]])

>>> ndimage.maximum(b, labels=labels, index=np.arange(1, labels_nb +1))

[5.0, 7.0, 9.0]

scipy.ndimage.measurements.maximum_position(input,labels=None,index=None)

Find the positions of the maximums of the values of an array at labels.

For each region speciﬁed by labels, the position of the maximum value of input within the region is returned.

Parameters input : array_like

5.18. Multi-dimensional image processing (scipy.ndimage) 491

SciPy Reference Guide, Release 0.13.0

Array_like of values.

labels : array_like, optional

An array of integers marking different regions over which the position of the max-

imum value of input is to be computed. labels must have the same shape as input.

If labels is not speciﬁed, the location of the ﬁrst maximum over the whole array is

returned.

The labels argument only works when index is speciﬁed.

index : array_like, optional

A list of region labels that are taken into account for ﬁnding the location of the max-

ima. If index is None, the ﬁrst maximum over all elements where labels is non-zero is

returned.

The index argument only works when labels is speciﬁed.

Returns output : list of tuples of ﬂoats

List of tuples of ﬂoats that specify the location of maxima of input over the regions

determined by labels and whose index is in index.

If index or labels are not speciﬁed, a tuple of ﬂoats is returned specifying the location

of the first maximal value of input.

See Also

label,minimum,median,maximum_position,extrema,sum,mean,variance,

standard_deviation

scipy.ndimage.measurements.mean(input,labels=None,index=None)

Calculate the mean of the values of an array at labels.

Parameters input : array_like

Array on which to compute the mean of elements over distinct regions.

labels : array_like, optional

Array of labels of same shape, or broadcastable to the same shape as input. All ele-

ments sharing the same label form one region over which the mean of the elements is

computed.

index : int or sequence of ints, optional

Labels of the objects over which the mean is to be computed. Default is None, in

which case the mean for all values where label is greater than 0 is calculated.

Returns out : list

Sequence of same length as index, with the mean of the different regions labeled by

the labels in index.

See Also

ndimage.variance,ndimage.standard_deviation,ndimage.minimum,

ndimage.maximum,ndimage.sum,ndimage.label

Examples

>>> a=np.arange(25).reshape((5,5))

>>> labels =np.zeros_like(a)

>>> labels[3:5,3:5]=1

>>> index =np.unique(labels)

>>> labels

array([[0, 0, 0, 0, 0],

[0, 0, 0, 0, 0],

[0, 0, 0, 1, 1],

[0, 0, 0, 1, 1]])

>>> index

492 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

array([0, 1])

>>> ndimage.mean(a, labels=labels, index=index)

[10.285714285714286, 21.0]

scipy.ndimage.measurements.minimum(input,labels=None,index=None)

Calculate the minimum of the values of an array over labeled regions.

Parameters input : array_like

Array_like of values. For each region speciﬁed by labels, the minimal values of input

over the region is computed.

labels : array_like, optional

An array_like of integers marking different regions over which the minimum value of

input is to be computed. labels must have the same shape as input. If labels is not

speciﬁed, the minimum over the whole array is returned.

index : array_like, optional

A list of region labels that are taken into account for computing the minima. If index

is None, the minimum over all elements where labels is non-zero is returned.

Returns minimum : ﬂoat or list of ﬂoats

List of minima of input over the regions determined by labels and whose index is in

index. If index or labels are not speciﬁed, a ﬂoat is returned: the minimal value of

input if labels is None, and the minimal value of elements where labels is greater than

zero if index is None.

See Also

label,maximum,median,minimum_position,extrema,sum,mean,variance,

standard_deviation

Notes

The function returns a Python list and not a Numpy array, use np.array to convert the list to an array.

Examples

>>> a=np.array([[1,2,0,0],

... [5,3,0,4],

... [0,0,0,7],

... [9,3,0,0]])

>>> labels, labels_nb =ndimage.label(a)

>>> labels

array([[1, 1, 0, 0],

[1, 1, 0, 2],

[0, 0, 0, 2],

[3, 3, 0, 0]])

>>> ndimage.minimum(a, labels=labels, index=np.arange(1, labels_nb +1))

[1.0, 4.0, 3.0]

>>> ndimage.minimum(a)

0.0

>>> ndimage.minimum(a, labels=labels)

1.0

scipy.ndimage.measurements.minimum_position(input,labels=None,index=None)

Find the positions of the minimums of the values of an array at labels.

Parameters input : array_like

Array_like of values.

labels : array_like, optional

5.18. Multi-dimensional image processing (scipy.ndimage) 493

SciPy Reference Guide, Release 0.13.0

An array of integers marking different regions over which the position of the minimum

value of input is to be computed. labels must have the same shape as input. If labels

is not speciﬁed, the location of the ﬁrst minimum over the whole array is returned.

The labels argument only works when index is speciﬁed.

index : array_like, optional

A list of region labels that are taken into account for ﬁnding the location of the minima.

If index is None, the first minimum over all elements where labels is non-zero is

returned.

The index argument only works when labels is speciﬁed.

Returns output : list of tuples of ﬂoats

Tuple of ﬂoats or list of tuples of ﬂoats that specify the location of minima of input

over the regions determined by labels and whose index is in index.

If index or labels are not speciﬁed, a tuple of ﬂoats is returned specifying the location

of the ﬁrst minimal value of input.

See Also

label,minimum,median,maximum_position,extrema,sum,mean,variance,

standard_deviation

scipy.ndimage.measurements.standard_deviation(input,labels=None,index=None)

Calculate the standard deviation of the values of an n-D image array, optionally at speciﬁed sub-regions.

Parameters input : array_like

Nd-image data to process.

labels : array_like, optional

Labels to identify sub-regions in input. If not None, must be same shape as input.

index : int or sequence of ints, optional

labels to include in output. If None (default), all values where labels is non-zero are

used.

Returns standard_deviation : ﬂoat or ndarray

Values of standard deviation, for each sub-region if labels and index are speciﬁed.

See Also

label,variance,maximum,minimum,extrema

Examples

>>> a=np.array([[1,2,0,0],

[5, 3, 0, 4],

[0, 0, 0, 7],

[9, 3, 0, 0]])

>>> from scipy import ndimage

>>> ndimage.standard_deviation(a)

2.7585095613392387

Features to process can be speciﬁed using labels and index:

>>> lbl, nlbl =ndimage.label(a)

>>> ndimage.standard_deviation(a, lbl, index=np.arange(1, nlbl+1))

array([ 1.479, 1.5 , 3. ])

If no index is given, non-zero labels are processed:

>>> ndimage.standard_deviation(a, lbl)

2.4874685927665499

494 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.ndimage.measurements.sum(input,labels=None,index=None)

Calculate the sum of the values of the array.

Parameters input : array_like

Values of input inside the regions deﬁned by labels are summed together.

labels : array_like of ints, optional

Assign labels to the values of the array. Has to have the same shape as input.

index : array_like, optional

A single label number or a sequence of label numbers of the objects to be measured.

Returns sum : list

A list of the sums of the values of input inside the regions deﬁned by labels.

See Also

mean,median

Examples

>>> input =[0,1,2,3]

>>> labels =[1,1,2,2]

>>> sum(input, labels, index=[1,2])

[1.0, 5.0]

scipy.ndimage.measurements.variance(input,labels=None,index=None)

Calculate the variance of the values of an n-D image array, optionally at speciﬁed sub-regions.

Parameters input : array_like

Nd-image data to process.

labels : array_like, optional

Labels deﬁning sub-regions in input. If not None, must be same shape as input.

index : int or sequence of ints, optional

labels to include in output. If None (default), all values where labels is non-zero are

used.

Returns variance : ﬂoat or ndarray

Values of variance, for each sub-region if labels and index are speciﬁed.

See Also

label,standard_deviation,maximum,minimum,extrema

Examples

>>> a=np.array([[1,2,0,0],

[5, 3, 0, 4],

[0, 0, 0, 7],

[9, 3, 0, 0]])

>>> from scipy import ndimage

>>> ndimage.variance(a)

7.609375

Features to process can be speciﬁed using labels and index:

>>> lbl, nlbl =ndimage.label(a)

>>> ndimage.variance(a, lbl, index=np.arange(1, nlbl+1))

array([ 2.1875, 2.25 , 9. ])

If no index is given, all non-zero labels are processed:

5.18. Multi-dimensional image processing (scipy.ndimage) 495

SciPy Reference Guide, Release 0.13.0

>>> ndimage.variance(a, lbl)

6.1875

scipy.ndimage.measurements.watershed_ift(input,markers,structure=None,output=None)

Apply watershed from markers using an iterative forest transform algorithm.

Parameters input : array_like

Input.

markers : array_like

Markers are points within each watershed that form the beginning of the process.

Negative markers are considered background markers which are processed after the

other markers.

structure : structure element, optional

A structuring element deﬁning the connectivity of the object can be provided. If None,

an element is generated with a squared connectivity equal to one.

out : ndarray

An output array can optionally be provided. The same shape as input.

Returns watershed_ift : ndarray

Output. Same shape as input.

5.18.5 Morphology scipy.ndimage.morphology

binary_closing(input[, structure, ...]) Multi-dimensional binary closing with the given structuring element.

binary_dilation(input[, structure, ...]) Multi-dimensional binary dilation with the given structuring element.

binary_erosion(input[, structure, ...]) Multi-dimensional binary erosion with a given structuring element.

binary_fill_holes(input[, structure, ...]) Fill the holes in binary objects.

binary_hit_or_miss(input[, structure1, ...]) Multi-dimensional binary hit-or-miss transform.

binary_opening(input[, structure, ...]) Multi-dimensional binary opening with the given structuring element.

binary_propagation(input[, structure, mask, ...]) Multi-dimensional binary propagation with the given structuring element.

black_tophat(input[, size, footprint, ...]) Multi-dimensional black tophat ﬁlter.

distance_transform_bf(input[, metric, ...]) Distance transform function by a brute force algorithm.

distance_transform_cdt(input[, metric, ...]) Distance transform for chamfer type of transforms.

distance_transform_edt(input[, sampling, ...]) Exact euclidean distance transform.

generate_binary_structure(rank, connectivity) Generate a binary structure for binary morphological operations.

grey_closing(input[, size, footprint, ...]) Multi-dimensional greyscale closing.

grey_dilation(input[, size, footprint, ...]) Calculate a greyscale dilation, using either a structuring element, or a footprint corresponding to a ﬂat structuring element.

grey_erosion(input[, size, footprint, ...]) Calculate a greyscale erosion, using either a structuring element, or a footprint corresponding to a ﬂat structuring element.

grey_opening(input[, size, footprint, ...]) Multi-dimensional greyscale opening.

iterate_structure(structure, iterations[, ...]) Iterate a structure by dilating it with itself.

morphological_gradient(input[, size, ...]) Multi-dimensional morphological gradient.

morphological_laplace(input[, size, ...]) Multi-dimensional morphological laplace.

white_tophat(input[, size, footprint, ...]) Multi-dimensional white tophat ﬁlter.

scipy.ndimage.morphology.binary_closing(input,structure=None,iterations=1,output=None,

origin=0)

Multi-dimensional binary closing with the given structuring element.

The closing of an input image by a structuring element is the erosion of the dilation of the image by the struc-

turing element.

Parameters input : array_like

Binary array_like to be closed. Non-zero (True) elements form the subset to be closed.

structure : array_like, optional

496 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Structuring element used for the closing. Non-zero elements are considered True. If

no structuring element is provided an element is generated with a square connectivity

equal to one (i.e., only nearest neighbors are connected to the center, diagonally-

connected elements are not considered neighbors).

iterations : {int, ﬂoat}, optional

The dilation step of the closing, then the erosion step are each repeated iterations

times (one, by default). If iterations is less than 1, each operations is repeated until

the result does not change anymore.

output : ndarray, optional

Array of the same shape as input, into which the output is placed. By default, a new

array is created.

origin : int or tuple of ints, optional

Placement of the ﬁlter, by default 0.

Returns binary_closing : ndarray of bools

Closing of the input by the structuring element.

See Also

grey_closing,binary_opening,binary_dilation,binary_erosion,

generate_binary_structure

Notes

Closing [R67] is a mathematical morphology operation [R68] that consists in the succession of a dilation and an

erosion of the input with the same structuring element. Closing therefore ﬁlls holes smaller than the structuring

element.

Together with opening (binary_opening), closing can be used for noise removal.

References

[R67],[R68]

Examples

>>> a=np.zeros((5,5), dtype=np.int)

>>> a[1:-1,1:-1]=1; a[2,2]=0

>>> a

array([[0, 0, 0, 0, 0],

[0, 1, 1, 1, 0],

[0, 1, 0, 1, 0],

[0, 1, 1, 1, 0],

[0, 0, 0, 0, 0]])

>>> # Closing removes small holes

>>> ndimage.binary_closing(a).astype(np.int)

array([[0, 0, 0, 0, 0],

[0, 1, 1, 1, 0],

[0, 0, 0, 0, 0]])

>>> # Closing is the erosion of the dilation of the input

>>> ndimage.binary_dilation(a).astype(np.int)

array([[0, 1, 1, 1, 0],

[1, 1, 1, 1, 1],

[0, 1, 1, 1, 0]])

>>> ndimage.binary_erosion(ndimage.binary_dilation(a)).astype(np.int)

array([[0, 0, 0, 0, 0],

5.18. Multi-dimensional image processing (scipy.ndimage) 497

SciPy Reference Guide, Release 0.13.0

[0, 1, 1, 1, 0],

[0, 0, 0, 0, 0]])

>>> a=np.zeros((7,7), dtype=np.int)

>>> a[1:6,2:5]=1; a[1:3,3]=0

>>> a

array([[0, 0, 0, 0, 0, 0, 0],

[0, 0, 1, 0, 1, 0, 0],

[0, 0, 1, 1, 1, 0, 0],

[0, 0, 0, 0, 0, 0, 0]])

>>> # In addition to removing holes, closing can also

>>> # coarsen boundaries with fine hollows.

>>> ndimage.binary_closing(a).astype(np.int)

array([[0, 0, 0, 0, 0, 0, 0],

[0, 0, 1, 0, 1, 0, 0],

[0, 0, 1, 1, 1, 0, 0],

[0, 0, 0, 0, 0, 0, 0]])

>>> ndimage.binary_closing(a, structure=np.ones((2,2))).astype(np.int)

array([[0, 0, 0, 0, 0, 0, 0],

[0, 0, 1, 1, 1, 0, 0],

[0, 0, 0, 0, 0, 0, 0]])

scipy.ndimage.morphology.binary_dilation(input,structure=None,iterations=1,mask=None,

output=None,border_value=0,origin=0,

brute_force=False)

Multi-dimensional binary dilation with the given structuring element.

Parameters input : array_like

Binary array_like to be dilated. Non-zero (True) elements form the subset to be di-

lated.

structure : array_like, optional

Structuring element used for the dilation. Non-zero elements are considered True. If

no structuring element is provided an element is generated with a square connectivity

equal to one.

iterations : {int, ﬂoat}, optional

The dilation is repeated iterations times (one, by default). If iterations is less than 1,

the dilation is repeated until the result does not change anymore.

mask : array_like, optional

If a mask is given, only those elements with a True value at the corresponding mask

element are modiﬁed at each iteration.

output : ndarray, optional

Array of the same shape as input, into which the output is placed. By default, a new

array is created.

origin : int or tuple of ints, optional

Placement of the ﬁlter, by default 0.

498 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

border_value : int (cast to 0 or 1)

Value at the border in the output array.

Returns binary_dilation : ndarray of bools

Dilation of the input by the structuring element.

See Also

grey_dilation,binary_erosion,binary_closing,binary_opening,

generate_binary_structure

Notes

Dilation [R69] is a mathematical morphology operation [R70] that uses a structuring element for expanding the

shapes in an image. The binary dilation of an image by a structuring element is the locus of the points covered

by the structuring element, when its center lies within the non-zero points of the image.

References

[R69],[R70]

Examples

>>> a=np.zeros((5,5))

>>> a[2,2]=1

>>> a

array([[ 0., 0., 0., 0., 0.],

[ 0., 0., 0., 0., 0.],

[ 0., 0., 1., 0., 0.],

[ 0., 0., 0., 0., 0.],

[ 0., 0., 0., 0., 0.]])

>>> ndimage.binary_dilation(a)

array([[False, False, False, False, False],

[False, False, True, False, False],

[False, True, True, True, False],

[False, False, True, False, False],

[False, False, False, False, False]], dtype=bool)

>>> ndimage.binary_dilation(a).astype(a.dtype)

array([[ 0., 0., 0., 0., 0.],

[ 0., 0., 1., 0., 0.],

[ 0., 1., 1., 1., 0.],

[ 0., 0., 1., 0., 0.],

[ 0., 0., 0., 0., 0.]])

>>> # 3x3 structuring element with connectivity 1, used by default

>>> struct1 =ndimage.generate_binary_structure(2,1)

>>> struct1

array([[False, True, False],

[ True, True, True],

[False, True, False]], dtype=bool)

>>> # 3x3 structuring element with connectivity 2

>>> struct2 =ndimage.generate_binary_structure(2,2)

>>> struct2

array([[ True, True, True],

[ True, True, True],

[ True, True, True]], dtype=bool)

>>> ndimage.binary_dilation(a, structure=struct1).astype(a.dtype)

array([[ 0., 0., 0., 0., 0.],

[ 0., 0., 1., 0., 0.],

[ 0., 1., 1., 1., 0.],

[ 0., 0., 1., 0., 0.],

5.18. Multi-dimensional image processing (scipy.ndimage) 499

SciPy Reference Guide, Release 0.13.0

[ 0., 0., 0., 0., 0.]])

>>> ndimage.binary_dilation(a, structure=struct2).astype(a.dtype)

array([[ 0., 0., 0., 0., 0.],

[ 0., 1., 1., 1., 0.],

[ 0., 0., 0., 0., 0.]])

>>> ndimage.binary_dilation(a, structure=struct1,\

... iterations=2).astype(a.dtype)

array([[ 0., 0., 1., 0., 0.],

[ 0., 1., 1., 1., 0.],

[ 1., 1., 1., 1., 1.],

[ 0., 1., 1., 1., 0.],

[ 0., 0., 1., 0., 0.]])

scipy.ndimage.morphology.binary_erosion(input,structure=None,iterations=1,mask=None,

output=None,border_value=0,origin=0,

brute_force=False)

Multi-dimensional binary erosion with a given structuring element.

Binary erosion is a mathematical morphology operation used for image processing.

Parameters input : array_like

Binary image to be eroded. Non-zero (True) elements form the subset to be eroded.

structure : array_like, optional

Structuring element used for the erosion. Non-zero elements are considered True. If

no structuring element is provided, an element is generated with a square connectivity

equal to one.

iterations : {int, ﬂoat}, optional

The erosion is repeated iterations times (one, by default). If iterations is less than 1,

the erosion is repeated until the result does not change anymore.

mask : array_like, optional

If a mask is given, only those elements with a True value at the corresponding mask

element are modiﬁed at each iteration.

output : ndarray, optional

Array of the same shape as input, into which the output is placed. By default, a new

array is created.

origin : int or tuple of ints, optional

Placement of the ﬁlter, by default 0.

border_value : int (cast to 0 or 1)

Value at the border in the output array.

Returns binary_erosion : ndarray of bools

Erosion of the input by the structuring element.

See Also

grey_erosion,binary_dilation,binary_closing,binary_opening,

generate_binary_structure

Notes

Erosion [R71] is a mathematical morphology operation [R72] that uses a structuring element for shrinking the

shapes in an image. The binary erosion of an image by a structuring element is the locus of the points where

a superimposition of the structuring element centered on the point is entirely contained in the set of non-zero

elements of the image.

500 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

References

[R71],[R72]

Examples

>>> a=np.zeros((7,7), dtype=np.int)

>>> a[1:6,2:5]=1

>>> a

array([[0, 0, 0, 0, 0, 0, 0],

[0, 0, 1, 1, 1, 0, 0],

[0, 0, 0, 0, 0, 0, 0]])

>>> ndimage.binary_erosion(a).astype(a.dtype)

array([[0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 1, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0]])

>>> #Erosion removes objects smaller than the structure

>>> ndimage.binary_erosion(a, structure=np.ones((5,5))).astype(a.dtype)

array([[0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0]])

scipy.ndimage.morphology.binary_fill_holes(input,structure=None,output=None,ori-

gin=0)

Fill the holes in binary objects.

Parameters input : array_like

n-dimensional binary array with holes to be ﬁlled

structure : array_like, optional

Structuring element used in the computation; large-size elements make computations

faster but may miss holes separated from the background by thin regions. The default

element (with a square connectivity equal to one) yields the intuitive result where all

holes in the input have been ﬁlled.

output : ndarray, optional

Array of the same shape as input, into which the output is placed. By default, a new

array is created.

origin : int, tuple of ints, optional

Position of the structuring element.

Returns out : ndarray

Transformation of the initial image input where holes have been ﬁlled.

See Also

binary_dilation,binary_propagation,label

5.18. Multi-dimensional image processing (scipy.ndimage) 501

SciPy Reference Guide, Release 0.13.0

Notes

The algorithm used in this function consists in invading the complementary of the shapes in input from the outer

boundary of the image, using binary dilations. Holes are not connected to the boundary and are therefore not

invaded. The result is the complementary subset of the invaded region.

References

[R73]

Examples

>>> a=np.zeros((5,5), dtype=int)

>>> a[1:4,1:4]=1

>>> a[2,2]=0

>>> a

array([[0, 0, 0, 0, 0],

[0, 1, 1, 1, 0],

[0, 1, 0, 1, 0],

[0, 1, 1, 1, 0],

[0, 0, 0, 0, 0]])

>>> ndimage.binary_fill_holes(a).astype(int)

array([[0, 0, 0, 0, 0],

[0, 1, 1, 1, 0],

[0, 0, 0, 0, 0]])

>>> # Too big structuring element

>>> ndimage.binary_fill_holes(a, structure=np.ones((5,5))).astype(int)

array([[0, 0, 0, 0, 0],

[0, 1, 1, 1, 0],

[0, 1, 0, 1, 0],

[0, 1, 1, 1, 0],

[0, 0, 0, 0, 0]])

scipy.ndimage.morphology.binary_hit_or_miss(input,structure1=None,structure2=None,

output=None,origin1=0,origin2=None)

Multi-dimensional binary hit-or-miss transform.

The hit-or-miss transform ﬁnds the locations of a given pattern inside the input image.

Parameters input : array_like (cast to booleans)

Binary image where a pattern is to be detected.

structure1 : array_like (cast to booleans), optional

Part of the structuring element to be ﬁtted to the foreground (non-zero elements) of

input. If no value is provided, a structure of square connectivity 1 is chosen.

structure2 : array_like (cast to booleans), optional

Second part of the structuring element that has to miss completely the foreground. If

no value is provided, the complementary of structure1 is taken.

output : ndarray, optional

Array of the same shape as input, into which the output is placed. By default, a new

array is created.

origin1 : int or tuple of ints, optional

Placement of the ﬁrst part of the structuring element structure1, by default 0 for a

centered structure.

origin2 : int or tuple of ints, optional

Placement of the second part of the structuring element structure2, by default 0 for a

centered structure. If a value is provided for origin1 and not for origin2, then origin2

is set to origin1.

502 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Returns binary_hit_or_miss : ndarray

Hit-or-miss transform of input with the given structuring element (structure1,struc-

ture2).

See Also

ndimage.morphology,binary_erosion

References

[R74]

Examples

>>> a=np.zeros((7,7), dtype=np.int)

>>> a[1,1]=1; a[2:4,2:4]=1; a[4:6,4:6]=1

>>> a

array([[0, 0, 0, 0, 0, 0, 0],

[0, 1, 0, 0, 0, 0, 0],

[0, 0, 1, 1, 0, 0, 0],

[0, 0, 0, 0, 1, 1, 0],

[0, 0, 0, 0, 0, 0, 0]])

>>> structure1 =np.array([[1,0,0], [0,1,1], [0,1,1]])

>>> structure1

array([[1, 0, 0],

[0, 1, 1],

[0, 1, 1]])

>>> # Find the matches of structure1 in the array a

>>> ndimage.binary_hit_or_miss(a, structure1=structure1).astype(np.int)

array([[0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0],

[0, 0, 1, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 1, 0, 0],

[0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0]])

>>> # Change the origin of the filter

>>> # origin1=1 is equivalent to origin1=(1,1) here

>>> ndimage.binary_hit_or_miss(a, structure1=structure1,\

... origin1=1).astype(np.int)

array([[0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 1, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 1, 0],

[0, 0, 0, 0, 0, 0, 0]])

scipy.ndimage.morphology.binary_opening(input,structure=None,iterations=1,output=None,

origin=0)

Multi-dimensional binary opening with the given structuring element.

The opening of an input image by a structuring element is the dilation of the erosion of the image by the

structuring element.

Parameters input : array_like

Binary array_like to be opened. Non-zero (True) elements form the subset to be

opened.

5.18. Multi-dimensional image processing (scipy.ndimage) 503

SciPy Reference Guide, Release 0.13.0

structure : array_like, optional

Structuring element used for the opening. Non-zero elements are considered True. If

no structuring element is provided an element is generated with a square connectivity

equal to one (i.e., only nearest neighbors are connected to the center, diagonally-

connected elements are not considered neighbors).

iterations : {int, ﬂoat}, optional

The erosion step of the opening, then the dilation step are each repeated iterations

times (one, by default). If iterations is less than 1, each operation is repeated until the

result does not change anymore.

output : ndarray, optional

Array of the same shape as input, into which the output is placed. By default, a new

array is created.

origin : int or tuple of ints, optional

Placement of the ﬁlter, by default 0.

Returns binary_opening : ndarray of bools

Opening of the input by the structuring element.

See Also

grey_opening,binary_closing,binary_erosion,binary_dilation,

generate_binary_structure

Notes

Opening [R75] is a mathematical morphology operation [R76] that consists in the succession of an erosion and

a dilation of the input with the same structuring element. Opening therefore removes objects smaller than the

structuring element.

Together with closing (binary_closing), opening can be used for noise removal.

References

[R75],[R76]

Examples

>>> a=np.zeros((5,5), dtype=np.int)

>>> a[1:4,1:4]=1; a[4,4]=1

>>> a

array([[0, 0, 0, 0, 0],

[0, 1, 1, 1, 0],

[0, 0, 0, 0, 1]])

>>> # Opening removes small objects

>>> ndimage.binary_opening(a, structure=np.ones((3,3))).astype(np.int)

array([[0, 0, 0, 0, 0],

[0, 1, 1, 1, 0],

[0, 0, 0, 0, 0]])

>>> # Opening can also smooth corners

>>> ndimage.binary_opening(a).astype(np.int)

array([[0, 0, 0, 0, 0],

[0, 0, 1, 0, 0],

[0, 1, 1, 1, 0],

[0, 0, 1, 0, 0],

[0, 0, 0, 0, 0]])

504 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

>>> # Opening is the dilation of the erosion of the input

>>> ndimage.binary_erosion(a).astype(np.int)

array([[0, 0, 0, 0, 0],

[0, 0, 0, 0, 0],

[0, 0, 1, 0, 0],

[0, 0, 0, 0, 0],

[0, 0, 0, 0, 0]])

>>> ndimage.binary_dilation(ndimage.binary_erosion(a)).astype(np.int)

array([[0, 0, 0, 0, 0],

[0, 0, 1, 0, 0],

[0, 1, 1, 1, 0],

[0, 0, 1, 0, 0],

[0, 0, 0, 0, 0]])

scipy.ndimage.morphology.binary_propagation(input,structure=None,mask=None,out-

put=None,border_value=0,origin=0)

Multi-dimensional binary propagation with the given structuring element.

Parameters input : array_like

Binary image to be propagated inside mask.

structure : array_like

Structuring element used in the successive dilations. The output may depend on the

structuring element, especially if mask has several connex components. If no structur-

ing element is provided, an element is generated with a squared connectivity equal to

one.

mask : array_like

Binary mask deﬁning the region into which input is allowed to propagate.

output : ndarray, optional

Array of the same shape as input, into which the output is placed. By default, a new

array is created.

origin : int or tuple of ints, optional

Placement of the ﬁlter, by default 0.

Returns binary_propagation : ndarray

Binary propagation of input inside mask.

Notes

This function is functionally equivalent to calling binary_dilation with the number of iterations less then one:

iterative dilation until the result does not change anymore.

The succession of an erosion and propagation inside the original image can be used instead of an opening for

deleting small objects while keeping the contours of larger objects untouched.

References

[R77],[R78]

Examples

>>> input =np.zeros((8,8), dtype=np.int)

>>> input[2,2]=1

>>> mask =np.zeros((8,8), dtype=np.int)

>>> mask[1:4,1:4]=mask[4,4]=mask[6:8,6:8]=1

>>> input

array([[0, 0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0, 0],

[0, 0, 1, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0, 0],

5.18. Multi-dimensional image processing (scipy.ndimage) 505

SciPy Reference Guide, Release 0.13.0

[0, 0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0, 0]])

>>> mask

array([[0, 0, 0, 0, 0, 0, 0, 0],

[0, 1, 1, 1, 0, 0, 0, 0],

[0, 0, 0, 0, 1, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 1, 1],

[0, 0, 0, 0, 0, 0, 1, 1]])

>>> ndimage.binary_propagation(input, mask=mask).astype(np.int)

array([[0, 0, 0, 0, 0, 0, 0, 0],

[0, 1, 1, 1, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0, 0]])

>>> ndimage.binary_propagation(input, mask=mask,\

... structure=np.ones((3,3))).astype(np.int)

array([[0, 0, 0, 0, 0, 0, 0, 0],

[0, 1, 1, 1, 0, 0, 0, 0],

[0, 0, 0, 0, 1, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0, 0]])

>>> # Comparison between opening and erosion+propagation

>>> a=np.zeros((6,6), dtype=np.int)

>>> a[2:5,2:5]=1; a[0,0]=1; a[5,5]=1

>>> a

array([[1, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0],

[0, 0, 1, 1, 1, 0],

[0, 0, 0, 0, 0, 1]])

>>> ndimage.binary_opening(a).astype(np.int)

array([[0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0],

[0, 0, 0, 1, 0, 0],

[0, 0, 1, 1, 1, 0],

[0, 0, 0, 1, 0, 0],

[0, 0, 0, 0, 0, 0]])

>>> b=ndimage.binary_erosion(a)

>>> b.astype(int)

array([[0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0],

[0, 0, 0, 1, 0, 0],

[0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0]])

506 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

>>> ndimage.binary_propagation(b, mask=a).astype(np.int)

array([[0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0],

[0, 0, 1, 1, 1, 0],

[0, 0, 0, 0, 0, 0]])

scipy.ndimage.morphology.black_tophat(input,size=None,footprint=None,structure=None,

output=None,mode=’reﬂect’,cval=0.0,origin=0)

Multi-dimensional black tophat ﬁlter.

Parameters input : array_like

Input.

size : tuple of ints

Shape of a ﬂat and full structuring element used for the ﬁlter. Optional if footprint or

structure is provided.

footprint : array of ints, optional

Positions of non-inﬁnite elements of a ﬂat structuring element used for the black

tophat ﬁlter.

structure : array of ints, optional

Structuring element used for the ﬁlter. structure may be a non-ﬂat structuring element.

output : array, optional

An array used for storing the output of the ﬁlter may be provided.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0.

origin : scalar, optional

The origin parameter controls the placement of the ﬁlter. Default 0

Returns black_tophat : ndarray

Result of the ﬁlter of input with structure.

See Also

white_tophat,grey_opening,grey_closing

scipy.ndimage.morphology.distance_transform_bf(input,metric=’euclidean’,sam-

pling=None,return_distances=True,

return_indices=False,distances=None,

indices=None)

Distance transform function by a brute force algorithm.

This function calculates the distance transform of the input, by replacing each background element (zero values),

with its shortest distance to the foreground (any element non-zero).

In addition to the distance transform, the feature transform can be calculated. In this case the index of the closest

background element is returned along the ﬁrst axis of the result.

Parameters input : array_like

Input

metric : str, optional

Three types of distance metric are supported: ‘euclidean’, ‘taxicab’ and ‘chessboard’.

sampling : {int, sequence of ints}, optional

This parameter is only used in the case of the euclidean metric distance transform.

5.18. Multi-dimensional image processing (scipy.ndimage) 507

SciPy Reference Guide, Release 0.13.0

The sampling along each axis can be given by the sampling parameter which should be

a sequence of length equal to the input rank, or a single number in which the sampling

is assumed to be equal along all axes.

return_distances : bool, optional

The return_distances ﬂag can be used to indicate if the distance transform is returned.

The default is True.

return_indices : bool, optional

The return_indices ﬂags can be used to indicate if the feature transform is returned.

The default is False.

distances : ﬂoat64 ndarray, optional

Optional output array to hold distances (if return_distances is True).

indices : int64 ndarray, optional

Optional output array to hold indices (if return_indices is True).

Returns distances : ndarray

Distance array if return_distances is True.

indices : ndarray

Indices array if return_indices is True.

Notes

This function employs a slow brute force algorithm, see also the function distance_transform_cdt for more

efﬁcient taxicab and chessboard algorithms.

scipy.ndimage.morphology.distance_transform_cdt(input,metric=’chessboard’,

return_distances=True,re-

turn_indices=False,distances=None,

indices=None)

Distance transform for chamfer type of transforms.

Parameters input : array_like

Input

metric : {‘chessboard’, ‘taxicab’}, optional

The metric determines the type of chamfering that is done. If the metric is equal

to ‘taxicab’ a structure is generated using generate_binary_structure with a squared

distance equal to 1. If the metric is equal to ‘chessboard’, a metric is generated us-

ing generate_binary_structure with a squared distance equal to the rank of the ar-

ray. These choices correspond to the common interpretations of the ‘taxicab’ and the

‘chessboard’ distance metrics in two dimensions.

The default for metric is ‘chessboard’.

return_distances, return_indices : bool, optional

The return_distances, and return_indices ﬂags can be used to indicate if the distance

transform, the feature transform, or both must be returned.

If the feature transform is returned (return_indices=True), the index of the

closest background element is returned along the ﬁrst axis of the result.

The return_distances default is True, and the return_indices default is False.

distances, indices : ndarrays of int32, optional

The distances and indices arguments can be used to give optional output arrays that

must be the same shape as input.

scipy.ndimage.morphology.distance_transform_edt(input,sampling=None,re-

turn_distances=True,re-

turn_indices=False,distances=None,

indices=None)

Exact euclidean distance transform.

In addition to the distance transform, the feature transform can be calculated. In this case the index of the closest

background element is returned along the ﬁrst axis of the result.

508 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Parameters input : array_like

Input data to transform. Can be any type but will be converted into binary: 1 wherever

input equates to True, 0 elsewhere.

sampling : ﬂoat or int, or sequence of same, optional

Spacing of elements along each dimension. If a sequence, must be of length equal

to the input rank; if a single number, this is used for all axes. If not speciﬁed, a grid

spacing of unity is implied.

return_distances : bool, optional

Whether to return distance matrix. At least one of return_distances/return_indices

must be True. Default is True.

return_indices : bool, optional

Whether to return indices matrix. Default is False.

distance : ndarray, optional

Used for output of distance array, must be of type ﬂoat64.

indices : ndarray, optional

Used for output of indices, must be of type int32.

Returns distance_transform_edt : ndarray or list of ndarrays

Either distance matrix, index matrix, or a list of the two, depending on return_x ﬂags

and distance and indices input parameters.

Notes

The euclidean distance transform gives values of the euclidean distance:

y_i = sqrt(sum (x[i]-b[i])**2)

where b[i] is the background point (value 0) with the smallest Euclidean distance to input points x[i], and n is

the number of dimensions.

Examples

>>> a=np.array(([0,1,1,1,1],

[0,0,1,1,1],

[0,1,1,1,1],

[0,1,1,1,0],

[0,1,1,0,0]))

>>> from scipy import ndimage

>>> ndimage.distance_transform_edt(a)

array([[ 0. , 1. , 1.4142, 2.2361, 3. ],

[ 0. , 0. , 1. , 2. , 2. ],

[ 0. , 1. , 1.4142, 1.4142, 1. ],

[ 0. , 1. , 1.4142, 1. , 0. ],

[ 0. , 1. , 1. , 0. , 0. ]])

With a sampling of 2 units along x, 1 along y:

>>> ndimage.distance_transform_edt(a, sampling=[2,1])

array([[ 0. , 1. , 2. , 2.8284, 3.6056],

[ 0. , 0. , 1. , 2. , 3. ],

[ 0. , 1. , 2. , 2.2361, 2. ],

[ 0. , 1. , 2. , 1. , 0. ],

[ 0. , 1. , 1. , 0. , 0. ]])

Asking for indices as well:

5.18. Multi-dimensional image processing (scipy.ndimage) 509

SciPy Reference Guide, Release 0.13.0

>>> edt, inds =ndimage.distance_transform_edt(a, return_indices=True)

>>> inds

array([[[0, 0, 1, 1, 3],

[1, 1, 1, 1, 3],

[2, 2, 1, 3, 3],

[3, 3, 4, 4, 3],

[4, 4, 4, 4, 4]],

[[0, 0, 1, 1, 4],

[0, 1, 1, 1, 4],

[0, 0, 1, 4, 4],

[0, 0, 3, 3, 4],

[0, 0, 3, 3, 4]]])

With arrays provided for inplace outputs:

>>> indices =np.zeros(((np.rank(a),) +a.shape), dtype=np.int32)

>>> ndimage.distance_transform_edt(a, return_indices=True, indices=indices)

array([[ 0. , 1. , 1.4142, 2.2361, 3. ],

[ 0. , 0. , 1. , 2. , 2. ],

[ 0. , 1. , 1.4142, 1.4142, 1. ],

[ 0. , 1. , 1.4142, 1. , 0. ],

[ 0. , 1. , 1. , 0. , 0. ]])

>>> indices

array([[[0, 0, 1, 1, 3],

[1, 1, 1, 1, 3],

[2, 2, 1, 3, 3],

[3, 3, 4, 4, 3],

[4, 4, 4, 4, 4]],

[[0, 0, 1, 1, 4],

[0, 1, 1, 1, 4],

[0, 0, 1, 4, 4],

[0, 0, 3, 3, 4],

[0, 0, 3, 3, 4]]])

scipy.ndimage.morphology.generate_binary_structure(rank,connectivity)

Generate a binary structure for binary morphological operations.

Parameters rank : int

Number of dimensions of the array to which the structuring element will be applied,

as returned by np.ndim.

connectivity : int

connectivity determines which elements of the output array belong to the structure, i.e.

are considered as neighbors of the central element. Elements up to a squared distance

of connectivity from the center are considered neighbors. connectivity may range from

1 (no diagonal elements are neighbors) to rank (all elements are neighbors).

Returns output : ndarray of bools

Structuring element which may be used for binary morphological operations, with

rank dimensions and all dimensions equal to 3.

See Also

iterate_structure,binary_dilation,binary_erosion

Notes

generate_binary_structure can only create structuring elements with dimensions equal to 3, i.e. min-

imal dimensions. For larger structuring elements, that are useful e.g. for eroding large objects, one may either

use iterate_structure, or create directly custom arrays with numpy functions such as numpy.ones.

510 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Examples

>>> struct =ndimage.generate_binary_structure(2,1)

>>> struct

array([[False, True, False],

[ True, True, True],

[False, True, False]], dtype=bool)

>>> a=np.zeros((5,5))

>>> a[2,2]=1

>>> a

array([[ 0., 0., 0., 0., 0.],

[ 0., 0., 0., 0., 0.],

[ 0., 0., 1., 0., 0.],

[ 0., 0., 0., 0., 0.],

[ 0., 0., 0., 0., 0.]])

>>> b=ndimage.binary_dilation(a, structure=struct).astype(a.dtype)

>>> b

array([[ 0., 0., 0., 0., 0.],

[ 0., 0., 1., 0., 0.],

[ 0., 1., 1., 1., 0.],

[ 0., 0., 1., 0., 0.],

[ 0., 0., 0., 0., 0.]])

>>> ndimage.binary_dilation(b, structure=struct).astype(a.dtype)

array([[ 0., 0., 1., 0., 0.],

[ 0., 1., 1., 1., 0.],

[ 1., 1., 1., 1., 1.],

[ 0., 1., 1., 1., 0.],

[ 0., 0., 1., 0., 0.]])

>>> struct =ndimage.generate_binary_structure(2,2)

>>> struct

array([[ True, True, True],

[ True, True, True],

[ True, True, True]], dtype=bool)

>>> struct =ndimage.generate_binary_structure(3,1)

>>> struct # no diagonal elements

array([[[False, False, False],

[False, True, False],

[False, False, False]],

[[False, True, False],

[ True, True, True],

[False, True, False]],

[[False, False, False],

[False, True, False],

[False, False, False]]], dtype=bool)

scipy.ndimage.morphology.grey_closing(input,size=None,footprint=None,structure=None,

output=None,mode=’reﬂect’,cval=0.0,origin=0)

Multi-dimensional greyscale closing.

A greyscale closing consists in the succession of a greyscale dilation, and a greyscale erosion.

Parameters input : array_like

Array over which the grayscale closing is to be computed.

size : tuple of ints

Shape of a ﬂat and full structuring element used for the grayscale closing. Optional if

footprint or structure is provided.

footprint : array of ints, optional

Positions of non-inﬁnite elements of a ﬂat structuring element used for the grayscale

closing.

5.18. Multi-dimensional image processing (scipy.ndimage) 511

SciPy Reference Guide, Release 0.13.0

structure : array of ints, optional

Structuring element used for the grayscale closing. structure may be a non-ﬂat struc-

turing element.

output : array, optional

An array used for storing the ouput of the closing may be provided.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0.

origin : scalar, optional

The origin parameter controls the placement of the ﬁlter. Default 0

Returns grey_closing : ndarray

Result of the grayscale closing of input with structure.

See Also

binary_closing,grey_dilation,grey_erosion,grey_opening,

generate_binary_structure

Notes

The action of a grayscale closing with a ﬂat structuring element amounts to smoothen deep local minima,

whereas binary closing ﬁlls small holes.

References

[R79]

Examples

>>> a=np.arange(36).reshape((6,6))

>>> a[3,3]=0

>>> a

array([[ 0, 1, 2, 3, 4, 5],

[ 6, 7, 8, 9, 10, 11],

[12, 13, 14, 15, 16, 17],

[18, 19, 20, 0, 22, 23],

[24, 25, 26, 27, 28, 29],

[30, 31, 32, 33, 34, 35]])

>>> ndimage.grey_closing(a, size=(3,3))

array([[ 7, 7, 8, 9, 10, 11],

[ 7, 7, 8, 9, 10, 11],

[13, 13, 14, 15, 16, 17],

[19, 19, 20, 20, 22, 23],

[25, 25, 26, 27, 28, 29],

[31, 31, 32, 33, 34, 35]])

>>> # Note that the local minimum a[3,3] has disappeared

scipy.ndimage.morphology.grey_dilation(input,size=None,footprint=None,structure=None,

output=None,mode=’reﬂect’,cval=0.0,origin=0)

Calculate a greyscale dilation, using either a structuring element, or a footprint corresponding to a ﬂat structuring

element.

Grayscale dilation is a mathematical morphology operation. For the simple case of a full and ﬂat structuring

element, it can be viewed as a maximum ﬁlter over a sliding window.

Parameters input : array_like

Array over which the grayscale dilation is to be computed.

512 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

size : tuple of ints

Shape of a ﬂat and full structuring element used for the grayscale dilation. Optional if

footprint or structure is provided.

footprint : array of ints, optional

Positions of non-inﬁnite elements of a ﬂat structuring element used for the grayscale

dilation. Non-zero values give the set of neighbors of the center over which the maxi-

mum is chosen.

structure : array of ints, optional

Structuring element used for the grayscale dilation. structure may be a non-ﬂat struc-

turing element.

output : array, optional

An array used for storing the ouput of the dilation may be provided.

mode : {‘reﬂect’,’constant’,’nearest’,’mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0.

origin : scalar, optional

The origin parameter controls the placement of the ﬁlter. Default 0

Returns grey_dilation : ndarray

Grayscale dilation of input.

See Also

binary_dilation,grey_erosion,grey_closing,grey_opening,

generate_binary_structure,ndimage.maximum_filter

Notes

The grayscale dilation of an image input by a structuring element s deﬁned over a domain E is given by:

(input+s)(x) = max {input(y) + s(x-y), for y in E}

In particular, for structuring elements deﬁned as s(y) = 0 for y in E, the grayscale dilation computes the maximum

of the input image inside a sliding window deﬁned by E.

Grayscale dilation [R80] is a mathematical morphology operation [R81].

References

[R80],[R81]

Examples

>>> a=np.zeros((7,7), dtype=np.int)

>>> a[2:5,2:5]=1

>>> a[4,4]=2; a[2,3]=3

>>> a

array([[0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0],

[0, 0, 1, 3, 1, 0, 0],

[0, 0, 1, 1, 1, 0, 0],

[0, 0, 1, 1, 2, 0, 0],

[0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0]])

>>> ndimage.grey_dilation(a, size=(3,3))

array([[0, 0, 0, 0, 0, 0, 0],

[0, 1, 3, 3, 3, 1, 0],

5.18. Multi-dimensional image processing (scipy.ndimage) 513

SciPy Reference Guide, Release 0.13.0

[0, 1, 3, 3, 3, 2, 0],

[0, 1, 1, 2, 2, 2, 0],

[0, 0, 0, 0, 0, 0, 0]])

>>> ndimage.grey_dilation(a, footprint=np.ones((3,3)))

array([[0, 0, 0, 0, 0, 0, 0],

[0, 1, 3, 3, 3, 1, 0],

[0, 1, 3, 3, 3, 2, 0],

[0, 1, 1, 2, 2, 2, 0],

[0, 0, 0, 0, 0, 0, 0]])

>>> s=ndimage.generate_binary_structure(2,1)

>>> s

array([[False, True, False],

[ True, True, True],

[False, True, False]], dtype=bool)

>>> ndimage.grey_dilation(a, footprint=s)

array([[0, 0, 0, 0, 0, 0, 0],

[0, 0, 1, 3, 1, 0, 0],

[0, 1, 3, 3, 3, 1, 0],

[0, 1, 1, 3, 2, 1, 0],

[0, 1, 1, 2, 2, 2, 0],

[0, 0, 1, 1, 2, 0, 0],

[0, 0, 0, 0, 0, 0, 0]])

>>> ndimage.grey_dilation(a, size=(3,3), structure=np.ones((3,3)))

array([[1, 1, 1, 1, 1, 1, 1],

[1, 2, 4, 4, 4, 2, 1],

[1, 2, 4, 4, 4, 3, 1],

[1, 2, 2, 3, 3, 3, 1],

[1, 1, 1, 1, 1, 1, 1]])

scipy.ndimage.morphology.grey_erosion(input,size=None,footprint=None,structure=None,

output=None,mode=’reﬂect’,cval=0.0,origin=0)

Calculate a greyscale erosion, using either a structuring element, or a footprint corresponding to a ﬂat structuring

element.

Grayscale erosion is a mathematical morphology operation. For the simple case of a full and ﬂat structuring

element, it can be viewed as a minimum ﬁlter over a sliding window.

Parameters input : array_like

Array over which the grayscale erosion is to be computed.

size : tuple of ints

Shape of a ﬂat and full structuring element used for the grayscale erosion. Optional if

footprint or structure is provided.

footprint : array of ints, optional

Positions of non-inﬁnite elements of a ﬂat structuring element used for the grayscale

erosion. Non-zero values give the set of neighbors of the center over which the mini-

mum is chosen.

structure : array of ints, optional

Structuring element used for the grayscale erosion. structure may be a non-ﬂat struc-

turing element.

output : array, optional

An array used for storing the ouput of the erosion may be provided.

mode : {‘reﬂect’,’constant’,’nearest’,’mirror’, ‘wrap’}, optional

514 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0.

origin : scalar, optional

The origin parameter controls the placement of the ﬁlter. Default 0

Returns output : ndarray

Grayscale erosion of input.

See Also

binary_erosion,grey_dilation,grey_opening,grey_closing,

generate_binary_structure,ndimage.minimum_filter

Notes

The grayscale erosion of an image input by a structuring element s deﬁned over a domain E is given by:

(input+s)(x) = min {input(y) - s(x-y), for y in E}

In particular, for structuring elements deﬁned as s(y) = 0 for y in E, the grayscale erosion computes the minimum

of the input image inside a sliding window deﬁned by E.

Grayscale erosion [R82] is a mathematical morphology operation [R83].

References

[R82],[R83]

Examples

>>> a=np.zeros((7,7), dtype=np.int)

>>> a[1:6,1:6]=3

>>> a[4,4]=2; a[2,3]=1

>>> a

array([[0, 0, 0, 0, 0, 0, 0],

[0, 3, 3, 3, 3, 3, 0],

[0, 3, 3, 1, 3, 3, 0],

[0, 3, 3, 3, 3, 3, 0],

[0, 3, 3, 3, 2, 3, 0],

[0, 3, 3, 3, 3, 3, 0],

[0, 0, 0, 0, 0, 0, 0]])

>>> ndimage.grey_erosion(a, size=(3,3))

array([[0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0],

[0, 0, 1, 1, 1, 0, 0],

[0, 0, 3, 2, 2, 0, 0],

[0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0]])

>>> footprint =ndimage.generate_binary_structure(2,1)

>>> footprint

array([[False, True, False],

[ True, True, True],

[False, True, False]], dtype=bool)

>>> # Diagonally-connected elements are not considered neighbors

>>> ndimage.grey_erosion(a, size=(3,3), footprint=footprint)

array([[0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0],

[0, 0, 1, 1, 1, 0, 0],

5.18. Multi-dimensional image processing (scipy.ndimage) 515

SciPy Reference Guide, Release 0.13.0

[0, 0, 3, 1, 2, 0, 0],

[0, 0, 3, 2, 2, 0, 0],

[0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0]])

scipy.ndimage.morphology.grey_opening(input,size=None,footprint=None,structure=None,

output=None,mode=’reﬂect’,cval=0.0,origin=0)

Multi-dimensional greyscale opening.

A greyscale opening consists in the succession of a greyscale erosion, and a greyscale dilation.

Parameters input : array_like

Array over which the grayscale opening is to be computed.

size : tuple of ints

Shape of a ﬂat and full structuring element used for the grayscale opening. Optional

if footprint or structure is provided.

footprint : array of ints, optional

Positions of non-inﬁnite elements of a ﬂat structuring element used for the grayscale

opening.

structure : array of ints, optional

Structuring element used for the grayscale opening. structure may be a non-ﬂat struc-

turing element.

output : array, optional

An array used for storing the ouput of the opening may be provided.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0.

origin : scalar, optional

The origin parameter controls the placement of the ﬁlter. Default 0

Returns grey_opening : ndarray

Result of the grayscale opening of input with structure.

See Also

binary_opening,grey_dilation,grey_erosion,grey_closing,

generate_binary_structure

Notes

The action of a grayscale opening with a ﬂat structuring element amounts to smoothen high local maxima,

whereas binary opening erases small objects.

References

[R84]

Examples

>>> a=np.arange(36).reshape((6,6))

>>> a[3,3]=50

>>> a

array([[ 0, 1, 2, 3, 4, 5],

[ 6, 7, 8, 9, 10, 11],

[12, 13, 14, 15, 16, 17],

[18, 19, 20, 50, 22, 23],

[24, 25, 26, 27, 28, 29],

516 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

[30, 31, 32, 33, 34, 35]])

>>> ndimage.grey_opening(a, size=(3,3))

array([[ 0, 1, 2, 3, 4, 4],

[ 6, 7, 8, 9, 10, 10],

[12, 13, 14, 15, 16, 16],

[18, 19, 20, 22, 22, 22],

[24, 25, 26, 27, 28, 28],

[24, 25, 26, 27, 28, 28]])

>>> # Note that the local maximum a[3,3] has disappeared

scipy.ndimage.morphology.iterate_structure(structure,iterations,origin=None)

Iterate a structure by dilating it with itself.

Parameters structure : array_like

Structuring element (an array of bools, for example), to be dilated with itself.

iterations : int

number of dilations performed on the structure with itself

origin : optional

If origin is None, only the iterated structure is returned. If not, a tuple of the iterated

structure and the modiﬁed origin is returned.

Returns iterate_structure : ndarray of bools

A new structuring element obtained by dilating structure (iterations - 1) times with

itself.

See Also

generate_binary_structure

Examples

>>> struct =ndimage.generate_binary_structure(2,1)

>>> struct.astype(int)

array([[0, 1, 0],

[1, 1, 1],

[0, 1, 0]])

>>> ndimage.iterate_structure(struct, 2).astype(int)

array([[0, 0, 1, 0, 0],

[0, 1, 1, 1, 0],

[1, 1, 1, 1, 1],

[0, 1, 1, 1, 0],

[0, 0, 1, 0, 0]])

>>> ndimage.iterate_structure(struct, 3).astype(int)

array([[0, 0, 0, 1, 0, 0, 0],

[0, 0, 1, 1, 1, 0, 0],

[0, 1, 1, 1, 1, 1, 0],

[1, 1, 1, 1, 1, 1, 1],

[0, 1, 1, 1, 1, 1, 0],

[0, 0, 1, 1, 1, 0, 0],

[0, 0, 0, 1, 0, 0, 0]])

scipy.ndimage.morphology.morphological_gradient(input,size=None,footprint=None,

structure=None,output=None,

mode=’reﬂect’,cval=0.0,origin=0)

Multi-dimensional morphological gradient.

The morphological gradient is calculated as the difference between a dilation and an erosion of the input with a

given structuring element.

Parameters input : array_like

5.18. Multi-dimensional image processing (scipy.ndimage) 517

SciPy Reference Guide, Release 0.13.0

Array over which to compute the morphlogical gradient.

size : tuple of ints

Shape of a ﬂat and full structuring element used for the mathematical morphology

operations. Optional if footprint or structure is provided. A larger size yields a more

blurred gradient.

footprint : array of ints, optional

Positions of non-inﬁnite elements of a ﬂat structuring element used for the morphol-

ogy operations. Larger footprints give a more blurred morphological gradient.

structure : array of ints, optional

Structuring element used for the morphology operations. structure may be a non-ﬂat

structuring element.

output : array, optional

An array used for storing the ouput of the morphological gradient may be provided.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0.

origin : scalar, optional

The origin parameter controls the placement of the ﬁlter. Default 0

Returns morphological_gradient : ndarray

Morphological gradient of input.

See Also

grey_dilation,grey_erosion,ndimage.gaussian_gradient_magnitude

Notes

For a ﬂat structuring element, the morphological gradient computed at a given point corresponds to the maximal

difference between elements of the input among the elements covered by the structuring element centered on

the point.

References

[R85]

Examples

>>> a=np.zeros((7,7), dtype=np.int)

>>> a[2:5,2:5]=1

>>> ndimage.morphological_gradient(a, size=(3,3))

array([[0, 0, 0, 0, 0, 0, 0],

[0, 1, 1, 1, 1, 1, 0],

[0, 1, 1, 0, 1, 1, 0],

[0, 1, 1, 1, 1, 1, 0],

[0, 0, 0, 0, 0, 0, 0]])

>>> # The morphological gradient is computed as the difference

>>> # between a dilation and an erosion

>>> ndimage.grey_dilation(a, size=(3,3)) -\

... ndimage.grey_erosion(a, size=(3,3))

array([[0, 0, 0, 0, 0, 0, 0],

[0, 1, 1, 1, 1, 1, 0],

[0, 1, 1, 0, 1, 1, 0],

[0, 1, 1, 1, 1, 1, 0],

518 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

[0, 1, 1, 1, 1, 1, 0],

[0, 0, 0, 0, 0, 0, 0]])

>>> a=np.zeros((7,7), dtype=np.int)

>>> a[2:5,2:5]=1

>>> a[4,4]=2; a[2,3]=3

>>> a

array([[0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0],

[0, 0, 1, 3, 1, 0, 0],

[0, 0, 1, 1, 1, 0, 0],

[0, 0, 1, 1, 2, 0, 0],

[0, 0, 0, 0, 0, 0, 0],

[0, 0, 0, 0, 0, 0, 0]])

>>> ndimage.morphological_gradient(a, size=(3,3))

array([[0, 0, 0, 0, 0, 0, 0],

[0, 1, 3, 3, 3, 1, 0],

[0, 1, 3, 2, 3, 2, 0],

[0, 1, 1, 2, 2, 2, 0],

[0, 0, 0, 0, 0, 0, 0]])

scipy.ndimage.morphology.morphological_laplace(input,size=None,footprint=None,

structure=None,output=None,

mode=’reﬂect’,cval=0.0,origin=0)

Multi-dimensional morphological laplace.

Parameters input : array_like

Input.

size : int or sequence of ints, optional

See structure.

footprint : bool or ndarray, optional

See structure.

structure : structure

Either size,footprint, or the structure must be provided.

output : ndarray

An output array can optionally be provided.

mode : {‘reﬂect’,’constant’,’nearest’,’mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled. For ‘constant’

mode, values beyond borders are set to be cval. Default is ‘reﬂect’.

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0

origin : origin

The origin parameter controls the placement of the ﬁlter.

Returns morphological_laplace : ndarray

Output

scipy.ndimage.morphology.white_tophat(input,size=None,footprint=None,structure=None,

output=None,mode=’reﬂect’,cval=0.0,origin=0)

Multi-dimensional white tophat ﬁlter.

Parameters input : array_like

Input.

size : tuple of ints

Shape of a ﬂat and full structuring element used for the ﬁlter. Optional if footprint or

structure is provided.

footprint : array of ints, optional

5.18. Multi-dimensional image processing (scipy.ndimage) 519

SciPy Reference Guide, Release 0.13.0

Positions of elements of a ﬂat structuring element used for the white tophat ﬁlter.

structure : array of ints, optional

Structuring element used for the ﬁlter. structure may be a non-ﬂat structuring element.

output : array, optional

An array used for storing the output of the ﬁlter may be provided.

mode : {‘reﬂect’, ‘constant’, ‘nearest’, ‘mirror’, ‘wrap’}, optional

The mode parameter determines how the array borders are handled, where cval is the

value when mode is equal to ‘constant’. Default is ‘reﬂect’

cval : scalar, optional

Value to ﬁll past edges of input if mode is ‘constant’. Default is 0.0.

origin : scalar, optional

The origin parameter controls the placement of the ﬁlter. Default is 0.

Returns output : ndarray

Result of the ﬁlter of input with structure.

See Also

black_tophat

5.18.6 Utility

imread(fname[, ﬂatten, mode]) Load an image from ﬁle.

scipy.ndimage.imread(fname,ﬂatten=False,mode=None)

Load an image from ﬁle.

Parameters fname : str

Image ﬁle name, e.g. test.jpg.

ﬂatten : bool, optional

If true, convert the output to grey-scale. Default is False.

mode : str, optional

mode to convert image to, e.g. RGB.

Returns img_array : ndarray

The different colour bands/channels are stored in the third dimension, such that a

grey-image is MxN, an RGB-image MxNx3 and an RGBA-image MxNx4.

Raises ImportError

If the Python Imaging Library (PIL) can not be imported.

5.19 Orthogonal distance regression (scipy.odr)

5.19.1 Package Content

Data(x[, y, we, wd, ﬁx, meta]) The data to ﬁt.

RealData(x[, y, sx, sy, covx, covy, ﬁx, meta]) The data, with weightings as actual standard deviations and/or covariances.

Model(fcn[, fjacb, fjacd, extra_args, ...]) The Model class stores information about the function you wish to ﬁt.

ODR(data, model[, beta0, delta0, iﬁxb, ...]) The ODR class gathers all information and coordinates the running of the

Output(output) The Output class stores the output of an ODR run.

odr(fcn, beta0, y, x[, we, wd, fjacb, ...]) Low-level function for ODR.

odr_error Exception indicating an error in ﬁtting.

Continued on next page

520 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Table 5.84 – continued from previous page

odr_stop Exception stopping ﬁtting.

class scipy.odr.Data(x,y=None,we=None,wd=None,ﬁx=None,meta={})

The data to ﬁt.

Parameters x: array_like

Input data for regression.

y: array_like, optional

Input data for regression.

we : array_like, optional

If we is a scalar, then that value is used for all data points (and all dimensions of

the response variable). If we is a rank-1 array of length q (the dimensionality of the

response variable), then this vector is the diagonal of the covariant weighting matrix

for all data points. If we is a rank-1 array of length n (the number of data points),

then the i’th element is the weight for the i’th response variable observation (single-

dimensional only). If we is a rank-2 array of shape (q, q), then this is the full covariant

weighting matrix broadcast to each observation. If we is a rank-2 array of shape (q, n),

then we[:,i] is the diagonal of the covariant weighting matrix for the i’th observation.

If we is a rank-3 array of shape (q, q, n), then we[:,:,i] is the full speciﬁcation of

the covariant weighting matrix for each observation. If the ﬁt is implicit, then only a

positive scalar value is used.

wd : array_like, optional

If wd is a scalar, then that value is used for all data points (and all dimensions of the

input variable). If wd = 0, then the covariant weighting matrix for each observation

is set to the identity matrix (so each dimension of each observation has the same

weight). If wd is a rank-1 array of length m (the dimensionality of the input variable),

then this vector is the diagonal of the covariant weighting matrix for all data points.

If wd is a rank-1 array of length n (the number of data points), then the i’th element

is the weight for the i’th input variable observation (single-dimensional only). If wd

is a rank-2 array of shape (m, m), then this is the full covariant weighting matrix

broadcast to each observation. If wd is a rank-2 array of shape (m, n), then wd[:,i]

is the diagonal of the covariant weighting matrix for the i’th observation. If wd is a

rank-3 array of shape (m, m, n), then wd[:,:,i] is the full speciﬁcation of the covariant

weighting matrix for each observation.

ﬁx : array_like of ints, optional

The ﬁx argument is the same as iﬁxx in the class ODR. It is an array of integers with

the same shape as data.x that determines which input observations are treated as ﬁxed.

One can use a sequence of length m (the dimensionality of the input observations) to

ﬁx some dimensions for all observations. A value of 0 ﬁxes the observation, a value >

0 makes it free.

meta : dict, optional

Free-form dictionary for metadata.

Notes

Each argument is attached to the member of the instance of the same name. The structures of xand yare

described in the Model class docstring. If yis an integer, then the Data instance can only be used to ﬁt with

implicit models where the dimensionality of the response is equal to the speciﬁed value of y.

The we argument weights the effect a deviation in the response variable has on the ﬁt. The wd argument weights

the effect a deviation in the input variable has on the ﬁt. To handle multidimensional inputs and responses easily,

the structure of these arguments has the n’th dimensional axis ﬁrst. These arguments heavily use the structured

arguments feature of ODRPACK to conveniently and ﬂexibly support all options. See the ODRPACK User’s

5.19. Orthogonal distance regression (scipy.odr) 521

SciPy Reference Guide, Release 0.13.0

Guide for a full explanation of how these weights are used in the algorithm. Basically, a higher value of the

weight for a particular data point makes a deviation at that point more detrimental to the ﬁt.

Methods

set_meta(**kwds) Update the metadata dictionary with the keywords and data provided by keywords.

Data.set_meta(**kwds)

Update the metadata dictionary with the keywords and data provided by keywords.

Examples

>>> data.set_meta(lab="Ph 7; Lab 26", title="Ag110 + Ag108 Decay")

class scipy.odr.RealData(x,y=None,sx=None,sy=None,covx=None,covy=None,ﬁx=None,

meta={})

The data, with weightings as actual standard deviations and/or covariances.

Parameters x: array_like

y: array_like, optional

sx, sy : array_like, optional

Standard deviations of x.sx are standard deviations of xand are converted to weights

dividing 1.0 by their squares.

sy : array_like, optional

Standard deviations of y.sy are standard deviations of yand are converted to weights

by dividing 1.0 by their squares.

covx : array_like, optional

Covariance of x covx is an array of covariance matrices of xand are converted to

weights by performing a matrix inversion on each observation’s covariance matrix.

covy : array_like, optional

Covariance of y covy is an array of covariance matrices and are converted to weights

by performing a matrix inversion on each observation’s covariance matrix.

ﬁx : array_like, optional

The argument and member ﬁx is the same as Data.ﬁx and ODR.iﬁxx: It is an array

of integers with the same shape as xthat determines which input observations are

treated as ﬁxed. One can use a sequence of length m (the dimensionality of the input

observations) to ﬁx some dimensions for all observations. A value of 0 ﬁxes the

observation, a value > 0 makes it free.

meta : dict, optional

Free-form dictionary for metadata.

Notes

The weights wd and we are computed from provided values as follows:

sx and sy are converted to weights by dividing 1.0 by their squares. For example, wd =

1./numpy.power(‘sx‘, 2).

covx and covy are arrays of covariance matrices and are converted to weights by performing a matrix inversion

on each observation’s covariance matrix. For example, we[i] = numpy.linalg.inv(covy[i]).

These arguments follow the same structured argument conventions as wd and we only restricted by their natures:

sx and sy can’t be rank-3, but covx and covy can be.

522 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Only set either sx or covx (not both). Setting both will raise an exception. Same with sy and covy.

Methods

set_meta(**kwds) Update the metadata dictionary with the keywords and data provided by keywords.

RealData.set_meta(**kwds)

Update the metadata dictionary with the keywords and data provided by keywords.

Examples

>>> data.set_meta(lab="Ph 7; Lab 26", title="Ag110 + Ag108 Decay")

class scipy.odr.Model(fcn,fjacb=None,fjacd=None,extra_args=None,estimate=None,implicit=0,

meta=None)

The Model class stores information about the function you wish to ﬁt.

It stores the function itself, at the least, and optionally stores functions which compute the Jacobians used

during ﬁtting. Also, one can provide a function that will provide reasonable starting values for the ﬁt parameters

possibly given the set of data.

Parameters fcn : function

fcn(beta, x) –> y

fjacb : function

Jacobian of fcn wrt the ﬁt parameters beta.

fjacb(beta, x) –> @f_i(x,B)/@B_j

fjacd : function

Jacobian of fcn wrt the (possibly multidimensional) input variable.

fjacd(beta, x) –> @f_i(x,B)/@x_j

extra_args : tuple, optional

If speciﬁed, extra_args should be a tuple of extra arguments to pass to fcn,fjacb, and

fjacd. Each will be called by apply(fcn, (beta, x) + extra_args)

estimate : array_like of rank-1

Provides estimates of the ﬁt parameters from the data

estimate(data) –> estbeta

implicit : boolean

If TRUE, speciﬁes that the model is implicit; i.e fcn(beta, x) ~= 0 and there is no y

data to ﬁt against

meta : dict, optional

freeform dictionary of metadata for the model

Notes

Note that the fcn,fjacb, and fjacd operate on NumPy arrays and return a NumPy array. The estimate object takes

an instance of the Data class.

Here are the rules for the shapes of the argument and return arrays of the callback functions:

xif the input data is single-dimensional, then xis rank-1 array; i.e. x = array([1, 2, 3,

...]); x.shape = (n,) If the input data is multi-dimensional, then xis a rank-2 ar-

ray; i.e., x = array([[1, 2, ...], [2, 4, ...]]); x.shape = (m, n). In

all cases, it has the same shape as the input data array passed to odr.mis the dimensionality of

the input data, nis the number of observations.

yif the response variable is single-dimensional, then yis a rank-1 array, i.e., y = array([2,

4, ...]); y.shape = (n,). If the response variable is multi-dimensional, then yis a

5.19. Orthogonal distance regression (scipy.odr) 523

SciPy Reference Guide, Release 0.13.0

rank-2 array, i.e., y = array([[2, 4, ...], [3, 6, ...]]); y.shape = (q,

n) where qis the dimensionality of the response variable.

beta rank-1 array of length pwhere pis the number of parameters; i.e. beta = array([B_1,

B_2, ..., B_p])

fjacb if the response variable is multi-dimensional, then the return array’s shape is (q, p, n) such that

fjacb(x,beta)[l,k,i] = d f_l(X,B)/d B_k evaluated at the i’th data point. If q

== 1, then the return array is only rank-2 and with shape (p, n).

fjacd as with fjacb, only the return array’s shape is (q, m, n) such that fjacd(x,beta)[l,j,i]

= d f_l(X,B)/d X_j at the i’th data point. If q == 1, then the return array’s shape is (m,

n). If m == 1, the shape is (q, n). If m==q==1, the shape is (n,).

Methods

set_meta(**kwds) Update the metadata dictionary with the keywords and data provided here.

Model.set_meta(**kwds)

Update the metadata dictionary with the keywords and data provided here.

Examples

set_meta(name=”Exponential”, equation=”y = a exp(b x) + c”)

class scipy.odr.ODR(data,model,beta0=None,delta0=None,iﬁxb=None,iﬁxx=None,job=None,

iprint=None,errﬁle=None,rptﬁle=None,ndigit=None,taufac=None,sstol=None,

partol=None,maxit=None,stpb=None,stpd=None,sclb=None,scld=None,

work=None,iwork=None)

The ODR class gathers all information and coordinates the running of the main ﬁtting routine.

Members of instances of the ODR class have the same names as the arguments to the initialization routine.

Parameters data : Data class instance

instance of the Data class

model : Model class instance

instance of the Model class

Other Parameters

beta0 : array_like of rank-1

a rank-1 sequence of initial parameter values. Optional if model provides an “esti-

mate” function to estimate these values.

delta0 : array_like of ﬂoats of rank-1, optional

a (double-precision) ﬂoat array to hold the initial values of the errors in the input

variables. Must be same shape as data.x

iﬁxb : array_like of ints of rank-1, optional

sequence of integers with the same length as beta0 that determines which parameters

are held ﬁxed. A value of 0 ﬁxes the parameter, a value > 0 makes the parameter free.

iﬁxx : array_like of ints with same shape as data.x, optional

an array of integers with the same shape as data.x that determines which input obser-

vations are treated as ﬁxed. One can use a sequence of length m (the dimensionality

of the input observations) to ﬁx some dimensions for all observations. A value of 0

ﬁxes the observation, a value > 0 makes it free.

job : int, optional

an integer telling ODRPACK what tasks to perform. See p. 31 of the ODRPACK

User’s Guide if you absolutely must set the value here. Use the method set_job post-

initialization for a more readable interface.

iprint : int, optional

524 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

an integer telling ODRPACK what to print. See pp. 33-34 of the ODRPACK User’s

Guide if you absolutely must set the value here. Use the method set_iprint post-

initialization for a more readable interface.

errﬁle : str, optional

string with the ﬁlename to print ODRPACK errors to. Do Not Open This File Yourself!

rptﬁle : str, optional

string with the ﬁlename to print ODRPACK summaries to. Do Not Open This File

Yourself!

ndigit : int, optional

integer specifying the number of reliable digits in the computation of the function.

taufac : ﬂoat, optional

ﬂoat specifying the initial trust region. The default value is 1. The initial trust region

is equal to taufac times the length of the ﬁrst computed Gauss-Newton step. taufac

must be less than 1.

sstol : ﬂoat, optional

ﬂoat specifying the tolerance for convergence based on the relative change in the sum-

of-squares. The default value is eps**(1/2) where eps is the smallest value such that 1

+ eps > 1 for double precision computation on the machine. sstol must be less than 1.

partol : ﬂoat, optional

ﬂoat specifying the tolerance for convergence based on the relative change in the

estimated parameters. The default value is eps**(2/3) for explicit models and

eps**(1/3) for implicit models. partol must be less than 1.

maxit : int, optional

integer specifying the maximum number of iterations to perform. For ﬁrst runs, maxit

is the total number of iterations performed and defaults to 50. For restarts, maxit is

the number of additional iterations to perform and defaults to 10.

stpb : array_like, optional

sequence (len(stpb) == len(beta0)) of relative step sizes to compute ﬁnite

difference derivatives wrt the parameters.

stpd : optional

array (stpd.shape == data.x.shape or stpd.shape == (m,)) of rela-

tive step sizes to compute ﬁnite difference derivatives wrt the input variable errors. If

stpd is a rank-1 array with length m (the dimensionality of the input variable), then

the values are broadcast to all observations.

sclb : array_like, optional

sequence (len(stpb) == len(beta0)) of scaling factors for the parameters.

The purpose of these scaling factors are to scale all of the parameters to around unity.

Normally appropriate scaling factors are computed if this argument is not speciﬁed.

Specify them yourself if the automatic procedure goes awry.

scld : array_like, optional

array (scld.shape == data.x.shape or scld.shape == (m,)) of scaling factors for the

errors in the input variables. Again, these factors are automatically computed if you

do not provide them. If scld.shape == (m,), then the scaling factors are broadcast to

all observations.

work : ndarray, optional

array to hold the double-valued working data for ODRPACK. When restarting, takes

the value of self.output.work.

iwork : ndarray, optional

array to hold the integer-valued working data for ODRPACK. When restarting, takes

the value of self.output.iwork.

5.19. Orthogonal distance regression (scipy.odr) 525

SciPy Reference Guide, Release 0.13.0

Attributes

data (Data) The data for this ﬁt

model (Model) The model used in ﬁt

out-

put

(Output) An instance if the Output class containing all of the returned data from an invocation of

ODR.run() or ODR.restart()

Methods

restart([iter]) Restarts the run with iter more iterations.

run() Run the ﬁtting routine with all of the information given.

set_iprint([init, so_init, iter, so_iter, ...]) Set the iprint parameter for the printing of computation reports.

set_job([ﬁt_type, deriv, var_calc, ...]) Sets the “job” parameter is a hopefully comprehensible way.

ODR.restart(iter=None)

Restarts the run with iter more iterations.

Parameters iter : int, optional

ODRPACK’s default for the number of new iterations is 10.

Returns output : Output instance

This object is also assigned to the attribute .output .

ODR.run()

Run the ﬁtting routine with all of the information given.

Returns output : Output instance

This object is also assigned to the attribute .output .

ODR.set_iprint(init=None,so_init=None,iter=None,so_iter=None,iter_step=None,ﬁnal=None,

so_ﬁnal=None)

Set the iprint parameter for the printing of computation reports.

If any of the arguments are speciﬁed here, then they are set in the iprint member. If iprint is not set

manually or with this method, then ODRPACK defaults to no printing. If no ﬁlename is speciﬁed with the

member rptﬁle, then ODRPACK prints to stdout. One can tell ODRPACK to print to stdout in addition

to the speciﬁed ﬁlename by setting the so_* arguments to this function, but one cannot specify to print to

stdout but not a ﬁle since one can do that by not specifying a rptﬁle ﬁlename.

There are three reports: initialization, iteration, and ﬁnal reports. They are represented by the arguments

init, iter, and ﬁnal respectively. The permissible values are 0, 1, and 2 representing “no report”, “short

report”, and “long report” respectively.

The argument iter_step (0 <= iter_step <= 9) speciﬁes how often to make the iteration report; the report

will be made for every iter_step’th iteration starting with iteration one. If iter_step == 0, then no iteration

report is made, regardless of the other arguments.

If the rptﬁle is None, then any so_* arguments supplied will raise an exception.

ODR.set_job(ﬁt_type=None,deriv=None,var_calc=None,del_init=None,restart=None)

Sets the “job” parameter is a hopefully comprehensible way.

If an argument is not speciﬁed, then the value is left as is. The default value from class initialization is for

all of these options set to 0.

Parameters ﬁt_type : {0, 1, 2} int

0 -> explicit ODR

1 -> implicit ODR

2 -> ordinary least-squares

deriv : {0, 1, 2, 3} int

526 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

0 -> forward ﬁnite differences

1 -> central ﬁnite differences

2 -> user-supplied derivatives (Jacobians) with results

checked by ODRPACK

3 -> user-supplied derivatives, no checking

var_calc : {0, 1, 2} int

0 -> calculate asymptotic covariance matrix and ﬁt

parameter uncertainties (V_B, s_B) using derivatives recomputed

at the ﬁnal solution

1 -> calculate V_B and s_B using derivatives from last iteration

2 -> do not calculate V_B and s_B

del_init : {0, 1} int

0 -> initial input variable offsets set to 0

1 -> initial offsets provided by user in variable “work”

restart : {0, 1} int

0 -> ﬁt is not a restart

1 -> ﬁt is a restart

Notes

The permissible values are different from those given on pg. 31 of the ODRPACK User’s Guide only in

that one cannot specify numbers greater than the last value for each variable.

If one does not supply functions to compute the Jacobians, the ﬁtting procedure will change deriv to 0,

ﬁnite differences, as a default. To initialize the input variable offsets by yourself, set del_init to 1 and put

the offsets into the “work” variable correctly.

class scipy.odr.Output(output)

The Output class stores the output of an ODR run.

Notes

Takes one argument for initialization, the return value from the function odr. The attributes listed as “optional”

above are only present if odr was run with full_output=1.

Attributes

beta (ndarray) Estimated parameter values, of shape (q,).

sd_beta (ndarray) Standard errors of the estimated parameters, of shape (p,).

cov_beta (ndarray) Covariance matrix of the estimated parameters, of shape (p,p).

delta (ndarray, optional) Array of estimated errors in input variables, of same shape as x.

eps (ndarray, optional) Array of estimated errors in response variables, of same shape as y.

xplus (ndarray, optional) Array of x + delta.

y (ndarray, optional) Array y = fcn(x + delta).

res_var (ﬂoat, optional) Residual variance.

sum_sqare (ﬂoat, optional) Sum of squares error.

sum_square_delta (ﬂoat, optional) Sum of squares of delta error.

sum_square_eps (ﬂoat, optional) Sum of squares of eps error.

inv_condnum (ﬂoat, optional) Inverse condition number (cf. ODRPACK UG p. 77).

rel_error (ﬂoat, optional) Relative error in function values computed within fcn.

work (ndarray, optional) Final work array.

work_ind (dict, optional) Indices into work for drawing out values (cf. ODRPACK UG p. 83).

info (int, optional) Reason for returning, as output by ODRPACK (cf. ODRPACK UG p. 38).

stopreason (list of str, optional) info interpreted into English.

5.19. Orthogonal distance regression (scipy.odr) 527

SciPy Reference Guide, Release 0.13.0

Methods

528 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

pprint() Pretty-print important results.

Output.pprint()

Pretty-print important results.

scipy.odr.odr(fcn,beta0,y,x,we=None,wd=None,fjacb=None,fjacd=None,extra_args=None,

iﬁxx=None,iﬁxb=None,job=0,iprint=0,errﬁle=None,rptﬁle=None,ndigit=0,tau-

fac=0.0,sstol=-1.0,partol=-1.0,maxit=-1,stpb=None,stpd=None,sclb=None,

scld=None,work=None,iwork=None,full_output=0)

Low-level function for ODR.

See Also

ODR,Model,Data,RealData

Notes

This is a function performing the same operation as the ODR,Model and Data classes together. The parameters

of this function are explained in the class documentation.

exception scipy.odr.odr_error

Exception indicating an error in ﬁtting.

This is raised by scipy.odr if an error occurs during ﬁtting.

exception scipy.odr.odr_stop

Exception stopping ﬁtting.

You can raise this exception in your objective function to tell scipy.odr to stop ﬁtting.

5.19.2 Usage information

Introduction

Why Orthogonal Distance Regression (ODR)? Sometimes one has measurement errors in the explanatory (a.k.a.,

“independent”) variable(s), not just the response (a.k.a., “dependent”) variable(s). Ordinary Least Squares (OLS)

ﬁtting procedures treat the data for explanatory variables as ﬁxed, i.e., not subject to error of any kind. Furthermore,

OLS procedures require that the response variables be an explicit function of the explanatory variables; sometimes

making the equation explicit is impractical and/or introduces errors. ODR can handle both of these cases with ease,

and can even reduce to the OLS case if that is sufﬁcient for the problem.

ODRPACK is a FORTRAN-77 library for performing ODR with possibly non-linear ﬁtting functions. It uses a mod-

iﬁed trust-region Levenberg-Marquardt-type algorithm [R318] to estimate the function parameters. The ﬁtting func-

tions are provided by Python functions operating on NumPy arrays. The required derivatives may be provided by

Python functions as well, or may be estimated numerically. ODRPACK can do explicit or implicit ODR ﬁts, or it

can do OLS. Input and output variables may be multi-dimensional. Weights can be provided to account for different

variances of the observations, and even covariances between dimensions of the variables.

The scipy.odr package offers an object-oriented interface to ODRPACK, in addition to the low-level odr function.

Additional background information about ODRPACK can be found in the ODRPACK User’s Guide, reading which is

recommended.

Basic usage

1. Deﬁne the function you want to ﬁt against.:

5.19. Orthogonal distance regression (scipy.odr) 529

SciPy Reference Guide, Release 0.13.0

def f(B, x):

’’’Linear function y = m*x + b’’’

# B is a vector of the parameters.

# x is an array of the current x values.

# x is in the same format as the x passed to Data or RealData.

# Return an array in the same format as y passed to Data or RealData.

return B[0]*x+B[1]

2. Create a Model.:

linear =Model(f)

3. Create a Data or RealData instance.:

mydata =Data(x, y, wd=1./power(sx,2), we=1./power(sy,2))

or, when the actual covariances are known:

mydata =RealData(x, y, sx=sx, sy=sy)

4. Instantiate ODR with your data, model and initial parameter estimate.:

myodr =ODR(mydata, linear, beta0=[1.,2.])

5. Run the ﬁt.:

myoutput =myodr.run()

6. Examine output.:

myoutput.pprint()

References

5.20 Optimization and root ﬁnding (scipy.optimize)

5.20.1 Optimization

General-purpose

minimize(fun, x0[, args, method, jac, hess, ...]) Minimization of scalar function of one or more variables.

fmin(func, x0[, args, xtol, ftol, maxiter, ...]) Minimize a function using the downhill simplex algorithm.

fmin_powell(func, x0[, args, xtol, ftol, ...]) Minimize a function using modiﬁed Powell’s method. This method

fmin_cg(f, x0[, fprime, args, gtol, norm, ...]) Minimize a function using a nonlinear conjugate gradient algorithm.

fmin_bfgs(f, x0[, fprime, args, gtol, norm, ...]) Minimize a function using the BFGS algorithm.

fmin_ncg(f, x0, fprime[, fhess_p, fhess, ...]) Unconstrained minimization of a function using the Newton-CG method.

leastsq(func, x0[, args, Dfun, full_output, ...]) Minimize the sum of squares of a set of equations.

scipy.optimize.minimize(fun,x0,args=(),method=’BFGS’,jac=None,hess=None,hessp=None,

bounds=None,constraints=(),tol=None,callback=None,options=None)

Minimization of scalar function of one or more variables. New in version 0.11.0.

Parameters fun : callable

Objective function.

530 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

x0 : ndarray

Initial guess.

args : tuple, optional

Extra arguments passed to the objective function and its derivatives (Jacobian, Hes-

sian).

method : str, optional

Type of solver. Should be one of

•‘Nelder-Mead’

•‘Powell’

•‘CG’

•‘BFGS’

•‘Newton-CG’

•‘Anneal’

•‘L-BFGS-B’

•‘TNC’

•‘COBYLA’

•‘SLSQP’

•‘dogleg’

•‘trust-ncg’

jac : bool or callable, optional

Jacobian of objective function. Only for CG, BFGS, Newton-CG, dogleg, trust-ncg.

If jac is a Boolean and is True, fun is assumed to return the value of Jacobian along

with the objective function. If False, the Jacobian will be estimated numerically. jac

can also be a callable returning the Jacobian of the objective. In this case, it must

accept the same arguments as fun.

hess, hessp : callable, optional

Hessian of objective function or Hessian of objective function times an arbitrary vector

p. Only for Newton-CG, dogleg, trust-ncg. Only one of hessp or hess needs to be

given. If hess is provided, then hessp will be ignored. If neither hess nor hessp is

provided, then the hessian product will be approximated using ﬁnite differences on

jac.hessp must compute the Hessian times an arbitrary vector.

bounds : sequence, optional

Bounds for variables (only for L-BFGS-B, TNC and SLSQP). (min, max) pairs

for each element in x, deﬁning the bounds on that parameter. Use None for one of

min or max when there is no bound in that direction.

constraints : dict or sequence of dict, optional

Constraints deﬁnition (only for COBYLA and SLSQP). Each constraint is deﬁned in

a dictionary with ﬁelds:

type [str] Constraint type: ‘eq’ for equality, ‘ineq’ for inequality.

fun [callable] The function deﬁning the constraint.

jac [callable, optional] The Jacobian of fun (only for SLSQP).

args [sequence, optional] Extra arguments to be passed to the function

and Jacobian.

Equality constraint means that the constraint function result is to be zero whereas

inequality means that it is to be non-negative. Note that COBYLA only supports

inequality constraints.

tol : ﬂoat, optional

Tolerance for termination. For detailed control, use solver-speciﬁc options.

options : dict, optional

A dictionary of solver options. All methods accept the following generic options:

maxiter [int] Maximum number of iterations to perform.

disp [bool] Set to True to print convergence messages.

For method-speciﬁc options, see show_options(‘minimize’, method).

callback : callable, optional

5.20. Optimization and root ﬁnding (scipy.optimize) 531

SciPy Reference Guide, Release 0.13.0

Called after each iteration, as callback(xk), where xk is the current parameter

vector.

Returns res : Result

The optimization result represented as a Result object. Important attributes are: x

the solution array, success a Boolean ﬂag indicating if the optimizer exited suc-

cessfully and message which describes the cause of the termination. See Result

for a description of other attributes.

See Also

minimize_scalar

Interface to minimization algorithms for scalar univariate functions.

Notes

This section describes the available solvers that can be selected by the ‘method’ parameter. The default method

is BFGS.

Unconstrained minimization

Method Nelder-Mead uses the Simplex algorithm [R93],[R94]. This algorithm has been successful in many

applications but other algorithms using the ﬁrst and/or second derivatives information might be preferred for

their better performances and robustness in general.

Method Powell is a modiﬁcation of Powell’s method [R95],[R96] which is a conjugate direction method. It

performs sequential one-dimensional minimizations along each vector of the directions set (direc ﬁeld in op-

tions and info), which is updated at each iteration of the main minimization loop. The function need not be

differentiable, and no derivatives are taken.

Method CG uses a nonlinear conjugate gradient algorithm by Polak and Ribiere, a variant of the Fletcher-Reeves

method described in [R97] pp. 120-122. Only the ﬁrst derivatives are used.

Method BFGS uses the quasi-Newton method of Broyden, Fletcher, Goldfarb, and Shanno (BFGS) [R97] pp.

136. It uses the ﬁrst derivatives only. BFGS has proven good performance even for non-smooth optimizations.

This method also returns an approximation of the Hessian inverse, stored as hess_inv in the Result object.

Method Newton-CG uses a Newton-CG algorithm [R97] pp. 168 (also known as the truncated Newton method).

It uses a CG method to the compute the search direction. See also TNC method for a box-constrained minimiza-

tion with a similar algorithm.

Method Anneal uses simulated annealing, which is a probabilistic metaheuristic algorithm for global optimiza-

tion. It uses no derivative information from the function being optimized.

Method dogleg uses the dog-leg trust-region algorithm [R97] for unconstrained minimization. This algorithm

requires the gradient and Hessian; furthermore the Hessian is required to be positive deﬁnite.

Method trust-ncg uses the Newton conjugate gradient trust-region algorithm [R97] for unconstrained minimiza-

tion. This algorithm requires the gradient and either the Hessian or a function that computes the product of the

Hessian with a given vector.

Constrained minimization

Method L-BFGS-B uses the L-BFGS-B algorithm [R98],[R99] for bound constrained minimization.

Method TNC uses a truncated Newton algorithm [R97],[R100] to minimize a function with variables subject to

bounds. This algorithm uses gradient information; it is also called Newton Conjugate-Gradient. It differs from

the Newton-CG method described above as it wraps a C implementation and allows each variable to be given

upper and lower bounds.

Method COBYLA uses the Constrained Optimization BY Linear Approximation (COBYLA) method [R101],1,

1Powell M J D. Direct search algorithms for optimization calculations. 1998. Acta Numerica 7: 287-336.

532 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

2. The algorithm is based on linear approximations to the objective function and each constraint. The method

wraps a FORTRAN implementation of the algorithm.

Method SLSQP uses Sequential Least SQuares Programming to minimize a function of several variables with

any combination of bounds, equality and inequality constraints. The method wraps the SLSQP Optimization

subroutine originally implemented by Dieter Kraft 3.

References

[R93],[R94],[R95],[R96],[R97],[R98],[R99],[R100],[R101],10,11,12

Examples

Let us consider the problem of minimizing the Rosenbrock function. This function (and its respective deriva-

tives) is implemented in rosen (resp. rosen_der,rosen_hess) in the scipy.optimize.

>>> from scipy.optimize import minimize, rosen, rosen_der

A simple application of the Nelder-Mead method is:

>>> x0 =[1.3,0.7,0.8,1.9,1.2]

>>> res =minimize(rosen, x0, method=’Nelder-Mead’)

>>> res.x

[ 1. 1. 1. 1. 1.]

Now using the BFGS algorithm, using the ﬁrst derivative and a few options:

>>> res =minimize(rosen, x0, method=’BFGS’, jac=rosen_der,

... options={’gtol’:1e-6,’disp’:True})

Optimization terminated successfully.

Current function value: 0.000000

Iterations: 52

Function evaluations: 64

Gradient evaluations: 64

>>> res.x

[ 1. 1. 1. 1. 1.]

>>> print res.message

Optimization terminated successfully.

>>> res.hess

[[ 0.00749589 0.01255155 0.02396251 0.04750988 0.09495377]

[ 0.01255155 0.02510441 0.04794055 0.09502834 0.18996269]

[ 0.02396251 0.04794055 0.09631614 0.19092151 0.38165151]

[ 0.04750988 0.09502834 0.19092151 0.38341252 0.7664427 ]

[ 0.09495377 0.18996269 0.38165151 0.7664427 1.53713523]]

Next, consider a minimization problem with several constraints (namely Example 16.4 from [R97]). The objec-

tive function is:

>>> fun =lambda x: (x[0]-1)**2+(x[1]-2.5)**2

There are three constraints deﬁned as:

>>> cons =({’type’:’ineq’,’fun’:lambda x: x[0]-2*x[1]+2},

... {’type’:’ineq’,’fun’:lambda x: -x[0]-2*x[1]+6},

... {’type’:’ineq’,’fun’:lambda x: -x[0]+2*x[1]+2})

And variables must be positive, hence the following bounds:

2Powell M J D. A view of algorithms for optimization without derivatives. 2007.Cambridge University Technical Report DAMTP 2007/NA03

3Kraft, D. A software package for sequential quadratic programming. 1988. Tech. Rep. DFVLR-FB 88-28, DLR German Aerospace Center –

Institute for Flight Mechanics, Koln, Germany.

5.20. Optimization and root ﬁnding (scipy.optimize) 533

SciPy Reference Guide, Release 0.13.0

>>> bnds =((0,None), (0,None))

The optimization problem is solved using the SLSQP method as:

>>> res =minimize(fun, (2,0), method=’SLSQP’, bounds=bnds,

... constraints=cons)

It should converge to the theoretical solution (1.4 ,1.7).

scipy.optimize.fmin(func,x0,args=(),xtol=0.0001,ftol=0.0001,maxiter=None,maxfun=None,

full_output=0,disp=1,retall=0,callback=None)

Minimize a function using the downhill simplex algorithm.

This algorithm only uses function values, not derivatives or second derivatives.

Parameters func : callable func(x,*args)

The objective function to be minimized.

x0 : ndarray

Initial guess.

args : tuple, optional

Extra arguments passed to func, i.e. f(x,*args).

callback : callable, optional

Called after each iteration, as callback(xk), where xk is the current parameter vector.

xtol : ﬂoat, optional

Relative error in xopt acceptable for convergence.

ftol : number, optional

Relative error in func(xopt) acceptable for convergence.

maxiter : int, optional

Maximum number of iterations to perform.

maxfun : number, optional

Maximum number of function evaluations to make.

full_output : bool, optional

Set to True if fopt and warnﬂag outputs are desired.

disp : bool, optional

Set to True to print convergence messages.

retall : bool, optional

Set to True to return list of solutions at each iteration.

Returns xopt : ndarray

Parameter that minimizes function.

fopt : ﬂoat

Value of function at minimum: fopt = func(xopt).

iter : int

Number of iterations performed.

funcalls : int

Number of function calls made.

warnﬂag : int

1 : Maximum number of function evaluations made. 2 : Maximum number of itera-

tions reached.

allvecs : list

Solution at each iteration.

See Also

minimize Interface to minimization algorithms for multivariate functions. See the ‘Nelder-Mead’ method

in particular.

534 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Notes

Uses a Nelder-Mead simplex algorithm to ﬁnd the minimum of function of one or more variables.

This algorithm has a long history of successful use in applications. But it will usually be slower than an algorithm

that uses ﬁrst or second derivative information. In practice it can have poor performance in high-dimensional

problems and is not robust to minimizing complicated functions. Additionally, there currently is no complete

theory describing when the algorithm will successfully converge to the minimum, or how fast it will if it does.

References

[R90],[R91]

scipy.optimize.fmin_powell(func,x0,args=(),xtol=0.0001,ftol=0.0001,maxiter=None,max-

fun=None,full_output=0,disp=1,retall=0,callback=None,di-

rec=None)

Minimize a function using modiﬁed Powell’s method. This method only uses function values, not derivatives.

Parameters func : callable f(x,*args)

Objective function to be minimized.

x0 : ndarray

Initial guess.

args : tuple, optional

Extra arguments passed to func.

callback : callable, optional

An optional user-supplied function, called after each iteration. Called as

callback(xk), where xk is the current parameter vector.

direc : ndarray, optional

Initial direction set.

xtol : ﬂoat, optional

Line-search error tolerance.

ftol : ﬂoat, optional

Relative error in func(xopt) acceptable for convergence.

maxiter : int, optional

Maximum number of iterations to perform.

maxfun : int, optional

Maximum number of function evaluations to make.

full_output : bool, optional

If True, fopt, xi, direc, iter, funcalls, and warnﬂag are returned.

disp : bool, optional

If True, print convergence messages.

retall : bool, optional

If True, return a list of the solution at each iteration.

Returns xopt : ndarray

Parameter which minimizes func.

fopt : number

Value of function at minimum: fopt = func(xopt).

direc : ndarray

Current direction set.

iter : int

Number of iterations.

funcalls : int

Number of function calls made.

warnﬂag : int

Integer warning ﬂag:

1 : Maximum number of function evaluations. 2 : Maximum number

of iterations.

5.20. Optimization and root ﬁnding (scipy.optimize) 535

SciPy Reference Guide, Release 0.13.0

allvecs : list

List of solutions at each iteration.

See Also

minimize Interface to unconstrained minimization algorithms for multivariate functions. See the ‘Powell’

method in particular.

Notes

Uses a modiﬁcation of Powell’s method to ﬁnd the minimum of a function of N variables. Powell’s method is a

conjugate direction method.

The algorithm has two loops. The outer loop merely iterates over the inner loop. The inner loop minimizes over

each current direction in the direction set. At the end of the inner loop, if certain conditions are met, the direction

that gave the largest decrease is dropped and replaced with the difference between the current estiamted x and

the estimated x from the beginning of the inner-loop.

The technical conditions for replacing the direction of greatest increase amount to checking that

1.No further gain can be made along the direction of greatest increase from that iteration.

2.The direction of greatest increase accounted for a large sufﬁcient fraction of the decrease in the function

value from that iteration of the inner loop.

References

Powell M.J.D. (1964) An efﬁcient method for ﬁnding the minimum of a function of several variables without

calculating derivatives, Computer Journal, 7 (2):155-162.

Press W., Teukolsky S.A., Vetterling W.T., and Flannery B.P.: Numerical Recipes (any edition), Cambridge

University Press

scipy.optimize.fmin_cg(f,x0,fprime=None,args=(),gtol=1e-05,norm=inf,

epsilon=1.4901161193847656e-08,maxiter=None,full_output=0,disp=1,

retall=0,callback=None)

Minimize a function using a nonlinear conjugate gradient algorithm.

Parameters f: callable, f(x, *args)

Objective function to be minimized. Here xmust be a 1-D array of the variables

that are to be changed in the search for a minimum, and args are the other (ﬁxed)

parameters of f.

x0 : ndarray

A user-supplied initial estimate of xopt, the optimal value of x. It must be a 1-D array

of values.

fprime : callable, fprime(x, *args), optional

A function that returns the gradient of fat x. Here xand args are as described above

for f. The returned value must be a 1-D array. Defaults to None, in which case the

gradient is approximated numerically (see epsilon, below).

args : tuple, optional

Parameter values passed to fand fprime. Must be supplied whenever additional ﬁxed

parameters are needed to completely specify the functions fand fprime.

gtol : ﬂoat, optional

Stop when the norm of the gradient is less than gtol.

norm : ﬂoat, optional

Order to use for the norm of the gradient (-np.Inf is min, np.Inf is max).

epsilon : ﬂoat or ndarray, optional

536 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Step size(s) to use when fprime is approximated numerically. Can be a scalar or a

1-D array. Defaults to sqrt(eps), with eps the ﬂoating point machine precision.

Usually sqrt(eps) is about 1.5e-8.

maxiter : int, optional

Maximum number of iterations to perform. Default is 200 *len(x0).

full_output : bool, optional

If True, return fopt,func_calls,grad_calls, and warnﬂag in addition to xopt. See the

Returns section below for additional information on optional return values.

disp : bool, optional

If True, return a convergence message, followed by xopt.

retall : bool, optional

If True, add to the returned values the results of each iteration.

callback : callable, optional

An optional user-supplied function, called after each iteration. Called as

callback(xk), where xk is the current value of x0.

Returns xopt : ndarray

Parameters which minimize f, i.e. f(xopt) == fopt.

fopt : ﬂoat, optional

Minimum value found, f(xopt). Only returned if full_output is True.

func_calls : int, optional

The number of function_calls made. Only returned if full_output is True.

grad_calls : int, optional

The number of gradient calls made. Only returned if full_output is True.

warnﬂag : int, optional

Integer value with warning status, only returned if full_output is True.

0 : Success.

1 : The maximum number of iterations was exceeded.

2[Gradient and/or function calls were not changing. May indicate] that

precision was lost, i.e., the routine did not converge.

allvecs : list of ndarray, optional

List of arrays, containing the results at each iteration. Only returned if retall is True.

See Also

minimize common interface to all scipy.optimize algorithms for unconstrained and constrained mini-

mization of multivariate functions. It provides an alternative way to call fmin_cg, by specifying

method=’CG’.

Notes

This conjugate gradient algorithm is based on that of Polak and Ribiere [R92].

Conjugate gradient methods tend to work better when:

1.fhas a unique global minimizing point, and no local minima or other stationary points,

2.fis, at least locally, reasonably well approximated by a quadratic function of the variables,

3.fis continuous and has a continuous gradient,

4.fprime is not too large, e.g., has a norm less than 1000,

5.The initial guess, x0, is reasonably close to f‘s global minimizing point, xopt.

References

[R92]

5.20. Optimization and root ﬁnding (scipy.optimize) 537

SciPy Reference Guide, Release 0.13.0

Examples

Example 1: seek the minimum value of the expression a*u**2+b*u*v+c*v**2+d*u+e*v +

ffor given values of the parameters and an initial guess (u, v) = (0, 0).

>>> args =(2,3,7,8,9,10)# parameter values

>>> def f(x, *args):

... u,v=x

... a,b,c,d,e,f=args

... return a*u**2+b*u*v+c*v**2+d*u+e*v+f

>>> def gradf(x, *args):

... u,v=x

... a,b,c,d,e,f=args

... gu =2*a*u+b*v+d# u-component of the gradient

... gv =b*u+2*c*v+e# v-component of the gradient

... return np.asarray((gu, gv))

>>> x0 =np.asarray((0,0)) # Initial guess.

>>> from scipy import optimize

>>> res1 =optimize.fmin_cg(f, x0, fprime=gradf, args=args)

>>> print ’res1 = ’, res1

Optimization terminated successfully.

Current function value: 1.617021

Iterations: 2

Function evaluations: 5

Gradient evaluations: 5

res1 = [-1.80851064 -0.25531915]

Example 2: solve the same problem using the minimize function. (This myopts dictionary shows all of the

available options, although in practice only non-default values would be needed. The returned value will be a

dictionary.)

>>> opts ={’maxiter’ :None,# default value.

... ’disp’ :True,# non-default value.

... ’gtol’ :1e-5,# default value.

... ’norm’ : np.inf, # default value.

... ’eps’ :1.4901161193847656e-08}# default value.

>>> res2 =optimize.minimize(f, x0, jac=gradf, args=args,

... method=’CG’, options=opts)

Optimization terminated successfully.

Current function value: 1.617021

Iterations: 2

Function evaluations: 5

Gradient evaluations: 5

>>> res2.x# minimum found

array([-1.80851064 -0.25531915])

scipy.optimize.fmin_bfgs(f,x0,fprime=None,args=(),gtol=1e-05,norm=inf,

epsilon=1.4901161193847656e-08,maxiter=None,full_output=0,

disp=1,retall=0,callback=None)

Minimize a function using the BFGS algorithm.

Parameters f: callable f(x,*args)

Objective function to be minimized.

x0 : ndarray

Initial guess.

fprime : callable f’(x,*args), optional

Gradient of f.

args : tuple, optional

Extra arguments passed to f and fprime.

538 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

gtol : ﬂoat, optional

Gradient norm must be less than gtol before succesful termination.

norm : ﬂoat, optional

Order of norm (Inf is max, -Inf is min)

epsilon : int or ndarray, optional

If fprime is approximated, use this value for the step size.

callback : callable, optional

An optional user-supplied function to call after each iteration. Called as callback(xk),

where xk is the current parameter vector.

maxiter : int, optional

Maximum number of iterations to perform.

full_output : bool, optional

If True,return fopt, func_calls, grad_calls, and warnﬂag in addition to xopt.

disp : bool, optional

Print convergence message if True.

retall : bool, optional

Return a list of results at each iteration if True.

Returns xopt : ndarray

Parameters which minimize f, i.e. f(xopt) == fopt.

fopt : ﬂoat

Minimum value.

gopt : ndarray

Value of gradient at minimum, f’(xopt), which should be near 0.

Bopt : ndarray

Value of 1/f’‘(xopt), i.e. the inverse hessian matrix.

func_calls : int

Number of function_calls made.

grad_calls : int

Number of gradient calls made.

warnﬂag : integer

1 : Maximum number of iterations exceeded. 2 : Gradient and/or function calls not

changing.

allvecs : list

Results at each iteration. Only returned if retall is True.

See Also

minimize Interface to minimization algorithms for multivariate functions. See the ‘BFGS’ method in par-

ticular.

Notes

Optimize the function, f, whose gradient is given by fprime using the quasi-Newton method of Broyden,

Fletcher, Goldfarb, and Shanno (BFGS)

References

Wright, and Nocedal ‘Numerical Optimization’, 1999, pg. 198.

scipy.optimize.fmin_ncg(f,x0,fprime,fhess_p=None,fhess=None,args=(),avextol=1e-05,

epsilon=1.4901161193847656e-08,maxiter=None,full_output=0,disp=1,

retall=0,callback=None)

Unconstrained minimization of a function using the Newton-CG method.

Parameters f: callable f(x, *args)

Objective function to be minimized.

x0 : ndarray

5.20. Optimization and root ﬁnding (scipy.optimize) 539

SciPy Reference Guide, Release 0.13.0

Initial guess.

fprime : callable f’(x, *args)

Gradient of f.

fhess_p : callable fhess_p(x, p, *args), optional

Function which computes the Hessian of f times an arbitrary vector, p.

fhess : callable fhess(x, *args), optional

Function to compute the Hessian matrix of f.

args : tuple, optional

Extra arguments passed to f, fprime, fhess_p, and fhess (the same set of extra argu-

ments is supplied to all of these functions).

epsilon : ﬂoat or ndarray, optional

If fhess is approximated, use this value for the step size.

callback : callable, optional

An optional user-supplied function which is called after each iteration. Called as

callback(xk), where xk is the current parameter vector.

avextol : ﬂoat, optional

Convergence is assumed when the average relative error in the minimizer falls below

this amount.

maxiter : int, optional

Maximum number of iterations to perform.

full_output : bool, optional

If True, return the optional outputs.

disp : bool, optional

If True, print convergence message.

retall : bool, optional

If True, return a list of results at each iteration.

Returns xopt : ndarray

Parameters which minimize f, i.e. f(xopt) == fopt.

fopt : ﬂoat

Value of the function at xopt, i.e. fopt = f(xopt).

fcalls : int

Number of function calls made.

gcalls : int

Number of gradient calls made.

hcalls : int

Number of hessian calls made.

warnﬂag : int

Warnings generated by the algorithm. 1 : Maximum number of iterations exceeded.

allvecs : list

The result at each iteration, if retall is True (see below).

See Also

minimize Interface to minimization algorithms for multivariate functions. See the ‘Newton-CG’ method in

particular.

Notes

Only one of fhess_p or fhess need to be given. If fhess is provided, then fhess_p will be ignored. If neither

fhess nor fhess_p is provided, then the hessian product will be approximated using ﬁnite differences on fprime.

fhess_p must compute the hessian times an arbitrary vector. If it is not given, ﬁnite-differences on fprime are

used to compute it.

Newton-CG methods are also called truncated Newton methods. This function differs from

scipy.optimize.fmin_tnc because

540 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

1.scipy.optimize.fmin_ncg is written purely in python using numpy

and scipy while scipy.optimize.fmin_tnc calls a C function.

2.scipy.optimize.fmin_ncg is only for unconstrained minimization

while scipy.optimize.fmin_tnc is for unconstrained minimization or box constrained mini-

mization. (Box constraints give lower and upper bounds for each variable seperately.)

References

Wright & Nocedal, ‘Numerical Optimization’, 1999, pg. 140.

scipy.optimize.leastsq(func,x0,args=(),Dfun=None,full_output=0,col_deriv=0,ftol=1.49012e-

08,xtol=1.49012e-08,gtol=0.0,maxfev=0,epsfcn=None,factor=100,

diag=None)

Minimize the sum of squares of a set of equations.

x = arg min(sum(func(y)**2,axis=0))

Parameters func : callable

should take at least one (possibly length N vector) argument and returns M ﬂoating

point numbers.

x0 : ndarray

The starting estimate for the minimization.

args : tuple

Any extra arguments to func are placed in this tuple.

Dfun : callable

A function or method to compute the Jacobian of func with derivatives across the

rows. If this is None, the Jacobian will be estimated.

full_output : bool

non-zero to return all optional outputs.

col_deriv : bool

non-zero to specify that the Jacobian function computes derivatives down the columns

(faster, because there is no transpose operation).

ftol : ﬂoat

Relative error desired in the sum of squares.

xtol : ﬂoat

Relative error desired in the approximate solution.

gtol : ﬂoat

Orthogonality desired between the function vector and the columns of the Jacobian.

maxfev : int

The maximum number of calls to the function. If zero, then 100*(N+1) is the maxi-

mum where N is the number of elements in x0.

epsfcn : ﬂoat

A suitable step length for the forward-difference approximation of the Jacobian (for

Dfun=None). If epsfcn is less than the machine precision, it is assumed that the rela-

tive errors in the functions are of the order of the machine precision.

factor : ﬂoat

A parameter determining the initial step bound (factor *|| diag *x||).

Should be in interval (0.1, 100).

diag : sequence

N positive entries that serve as a scale factors for the variables.

Returns x: ndarray

The solution (or the result of the last iteration for an unsuccessful call).

cov_x : ndarray

5.20. Optimization and root ﬁnding (scipy.optimize) 541

SciPy Reference Guide, Release 0.13.0

Uses the fjac and ipvt optional outputs to construct an estimate of the jacobian around

the solution. None if a singular matrix encountered (indicates very ﬂat curvature in

some direction). This matrix must be multiplied by the residual variance to get the

covariance of the parameter estimates – see curve_ﬁt.

infodict : dict

a dictionary of optional outputs with the key s:

nfev The number of function calls

fvec The function evaluated at the output

fjac A permutation of the R matrix of a QR factorization of the ﬁnal ap-

proximate Jacobian matrix, stored column wise. Together with ipvt,

the covariance of the estimate can be approximated.

ipvt An integer array of length N which deﬁnes a permutation matrix, p,

such that fjac*p = q*r, where r is upper triangular with diagonal ele-

ments of nonincreasing magnitude. Column j of p is column ipvt(j) of

the identity matrix.

qtf The vector (transpose(q) * fvec).

mesg : str

A string message giving information about the cause of failure.

ier : int

An integer ﬂag. If it is equal to 1, 2, 3 or 4, the solution was found. Otherwise, the

solution was not found. In either case, the optional output variable ‘mesg’ gives more

information.

Notes

“leastsq” is a wrapper around MINPACK’s lmdif and lmder algorithms.

cov_x is a Jacobian approximation to the Hessian of the least squares objective function. This approximation

assumes that the objective function is based on the difference between some observed target data (ydata) and a

(non-linear) function of the parameters f(xdata, params)

func(params) =ydata -f(xdata, params)

so that the objective function is

min sum((ydata - f(xdata, params))**2, axis=0)

params

Constrained (multivariate)

fmin_l_bfgs_b(func, x0[, fprime, args, ...]) Minimize a function func using the L-BFGS-B algorithm.

fmin_tnc(func, x0[, fprime, args, ...]) Minimize a function with variables subject to bounds, using

fmin_cobyla(func, x0, cons[, args, ...]) Minimize a function using the Constrained Optimization BY Linear

fmin_slsqp(func, x0[, eqcons, f_eqcons, ...]) Minimize a function using Sequential Least SQuares Programming

nnls(A, b) Solve argmin_x || Ax - b ||_2 for x>=0. This is a wrapper

scipy.optimize.fmin_l_bfgs_b(func,x0,fprime=None,args=(),approx_grad=0,bounds=None,

m=10,factr=10000000.0,pgtol=1e-05,epsilon=1e-08,iprint=-1,

maxfun=15000,maxiter=15000,disp=None,callback=None)

Minimize a function func using the L-BFGS-B algorithm.

Parameters func : callable f(x,*args)

Function to minimise.

x0 : ndarray

542 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Initial guess.

fprime : callable fprime(x,*args)

The gradient of func. If None, then func returns the function value and the gradient (f,

g = func(x, *args)), unless approx_grad is True in which case func returns

only f.

args : sequence

Arguments to pass to func and fprime.

approx_grad : bool

Whether to approximate the gradient numerically (in which case func returns only the

function value).

bounds : list

(min, max) pairs for each element in x, deﬁning the bounds on that parameter.

Use None for one of min or max when there is no bound in that direction.

m: int

The maximum number of variable metric corrections used to deﬁne the limited mem-

ory matrix. (The limited memory BFGS method does not store the full hessian but

uses this many terms in an approximation to it.)

factr : ﬂoat

The iteration stops when (f^k - f^{k+1})/max{|f^k|,|f^{k+1}|,1}

<= factr *eps, where eps is the machine precision, which is automatically

generated by the code. Typical values for factr are: 1e12 for low accuracy; 1e7 for

moderate accuracy; 10.0 for extremely high accuracy.

pgtol : ﬂoat

The iteration will stop when max{|proj g_i | i = 1, ..., n} <=

pgtol where pg_i is the i-th component of the projected gradient.

epsilon : ﬂoat

Step size used when approx_grad is True, for numerically calculating the gradient

iprint : int

Controls the frequency of output. iprint < 0 means no output; iprint ==

0means write messages to stdout; iprint > 1 in addition means write logging

information to a ﬁle named iterate.dat in the current working directory.

disp : int, optional

If zero, then no output. If a positive number, then this over-rides iprint (i.e., iprint gets

the value of disp).

maxfun : int

Maximum number of function evaluations.

maxiter : int

Maximum number of iterations.

callback : callable, optional

Called after each iteration, as callback(xk), where xk is the current parameter

vector.

Returns x: array_like

Estimated position of the minimum.

f: ﬂoat

Value of func at the minimum.

d: dict

Information dictionary.

•d[’warnﬂag’] is

–0 if converged,

–1 if too many function evaluations or too many iterations,

–2 if stopped for another reason, given in d[’task’]

•d[’grad’] is the gradient at the minimum (should be 0 ish)

•d[’funcalls’] is the number of function calls made.

•d[’nit’] is the number of iterations.

5.20. Optimization and root ﬁnding (scipy.optimize) 543

SciPy Reference Guide, Release 0.13.0

See Also

minimize Interface to minimization algorithms for multivariate functions. See the ‘L-BFGS-B’ method in

particular.

Notes

License of L-BFGS-B (FORTRAN code):

The version included here (in fortran code) is 3.0 (released April 25, 2011). It was written by Ciyou Zhu,

Richard Byrd, and Jorge Nocedal <nocedal@ece.nwu.edu>. It carries the following condition for use:

This software is freely available, but we expect that all publications describing work using this software, or all

commercial products using it, quote at least one of the references given below. This software is released under

the BSD License.

References

•R. H. Byrd, P. Lu and J. Nocedal. A Limited Memory Algorithm for Bound Constrained Optimization,

(1995), SIAM Journal on Scientiﬁc and Statistical Computing, 16, 5, pp. 1190-1208.

•C. Zhu, R. H. Byrd and J. Nocedal. L-BFGS-B: Algorithm 778: L-BFGS-B, FORTRAN routines for

large scale bound constrained optimization (1997), ACM Transactions on Mathematical Software, 23, 4,

pp. 550 - 560.

•J.L. Morales and J. Nocedal. L-BFGS-B: Remark on Algorithm 778: L-BFGS-B, FORTRAN routines

for large scale bound constrained optimization (2011), ACM Transactions on Mathematical Software, 38,

scipy.optimize.fmin_tnc(func,x0,fprime=None,args=(),approx_grad=0,bounds=None,

epsilon=1e-08,scale=None,offset=None,messages=15,maxCGit=-

1,maxfun=None,eta=-1,stepmx=0,accuracy=0,fmin=0,ftol=-1,

xtol=-1,pgtol=-1,rescale=-1,disp=None,callback=None)

Minimize a function with variables subject to bounds, using gradient information in a truncated Newton algo-

rithm. This method wraps a C implementation of the algorithm.

Parameters func : callable func(x, *args)

Function to minimize. Must do one of:

1.Return f and g, where f is the value of the function and g its gradient (a list of

ﬂoats).

2.Return the function value but supply gradient function seperately as fprime.

3.Return the function value and set approx_grad=True.

If the function returns None, the minimization is aborted.

x0 : array_like

Initial estimate of minimum.

fprime : callable fprime(x, *args)

Gradient of func. If None, then either func must return the function value and the

gradient (f,g = func(x, *args)) or approx_grad must be True.

args : tuple

Arguments to pass to function.

approx_grad : bool

If true, approximate the gradient numerically.

bounds : list

(min, max) pairs for each element in x0, deﬁning the bounds on that parameter. Use

None or +/-inf for one of min or max when there is no bound in that direction.

epsilon : ﬂoat

Used if approx_grad is True. The stepsize in a ﬁnite difference approximation for

fprime.

544 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scale : array_like

Scaling factors to apply to each variable. If None, the factors are up-low for interval

bounded variables and 1+|x| for the others. Defaults to None.

offset : array_like

Value to substract from each variable. If None, the offsets are (up+low)/2 for interval

bounded variables and x for the others.

messages :

Bit mask used to select messages display during minimization values deﬁned in the

MSGS dict. Defaults to MGS_ALL.

disp : int

Integer interface to messages. 0 = no message, 5 = all messages

maxCGit : int

Maximum number of hessian*vector evaluations per main iteration. If max-

CGit == 0, the direction chosen is -gradient if maxCGit < 0, maxCGit is set to

max(1,min(50,n/2)). Defaults to -1.

maxfun : int

Maximum number of function evaluation. if None, maxfun is set to max(100,

10*len(x0)). Defaults to None.

eta : ﬂoat

Severity of the line search. if < 0 or > 1, set to 0.25. Defaults to -1.

stepmx : ﬂoat

Maximum step for the line search. May be increased during call. If too small, it will

be set to 10.0. Defaults to 0.

accuracy : ﬂoat

Relative precision for ﬁnite difference calculations. If <= machine_precision, set to

sqrt(machine_precision). Defaults to 0.

fmin : ﬂoat

Minimum function value estimate. Defaults to 0.

ftol : ﬂoat

Precision goal for the value of f in the stoping criterion. If ftol < 0.0, ftol is set to 0.0

defaults to -1.

xtol : ﬂoat

Precision goal for the value of x in the stopping criterion (after applying x scaling

factors). If xtol < 0.0, xtol is set to sqrt(machine_precision). Defaults to -1.

pgtol : ﬂoat

Precision goal for the value of the projected gradient in the stopping criterion (after

applying x scaling factors). If pgtol < 0.0, pgtol is set to 1e-2 * sqrt(accuracy). Setting

it to 0.0 is not recommended. Defaults to -1.

rescale : ﬂoat

Scaling factor (in log10) used to trigger f value rescaling. If 0, rescale at each iteration.

If a large value, never rescale. If < 0, rescale is set to 1.3.

callback : callable, optional

Called after each iteration, as callback(xk), where xk is the current parameter vector.

Returns x: ndarray

The solution.

nfeval : int

The number of function evaluations.

rc : int

Return code as deﬁned in the RCSTRINGS dict.

See Also

minimize Interface to minimization algorithms for multivariate functions. See the ‘TNC’ method in partic-

ular.

5.20. Optimization and root ﬁnding (scipy.optimize) 545

SciPy Reference Guide, Release 0.13.0

Notes

The underlying algorithm is truncated Newton, also called Newton Conjugate-Gradient. This method differs

from scipy.optimize.fmin_ncg in that

1.It wraps a C implementation of the algorithm

2.It allows each variable to be given an upper and lower bound.

The algorithm incoporates the bound constraints by determining the descent direction as in an unconstrained

truncated Newton, but never taking a step-size large enough to leave the space of feasible x’s. The algorithm

keeps track of a set of currently active constraints, and ignores them when computing the minimum allowable

step size. (The x’s associated with the active constraint are kept ﬁxed.) If the maximum allowable step size is

zero then a new constraint is added. At the end of each iteration one of the constraints may be deemed no longer

active and removed. A constraint is considered no longer active is if it is currently active but the gradient for

that variable points inward from the constraint. The speciﬁc constraint removed is the one associated with the

variable of largest index whose constraint is no longer active.

References

Wright S., Nocedal J. (2006), ‘Numerical Optimization’

Nash S.G. (1984), “Newton-Type Minimization Via the Lanczos Method”, SIAM Journal of Numerical Analysis

21, pp. 770-778

scipy.optimize.fmin_cobyla(func,x0,cons,args=(),consargs=None,rhobeg=1.0,rhoend=0.0001,

iprint=1,maxfun=1000,disp=None)

Minimize a function using the Constrained Optimization BY Linear Approximation (COBYLA) method. This

method wraps a FORTRAN implentation of the algorithm.

Parameters func : callable

Function to minimize. In the form func(x, *args).

x0 : ndarray

Initial guess.

cons : sequence

Constraint functions; must all be >=0 (a single function if only 1 constraint). Each

function takes the parameters xas its ﬁrst argument.

args : tuple

Extra arguments to pass to function.

consargs : tuple

Extra arguments to pass to constraint functions (default of None means use same extra

arguments as those passed to func). Use () for no extra arguments.

rhobeg :

Reasonable initial changes to the variables.

rhoend :

Final accuracy in the optimization (not precisely guaranteed). This is a lower bound

on the size of the trust region.

iprint : {0, 1, 2, 3}

Controls the frequency of output; 0 implies no output. Deprecated.

disp : {0, 1, 2, 3}

Over-rides the iprint interface. Preferred.

maxfun : int

Maximum number of function evaluations.

Returns x: ndarray

The argument that minimises f.

546 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

See Also

minimize Interface to minimization algorithms for multivariate functions. See the ‘COBYLA’ method in

particular.

Notes

This algorithm is based on linear approximations to the objective function and each constraint. We brieﬂy

describe the algorithm.

Suppose the function is being minimized over k variables. At the jth iteration the algorithm has k+1 points v_1,

..., v_(k+1), an approximate solution x_j, and a radius RHO_j. (i.e. linear plus a constant) approximations to the

objective function and constraint functions such that their function values agree with the linear approximation

on the k+1 points v_1,.., v_(k+1). This gives a linear program to solve (where the linear approximations of the

constraint functions are constrained to be non-negative).

However the linear approximations are likely only good approximations near the current simplex, so the linear

program is given the further requirement that the solution, which will become x_(j+1), must be within RHO_j

from x_j. RHO_j only decreases, never increases. The initial RHO_j is rhobeg and the ﬁnal RHO_j is rhoend.

In this way COBYLA’s iterations behave like a trust region algorithm.

Additionally, the linear program may be inconsistent, or the approximation may give poor improvement. For

details about how these issues are resolved, as well as how the points v_i are updated, refer to the source code

or the references below.

References

Powell M.J.D. (1994), “A direct search optimization method that models the objective and constraint functions

by linear interpolation.”, in Advances in Optimization and Numerical Analysis, eds. S. Gomez and J-P Hennart,

Kluwer Academic (Dordrecht), pp. 51-67

Powell M.J.D. (1998), “Direct search algorithms for optimization calculations”, Acta Numerica 7, 287-336

Powell M.J.D. (2007), “A view of algorithms for optimization without derivatives”, Cambridge University Tech-

nical Report DAMTP 2007/NA03

Examples

Minimize the objective function f(x,y) = x*y subject to the constraints x**2 + y**2 < 1 and y > 0:

>>> def objective(x):

... return x[0]*x[1]

...

>>> def constr1(x):

... return 1-(x[0]**2+x[1]**2)

...

>>> def constr2(x):

... return x[1]

...

>>> fmin_cobyla(objective, [0.0,0.1], [constr1, constr2], rhoend=1e-7)

Normal return from subroutine COBYLA

NFVALS = 64 F =-5.000000E-01 MAXCV = 1.998401E-14

X =-7.071069E-01 7.071067E-01

array([-0.70710685, 0.70710671])

The exact solution is (-sqrt(2)/2, sqrt(2)/2).

5.20. Optimization and root ﬁnding (scipy.optimize) 547

SciPy Reference Guide, Release 0.13.0

scipy.optimize.fmin_slsqp(func,x0,eqcons=[],f_eqcons=None,ieqcons=[],

f_ieqcons=None,bounds=[],fprime=None,fprime_eqcons=None,

fprime_ieqcons=None,args=(),iter=100,acc=1e-06,iprint=1,

disp=None,full_output=0,epsilon=1.4901161193847656e-08)

Minimize a function using Sequential Least SQuares Programming

Python interface function for the SLSQP Optimization subroutine originally implemented by Dieter Kraft.

Parameters func : callable f(x,*args)

Objective function.

x0 : 1-D ndarray of ﬂoat

Initial guess for the independent variable(s).

eqcons : list

A list of functions of length n such that eqcons[j](x,*args) == 0.0 in a successfully

optimized problem.

f_eqcons : callable f(x,*args)

Returns a 1-D array in which each element must equal 0.0 in a successfully optimized

problem. If f_eqcons is speciﬁed, eqcons is ignored.

ieqcons : list

A list of functions of length n such that ieqcons[j](x,*args) >= 0.0 in a successfully

optimized problem.

f_ieqcons : callable f(x,*args)

Returns a 1-D ndarray in which each element must be greater or equal to 0.0 in a

successfully optimized problem. If f_ieqcons is speciﬁed, ieqcons is ignored.

bounds : list

A list of tuples specifying the lower and upper bound for each independent variable

[(xl0, xu0),(xl1, xu1),...]

fprime : callable f(x,*args)

A function that evaluates the partial derivatives of func.

fprime_eqcons : callable f(x,*args)

A function of the form f(x, *args) that returns the m by n array of equality constraint

normals. If not provided, the normals will be approximated. The array returned by

fprime_eqcons should be sized as ( len(eqcons), len(x0) ).

fprime_ieqcons : callable f(x,*args)

A function of the form f(x, *args) that returns the m by n array of inequality constraint

normals. If not provided, the normals will be approximated. The array returned by

fprime_ieqcons should be sized as ( len(ieqcons), len(x0) ).

args : sequence

Additional arguments passed to func and fprime.

iter : int

The maximum number of iterations.

acc : ﬂoat

Requested accuracy.

iprint : int

The verbosity of fmin_slsqp :

•iprint <= 0 : Silent operation

•iprint == 1 : Print summary upon completion (default)

•iprint >= 2 : Print status of each iterate and summary

disp : int

Over-rides the iprint interface (preferred).

full_output : bool

If False, return only the minimizer of func (default). Otherwise, output ﬁnal objective

function and summary information.

epsilon : ﬂoat

The step size for ﬁnite-difference derivative estimates.

548 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Returns out : ndarray of ﬂoat

The ﬁnal minimizer of func.

fx : ndarray of ﬂoat, if full_output is true

The ﬁnal value of the objective function.

its : int, if full_output is true

The number of iterations.

imode : int, if full_output is true

The exit mode from the optimizer (see below).

smode : string, if full_output is true

Message describing the exit mode from the optimizer.

See Also

minimize Interface to minimization algorithms for multivariate functions. See the ‘SLSQP’ method in

particular.

Notes

Exit modes are deﬁned as follows

-1 : Gradient evaluation required (g & a)

0 : Optimization terminated successfully.

1 : Function evaluation required (f & c)

2 : More equality constraints than independent variables

3 : More than 3*n iterations in LSQ subproblem

4 : Inequality constraints incompatible

5 : Singular matrix E in LSQ subproblem

6 : Singular matrix C in LSQ subproblem

7 : Rank-deficient equality constraint subproblem HFTI

8 : Positive directional derivative for linesearch

9 : Iteration limit exceeded

Examples

Examples are given in the tutorial.

scipy.optimize.nnls(A,b)

Solve argmin_x || Ax - b ||_2 for x>=0. This is a wrapper for a FORTAN non-negative least squares

solver.

Parameters A: ndarray

Matrix Aas shown above.

b: ndarray

Right-hand side vector.

Returns x: ndarray

Solution vector.

rnorm : ﬂoat

The residual, || Ax-b ||_2.

Notes

The FORTRAN code was published in the book below. The algorithm is an active set method. It solves the

KKT (Karush-Kuhn-Tucker) conditions for the non-negative least squares problem.

References

Lawson C., Hanson R.J., (1987) Solving Least Squares Problems, SIAM

5.20. Optimization and root ﬁnding (scipy.optimize) 549

SciPy Reference Guide, Release 0.13.0

Global

anneal(func, x0[, args, schedule, ...]) Minimize a function using simulated annealing.

basinhopping(func, x0[, niter, T, stepsize, ...]) Find the global minimum of a function using the basin-hopping algorithm ..

brute(func, ranges[, args, Ns, full_output, ...]) Minimize a function over a given range by brute force.

scipy.optimize.anneal(func,x0,args=(),schedule=’fast’,full_output=0,T0=None,Tf=1e-12,max-

eval=None,maxaccept=None,maxiter=400,boltzmann=1.0,learn_rate=0.5,

feps=1e-06,quench=1.0,m=1.0,n=1.0,lower=-100,upper=100,dwell=50,

disp=True)

Minimize a function using simulated annealing.

Uses simulated annealing, a random algorithm that uses no derivative information from the function being

optimized. Other names for this family of approaches include: “Monte Carlo”, “Metropolis”, “Metropolis-

Hastings”, etc. They all involve (a) evaluating the objective function on a random set of points, (b) keeping

those that pass their randomized evaluation critera, (c) cooling (i.e., tightening) the evaluation critera, and (d)

repeating until their termination critera are met. In practice they have been used mainly in discrete rather than

in continuous optimization.

Available annealing schedules are ‘fast’, ‘cauchy’ and ‘boltzmann’.

Parameters func : callable

The objective function to be minimized. Must be in the form f(x, *args), where xis

the argument in the form of a 1-D array and args is a tuple of any additional ﬁxed

parameters needed to completely specify the function.

x0: 1-D array

An initial guess at the optimizing argument of func.

args : tuple, optional

Any additional ﬁxed parameters needed to completely specify the objective function.

schedule : str, optional

The annealing schedule to use. Must be one of ‘fast’, ‘cauchy’ or ‘boltzmann’. See

Notes.

full_output : bool, optional

If full_output, then return all values listed in the Returns section. Otherwise, return

just the xmin and status values.

T0 : ﬂoat, optional

The initial “temperature”. If None, then estimate it as 1.2 times the largest cost-

function deviation over random points in the box-shaped region speciﬁed by the lower,

upper input parameters.

Tf : ﬂoat, optional

Final goal temperature. Cease iterations if the temperature falls below Tf.

maxeval : int, optional

Cease iterations if the number of function evaluations exceeds maxeval.

maxaccept : int, optional

Cease iterations if the number of points accepted exceeds maxaccept. See Notes for

the probabilistic acceptance criteria used.

maxiter : int, optional

Cease iterations if the number of cooling iterations exceeds maxiter.

learn_rate : ﬂoat, optional

Scale constant for tuning the probabilistc acceptance criteria.

boltzmann : ﬂoat, optional

Boltzmann constant in the probabilistic acceptance criteria (increase for less stringent

criteria at each temperature).

feps : ﬂoat, optional

550 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Cease iterations if the relative errors in the function value over the last four coolings

is below feps.

quench, m, n : ﬂoats, optional

Parameters to alter the fast simulated annealing schedule. See Notes.

lower, upper : ﬂoats or 1-D arrays, optional

Lower and upper bounds on the argument x. If ﬂoats are provided, they apply to all

components of x.

dwell : int, optional

The number of times to execute the inner loop at each value of the temperature. See

Notes.

disp : bool, optional

Print a descriptive convergence message if True.

Returns xmin : ndarray

The point where the lowest function value was found.

Jmin : ﬂoat

The objective function value at xmin.

T: ﬂoat

The temperature at termination of the iterations.

feval : int

Number of function evaluations used.

iters : int

Number of cooling iterations used.

accept : int

Number of tests accepted.

status : int

A code indicating the reason for termination:

•0 : Points no longer changing.

•1 : Cooled to ﬁnal temperature.

•2 : Maximum function evaluations reached.

•3 : Maximum cooling iterations reached.

•4 : Maximum accepted query locations reached.

•5 : Final point not the minimum amongst encountered points.

See Also

basinhopping

another (more performant) global optimizer

brute brute-force global optimizer

Notes

Simulated annealing is a random algorithm which uses no derivative information from the function being opti-

mized. In practice it has been more useful in discrete optimization than continuous optimization, as there are

usually better algorithms for continuous optimization problems.

Some experimentation by trying the different temperature schedules and altering their parameters is likely re-

quired to obtain good performance.

The randomness in the algorithm comes from random sampling in numpy. To obtain the same results you can

call numpy.random.seed with the same seed immediately before calling anneal.

We give a brief description of how the three temperature schedules generate new points and vary their tem-

perature. Temperatures are only updated with iterations in the outer loop. The inner loop is over loop over

xrange(dwell), and new points are generated for every iteration in the inner loop. Whether the proposed

new points are accepted is probabilistic.

5.20. Optimization and root ﬁnding (scipy.optimize) 551

SciPy Reference Guide, Release 0.13.0

For readability, let ddenote the dimension of the inputs to func. Also, let x_old denote the previous state,

and kdenote the iteration number of the outer loop. All other variables not deﬁned below are input variables to

anneal itself.

In the ‘fast’ schedule the updates are:

u ~ Uniform(0, 1, size = d)

y = sgn(u - 0.5) *T*((1 + 1/T)**abs(2*u - 1) - 1.0)

xc = y *(upper - lower)

x_new = x_old + xc

c=n*exp(-n *quench)

T_new = T0 *exp(-c *k**quench)

In the ‘cauchy’ schedule the updates are:

u ~ Uniform(-pi/2, pi/2, size=d)

xc = learn_rate *T*tan(u)

x_new = x_old + xc

T_new = T0 / (1 + k)

In the ‘boltzmann’ schedule the updates are:

std = minimum(sqrt(T) *ones(d), (upper - lower) / (3*learn_rate))

y ~ Normal(0, std, size = d)

x_new = x_old + learn_rate *y

T_new = T0 / log(1 + k)

References

[1] P. J. M. van Laarhoven and E. H. L. Aarts, “Simulated Annealing: Theory

and Applications”, Kluwer Academic Publishers, 1987.

[2] W.H. Press et al., “Numerical Recipies: The Art of Scientiﬁc Computing”,

Cambridge U. Press, 1987.

Examples

Example 1. We illustrate the use of anneal to seek the global minimum of a function of two variables that is

equal to the sum of a positive- deﬁnite quadratic and two deep “Gaussian-shaped” craters. Speciﬁcally, deﬁne

the objective function fas the sum of three other functions, f=f1+f2+f3. We suppose each of these

has a signature (z, *params), where z = (x, y),params, and the functions are as deﬁned below.

>>> params =(2,3,7,8,9,10,44,-1,2,26,1,-2,0.5)

>>> def f1(z, *params):

... x,y=z

... a, b, c, d, e, f, g, h, i, j, k, l, scale =params

... return (a *x**2+b*x*y+c*y**2+d*x+e*y+f)

>>> def f2(z, *params):

... x,y=z

... a, b, c, d, e, f, g, h, i, j, k, l, scale =params

... return (-g*np.exp(-((x-h)**2+(y-i)**2)/scale))

>>> def f3(z, *params):

... x,y=z

552 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

... a, b, c, d, e, f, g, h, i, j, k, l, scale =params

... return (-j*np.exp(-((x-k)**2+(y-l)**2)/scale))

>>> def f(z, *params):

... x,y=z

... a, b, c, d, e, f, g, h, i, j, k, l, scale =params

... return f1(z, *params) +f2(z, *params) +f3(z, *params)

>>> x0 =np.array([2.,2.]) # Initial guess.

>>> from scipy import optimize

>>> np.random.seed(555)# Seeded to allow replication.

>>> res =optimize.anneal(f, x0, args=params, schedule=’boltzmann’,

full_output=True, maxiter=500, lower=-10,

upper=10, dwell=250, disp=True)

Warning: Maximum number of iterations exceeded.

>>> res[0]# obtained minimum

array([-1.03914194, 1.81330654])

>>> res[1]# function value at minimum

-3.3817...

So this run settled on the point [-1.039, 1.813] with a minimum function value of about -3.382. The ﬁnal

temperature was about 212. The run used 125301 function evaluations, 501 iterations (including the initial

guess as a iteration), and accepted 61162 points. The status ﬂag of 3 also indicates that maxiter was reached.

This problem’s true global minimum lies near the point [-1.057, 1.808] and has a value of about -3.409. So these

anneal results are pretty good and could be used as the starting guess in a local optimizer to seek a more exact

local minimum.

Example 2. To minimize the same objective function using the minimize approach, we need to (a) convert the

options to an “options dictionary” using the keys prescribed for this method, (b) call the minimize function

with the name of the method (which in this case is ‘Anneal’), and (c) take account of the fact that the returned

value will be a Result object (i.e., a dictionary, as deﬁned in optimize.py).

All of the allowable options for ‘Anneal’ when using the minimize approach are listed in the myopts dictio-

nary given below, although in practice only the non-default values would be needed. Some of their names differ

from those used in the anneal approach. We can proceed as follows:

>>> myopts ={

’schedule’ : ’boltzmann’, # Non-default value.

’maxfev’ : None, # Default, formerly ‘maxeval‘.

’maxiter’ : 500, # Non-default value.

’maxaccept’ : None, # Default value.

’ftol’ : 1e-6, # Default, formerly ‘feps‘.

’T0’ : None, # Default value.

’Tf’ : 1e-12, # Default value.

’boltzmann’ : 1.0, # Default value.

’learn_rate’ : 0.5, # Default value.

’quench’ : 1.0, # Default value.

’m’ : 1.0, # Default value.

’n’ : 1.0, # Default value.

’lower’ : -10, # Non-default value.

’upper’ : +10, # Non-default value.

’dwell’ : 250, # Non-default value.

’disp’ : True # Default value.

}

>>> from scipy import optimize

>>> np.random.seed(777)# Seeded to allow replication.

>>> res2 =optimize.minimize(f, x0, args=params, method=’Anneal’,

options=myopts)

5.20. Optimization and root ﬁnding (scipy.optimize) 553

SciPy Reference Guide, Release 0.13.0

Warning: Maximum number of iterations exceeded.

>>> res2

status: 3

success: False

accept: 61742

nfev: 125301

T: 214.20624873839623

fun: -3.4084065576676053

x: array([-1.05757366, 1.8071427 ])

message: ’Maximum cooling iterations reached’

nit: 501

scipy.optimize.basinhopping(func,x0,niter=100,T=1.0,stepsize=0.5,minimizer_kwargs=None,

take_step=None,accept_test=None,callback=None,interval=50,

disp=False,niter_success=None)

Find the global minimum of a function using the basin-hopping algorithm New in version 0.12.0.

Parameters func : callable f(x, *args)

Function to be optimized. args can be passed as an optional item in the dict

minimizer_kwargs

x0 : ndarray

Initial guess.

niter : integer, optional

The number of basin hopping iterations

T: ﬂoat, optional

The “temperature” parameter for the accept or reject criterion. Higher “temperatures”

mean that larger jumps in function value will be accepted. For best results Tshould

be comparable to the separation (in function value) between local minima.

stepsize : ﬂoat, optional

initial step size for use in the random displacement.

minimizer_kwargs : dict, optional

Extra keyword arguments to be passed to the minimizer

scipy.optimize.minimize() Some important options could be:

method [str] The minimization method (e.g. "L-BFGS-B")

args [tuple] Extra arguments passed to the objective function (func)

and its derivatives (Jacobian, Hessian).

take_step : callable take_step(x), optional

Replace the default step taking routine with this routine. The default step taking rou-

tine is a random displacement of the coordinates, but other step taking algorithms

may be better for some systems. take_step can optionally have the attribute

take_step.stepsize. If this attribute exists, then basinhopping will adjust

take_step.stepsize in order to try to optimize the global minimum search.

accept_test : callable, accept_test(f_new=f_new, x_new=x_new,

f_old=fold, x_old=x_old), optional

Deﬁne a test which will be used to judge whether or not to accept the step. This will

be used in addition to the Metropolis test based on “temperature” T. The acceptable

return values are True, False, or "force accept". If the latter, then this will

override any other tests in order to accept the step. This can be used, for example, to

forcefully escape from a local minimum that basinhopping is trapped in.

callback : callable, callback(x, f, accept), optional

A callback function which will be called for all minimum found. xand fare the

coordinates and function value of the trial minima, and accept is whether or not

that minima was accepted. This can be used, for example, to save the lowest N min-

ima found. Also, callback can be used to specify a user deﬁned stop criterion by

optionally returning True to stop the basinhopping routine.

554 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

interval : integer, optional

interval for how often to update the stepsize

disp : bool, optional

Set to True to print status messages

niter_success : integer, optional

Stop the run if the global minimum candidate remains the same for this number of

iterations.

Returns res : Result

The optimization result represented as a Result object. Important attributes are:

xthe solution array, fun the value of the function at the solution, and message

which describes the cause of the termination. See Result for a description of other

attributes.

See Also

minimize The local minimization function called once for each basinhopping step. minimizer_kwargs

is passed to this routine.

Notes

Basin-hopping is a stochastic algorithm which attempts to ﬁnd the global minimum of a smooth scalar function

of one or more variables [R86] [R87] [R88] [R89]. The algorithm in its current form was described by David

Wales and Jonathan Doye [R87] http://www-wales.ch.cam.ac.uk/.

The algorithm is iterative with each cycle composed of the following features

1.random perturbation of the coordinates

2.local minimization

3.accept or reject the new coordinates based on the minimized function value

The acceptance test used here is the Metropolis criterion of standard Monte Carlo algorithms, although there are

many other possibilities [R88].

This global minimization method has been shown to be extremely efﬁcient for a wide variety of problems in

physics and chemistry. It is particularly useful when the function has many minima separated by large barriers.

See the Cambridge Cluster Database http://www-wales.ch.cam.ac.uk/CCD.html for databases of molecular sys-

tems that have been optimized primarily using basin-hopping. This database includes minimization problems

exceeding 300 degrees of freedom.

See the free software program GMIN (http://www-wales.ch.cam.ac.uk/GMIN) for a Fortran implementation of

basin-hopping. This implementation has many different variations of the procedure described above, including

more advanced step taking algorithms and alternate acceptance criterion.

For stochastic global optimization there is no way to determine if the true global minimum has actually been

found. Instead, as a consistency check, the algorithm can be run from a number of different random starting

points to ensure the lowest minimum found in each example has converged to the global minimum. For this

reason basinhopping will by default simply run for the number of iterations niter and return the lowest

minimum found. It is left to the user to ensure that this is in fact the global minimum.

Choosing stepsize: This is a crucial parameter in basinhopping and depends on the problem being

solved. Ideally it should be comparable to the typical separation between local minima of the function being

optimized. basinhopping will, by default, adjust stepsize to ﬁnd an optimal value, but this may take

many iterations. You will get quicker results if you set a sensible value for stepsize.

Choosing T: The parameter Tis the temperature used in the metropolis criterion. Basinhopping steps are ac-

cepted with probability 1if func(xnew) < func(xold), or otherwise with probability:

5.20. Optimization and root ﬁnding (scipy.optimize) 555

SciPy Reference Guide, Release 0.13.0

exp( -(func(xnew) -func(xold)) /T )

So, for best results, Tshould to be comparable to the typical difference in function value between between local

minima

References

[R86],[R87],[R88],[R89]

Examples

The following example is a one-dimensional minimization problem, with many local minima superimposed on

a parabola.

>>> func =lambda x: cos(14.5 *x-0.3)+(x +0.2)*x

>>> x0=[1.]

Basinhopping, internally, uses a local minimization algorithm. We will use the parameter

minimizer_kwargs to tell basinhopping which algorithm to use and how to set up that minimizer.

This parameter will be passed to scipy.optimize.minimize().

>>> minimizer_kwargs ={"method":"BFGS"}

>>> ret =basinhopping(func, x0, minimizer_kwargs=minimizer_kwargs,

... niter=200)

>>> print("global minimum: x = %.4f, f(x0) = %.4f"%(ret.x, ret.fun))

global minimum: x = -0.1951, f(x0) = -1.0009

Next consider a two-dimensional minimization problem. Also, this time we will use gradient information to

signiﬁcantly speed up the search.

>>> def func2d(x):

... f=cos(14.5 *x[0]-0.3)+(x[1]+0.2)*x[1]+(x[0]+

... 0.2)*x[0]

... df =np.zeros(2)

... df[0]= -14.5 *sin(14.5 *x[0]-0.3)+2. *x[0]+0.2

... df[1]=2. *x[1]+0.2

... return f, df

We’ll also use a different local minimization algorithm. Also we must tell the minimizer that our function returns

both energy and gradient (jacobian)

>>> minimizer_kwargs ={"method":"L-BFGS-B","jac":True}

>>> x0 =[1.0,1.0]

>>> ret =basinhopping(func2d, x0, minimizer_kwargs=minimizer_kwargs,

... niter=200)

>>> print("global minimum: x = [%.4f,%.4f], f(x0) = %.4f"%(ret.x[0],

... ret.x[1],

... ret.fun))

global minimum: x = [-0.1951, -0.1000], f(x0) = -1.0109

Here is an example using a custom step taking routine. Imagine you want the ﬁrst coordinate to take larger steps

then the rest of the coordinates. This can be implemented like so:

>>> class MyTakeStep(object):

... def __init__(self, stepsize=0.5):

... self.stepsize =stepsize

... def __call__(self, x):

... s=self.stepsize

... x[0]+= np.random.uniform(-2.*s, 2.*s)

556 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

... x[1:] += np.random.uniform(-s, s, x[1:].shape)

... return x

Since MyTakeStep.stepsize exists basinhopping will adjust the magnitude of stepsize to optimize the

search. We’ll use the same 2-D function as before

>>> mytakestep =MyTakeStep()

>>> ret =basinhopping(func2d, x0, minimizer_kwargs=minimizer_kwargs,

... niter=200, take_step=mytakestep)

>>> print("global minimum: x = [%.4f,%.4f], f(x0) = %.4f"%(ret.x[0],

... ret.x[1],

... ret.fun))

global minimum: x = [-0.1951, -0.1000], f(x0) = -1.0109

Now let’s do an example using a custom callback function which prints the value of every minimum found

>>> def print_fun(x, f, accepted):

... print("at minima %.4f accepted %d"%(f, int(accepted)))

We’ll run it for only 10 basinhopping steps this time.

>>> np.random.seed(1)

>>> ret =basinhopping(func2d, x0, minimizer_kwargs=minimizer_kwargs,

... niter=10, callback=print_fun)

at minima 0.4159 accepted 1

at minima -0.9073 accepted 1

at minima -0.1021 accepted 1

at minima 0.9102 accepted 1

at minima 2.2945 accepted 0

at minima -0.1021 accepted 1

at minima -1.0109 accepted 1

The minima at -1.0109 is actually the global minimum, found already on the 8th iteration.

Now let’s implement bounds on the problem using a custom accept_test:

>>> class MyBounds(object):

... def __init__(self, xmax=[1.1,1.1], xmin=[-1.1,-1.1] ):

... self.xmax =np.array(xmax)

... self.xmin =np.array(xmin)

... def __call__(self,**kwargs):

... x=kwargs["x_new"]

... tmax =bool(np.all(x <= self.xmax))

... tmin =bool(np.all(x >= self.xmin))

... return tmax and tmin

>>> mybounds =MyBounds()

>>> ret =basinhopping(func2d, x0, minimizer_kwargs=minimizer_kwargs,

... niter=10, accept_test=mybounds)

scipy.optimize.brute(func,ranges,args=(),Ns=20,full_output=0,ﬁnish=<function fmin at

0x45df5f0>,disp=False)

Minimize a function over a given range by brute force.

Uses the “brute force” method, i.e. computes the function’s value at each point of a multidimensional grid of

points, to ﬁnd the global minimum of the function.

Parameters func : callable

5.20. Optimization and root ﬁnding (scipy.optimize) 557

SciPy Reference Guide, Release 0.13.0

The objective function to be minimized. Must be in the form f(x, *args), where

xis the argument in the form of a 1-D array and args is a tuple of any additional

ﬁxed parameters needed to completely specify the function.

ranges : tuple

Each component of the ranges tuple must be either a “slice object” or a range tuple

of the form (low, high). The program uses these to create the grid of points on

which the objective function will be computed. See Note 2 for more detail.

args : tuple, optional

Any additional ﬁxed parameters needed to completely specify the function.

Ns : int, optional

Number of grid points along the axes, if not otherwise speciﬁed. See Note2.

full_output : bool, optional

If True, return the evaluation grid and the objective function’s values on it.

ﬁnish : callable, optional

An optimization function that is called with the result of brute force minimization as

initial guess. ﬁnish should take the initial guess as positional argument, and take args,

full_output and disp as keyword arguments. Use None if no “polishing” function is to

be used. See Notes for more details.

disp : bool, optional

Set to True to print convergence messages.

Returns x0 : ndarray

A 1-D array containing the coordinates of a point at which the objective function had

its minimum value. (See Note 1 for which point is returned.)

fval : ﬂoat

Function value at the point x0.

grid : tuple

Representation of the evaluation grid. It has the same length as x0. (Returned when

full_output is True.)

Jout : ndarray

Function values at each point of the evaluation grid, i.e.,Jout = func(*grid).

(Returned when full_output is True.)

See Also

anneal Another approach to seeking the global minimum of

multivariate,multimodal

Notes

Note 1: The program ﬁnds the gridpoint at which the lowest value of the objective function occurs. If ﬁnish is

None, that is the point returned. When the global minimum occurs within (or not very far outside) the grid’s

boundaries, and the grid is ﬁne enough, that point will be in the neighborhood of the gobal minimum.

However, users often employ some other optimization program to “polish” the gridpoint values, i.e., to seek

a more precise (local) minimum near brute’s best gridpoint. The brute function’s ﬁnish option provides

a convenient way to do that. Any polishing program used must take brute’s output as its initial guess as a

positional argument, and take brute’s input values for args and full_output as keyword arguments, otherwise an

error will be raised.

brute assumes that the ﬁnish function returns a tuple in the form: (xmin, Jmin, ... ,

statuscode), where xmin is the minimizing value of the argument, Jmin is the minimum value of the

objective function, ”...” may be some other returned values (which are not used by brute), and statuscode

is the status code of the ﬁnish program.

558 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Note that when ﬁnish is not None, the values returned are those of the ﬁnish program, not the gridpoint ones.

Consequently, while brute conﬁnes its search to the input grid points, the ﬁnish program’s results usually will

not coincide with any gridpoint, and may fall outside the grid’s boundary.

Note 2: The grid of points is a numpy.mgrid object. For brute the ranges and Ns inputs have the following

effect. Each component of the ranges tuple can be either a slice object or a two-tuple giving a range of values,

such as (0, 5). If the component is a slice object, brute uses it directly. If the component is a two-tuple range,

brute internally converts it to a slice object that interpolates Ns points from its low-value to its high-value,

inclusive.

Examples

We illustrate the use of brute to seek the global minimum of a function of two variables that is given as the

sum of a positive-deﬁnite quadratic and two deep “Gaussian-shaped” craters. Speciﬁcally, deﬁne the objective

function fas the sum of three other functions, f=f1+f2+f3. We suppose each of these has a signature

(z, *params), where z = (x, y), and params and the functions are as deﬁned below.

>>> params =(2,3,7,8,9,10,44,-1,2,26,1,-2,0.5)

>>> def f1(z, *params):

... x,y=z

... a, b, c, d, e, f, g, h, i, j, k, l, scale =params

... return (a *x**2+b*x*y+c*y**2+d*x+e*y+f)

>>> def f2(z, *params):

... x,y=z

... a, b, c, d, e, f, g, h, i, j, k, l, scale =params

... return (-g*np.exp(-((x-h)**2+(y-i)**2)/scale))

>>> def f3(z, *params):

... x,y=z

... a, b, c, d, e, f, g, h, i, j, k, l, scale =params

... return (-j*np.exp(-((x-k)**2+(y-l)**2)/scale))

>>> def f(z, *params):

... x,y=z

... a, b, c, d, e, f, g, h, i, j, k, l, scale =params

... return f1(z, *params) +f2(z, *params) +f3(z, *params)

Thus, the objective function may have local minima near the minimum of each of the three functions of which

it is composed. To use fmin to polish its gridpoint result, we may then continue as follows:

>>> rranges =(slice(-4,4,0.25), slice(-4,4,0.25))

>>> from scipy import optimize

>>> resbrute =optimize.brute(f, rranges, args=params, full_output=True,

finish=optimize.fmin)

>>> resbrute[0]# global minimum

array([-1.05665192, 1.80834843])

>>> resbrute[1]# function value at global minimum

-3.4085818767

Note that if ﬁnish had been set to None, we would have gotten the gridpoint [-1.0 1.75] where the rounded

function value is -2.892.

Scalar function minimizers

minimize_scalar(fun[, bracket, bounds, ...]) Minimization of scalar function of one variable.

Continued on next page

5.20. Optimization and root ﬁnding (scipy.optimize) 559

SciPy Reference Guide, Release 0.13.0

Table 5.93 – continued from previous page

fminbound(func, x1, x2[, args, xtol, ...]) Bounded minimization for scalar functions.

brent(func[, args, brack, tol, full_output, ...]) Given a function of one-variable and a possible bracketing interval, return the minimum of the function isolated to a fractional precision of tol.

golden(func[, args, brack, tol, full_output]) Return the minimum of a function of one variable.

bracket(func[, xa, xb, args, grow_limit, ...]) Bracket the minimum of the function.

scipy.optimize.minimize_scalar(fun,bracket=None,bounds=None,args=(),method=’brent’,

tol=None,options=None)

Minimization of scalar function of one variable. New in version 0.11.0.

Parameters fun : callable

Objective function. Scalar function, must return a scalar.

bracket : sequence, optional

For methods ‘brent’ and ‘golden’, bracket deﬁnes the bracketing interval and can

either have three items (a, b, c) so that a < b < c and fun(b) < fun(a), fun(c) or two

items aand cwhich are assumed to be a starting interval for a downhill bracket search

(see bracket); it doesn’t always mean that the obtained solution will satisfy a <= x

<= c.

bounds : sequence, optional

For method ‘bounded’, bounds is mandatory and must have two items corresponding

to the optimization bounds.

args : tuple, optional

Extra arguments passed to the objective function.

method : str, optional

Type of solver. Should be one of

•‘Brent’

•‘Bounded’

•‘Golden’

tol : ﬂoat, optional

Tolerance for termination. For detailed control, use solver-speciﬁc options.

options : dict, optional

A dictionary of solver options.

xtol [ﬂoat] Relative error in solution xopt acceptable for con-

vergence.

maxiter [int] Maximum number of iterations to perform.

disp [bool] Set to True to print convergence messages.

Returns res : Result

The optimization result represented as a Result object. Important attributes are: x

the solution array, success a Boolean ﬂag indicating if the optimizer exited suc-

cessfully and message which describes the cause of the termination. See Result

for a description of other attributes.

See Also

minimize Interface to minimization algorithms for scalar multivariate functions.

Notes

This section describes the available solvers that can be selected by the ‘method’ parameter. The default method

is Brent.

Method Brent uses Brent’s algorithm to ﬁnd a local minimum. The algorithm uses inverse parabolic interpolation

when possible to speed up convergence of the golden section method.

560 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Method Golden uses the golden section search technique. It uses analog of the bisection method to decrease the

bracketed interval. It is usually preferable to use the Brent method.

Method Bounded can perform bounded minimization. It uses the Brent method to ﬁnd a local minimum in the

interval x1 < xopt < x2.

Examples

Consider the problem of minimizing the following function.

>>> def f(x):

... return (x -2)*x*(x +2)**2

Using the Brent method, we ﬁnd the local minimum as:

>>> from scipy.optimize import minimize_scalar

>>> res =minimize_scalar(f)

>>> res.x

1.28077640403

Using the Bounded method, we ﬁnd a local minimum with speciﬁed bounds as:

>>> res =minimize_scalar(f, bounds=(-3,-1), method=’bounded’)

>>> res.x

-2.0000002026

scipy.optimize.fminbound(func,x1,x2,args=(),xtol=1e-05,maxfun=500,full_output=0,disp=1)

Bounded minimization for scalar functions.

Parameters func : callable f(x,*args)

Objective function to be minimized (must accept and return scalars).

x1, x2 : ﬂoat or array scalar

The optimization bounds.

args : tuple, optional

Extra arguments passed to function.

xtol : ﬂoat, optional

The convergence tolerance.

maxfun : int, optional

Maximum number of function evaluations allowed.

full_output : bool, optional

If True, return optional outputs.

disp : int, optional

If non-zero, print messages.

0 : no message printing. 1 : non-convergence notiﬁcation messages

only. 2 : print a message on convergence too. 3 : print iteration results.

Returns xopt : ndarray

Parameters (over given interval) which minimize the objective function.

fval : number

The function value at the minimum point.

ierr : int

An error ﬂag (0 if converged, 1 if maximum number of function calls reached).

numfunc : int

The number of function calls made.

See Also

minimize_scalar

Interface to minimization algorithms for scalar univariate functions. See the ‘Bounded’ method

in particular.

5.20. Optimization and root ﬁnding (scipy.optimize) 561

SciPy Reference Guide, Release 0.13.0

Notes

Finds a local minimizer of the scalar function func in the interval x1 < xopt < x2 using Brent’s method. (See

brent for auto-bracketing).

scipy.optimize.brent(func,args=(),brack=None,tol=1.48e-08,full_output=0,maxiter=500)

Given a function of one-variable and a possible bracketing interval, return the minimum of the function isolated

to a fractional precision of tol.

Parameters func : callable f(x,*args)

Objective function.

args

Additional arguments (if present).

brack : tuple

Triple (a,b,c) where (a<b<c) and func(b) < func(a),func(c). If bracket consists of two

numbers (a,c) then they are assumed to be a starting interval for a downhill bracket

search (see bracket); it doesn’t always mean that the obtained solution will satisfy

a<=x<=c.

tol : ﬂoat

Stop if between iteration change is less than tol.

full_output : bool

If True, return all output args (xmin, fval, iter, funcalls).

maxiter : int

Maximum number of iterations in solution.

Returns xmin : ndarray

Optimum point.

fval : ﬂoat

Optimum value.

iter : int

Number of iterations.

funcalls : int

Number of objective function evaluations made.

See Also

minimize_scalar

Interface to minimization algorithms for scalar univariate functions. See the ‘Brent’ method in

particular.

Notes

Uses inverse parabolic interpolation when possible to speed up convergence of golden section method.

scipy.optimize.golden(func,args=(),brack=None,tol=1.4901161193847656e-08,full_output=0)

Return the minimum of a function of one variable.

Given a function of one variable and a possible bracketing interval, return the minimum of the function isolated

to a fractional precision of tol.

Parameters func : callable func(x,*args)

Objective function to minimize.

args : tuple

Additional arguments (if present), passed to func.

brack : tuple

Triple (a,b,c), where (a<b<c) and func(b) < func(a),func(c). If bracket consists of two

numbers (a, c), then they are assumed to be a starting interval for a downhill bracket

search (see bracket); it doesn’t always mean that obtained solution will satisfy

a<=x<=c.

562 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

tol : ﬂoat

x tolerance stop criterion

full_output : bool

If True, return optional outputs.

See Also

minimize_scalar

Interface to minimization algorithms for scalar univariate functions. See the ‘Golden’ method in

particular.

Notes

Uses analog of bisection method to decrease the bracketed interval.

scipy.optimize.bracket(func,xa=0.0,xb=1.0,args=(),grow_limit=110.0,maxiter=1000)

Bracket the minimum of the function.

Given a function and distinct initial points, search in the downhill direction (as deﬁned by the initital points) and

return new points xa, xb, xc that bracket the minimum of the function f(xa) > f(xb) < f(xc). It doesn’t always

mean that obtained solution will satisfy xa<=x<=xb

Parameters func : callable f(x,*args)

Objective function to minimize.

xa, xb : ﬂoat, optional

Bracketing interval. Defaults xa to 0.0, and xb to 1.0.

args : tuple, optional

Additional arguments (if present), passed to func.

grow_limit : ﬂoat, optional

Maximum grow limit. Defaults to 110.0

maxiter : int, optional

Maximum number of iterations to perform. Defaults to 1000.

Returns xa, xb, xc : ﬂoat

Bracket.

fa, fb, fc : ﬂoat

Objective function values in bracket.

funcalls : int

Number of function evaluations made.

Rosenbrock function

rosen(x) The Rosenbrock function.

rosen_der(x) The derivative (i.e.

rosen_hess(x) The Hessian matrix of the Rosenbrock function.

rosen_hess_prod(x, p) Product of the Hessian matrix of the Rosenbrock function with a vector.

scipy.optimize.rosen(x)

The Rosenbrock function.

The function computed is:

sum(100.0*(x[1:] - x[:-1]**2.0)**2.0 + (1 - x[:-1])**2.0

Parameters x: array_like

1-D array of points at which the Rosenbrock function is to be computed.

5.20. Optimization and root ﬁnding (scipy.optimize) 563

SciPy Reference Guide, Release 0.13.0

Returns f: ﬂoat

The value of the Rosenbrock function.

See Also

rosen_der,rosen_hess,rosen_hess_prod

scipy.optimize.rosen_der(x)

The derivative (i.e. gradient) of the Rosenbrock function.

Parameters x: array_like

1-D array of points at which the derivative is to be computed.

Returns rosen_der : (N,) ndarray

The gradient of the Rosenbrock function at x.

See Also

rosen,rosen_hess,rosen_hess_prod

scipy.optimize.rosen_hess(x)

The Hessian matrix of the Rosenbrock function.

Parameters x: array_like

1-D array of points at which the Hessian matrix is to be computed.

Returns rosen_hess : ndarray

The Hessian matrix of the Rosenbrock function at x.

See Also

rosen,rosen_der,rosen_hess_prod

scipy.optimize.rosen_hess_prod(x,p)

Product of the Hessian matrix of the Rosenbrock function with a vector.

Parameters x: array_like

1-D array of points at which the Hessian matrix is to be computed.

p: array_like

1-D array, the vector to be multiplied by the Hessian matrix.

Returns rosen_hess_prod : ndarray

The Hessian matrix of the Rosenbrock function at xmultiplied by the vector p.

See Also

rosen,rosen_der,rosen_hess

5.20.2 Fitting

curve_fit(f, xdata, ydata[, p0, sigma]) Use non-linear least squares to ﬁt a function, f, to data.

scipy.optimize.curve_fit(f,xdata,ydata,p0=None,sigma=None,**kw)

Use non-linear least squares to ﬁt a function, f, to data.

Assumes ydata = f(xdata, *params) + eps

Parameters f: callable

The model function, f(x, ...). It must take the independent variable as the ﬁrst argument

and the parameters to ﬁt as separate remaining arguments.

xdata : An N-length sequence or an (k,N)-shaped array

564 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

for functions with k predictors. The independent variable where the data is measured.

ydata : N-length sequence

The dependent data — nominally f(xdata, ...)

p0 : None, scalar, or M-length sequence

Initial guess for the parameters. If None, then the initial values will all be 1 (if the

number of parameters for the function can be determined using introspection, other-

wise a ValueError is raised).

sigma : None or N-length sequence

If not None, this vector will be used as relative weights in the least-squares problem.

Returns popt : array

Optimal values for the parameters so that the sum of the squared error of f(xdata,

*popt) - ydata is minimized

pcov : 2d array

The estimated covariance of popt. The diagonals provide the variance of the parameter

estimate.

See Also

leastsq

Notes

The algorithm uses the Levenberg-Marquardt algorithm through leastsq. Additional keyword arguments are

passed directly to that algorithm.

Examples

>>> import numpy as np

>>> from scipy.optimize import curve_fit

>>> def func(x, a, b, c):

... return a*np.exp(-b*x) +c

>>> x=np.linspace(0,4,50)

>>> y=func(x, 2.5,1.3,0.5)

>>> yn =y+0.2*np.random.normal(size=len(x))

>>> popt, pcov =curve_fit(func, x, yn)

5.20.3 Root ﬁnding

Scalar functions

brentq(f, a, b[, args, xtol, rtol, maxiter, ...]) Find a root of a function in given interval.

brenth(f, a, b[, args, xtol, rtol, maxiter, ...]) Find root of f in [a,b].

ridder(f, a, b[, args, xtol, rtol, maxiter, ...]) Find a root of a function in an interval.

bisect(f, a, b[, args, xtol, rtol, maxiter, ...]) Find root of a function within an interval.

newton(func, x0[, fprime, args, tol, ...]) Find a zero using the Newton-Raphson or secant method.

scipy.optimize.brentq(f,a,b,args=(),xtol=1e-12,rtol=4.4408920985006262e-16,maxiter=100,

full_output=False,disp=True)

Find a root of a function in given interval.

Return ﬂoat, a zero of fbetween aand b.fmust be a continuous function, and [a,b] must be a sign changing

interval.

5.20. Optimization and root ﬁnding (scipy.optimize) 565

SciPy Reference Guide, Release 0.13.0

Description: Uses the classic Brent (1973) method to ﬁnd a zero of the function fon the sign changing interval [a

, b]. Generally considered the best of the rootﬁnding routines here. It is a safe version of the secant method that

uses inverse quadratic extrapolation. Brent’s method combines root bracketing, interval bisection, and inverse

quadratic interpolation. It is sometimes known as the van Wijngaarden-Deker-Brent method. Brent (1973)

claims convergence is guaranteed for functions computable within [a,b].

[Brent1973] provides the classic description of the algorithm. Another description can be found

in a recent edition of Numerical Recipes, including [PressEtal1992]. Another description is at

http://mathworld.wolfram.com/BrentsMethod.html. It should be easy to understand the algorithm just by read-

ing our code. Our code diverges a bit from standard presentations: we choose a different formula for the

extrapolation step.

Parameters f: function

Python function returning a number. f must be continuous, and f(a) and f(b) must have

opposite signs.

a: number

One end of the bracketing interval [a,b].

b: number

The other end of the bracketing interval [a,b].

xtol : number, optional

The routine converges when a root is known to lie within xtol of the value return.

Should be >= 0. The routine modiﬁes this to take into account the relative precision

of doubles.

rtol : number, optional

The routine converges when a root is known to lie within rtol times the value returned

of the value returned. Should be >= 0. Defaults to np.finfo(float).eps *2.

maxiter : number, optional

if convergence is not achieved in maxiter iterations, and error is raised. Must be >= 0.

args : tuple, optional

containing extra arguments for the function f.fis called by apply(f,

(x)+args).

full_output : bool, optional

If full_output is False, the root is returned. If full_output is True, the return value is

(x, r), where xis the root, and ris a RootResults object.

disp : bool, optional

If True, raise RuntimeError if the algorithm didn’t converge.

Returns x0 : ﬂoat

Zero of fbetween aand b.

r: RootResults (present if full_output = True)

Object containing information about the convergence. In particular, r.converged

is True if the routine converged.

See Also

multivariate

fmin,fmin_powell,fmin_cg,fmin_bfgs,fmin_ncg

nonlinear leastsq

constrained

fmin_l_bfgs_b,fmin_tnc,fmin_cobyla

global anneal,basinhopping,brute

local fminbound,brent,golden,bracket

n-dimensional

fsolve

566 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

one-dimensional

brentq,brenth,ridder,bisect,newton

scalar fixed_point

Notes

fmust be continuous. f(a) and f(b) must have opposite signs.

References

[Brent1973],[PressEtal1992]

scipy.optimize.brenth(f,a,b,args=(),xtol=1e-12,rtol=4.4408920985006262e-16,maxiter=100,

full_output=False,disp=True)

Find root of f in [a,b].

A variation on the classic Brent routine to ﬁnd a zero of the function f between the arguments a and b that uses

hyperbolic extrapolation instead of inverse quadratic extrapolation. There was a paper back in the 1980’s ... f(a)

and f(b) can not have the same signs. Generally on a par with the brent routine, but not as heavily tested. It is a

safe version of the secant method that uses hyperbolic extrapolation. The version here is by Chuck Harris.

Parameters f: function

Python function returning a number. f must be continuous, and f(a) and f(b) must have

opposite signs.

a: number

One end of the bracketing interval [a,b].

b: number

The other end of the bracketing interval [a,b].

xtol : number, optional

The routine converges when a root is known to lie within xtol of the value return.

Should be >= 0. The routine modiﬁes this to take into account the relative precision

of doubles.

rtol : number, optional

The routine converges when a root is known to lie within rtol times the value returned

of the value returned. Should be >= 0. Defaults to np.finfo(float).eps *2.

maxiter : number, optional

if convergence is not achieved in maxiter iterations, and error is raised. Must be >= 0.

args : tuple, optional

containing extra arguments for the function f.fis called by apply(f,

(x)+args).

full_output : bool, optional

If full_output is False, the root is returned. If full_output is True, the return value is

(x, r), where xis the root, and ris a RootResults object.

disp : bool, optional

If True, raise RuntimeError if the algorithm didn’t converge.

Returns x0 : ﬂoat

Zero of fbetween aand b.

r: RootResults (present if full_output = True)

Object containing information about the convergence. In particular, r.converged

is True if the routine converged.

See Also

fmin,fmin_powell,fmin_cg

leastsq nonlinear least squares minimizer

5.20. Optimization and root ﬁnding (scipy.optimize) 567

SciPy Reference Guide, Release 0.13.0

fmin_l_bfgs_b,fmin_tnc,fmin_cobyla,anneal,brute,fminbound,brent,golden,

bracket

fsolve n-dimensional root-ﬁnding

brentq,brenth,ridder,bisect,newton

fixed_point

scalar ﬁxed-point ﬁnder

scipy.optimize.ridder(f,a,b,args=(),xtol=1e-12,rtol=4.4408920985006262e-16,maxiter=100,

full_output=False,disp=True)

Find a root of a function in an interval.

Parameters f: function

Python function returning a number. f must be continuous, and f(a) and f(b) must have

opposite signs.

a: number

One end of the bracketing interval [a,b].

b: number

The other end of the bracketing interval [a,b].

xtol : number, optional

The routine converges when a root is known to lie within xtol of the value return.

Should be >= 0. The routine modiﬁes this to take into account the relative precision

of doubles.

rtol : number, optional

The routine converges when a root is known to lie within rtol times the value returned

of the value returned. Should be >= 0. Defaults to np.finfo(float).eps *2.

maxiter : number, optional

if convergence is not achieved in maxiter iterations, and error is raised. Must be >= 0.

args : tuple, optional

containing extra arguments for the function f.fis called by apply(f,

(x)+args).

full_output : bool, optional

If full_output is False, the root is returned. If full_output is True, the return value is

(x, r), where xis the root, and ris a RootResults object.

disp : bool, optional

If True, raise RuntimeError if the algorithm didn’t converge.

Returns x0 : ﬂoat

Zero of fbetween aand b.

r: RootResults (present if full_output = True)

Object containing information about the convergence. In particular, r.converged

is True if the routine converged.

See Also

brentq,brenth,bisect,newton

fixed_point

scalar ﬁxed-point ﬁnder

Notes

Uses [Ridders1979] method to ﬁnd a zero of the function fbetween the arguments aand b. Ridders’ method

is faster than bisection, but not generally as fast as the Brent rountines. [Ridders1979] provides the classic

description and source of the algorithm. A description can also be found in any recent edition of Numerical

Recipes.

The routine used here diverges slightly from standard presentations in order to be a bit more careful of tolerance.

568 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

References

[Ridders1979]

scipy.optimize.bisect(f,a,b,args=(),xtol=1e-12,rtol=4.4408920985006262e-16,maxiter=100,

full_output=False,disp=True)

Find root of a function within an interval.

Basic bisection routine to ﬁnd a zero of the function fbetween the arguments aand b.f(a) and f(b) can not have

the same signs. Slow but sure.

Parameters f: function

Python function returning a number. fmust be continuous, and f(a) and f(b) must

have opposite signs.

a: number

One end of the bracketing interval [a,b].

b: number

The other end of the bracketing interval [a,b].

xtol : number, optional

The routine converges when a root is known to lie within xtol of the value return.

Should be >= 0. The routine modiﬁes this to take into account the relative precision

of doubles.

rtol : number, optional

The routine converges when a root is known to lie within rtol times the value returned

of the value returned. Should be >= 0. Defaults to np.finfo(float).eps *2.

maxiter : number, optional

if convergence is not achieved in maxiter iterations, and error is raised. Must be >= 0.

args : tuple, optional

containing extra arguments for the function f.fis called by apply(f,

(x)+args).

full_output : bool, optional

If full_output is False, the root is returned. If full_output is True, the return value is

(x, r), where x is the root, and r is a RootResults object.

disp : bool, optional

If True, raise RuntimeError if the algorithm didn’t converge.

Returns x0 : ﬂoat

Zero of fbetween aand b.

r: RootResults (present if full_output = True)

Object containing information about the convergence. In particular, r.converged

is True if the routine converged.

See Also

brentq,brenth,bisect,newton

fixed_point

scalar ﬁxed-point ﬁnder

fsolve n-dimensional root-ﬁnding

scipy.optimize.newton(func,x0,fprime=None,args=(),tol=1.48e-08,maxiter=50,fprime2=None)

Find a zero using the Newton-Raphson or secant method.

Find a zero of the function func given a nearby starting point x0. The Newton-Raphson method is used if the

derivative fprime of func is provided, otherwise the secant method is used. If the second order derivate fprime2

of func is provided, parabolic Halley’s method is used.

Parameters func : function

5.20. Optimization and root ﬁnding (scipy.optimize) 569

SciPy Reference Guide, Release 0.13.0

The function whose zero is wanted. It must be a function of a single variable of the

form f(x,a,b,c...), where a,b,c... are extra arguments that can be passed in the args

parameter.

x0 : ﬂoat

An initial estimate of the zero that should be somewhere near the actual zero.

fprime : function, optional

The derivative of the function when available and convenient. If it is None (default),

then the secant method is used.

args : tuple, optional

Extra arguments to be used in the function call.

tol : ﬂoat, optional

The allowable error of the zero value.

maxiter : int, optional

Maximum number of iterations.

fprime2 : function, optional

The second order derivative of the function when available and convenient. If it is

None (default), then the normal Newton-Raphson or the secant method is used. If it

is given, parabolic Halley’s method is used.

Returns zero : ﬂoat

Estimated location where function is zero.

See Also

brentq,brenth,ridder,bisect

fsolve ﬁnd zeroes in n dimensions.

Notes

The convergence rate of the Newton-Raphson method is quadratic, the Halley method is cubic, and the secant

method is sub-quadratic. This means that if the function is well behaved the actual error in the estimated zero

is approximately the square (cube for Halley) of the requested tolerance up to roundoff error. However, the

stopping criterion used here is the step size and there is no guarantee that a zero has been found. Consequently

the result should be veriﬁed. Safer algorithms are brentq, brenth, ridder, and bisect, but they all require that the

root ﬁrst be bracketed in an interval where the function changes sign. The brentq algorithm is recommended for

general use in one dimensional problems when such an interval has been found.

Fixed point ﬁnding:

fixed_point(func, x0[, args, xtol, maxiter]) Find a ﬁxed point of the function.

scipy.optimize.fixed_point(func,x0,args=(),xtol=1e-08,maxiter=500)

Find a ﬁxed point of the function.

Given a function of one or more variables and a starting point, ﬁnd a ﬁxed-point of the function: i.e. where

func(x0) == x0.

Parameters func : function

Function to evaluate.

x0 : array_like

Fixed point of function.

args : tuple, optional

Extra arguments to func.

xtol : ﬂoat, optional

Convergence tolerance, defaults to 1e-08.

maxiter : int, optional

Maximum number of iterations, defaults to 500.

570 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Notes

Uses Steffensen’s Method using Aitken’s Del^2 convergence acceleration. See Burden, Faires, “Numerical

Analysis”, 5th edition, pg. 80

Examples

>>> from scipy import optimize

>>> def func(x, c1, c2):

.... return np.sqrt(c1/(x+c2))

>>> c1 =np.array([10,12.])

>>> c2 =np.array([3,5.])

>>> optimize.fixed_point(func, [1.2,1.3], args=(c1,c2))

array([ 1.4920333 , 1.37228132])

Multidimensional

General nonlinear solvers:

root(fun, x0[, args, method, jac, tol, ...]) Find a root of a vector function.

fsolve(func, x0[, args, fprime, ...]) Find the roots of a function.

broyden1(F, xin[, iter, alpha, ...]) Find a root of a function, using Broyden’s ﬁrst Jacobian approximation.

broyden2(F, xin[, iter, alpha, ...]) Find a root of a function, using Broyden’s second Jacobian approximation.

scipy.optimize.root(fun,x0,args=(),method=’hybr’,jac=None,tol=None,callback=None,op-

tions=None)

Find a root of a vector function. New in version 0.11.0.

Parameters fun : callable

A vector function to ﬁnd a root of.

x0 : ndarray

Initial guess.

args : tuple, optional

Extra arguments passed to the objective function and its Jacobian.

method : str, optional

Type of solver. Should be one of

•‘hybr’

•‘lm’

•‘broyden1’

•‘broyden2’

•‘anderson’

•‘linearmixing’

•‘diagbroyden’

•‘excitingmixing’

•‘krylov’

jac : bool or callable, optional

If jac is a Boolean and is True, fun is assumed to return the value of Jacobian along

with the objective function. If False, the Jacobian will be estimated numerically. jac

can also be a callable returning the Jacobian of fun. In this case, it must accept the

same arguments as fun.

tol : ﬂoat, optional

Tolerance for termination. For detailed control, use solver-speciﬁc options.

callback : function, optional

5.20. Optimization and root ﬁnding (scipy.optimize) 571

SciPy Reference Guide, Release 0.13.0

Optional callback function. It is called on every iteration as callback(x, f)

where xis the current solution and fthe corresponding residual. For all methods but

‘hybr’ and ‘lm’.

options : dict, optional

A dictionary of solver options. E.g. xtol or maxiter, see show_options(’root’,

method) for details.

Returns sol : Result

The solution represented as a Result object. Important attributes are: xthe solution

array, success a Boolean ﬂag indicating if the algorithm exited successfully and

message which describes the cause of the termination. See Result for a descrip-

tion of other attributes.

Notes

This section describes the available solvers that can be selected by the ‘method’ parameter. The default method

is hybr.

Method hybr uses a modiﬁcation of the Powell hybrid method as implemented in MINPACK [R102].

Method lm solves the system of nonlinear equations in a least squares sense using a modiﬁcation of the

Levenberg-Marquardt algorithm as implemented in MINPACK [R102].

Methods broyden1,broyden2,anderson,linearmixing,diagbroyden,excitingmixing,krylov are inexact Newton

methods, with backtracking or full line searches [R103]. Each method corresponds to a particular Jacobian

approximations. See nonlin for details.

•Method broyden1 uses Broyden’s ﬁrst Jacobian approximation, it is known as Broyden’s good method.

•Method broyden2 uses Broyden’s second Jacobian approximation, it is known as Broyden’s bad method.

•Method anderson uses (extended) Anderson mixing.

•Method Krylov uses Krylov approximation for inverse Jacobian. It is suitable for large-scale problem.

•Method diagbroyden uses diagonal Broyden Jacobian approximation.

•Method linearmixing uses a scalar Jacobian approximation.

•Method excitingmixing uses a tuned diagonal Jacobian approximation.

Warning: The algorithms implemented for methods diagbroyden,linearmixing and excitingmixing may be

useful for speciﬁc problems, but whether they will work may depend strongly on the problem.

References

[R102],[R103]

Examples

The following functions deﬁne a system of nonlinear equations and its jacobian.

>>> def fun(x):

... return [x[0]+0.5 *(x[0]-x[1])**3-1.0,

... 0.5 *(x[1]-x[0])**3+x[1]]

>>> def jac(x):

... return np.array([[1+1.5 *(x[0]-x[1])**2,

... -1.5 *(x[0]-x[1])**2],

... [-1.5 *(x[1]-x[0])**2,

... 1+1.5 *(x[1]-x[0])**2]])

572 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

A solution can be obtained as follows.

>>> from scipy import optimize

>>> sol =optimize.root(fun, [0,0], jac=jac, method=’hybr’)

>>> sol.x

array([ 0.8411639, 0.1588361])

scipy.optimize.fsolve(func,x0,args=(),fprime=None,full_output=0,col_deriv=0,xtol=1.49012e-

08,maxfev=0,band=None,epsfcn=None,factor=100,diag=None)

Find the roots of a function.

Return the roots of the (non-linear) equations deﬁned by func(x) = 0 given a starting estimate.

Parameters func : callable f(x, *args)

A function that takes at least one (possibly vector) argument.

x0 : ndarray

The starting estimate for the roots of func(x) = 0.

args : tuple, optional

Any extra arguments to func.

fprime : callable(x), optional

A function to compute the Jacobian of func with derivatives across the rows. By

default, the Jacobian will be estimated.

full_output : bool, optional

If True, return optional outputs.

col_deriv : bool, optional

Specify whether the Jacobian function computes derivatives down the columns (faster,

because there is no transpose operation).

xtol : ﬂoat

The calculation will terminate if the relative error between two consecutive iterates is

at most xtol.

maxfev : int, optional

The maximum number of calls to the function. If zero, then 100*(N+1) is the

maximum where N is the number of elements in x0.

band : tuple, optional

If set to a two-sequence containing the number of sub- and super-diagonals within

the band of the Jacobi matrix, the Jacobi matrix is considered banded (only for

fprime=None).

epsfcn : ﬂoat, optional

A suitable step length for the forward-difference approximation of the Jacobian (for

fprime=None). If epsfcn is less than the machine precision, it is assumed that the

relative errors in the functions are of the order of the machine precision.

factor : ﬂoat, optional

A parameter determining the initial step bound (factor *|| diag *x||).

Should be in the interval (0.1, 100).

diag : sequence, optional

N positive entries that serve as a scale factors for the variables.

Returns x: ndarray

The solution (or the result of the last iteration for an unsuccessful call).

infodict : dict

A dictionary of optional outputs with the keys:

nfev number of function calls

njev number of Jacobian calls

fvec function evaluated at the output

fjac the orthogonal matrix, q, produced by the QR factorization of the ﬁnal

approximate Jacobian matrix, stored column wise

5.20. Optimization and root ﬁnding (scipy.optimize) 573

SciPy Reference Guide, Release 0.13.0

rupper triangular matrix produced by QR factorization of the same ma-

trix

qtf the vector (transpose(q) *fvec)

ier : int

An integer ﬂag. Set to 1 if a solution was found, otherwise refer to mesg for more

information.

mesg : str

If no solution is found, mesg details the cause of failure.

See Also

root Interface to root ﬁnding algorithms for multivariate

functions.

Notes

fsolve is a wrapper around MINPACK’s hybrd and hybrj algorithms.

scipy.optimize.broyden1(F,xin,iter=None,alpha=None,reduction_method=’restart’,

max_rank=None,verbose=False,maxiter=None,f_tol=None,

f_rtol=None,x_tol=None,x_rtol=None,tol_norm=None,

line_search=’armijo’,callback=None,**kw)

Find a root of a function, using Broyden’s ﬁrst Jacobian approximation.

This method is also known as “Broyden’s good method”.

Parameters F: function(x) -> f

Function whose root to ﬁnd; should take and return an array-like object.

x0 : array_like

Initial guess for the solution

alpha : ﬂoat, optional

Initial guess for the Jacobian is (-1/alpha).

reduction_method : str or tuple, optional

Method used in ensuring that the rank of the Broyden matrix stays low. Can either be

a string giving the name of the method, or a tuple of the form (method, param1,

param2, ...) that gives the name of the method and values for additional param-

eters.

Methods available:

•restart: drop all matrix columns. Has no extra parameters.

•simple: drop oldest matrix column. Has no extra parameters.

•svd: keep only the most signiﬁcant SVD components. Takes an extra

parameter, to_retain‘, which determines the number of

SVD components to retain when rank reduction is done.

Default is ‘‘max_rank - 2.

max_rank : int, optional

Maximum rank for the Broyden matrix. Default is inﬁnity (ie., no rank reduction).

iter : int, optional

Number of iterations to make. If omitted (default), make as many as required to meet

tolerances.

verbose : bool, optional

Print status to stdout on every iteration.

maxiter : int, optional

Maximum number of iterations to make. If more are needed to meet convergence,

NoConvergence is raised.

f_tol : ﬂoat, optional

Absolute tolerance (in max-norm) for the residual. If omitted, default is 6e-6.

574 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

f_rtol : ﬂoat, optional

Relative tolerance for the residual. If omitted, not used.

x_tol : ﬂoat, optional

Absolute minimum step size, as determined from the Jacobian approximation. If the

step size is smaller than this, optimization is terminated as successful. If omitted, not

used.

x_rtol : ﬂoat, optional

Relative minimum step size. If omitted, not used.

tol_norm : function(vector) -> scalar, optional

Norm to use in convergence check. Default is the maximum norm.

line_search : {None, ‘armijo’ (default), ‘wolfe’}, optional

Which type of a line search to use to determine the step size in the direction given by

the Jacobian approximation. Defaults to ‘armijo’.

callback : function, optional

Optional callback function. It is called on every iteration as callback(x, f)

where xis the current solution and fthe corresponding residual.

Returns sol : ndarray

An array (of similar array type as x0) containing the ﬁnal solution.

Raises NoConvergence

When a solution was not found.

Notes

This algorithm implements the inverse Jacobian Quasi-Newton update

H+=H+ (dx −Hdf)dx†H/(dx†Hdf)

which corresponds to Broyden’s ﬁrst Jacobian update

J+=J+ (df −Jdx)dx†/dx†dx

References

[vR]

scipy.optimize.broyden2(F,xin,iter=None,alpha=None,reduction_method=’restart’,

max_rank=None,verbose=False,maxiter=None,f_tol=None,

f_rtol=None,x_tol=None,x_rtol=None,tol_norm=None,

line_search=’armijo’,callback=None,**kw)

Find a root of a function, using Broyden’s second Jacobian approximation.

This method is also known as “Broyden’s bad method”.

Parameters F: function(x) -> f

Function whose root to ﬁnd; should take and return an array-like object.

x0 : array_like

Initial guess for the solution

alpha : ﬂoat, optional

Initial guess for the Jacobian is (-1/alpha).

reduction_method : str or tuple, optional

Method used in ensuring that the rank of the Broyden matrix stays low. Can either be

a string giving the name of the method, or a tuple of the form (method, param1,

param2, ...) that gives the name of the method and values for additional param-

eters.

Methods available:

5.20. Optimization and root ﬁnding (scipy.optimize) 575

SciPy Reference Guide, Release 0.13.0

•restart: drop all matrix columns. Has no extra parameters.

•simple: drop oldest matrix column. Has no extra parameters.

•svd: keep only the most signiﬁcant SVD components. Takes an extra

parameter, to_retain‘, which determines the number of

SVD components to retain when rank reduction is done.

Default is ‘‘max_rank - 2.

max_rank : int, optional

Maximum rank for the Broyden matrix. Default is inﬁnity (ie., no rank reduction).

iter : int, optional

Number of iterations to make. If omitted (default), make as many as required to meet

tolerances.

verbose : bool, optional

Print status to stdout on every iteration.

maxiter : int, optional

Maximum number of iterations to make. If more are needed to meet convergence,

NoConvergence is raised.

f_tol : ﬂoat, optional

Absolute tolerance (in max-norm) for the residual. If omitted, default is 6e-6.

f_rtol : ﬂoat, optional

Relative tolerance for the residual. If omitted, not used.

x_tol : ﬂoat, optional

Absolute minimum step size, as determined from the Jacobian approximation. If the

step size is smaller than this, optimization is terminated as successful. If omitted, not

used.

x_rtol : ﬂoat, optional

Relative minimum step size. If omitted, not used.

tol_norm : function(vector) -> scalar, optional

Norm to use in convergence check. Default is the maximum norm.

line_search : {None, ‘armijo’ (default), ‘wolfe’}, optional

Which type of a line search to use to determine the step size in the direction given by

the Jacobian approximation. Defaults to ‘armijo’.

callback : function, optional

Optional callback function. It is called on every iteration as callback(x, f)

where xis the current solution and fthe corresponding residual.

Returns sol : ndarray

An array (of similar array type as x0) containing the ﬁnal solution.

Raises NoConvergence

When a solution was not found.

Notes

This algorithm implements the inverse Jacobian Quasi-Newton update

H+=H+ (dx −Hdf)df†/(df†df)

corresponding to Broyden’s second method.

References

[vR]

Large-scale nonlinear solvers:

newton_krylov(F, xin[, iter, rdiff, method, ...]) Find a root of a function, using Krylov approximation for inverse Jacobian.

Continued on next page

576 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Table 5.99 – continued from previous page

anderson(F, xin[, iter, alpha, w0, M, ...]) Find a root of a function, using (extended) Anderson mixing.

scipy.optimize.newton_krylov(F,xin,iter=None,rdiff=None,method=’lgmres’,in-

ner_maxiter=20,inner_M=None,outer_k=10,verbose=False,

maxiter=None,f_tol=None,f_rtol=None,x_tol=None,

x_rtol=None,tol_norm=None,line_search=’armijo’,call-

back=None,**kw)

Find a root of a function, using Krylov approximation for inverse Jacobian.

This method is suitable for solving large-scale problems.

Parameters F: function(x) -> f

Function whose root to ﬁnd; should take and return an array-like object.

x0 : array_like

Initial guess for the solution

rdiff : ﬂoat, optional

Relative step size to use in numerical differentiation.

method : {‘lgmres’, ‘gmres’, ‘bicgstab’, ‘cgs’, ‘minres’} or function

Krylov method to use to approximate the Jacobian. Can be a string, or a function im-

plementing the same interface as the iterative solvers in scipy.sparse.linalg.

The default is scipy.sparse.linalg.lgmres.

inner_M : LinearOperator or InverseJacobian

Preconditioner for the inner Krylov iteration. Note that you can use also inverse Jaco-

bians as (adaptive) preconditioners. For example,

>>> jac =BroydenFirst()

>>> kjac =KrylovJacobian(inner_M=jac.inverse).

If the preconditioner has a method named ‘update’, it will be called as update(x,

f) after each nonlinear step, with xgiving the current point, and fthe current function

value.

inner_tol, inner_maxiter, ...

Parameters to pass on to the “inner” Krylov solver. See

scipy.sparse.linalg.gmres for details.

outer_k : int, optional

Size of the subspace kept across LGMRES nonlinear iterations. See

scipy.sparse.linalg.lgmres for details.

iter : int, optional

Number of iterations to make. If omitted (default), make as many as required to meet

tolerances.

verbose : bool, optional

Print status to stdout on every iteration.

maxiter : int, optional

Maximum number of iterations to make. If more are needed to meet convergence,

NoConvergence is raised.

f_tol : ﬂoat, optional

Absolute tolerance (in max-norm) for the residual. If omitted, default is 6e-6.

f_rtol : ﬂoat, optional

Relative tolerance for the residual. If omitted, not used.

x_tol : ﬂoat, optional

Absolute minimum step size, as determined from the Jacobian approximation. If the

step size is smaller than this, optimization is terminated as successful. If omitted, not

used.

x_rtol : ﬂoat, optional

Relative minimum step size. If omitted, not used.

5.20. Optimization and root ﬁnding (scipy.optimize) 577

SciPy Reference Guide, Release 0.13.0

tol_norm : function(vector) -> scalar, optional

Norm to use in convergence check. Default is the maximum norm.

line_search : {None, ‘armijo’ (default), ‘wolfe’}, optional

Which type of a line search to use to determine the step size in the direction given by

the Jacobian approximation. Defaults to ‘armijo’.

callback : function, optional

Optional callback function. It is called on every iteration as callback(x, f)

where xis the current solution and fthe corresponding residual.

Returns sol : ndarray

An array (of similar array type as x0) containing the ﬁnal solution.

Raises NoConvergence

When a solution was not found.

See Also

scipy.sparse.linalg.gmres,scipy.sparse.linalg.lgmres

Notes

This function implements a Newton-Krylov solver. The basic idea is to compute the inverse of the Jacobian

with an iterative Krylov method. These methods require only evaluating the Jacobian-vector products, which

are conveniently approximated by numerical differentiation:

Jv ≈(f(x+ω∗v/|v|)−f(x))/ω

Due to the use of iterative matrix inverses, these methods can deal with large nonlinear problems.

Scipy’s scipy.sparse.linalg module offers a selection of Krylov solvers to choose from. The default

here is lgmres, which is a variant of restarted GMRES iteration that reuses some of the information obtained in

the previous Newton steps to invert Jacobians in subsequent steps.

For a review on Newton-Krylov methods, see for example [KK], and for the LGMRES sparse inverse method,

see [BJM].

References

[KK],[BJM]

scipy.optimize.anderson(F,xin,iter=None,alpha=None,w0=0.01,M=5,verbose=False,

maxiter=None,f_tol=None,f_rtol=None,x_tol=None,x_rtol=None,

tol_norm=None,line_search=’armijo’,callback=None,**kw)

Find a root of a function, using (extended) Anderson mixing.

The Jacobian is formed by for a ‘best’ solution in the space spanned by last Mvectors. As a result, only a MxM

matrix inversions and MxN multiplications are required. [Ey]

Parameters F: function(x) -> f

Function whose root to ﬁnd; should take and return an array-like object.

x0 : array_like

Initial guess for the solution

alpha : ﬂoat, optional

Initial guess for the Jacobian is (-1/alpha).

M: ﬂoat, optional

Number of previous vectors to retain. Defaults to 5.

w0 : ﬂoat, optional

Regularization parameter for numerical stability. Compared to unity, good values of

the order of 0.01.

iter : int, optional

578 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Number of iterations to make. If omitted (default), make as many as required to meet

tolerances.

verbose : bool, optional

Print status to stdout on every iteration.

maxiter : int, optional

Maximum number of iterations to make. If more are needed to meet convergence,

NoConvergence is raised.

f_tol : ﬂoat, optional

Absolute tolerance (in max-norm) for the residual. If omitted, default is 6e-6.

f_rtol : ﬂoat, optional

Relative tolerance for the residual. If omitted, not used.

x_tol : ﬂoat, optional

Absolute minimum step size, as determined from the Jacobian approximation. If the

step size is smaller than this, optimization is terminated as successful. If omitted, not

used.

x_rtol : ﬂoat, optional

Relative minimum step size. If omitted, not used.

tol_norm : function(vector) -> scalar, optional

Norm to use in convergence check. Default is the maximum norm.

line_search : {None, ‘armijo’ (default), ‘wolfe’}, optional

Which type of a line search to use to determine the step size in the direction given by

the Jacobian approximation. Defaults to ‘armijo’.

callback : function, optional

Optional callback function. It is called on every iteration as callback(x, f)

where xis the current solution and fthe corresponding residual.

Returns sol : ndarray

An array (of similar array type as x0) containing the ﬁnal solution.

Raises NoConvergence

When a solution was not found.

References

[Ey]

Simple iterations:

excitingmixing(F, xin[, iter, alpha, ...]) Find a root of a function, using a tuned diagonal Jacobian approximation.

linearmixing(F, xin[, iter, alpha, verbose, ...]) Find a root of a function, using a scalar Jacobian approximation.

diagbroyden(F, xin[, iter, alpha, verbose, ...]) Find a root of a function, using diagonal Broyden Jacobian approximation.

scipy.optimize.excitingmixing(F,xin,iter=None,alpha=None,alphamax=1.0,verbose=False,

maxiter=None,f_tol=None,f_rtol=None,x_tol=None,

x_rtol=None,tol_norm=None,line_search=’armijo’,call-

back=None,**kw)

Find a root of a function, using a tuned diagonal Jacobian approximation.

The Jacobian matrix is diagonal and is tuned on each iteration.

Warning: This algorithm may be useful for speciﬁc problems, but whether it will work may depend strongly

on the problem.

Parameters F: function(x) -> f

Function whose root to ﬁnd; should take and return an array-like object.

x0 : array_like

Initial guess for the solution

5.20. Optimization and root ﬁnding (scipy.optimize) 579

SciPy Reference Guide, Release 0.13.0

alpha : ﬂoat, optional

Initial Jacobian approximation is (-1/alpha).

alphamax : ﬂoat, optional

The entries of the diagonal Jacobian are kept in the range [alpha, alphamax].

iter : int, optional

Number of iterations to make. If omitted (default), make as many as required to meet

tolerances.

verbose : bool, optional

Print status to stdout on every iteration.

maxiter : int, optional

Maximum number of iterations to make. If more are needed to meet convergence,

NoConvergence is raised.

f_tol : ﬂoat, optional

Absolute tolerance (in max-norm) for the residual. If omitted, default is 6e-6.

f_rtol : ﬂoat, optional

Relative tolerance for the residual. If omitted, not used.

x_tol : ﬂoat, optional

Absolute minimum step size, as determined from the Jacobian approximation. If the

step size is smaller than this, optimization is terminated as successful. If omitted, not

used.

x_rtol : ﬂoat, optional

Relative minimum step size. If omitted, not used.

tol_norm : function(vector) -> scalar, optional

Norm to use in convergence check. Default is the maximum norm.

line_search : {None, ‘armijo’ (default), ‘wolfe’}, optional

Which type of a line search to use to determine the step size in the direction given by

the Jacobian approximation. Defaults to ‘armijo’.

callback : function, optional

Optional callback function. It is called on every iteration as callback(x, f)

where xis the current solution and fthe corresponding residual.

Returns sol : ndarray

An array (of similar array type as x0) containing the ﬁnal solution.

Raises NoConvergence

When a solution was not found.

scipy.optimize.linearmixing(F,xin,iter=None,alpha=None,verbose=False,max-

iter=None,f_tol=None,f_rtol=None,x_tol=None,x_rtol=None,

tol_norm=None,line_search=’armijo’,callback=None,**kw)

Find a root of a function, using a scalar Jacobian approximation.

Warning: This algorithm may be useful for speciﬁc problems, but whether it will work may depend strongly

on the problem.

Parameters F: function(x) -> f

Function whose root to ﬁnd; should take and return an array-like object.

x0 : array_like

Initial guess for the solution

alpha : ﬂoat, optional

The Jacobian approximation is (-1/alpha).

iter : int, optional

Number of iterations to make. If omitted (default), make as many as required to meet

tolerances.

verbose : bool, optional

Print status to stdout on every iteration.

maxiter : int, optional

580 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Maximum number of iterations to make. If more are needed to meet convergence,

NoConvergence is raised.

f_tol : ﬂoat, optional

Absolute tolerance (in max-norm) for the residual. If omitted, default is 6e-6.

f_rtol : ﬂoat, optional

Relative tolerance for the residual. If omitted, not used.

x_tol : ﬂoat, optional

Absolute minimum step size, as determined from the Jacobian approximation. If the

step size is smaller than this, optimization is terminated as successful. If omitted, not

used.

x_rtol : ﬂoat, optional

Relative minimum step size. If omitted, not used.

tol_norm : function(vector) -> scalar, optional

Norm to use in convergence check. Default is the maximum norm.

line_search : {None, ‘armijo’ (default), ‘wolfe’}, optional

Which type of a line search to use to determine the step size in the direction given by

the Jacobian approximation. Defaults to ‘armijo’.

callback : function, optional

Optional callback function. It is called on every iteration as callback(x, f)

where xis the current solution and fthe corresponding residual.

Returns sol : ndarray

An array (of similar array type as x0) containing the ﬁnal solution.

Raises NoConvergence

When a solution was not found.

scipy.optimize.diagbroyden(F,xin,iter=None,alpha=None,verbose=False,maxiter=None,

f_tol=None,f_rtol=None,x_tol=None,x_rtol=None,tol_norm=None,

line_search=’armijo’,callback=None,**kw)

Find a root of a function, using diagonal Broyden Jacobian approximation.

The Jacobian approximation is derived from previous iterations, by retaining only the diagonal of Broyden

matrices.

Warning: This algorithm may be useful for speciﬁc problems, but whether it will work may depend strongly

on the problem.

Parameters F: function(x) -> f

Function whose root to ﬁnd; should take and return an array-like object.

x0 : array_like

Initial guess for the solution

alpha : ﬂoat, optional

Initial guess for the Jacobian is (-1/alpha).

iter : int, optional

Number of iterations to make. If omitted (default), make as many as required to meet

tolerances.

verbose : bool, optional

Print status to stdout on every iteration.

maxiter : int, optional

Maximum number of iterations to make. If more are needed to meet convergence,

NoConvergence is raised.

f_tol : ﬂoat, optional

Absolute tolerance (in max-norm) for the residual. If omitted, default is 6e-6.

f_rtol : ﬂoat, optional

Relative tolerance for the residual. If omitted, not used.

x_tol : ﬂoat, optional

5.20. Optimization and root ﬁnding (scipy.optimize) 581

SciPy Reference Guide, Release 0.13.0

Absolute minimum step size, as determined from the Jacobian approximation. If the

step size is smaller than this, optimization is terminated as successful. If omitted, not

used.

x_rtol : ﬂoat, optional

Relative minimum step size. If omitted, not used.

tol_norm : function(vector) -> scalar, optional

Norm to use in convergence check. Default is the maximum norm.

line_search : {None, ‘armijo’ (default), ‘wolfe’}, optional

Which type of a line search to use to determine the step size in the direction given by

the Jacobian approximation. Defaults to ‘armijo’.

callback : function, optional

Optional callback function. It is called on every iteration as callback(x, f)

where xis the current solution and fthe corresponding residual.

Returns sol : ndarray

An array (of similar array type as x0) containing the ﬁnal solution.

Raises NoConvergence

When a solution was not found.

Additional information on the nonlinear solvers

5.20.4 Utility Functions

line_search(f, myfprime, xk, pk[, gfk, ...]) Find alpha that satisﬁes strong Wolfe conditions.

check_grad(func, grad, x0, *args) Check the correctness of a gradient function by comparing it against a (forward) ﬁnite-difference approximation of the gradient.

show_options(solver[, method]) Show documentation for additional options of optimization solvers.

scipy.optimize.line_search(f,myfprime,xk,pk,gfk=None,old_fval=None,old_old_fval=None,

args=(),c1=0.0001,c2=0.9,amax=50)

Find alpha that satisﬁes strong Wolfe conditions.

Parameters f: callable f(x,*args)

Objective function.

myfprime : callable f’(x,*args)

Objective function gradient.

xk : ndarray

Starting point.

pk : ndarray

Search direction.

gfk : ndarray, optional

Gradient value for x=xk (xk being the current parameter estimate). Will be recom-

puted if omitted.

old_fval : ﬂoat, optional

Function value for x=xk. Will be recomputed if omitted.

old_old_fval : ﬂoat, optional

Function value for the point preceding x=xk

args : tuple, optional

Additional arguments passed to objective function.

c1 : ﬂoat, optional

Parameter for Armijo condition rule.

c2 : ﬂoat, optional

Parameter for curvature condition rule.

Returns alpha0 : ﬂoat

Alpha for which x_new = x0 + alpha *pk.

582 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

fc : int

Number of function evaluations made.

gc : int

Number of gradient evaluations made.

Notes

Uses the line search algorithm to enforce strong Wolfe conditions. See Wright and Nocedal, ‘Numerical Opti-

mization’, 1999, pg. 59-60.

For the zoom phase it uses an algorithm by [...].

scipy.optimize.check_grad(func,grad,x0,*args)

Check the correctness of a gradient function by comparing it against a (forward) ﬁnite-difference approximation

of the gradient.

Parameters func : callable func(x0,*args)

Function whose derivative is to be checked.

grad : callable grad(x0, *args)

Gradient of func.

x0 : ndarray

Points to check grad against forward difference approximation of grad using func.

args : *args, optional

Extra arguments passed to func and grad.

Returns err : ﬂoat

The square root of the sum of squares (i.e. the 2-norm) of the difference between

grad(x0, *args) and the ﬁnite difference approximation of grad using func at

the points x0.

See Also

approx_fprime

Notes

The step size used for the ﬁnite difference approximation is sqrt(numpy.ﬁnfo(ﬂoat).eps), which is approximately

1.49e-08.

Examples

>>> def func(x): return x[0]**2-0.5 *x[1]**3

>>> def grad(x): return [2*x[0], -1.5 *x[1]**2]

>>> check_grad(func, grad, [1.5,-1.5])

2.9802322387695312e-08

scipy.optimize.show_options(solver,method=None)

Show documentation for additional options of optimization solvers.

These are method-speciﬁc options that can be supplied through the options dict.

Parameters solver : str

Type of optimization solver. One of {minimize,root}.

method : str, optional

If not given, shows all methods of the speciﬁed solver. Otherwise, show only the

options for the speciﬁed method. Valid values corresponds to methods’ names of

respective solver (e.g. ‘BFGS’ for ‘minimize’).

5.20. Optimization and root ﬁnding (scipy.optimize) 583

SciPy Reference Guide, Release 0.13.0

Notes

** minimize options

•BFGS options:

gtol [ﬂoat] Gradient norm must be less than gtol before successful termination.

norm [ﬂoat] Order of norm (Inf is max, -Inf is min).

eps [ﬂoat or ndarray] If jac is approximated, use this value for the step size.

•Nelder-Mead options:

xtol [ﬂoat] Relative error in solution xopt acceptable for convergence.

ftol [ﬂoat] Relative error in fun(xopt) acceptable for convergence.

maxfev [int] Maximum number of function evaluations to make.

•Newton-CG options:

xtol [ﬂoat] Average relative error in solution xopt acceptable for convergence.

eps [ﬂoat or ndarray] If jac is approximated, use this value for the step size.

•CG options:

gtol [ﬂoat] Gradient norm must be less than gtol before successful termination.

norm [ﬂoat] Order of norm (Inf is max, -Inf is min).

eps [ﬂoat or ndarray] If jac is approximated, use this value for the step size.

•Powell options:

xtol [ﬂoat] Relative error in solution xopt acceptable for convergence.

ftol [ﬂoat] Relative error in fun(xopt) acceptable for convergence.

maxfev [int] Maximum number of function evaluations to make.

direc [ndarray] Initial set of direction vectors for the Powell method.

•Anneal options:

ftol [ﬂoat] Relative error in fun(x) acceptable for convergence.

schedule [str] Annealing schedule to use. One of: ‘fast’, ‘cauchy’ or ‘boltzmann’.

T0 [ﬂoat] Initial Temperature (estimated as 1.2 times the largest cost-function

deviation over random points in the range).

Tf [ﬂoat] Final goal temperature.

maxfev [int] Maximum number of function evaluations to make.

maxaccept [int] Maximum changes to accept.

boltzmann [ﬂoat] Boltzmann constant in acceptance test (increase for less stringent test

at each temperature).

learn_rate [ﬂoat] Scale constant for adjusting guesses.

584 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

quench, m, n [ﬂoat] Parameters to alter fast_sa schedule.

lower, upper [ﬂoat or ndarray] Lower and upper bounds on x.

dwell [int] The number of times to search the space at each temperature.

•L-BFGS-B options:

ftol [ﬂoat] The iteration stops when (f^k -

f^{k+1})/max{|f^k|,|f^{k+1}|,1} <= ftol.

gtol [ﬂoat] The iteration will stop when max{|proj g_i | i = 1,

..., n} <= gtol where pg_i is the i-th component of the projected

gradient.

maxcor [int] The maximum number of variable metric corrections used to deﬁne

the limited memory matrix. (The limited memory BFGS method does not

store the full hessian but uses this many terms in an approximation to it.)

maxiter [int] Maximum number of function evaluations.

•TNC options:

ftol [ﬂoat] Precision goal for the value of f in the stoping criterion. If ftol < 0.0,

ftol is set to 0.0 defaults to -1.

xtol [ﬂoat] Precision goal for the value of x in the stopping criterion (after apply-

ing x scaling factors). If xtol < 0.0, xtol is set to sqrt(machine_precision).

Defaults to -1.

gtol [ﬂoat] Precision goal for the value of the projected gradient in the stopping

criterion (after applying x scaling factors). If gtol < 0.0, gtol is set to 1e-2 *

sqrt(accuracy). Setting it to 0.0 is not recommended. Defaults to -1.

scale [list of ﬂoats] Scaling factors to apply to each variable. If None, the factors

are up-low for interval bounded variables and 1+|x] fo the others. Defaults

to None

offset [ﬂoat] Value to substract from each variable. If None, the offsets are

(up+low)/2 for interval bounded variables and x for the others.

maxCGit [int] Maximum number of hessian*vector evaluations per main iteration. If

maxCGit == 0, the direction chosen is -gradient if maxCGit < 0, maxCGit

is set to max(1,min(50,n/2)). Defaults to -1.

maxiter [int] Maximum number of function evaluation. if None, maxiter is set to

max(100, 10*len(x0)). Defaults to None.

eta [ﬂoat] Severity of the line search. if < 0 or > 1, set to 0.25. Defaults to -1.

stepmx [ﬂoat] Maximum step for the line search. May be increased during call. If

too small, it will be set to 10.0. Defaults to 0.

accuracy [ﬂoat] Relative precision for ﬁnite difference calculations. If <= ma-

chine_precision, set to sqrt(machine_precision). Defaults to 0.

minfev [ﬂoat] Minimum function value estimate. Defaults to 0.

rescale [ﬂoat] Scaling factor (in log10) used to trigger f value rescaling. If 0, rescale

at each iteration. If a large value, never rescale. If < 0, rescale is set to 1.3.

•COBYLA options:

5.20. Optimization and root ﬁnding (scipy.optimize) 585

SciPy Reference Guide, Release 0.13.0

tol [ﬂoat] Final accuracy in the optimization (not precisely guaranteed). This

is a lower bound on the size of the trust region.

rhobeg [ﬂoat] Reasonable initial changes to the variables.

maxfev [int] Maximum number of function evaluations.

•SLSQP options:

ftol [ﬂoat] Precision goal for the value of f in the stopping criterion.

eps [ﬂoat] Step size used for numerical approximation of the jacobian.

maxiter [int] Maximum number of iterations.

•dogleg options:

initial_trust_radius

[ﬂoat] Initial trust-region radius.

max_trust_radius

[ﬂoat] Maximum value of the trust-region radius. No steps that are longer

than this value will be proposed.

eta [ﬂoat] Trust region related acceptance stringency for proposed steps.

gtol [ﬂoat] Gradient norm must be less than gtol before successful termination.

•trust-ncg options:

see dogleg options.

** root options

•hybrd options:

col_deriv [bool] Specify whether the Jacobian function computes derivatives down

the columns (faster, because there is no transpose operation).

xtol [ﬂoat] The calculation will terminate if the relative error between two con-

secutive iterates is at most xtol.

maxfev [int] The maximum number of calls to the function. If zero, then

100*(N+1) is the maximum where N is the number of elements in x0.

band [sequence] If set to a two-sequence containing the number of sub- and

super-diagonals within the band of the Jacobi matrix, the Jacobi matrix is

considered banded (only for fprime=None).

epsfcn [ﬂoat] A suitable step length for the forward-difference approximation of

the Jacobian (for fprime=None). If epsfcn is less than the machine pre-

cision, it is assumed that the relative errors in the functions are of the order

of the machine precision.

factor [ﬂoat] A parameter determining the initial step bound (factor *||

diag *x||). Should be in the interval (0.1, 100).

diag [sequence] N positive entries that serve as a scale factors for the variables.

•LM options:

col_deriv [bool] non-zero to specify that the Jacobian function computes derivatives

down the columns (faster, because there is no transpose operation).

ftol [ﬂoat] Relative error desired in the sum of squares.

586 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

xtol [ﬂoat] Relative error desired in the approximate solution.

gtol [ﬂoat] Orthogonality desired between the function vector and the columns

of the Jacobian.

maxiter [int] The maximum number of calls to the function. If zero, then 100*(N+1)

is the maximum where N is the number of elements in x0.

epsfcn [ﬂoat] A suitable step length for the forward-difference approximation of

the Jacobian (for Dfun=None). If epsfcn is less than the machine precision,

it is assumed that the relative errors in the functions are of the order of the

machine precision.

factor [ﬂoat] A parameter determining the initial step bound (factor *||

diag *x||). Should be in interval (0.1, 100).

diag [sequence] N positive entries that serve as a scale factors for the variables.

•Broyden1 options:

nit [int, optional] Number of iterations to make. If omitted (default), make as

many as required to meet tolerances.

disp [bool, optional] Print status to stdout on every iteration.

maxiter [int, optional] Maximum number of iterations to make. If more are needed

to meet convergence, NoConvergence is raised.

ftol [ﬂoat, optional] Relative tolerance for the residual. If omitted, not used.

fatol [ﬂoat, optional] Absolute tolerance (in max-norm) for the residual. If omit-

ted, default is 6e-6.

xtol [ﬂoat, optional] Relative minimum step size. If omitted, not used.

xatol [ﬂoat, optional] Absolute minimum step size, as determined from the Ja-

cobian approximation. If the step size is smaller than this, optimization is

terminated as successful. If omitted, not used.

tol_norm [function(vector) -> scalar, optional] Norm to use in convergence check.

Default is the maximum norm.

line_search [{None, ‘armijo’ (default), ‘wolfe’}, optional] Which type of a line search

to use to determine the step size in the direction given by the Jacobian ap-

proximation. Defaults to ‘armijo’.

jac_options [dict, optional]

Options for the respective Jacobian approximation.

alpha [ﬂoat, optional] Initial guess for the Jacobian

is (-1/alpha).

reduction_method

[str or tuple, optional] Method used in en-

suring that the rank of the Broyden ma-

trix stays low. Can either be a string giv-

ing the name of the method, or a tuple of

the form (method, param1, param2,

...) that gives the name of the method and

values for additional parameters.

5.20. Optimization and root ﬁnding (scipy.optimize) 587

SciPy Reference Guide, Release 0.13.0

Methods available:

–restart: drop all matrix columns. Has no

extra pa-

rameters.

–simple: drop oldest matrix column. Has no

extra pa-

rameters.

–svd: keep only the most signiﬁcant SVD

components.

Extra parameters:

*‘‘to_retain‘: number of SVD components to

retain

when

rank

re-

duc-

tion

done.

De-

fault

max_rank

max_rank [int, optional] Maximum rank for the Broy-

den matrix. Default is inﬁnity (ie., no rank

reduction).

•Broyden2 options:

nit [int, optional] Number of iterations to make. If omitted (default), make as

many as required to meet tolerances.

disp [bool, optional] Print status to stdout on every iteration.

maxiter [int, optional] Maximum number of iterations to make. If more are needed

to meet convergence, NoConvergence is raised.

ftol [ﬂoat, optional] Relative tolerance for the residual. If omitted, not used.

fatol [ﬂoat, optional] Absolute tolerance (in max-norm) for the residual. If omit-

ted, default is 6e-6.

xtol [ﬂoat, optional] Relative minimum step size. If omitted, not used.

xatol [ﬂoat, optional] Absolute minimum step size, as determined from the Ja-

cobian approximation. If the step size is smaller than this, optimization is

588 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

terminated as successful. If omitted, not used.

tol_norm [function(vector) -> scalar, optional] Norm to use in convergence check.

Default is the maximum norm.

line_search [{None, ‘armijo’ (default), ‘wolfe’}, optional] Which type of a line search

to use to determine the step size in the direction given by the Jacobian ap-

proximation. Defaults to ‘armijo’.

jac_options [dict, optional]

Options for the respective Jacobian approximation.

alpha [ﬂoat, optional] Initial guess for the Jacobian

is (-1/alpha).

reduction_method

[str or tuple, optional] Method used in en-

suring that the rank of the Broyden ma-

trix stays low. Can either be a string giv-

ing the name of the method, or a tuple of

the form (method, param1, param2,

...) that gives the name of the method and

values for additional parameters.

Methods available:

–restart: drop all matrix columns. Has no

extra pa-

rameters.

–simple: drop oldest matrix column. Has no

extra pa-

rameters.

–svd: keep only the most signiﬁcant SVD

components.

Extra parameters:

*‘‘to_retain‘: number of SVD components to

retain

when

rank

re-

duc-

tion

done.

De-

fault

max_rank

5.20. Optimization and root ﬁnding (scipy.optimize) 589

SciPy Reference Guide, Release 0.13.0

max_rank [int, optional] Maximum rank for the Broy-

den matrix. Default is inﬁnity (ie., no rank

reduction).

•Anderson options:

nit [int, optional] Number of iterations to make. If omitted (default), make as

many as required to meet tolerances.

disp [bool, optional] Print status to stdout on every iteration.

maxiter [int, optional] Maximum number of iterations to make. If more are needed

to meet convergence, NoConvergence is raised.

ftol [ﬂoat, optional] Relative tolerance for the residual. If omitted, not used.

fatol [ﬂoat, optional] Absolute tolerance (in max-norm) for the residual. If omit-

ted, default is 6e-6.

xtol [ﬂoat, optional] Relative minimum step size. If omitted, not used.

xatol [ﬂoat, optional] Absolute minimum step size, as determined from the Ja-

cobian approximation. If the step size is smaller than this, optimization is

terminated as successful. If omitted, not used.

tol_norm [function(vector) -> scalar, optional] Norm to use in convergence check.

Default is the maximum norm.

line_search [{None, ‘armijo’ (default), ‘wolfe’}, optional] Which type of a line search

to use to determine the step size in the direction given by the Jacobian ap-

proximation. Defaults to ‘armijo’.

jac_options [dict, optional]

Options for the respective Jacobian approximation.

alpha [ﬂoat, optional] Initial guess for the Jacobian

is (-1/alpha).

M[ﬂoat, optional] Number of previous vectors

to retain. Defaults to 5.

w0 [ﬂoat, optional] Regularization parameter for

numerical stability. Compared to unity, good

values of the order of 0.01.

•LinearMixing options:

nit [int, optional] Number of iterations to make. If omitted (default), make as

many as required to meet tolerances.

disp [bool, optional] Print status to stdout on every iteration.

maxiter [int, optional] Maximum number of iterations to make. If more are needed

to meet convergence, NoConvergence is raised.

ftol [ﬂoat, optional] Relative tolerance for the residual. If omitted, not used.

590 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

fatol [ﬂoat, optional] Absolute tolerance (in max-norm) for the residual. If omit-

ted, default is 6e-6.

xtol [ﬂoat, optional] Relative minimum step size. If omitted, not used.

xatol [ﬂoat, optional] Absolute minimum step size, as determined from the Ja-

cobian approximation. If the step size is smaller than this, optimization is

terminated as successful. If omitted, not used.

tol_norm [function(vector) -> scalar, optional] Norm to use in convergence check.

Default is the maximum norm.

line_search [{None, ‘armijo’ (default), ‘wolfe’}, optional] Which type of a line search

to use to determine the step size in the direction given by the Jacobian ap-

proximation. Defaults to ‘armijo’.

jac_options [dict, optional]

Options for the respective Jacobian approximation.

alpha [ﬂoat, optional] initial guess for the jacobian

is (-1/alpha).

•DiagBroyden options:

nit [int, optional] Number of iterations to make. If omitted (default), make as

many as required to meet tolerances.

disp [bool, optional] Print status to stdout on every iteration.

maxiter [int, optional] Maximum number of iterations to make. If more are needed

to meet convergence, NoConvergence is raised.

ftol [ﬂoat, optional] Relative tolerance for the residual. If omitted, not used.

fatol [ﬂoat, optional] Absolute tolerance (in max-norm) for the residual. If omit-

ted, default is 6e-6.

xtol [ﬂoat, optional] Relative minimum step size. If omitted, not used.

xatol [ﬂoat, optional] Absolute minimum step size, as determined from the Ja-

cobian approximation. If the step size is smaller than this, optimization is

terminated as successful. If omitted, not used.

tol_norm [function(vector) -> scalar, optional] Norm to use in convergence check.

Default is the maximum norm.

line_search [{None, ‘armijo’ (default), ‘wolfe’}, optional] Which type of a line search

to use to determine the step size in the direction given by the Jacobian ap-

proximation. Defaults to ‘armijo’.

jac_options [dict, optional]

Options for the respective Jacobian approximation.

alpha [ﬂoat, optional] initial guess for the jacobian

is (-1/alpha).

•ExcitingMixing options:

5.20. Optimization and root ﬁnding (scipy.optimize) 591

SciPy Reference Guide, Release 0.13.0

nit [int, optional] Number of iterations to make. If omitted (default), make as

many as required to meet tolerances.

disp [bool, optional] Print status to stdout on every iteration.

maxiter [int, optional] Maximum number of iterations to make. If more are needed

to meet convergence, NoConvergence is raised.

ftol [ﬂoat, optional] Relative tolerance for the residual. If omitted, not used.

fatol [ﬂoat, optional] Absolute tolerance (in max-norm) for the residual. If omit-

ted, default is 6e-6.

xtol [ﬂoat, optional] Relative minimum step size. If omitted, not used.

xatol [ﬂoat, optional] Absolute minimum step size, as determined from the Ja-

cobian approximation. If the step size is smaller than this, optimization is

terminated as successful. If omitted, not used.

tol_norm [function(vector) -> scalar, optional] Norm to use in convergence check.

Default is the maximum norm.

line_search [{None, ‘armijo’ (default), ‘wolfe’}, optional] Which type of a line search

to use to determine the step size in the direction given by the Jacobian ap-

proximation. Defaults to ‘armijo’.

jac_options [dict, optional]

Options for the respective Jacobian approximation.

alpha [ﬂoat, optional] Initial Jacobian approxima-

tion is (-1/alpha).

alphamax [ﬂoat, optional] The entries of the diago-

nal Jacobian are kept in the range [alpha,

alphamax].

•Krylov options:

nit [int, optional] Number of iterations to make. If omitted (default), make as

many as required to meet tolerances.

disp [bool, optional] Print status to stdout on every iteration.

maxiter [int, optional] Maximum number of iterations to make. If more are needed

to meet convergence, NoConvergence is raised.

ftol [ﬂoat, optional] Relative tolerance for the residual. If omitted, not used.

fatol [ﬂoat, optional] Absolute tolerance (in max-norm) for the residual. If omit-

ted, default is 6e-6.

xtol [ﬂoat, optional] Relative minimum step size. If omitted, not used.

xatol [ﬂoat, optional] Absolute minimum step size, as determined from the Ja-

cobian approximation. If the step size is smaller than this, optimization is

terminated as successful. If omitted, not used.

tol_norm [function(vector) -> scalar, optional] Norm to use in convergence check.

Default is the maximum norm.

592 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

line_search [{None, ‘armijo’ (default), ‘wolfe’}, optional] Which type of a line search

to use to determine the step size in the direction given by the Jacobian ap-

proximation. Defaults to ‘armijo’.

jac_options [dict, optional]

Options for the respective Jacobian approximation.

rdiff [ﬂoat, optional] Relative step size to use in

numerical differentiation.

method [{‘lgmres’, ‘gmres’, ‘bicgstab’, ‘cgs’, ‘min-

res’} or] function Krylov method to use

to approximate the Jacobian. Can be

a string, or a function implementing the

same interface as the iterative solvers in

scipy.sparse.linalg.

The default is

scipy.sparse.linalg.lgmres.

inner_M [LinearOperator or InverseJacobian] Precon-

ditioner for the inner Krylov iteration. Note

that you can use also inverse Jacobians as

(adaptive) preconditioners. For example,

>>> jac =BroydenFirst()

>>> kjac =KrylovJacobian(inner_M=jac.inverse).

If the preconditioner has a method named

‘update’, it will be called as update(x,

f) after each nonlinear step, with xgiving

the current point, and fthe current function

value.

inner_tol, inner_maxiter, ...

Parameters to pass on to the

“inner” Krylov solver. See

scipy.sparse.linalg.gmres for

details.

outer_k [int, optional] Size of the subspace kept

across LGMRES nonlinear iterations.

See scipy.sparse.linalg.lgmres

for details.

5.21 Nonlinear solvers

This is a collection of general-purpose nonlinear multidimensional solvers. These solvers ﬁnd xfor which F(x) = 0.

Both xand Fcan be multidimensional.

5.21.1 Routines

Large-scale nonlinear solvers:

5.21. Nonlinear solvers 593

SciPy Reference Guide, Release 0.13.0

newton_krylov(F, xin[, iter, rdiff, method, ...]) Find a root of a function, using Krylov approximation for inverse Jacobian.

anderson(F, xin[, iter, alpha, w0, M, ...]) Find a root of a function, using (extended) Anderson mixing.

General nonlinear solvers:

broyden1(F, xin[, iter, alpha, ...]) Find a root of a function, using Broyden’s ﬁrst Jacobian approximation.

broyden2(F, xin[, iter, alpha, ...]) Find a root of a function, using Broyden’s second Jacobian approximation.

Simple iterations:

excitingmixing(F, xin[, iter, alpha, ...]) Find a root of a function, using a tuned diagonal Jacobian approximation.

linearmixing(F, xin[, iter, alpha, verbose, ...]) Find a root of a function, using a scalar Jacobian approximation.

diagbroyden(F, xin[, iter, alpha, verbose, ...]) Find a root of a function, using diagonal Broyden Jacobian approximation.

5.21.2 Examples

Small problem

>>> def F(x):

... return np.cos(x) +x[::-1]-[1,2,3,4]

>>> import scipy.optimize

>>> x=scipy.optimize.broyden1(F, [1,1,1,1], f_tol=1e-14)

>>> x

array([ 4.04674914, 3.91158389, 2.71791677, 1.61756251])

>>> np.cos(x) +x[::-1]

array([ 1., 2., 3., 4.])

Large problem

Suppose that we needed to solve the following integrodifferential equation on the square [0,1] ×[0,1]:

∇2P= 10 Z1

0Z1

cosh(P)dx dy2

with P(x, 1) = 1 and P= 0 elsewhere on the boundary of the square.

The solution can be found using the newton_krylov solver:

import numpy as np

from scipy.optimize import newton_krylov

from numpy import cosh, zeros_like, mgrid, zeros

# parameters

nx, ny =75,75

hx, hy =1./(nx-1), 1./(ny-1)

P_left, P_right =0,0

P_top, P_bottom =1,0

def residual(P):

d2x =zeros_like(P)

594 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

d2y =zeros_like(P)

d2x[1:-1]=(P[2:] -2*P[1:-1]+P[:-2]) /hx/hx

d2x[0]=(P[1]-2*P[0]+P_left)/hx/hx

d2x[-1]=(P_right -2*P[-1]+P[-2])/hx/hx

d2y[:,1:-1]=(P[:,2:] -2*P[:,1:-1]+P[:,:-2])/hy/hy

d2y[:,0]=(P[:,1]-2*P[:,0]+P_bottom)/hy/hy

d2y[:,-1]=(P_top -2*P[:,-1]+P[:,-2])/hy/hy

return d2x +d2y -10*cosh(P).mean()**2

# solve

guess =zeros((nx, ny), float)

sol =newton_krylov(residual, guess, method=’lgmres’, verbose=1)

print(’Residual: %g’%abs(residual(sol)).max())

# visualize

import matplotlib.pyplot as plt

x,y=mgrid[0:1:(nx*1j), 0:1:(ny*1j)]

plt.pcolor(x, y, sol)

plt.colorbar()

plt.show()

5.22 Signal processing (scipy.signal)

5.22.1 Convolution

convolve(in1, in2[, mode]) Convolve two N-dimensional arrays.

correlate(in1, in2[, mode]) Cross-correlate two N-dimensional arrays.

fftconvolve(in1, in2[, mode]) Convolve two N-dimensional arrays using FFT.

convolve2d(in1, in2[, mode, boundary, ﬁllvalue]) Convolve two 2-dimensional arrays.

correlate2d(in1, in2[, mode, boundary, ...]) Cross-correlate two 2-dimensional arrays.

Continued on next page

5.22. Signal processing (scipy.signal) 595

SciPy Reference Guide, Release 0.13.0

Table 5.105 – continued from previous page

sepfir2d((input, hrow, hcol) -> output) Description:

scipy.signal.convolve(in1,in2,mode=’full’)

Convolve two N-dimensional arrays.

Convolve in1 and in2, with the output size determined by the mode argument.

Parameters in1 : array_like

First input.

in2 : array_like

Second input. Should have the same number of dimensions as in1; if sizes

of in1 and in2 are not equal then in1 has to be the larger array.

mode : str {‘full’, ‘valid’, ‘same’}, optional

A string indicating the size of the output:

full The output is the full discrete linear convolution of the in-

puts. (Default)

valid The output consists only of those elements that do not rely

on the zero-padding.

same The output is the same size as in1, centered with respect to

the ‘full’ output.

Returns convolve : array

An N-dimensional array containing a subset of the discrete linear convolu-

tion of in1 with in2.

scipy.signal.correlate(in1,in2,mode=’full’)

Cross-correlate two N-dimensional arrays.

Cross-correlate in1 and in2, with the output size determined by the mode argument.

Parameters in1 : array_like

First input.

in2 : array_like

Second input. Should have the same number of dimensions as in1; if sizes

of in1 and in2 are not equal then in1 has to be the larger array.

mode : str {‘full’, ‘valid’, ‘same’}, optional

A string indicating the size of the output:

full The output is the full discrete linear cross-correlation of the

inputs. (Default)

valid The output consists only of those elements that do not rely

on the zero-padding.

same The output is the same size as in1, centered with respect to

the ‘full’ output.

Returns correlate : array

An N-dimensional array containing a subset of the discrete linear cross-

correlation of in1 with in2.

Notes

The correlation z of two arrays x and y of rank d is deﬁned as:

z[...,k,...] = sum[..., i_l, ...]

x[..., i_l,...] * conj(y[..., i_l + k,...])

scipy.signal.fftconvolve(in1,in2,mode=’full’)

Convolve two N-dimensional arrays using FFT.

Convolve in1 and in2 using the fast Fourier transform method, with the output size determined by the mode

argument.

This is generally much faster than convolve for large arrays (n > ~500), but can be slower when only a few

output values are needed, and can only output ﬂoat arrays (int or object array inputs will be cast to ﬂoat).

596 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Parameters in1 : array_like

First input.

in2 : array_like

Second input. Should have the same number of dimensions as in1; if sizes

of in1 and in2 are not equal then in1 has to be the larger array.

mode : str {‘full’, ‘valid’, ‘same’}, optional

A string indicating the size of the output:

full The output is the full discrete linear convolution of the in-

puts. (Default)

valid The output consists only of those elements that do not rely

on the zero-padding.

same The output is the same size as in1, centered with respect to

the ‘full’ output.

Returns out : array

An N-dimensional array containing a subset of the discrete linear convolu-

tion of in1 with in2.

scipy.signal.convolve2d(in1,in2,mode=’full’,boundary=’ﬁll’,ﬁllvalue=0)

Convolve two 2-dimensional arrays.

Convolve in1 and in2 with output size determined by mode, and boundary conditions determined by boundary

and ﬁllvalue.

Parameters in1, in2 : array_like

Two-dimensional input arrays to be convolved.

mode : str {‘full’, ‘valid’, ‘same’}, optional

A string indicating the size of the output:

full The output is the full discrete linear convolution of the in-

puts. (Default)

valid The output consists only of those elements that do not rely

on the zero-padding.

same The output is the same size as in1, centered with respect to

the ‘full’ output.

boundary : str {‘ﬁll’, ‘wrap’, ‘symm’}, optional

A ﬂag indicating how to handle boundaries:

fill pad input arrays with ﬁllvalue. (default)

wrap circular boundary conditions.

symm symmetrical boundary conditions.

ﬁllvalue : scalar, optional

Value to ﬁll pad input arrays with. Default is 0.

Returns out : ndarray

A 2-dimensional array containing a subset of the discrete linear convolution

of in1 with in2.

scipy.signal.correlate2d(in1,in2,mode=’full’,boundary=’ﬁll’,ﬁllvalue=0)

Cross-correlate two 2-dimensional arrays.

Cross correlate in1 and in2 with output size determined by mode, and boundary conditions determined by

boundary and ﬁllvalue.

Parameters in1, in2 : array_like

Two-dimensional input arrays to be convolved.

mode : str {‘full’, ‘valid’, ‘same’}, optional

A string indicating the size of the output:

full The output is the full discrete linear cross-correlation of the

inputs. (Default)

valid The output consists only of those elements that do not rely

on the zero-padding.

same The output is the same size as in1, centered with respect to

the ‘full’ output.

boundary : str {‘ﬁll’, ‘wrap’, ‘symm’}, optional

A ﬂag indicating how to handle boundaries:

5.22. Signal processing (scipy.signal) 597

SciPy Reference Guide, Release 0.13.0

fill pad input arrays with ﬁllvalue. (default)

wrap circular boundary conditions.

symm symmetrical boundary conditions.

ﬁllvalue : scalar, optional

Value to ﬁll pad input arrays with. Default is 0.

Returns correlate2d : ndarray

A 2-dimensional array containing a subset of the discrete linear cross-

correlation of in1 with in2.

scipy.signal.sepfir2d(input,hrow,hcol)→output

Description:

Convolve the rank-2 input array with the separable ﬁlter deﬁned by the rank-1 arrays hrow, and hcol.

Mirror symmetric boundary conditions are assumed. This function can be used to ﬁnd an image given its

B-spline representation.

5.22.2 B-splines

bspline(x, n) B-spline basis function of order n.

cubic(x) A cubic B-spline.

quadratic(x) A quadratic B-spline.

gauss_spline(x, n) Gaussian approximation to B-spline basis function of order n.

cspline1d(signal[, lamb]) Compute cubic spline coefﬁcients for rank-1 array.

qspline1d(signal[, lamb]) Compute quadratic spline coefﬁcients for rank-1 array.

cspline2d((input {, lambda, precision}) -> ck) Description:

qspline2d((input {, lambda, precision}) -> qk) Description:

cspline1d_eval(cj, newx[, dx, x0]) Evaluate a spline at the new set of points.

spline_filter(Iin[, lmbda]) Smoothing spline (cubic) ﬁltering of a rank-2 array.

scipy.signal.bspline(x,n)

B-spline basis function of order n.

Notes

Uses numpy.piecewise and automatic function-generator.

scipy.signal.cubic(x)

A cubic B-spline.

This is a special case of bspline, and equivalent to bspline(x, 3).

scipy.signal.quadratic(x)

A quadratic B-spline.

This is a special case of bspline, and equivalent to bspline(x, 2).

scipy.signal.gauss_spline(x,n)

Gaussian approximation to B-spline basis function of order n.

scipy.signal.cspline1d(signal,lamb=0.0)

Compute cubic spline coefﬁcients for rank-1 array.

Find the cubic spline coefﬁcients for a 1-D signal assuming mirror-symmetric boundary conditions. To obtain

the signal back from the spline representation mirror-symmetric-convolve these coefﬁcients with a length 3 FIR

window [1.0, 4.0, 1.0]/ 6.0 .

Parameters signal : ndarray

598 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

A rank-1 array representing samples of a signal.

lamb : ﬂoat, optional

Smoothing coefﬁcient, default is 0.0.

Returns c: ndarray

Cubic spline coefﬁcients.

scipy.signal.qspline1d(signal,lamb=0.0)

Compute quadratic spline coefﬁcients for rank-1 array.

Find the quadratic spline coefﬁcients for a 1-D signal assuming mirror-symmetric boundary conditions. To

obtain the signal back from the spline representation mirror-symmetric-convolve these coefﬁcients with a length

3 FIR window [1.0, 6.0, 1.0]/ 8.0 .

Parameters signal : ndarray

A rank-1 array representing samples of a signal.

lamb : ﬂoat, optional

Smoothing coefﬁcient (must be zero for now).

Returns c: ndarray

Cubic spline coefﬁcients.

scipy.signal.cspline2d(input {,lambda,precision})→ck

Description:

Return the third-order B-spline coefﬁcients over a regularly spacedi input grid for the two-dimensional

input image. The lambda argument speciﬁes the amount of smoothing. The precision argument allows

specifying the precision used when computing the inﬁnite sum needed to apply mirror- symmetric bound-

ary conditions.

scipy.signal.qspline2d(input {,lambda,precision})→qk

Description:

Return the second-order B-spline coefﬁcients over a regularly spaced input grid for the two-dimensional

input image. The lambda argument speciﬁes the amount of smoothing. The precision argument allows

specifying the precision used when computing the inﬁnite sum needed to apply mirror- symmetric bound-

ary conditions.

scipy.signal.cspline1d_eval(cj,newx,dx=1.0,x0=0)

Evaluate a spline at the new set of points.

dx is the old sample-spacing while x0 was the old origin. In other-words the old-sample points (knot-points) for

which the cj represent spline coefﬁcients were at equally-spaced points of:

oldx = x0 + j*dx j=0...N-1, with N=len(cj)

Edges are handled using mirror-symmetric boundary conditions.

scipy.signal.cspline1d_eval(cj,newx,dx=1.0,x0=0)

Evaluate a spline at the new set of points.

dx is the old sample-spacing while x0 was the old origin. In other-words the old-sample points (knot-points) for

which the cj represent spline coefﬁcients were at equally-spaced points of:

oldx = x0 + j*dx j=0...N-1, with N=len(cj)

Edges are handled using mirror-symmetric boundary conditions.

scipy.signal.spline_filter(Iin,lmbda=5.0)

Smoothing spline (cubic) ﬁltering of a rank-2 array.

Filter an input data set, Iin, using a (cubic) smoothing spline of fall-off lmbda.

5.22. Signal processing (scipy.signal) 599

SciPy Reference Guide, Release 0.13.0

5.22.3 Filtering

600 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

order_filter(a, domain, rank) Perform an order ﬁlter on an N-dimensional array.

medfilt(volume[, kernel_size]) Perform a median ﬁlter on an N-dimensional array.

medfilt2d(input[, kernel_size]) Median ﬁlter a 2-dimensional array.

wiener(im[, mysize, noise]) Perform a Wiener ﬁlter on an N-dimensional array.

symiirorder1((input, c0, z1 {, ...) Implement a smoothing IIR ﬁlter with mirror-symmetric boundary conditions

symiirorder2((input, r, omega {, ...) Implement a smoothing IIR ﬁlter with mirror-symmetric boundary conditions

lfilter(b, a, x[, axis, zi]) Filter data along one-dimension with an IIR or FIR ﬁlter.

lfiltic(b, a, y[, x]) Construct initial conditions for lﬁlter.

lfilter_zi(b, a) Compute an initial state zi for the lﬁlter function that corresponds to the steady state of the step response.

filtfilt(b, a, x[, axis, padtype, padlen]) A forward-backward ﬁlter.

deconvolve(signal, divisor) Deconvolves divisor out of signal.

hilbert(x[, N, axis]) Compute the analytic signal, using the Hilbert transform.

get_window(window, Nx[, fftbins]) Return a window.

decimate(x, q[, n, ftype, axis]) Downsample the signal by using a ﬁlter.

detrend(data[, axis, type, bp]) Remove linear trend along axis from data.

resample(x, num[, t, axis, window]) Resample xto num samples using Fourier method along the given axis.

scipy.signal.order_filter(a,domain,rank)

Perform an order ﬁlter on an N-dimensional array.

Perform an order ﬁlter on the array in. The domain argument acts as a mask centered over each pixel. The

non-zero elements of domain are used to select elements surrounding each input pixel which are placed in a list.

The list is sorted, and the output for that pixel is the element corresponding to rank in the sorted list.

Parameters a: ndarray

The N-dimensional input array.

domain : array_like

A mask array with the same number of dimensions as in. Each dimension

should have an odd number of elements.

rank : int

A non-negative integer which selects the element from the sorted list (0

corresponds to the smallest element, 1 is the next smallest element, etc.).

Returns out : ndarray

The results of the order ﬁlter in an array with the same shape as in.

Examples

>>> from scipy import signal

>>> x=np.arange(25).reshape(5,5)

>>> domain =np.identity(3)

>>> x

array([[ 0, 1, 2, 3, 4],

[ 5, 6, 7, 8, 9],

[10, 11, 12, 13, 14],

[15, 16, 17, 18, 19],

[20, 21, 22, 23, 24]])

>>> signal.order_filter(x, domain, 0)

array([[ 0., 0., 0., 0., 0.],

[ 0., 0., 1., 2., 0.],

[ 0., 5., 6., 7., 0.],

[ 0., 10., 11., 12., 0.],

[ 0., 0., 0., 0., 0.]])

>>> signal.order_filter(x, domain, 2)

array([[ 6., 7., 8., 9., 4.],

[ 11., 12., 13., 14., 9.],

5.22. Signal processing (scipy.signal) 601

SciPy Reference Guide, Release 0.13.0

[ 16., 17., 18., 19., 14.],

[ 21., 22., 23., 24., 19.],

[ 20., 21., 22., 23., 24.]])

scipy.signal.medfilt(volume,kernel_size=None)

Perform a median ﬁlter on an N-dimensional array.

Apply a median ﬁlter to the input array using a local window-size given by kernel_size.

Parameters volume : array_like

An N-dimensional input array.

kernel_size : array_like, optional

A scalar or an N-length list giving the size of the median ﬁlter window in

each dimension. Elements of kernel_size should be odd. If kernel_size is a

scalar, then this scalar is used as the size in each dimension. Default size is

3 for each dimension.

Returns out : ndarray

An array the same size as input containing the median ﬁltered result.

scipy.signal.medfilt2d(input,kernel_size=3)

Median ﬁlter a 2-dimensional array.

Apply a median ﬁlter to the input array using a local window-size given by kernel_size (must be odd).

Parameters input : array_like

A 2-dimensional input array.

kernel_size : array_like, optional

A scalar or a list of length 2, giving the size of the median ﬁlter window in

each dimension. Elements of kernel_size should be odd. If kernel_size is

a scalar, then this scalar is used as the size in each dimension. Default is a

kernel of size (3, 3).

Returns out : ndarray

An array the same size as input containing the median ﬁltered result.

scipy.signal.wiener(im,mysize=None,noise=None)

Perform a Wiener ﬁlter on an N-dimensional array.

Apply a Wiener ﬁlter to the N-dimensional array im.

Parameters im : ndarray

An N-dimensional array.

mysize : int or arraylike, optional

A scalar or an N-length list giving the size of the Wiener ﬁlter window in

each dimension. Elements of mysize should be odd. If mysize is a scalar,

then this scalar is used as the size in each dimension.

noise : ﬂoat, optional

The noise-power to use. If None, then noise is estimated as the average of

the local variance of the input.

Returns out : ndarray

Wiener ﬁltered result with the same shape as im.

scipy.signal.symiirorder1(input,c0,z1 {,precision})→output

Implement a smoothing IIR ﬁlter with mirror-symmetric boundary conditions using a cascade of ﬁrst-order

sections. The second section uses a reversed sequence. This implements a system with the following transfer

function and mirror-symmetric boundary conditions:

H(z) = ---------------------

(1-z1/z) (1 - z1 z)

602 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

The resulting signal will have mirror symmetric boundary conditions as well.

Parameters input : ndarray

The input signal.

c0, z1 : scalar

Parameters in the transfer function.

precision :

Speciﬁes the precision for calculating initial conditions of the recursive ﬁl-

ter based on mirror-symmetric input.

Returns output : ndarray

The ﬁltered signal.

scipy.signal.symiirorder2(input,r,omega {,precision})→output

Implement a smoothing IIR ﬁlter with mirror-symmetric boundary conditions using a cascade of second-order

sections. The second section uses a reversed sequence. This implements the following transfer function:

cs^2

H(z) = ---------------------------------------

(1 - a2/z - a3/z^2) (1 - a2 z - a3 z^2 )

where:

a2 = (2 r cos omega)

a3 = - r^2

cs = 1 - 2 r cos omega + r^2

Parameters input : ndarray

The input signal.

r, omega : scalar

Parameters in the transfer function.

precision :

Speciﬁes the precision for calculating initial conditions of the recursive ﬁl-

ter based on mirror-symmetric input.

Returns output : ndarray

The ﬁltered signal.

scipy.signal.lfilter(b,a,x,axis=-1,zi=None)

Filter data along one-dimension with an IIR or FIR ﬁlter.

Filter a data sequence, x, using a digital ﬁlter. This works for many fundamental data types (including Object

type). The ﬁlter is a direct form II transposed implementation of the standard difference equation (see Notes).

Parameters b: array_like

The numerator coefﬁcient vector in a 1-D sequence.

a: array_like

The denominator coefﬁcient vector in a 1-D sequence. If a[0] is not 1,

then both aand bare normalized by a[0].

x: array_like

An N-dimensional input array.

axis : int

The axis of the input data array along which to apply the linear ﬁlter. The

ﬁlter is applied to each subarray along this axis. Default is -1.

zi : array_like, optional

Initial conditions for the ﬁlter delays. It is a vector (or array of vectors for

an N-dimensional input) of length max(len(a),len(b))-1. If zi is

None or is not given then initial rest is assumed. See lfiltic for more

information.

5.22. Signal processing (scipy.signal) 603

SciPy Reference Guide, Release 0.13.0

Returns y: array

The output of the digital ﬁlter.

zf : array, optional

If zi is None, this is not returned, otherwise, zf holds the ﬁnal ﬁlter delay

values.

Notes

The ﬁlter function is implemented as a direct II transposed structure. This means that the ﬁlter implements:

a[0]*y[n] = b[0]*x[n] + b[1]*x[n-1] + ... + b[nb]*x[n-nb]

- a[1]*y[n-1] - ... - a[na]*y[n-na]

using the following difference equations:

y[m] =b[0]*x[m] +z[0,m-1]

z[0,m] =b[1]*x[m] +z[1,m-1]-a[1]*y[m]

...

z[n-3,m] =b[n-2]*x[m] +z[n-2,m-1]-a[n-2]*y[m]

z[n-2,m] =b[n-1]*x[m] -a[n-1]*y[m]

where m is the output sample number and n=max(len(a),len(b)) is the model order.

The rational transfer function describing this ﬁlter in the z-transform domain is:

-1 -nb

b[0] + b[1]z + ... + b[nb] z

Y(z) = ---------------------------------- X(z)

-1 -na

a[0] + a[1]z + ... + a[na] z

scipy.signal.lfiltic(b,a,y,x=None)

Construct initial conditions for lﬁlter.

Given a linear ﬁlter (b, a) and initial conditions on the output yand the input x, return the inital conditions on

the state vector zi which is used by lfilter to generate the output given the input.

Parameters b: array_like

Linear ﬁlter term.

a: array_like

Linear ﬁlter term.

y: array_like

Initial conditions.

If N=len(a) - 1, then y = {y[-1], y[-2], ..., y[-N]}.

If yis too short, it is padded with zeros.

x: array_like, optional

Initial conditions.

If M=len(b) - 1, then x = {x[-1], x[-2], ..., x[-M]}.

If xis not given, its initial conditions are assumed zero.

If xis too short, it is padded with zeros.

Returns zi : ndarray

The state vector zi.zi = {z_0[-1], z_1[-1], ...,

z_K-1[-1]}, where K = max(M,N).

604 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

See Also

lfilter

scipy.signal.lfilter_zi(b,a)

Compute an initial state zi for the lﬁlter function that corresponds to the steady state of the step response.

A typical use of this function is to set the initial state so that the output of the ﬁlter starts at the same value as

the ﬁrst element of the signal to be ﬁltered.

Parameters b, a : array_like (1-D)

The IIR ﬁlter coefﬁcients. See lfilter for more information.

Returns zi : 1-D ndarray

The initial state for the ﬁlter.

Notes

A linear ﬁlter with order m has a state space representation (A, B, C, D), for which the output y of the ﬁlter can

be expressed as:

z(n+1)=A*z(n) +B*x(n)

y(n) =C*z(n) +D*x(n)

where z(n) is a vector of length m, A has shape (m, m), B has shape (m, 1), C has shape (1, m) and D has shape

(1, 1) (assuming x(n) is a scalar). lﬁlter_zi solves:

zi =A*zi +B

In other words, it ﬁnds the initial condition for which the response to an input of all ones is a constant.

Given the ﬁlter coefﬁcients aand b, the state space matrices for the transposed direct form II implementation of

the linear ﬁlter, which is the implementation used by scipy.signal.lﬁlter, are:

A=scipy.linalg.companion(a).T

B=b[1:] -a[1:]*b[0]

assuming a[0] is 1.0; if a[0] is not 1, aand bare ﬁrst divided by a[0].

Examples

The following code creates a lowpass Butterworth ﬁlter. Then it applies that ﬁlter to an array whose values are

all 1.0; the output is also all 1.0, as expected for a lowpass ﬁlter. If the zi argument of lfilter had not been

given, the output would have shown the transient signal.

>>> from numpy import array, ones

>>> from scipy.signal import lfilter, lfilter_zi, butter

>>> b,a=butter(5,0.25)

>>> zi =lfilter_zi(b, a)

>>> y, zo =lfilter(b, a, ones(10), zi=zi)

>>> y

array([1., 1., 1., 1., 1., 1., 1., 1., 1., 1.])

Another example:

>>> x=array([0.5,0.5,0.5,0.0,0.0,0.0,0.0])

>>> y, zf =lfilter(b, a, x, zi=zi*x[0])

>>> y

array([ 0.5 , 0.5 , 0.5 , 0.49836039, 0.48610528,

0.44399389, 0.35505241])

5.22. Signal processing (scipy.signal) 605

SciPy Reference Guide, Release 0.13.0

Note that the zi argument to lfilter was computed using lfilter_zi and scaled by x[0]. Then the output

yhas no transient until the input drops from 0.5 to 0.0.

scipy.signal.filtfilt(b,a,x,axis=-1,padtype=’odd’,padlen=None)

A forward-backward ﬁlter.

This function applies a linear ﬁlter twice, once forward and once backwards. The combined ﬁlter has linear

phase.

Before applying the ﬁlter, the function can pad the data along the given axis in one of three ways: odd, even or

constant. The odd and even extensions have the corresponding symmetry about the end point of the data. The

constant extension extends the data with the values at end points. On both the forward and backwards passes,

the initial condition of the ﬁlter is found by using lfilter_zi and scaling it by the end point of the extended

data.

Parameters b: (N,) array_like

The numerator coefﬁcient vector of the ﬁlter.

a: (N,) array_like

The denominator coefﬁcient vector of the ﬁlter. If a[0] is not 1, then both a

and b are normalized by a[0].

x: array_like

The array of data to be ﬁltered.

axis : int, optional

The axis of xto which the ﬁlter is applied. Default is -1.

padtype : str or None, optional

Must be ‘odd’, ‘even’, ‘constant’, or None. This determines the type of

extension to use for the padded signal to which the ﬁlter is applied. If

padtype is None, no padding is used. The default is ‘odd’.

padlen : int or None, optional

The number of elements by which to extend xat both ends of axis before

applying the ﬁlter. This value must be less than x.shape[axis]-1.padlen=0

implies no padding. The default value is 3*max(len(a),len(b)).

Returns y: ndarray

The ﬁltered output, an array of type numpy.ﬂoat64 with the same shape as

See Also

lfilter_zi,lfilter

Examples

First we create a one second signal that is the sum of two pure sine waves, with frequencies 5 Hz and 250 Hz,

sampled at 2000 Hz.

>>> t=np.linspace(0,1.0,2001)

>>> xlow =np.sin(2*np.pi *5*t)

>>> xhigh =np.sin(2*np.pi *250 *t)

>>> x=xlow +xhigh

Now create a lowpass Butterworth ﬁlter with a cutoff of 0.125 times the Nyquist rate, or 125 Hz, and apply it to

x with ﬁltﬁlt. The result should be approximately xlow, with no phase shift.

>>> from scipy import signal

>>> b,a=signal.butter(8,0.125)

>>> y=signal.filtfilt(b, a, x, padlen=150)

>>> np.abs(y -xlow).max()

9.1086182074789912e-06

606 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

We get a fairly clean result for this artiﬁcial example because the odd extension is exact, and with the moderately

long padding, the ﬁlter’s transients have dissipated by the time the actual data is reached. In general, transient

effects at the edges are unavoidable.

scipy.signal.deconvolve(signal,divisor)

Deconvolves divisor out of signal.

Parameters signal : array

Signal input

divisor : array

Divisor input

Returns q: array

Quotient of the division

r: array

Remainder

Examples

>>> from scipy import signal

>>> sig =np.array([0,0,0,0,0,1,1,1,1,])

>>> filter =np.array([1,1,0])

>>> res =signal.convolve(sig, filter)

>>> signal.deconvolve(res, filter)

(array([ 0., 0., 0., 0., 0., 1., 1., 1., 1.]),

array([ 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.]))

scipy.signal.hilbert(x,N=None,axis=-1)

Compute the analytic signal, using the Hilbert transform.

The transformation is done along the last axis by default.

Parameters x: array_like

Signal data. Must be real.

N: int, optional

Number of Fourier components. Default: x.shape[axis]

axis : int, optional

Axis along which to do the transformation. Default: -1.

Returns xa : ndarray

Analytic signal of x, of each 1-D array along axis

Notes

The analytic signal x_a(t) of signal x(t) is:

xa=F−1(F(x)2U) = x+iy

where Fis the Fourier transform, Uthe unit step function, and ythe Hilbert transform of x.[R125]

In other words, the negative half of the frequency spectrum is zeroed out, turning the real-valued signal into

a complex signal. The Hilbert transformed signal can be obtained from np.imag(hilbert(x)), and the

original signal from np.real(hilbert(x)).

References

[R125]

scipy.signal.get_window(window,Nx,fftbins=True)

Return a window.

Parameters window : string, ﬂoat, or tuple

5.22. Signal processing (scipy.signal) 607

SciPy Reference Guide, Release 0.13.0

The type of window to create. See below for more details.

Nx : int

The number of samples in the window.

fftbins : bool, optional

If True, create a “periodic” window ready to use with ifftshift and be multi-

plied by the result of an fft (SEE ALSO fftfreq).

Returns get_window : ndarray

Returns a window of length Nx and type window

Notes

Window types:

boxcar, triang, blackman, hamming, hann, bartlett, ﬂattop, parzen, bohman, blackmanharris, nuttall,

barthann, kaiser (needs beta), gaussian (needs std), general_gaussian (needs power, width), slepian (needs

width), chebwin (needs attenuation)

If the window requires no parameters, then window can be a string.

If the window requires parameters, then window must be a tuple with the ﬁrst argument the string name of the

window, and the next arguments the needed parameters.

If window is a ﬂoating point number, it is interpreted as the beta parameter of the kaiser window.

Each of the window types listed above is also the name of a function that can be called directly to create a

window of that type.

Examples

>>> from scipy import signal

>>> signal.get_window(’triang’,7)

array([ 0.25, 0.5 , 0.75, 1. , 0.75, 0.5 , 0.25])

>>> signal.get_window((’kaiser’,4.0), 9)

array([ 0.08848053, 0.32578323, 0.63343178, 0.89640418, 1. ,

0.89640418, 0.63343178, 0.32578323, 0.08848053])

>>> signal.get_window(4.0,9)

array([ 0.08848053, 0.32578323, 0.63343178, 0.89640418, 1. ,

0.89640418, 0.63343178, 0.32578323, 0.08848053])

scipy.signal.decimate(x,q,n=None,ftype=’iir’,axis=-1)

Downsample the signal by using a ﬁlter.

By default, an order 8 Chebyshev type I ﬁlter is used. A 30 point FIR ﬁlter with hamming window is used if

ftype is ‘ﬁr’.

Parameters x: ndarray

The signal to be downsampled, as an N-dimensional array.

q: int

The downsampling factor.

n: int, optional

The order of the ﬁlter (1 less than the length for ‘ﬁr’).

ftype : str {‘iir’, ‘ﬁr’}, optional

The type of the lowpass ﬁlter.

axis : int, optional

The axis along which to decimate.

Returns y: ndarray

The down-sampled signal.

608 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

See Also

resample

scipy.signal.detrend(data,axis=-1,type=’linear’,bp=0)

Remove linear trend along axis from data.

Parameters data : array_like

The input data.

axis : int, optional

The axis along which to detrend the data. By default this is the last axis

(-1).

type : {‘linear’, ‘constant’}, optional

The type of detrending. If type == ’linear’ (default), the result

of a linear least-squares ﬁt to data is subtracted from data. If type ==

’constant’, only the mean of data is subtracted.

bp : array_like of ints, optional

A sequence of break points. If given, an individual linear ﬁt is performed

for each part of data between two break points. Break points are speciﬁed

as indices into data.

Returns ret : ndarray

The detrended input data.

Examples

>>> from scipy import signal

>>> randgen =np.random.RandomState(9)

>>> npoints =1e3

>>> noise =randgen.randn(npoints)

>>> x=3+2*np.linspace(0,1, npoints) +noise

>>> (signal.detrend(x) -noise).max() <0.01

True

scipy.signal.resample(x,num,t=None,axis=0,window=None)

Resample xto num samples using Fourier method along the given axis.

The resampled signal starts at the same value as xbut is sampled with a spacing of len(x) / num *

(spacing of x). Because a Fourier method is used, the signal is assumed to be periodic.

Parameters x: array_like

The data to be resampled.

num : int

The number of samples in the resampled signal.

t: array_like, optional

If tis given, it is assumed to be the sample positions associated with the

signal data in x.

axis : int, optional

The axis of xthat is resampled. Default is 0.

window : array_like, callable, string, ﬂoat, or tuple, optional

Speciﬁes the window applied to the signal in the Fourier domain. See below

for details.

Returns resampled_x or (resampled_x, resampled_t)

Either the resampled array, or, if twas given, a tuple containing the resam-

pled array and the corresponding resampled positions.

5.22. Signal processing (scipy.signal) 609

SciPy Reference Guide, Release 0.13.0

Notes

The argument window controls a Fourier-domain window that tapers the Fourier spectrum before zero-padding

to alleviate ringing in the resampled values for sampled signals you didn’t intend to be interpreted as band-

limited.

If window is a function, then it is called with a vector of inputs indicating the frequency bins (i.e. fft-

freq(x.shape[axis]) ).

If window is an array of the same length as x.shape[axis] it is assumed to be the window to be applied directly

in the Fourier domain (with dc and low-frequency ﬁrst).

For any other type of window, the function scipy.signal.get_window is called to generate the window.

The ﬁrst sample of the returned vector is the same as the ﬁrst sample of the input vector. The spacing between

samples is changed from dx to:

dx * len(x) / num

If tis not None, then it represents the old sample positions, and the new sample positions will be returned as

well as the new samples.

5.22.4 Filter design

bilinear(b, a[, fs]) Return a digital ﬁlter from an analog one using a bilinear transform.

firwin(numtaps, cutoff[, width, window, ...]) FIR ﬁlter design using the window method.

firwin2(numtaps, freq, gain[, nfreqs, ...]) FIR ﬁlter design using the window method.

freqs(b, a[, worN, plot]) Compute frequency response of analog ﬁlter.

freqz(b[, a, worN, whole, plot]) Compute the frequency response of a digital ﬁlter.

iirdesign(wp, ws, gpass, gstop[, analog, ...]) Complete IIR digital and analog ﬁlter design.

iirfilter(N, Wn[, rp, rs, btype, analog, ...]) IIR digital and analog ﬁlter design given order and critical points.

kaiser_atten(numtaps, width) Compute the attenuation of a Kaiser FIR ﬁlter.

kaiser_beta(a) Compute the Kaiser parameter beta, given the attenuation a.

kaiserord(ripple, width) Design a Kaiser window to limit ripple and width of transition region.

remez(numtaps, bands, desired[, weight, Hz, ...]) Calculate the minimax optimal ﬁlter using the Remez exchange algorithm.

unique_roots(p[, tol, rtype]) Determine unique roots and their multiplicities from a list of roots.

residue(b, a[, tol, rtype]) Compute partial-fraction expansion of b(s) / a(s).

residuez(b, a[, tol, rtype]) Compute partial-fraction expansion of b(z) / a(z).

invres(r, p, k[, tol, rtype]) Compute b(s) and a(s) from partial fraction expansion: r,p,k

scipy.signal.bilinear(b,a,fs=1.0)

Return a digital ﬁlter from an analog one using a bilinear transform.

The bilinear transform substitutes (z-1) / (z+1) for s.

scipy.signal.firwin(numtaps,cutoff,width=None,window=’hamming’,pass_zero=True,scale=True,

nyq=1.0)

FIR ﬁlter design using the window method.

This function computes the coefﬁcients of a ﬁnite impulse response ﬁlter. The ﬁlter will have linear phase; it

will be Type I if numtaps is odd and Type II if numtaps is even.

Type II ﬁlters always have zero response at the Nyquist rate, so a ValueError exception is raised if ﬁrwin is

called with numtaps even and having a passband whose right end is at the Nyquist rate.

Parameters numtaps : int

610 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Length of the ﬁlter (number of coefﬁcients, i.e. the ﬁlter order + 1). num-

taps must be even if a passband includes the Nyquist frequency.

cutoff : ﬂoat or 1D array_like

Cutoff frequency of ﬁlter (expressed in the same units as nyq) OR an array

of cutoff frequencies (that is, band edges). In the latter case, the frequencies

in cutoff should be positive and monotonically increasing between 0 and

nyq. The values 0 and nyq must not be included in cutoff.

width : ﬂoat or None

If width is not None, then assume it is the approximate width of the transi-

tion region (expressed in the same units as nyq) for use in Kaiser FIR ﬁlter

design. In this case, the window argument is ignored.

window : string or tuple of string and parameter values

Desired window to use. See scipy.signal.get_window for a list of

windows and required parameters.

pass_zero : bool

If True, the gain at the frequency 0 (i.e. the “DC gain”) is 1. Otherwise the

DC gain is 0.

scale : bool

Set to True to scale the coefﬁcients so that the frequency response is exactly

unity at a certain frequency. That frequency is either:

•0 (DC) if the ﬁrst passband starts at 0 (i.e. pass_zero is True)

•nyq (the Nyquist rate) if the ﬁrst passband ends at nyq (i.e the

ﬁlter is a single band highpass ﬁlter); center of ﬁrst passband

otherwise

nyq : ﬂoat

Nyquist frequency. Each frequency in cutoff must be between 0 and nyq.

Returns h: (numtaps,) ndarray

Coefﬁcients of length numtaps FIR ﬁlter.

Raises ValueError

If any value in cutoff is less than or equal to 0 or greater than or equal to

nyq, if the values in cutoff are not strictly monotonically increasing, or if

numtaps is even but a passband includes the Nyquist frequency.

See Also

scipy.signal.firwin2

Examples

Low-pass from 0 to f:

>>> from scipy import signal

>>> signal.firwin(numtaps, f)

Use a speciﬁc window function:

>>> signal.firwin(numtaps, f, window=’nuttall’)

High-pass (‘stop’ from 0 to f):

>>> signal.firwin(numtaps, f, pass_zero=False)

Band-pass:

>>> signal.firwin(numtaps, [f1, f2], pass_zero=False)

Band-stop:

5.22. Signal processing (scipy.signal) 611

SciPy Reference Guide, Release 0.13.0

>>> signal.firwin(numtaps, [f1, f2])

Multi-band (passbands are [0, f1], [f2, f3] and [f4, 1]):

>>> signal.firwin(numtaps, [f1, f2, f3, f4])

Multi-band (passbands are [f1, f2] and [f3,f4]):

>>> signal.firwin(numtaps, [f1, f2, f3, f4], pass_zero=False)

scipy.signal.firwin2(numtaps,freq,gain,nfreqs=None,window=’hamming’,nyq=1.0,antisymmet-

ric=False)

FIR ﬁlter design using the window method.

From the given frequencies freq and corresponding gains gain, this function constructs an FIR ﬁlter with linear

phase and (approximately) the given frequency response.

Parameters numtaps : int

The number of taps in the FIR ﬁlter. numtaps must be less than nfreqs.

freq : array_like, 1D

The frequency sampling points. Typically 0.0 to 1.0 with 1.0 being Nyquist.

The Nyquist frequency can be redeﬁned with the argument nyq. The values

in freq must be nondecreasing. A value can be repeated once to implement

a discontinuity. The ﬁrst value in freq must be 0, and the last value must be

nyq.

gain : array_like

The ﬁlter gains at the frequency sampling points. Certain constraints to gain

values, depending on the ﬁlter type, are applied, see Notes for details.

nfreqs : int, optional

The size of the interpolation mesh used to construct the ﬁlter. For most

efﬁcient behavior, this should be a power of 2 plus 1 (e.g, 129, 257, etc).

The default is one more than the smallest power of 2 that is not less than

numtaps.nfreqs must be greater than numtaps.

window : string or (string, ﬂoat) or ﬂoat, or None, optional

Window function to use. Default is “hamming”. See

scipy.signal.get_window for the complete list of possible

values. If None, no window function is applied.

nyq : ﬂoat

Nyquist frequency. Each frequency in freq must be between 0 and nyq

(inclusive).

antisymmetric : bool

Whether resulting impulse response is symmetric/antisymmetric. See Notes

for more details.

Returns taps : ndarray

The ﬁlter coefﬁcients of the FIR ﬁlter, as a 1-D array of length numtaps.

See Also

scipy.signal.firwin

Notes

From the given set of frequencies and gains, the desired response is constructed in the frequency domain. The

inverse FFT is applied to the desired response to create the associated convolution kernel, and the ﬁrst numtaps

coefﬁcients of this kernel, scaled by window, are returned.

612 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

The FIR ﬁlter will have linear phase. The type of ﬁlter is determined by the value of ‘numtaps‘ and antisymmetric

ﬂag. There are four possible combinations:

•odd numtaps,antisymmetric is False, type I ﬁlter is produced

•even numtaps,antisymmetric is False, type II ﬁlter is produced

•odd numtaps,antisymmetric is True, type III ﬁlter is produced

•even numtaps,antisymmetric is True, type IV ﬁlter is produced

Magnitude response of all but type I ﬁlters are subjects to following constraints:

•type II – zero at the Nyquist frequency

•type III – zero at zero and Nyquist frequencies

•type IV – zero at zero frequency

New in version 0.9.0.

References

[R115],[R116]

Examples

A lowpass FIR ﬁlter with a response that is 1 on [0.0, 0.5], and that decreases linearly on [0.5, 1.0] from 1 to 0:

>>> from scipy import signal

>>> taps =signal.firwin2(150, [0.0,0.5,1.0], [1.0,1.0,0.0])

>>> print(taps[72:78])

[-0.02286961 -0.06362756 0.57310236 0.57310236 -0.06362756 -0.02286961]

scipy.signal.freqs(b,a,worN=None,plot=None)

Compute frequency response of analog ﬁlter.

Given the numerator band denominator aof a ﬁlter, compute its frequency response:

b[0]*(jw)**(nb-1) + b[1]*(jw)**(nb-2) + ... + b[nb-1]

H(w) = -------------------------------------------------------

a[0]*(jw)**(na-1) + a[1]*(jw)**(na-2) + ... + a[na-1]

Parameters b: ndarray

Numerator of a linear ﬁlter.

a: ndarray

Denominator of a linear ﬁlter.

worN : {None, int}, optional

If None, then compute at 200 frequencies around the interesting parts of the

response curve (determined by pole-zero locations). If a single integer, then

compute at that many frequencies. Otherwise, compute the response at the

angular frequencies (e.g. rad/s) given in worN.

plot : callable

A callable that takes two arguments. If given, the return parameters wand h

are passed to plot. Useful for plotting the frequency response inside freqs.

Returns w: ndarray

The angular frequencies at which h was computed.

h: ndarray

The frequency response.

See Also

freqz Compute the frequency response of a digital ﬁlter.

5.22. Signal processing (scipy.signal) 613

SciPy Reference Guide, Release 0.13.0

Notes

Using Matplotlib’s “plot” function as the callable for plot produces unexpected results, this plots the real part of

the complex transfer function, not the magnitude.

Examples

>>> from scipy.signal import freqs, iirfilter

>>> b,a=iirfilter(4, [1,10], 1,60, analog=True, ftype=’cheby1’)

>>> w,h=freqs(b, a, worN=np.logspace(-1,2,1000))

>>> import matplotlib.pyplot as plt

>>> plt.semilogx(w, abs(h))

>>> plt.xlabel(’Frequency’)

>>> plt.ylabel(’Amplitude response’)

>>> plt.grid()

>>> plt.show()

scipy.signal.freqz(b,a=1,worN=None,whole=0,plot=None)

Compute the frequency response of a digital ﬁlter.

Given the numerator band denominator aof a digital ﬁlter, compute its frequency response:

jw -jw -jmw

jw B(e) b[0] + b[1]e + .... + b[m]e

H(e) = ---- = ------------------------------------

jw -jw -jnw

A(e) a[0] + a[1]e + .... + a[n]e

Parameters b: ndarray

numerator of a linear ﬁlter

a: ndarray

denominator of a linear ﬁlter

worN : {None, int, array_like}, optional

614 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

If None (default), then compute at 512 frequencies equally spaced around

the unit circle. If a single integer, then compute at that many frequencies.

If an array_like, compute the response at the frequencies given (in radi-

ans/sample).

whole : bool, optional

Normally, frequencies are computed from 0 to the Nyquist frequency, pi

radians/sample (upper-half of unit-circle). If whole is True, compute fre-

quencies from 0 to 2*pi radians/sample.

plot : callable

A callable that takes two arguments. If given, the return parameters wand h

are passed to plot. Useful for plotting the frequency response inside freqz.

Returns w: ndarray

The normalized frequencies at which h was computed, in radians/sample.

h: ndarray

The frequency response.

Notes

Using Matplotlib’s “plot” function as the callable for plot produces unexpected results, this plots the real part of

the complex transfer function, not the magnitude.

Examples

>>> from scipy import signal

>>> b=signal.firwin(80,0.5, window=(’kaiser’,8))

>>> w,h=signal.freqz(b)

>>> import matplotlib.pyplot as plt

>>> fig =plt.figure()

>>> plt.title(’Digital filter frequency response’)

>>> ax1 =fig.add_subplot(111)

>>> plt.semilogy(w, np.abs(h), ’b’)

>>> plt.ylabel(’Amplitude (dB)’, color=’b’)

>>> plt.xlabel(’Frequency (rad/sample)’)

>>> ax2 =ax1.twinx()

>>> angles =np.unwrap(np.angle(h))

>>> plt.plot(w, angles, ’g’)

>>> plt.ylabel(’Angle (radians)’, color=’g’)

>>> plt.grid()

>>> plt.axis(’tight’)

>>> plt.show()

5.22. Signal processing (scipy.signal) 615

SciPy Reference Guide, Release 0.13.0

scipy.signal.iirdesign(wp,ws,gpass,gstop,analog=False,ftype=’ellip’,output=’ba’)

Complete IIR digital and analog ﬁlter design.

Given passband and stopband frequencies and gains, construct an analog or digital IIR ﬁlter of minimum order

for a given basic type. Return the output in numerator, denominator (‘ba’) or pole-zero (‘zpk’) form.

Parameters wp, ws : ﬂoat

Passband and stopband edge frequencies. For digital ﬁlters, these are nor-

malized from 0 to 1, where 1 is the Nyquist frequency, pi radians/sample.

(wp and ws are thus in half-cycles / sample.) For example:

•Lowpass: wp = 0.2, ws = 0.3

•Highpass: wp = 0.3, ws = 0.2

•Bandpass: wp = [0.2, 0.5], ws = [0.1, 0.6]

•Bandstop: wp = [0.1, 0.6], ws = [0.2, 0.5]

For analog ﬁlters, wp and ws are angular frequencies (e.g. rad/s).

gpass : ﬂoat

The maximum loss in the passband (dB).

gstop : ﬂoat

The minimum attenuation in the stopband (dB).

analog : bool, optional

When True, return an analog ﬁlter, otherwise a digital ﬁlter is returned.

ftype : str, optional

The type of IIR ﬁlter to design:

•Butterworth : ‘butter’

•Chebyshev I : ‘cheby1’

•Chebyshev II : ‘cheby2’

•Cauer/elliptic: ‘ellip’

•Bessel/Thomson: ‘bessel’

output : {‘ba’, ‘zpk’}, optional

Type of output: numerator/denominator (‘ba’) or pole-zero (‘zpk’). Default

is ‘ba’.

Returns b, a : ndarray, ndarray

Numerator (b) and denominator (a) polynomials of the IIR ﬁlter. Only re-

turned if output=’ba’.

z, p, k : ndarray, ndarray, ﬂoat

Zeros, poles, and system gain of the IIR ﬁlter transfer function. Only re-

turned if output=’zpk’.

scipy.signal.iirfilter(N,Wn,rp=None,rs=None,btype=’band’,analog=False,ftype=’butter’,

output=’ba’)

IIR digital and analog ﬁlter design given order and critical points.

616 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Design an Nth order digital or analog ﬁlter and return the ﬁlter coefﬁcients in (B,A) (numerator, denominator)

or (Z,P,K) form.

Parameters N: int

The order of the ﬁlter.

Wn : array_like

A scalar or length-2 sequence giving the critical frequencies. For digital

ﬁlters, Wn is normalized from 0 to 1, where 1 is the Nyquist frequency, pi

radians/sample. (Wn is thus in half-cycles / sample.) For analog ﬁlters, Wn

is an angular frequency (e.g. rad/s).

rp : ﬂoat, optional

For Chebyshev and elliptic ﬁlters, provides the maximum ripple in the pass-

band. (dB)

rs : ﬂoat, optional

For Chebyshev and elliptic ﬁlters, provides the minimum attenuation in the

stop band. (dB)

btype : {‘bandpass’, ‘lowpass’, ‘highpass’, ‘bandstop’}, optional

The type of ﬁlter. Default is ‘bandpass’.

analog : bool, optional

When True, return an analog ﬁlter, otherwise a digital ﬁlter is returned.

ftype : str, optional

The type of IIR ﬁlter to design:

•Butterworth : ‘butter’

•Chebyshev I : ‘cheby1’

•Chebyshev II : ‘cheby2’

•Cauer/elliptic: ‘ellip’

•Bessel/Thomson: ‘bessel’

output : {‘ba’, ‘zpk’}, optional

Type of output: numerator/denominator (‘ba’) or pole-zero (‘zpk’). Default

is ‘ba’.

See Also

buttord,cheb1ord,cheb2ord,ellipord

scipy.signal.kaiser_atten(numtaps,width)

Compute the attenuation of a Kaiser FIR ﬁlter.

Given the number of taps Nand the transition width width, compute the attenuation ain dB, given by Kaiser’s

formula:

a = 2.285 * (N - 1) * pi * width + 7.95

Parameters N: int

The number of taps in the FIR ﬁlter.

width : ﬂoat

The desired width of the transition region between passband and stopband

(or, in general, at any discontinuity) for the ﬁlter.

Returns a: ﬂoat

The attenuation of the ripple, in dB.

See Also

kaiserord,kaiser_beta

scipy.signal.kaiser_beta(a)

Compute the Kaiser parameter beta, given the attenuation a.

Parameters a: ﬂoat

5.22. Signal processing (scipy.signal) 617

SciPy Reference Guide, Release 0.13.0

The desired attenuation in the stopband and maximum ripple in the pass-

band, in dB. This should be a positive number.

Returns beta : ﬂoat

The beta parameter to be used in the formula for a Kaiser window.

References

Oppenheim, Schafer, “Discrete-Time Signal Processing”, p.475-476.

scipy.signal.kaiserord(ripple,width)

Design a Kaiser window to limit ripple and width of transition region.

Parameters ripple : ﬂoat

Positive number specifying maximum ripple in passband (dB) and mini-

mum ripple in stopband.

width : ﬂoat

Width of transition region (normalized so that 1 corresponds to pi radians /

sample).

Returns numtaps : int

The length of the kaiser window.

beta : ﬂoat

The beta parameter for the kaiser window.

See Also

kaiser_beta,kaiser_atten

Notes

There are several ways to obtain the Kaiser window:

•signal.kaiser(numtaps, beta, sym=0)

•signal.get_window(beta, numtaps)

•signal.get_window((’kaiser’, beta), numtaps)

The empirical equations discovered by Kaiser are used.

References

Oppenheim, Schafer, “Discrete-Time Signal Processing”, p.475-476.

scipy.signal.remez(numtaps,bands,desired,weight=None,Hz=1,type=’bandpass’,maxiter=25,

grid_density=16)

Calculate the minimax optimal ﬁlter using the Remez exchange algorithm.

Calculate the ﬁlter-coefﬁcients for the ﬁnite impulse response (FIR) ﬁlter whose transfer function minimizes the

maximum error between the desired gain and the realized gain in the speciﬁed frequency bands using the Remez

exchange algorithm.

Parameters numtaps : int

The desired number of taps in the ﬁlter. The number of taps is the number

of terms in the ﬁlter, or the ﬁlter order plus one.

bands : array_like

A monotonic sequence containing the band edges in Hz. All elements must

be non-negative and less than half the sampling frequency as given by Hz.

desired : array_like

A sequence half the size of bands containing the desired gain in each of the

speciﬁed bands.

weight : array_like, optional

A relative weighting to give to each band region. The length of weight has

to be half the length of bands.

Hz : scalar, optional

618 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

The sampling frequency in Hz. Default is 1.

type : {‘bandpass’, ‘differentiator’, ‘hilbert’}, optional

The type of ﬁlter:

‘bandpass’ : ﬂat response in bands. This is the default.

‘differentiator’ : frequency proportional response in bands.

‘hilbert’ [ﬁlter with odd symmetry, that is, type III]

(for even order) or type IV (for odd order)

linear phase ﬁlters.

maxiter : int, optional

Maximum number of iterations of the algorithm. Default is 25.

grid_density : int, optional

Grid density. The dense grid used in remez is of size (numtaps + 1)

*grid_density. Default is 16.

Returns out : ndarray

A rank-1 array containing the coefﬁcients of the optimal (in a minimax

sense) ﬁlter.

See Also

freqz Compute the frequency response of a digital ﬁlter.

References

[R132],[R133]

Examples

We want to construct a ﬁlter with a passband at 0.2-0.4 Hz, and stop bands at 0-0.1 Hz and 0.45-0.5 Hz. Note

that this means that the behavior in the frequency ranges between those bands is unspeciﬁed and may overshoot.

>>> from scipy import signal

>>> bpass =signal.remez(72, [0,0.1,0.2,0.4,0.45,0.5], [0,1,0])

>>> freq, response =signal.freqz(bpass)

>>> ampl =np.abs(response)

>>> import matplotlib.pyplot as plt

>>> fig =plt.figure()

>>> ax1 =fig.add_subplot(111)

>>> ax1.semilogy(freq/(2*np.pi), ampl, ’b-’)# freq in Hz

>>> plt.show()

5.22. Signal processing (scipy.signal) 619

SciPy Reference Guide, Release 0.13.0

scipy.signal.unique_roots(p,tol=0.001,rtype=’min’)

Determine unique roots and their multiplicities from a list of roots.

Parameters p: array_like

The list of roots.

tol : ﬂoat, optional

The tolerance for two roots to be considered equal. Default is 1e-3.

rtype : {‘max’, ‘min, ‘avg’}, optional

How to determine the returned root if multiple roots are within tol of each

other.

•‘max’: pick the maximum of those roots.

•‘min’: pick the minimum of those roots.

•‘avg’: take the average of those roots.

Returns pout : ndarray

The list of unique roots, sorted from low to high.

mult : ndarray

The multiplicity of each root.

Notes

This utility function is not speciﬁc to roots but can be used for any sequence of values for which uniqueness and

multiplicity has to be determined. For a more general routine, see numpy.unique.

Examples

>>> from scipy import signal

>>> vals =[0,1.3,1.31,2.8,1.25,2.2,10.3]

>>> uniq, mult =signal.unique_roots(vals, tol=2e-2, rtype=’avg’)

Check which roots have multiplicity larger than 1:

>>> uniq[mult >1]

array([ 1.305])

scipy.signal.residue(b,a,tol=0.001,rtype=’avg’)

Compute partial-fraction expansion of b(s) / a(s).

If M = len(b) and N = len(a), then the partial-fraction expansion H(s) is deﬁned as:

620 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

b(s) b[0] s**(M-1) + b[1] s**(M-2) + ... + b[M-1]

H(s) = ------ = ----------------------------------------------

a(s) a[0] s**(N-1) + a[1] s**(N-2) + ... + a[N-1]

r[0] r[1] r[-1]

= -------- + -------- + ... + --------- + k(s)

(s-p[0]) (s-p[1]) (s-p[-1])

If there are any repeated roots (closer together than tol), then H(s) has terms like:

r[i] r[i+1] r[i+n-1]

-------- + ----------- + ... + -----------

(s-p[i]) (s-p[i])**2 (s-p[i])**n

Returns r: ndarray

Residues.

p: ndarray

Poles.

k: ndarray

Coefﬁcients of the direct polynomial term.

See Also

invres,numpy.poly,unique_roots

scipy.signal.residuez(b,a,tol=0.001,rtype=’avg’)

Compute partial-fraction expansion of b(z) / a(z).

If M = len(b) and N = len(a):

b(z) b[0] + b[1] z**(-1) + ... + b[M-1] z**(-M+1)

H(z) = ------ = ----------------------------------------------

a(z) a[0] + a[1] z**(-1) + ... + a[N-1] z**(-N+1)

r[0] r[-1]

= --------------- + ... + ---------------- + k[0] + k[1]z**(-1) ...

(1-p[0]z**(-1)) (1-p[-1]z**(-1))

If there are any repeated roots (closer than tol), then the partial fraction expansion has terms like:

r[i] r[i+1] r[i+n-1]

-------------- + ------------------ + ... + ------------------

(1-p[i]z**(-1)) (1-p[i]z**(-1))**2 (1-p[i]z**(-1))**n

See Also

invresz,unique_roots

scipy.signal.invres(r,p,k,tol=0.001,rtype=’avg’)

Compute b(s) and a(s) from partial fraction expansion: r,p,k

If M = len(b) and N = len(a):

b(s) b[0] x**(M-1) + b[1] x**(M-2) + ... + b[M-1]

H(s) = ------ = ----------------------------------------------

a(s) a[0] x**(N-1) + a[1] x**(N-2) + ... + a[N-1]

5.22. Signal processing (scipy.signal) 621

SciPy Reference Guide, Release 0.13.0

r[0] r[1] r[-1]

= -------- + -------- + ... + --------- + k(s)

(s-p[0]) (s-p[1]) (s-p[-1])

If there are any repeated roots (closer than tol), then the partial fraction expansion has terms like:

r[i] r[i+1] r[i+n-1]

-------- + ----------- + ... + -----------

(s-p[i]) (s-p[i])**2 (s-p[i])**n

Parameters r: ndarray

Residues.

p: ndarray

Poles.

k: ndarray

Coefﬁcients of the direct polynomial term.

tol : ﬂoat, optional

The tolerance for two roots to be considered equal. Default is 1e-3.

rtype : {‘max’, ‘min, ‘avg’}, optional

How to determine the returned root if multiple roots are within tol of each

other.

‘max’: pick the maximum of those roots.

‘min’: pick the minimum of those roots.

‘avg’: take the average of those roots.

See Also

residue,unique_roots

5.22.5 Matlab-style IIR ﬁlter design

butter(N, Wn[, btype, analog, output]) Butterworth digital and analog ﬁlter design.

buttord(wp, ws, gpass, gstop[, analog]) Butterworth ﬁlter order selection.

cheby1(N, rp, Wn[, btype, analog, output]) Chebyshev type I digital and analog ﬁlter design.

cheb1ord(wp, ws, gpass, gstop[, analog]) Chebyshev type I ﬁlter order selection.

cheby2(N, rs, Wn[, btype, analog, output]) Chebyshev type II digital and analog ﬁlter design.

cheb2ord(wp, ws, gpass, gstop[, analog]) Chebyshev type II ﬁlter order selection.

ellip(N, rp, rs, Wn[, btype, analog, output]) Elliptic (Cauer) digital and analog ﬁlter design.

ellipord(wp, ws, gpass, gstop[, analog]) Elliptic (Cauer) ﬁlter order selection.

bessel(N, Wn[, btype, analog, output]) Bessel/Thomson digital and analog ﬁlter design.

scipy.signal.butter(N,Wn,btype=’low’,analog=False,output=’ba’)

Butterworth digital and analog ﬁlter design.

Design an Nth order digital or analog Butterworth ﬁlter and return the ﬁlter coefﬁcients in (B,A) or (Z,P,K)

form.

Parameters N: int

The order of the ﬁlter.

Wn : array_like

A scalar or length-2 sequence giving the critical frequencies. For a Butter-

worth ﬁlter, this is the point at which the gain drops to 1/sqrt(2) that of the

passband (the “-3 dB point”). For digital ﬁlters, Wn is normalized from 0

622 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

to 1, where 1 is the Nyquist frequency, pi radians/sample. (Wn is thus in

half-cycles / sample.) For analog ﬁlters, Wn is an angular frequency (e.g.

rad/s).

btype : {‘lowpass’, ‘highpass’, ‘bandpass’, ‘bandstop’}, optional

The type of ﬁlter. Default is ‘lowpass’.

analog : bool, optional

When True, return an analog ﬁlter, otherwise a digital ﬁlter is returned.

output : {‘ba’, ‘zpk’}, optional

Type of output: numerator/denominator (‘ba’) or pole-zero (‘zpk’). Default

is ‘ba’.

Returns b, a : ndarray, ndarray

Numerator (b) and denominator (a) polynomials of the IIR ﬁlter. Only re-

turned if output=’ba’.

z, p, k : ndarray, ndarray, ﬂoat

Zeros, poles, and system gain of the IIR ﬁlter transfer function. Only re-

turned if output=’zpk’.

See Also

buttord

Notes

The Butterworth ﬁlter has maximally ﬂat frequency response in the passband.

Examples

Plot the ﬁlter’s frequency response, showing the critical points:

>>> from scipy import signal

>>> import matplotlib.pyplot as plt

>>> b,a=signal.butter(4,100,’low’, analog=True)

>>> w,h=signal.freqs(b, a)

>>> plt.plot(w, 20 *np.log10(abs(h)))

>>> plt.xscale(’log’)

>>> plt.title(’Butterworth filter frequency response’)

>>> plt.xlabel(’Frequency [radians / second]’)

>>> plt.ylabel(’Amplitude [dB]’)

>>> plt.margins(0,0.1)

>>> plt.grid(which=’both’, axis=’both’)

>>> plt.axvline(100, color=’green’)# cutoff frequency

>>> plt.show()

5.22. Signal processing (scipy.signal) 623

SciPy Reference Guide, Release 0.13.0

scipy.signal.buttord(wp,ws,gpass,gstop,analog=False)

Butterworth ﬁlter order selection.

Return the order of the lowest order digital or analog Butterworth ﬁlter that loses no more than gpass dB in the

passband and has at least gstop dB attenuation in the stopband.

Parameters wp, ws : ﬂoat

Passband and stopband edge frequencies. For digital ﬁlters, these are nor-

malized from 0 to 1, where 1 is the Nyquist frequency, pi radians/sample.

(wp and ws are thus in half-cycles / sample.) For example:

•Lowpass: wp = 0.2, ws = 0.3

•Highpass: wp = 0.3, ws = 0.2

•Bandpass: wp = [0.2, 0.5], ws = [0.1, 0.6]

•Bandstop: wp = [0.1, 0.6], ws = [0.2, 0.5]

For analog ﬁlters, wp and ws are angular frequencies (e.g. rad/s).

gpass : ﬂoat

The maximum loss in the passband (dB).

gstop : ﬂoat

The minimum attenuation in the stopband (dB).

analog : bool, optional

When True, return an analog ﬁlter, otherwise a digital ﬁlter is returned.

Returns ord : int

The lowest order for a Butterworth ﬁlter which meets specs.

wn : ndarray or ﬂoat

The Butterworth natural frequency (i.e. the “3dB frequency”). Should be

used with butter to give ﬁlter results.

scipy.signal.cheby1(N,rp,Wn,btype=’low’,analog=False,output=’ba’)

Chebyshev type I digital and analog ﬁlter design.

Design an Nth order digital or analog Chebyshev type I ﬁlter and return the ﬁlter coefﬁcients in (B,A) or (Z,P,K)

form.

Parameters N: int

The order of the ﬁlter.

rp : ﬂoat

The maximum ripple allowed below unity gain in the passband. Speciﬁed

in decibels, as a positive number.

Wn : array_like

624 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

A scalar or length-2 sequence giving the critical frequencies. For Type I

ﬁlters, this is the point in the transition band at which the gain ﬁrst drops

below -rp. For digital ﬁlters, Wn is normalized from 0 to 1, where 1 is the

Nyquist frequency, pi radians/sample. (Wn is thus in half-cycles / sample.)

For analog ﬁlters, Wn is an angular frequency (e.g. rad/s).

btype : {‘lowpass’, ‘highpass’, ‘bandpass’, ‘bandstop’}, optional

The type of ﬁlter. Default is ‘lowpass’.

analog : bool, optional

When True, return an analog ﬁlter, otherwise a digital ﬁlter is returned.

output : {‘ba’, ‘zpk’}, optional

Type of output: numerator/denominator (‘ba’) or pole-zero (‘zpk’). Default

is ‘ba’.

Returns b, a : ndarray, ndarray

Numerator (b) and denominator (a) polynomials of the IIR ﬁlter. Only re-

turned if output=’ba’.

z, p, k : ndarray, ndarray, ﬂoat

Zeros, poles, and system gain of the IIR ﬁlter transfer function. Only re-

turned if output=’zpk’.

See Also

cheb1ord

Notes

The Chebyshev type I ﬁlter maximizes the rate of cutoff between the frequency response’s passband and stop-

band, at the expense of ripple in the passband and increased ringing in the step response.

Type I ﬁlters roll off faster than Type II (cheby2), but Type II ﬁlters do not have any ripple in the passband.

Examples

Plot the ﬁlter’s frequency response, showing the critical points:

>>> from scipy import signal

>>> import matplotlib.pyplot as plt

>>> b,a=signal.cheby1(4,5,100,’low’, analog=True)

>>> w,h=signal.freqs(b, a)

>>> plt.plot(w, 20 *np.log10(abs(h)))

>>> plt.xscale(’log’)

>>> plt.title(’Chebyshev Type I frequency response (rp=5)’)

>>> plt.xlabel(’Frequency [radians / second]’)

>>> plt.ylabel(’Amplitude [dB]’)

>>> plt.margins(0,0.1)

>>> plt.grid(which=’both’, axis=’both’)

>>> plt.axvline(100, color=’green’)# cutoff frequency

>>> plt.axhline(-5, color=’green’)# rp

>>> plt.show()

5.22. Signal processing (scipy.signal) 625

SciPy Reference Guide, Release 0.13.0

scipy.signal.cheb1ord(wp,ws,gpass,gstop,analog=False)

Chebyshev type I ﬁlter order selection.

Return the order of the lowest order digital or analog Chebyshev Type I ﬁlter that loses no more than gpass dB

in the passband and has at least gstop dB attenuation in the stopband.

Parameters wp, ws : ﬂoat

Passband and stopband edge frequencies. For digital ﬁlters, these are nor-

malized from 0 to 1, where 1 is the Nyquist frequency, pi radians/sample.

(wp and ws are thus in half-cycles / sample.) For example:

•Lowpass: wp = 0.2, ws = 0.3

•Highpass: wp = 0.3, ws = 0.2

•Bandpass: wp = [0.2, 0.5], ws = [0.1, 0.6]

•Bandstop: wp = [0.1, 0.6], ws = [0.2, 0.5]

For analog ﬁlters, wp and ws are angular frequencies (e.g. rad/s).

gpass : ﬂoat

The maximum loss in the passband (dB).

gstop : ﬂoat

The minimum attenuation in the stopband (dB).

analog : bool, optional

When True, return an analog ﬁlter, otherwise a digital ﬁlter is returned.

Returns ord : int

The lowest order for a Chebyshev type I ﬁlter that meets specs.

wn : ndarray or ﬂoat

The Chebyshev natural frequency (the “3dB frequency”) for use with

cheby1 to give ﬁlter results.

scipy.signal.cheby2(N,rs,Wn,btype=’low’,analog=False,output=’ba’)

Chebyshev type II digital and analog ﬁlter design.

Design an Nth order digital or analog Chebyshev type II ﬁlter and return the ﬁlter coefﬁcients in (B,A) or (Z,P,K)

form.

Parameters N: int

The order of the ﬁlter.

rs : ﬂoat

The minimum attenuation required in the stop band. Speciﬁed in decibels,

as a positive number.

Wn : array_like

626 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

A scalar or length-2 sequence giving the critical frequencies. For Type II

ﬁlters, this is the point in the transition band at which the gain ﬁrst reaches

-rs. For digital ﬁlters, Wn is normalized from 0 to 1, where 1 is the Nyquist

frequency, pi radians/sample. (Wn is thus in half-cycles / sample.) For

analog ﬁlters, Wn is an angular frequency (e.g. rad/s).

btype : {‘lowpass’, ‘highpass’, ‘bandpass’, ‘bandstop’}, optional

The type of ﬁlter. Default is ‘lowpass’.

analog : bool, optional

When True, return an analog ﬁlter, otherwise a digital ﬁlter is returned.

output : {‘ba’, ‘zpk’}, optional

Type of output: numerator/denominator (‘ba’) or pole-zero (‘zpk’). Default

is ‘ba’.

Returns b, a : ndarray, ndarray

Numerator (b) and denominator (a) polynomials of the IIR ﬁlter. Only re-

turned if output=’ba’.

z, p, k : ndarray, ndarray, ﬂoat

Zeros, poles, and system gain of the IIR ﬁlter transfer function. Only re-

turned if output=’zpk’.

See Also

cheb2ord

Notes

The Chebyshev type II ﬁlter maximizes the rate of cutoff between the frequency response’s passband and stop-

band, at the expense of ripple in the stopband and increased ringing in the step response.

Type II ﬁlters do not roll off as fast as Type I (cheby1).

Examples

Plot the ﬁlter’s frequency response, showing the critical points:

>>> from scipy import signal

>>> import matplotlib.pyplot as plt

>>> b,a=signal.cheby2(4,40,100,’low’, analog=True)

>>> w,h=signal.freqs(b, a)

>>> plt.plot(w, 20 *np.log10(abs(h)))

>>> plt.xscale(’log’)

>>> plt.title(’Chebyshev Type II frequency response (rs=40)’)

>>> plt.xlabel(’Frequency [radians / second]’)

>>> plt.ylabel(’Amplitude [dB]’)

>>> plt.margins(0,0.1)

>>> plt.grid(which=’both’, axis=’both’)

>>> plt.axvline(100, color=’green’)# cutoff frequency

>>> plt.axhline(-40, color=’green’)# rs

>>> plt.show()

5.22. Signal processing (scipy.signal) 627

SciPy Reference Guide, Release 0.13.0

scipy.signal.cheb2ord(wp,ws,gpass,gstop,analog=False)

Chebyshev type II ﬁlter order selection.

Return the order of the lowest order digital or analog Chebyshev Type II ﬁlter that loses no more than gpass dB

in the passband and has at least gstop dB attenuation in the stopband.

Parameters wp, ws : ﬂoat

Passband and stopband edge frequencies. For digital ﬁlters, these are nor-

malized from 0 to 1, where 1 is the Nyquist frequency, pi radians/sample.

(wp and ws are thus in half-cycles / sample.) For example:

•Lowpass: wp = 0.2, ws = 0.3

•Highpass: wp = 0.3, ws = 0.2

•Bandpass: wp = [0.2, 0.5], ws = [0.1, 0.6]

•Bandstop: wp = [0.1, 0.6], ws = [0.2, 0.5]

For analog ﬁlters, wp and ws are angular frequencies (e.g. rad/s).

gpass : ﬂoat

The maximum loss in the passband (dB).

gstop : ﬂoat

The minimum attenuation in the stopband (dB).

analog : bool, optional

When True, return an analog ﬁlter, otherwise a digital ﬁlter is returned.

Returns ord : int

The lowest order for a Chebyshev type II ﬁlter that meets specs.

wn : ndarray or ﬂoat

The Chebyshev natural frequency (the “3dB frequency”) for use with

cheby2 to give ﬁlter results.

scipy.signal.ellip(N,rp,rs,Wn,btype=’low’,analog=False,output=’ba’)

Elliptic (Cauer) digital and analog ﬁlter design.

Design an Nth order digital or analog elliptic ﬁlter and return the ﬁlter coefﬁcients in (B,A) or (Z,P,K) form.

Parameters N: int

The order of the ﬁlter.

rp : ﬂoat

The maximum ripple allowed below unity gain in the passband. Speciﬁed

in decibels, as a positive number.

rs : ﬂoat

The minimum attenuation required in the stop band. Speciﬁed in decibels,

as a positive number.

628 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Wn : array_like

A scalar or length-2 sequence giving the critical frequencies. For elliptic

ﬁlters, this is the point in the transition band at which the gain ﬁrst drops

below -rp. For digital ﬁlters, Wn is normalized from 0 to 1, where 1 is the

Nyquist frequency, pi radians/sample. (Wn is thus in half-cycles / sample.)

For analog ﬁlters, Wn is an angular frequency (e.g. rad/s).

btype : {‘lowpass’, ‘highpass’, ‘bandpass’, ‘bandstop’}, optional

The type of ﬁlter. Default is ‘lowpass’.

analog : bool, optional

When True, return an analog ﬁlter, otherwise a digital ﬁlter is returned.

output : {‘ba’, ‘zpk’}, optional

Type of output: numerator/denominator (‘ba’) or pole-zero (‘zpk’). Default

is ‘ba’.

Returns b, a : ndarray, ndarray

Numerator (b) and denominator (a) polynomials of the IIR ﬁlter. Only re-

turned if output=’ba’.

z, p, k : ndarray, ndarray, ﬂoat

Zeros, poles, and system gain of the IIR ﬁlter transfer function. Only re-

turned if output=’zpk’.

See Also

ellipord

Notes

Also known as Cauer or Zolotarev ﬁlters, the elliptical ﬁlter maximizes the rate of transition between the fre-

quency response’s passband and stopband, at the expense of ripple in both, and increased ringing in the step

response.

As rp approaches 0, the elliptical ﬁlter becomes a Chebyshev type II ﬁlter (cheby2). As rs approaches 0, it

becomes a Chebyshev type I ﬁlter (cheby1). As both approach 0, it becomes a Butterworth ﬁlter (butter).

Examples

Plot the ﬁlter’s frequency response, showing the critical points:

>>> from scipy import signal

>>> import matplotlib.pyplot as plt

>>> b,a=signal.ellip(4,5,40,100,’low’, analog=True)

>>> w,h=signal.freqs(b, a)

>>> plt.plot(w, 20 *np.log10(abs(h)))

>>> plt.xscale(’log’)

>>> plt.title(’Elliptic filter frequency response (rp=5, rs=40)’)

>>> plt.xlabel(’Frequency [radians / second]’)

>>> plt.ylabel(’Amplitude [dB]’)

>>> plt.margins(0,0.1)

>>> plt.grid(which=’both’, axis=’both’)

>>> plt.axvline(100, color=’green’)# cutoff frequency

>>> plt.axhline(-40, color=’green’)# rs

>>> plt.axhline(-5, color=’green’)# rp

>>> plt.show()

5.22. Signal processing (scipy.signal) 629

SciPy Reference Guide, Release 0.13.0

scipy.signal.ellipord(wp,ws,gpass,gstop,analog=False)

Elliptic (Cauer) ﬁlter order selection.

Return the order of the lowest order digital or analog elliptic ﬁlter that loses no more than gpass dB in the

passband and has at least gstop dB attenuation in the stopband.

Parameters wp, ws : ﬂoat

Passband and stopband edge frequencies. For digital ﬁlters, these are nor-

malized from 0 to 1, where 1 is the Nyquist frequency, pi radians/sample.

(wp and ws are thus in half-cycles / sample.) For example:

•Lowpass: wp = 0.2, ws = 0.3

•Highpass: wp = 0.3, ws = 0.2

•Bandpass: wp = [0.2, 0.5], ws = [0.1, 0.6]

•Bandstop: wp = [0.1, 0.6], ws = [0.2, 0.5]

For analog ﬁlters, wp and ws are angular frequencies (e.g. rad/s).

gpass : ﬂoat

The maximum loss in the passband (dB).

gstop : ﬂoat

The minimum attenuation in the stopband (dB).

analog : bool, optional

When True, return an analog ﬁlter, otherwise a digital ﬁlter is returned.

Returns ord : int

The lowest order for an Elliptic (Cauer) ﬁlter that meets specs.

wn : ndarray or ﬂoat

The Chebyshev natural frequency (the “3dB frequency”) for use with

ellip to give ﬁlter results.

scipy.signal.bessel(N,Wn,btype=’low’,analog=False,output=’ba’)

Bessel/Thomson digital and analog ﬁlter design.

Design an Nth order digital or analog Bessel ﬁlter and return the ﬁlter coefﬁcients in (B,A) or (Z,P,K) form.

Parameters N: int

The order of the ﬁlter.

Wn : array_like

A scalar or length-2 sequence giving the critical frequencies. For a Bessel

ﬁlter, this is deﬁned as the point at which the asymptotes of the response are

the same as a Butterworth ﬁlter of the same order. For digital ﬁlters, Wn is

normalized from 0 to 1, where 1 is the Nyquist frequency, pi radians/sample.

630 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

(Wn is thus in half-cycles / sample.) For analog ﬁlters, Wn is an angular

frequency (e.g. rad/s).

btype : {‘lowpass’, ‘highpass’, ‘bandpass’, ‘bandstop’}, optional

The type of ﬁlter. Default is ‘lowpass’.

analog : bool, optional

When True, return an analog ﬁlter, otherwise a digital ﬁlter is returned.

output : {‘ba’, ‘zpk’}, optional

Type of output: numerator/denominator (‘ba’) or pole-zero (‘zpk’). Default

is ‘ba’.

Returns b, a : ndarray, ndarray

Numerator (b) and denominator (a) polynomials of the IIR ﬁlter. Only re-

turned if output=’ba’.

z, p, k : ndarray, ndarray, ﬂoat

Zeros, poles, and system gain of the IIR ﬁlter transfer function. Only re-

turned if output=’zpk’.

Notes

Also known as a Thomson ﬁlter, the analog Bessel ﬁlter has maximally ﬂat group delay and maximally linear

phase response, with very little ringing in the step response.

As order increases, the Bessel ﬁlter approaches a Gaussian ﬁlter.

The digital Bessel ﬁlter is generated using the bilinear transform, which does not preserve the phase response

of the analog ﬁlter. As such, it is only approximately correct at frequencies below about fs/4. To get maximally

ﬂat group delay at higher frequencies, the analog Bessel ﬁlter must be transformed using phase-preserving

techniques.

For a given Wn, the lowpass and highpass ﬁlter have the same phase vs frequency curves; they are “phase-

matched”.

Examples

Plot the ﬁlter’s frequency response, showing the ﬂat group delay and the relationship to the Butterworth’s cutoff

frequency:

>>> from scipy import signal

>>> import matplotlib.pyplot as plt

>>> b,a=signal.butter(4,100,’low’, analog=True)

>>> w,h=signal.freqs(b, a)

>>> plt.plot(w, 20 *np.log10(np.abs(h)), color=’silver’, ls=’dashed’)

>>> b,a=signal.bessel(4,100,’low’, analog=True)

>>> w,h=signal.freqs(b, a)

>>> plt.plot(w, 20 *np.log10(np.abs(h)))

>>> plt.xscale(’log’)

>>> plt.title(’Bessel filter frequency response (with Butterworth)’)

>>> plt.xlabel(’Frequency [radians / second]’)

>>> plt.ylabel(’Amplitude [dB]’)

>>> plt.margins(0,0.1)

>>> plt.grid(which=’both’, axis=’both’)

>>> plt.axvline(100, color=’green’)# cutoff frequency

>>> plt.show()

5.22. Signal processing (scipy.signal) 631

SciPy Reference Guide, Release 0.13.0

>>> plt.figure()

>>> plt.plot(w[1:], -np.diff(np.unwrap(np.angle(h)))/np.diff(w))

>>> plt.xscale(’log’)

>>> plt.title(’Bessel filter group delay’)

>>> plt.xlabel(’Frequency [radians / second]’)

>>> plt.ylabel(’Group delay [seconds]’)

>>> plt.margins(0,0.1)

>>> plt.grid(which=’both’, axis=’both’)

>>> plt.show()

5.22.6 Continuous-Time Linear Systems

freqresp(system[, w, n]) Calculate the frequency response of a continuous-time system.

lti(*args, **kwords) Linear Time Invariant class which simpliﬁes representation.

Continued on next page

632 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Table 5.110 – continued from previous page

lsim(system, U, T[, X0, interp]) Simulate output of a continuous-time linear system.

lsim2(system[, U, T, X0]) Simulate output of a continuous-time linear system, by using

impulse(system[, X0, T, N]) Impulse response of continuous-time system.

impulse2(system[, X0, T, N]) Impulse response of a single-input, continuous-time linear system.

step(system[, X0, T, N]) Step response of continuous-time system.

step2(system[, X0, T, N]) Step response of continuous-time system.

bode(system[, w, n]) Calculate Bode magnitude and phase data of a continuous-time system.

scipy.signal.freqresp(system,w=None,n=10000)

Calculate the frequency response of a continuous-time system.

Parameters system : an instance of the LTI class or a tuple describing the system.

The following gives the number of elements in the tuple and the interpreta-

tion:

•2 (num, den)

•3 (zeros, poles, gain)

•4 (A, B, C, D)

w: array_like, optional

Array of frequencies (in rad/s). Magnitude and phase data is calculated for

every value in this array. If not given a reasonable set will be calculated.

n: int, optional

Number of frequency points to compute if wis not given. The nfrequencies

are logarithmically spaced in an interval chosen to include the inﬂuence of

the poles and zeros of the system.

Returns w: 1D ndarray

Frequency array [rad/s]

H: 1D ndarray

Array of complex magnitude values

Examples

# Generating the Nyquist plot of a transfer function

>>> from scipy import signal

>>> import matplotlib.pyplot as plt

>>> s1 =signal.lti([], [1,1,1], [5])

# transfer function: H(s) = 5 / (s-1)^3

>>> w,H=signal.freqresp(s1)

>>> plt.figure()

>>> plt.plot(H.real, H.imag, "b")

>>> plt.plot(H.real, -H.imag, "r")

>>> plt.show()

5.22. Signal processing (scipy.signal) 633

SciPy Reference Guide, Release 0.13.0

class scipy.signal.lti(*args,**kwords)

Linear Time Invariant class which simpliﬁes representation.

Parameters args : arguments

The lti class can be instantiated with either 2, 3 or 4 arguments. The

following gives the number of elements in the tuple and the interpretation:

•2: (numerator, denominator)

•3: (zeros, poles, gain)

•4: (A, B, C, D)

Each argument can be an array or sequence.

Notes

lti instances have all types of representations available; for example after creating an instance s with (zeros,

poles, gain) the transfer function representation (numerator, denominator) can be accessed as s.num and

s.den.

Attributes

den

gain

num

poles

zeros

lti.A

lti.B

lti.C

634 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

lti.D

lti.den

lti.gain

lti.num

lti.poles

lti.zeros

Methods

bode([w, n]) Calculate Bode magnitude and phase data.

freqresp([w, n]) Calculate the frequency response of a continuous-time system.

impulse([X0, T, N])

output(U, T[, X0])

step([X0, T, N])

lti.bode(w=None,n=100)

Calculate Bode magnitude and phase data.

Returns a 3-tuple containing arrays of frequencies [rad/s], magnitude [dB] and phase [deg]. See

scipy.signal.bode for details. New in version 0.11.0.

Examples

>>> from scipy import signal

>>> import matplotlib.pyplot as plt

>>> s1 =signal.lti([1], [1,1])

>>> w, mag, phase =s1.bode()

>>> plt.figure()

>>> plt.semilogx(w, mag) # Bode magnitude plot

>>> plt.figure()

>>> plt.semilogx(w, phase) # Bode phase plot

>>> plt.show()

5.22. Signal processing (scipy.signal) 635

SciPy Reference Guide, Release 0.13.0

lti.freqresp(w=None,n=10000)

Calculate the frequency response of a continuous-time system.

Returns a 2-tuple containing arrays of frequencies [rad/s] and complex magnitude. See

scipy.signal.freqresp for details.

lti.impulse(X0=None,T=None,N=None)

lti.output(U,T,X0=None)

lti.step(X0=None,T=None,N=None)

scipy.signal.lsim(system,U,T,X0=None,interp=1)

Simulate output of a continuous-time linear system.

Parameters system : an instance of the LTI class or a tuple describing the system.

636 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

The following gives the number of elements in the tuple and the interpreta-

tion:

•2: (num, den)

•3: (zeros, poles, gain)

•4: (A, B, C, D)

U: array_like

An input array describing the input at each time T(interpolation is assumed

between given times). If there are multiple inputs, then each column of the

rank-2 array represents an input.

T: array_like

The time steps at which the input is deﬁned and at which the output is

desired.

X0 :

The initial conditions on the state vector (zero by default).

interp : {1, 0}

Whether to use linear (1) or zero-order hold (0) interpolation.

Returns T: 1D ndarray

Time values for the output.

yout : 1D ndarray

System response.

xout : ndarray

Time-evolution of the state-vector.

scipy.signal.lsim2(system,U=None,T=None,X0=None,**kwargs)

Simulate output of a continuous-time linear system, by using the ODE solver scipy.integrate.odeint.

Parameters system : an instance of the LTI class or a tuple describing the system.

The following gives the number of elements in the tuple and the interpreta-

tion:

•2: (num, den)

•3: (zeros, poles, gain)

•4: (A, B, C, D)

U: array_like (1D or 2D), optional

An input array describing the input at each time T. Linear interpolation is

used between given times. If there are multiple inputs, then each column of

the rank-2 array represents an input. If U is not given, the input is assumed

to be zero.

T: array_like (1D or 2D), optional

The time steps at which the input is deﬁned and at which the output is

desired. The default is 101 evenly spaced points on the interval [0,10.0].

X0 : array_like (1D), optional

The initial condition of the state vector. If X0 is not given, the initial condi-

tions are assumed to be 0.

kwargs : dict

Additional keyword arguments are passed on to the function odeint. See the

notes below for more details.

Returns T: 1D ndarray

The time values for the output.

yout : ndarray

The response of the system.

xout : ndarray

The time-evolution of the state-vector.

Notes

This function uses scipy.integrate.odeint to solve the system’s differential equations. Ad-

ditional keyword arguments given to lsim2 are passed on to odeint. See the documentation for

scipy.integrate.odeint for the full list of arguments.

5.22. Signal processing (scipy.signal) 637

SciPy Reference Guide, Release 0.13.0

scipy.signal.impulse(system,X0=None,T=None,N=None)

Impulse response of continuous-time system.

Parameters system : an instance of the LTI class or a tuple of array_like

describing the system. The following gives the number of elements in the

tuple and the interpretation:

•2 (num, den)

•3 (zeros, poles, gain)

•4 (A, B, C, D)

X0 : array_like, optional

Initial state-vector. Defaults to zero.

T: array_like, optional

Time points. Computed if not given.

N: int, optional

The number of time points to compute (if Tis not given).

Returns T: ndarray

A 1-D array of time points.

yout : ndarray

A 1-D array containing the impulse response of the system (except for sin-

gularities at zero).

scipy.signal.impulse2(system,X0=None,T=None,N=None,**kwargs)

Impulse response of a single-input, continuous-time linear system.

Parameters system : an instance of the LTI class or a tuple of array_like

describing the system. The following gives the number of elements in the

tuple and the interpretation:

•2 (num, den)

•3 (zeros, poles, gain)

•4 (A, B, C, D)

X0 : 1-D array_like, optional

The initial condition of the state vector. Default: 0 (the zero vector).

T: 1-D array_like, optional

The time steps at which the input is deﬁned and at which the output is

desired. If Tis not given, the function will generate a set of time samples

automatically.

N: int, optional

Number of time points to compute. Default: 100.

kwargs : various types

Additional keyword arguments are passed on to the function

scipy.signal.lsim2, which in turn passes them on to

scipy.integrate.odeint; see the latter’s documentation for

information about these arguments.

Returns T: ndarray

The time values for the output.

yout : ndarray

The output response of the system.

See Also

impulse,lsim2,integrate.odeint

Notes

The solution is generated by calling scipy.signal.lsim2, which uses the differential equation solver

scipy.integrate.odeint. New in version 0.8.0.

Examples

Second order system with a repeated root: x’‘(t) + 2*x(t) + x(t) = u(t)

638 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

>>> from scipy import signal

>>> system =([1.0], [1.0,2.0,1.0])

>>> t,y=signal.impulse2(system)

>>> import matplotlib.pyplot as plt

>>> plt.plot(t, y)

scipy.signal.step(system,X0=None,T=None,N=None)

Step response of continuous-time system.

Parameters system : an instance of the LTI class or a tuple of array_like

describing the system. The following gives the number of elements in the

tuple and the interpretation:

•2 (num, den)

•3 (zeros, poles, gain)

•4 (A, B, C, D)

X0 : array_like, optional

Initial state-vector (default is zero).

T: array_like, optional

Time points (computed if not given).

N: int

Number of time points to compute if Tis not given.

Returns T: 1D ndarray

Output time points.

yout : 1D ndarray

Step response of system.

See Also

scipy.signal.step2

scipy.signal.step2(system,X0=None,T=None,N=None,**kwargs)

Step response of continuous-time system.

This function is functionally the same as scipy.signal.step, but it uses the function

scipy.signal.lsim2 to compute the step response.

Parameters system : an instance of the LTI class or a tuple of array_like

describing the system. The following gives the number of elements in the

tuple and the interpretation:

5.22. Signal processing (scipy.signal) 639

SciPy Reference Guide, Release 0.13.0

•2 (num, den)

•3 (zeros, poles, gain)

•4 (A, B, C, D)

X0 : array_like, optional

Initial state-vector (default is zero).

T: array_like, optional

Time points (computed if not given).

N: int

Number of time points to compute if Tis not given.

kwargs : various types

Additional keyword arguments are passed on the function

scipy.signal.lsim2, which in turn passes them on to

scipy.integrate.odeint. See the documentation for

scipy.integrate.odeint for information about these arguments.

Returns T: 1D ndarray

Output time points.

yout : 1D ndarray

Step response of system.

See Also

scipy.signal.step

Notes

New in version 0.8.0.

scipy.signal.bode(system,w=None,n=100)

Calculate Bode magnitude and phase data of a continuous-time system. New in version 0.11.0.

Parameters system : an instance of the LTI class or a tuple describing the system.

The following gives the number of elements in the tuple and the interpreta-

tion:

•2 (num, den)

•3 (zeros, poles, gain)

•4 (A, B, C, D)

w: array_like, optional

Array of frequencies (in rad/s). Magnitude and phase data is calculated for

every value in this array. If not given a reasonable set will be calculated.

n: int, optional

Number of frequency points to compute if wis not given. The nfrequencies

are logarithmically spaced in an interval chosen to include the inﬂuence of

the poles and zeros of the system.

Returns w: 1D ndarray

Frequency array [rad/s]

mag : 1D ndarray

Magnitude array [dB]

phase : 1D ndarray

Phase array [deg]

Examples

>>> from scipy import signal

>>> import matplotlib.pyplot as plt

>>> s1 =signal.lti([1], [1,1])

>>> w, mag, phase =signal.bode(s1)

640 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

>>> plt.figure()

>>> plt.semilogx(w, mag) # Bode magnitude plot

>>> plt.figure()

>>> plt.semilogx(w, phase) # Bode phase plot

>>> plt.show()

5.22.7 Discrete-Time Linear Systems

dlsim(system, u[, t, x0]) Simulate output of a discrete-time linear system.

dimpulse(system[, x0, t, n]) Impulse response of discrete-time system.

dstep(system[, x0, t, n]) Step response of discrete-time system.

scipy.signal.dlsim(system,u,t=None,x0=None)

Simulate output of a discrete-time linear system.

5.22. Signal processing (scipy.signal) 641

SciPy Reference Guide, Release 0.13.0

Parameters system : class instance or tuple

An instance of the LTI class, or a tuple describing the system. The following

gives the number of elements in the tuple and the interpretation:

•3: (num, den, dt)

•4: (zeros, poles, gain, dt)

•5: (A, B, C, D, dt)

u: array_like

An input array describing the input at each time t(interpolation is assumed

between given times). If there are multiple inputs, then each column of the

rank-2 array represents an input.

t: array_like, optional

The time steps at which the input is deﬁned. If tis given, the ﬁnal value in

tdetermines the number of steps returned in the output.

x0 : arry_like, optional

The initial conditions on the state vector (zero by default).

Returns tout : ndarray

Time values for the output, as a 1-D array.

yout : ndarray

System response, as a 1-D array.

xout : ndarray, optional

Time-evolution of the state-vector. Only generated if the input is a state-

space systems.

See Also

lsim,dstep,dimpulse,cont2discrete

Examples

A simple integrator transfer function with a discrete time step of 1.0 could be implemented as:

>>> from scipy import signal

>>> tf =([1.0,], [1.0,-1.0], 1.0)

>>> t_in =[0.0,1.0,2.0,3.0]

>>> u=np.asarray([0.0,0.0,1.0,1.0])

>>> t_out, y =signal.dlsim(tf, u, t=t_in)

>>> y

array([ 0., 0., 0., 1.])

scipy.signal.dimpulse(system,x0=None,t=None,n=None)

Impulse response of discrete-time system.

Parameters system : tuple

The following gives the number of elements in the tuple and the interpreta-

tion:

•3: (num, den, dt)

•4: (zeros, poles, gain, dt)

•5: (A, B, C, D, dt)

x0 : array_like, optional

Initial state-vector. Defaults to zero.

t: array_like, optional

Time points. Computed if not given.

n: int, optional

The number of time points to compute (if tis not given).

Returns t: ndarray

A 1-D array of time points.

yout : tuple of array_like

Impulse response of system. Each element of the tuple represents the output

of the system based on an impulse in each input.

642 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

See Also

impulse,dstep,dlsim,cont2discrete

scipy.signal.dstep(system,x0=None,t=None,n=None)

Step response of discrete-time system.

Parameters system : a tuple describing the system.

The following gives the number of elements in the tuple and the interpreta-

tion:

•3: (num, den, dt)

•4: (zeros, poles, gain, dt)

•5: (A, B, C, D, dt)

x0 : array_like, optional

Initial state-vector (default is zero).

t: array_like, optional

Time points (computed if not given).

n: int, optional

Number of time points to compute if tis not given.

Returns t: ndarray

Output time points, as a 1-D array.

yout : tuple of array_like

Step response of system. Each element of the tuple represents the output of

the system based on a step response to each input.

See Also

step,dimpulse,dlsim,cont2discrete

5.22.8 LTI Representations

tf2zpk(b, a) Return zero, pole, gain (z,p,k) representation from a numerator, denominator representation of a linear ﬁlter.

zpk2tf(z, p, k) Return polynomial transfer function representation from zeros

tf2ss(num, den) Transfer function to state-space representation.

ss2tf(A, B, C, D[, input]) State-space to transfer function.

zpk2ss(z, p, k) Zero-pole-gain representation to state-space representation

ss2zpk(A, B, C, D[, input]) State-space representation to zero-pole-gain representation.

cont2discrete(sys, dt[, method, alpha]) Transform a continuous to a discrete state-space system.

scipy.signal.tf2zpk(b,a)

Return zero, pole, gain (z,p,k) representation from a numerator, denominator representation of a linear ﬁlter.

Parameters b: ndarray

Numerator polynomial.

a: ndarray

Denominator polynomial.

Returns z: ndarray

Zeros of the transfer function.

p: ndarray

Poles of the transfer function.

k: ﬂoat

System gain.

Notes

If some values of bare too close to 0, they are removed. In that case, a BadCoefﬁcients warning is emitted.

5.22. Signal processing (scipy.signal) 643

SciPy Reference Guide, Release 0.13.0

scipy.signal.zpk2tf(z,p,k)

Return polynomial transfer function representation from zeros and poles

Parameters z: ndarray

Zeros of the transfer function.

p: ndarray

Poles of the transfer function.

k: ﬂoat

System gain.

Returns b: ndarray

Numerator polynomial.

a: ndarray

Denominator polynomial.

scipy.signal.tf2ss(num,den)

Transfer function to state-space representation.

Parameters num, den : array_like

Sequences representing the numerator and denominator polynomials. The

denominator needs to be at least as long as the numerator.

Returns A, B, C, D : ndarray

State space representation of the system.

scipy.signal.ss2tf(A,B,C,D,input=0)

State-space to transfer function.

Parameters A, B, C, D : ndarray

State-space representation of linear system.

input : int, optional

For multiple-input systems, the input to use.

Returns num, den : 1D ndarray

Numerator and denominator polynomials (as sequences) respectively.

scipy.signal.zpk2ss(z,p,k)

Zero-pole-gain representation to state-space representation

Parameters z, p : sequence

Zeros and poles.

k: ﬂoat

System gain.

Returns A, B, C, D : ndarray

State-space matrices.

scipy.signal.ss2zpk(A,B,C,D,input=0)

State-space representation to zero-pole-gain representation.

Parameters A, B, C, D : ndarray

State-space representation of linear system.

input : int, optional

For multiple-input systems, the input to use.

Returns z, p : sequence

Zeros and poles.

k: ﬂoat

System gain.

scipy.signal.cont2discrete(sys,dt,method=’zoh’,alpha=None)

Transform a continuous to a discrete state-space system.

Parameters sys : a tuple describing the system.

The following gives the number of elements in the tuple and the interpreta-

tion:

•2: (num, den)

644 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

•3: (zeros, poles, gain)

•4: (A, B, C, D)

dt : ﬂoat

The discretization time step.

method : {“gbt”, “bilinear”, “euler”, “backward_diff”, “zoh”}

Which method to use:

•gbt: generalized bilinear transformation

•bilinear: Tustin’s approximation (“gbt” with alpha=0.5)

•euler: Euler (or forward differencing) method (“gbt” with

alpha=0)

•backward_diff: Backwards differencing (“gbt” with al-

pha=1.0)

•zoh: zero-order hold (default)

alpha : ﬂoat within [0, 1]

The generalized bilinear transformation weighting parameter, which should

only be speciﬁed with method=”gbt”, and is ignored otherwise

Returns sysd : tuple containing the discrete system

Based on the input type, the output will be of the form

•(num, den, dt) for transfer function input

•(zeros, poles, gain, dt) for zeros-poles-gain input

•(A, B, C, D, dt) for state-space system input

Notes

By default, the routine uses a Zero-Order Hold (zoh) method to perform the transformation. Alternatively, a

generalized bilinear transformation may be used, which includes the common Tustin’s bilinear approximation,

an Euler’s method technique, or a backwards differencing technique.

The Zero-Order Hold (zoh) method is based on [R111], the generalized bilinear approximation is based on

[R112] and [R113].

References

[R111],[R112],[R113]

5.22.9 Waveforms

chirp(t, f0, t1, f1[, method, phi, vertex_zero]) Frequency-swept cosine generator.

gausspulse(t[, fc, bw, bwr, tpr, retquad, ...]) Return a Gaussian modulated sinusoid:

sawtooth(t[, width]) Return a periodic sawtooth or triangle waveform.

square(t[, duty]) Return a periodic square-wave waveform.

sweep_poly(t, poly[, phi]) Frequency-swept cosine generator, with a time-dependent frequency.

scipy.signal.chirp(t,f0,t1,f1,method=’linear’,phi=0,vertex_zero=True)

Frequency-swept cosine generator.

In the following, ‘Hz’ should be interpreted as ‘cycles per unit’; there is no requirement here that the unit is

one second. The important distinction is that the units of rotation are cycles, not radians. Likewise, tcould be a

measurement of space instead of time.

Parameters t: ndarray

Times at which to evaluate the waveform.

f0 : ﬂoat

Frequency (e.g. Hz) at time t=0.

t1 : ﬂoat

Time at which f1 is speciﬁed.

f1 : ﬂoat

Frequency (e.g. Hz) of the waveform at time t1.

5.22. Signal processing (scipy.signal) 645

SciPy Reference Guide, Release 0.13.0

method : {‘linear’, ‘quadratic’, ‘logarithmic’, ‘hyperbolic’}, optional

Kind of frequency sweep. If not given, linear is assumed. See Notes below

for more details.

phi : ﬂoat, optional

Phase offset, in degrees. Default is 0.

vertex_zero : bool, optional

This parameter is only used when method is ‘quadratic’. It determines

whether the vertex of the parabola that is the graph of the frequency is at

t=0 or t=t1.

Returns y: ndarray

A numpy array containing the signal evaluated at twith the requested

time-varying frequency. More precisely, the function returns cos(phase

+ (pi/180)*phi) where phase is the integral (from 0 to t) of

2*pi*f(t).f(t) is deﬁned below.

See Also

sweep_poly

Notes

There are four options for the method. The following formulas give the instantaneous frequency (in Hz) of the

signal generated by chirp(). For convenience, the shorter names shown below may also be used.

linear, lin, li:

f(t) = f0 + (f1 - f0) *t / t1

quadratic, quad, q:

The graph of the frequency f(t) is a parabola through (0, f0) and (t1, f1). By default, the vertex of the

parabola is at (0, f0). If vertex_zero is False, then the vertex is at (t1, f1). The formula is:

if vertex_zero is True:

f(t) = f0 + (f1 - f0) *t**2 / t1**2

else:

f(t) = f1 - (f1 - f0) *(t1 - t)**2 / t1**2

To use a more general quadratic function, or an arbitrary polynomial, use the function

scipy.signal.waveforms.sweep_poly.

logarithmic, log, lo:

f(t) = f0 *(f1/f0)**(t/t1)

f0 and f1 must be nonzero and have the same sign.

This signal is also known as a geometric or exponential chirp.

hyperbolic, hyp:

f(t) = f0*f1*t1 / ((f0 - f1)*t + f1*t1)

f1 must be positive, and f0 must be greater than f1.

scipy.signal.gausspulse(t,fc=1000,bw=0.5,bwr=-6,tpr=-60,retquad=False,retenv=False)

Return a Gaussian modulated sinusoid:

exp(-a t^2) exp(1j*2*pi*fc*t).

If retquad is True, then return the real and imaginary parts (in-phase and quadrature). If retenv is True, then

return the envelope (unmodulated signal). Otherwise, return the real part of the modulated sinusoid.

Parameters t: ndarray or the string ‘cutoff’

Input array.

fc : int, optional

Center frequency (e.g. Hz). Default is 1000.

646 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

bw : ﬂoat, optional

Fractional bandwidth in frequency domain of pulse (e.g. Hz). Default is

0.5.

bwr : ﬂoat, optional

Reference level at which fractional bandwidth is calculated (dB). Default is

-6.

tpr : ﬂoat, optional

If tis ‘cutoff’, then the function returns the cutoff time for when the pulse

amplitude falls below tpr (in dB). Default is -60.

retquad : bool, optional

If True, return the quadrature (imaginary) as well as the real part of the

signal. Default is False.

retenv : bool, optional

If True, return the envelope of the signal. Default is False.

Returns yI : ndarray

Real part of signal. Always returned.

yQ : ndarray

Imaginary part of signal. Only returned if retquad is True.

yenv : ndarray

Envelope of signal. Only returned if retenv is True.

See Also

scipy.signal.morlet

Examples

Plot real component, imaginary component, and envelope for a 5 Hz pulse, sampled at 100 Hz for 2 seconds:

>>> from scipy import signal

>>> import matplotlib.pyplot as plt

>>> t=np.linspace(-1,1,2*100, endpoint=False)

>>> i,q,e=signal.gausspulse(t, fc=5, retquad=True, retenv=True)

>>> plt.plot(t, i, t, q, t, e, ’--’)

scipy.signal.sawtooth(t,width=1)

Return a periodic sawtooth or triangle waveform.

5.22. Signal processing (scipy.signal) 647

SciPy Reference Guide, Release 0.13.0

The sawtooth waveform has a period 2*pi, rises from -1 to 1 on the interval 0 to width*2*pi, then drops

from 1 to -1 on the interval width*2*pi to 2*pi.width must be in the interval [0, 1].

Note that this is not band-limited. It produces an inﬁnite number of harmonics, which are aliased back and forth

across the frequency spectrum.

Parameters t: array_like

Time.

width : array_like, optional

Width of the rising ramp as a proportion of the total cycle. Default is 1,

producing a rising ramp, while 0 produces a falling ramp. t= 0.5 produces

a triangle wave. If an array, causes wave shape to change over time, and

must be the same length as t.

Returns y: ndarray

Output array containing the sawtooth waveform.

Examples

A 5 Hz waveform sampled at 500 Hz for 1 second:

>>> from scipy import signal

>>> import matplotlib.pyplot as plt

>>> t=np.linspace(0,1,500)

>>> plt.plot(t, signal.sawtooth(2*np.pi *5*t))

scipy.signal.square(t,duty=0.5)

Return a periodic square-wave waveform.

The square wave has a period 2*pi, has value +1 from 0 to 2*pi*duty and -1 from 2*pi*duty to 2*pi.

duty must be in the interval [0,1].

Note that this is not band-limited. It produces an inﬁnite number of harmonics, which are aliased back and forth

across the frequency spectrum.

Parameters t: array_like

The input time array.

duty : array_like, optional

Duty cycle. Default is 0.5 (50% duty cycle). If an array, causes wave shape

to change over time, and must be the same length as t.

Returns y: ndarray

648 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Output array containing the square waveform.

Examples

A 5 Hz waveform sampled at 500 Hz for 1 second:

>>> from scipy import signal

>>> import matplotlib.pyplot as plt

>>> t=np.linspace(0,1,500, endpoint=False)

>>> plt.plot(t, signal.square(2*np.pi *5*t))

>>> plt.ylim(-2,2)

A pulse-width modulated sine wave:

>>> plt.figure()

>>> sig =np.sin(2*np.pi *t)

>>> pwm =signal.square(2*np.pi *30 *t, duty=(sig +1)/2)

>>> plt.subplot(2,1,1)

>>> plt.plot(t, sig)

>>> plt.subplot(2,1,2)

>>> plt.plot(t, pwm)

>>> plt.ylim(-1.5,1.5)

5.22. Signal processing (scipy.signal) 649

SciPy Reference Guide, Release 0.13.0

scipy.signal.sweep_poly(t,poly,phi=0)

Frequency-swept cosine generator, with a time-dependent frequency.

This function generates a sinusoidal function whose instantaneous frequency varies with time. The frequency at

time tis given by the polynomial poly.

Parameters t: ndarray

Times at which to evaluate the waveform.

poly : 1-D array-like or instance of numpy.poly1d

The desired frequency expressed as a polynomial. If poly is a list or ndarray

of length n, then the elements of poly are the coefﬁcients of the polynomial,

and the instantaneous frequency is

f(t) = poly[0]*t**(n-1) +

poly[1]*t**(n-2) + ... + poly[n-1]

If poly is an instance of numpy.poly1d, then the instantaneous frequency is

f(t) = poly(t)

phi : ﬂoat, optional

Phase offset, in degrees, Default: 0.

Returns sweep_poly : ndarray

A numpy array containing the signal evaluated at twith the requested time-

varying frequency. More precisely, the function returns cos(phase +

(pi/180)*phi), where phase is the integral (from 0 to t) of 2*pi

*f(t);f(t) is deﬁned above.

See Also

chirp

Notes

New in version 0.8.0. If poly is a list or ndarray of length n, then the elements of poly are the coefﬁcients of the

polynomial, and the instantaneous frequency is:

f(t) = poly[0]*t**(n-1) + poly[1]*t**(n-2) + ... + poly[n-1]

If poly is an instance of numpy.poly1d, then the instantaneous frequency is:

f(t) = poly(t)

Finally, the output sis:

650 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

cos(phase + (pi/180)*phi)

where phase is the integral from 0 to tof 2*pi *f(t),f(t) as deﬁned above.

5.22.10 Window functions

get_window(window, Nx[, fftbins]) Return a window.

barthann(M[, sym]) Return a modiﬁed Bartlett-Hann window.

bartlett(M[, sym]) Return a Bartlett window.

blackman(M[, sym]) Return a Blackman window.

blackmanharris(M[, sym]) Return a minimum 4-term Blackman-Harris window.

bohman(M[, sym]) Return a Bohman window.

boxcar(M[, sym]) Return a boxcar or rectangular window.

chebwin(M, at[, sym]) Return a Dolph-Chebyshev window.

cosine(M[, sym]) Return a window with a simple cosine shape.

flattop(M[, sym]) Return a ﬂat top window.

gaussian(M, std[, sym]) Return a Gaussian window.

general_gaussian(M, p, sig[, sym]) Return a window with a generalized Gaussian shape.

hamming(M[, sym]) Return a Hamming window.

hann(M[, sym]) Return a Hann window.

kaiser(M, beta[, sym]) Return a Kaiser window.

nuttall(M[, sym]) Return a minimum 4-term Blackman-Harris window according to Nuttall.

parzen(M[, sym]) Return a Parzen window.

slepian(M, width[, sym]) Return a digital Slepian (DPSS) window.

triang(M[, sym]) Return a triangular window.

scipy.signal.get_window(window,Nx,fftbins=True)

Return a window.

Parameters window : string, ﬂoat, or tuple

The type of window to create. See below for more details.

Nx : int

The number of samples in the window.

fftbins : bool, optional

If True, create a “periodic” window ready to use with ifftshift and be multi-

plied by the result of an fft (SEE ALSO fftfreq).

Returns get_window : ndarray

Returns a window of length Nx and type window

Notes

Window types:

boxcar, triang, blackman, hamming, hann, bartlett, ﬂattop, parzen, bohman, blackmanharris, nuttall,

barthann, kaiser (needs beta), gaussian (needs std), general_gaussian (needs power, width), slepian (needs

width), chebwin (needs attenuation)

If the window requires no parameters, then window can be a string.

If the window requires parameters, then window must be a tuple with the ﬁrst argument the string name of the

window, and the next arguments the needed parameters.

If window is a ﬂoating point number, it is interpreted as the beta parameter of the kaiser window.

Each of the window types listed above is also the name of a function that can be called directly to create a

window of that type.

5.22. Signal processing (scipy.signal) 651

SciPy Reference Guide, Release 0.13.0

Examples

>>> from scipy import signal

>>> signal.get_window(’triang’,7)

array([ 0.25, 0.5 , 0.75, 1. , 0.75, 0.5 , 0.25])

>>> signal.get_window((’kaiser’,4.0), 9)

array([ 0.08848053, 0.32578323, 0.63343178, 0.89640418, 1. ,

0.89640418, 0.63343178, 0.32578323, 0.08848053])

>>> signal.get_window(4.0,9)

array([ 0.08848053, 0.32578323, 0.63343178, 0.89640418, 1. ,

0.89640418, 0.63343178, 0.32578323, 0.08848053])

scipy.signal.barthann(M,sym=True)

Return a modiﬁed Bartlett-Hann window.

Parameters M: int

Number of points in the output window. If zero or less, an empty array is

returned.

sym : bool, optional

When True (default), generates a symmetric window, for use in ﬁlter design.

When False, generates a periodic window, for use in spectral analysis.

Returns w: ndarray

The window, with the maximum value normalized to 1 (though the value 1

does not appear if the number of samples is even and sym is True).

Examples

Plot the window and its frequency response:

>>> from scipy import signal

>>> from scipy.fftpack import fft, fftshift

>>> import matplotlib.pyplot as plt

>>> window =signal.barthann(51)

>>> plt.plot(window)

>>> plt.title("Bartlett-Hann window")

>>> plt.ylabel("Amplitude")

>>> plt.xlabel("Sample")

>>> plt.figure()

>>> A=fft(window, 2048)/(len(window)/2.0)

>>> freq =np.linspace(-0.5,0.5,len(A))

>>> response =20 *np.log10(np.abs(fftshift(A /abs(A).max())))

>>> plt.plot(freq, response)

>>> plt.axis([-0.5,0.5,-120,0])

>>> plt.title("Frequency response of the Bartlett-Hann window")

>>> plt.ylabel("Normalized magnitude [dB]")

>>> plt.xlabel("Normalized frequency [cycles per sample]")

652 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.signal.bartlett(M,sym=True)

Return a Bartlett window.

The Bartlett window is very similar to a triangular window, except that the end points are at zero. It is often

used in signal processing for tapering a signal, without generating too much ripple in the frequency domain.

Parameters M: int

Number of points in the output window. If zero or less, an empty array is

returned.

sym : bool, optional

When True (default), generates a symmetric window, for use in ﬁlter design.

When False, generates a periodic window, for use in spectral analysis.

Returns w: ndarray

The triangular window, with the maximum value normalized to 1 (though

the value 1 does not appear if the number of samples is even and sym is

True), with the ﬁrst and last samples equal to zero.

5.22. Signal processing (scipy.signal) 653

SciPy Reference Guide, Release 0.13.0

Notes

The Bartlett window is deﬁned as

w(n) = 2

M−1M−1

2−

n−M−1

2

Most references to the Bartlett window come from the signal processing literature, where it is used as one of

many windowing functions for smoothing values. Note that convolution with this window produces linear in-

terpolation. It is also known as an apodization (which means”removing the foot”, i.e. smoothing discontinuities

at the beginning and end of the sampled signal) or tapering function. The fourier transform of the Bartlett is the

product of two sinc functions. Note the excellent discussion in Kanasewich.

References

[R104],[R105],[R106],[R107],[R108]

Examples

Plot the window and its frequency response:

>>> from scipy import signal

>>> from scipy.fftpack import fft, fftshift

>>> import matplotlib.pyplot as plt

>>> window =signal.bartlett(51)

>>> plt.plot(window)

>>> plt.title("Bartlett window")

>>> plt.ylabel("Amplitude")

>>> plt.xlabel("Sample")

>>> plt.figure()

>>> A=fft(window, 2048)/(len(window)/2.0)

>>> freq =np.linspace(-0.5,0.5,len(A))

>>> response =20 *np.log10(np.abs(fftshift(A /abs(A).max())))

>>> plt.plot(freq, response)

>>> plt.axis([-0.5,0.5,-120,0])

>>> plt.title("Frequency response of the Bartlett window")

>>> plt.ylabel("Normalized magnitude [dB]")

>>> plt.xlabel("Normalized frequency [cycles per sample]")

654 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.signal.blackman(M,sym=True)

Return a Blackman window.

The Blackman window is a taper formed by using the the ﬁrst three terms of a summation of cosines. It was

designed to have close to the minimal leakage possible. It is close to optimal, only slightly worse than a Kaiser

window.

Parameters M: int

Number of points in the output window. If zero or less, an empty array is

returned.

sym : bool, optional

When True (default), generates a symmetric window, for use in ﬁlter design.

When False, generates a periodic window, for use in spectral analysis.

Returns w: ndarray

The window, with the maximum value normalized to 1 (though the value 1

does not appear if the number of samples is even and sym is True).

5.22. Signal processing (scipy.signal) 655

SciPy Reference Guide, Release 0.13.0

Notes

The Blackman window is deﬁned as

w(n)=0.42 −0.5 cos(2πn/M)+0.08 cos(4πn/M)

Most references to the Blackman window come from the signal processing literature, where it is used as one of

many windowing functions for smoothing values. It is also known as an apodization (which means “removing

the foot”, i.e. smoothing discontinuities at the beginning and end of the sampled signal) or tapering function. It

is known as a “near optimal” tapering function, almost as good (by some measures) as the Kaiser window.

References

[R109],[R110]

Examples

Plot the window and its frequency response:

>>> from scipy import signal

>>> from scipy.fftpack import fft, fftshift

>>> import matplotlib.pyplot as plt

>>> window =signal.blackman(51)

>>> plt.plot(window)

>>> plt.title("Blackman window")

>>> plt.ylabel("Amplitude")

>>> plt.xlabel("Sample")

>>> plt.figure()

>>> A=fft(window, 2048)/(len(window)/2.0)

>>> freq =np.linspace(-0.5,0.5,len(A))

>>> response =20 *np.log10(np.abs(fftshift(A /abs(A).max())))

>>> plt.plot(freq, response)

>>> plt.axis([-0.5,0.5,-120,0])

>>> plt.title("Frequency response of the Blackman window")

>>> plt.ylabel("Normalized magnitude [dB]")

>>> plt.xlabel("Normalized frequency [cycles per sample]")

656 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.signal.blackmanharris(M,sym=True)

Return a minimum 4-term Blackman-Harris window.

Parameters M: int

Number of points in the output window. If zero or less, an empty array is

returned.

sym : bool, optional

When True (default), generates a symmetric window, for use in ﬁlter design.

When False, generates a periodic window, for use in spectral analysis.

Returns w: ndarray

The window, with the maximum value normalized to 1 (though the value 1

does not appear if the number of samples is even and sym is True).

Examples

Plot the window and its frequency response:

5.22. Signal processing (scipy.signal) 657

SciPy Reference Guide, Release 0.13.0

>>> from scipy import signal

>>> from scipy.fftpack import fft, fftshift

>>> import matplotlib.pyplot as plt

>>> window =signal.blackmanharris(51)

>>> plt.plot(window)

>>> plt.title("Blackman-Harris window")

>>> plt.ylabel("Amplitude")

>>> plt.xlabel("Sample")

>>> plt.figure()

>>> A=fft(window, 2048)/(len(window)/2.0)

>>> freq =np.linspace(-0.5,0.5,len(A))

>>> response =20 *np.log10(np.abs(fftshift(A /abs(A).max())))

>>> plt.plot(freq, response)

>>> plt.axis([-0.5,0.5,-120,0])

>>> plt.title("Frequency response of the Blackman-Harris window")

>>> plt.ylabel("Normalized magnitude [dB]")

>>> plt.xlabel("Normalized frequency [cycles per sample]")

658 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.signal.bohman(M,sym=True)

Return a Bohman window.

Parameters M: int

Number of points in the output window. If zero or less, an empty array is

returned.

sym : bool, optional

When True (default), generates a symmetric window, for use in ﬁlter design.

When False, generates a periodic window, for use in spectral analysis.

Returns w: ndarray

The window, with the maximum value normalized to 1 (though the value 1

does not appear if the number of samples is even and sym is True).

Examples

Plot the window and its frequency response:

>>> from scipy import signal

>>> from scipy.fftpack import fft, fftshift

>>> import matplotlib.pyplot as plt

>>> window =signal.bohman(51)

>>> plt.plot(window)

>>> plt.title("Bohman window")

>>> plt.ylabel("Amplitude")

>>> plt.xlabel("Sample")

>>> plt.figure()

>>> A=fft(window, 2048)/(len(window)/2.0)

>>> freq =np.linspace(-0.5,0.5,len(A))

>>> response =20 *np.log10(np.abs(fftshift(A /abs(A).max())))

>>> plt.plot(freq, response)

>>> plt.axis([-0.5,0.5,-120,0])

>>> plt.title("Frequency response of the Bohman window")

>>> plt.ylabel("Normalized magnitude [dB]")

>>> plt.xlabel("Normalized frequency [cycles per sample]")

5.22. Signal processing (scipy.signal) 659

SciPy Reference Guide, Release 0.13.0

scipy.signal.boxcar(M,sym=True)

Return a boxcar or rectangular window.

Included for completeness, this is equivalent to no window at all.

Parameters M: int

Number of points in the output window. If zero or less, an empty array is

returned.

sym : bool, optional

Whether the window is symmetric. (Has no effect for boxcar.)

Returns w: ndarray

The window, with the maximum value normalized to 1

Examples

Plot the window and its frequency response:

660 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

>>> from scipy import signal

>>> from scipy.fftpack import fft, fftshift

>>> import matplotlib.pyplot as plt

>>> window =signal.boxcar(51)

>>> plt.plot(window)

>>> plt.title("Boxcar window")

>>> plt.ylabel("Amplitude")

>>> plt.xlabel("Sample")

>>> plt.figure()

>>> A=fft(window, 2048)/(len(window)/2.0)

>>> freq =np.linspace(-0.5,0.5,len(A))

>>> response =20 *np.log10(np.abs(fftshift(A /abs(A).max())))

>>> plt.plot(freq, response)

>>> plt.axis([-0.5,0.5,-120,0])

>>> plt.title("Frequency response of the boxcar window")

>>> plt.ylabel("Normalized magnitude [dB]")

>>> plt.xlabel("Normalized frequency [cycles per sample]")

5.22. Signal processing (scipy.signal) 661

SciPy Reference Guide, Release 0.13.0

scipy.signal.chebwin(M,at,sym=True)

Return a Dolph-Chebyshev window.

Parameters M: int

Number of points in the output window. If zero or less, an empty array is

returned.

at : ﬂoat

Attenuation (in dB).

sym : bool, optional

When True (default), generates a symmetric window, for use in ﬁlter design.

When False, generates a periodic window, for use in spectral analysis.

Returns w: ndarray

The window, with the maximum value always normalized to 1

Examples

Plot the window and its frequency response:

>>> from scipy import signal

>>> from scipy.fftpack import fft, fftshift

>>> import matplotlib.pyplot as plt

>>> window =signal.chebwin(51, at=100)

>>> plt.plot(window)

>>> plt.title("Dolph-Chebyshev window (100 dB)")

>>> plt.ylabel("Amplitude")

>>> plt.xlabel("Sample")

>>> plt.figure()

>>> A=fft(window, 2048)/(len(window)/2.0)

>>> freq =np.linspace(-0.5,0.5,len(A))

>>> response =20 *np.log10(np.abs(fftshift(A /abs(A).max())))

>>> plt.plot(freq, response)

>>> plt.axis([-0.5,0.5,-120,0])

>>> plt.title("Frequency response of the Dolph-Chebyshev window (100 dB)")

>>> plt.ylabel("Normalized magnitude [dB]")

>>> plt.xlabel("Normalized frequency [cycles per sample]")

662 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.signal.cosine(M,sym=True)

Return a window with a simple cosine shape. New in version 0.13.0.

Parameters M: int

Number of points in the output window. If zero or less, an empty array is

returned.

sym : bool, optional

When True, generates a symmetric window, for use in ﬁlter design. When

False, generates a periodic window, for use in spectral analysis.

Returns w: ndarray

The window, with the maximum value normalized to 1.

scipy.signal.flattop(M,sym=True)

Return a ﬂat top window.

Parameters M: int

Number of points in the output window. If zero or less, an empty array is

returned.

5.22. Signal processing (scipy.signal) 663

SciPy Reference Guide, Release 0.13.0

sym : bool, optional

When True (default), generates a symmetric window, for use in ﬁlter design.

When False, generates a periodic window, for use in spectral analysis.

Returns w: ndarray

The window, with the maximum value normalized to 1 (though the value 1

does not appear if the number of samples is even and sym is True).

Examples

Plot the window and its frequency response:

>>> from scipy import signal

>>> from scipy.fftpack import fft, fftshift

>>> import matplotlib.pyplot as plt

>>> window =signal.flattop(51)

>>> plt.plot(window)

>>> plt.title("Flat top window")

>>> plt.ylabel("Amplitude")

>>> plt.xlabel("Sample")

>>> plt.figure()

>>> A=fft(window, 2048)/(len(window)/2.0)

>>> freq =np.linspace(-0.5,0.5,len(A))

>>> response =20 *np.log10(np.abs(fftshift(A /abs(A).max())))

>>> plt.plot(freq, response)

>>> plt.axis([-0.5,0.5,-120,0])

>>> plt.title("Frequency response of the flat top window")

>>> plt.ylabel("Normalized magnitude [dB]")

>>> plt.xlabel("Normalized frequency [cycles per sample]")

664 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.signal.gaussian(M,std,sym=True)

Return a Gaussian window.

Parameters M: int

Number of points in the output window. If zero or less, an empty array is

returned.

std : ﬂoat

The standard deviation, sigma.

sym : bool, optional

When True (default), generates a symmetric window, for use in ﬁlter design.

When False, generates a periodic window, for use in spectral analysis.

Returns w: ndarray

The window, with the maximum value normalized to 1 (though the value 1

does not appear if the number of samples is even and sym is True).

Notes

The Gaussian window is deﬁned as

w(n) = e−1

2(n

σ)2

Examples

Plot the window and its frequency response:

>>> from scipy import signal

>>> from scipy.fftpack import fft, fftshift

>>> import matplotlib.pyplot as plt

>>> window =signal.gaussian(51, std=7)

>>> plt.plot(window)

>>> plt.title(r"Gaussian window ($\sigma$=7)")

>>> plt.ylabel("Amplitude")

>>> plt.xlabel("Sample")

5.22. Signal processing (scipy.signal) 665

SciPy Reference Guide, Release 0.13.0

>>> plt.figure()

>>> A=fft(window, 2048)/(len(window)/2.0)

>>> freq =np.linspace(-0.5,0.5,len(A))

>>> response =20 *np.log10(np.abs(fftshift(A /abs(A).max())))

>>> plt.plot(freq, response)

>>> plt.axis([-0.5,0.5,-120,0])

>>> plt.title(r"Frequency response of the Gaussian window ($\sigma$=7)")

>>> plt.ylabel("Normalized magnitude [dB]")

>>> plt.xlabel("Normalized frequency [cycles per sample]")

scipy.signal.general_gaussian(M,p,sig,sym=True)

Return a window with a generalized Gaussian shape.

Parameters M: int

Number of points in the output window. If zero or less, an empty array is

returned.

p: ﬂoat

666 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Shape parameter. p = 1 is identical to gaussian, p = 0.5 is the same shape

as the Laplace distribution.

sig : ﬂoat

The standard deviation, sigma.

sym : bool, optional

When True (default), generates a symmetric window, for use in ﬁlter design.

When False, generates a periodic window, for use in spectral analysis.

Returns w: ndarray

The window, with the maximum value normalized to 1 (though the value 1

does not appear if the number of samples is even and sym is True).

Notes

The generalized Gaussian window is deﬁned as

w(n) = e−1

2|n

σ|2p

the half-power point is at

(2 log(2))1/(2p)σ

Examples

Plot the window and its frequency response:

>>> from scipy import signal

>>> from scipy.fftpack import fft, fftshift

>>> import matplotlib.pyplot as plt

>>> window =signal.general_gaussian(51, p=1.5, sig=7)

>>> plt.plot(window)

>>> plt.title(r"Generalized Gaussian window (p=1.5, $\sigma$=7)")

>>> plt.ylabel("Amplitude")

>>> plt.xlabel("Sample")

>>> plt.figure()

>>> A=fft(window, 2048)/(len(window)/2.0)

>>> freq =np.linspace(-0.5,0.5,len(A))

>>> response =20 *np.log10(np.abs(fftshift(A /abs(A).max())))

>>> plt.plot(freq, response)

>>> plt.axis([-0.5,0.5,-120,0])

>>> plt.title(r"Freq. resp. of the gen. Gaussian window (p=1.5, $\sigma$=7)")

>>> plt.ylabel("Normalized magnitude [dB]")

>>> plt.xlabel("Normalized frequency [cycles per sample]")

5.22. Signal processing (scipy.signal) 667

SciPy Reference Guide, Release 0.13.0

scipy.signal.hamming(M,sym=True)

Return a Hamming window.

The Hamming window is a taper formed by using a raised cosine with non-zero endpoints, optimized to mini-

mize the nearest side lobe.

Parameters M: int

Number of points in the output window. If zero or less, an empty array is

returned.

sym : bool, optional

When True (default), generates a symmetric window, for use in ﬁlter design.

When False, generates a periodic window, for use in spectral analysis.

Returns w: ndarray

The window, with the maximum value normalized to 1 (though the value 1

does not appear if the number of samples is even and sym is True).

668 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Notes

The Hamming window is deﬁned as

w(n)=0.54 −0.46 cos 2πn

M−10≤n≤M−1

The Hamming was named for R. W. Hamming, an associate of J. W. Tukey and is described in Blackman and

Tukey. It was recommended for smoothing the truncated autocovariance function in the time domain. Most

references to the Hamming window come from the signal processing literature, where it is used as one of many

windowing functions for smoothing values. It is also known as an apodization (which means “removing the

foot”, i.e. smoothing discontinuities at the beginning and end of the sampled signal) or tapering function.

References

[R117],[R118],[R119],[R120]

Examples

Plot the window and its frequency response:

>>> from scipy import signal

>>> from scipy.fftpack import fft, fftshift

>>> import matplotlib.pyplot as plt

>>> window =signal.hamming(51)

>>> plt.plot(window)

>>> plt.title("Hamming window")

>>> plt.ylabel("Amplitude")

>>> plt.xlabel("Sample")

>>> plt.figure()

>>> A=fft(window, 2048)/(len(window)/2.0)

>>> freq =np.linspace(-0.5,0.5,len(A))

>>> response =20 *np.log10(np.abs(fftshift(A /abs(A).max())))

>>> plt.plot(freq, response)

>>> plt.axis([-0.5,0.5,-120,0])

>>> plt.title("Frequency response of the Hamming window")

>>> plt.ylabel("Normalized magnitude [dB]")

>>> plt.xlabel("Normalized frequency [cycles per sample]")

5.22. Signal processing (scipy.signal) 669

SciPy Reference Guide, Release 0.13.0

scipy.signal.hann(M,sym=True)

Return a Hann window.

The Hann window is a taper formed by using a raised cosine or sine-squared with ends that touch zero.

Parameters M: int

Number of points in the output window. If zero or less, an empty array is

returned.

sym : bool, optional

When True (default), generates a symmetric window, for use in ﬁlter design.

When False, generates a periodic window, for use in spectral analysis.

Returns w: ndarray

The window, with the maximum value normalized to 1 (though the value 1

does not appear if Mis even and sym is True).

670 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Notes

The Hann window is deﬁned as

w(n)=0.5−0.5 cos 2πn

M−10≤n≤M−1

The window was named for Julius van Hann, an Austrian meterologist. It is also known as the Cosine Bell. It

is sometimes erroneously referred to as the “Hanning” window, from the use of “hann” as a verb in the original

paper and confusion with the very similar Hamming window.

Most references to the Hann window come from the signal processing literature, where it is used as one of many

windowing functions for smoothing values. It is also known as an apodization (which means “removing the

foot”, i.e. smoothing discontinuities at the beginning and end of the sampled signal) or tapering function.

References

[R121],[R122],[R123],[R124]

Examples

Plot the window and its frequency response:

>>> from scipy import signal

>>> from scipy.fftpack import fft, fftshift

>>> import matplotlib.pyplot as plt

>>> window =signal.hann(51)

>>> plt.plot(window)

>>> plt.title("Hann window")

>>> plt.ylabel("Amplitude")

>>> plt.xlabel("Sample")

>>> plt.figure()

>>> A=fft(window, 2048)/(len(window)/2.0)

>>> freq =np.linspace(-0.5,0.5,len(A))

>>> response =20 *np.log10(np.abs(fftshift(A /abs(A).max())))

>>> plt.plot(freq, response)

>>> plt.axis([-0.5,0.5,-120,0])

>>> plt.title("Frequency response of the Hann window")

>>> plt.ylabel("Normalized magnitude [dB]")

>>> plt.xlabel("Normalized frequency [cycles per sample]")

5.22. Signal processing (scipy.signal) 671

SciPy Reference Guide, Release 0.13.0

scipy.signal.kaiser(M,beta,sym=True)

Return a Kaiser window.

The Kaiser window is a taper formed by using a Bessel function.

Parameters M: int

Number of points in the output window. If zero or less, an empty array is

returned.

beta : ﬂoat

Shape parameter, determines trade-off between main-lobe width and side

lobe level. As beta gets large, the window narrows.

sym : bool, optional

When True (default), generates a symmetric window, for use in ﬁlter design.

When False, generates a periodic window, for use in spectral analysis.

Returns w: ndarray

The window, with the maximum value normalized to 1 (though the value 1

does not appear if the number of samples is even and sym is True).

672 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Notes

The Kaiser window is deﬁned as

w(n) = I0 βs1−4n2

(M−1)2!/I0(β)

with

−M−1

2≤n≤M−1

where I0is the modiﬁed zeroth-order Bessel function.

The Kaiser was named for Jim Kaiser, who discovered a simple approximation to the DPSS window based on

Bessel functions. The Kaiser window is a very good approximation to the Digital Prolate Spheroidal Sequence,

or Slepian window, which is the transform which maximizes the energy in the main lobe of the window relative

to total energy.

The Kaiser can approximate many other windows by varying the beta parameter.

beta Window shape

0 Rectangular

5 Similar to a Hamming

6 Similar to a Hann

8.6 Similar to a Blackman

A beta value of 14 is probably a good starting point. Note that as beta gets large, the window narrows, and so

the number of samples needs to be large enough to sample the increasingly narrow spike, otherwise NaNs will

get returned.

Most references to the Kaiser window come from the signal processing literature, where it is used as one of

many windowing functions for smoothing values. It is also known as an apodization (which means “removing

the foot”, i.e. smoothing discontinuities at the beginning and end of the sampled signal) or tapering function.

References

[R126],[R127],[R128]

Examples

Plot the window and its frequency response:

>>> from scipy import signal

>>> from scipy.fftpack import fft, fftshift

>>> import matplotlib.pyplot as plt

>>> window =signal.kaiser(51, beta=14)

>>> plt.plot(window)

>>> plt.title(r"Kaiser window ($\beta$=14)")

>>> plt.ylabel("Amplitude")

>>> plt.xlabel("Sample")

>>> plt.figure()

>>> A=fft(window, 2048)/(len(window)/2.0)

>>> freq =np.linspace(-0.5,0.5,len(A))

5.22. Signal processing (scipy.signal) 673

SciPy Reference Guide, Release 0.13.0

>>> response =20 *np.log10(np.abs(fftshift(A /abs(A).max())))

>>> plt.plot(freq, response)

>>> plt.axis([-0.5,0.5,-120,0])

>>> plt.title(r"Frequency response of the Kaiser window ($\beta$=14)")

>>> plt.ylabel("Normalized magnitude [dB]")

>>> plt.xlabel("Normalized frequency [cycles per sample]")

scipy.signal.nuttall(M,sym=True)

Return a minimum 4-term Blackman-Harris window according to Nuttall.

Parameters M: int

Number of points in the output window. If zero or less, an empty array is

returned.

sym : bool, optional

When True (default), generates a symmetric window, for use in ﬁlter design.

When False, generates a periodic window, for use in spectral analysis.

Returns w: ndarray

674 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

The window, with the maximum value normalized to 1 (though the value 1

does not appear if the number of samples is even and sym is True).

Examples

Plot the window and its frequency response:

>>> from scipy import signal

>>> from scipy.fftpack import fft, fftshift

>>> import matplotlib.pyplot as plt

>>> window =signal.nuttall(51)

>>> plt.plot(window)

>>> plt.title("Nuttall window")

>>> plt.ylabel("Amplitude")

>>> plt.xlabel("Sample")

>>> plt.figure()

>>> A=fft(window, 2048)/(len(window)/2.0)

>>> freq =np.linspace(-0.5,0.5,len(A))

>>> response =20 *np.log10(np.abs(fftshift(A /abs(A).max())))

>>> plt.plot(freq, response)

>>> plt.axis([-0.5,0.5,-120,0])

>>> plt.title("Frequency response of the Nuttall window")

>>> plt.ylabel("Normalized magnitude [dB]")

>>> plt.xlabel("Normalized frequency [cycles per sample]")

5.22. Signal processing (scipy.signal) 675

SciPy Reference Guide, Release 0.13.0

scipy.signal.parzen(M,sym=True)

Return a Parzen window.

Parameters M: int

Number of points in the output window. If zero or less, an empty array is

returned.

sym : bool, optional

When True (default), generates a symmetric window, for use in ﬁlter design.

When False, generates a periodic window, for use in spectral analysis.

Returns w: ndarray

The window, with the maximum value normalized to 1 (though the value 1

does not appear if the number of samples is even and sym is True).

Examples

Plot the window and its frequency response:

>>> from scipy import signal

>>> from scipy.fftpack import fft, fftshift

>>> import matplotlib.pyplot as plt

>>> window =signal.parzen(51)

>>> plt.plot(window)

>>> plt.title("Parzen window")

>>> plt.ylabel("Amplitude")

>>> plt.xlabel("Sample")

>>> plt.figure()

>>> A=fft(window, 2048)/(len(window)/2.0)

>>> freq =np.linspace(-0.5,0.5,len(A))

>>> response =20 *np.log10(np.abs(fftshift(A /abs(A).max())))

>>> plt.plot(freq, response)

>>> plt.axis([-0.5,0.5,-120,0])

>>> plt.title("Frequency response of the Parzen window")

>>> plt.ylabel("Normalized magnitude [dB]")

>>> plt.xlabel("Normalized frequency [cycles per sample]")

676 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.signal.slepian(M,width,sym=True)

Return a digital Slepian (DPSS) window.

Used to maximize the energy concentration in the main lobe. Also called the digital prolate spheroidal sequence

(DPSS).

Parameters M: int

Number of points in the output window. If zero or less, an empty array is

returned.

width : ﬂoat

Bandwidth

sym : bool, optional

When True (default), generates a symmetric window, for use in ﬁlter design.

When False, generates a periodic window, for use in spectral analysis.

Returns w: ndarray

The window, with the maximum value always normalized to 1

5.22. Signal processing (scipy.signal) 677

SciPy Reference Guide, Release 0.13.0

Examples

Plot the window and its frequency response:

>>> from scipy import signal

>>> from scipy.fftpack import fft, fftshift

>>> import matplotlib.pyplot as plt

>>> window =signal.slepian(51, width=0.3)

>>> plt.plot(window)

>>> plt.title("Slepian (DPSS) window (BW=0.3)")

>>> plt.ylabel("Amplitude")

>>> plt.xlabel("Sample")

>>> plt.figure()

>>> A=fft(window, 2048)/(len(window)/2.0)

>>> freq =np.linspace(-0.5,0.5,len(A))

>>> response =20 *np.log10(np.abs(fftshift(A /abs(A).max())))

>>> plt.plot(freq, response)

>>> plt.axis([-0.5,0.5,-120,0])

>>> plt.title("Frequency response of the Slepian window (BW=0.3)")

>>> plt.ylabel("Normalized magnitude [dB]")

>>> plt.xlabel("Normalized frequency [cycles per sample]")

678 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.signal.triang(M,sym=True)

Return a triangular window.

Parameters M: int

Number of points in the output window. If zero or less, an empty array is

returned.

sym : bool, optional

When True (default), generates a symmetric window, for use in ﬁlter design.

When False, generates a periodic window, for use in spectral analysis.

Returns w: ndarray

The window, with the maximum value normalized to 1 (though the value 1

does not appear if the number of samples is even and sym is True).

Examples

Plot the window and its frequency response:

>>> from scipy import signal

>>> from scipy.fftpack import fft, fftshift

>>> import matplotlib.pyplot as plt

>>> window =signal.triang(51)

>>> plt.plot(window)

>>> plt.title("Triangular window")

>>> plt.ylabel("Amplitude")

>>> plt.xlabel("Sample")

>>> plt.figure()

>>> A=fft(window, 2048)/(len(window)/2.0)

>>> freq =np.linspace(-0.5,0.5,len(A))

>>> response =20 *np.log10(np.abs(fftshift(A /abs(A).max())))

>>> plt.plot(freq, response)

>>> plt.axis([-0.5,0.5,-120,0])

>>> plt.title("Frequency response of the triangular window")

>>> plt.ylabel("Normalized magnitude [dB]")

>>> plt.xlabel("Normalized frequency [cycles per sample]")

5.22. Signal processing (scipy.signal) 679

SciPy Reference Guide, Release 0.13.0

5.22.11 Wavelets

cascade(hk[, J]) Return (x, phi, psi) at dyadic points K/2**Jfrom ﬁlter coefﬁcients.

daub(p) The coefﬁcients for the FIR low-pass ﬁlter producing Daubechies wavelets.

morlet(M[, w, s, complete]) Complex Morlet wavelet.

qmf(hk) Return high-pass qmf ﬁlter from low-pass

ricker(points, a) Return a Ricker wavelet, also known as the “Mexican hat wavelet”.

cwt(data, wavelet, widths) Continuous wavelet transform.

scipy.signal.cascade(hk,J=7)

Return (x, phi, psi) at dyadic points K/2**Jfrom ﬁlter coefﬁcients.

Parameters hk : array_like

Coefﬁcients of low-pass ﬁlter.

680 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

J: int, optional

Values will be computed at grid points K/2**J. Default is 7.

Returns x: ndarray

The dyadic points K/2**Jfor K=0...N *(2**J)-1 where

len(hk) = len(gk) = N+1.

phi : ndarray

The scaling function phi(x) at x:phi(x) = sum(hk *

phi(2x-k)), where k is from 0 to N.

psi : ndarray, optional

The wavelet function psi(x) at x:phi(x) = sum(gk *

phi(2x-k)), where k is from 0 to N. psi is only returned if gk is

not None.

Notes

The algorithm uses the vector cascade algorithm described by Strang and Nguyen in “Wavelets and Filter

Banks”. It builds a dictionary of values and slices for quick reuse. Then inserts vectors into ﬁnal vector at

the end.

scipy.signal.daub(p)

The coefﬁcients for the FIR low-pass ﬁlter producing Daubechies wavelets.

p>=1 gives the order of the zero at f=1/2. There are 2p ﬁlter coefﬁcients.

Parameters p: int

Order of the zero at f=1/2, can have values from 1 to 34.

Returns daub : ndarray

Return

scipy.signal.morlet(M,w=5.0,s=1.0,complete=True)

Complex Morlet wavelet.

Parameters M: int

Length of the wavelet.

w: ﬂoat

Omega0. Default is 5

s: ﬂoat

Scaling factor, windowed from -s*2*pi to +s*2*pi. Default is 1.

complete : bool

Whether to use the complete or the standard version.

Returns morlet : (M,) ndarray

See Also

scipy.signal.gausspulse

Notes

The standard version:

pi**-0.25 *exp(1j*w*x) *exp(-0.5*(x**2))

This commonly used wavelet is often referred to simply as the Morlet wavelet. Note that this simpliﬁed version

can cause admissibility problems at low values of w.

The complete version:

pi**-0.25 *(exp(1j*w*x) -exp(-0.5*(w**2))) *exp(-0.5*(x**2))

5.22. Signal processing (scipy.signal) 681

SciPy Reference Guide, Release 0.13.0

The complete version of the Morlet wavelet, with a correction term to improve admissibility. For w greater than

5, the correction term is negligible.

Note that the energy of the return wavelet is not normalised according to s.

The fundamental frequency of this wavelet in Hz is given by f=2*s*w*r/Mwhere r is the sampling rate.

scipy.signal.qmf(hk)

Return high-pass qmf ﬁlter from low-pass

Parameters hk : array_like

Coefﬁcients of high-pass ﬁlter.

scipy.signal.ricker(points,a)

Return a Ricker wavelet, also known as the “Mexican hat wavelet”.

It models the function:

A (1 - x^2/a^2) exp(-t^2/a^2),

where A = 2/sqrt(3a)pi^1/3.

Parameters points : int

Number of points in vector. Will be centered around 0.

a: scalar

Width parameter of the wavelet.

Returns vector : (N,) ndarray

Array of length points in shape of ricker curve.

Examples

>>> from scipy import signal

>>> import matplotlib.pyplot as plt

>>> points =100

>>> a=4.0

>>> vec2 =signal.ricker(points, a)

>>> print(len(vec2))

100

>>> plt.plot(vec2)

>>> plt.show()

682 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.signal.cwt(data,wavelet,widths)

Continuous wavelet transform.

Performs a continuous wavelet transform on data, using the wavelet function. A CWT performs a convolution

with data using the wavelet function, which is characterized by a width parameter and length parameter.

Parameters data : (N,) ndarray

data on which to perform the transform.

wavelet : function

Wavelet function, which should take 2 arguments. The ﬁrst ar-

gument is the number of points that the returned vector will have

(len(wavelet(width,length)) == length). The second is a width parameter,

deﬁning the size of the wavelet (e.g. standard deviation of a gaussian). See

ricker, which satisﬁes these requirements.

widths : (M,) sequence

Widths to use for transform.

Returns cwt: (M, N) ndarray

Will have shape of (len(data), len(widths)).

Notes

>>> length =min(10 *width[ii], len(data))

>>> cwt[ii,:] =scipy.signal.convolve(data, wavelet(width[ii],

... length), mode=’same’)

Examples

>>> from scipy import signal

>>> sig =np.random.rand(20)-0.5

>>> wavelet =signal.ricker

>>> widths =np.arange(1,11)

>>> cwtmatr =signal.cwt(sig, wavelet, widths)

5.22.12 Peak ﬁnding

5.22. Signal processing (scipy.signal) 683

SciPy Reference Guide, Release 0.13.0

find_peaks_cwt(vector, widths[, wavelet, ...]) Attempt to ﬁnd the peaks in a 1-D array.

argrelmin(data[, axis, order, mode]) Calculate the relative minima of data.

argrelmax(data[, axis, order, mode]) Calculate the relative maxima of data.

argrelextrema(data, comparator[, axis, ...]) Calculate the relative extrema of data.

scipy.signal.find_peaks_cwt(vector,widths,wavelet=None,max_distances=None,

gap_thresh=None,min_length=None,min_snr=1,noise_perc=10)

Attempt to ﬁnd the peaks in a 1-D array.

The general approach is to smooth vector by convolving it with wavelet(width) for each width in widths. Relative

maxima which appear at enough length scales, and with sufﬁciently high SNR, are accepted. New in version

0.11.0.

Parameters vector : ndarray

1-D array in which to ﬁnd the peaks.

widths : sequence

1-D array of widths to use for calculating the CWT matrix. In general, this

range should cover the expected width of peaks of interest.

wavelet : callable, optional

Should take a single variable and return a 1-D array to convolve with vector.

Should be normalized to unit area. Default is the ricker wavelet.

max_distances : ndarray, optional

At each row, a ridge line is only connected if the relative max at row[n] is

within max_distances[n] from the relative max at row[n+1]. De-

fault value is widths/4.

gap_thresh : ﬂoat, optional

If a relative maximum is not found within max_distances, there will be a

gap. A ridge line is discontinued if there are more than gap_thresh points

without connecting a new relative maximum. Default is 2.

min_length : int, optional

Minimum length a ridge line needs to be acceptable. Default is

cwt.shape[0] / 4, ie 1/4-th the number of widths.

min_snr : ﬂoat, optional

Minimum SNR ratio. Default 1. The signal is the value of the cwt matrix

at the shortest length scale (cwt[0, loc]), the noise is the noise_perc‘th

percentile of datapoints contained within a window of ‘window_size around

cwt[0, loc].

noise_perc : ﬂoat, optional

When calculating the noise ﬂoor, percentile of data points examined below

which to consider noise. Calculated using stats.scoreatpercentile. Default

is 10.

See Also

cwt

Notes

This approach was designed for ﬁnding sharp peaks among noisy data, however with proper parameter selection

it should function well for different peak shapes.

The algorithm is as follows:

1.Perform a continuous wavelet transform on vector, for the supplied widths. This

is a convolution of vector with wavelet(width) for each width in widths. See cwt

684 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

2.Identify “ridge lines” in the cwt matrix. These are relative maxima at each row,

connected across adjacent rows. See identify_ridge_lines

3.Filter the ridge_lines using ﬁlter_ridge_lines.

References

[R114]

Examples

>>> from scipy import signal

>>> xs =np.arange(0, np.pi, 0.05)

>>> data =np.sin(xs)

>>> peakind =signal.find_peaks_cwt(data, np.arange(1,10))

>>> peakind, xs[peakind], data[peakind]

([32], array([ 1.6]), array([ 0.9995736]))

scipy.signal.argrelmin(data,axis=0,order=1,mode=’clip’)

Calculate the relative minima of data. New in version 0.11.0.

Parameters data : ndarray

Array in which to ﬁnd the relative minima.

axis : int, optional

Axis over which to select from data. Default is 0.

order : int, optional

How many points on each side to use for the comparison to consider

comparator(n, n+x) to be True.

mode : str, optional

How the edges of the vector are treated. Available options are ‘wrap’ (wrap

around) or ‘clip’ (treat overﬂow as the same as the last (or ﬁrst) element).

Default ‘clip’. See numpy.take

Returns extrema : ndarray

Indices of the minima, as an array of integers.

See Also

argrelextrema,argrelmax

Notes

This function uses argrelextrema with np.less as comparator.

scipy.signal.argrelmax(data,axis=0,order=1,mode=’clip’)

Calculate the relative maxima of data. New in version 0.11.0.

Parameters data : ndarray

Array in which to ﬁnd the relative maxima.

axis : int, optional

Axis over which to select from data. Default is 0.

order : int, optional

How many points on each side to use for the comparison to consider

comparator(n, n+x) to be True.

mode : str, optional

How the edges of the vector are treated. Available options are ‘wrap’ (wrap

around) or ‘clip’ (treat overﬂow as the same as the last (or ﬁrst) element).

Default ‘clip’. See numpy.take.

Returns extrema : ndarray

Indices of the maxima, as an array of integers.

5.22. Signal processing (scipy.signal) 685

SciPy Reference Guide, Release 0.13.0

See Also

argrelextrema,argrelmin

Notes

This function uses argrelextrema with np.greater as comparator.

scipy.signal.argrelextrema(data,comparator,axis=0,order=1,mode=’clip’)

Calculate the relative extrema of data. New in version 0.11.0.

Parameters data : ndarray

Array in which to ﬁnd the relative extrema.

comparator : callable

Function to use to compare two data points. Should take 2 numbers as

arguments.

axis : int, optional

Axis over which to select from data. Default is 0.

order : int, optional

How many points on each side to use for the comparison to consider

comparator(n, n+x) to be True.

mode : str, optional

How the edges of the vector are treated. ‘wrap’ (wrap around) or ‘clip’

(treat overﬂow as the same as the last (or ﬁrst) element). Default is ‘clip’.

See numpy.take.

Returns extrema : ndarray

Indices of the extrema, as an array of integers (same format as np.argmin,

np.argmax).

See Also

argrelmin,argrelmax

5.22.13 Spectral Analysis

periodogram(x[, fs, window, nfft, detrend, ...]) Estimate power spectral density using a periodogram.

welch(x[, fs, window, nperseg, noverlap, ...]) Estimate power spectral density using Welch’s method.

lombscargle(x, y, freqs) Computes the Lomb-Scargle periodogram.

scipy.signal.periodogram(x,fs=1.0,window=None,nfft=None,detrend=’constant’,re-

turn_onesided=True,scaling=’density’,axis=-1)

Estimate power spectral density using a periodogram.

Parameters x: array_like

Time series of measurement values

fs : ﬂoat, optional

Sampling frequency of the xtime series in units of Hz. Defaults to 1.0.

window : str or tuple or array_like, optional

Desired window to use. See get_window for a list of windows and re-

quired parameters. If window is an array it will be used directly as the

window. Defaults to None; equivalent to ‘boxcar’.

nfft : int, optional

Length of the FFT used. If None the length of xwill be used.

detrend : str or function, optional

686 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Speciﬁes how to detrend xprior to computing the spectrum. If detrend is

a string, it is passed as the type argument to detrend. If it is a function,

it should return a detrended array. Defaults to ‘constant’.

return_onesided : bool, optional

If True, return a one-sided spectrum for real data. If False return a two-

sided spectrum. Note that for complex data, a two-sided spectrum is always

returned.

scaling : { ‘density’, ‘spectrum’ }, optional

Selects between computing the power spectral density (‘density’) where

Pxx has units of V**2/Hz if xis measured in V and computing the power

spectrum (‘spectrum’) where Pxx has units of V**2 if xis measured in V.

Defaults to ‘density’

axis : int, optional

Axis along which the periodogram is computed; the default is over the last

axis (i.e. axis=-1).

Returns f: ndarray

Array of sample frequencies.

Pxx : ndarray

Power spectral density or power spectrum of x.

See Also

welch Estimate power spectral density using Welch’s method

lombscargle

Lomb-Scargle periodogram for unevenly sampled data

Notes

New in version 0.12.0.

Examples

>>> from scipy import signal

>>> import matplotlib.pyplot as plt

Generate a test signal, a 2 Vrms sine wave at 1234 Hz, corrupted by 0.001 V**2/Hz of white noise sampled at

10 kHz.

>>> fs =10e3

>>> N=1e5

>>> amp =2*np.sqrt(2)

>>> freq =1234.0

>>> noise_power =0.001 *fs /2

>>> time =np.arange(N) /fs

>>> x=amp*np.sin(2*np.pi*freq*time)

>>> x+= np.random.normal(scale=np.sqrt(noise_power), size=time.shape)

Compute and plot the power spectral density.

>>> f, Pxx_den =signal.periodogram(x, fs)

>>> plt.semilogy(f, Pxx_den)

>>> plt.xlabel(’frequency [Hz]’)

>>> plt.ylabel(’PSD [V**2/Hz]’)

>>> plt.show()

5.22. Signal processing (scipy.signal) 687

SciPy Reference Guide, Release 0.13.0

If we average the last half of the spectral density, to exclude the peak, we can recover the noise power on the

signal.

>>> np.mean(Pxx_den[256:])

0.0009924865443739191

Now compute and plot the power spectrum.

>>> f, Pxx_spec =signal.periodogram(x, fs, ’flattop’, scaling=’spectrum’)

>>> plt.figure()

>>> plt.semilogy(f, np.sqrt(Pxx_spec))

>>> plt.xlabel(’frequency [Hz]’)

>>> plt.ylabel(’Linear spectrum [V RMS]’)

>>> plt.show()

The peak height in the power spectrum is an estimate of the RMS amplitude.

688 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

>>> np.sqrt(Pxx_spec.max())

2.0077340678640727

scipy.signal.welch(x,fs=1.0,window=’hanning’,nperseg=256,noverlap=None,nfft=None,de-

trend=’constant’,return_onesided=True,scaling=’density’,axis=-1)

Estimate power spectral density using Welch’s method.

Welch’s method [R134] computes an estimate of the power spectral density by dividing the data into overlapping

segments, computing a modiﬁed periodogram for each segment and averaging the periodograms.

Parameters x: array_like

Time series of measurement values

fs : ﬂoat, optional

Sampling frequency of the xtime series in units of Hz. Defaults to 1.0.

window : str or tuple or array_like, optional

Desired window to use. See get_window for a list of windows and re-

quired parameters. If window is array_like it will be used directly as the

window and its length will be used for nperseg. Defaults to ‘hanning’.

nperseg : int, optional

Length of each segment. Defaults to 256.

noverlap: int, optional

Number of points to overlap between segments. If None, noverlap =

nperseg / 2. Defaults to None.

nfft : int, optional

Length of the FFT used, if a zero padded FFT is desired. If None, the FFT

length is nperseg. Defaults to None.

detrend : str or function, optional

Speciﬁes how to detrend each segment. If detrend is a string, it is passed

as the type argument to detrend. If it is a function, it takes a segment

and returns a detrended segment. Defaults to ‘constant’.

return_onesided : bool, optional

If True, return a one-sided spectrum for real data. If False return a two-

sided spectrum. Note that for complex data, a two-sided spectrum is always

returned.

scaling : { ‘density’, ‘spectrum’ }, optional

Selects between computing the power spectral density (‘density’) where

Pxx has units of V**2/Hz if x is measured in V and computing the power

spectrum (‘spectrum’) where Pxx has units of V**2 if x is measured in V.

Defaults to ‘density’.

axis : int, optional

Axis along which the periodogram is computed; the default is over the last

axis (i.e. axis=-1).

Returns f: ndarray

Array of sample frequencies.

Pxx : ndarray

Power spectral density or power spectrum of x.

See Also

periodogram

Simple, optionally modiﬁed periodogram

lombscargle

Lomb-Scargle periodogram for unevenly sampled data

5.22. Signal processing (scipy.signal) 689

SciPy Reference Guide, Release 0.13.0

Notes

An appropriate amount of overlap will depend on the choice of window and on your requirements. For the

default ‘hanning’ window an overlap of 50% is a reasonable trade off between accurately estimating the signal

power, while not over counting any of the data. Narrower windows may require a larger overlap.

If noverlap is 0, this method is equivalent to Bartlett’s method [R135]. New in version 0.12.0.

References

[R134],[R135]

Examples

>>> from scipy import signal

>>> import matplotlib.pyplot as plt

Generate a test signal, a 2 Vrms sine wave at 1234 Hz, corrupted by 0.001 V**2/Hz of white noise sampled at

10 kHz.

>>> fs =10e3

>>> N=1e5

>>> amp =2*np.sqrt(2)

>>> freq =1234.0

>>> noise_power =0.001 *fs /2

>>> time =np.arange(N) /fs

>>> x=amp*np.sin(2*np.pi*freq*time)

>>> x+= np.random.normal(scale=np.sqrt(noise_power), size=time.shape)

Compute and plot the power spectral density.

>>> f, Pxx_den =signal.welch(x, fs, 1024)

>>> plt.semilogy(f, Pxx_den)

>>> plt.xlabel(’frequency [Hz]’)

>>> plt.ylabel(’PSD [V**2/Hz]’)

>>> plt.show()

If we average the last half of the spectral density, to exclude the peak, we can recover the noise power on the

signal.

>>> np.mean(Pxx_den[256:])

0.0009924865443739191

Now compute and plot the power spectrum.

>>> f, Pxx_spec =signal.welch(x, fs, ’flattop’,1024, scaling=’spectrum’)

>>> plt.figure()

>>> plt.semilogy(f, np.sqrt(Pxx_spec))

>>> plt.xlabel(’frequency [Hz]’)

>>> plt.ylabel(’Linear spectrum [V RMS]’)

>>> plt.show()

The peak height in the power spectrum is an estimate of the RMS amplitude.

>>> np.sqrt(Pxx_spec.max())

2.0077340678640727

690 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.signal.lombscargle(x,y,freqs)

Computes the Lomb-Scargle periodogram.

The Lomb-Scargle periodogram was developed by Lomb [R129] and further extended by Scargle [R130] to

ﬁnd, and test the signiﬁcance of weak periodic signals with uneven temporal sampling.

The computed periodogram is unnormalized, it takes the value (A**2) *N/4 for a harmonic signal with

amplitude A for sufﬁciently large N.

Parameters x: array_like

Sample times.

y: array_like

Measurement values.

freqs : array_like

Angular frequencies for output periodogram.

Returns pgram : array_like

Lomb-Scargle periodogram.

Raises ValueError

If the input arrays xand ydo not have the same shape.

Notes

This subroutine calculates the periodogram using a slightly modiﬁed algorithm due to Townsend [R131] which

allows the periodogram to be calculated using only a single pass through the input arrays for each frequency.

The algorithm running time scales roughly as O(x * freqs) or O(N^2) for a large number of samples and fre-

quencies.

References

[R129],[R130],[R131]

Examples

>>> import scipy.signal

First deﬁne some input parameters for the signal:

>>> A=2.

>>> w=1.

>>> phi =0.5 *np.pi

>>> nin =1000

>>> nout =100000

>>> frac_points =0.9 # Fraction of points to select

Randomly select a fraction of an array with timesteps:

>>> r=np.random.rand(nin)

>>> x=np.linspace(0.01,10*np.pi, nin)

>>> x=x[r >= frac_points]

>>> normval =x.shape[0]# For normalization of the periodogram

Plot a sine wave for the selected times:

>>> y=A*np.sin(w*x+phi)

Deﬁne the array of frequencies for which to compute the periodogram:

>>> f=np.linspace(0.01,10, nout)

5.22. Signal processing (scipy.signal) 691

SciPy Reference Guide, Release 0.13.0

Calculate Lomb-Scargle periodogram:

>>> pgram =sp.signal.lombscargle(x, y, f)

Now make a plot of the input data:

>>> plt.subplot(2,1,1)

<matplotlib.axes.AxesSubplot object at 0x102154f50>

>>> plt.plot(x, y, ’b+’)

[<matplotlib.lines.Line2D object at 0x102154a10>]

Then plot the normalized periodogram:

>>> plt.subplot(2,1,2)

<matplotlib.axes.AxesSubplot object at 0x104b0a990>

>>> plt.plot(f, np.sqrt(4*(pgram/normval)))

[<matplotlib.lines.Line2D object at 0x104b2f910>]

>>> plt.show()

5.23 Sparse matrices (scipy.sparse)

SciPy 2-D sparse matrix package for numeric data.

5.23.1 Contents

Sparse matrix classes

bsr_matrix(arg1[, shape, dtype, copy, blocksize]) Block Sparse Row matrix

coo_matrix(arg1[, shape, dtype, copy]) A sparse matrix in COOrdinate format.

csc_matrix(arg1[, shape, dtype, copy]) Compressed Sparse Column matrix

csr_matrix(arg1[, shape, dtype, copy]) Compressed Sparse Row matrix

dia_matrix(arg1[, shape, dtype, copy]) Sparse matrix with DIAgonal storage

dok_matrix(arg1[, shape, dtype, copy]) Dictionary Of Keys based sparse matrix.

lil_matrix(arg1[, shape, dtype, copy]) Row-based linked list sparse matrix

class scipy.sparse.bsr_matrix(arg1,shape=None,dtype=None,copy=False,blocksize=None)

Block Sparse Row matrix

This can be instantiated in several ways:

bsr_matrix(D, [blocksize=(R,C)])

with a dense matrix or rank-2 ndarray D

bsr_matrix(S, [blocksize=(R,C)])

with another sparse matrix S (equivalent to S.tobsr())

bsr_matrix((M, N), [blocksize=(R,C), dtype])

to construct an empty matrix with shape (M, N) dtype is optional, defaulting to

dtype=’d’.

bsr_matrix((data, ij), [blocksize=(R,C), shape=(M, N)])

where data and ij satisfy a[ij[0, k], ij[1, k]] = data[k]

bsr_matrix((data, indices, indptr), [shape=(M, N)])

is the standard BSR representation where the block column indices for row i

692 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

are stored in indices[indptr[i]:indptr[i+1]] and their correspond-

ing block values are stored in data[ indptr[i]: indptr[i+1] ]. If

the shape parameter is not supplied, the matrix dimensions are inferred from the

index arrays.

Notes

Sparse matrices can be used in arithmetic operations: they support addition, subtraction, multiplication, division,

and matrix power.

Summary of BSR format

The Block Compressed Row (BSR) format is very similar to the Compressed Sparse Row (CSR) format. BSR is

appropriate for sparse matrices with dense sub matrices like the last example below. Block matrices often arise

in vector-valued ﬁnite element discretizations. In such cases, BSR is considerably more efﬁcient than CSR and

CSC for many sparse arithmetic operations.

Blocksize

The blocksize (R,C) must evenly divide the shape of the matrix (M,N). That is, R and C must satisfy the

relationship M%R=0and N%C=0.

If no blocksize is speciﬁed, a simple heuristic is applied to determine an appropriate blocksize.

Examples

>>> from scipy.sparse import bsr_matrix

>>> bsr_matrix((3,4), dtype=np.int8).todense()

matrix([[0, 0, 0, 0],

[0, 0, 0, 0],

[0, 0, 0, 0]], dtype=int8)

>>> row =np.array([0,0,1,2,2,2])

>>> col =np.array([0,2,2,0,1,2])

>>> data =np.array([1,2,3,4,5,6])

>>> bsr_matrix((data, (row,col)), shape=(3,3)).todense()

matrix([[1, 0, 2],

[0, 0, 3],

[4, 5, 6]])

>>> indptr =np.array([0,2,3,6])

>>> indices =np.array([0,2,2,0,1,2])

>>> data =np.array([1,2,3,4,5,6]).repeat(4).reshape(6,2,2)

>>> bsr_matrix((data,indices,indptr), shape=(6,6)).todense()

matrix([[1, 1, 0, 0, 2, 2],

[1, 1, 0, 0, 2, 2],

[0, 0, 0, 0, 3, 3],

[4, 4, 5, 5, 6, 6],

[4, 4, 5, 5, 6, 6]])

Attributes

has_sorted_indices Determine whether the matrix has sorted indices

bsr_matrix.has_sorted_indices

Determine whether the matrix has sorted indices

5.23. Sparse matrices (scipy.sparse) 693

SciPy Reference Guide, Release 0.13.0

Returns

•True: if the indices of the matrix are in sorted order

•False: otherwise

dtype (dtype) Data type of the matrix

shape (2-tuple) Shape of the matrix

ndim (int) Number of dimensions (this is always 2)

nnz Number of nonzero elements

data Data array of the matrix

indices BSR format index array

indptr BSR format index pointer array

blocksize Block size of the matrix

Methods

arcsin() Element-wise arcsin.

arcsinh() Element-wise arcsinh.

arctan() Element-wise arctan.

arctanh() Element-wise arctanh.

asformat(format) Return this matrix in a given sparse format

asfptype() Upcast matrix to a ﬂoating point format (if necessary)

astype(t)

ceil() Element-wise ceil.

check_format([full_check]) check whether the matrix format is valid

conj()

conjugate()

copy()

deg2rad() Element-wise deg2rad.

diagonal() Returns the main diagonal of the matrix

dot(other) Ordinary dot product ..

eliminate_zeros()

expm1() Element-wise expm1.

floor() Element-wise ﬂoor.

getH()

get_shape()

getcol(j) Returns a copy of column j of the matrix, as an (m x 1) sparse

getdata(ind)

getformat()

getmaxprint()

getnnz()

getrow(i) Returns a copy of row i of the matrix, as a (1 x n) sparse

log1p() Element-wise log1p.

matmat(other)

matvec(other)

max() Maximum of the elements of this matrix.

mean([axis]) Average the matrix over the given axis.

min() Minimum of the elements of this matrix.

multiply(other) Point-wise multiplication by another matrix, vector, or

nonzero() nonzero indices

prune() Remove empty space after all non-zero elements.

rad2deg() Element-wise rad2deg.

reshape(shape)

Continued on next page

694 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Table 5.122 – continued from previous page

rint() Element-wise rint.

set_shape(shape)

setdiag(values[, k]) Fills the diagonal elements {a_ii} with the values from the given sequence.

sign() Element-wise sign.

sin() Element-wise sin.

sinh() Element-wise sinh.

sort_indices() Sort the indices of this matrix in place

sorted_indices() Return a copy of this matrix with sorted indices

sqrt() Element-wise sqrt.

sum([axis]) Sum the matrix over the given axis.

sum_duplicates()

tan() Element-wise tan.

tanh() Element-wise tanh.

toarray([order, out]) See the docstring for spmatrix.toarray.

tobsr([blocksize, copy])

tocoo([copy]) Convert this matrix to COOrdinate format.

tocsc()

tocsr()

todense([order, out]) Return a dense matrix representation of this matrix.

todia()

todok()

tolil()

transpose()

trunc() Element-wise trunc.

bsr_matrix.arcsin()

Element-wise arcsin.

See numpy.arcsin for more information.

bsr_matrix.arcsinh()

Element-wise arcsinh.

See numpy.arcsinh for more information.

bsr_matrix.arctan()

Element-wise arctan.

See numpy.arctan for more information.

bsr_matrix.arctanh()

Element-wise arctanh.

See numpy.arctanh for more information.

bsr_matrix.asformat(format)

Return this matrix in a given sparse format

Parameters format : {string, None}

desired sparse matrix format

•None for no format conversion

•“csr” for csr_matrix format

•“csc” for csc_matrix format

•“lil” for lil_matrix format

•“dok” for dok_matrix format and so on

5.23. Sparse matrices (scipy.sparse) 695

SciPy Reference Guide, Release 0.13.0

bsr_matrix.asfptype()

Upcast matrix to a ﬂoating point format (if necessary)

bsr_matrix.astype(t)

bsr_matrix.ceil()

Element-wise ceil.

See numpy.ceil for more information.

bsr_matrix.check_format(full_check=True)

check whether the matrix format is valid

Parameters:

full_check: True - rigorous check, O(N) operations : default False - basic check, O(1)

operations

bsr_matrix.conj()

bsr_matrix.conjugate()

bsr_matrix.copy()

bsr_matrix.deg2rad()

Element-wise deg2rad.

See numpy.deg2rad for more information.

bsr_matrix.diagonal()

Returns the main diagonal of the matrix

bsr_matrix.dot(other)

Ordinary dot product

Examples

>>> import numpy as np

>>> from scipy.sparse import csr_matrix

>>> A=csr_matrix([[1,2,0], [0,0,3], [4,0,5]])

>>> v=np.array([1,0,-1])

>>> A.dot(v)

array([ 1, -3, -1], dtype=int64)

bsr_matrix.eliminate_zeros()

bsr_matrix.expm1()

Element-wise expm1.

See numpy.expm1 for more information.

bsr_matrix.floor()

Element-wise ﬂoor.

See numpy.ﬂoor for more information.

bsr_matrix.getH()

696 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

bsr_matrix.get_shape()

bsr_matrix.getcol(j)

Returns a copy of column j of the matrix, as an (m x 1) sparse matrix (column vector).

bsr_matrix.getdata(ind)

bsr_matrix.getformat()

bsr_matrix.getmaxprint()

bsr_matrix.getnnz()

bsr_matrix.getrow(i)

Returns a copy of row i of the matrix, as a (1 x n) sparse matrix (row vector).

bsr_matrix.log1p()

Element-wise log1p.

See numpy.log1p for more information.

bsr_matrix.matmat(other)

bsr_matrix.matvec(other)

bsr_matrix.max()

Maximum of the elements of this matrix.

This takes all elements into account, not just the non-zero ones.

Returns amax : self.dtype

Maximum element.

bsr_matrix.mean(axis=None)

Average the matrix over the given axis. If the axis is None, average over both rows and columns, returning

a scalar.

bsr_matrix.min()

Minimum of the elements of this matrix.

This takes all elements into account, not just the non-zero ones.

Returns amin : self.dtype

Minimum element.

bsr_matrix.multiply(other)

Point-wise multiplication by another matrix, vector, or scalar.

bsr_matrix.nonzero()

nonzero indices

Returns a tuple of arrays (row,col) containing the indices of the non-zero elements of the matrix.

Examples

5.23. Sparse matrices (scipy.sparse) 697

SciPy Reference Guide, Release 0.13.0

>>> from scipy.sparse import csr_matrix

>>> A=csr_matrix([[1,2,0],[0,0,3],[4,0,5]])

>>> A.nonzero()

(array([0, 0, 1, 2, 2]), array([0, 1, 2, 0, 2]))

bsr_matrix.prune()

Remove empty space after all non-zero elements.

bsr_matrix.rad2deg()

Element-wise rad2deg.

See numpy.rad2deg for more information.

bsr_matrix.reshape(shape)

bsr_matrix.rint()

Element-wise rint.

See numpy.rint for more information.

bsr_matrix.set_shape(shape)

bsr_matrix.setdiag(values,k=0)

Fills the diagonal elements {a_ii} with the values from the given sequence. If k != 0, ﬁlls the off-diagonal

elements {a_{i,i+k}} instead.

values may have any length. If the diagonal is longer than values, then the remaining diagonal entries will

not be set. If values if longer than the diagonal, then the remaining values are ignored.

bsr_matrix.sign()

Element-wise sign.

See numpy.sign for more information.

bsr_matrix.sin()

Element-wise sin.

See numpy.sin for more information.

bsr_matrix.sinh()

Element-wise sinh.

See numpy.sinh for more information.

bsr_matrix.sort_indices()

Sort the indices of this matrix in place

bsr_matrix.sorted_indices()

Return a copy of this matrix with sorted indices

bsr_matrix.sqrt()

Element-wise sqrt.

See numpy.sqrt for more information.

bsr_matrix.sum(axis=None)

Sum the matrix over the given axis. If the axis is None, sum over both rows and columns, returning a

scalar.

bsr_matrix.sum_duplicates()

698 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

bsr_matrix.tan()

Element-wise tan.

See numpy.tan for more information.

bsr_matrix.tanh()

Element-wise tanh.

See numpy.tanh for more information.

bsr_matrix.toarray(order=None,out=None)

See the docstring for spmatrix.toarray.

bsr_matrix.tobsr(blocksize=None,copy=False)

bsr_matrix.tocoo(copy=True)

Convert this matrix to COOrdinate format.

When copy=False the data array will be shared between this matrix and the resultant coo_matrix.

bsr_matrix.tocsc()

bsr_matrix.tocsr()

bsr_matrix.todense(order=None,out=None)

Return a dense matrix representation of this matrix.

Parameters order : {‘C’, ‘F’}, optional

Whether to store multi-dimensional data in C (row-major) or Fortran

(column-major) order in memory. The default is ‘None’, indicating

the NumPy default of C-ordered. Cannot be speciﬁed in conjunction

with the out argument.

out : ndarray, 2-dimensional, optional

If speciﬁed, uses this array (or numpy.matrix) as the output buffer

instead of allocating a new array to return. The provided array must

have the same shape and dtype as the sparse matrix on which you are

calling the method.

Returns arr : numpy.matrix, 2-dimensional

A NumPy matrix object with the same shape and containing the

same data represented by the sparse matrix, with the requested

memory order. If out was passed and was an array (rather than a

numpy.matrix), it will be ﬁlled with the appropriate values and

returned wrapped in a numpy.matrix object that shares the same

memory.

bsr_matrix.todia()

bsr_matrix.todok()

bsr_matrix.tolil()

bsr_matrix.transpose()

bsr_matrix.trunc()

Element-wise trunc.

5.23. Sparse matrices (scipy.sparse) 699

SciPy Reference Guide, Release 0.13.0

See numpy.trunc for more information.

class scipy.sparse.coo_matrix(arg1,shape=None,dtype=None,copy=False)

A sparse matrix in COOrdinate format.

Also known as the ‘ijv’ or ‘triplet’ format.

This can be instantiated in several ways:

coo_matrix(D)

with a dense matrix D

coo_matrix(S)with another sparse matrix S (equivalent to S.tocoo())

coo_matrix((M, N), [dtype])

to construct an empty matrix with shape (M, N) dtype is optional, defaulting to

dtype=’d’.

coo_matrix((data, (i, j)), [shape=(M, N)])

to construct from three arrays:

1.data[:] the entries of the matrix, in any order

2.i[:] the row indices of the matrix entries

3.j[:] the column indices of the matrix entries

Where A[i[k], j[k]] = data[k]. When shape is not speciﬁed, it is in-

ferred from the index arrays

Notes

Sparse matrices can be used in arithmetic operations: they support addition, subtraction, multiplication, division,

and matrix power.

Advantages of the COO format

•facilitates fast conversion among sparse formats

•permits duplicate entries (see example)

•very fast conversion to and from CSR/CSC formats

Disadvantages of the COO format

•does not directly support:

–arithmetic operations

–slicing

Intended Usage

•COO is a fast format for constructing sparse matrices

•Once a matrix has been constructed, convert to CSR or CSC format for fast arith-

metic and matrix vector operations

•By default when converting to CSR or CSC format, duplicate (i,j) entries will be

summed together. This facilitates efﬁcient construction of ﬁnite element matrices

and the like. (see example)

Examples

>>> from scipy.sparse import coo_matrix

>>> coo_matrix((3,4), dtype=np.int8).todense()

matrix([[0, 0, 0, 0],

[0, 0, 0, 0],

[0, 0, 0, 0]], dtype=int8)

700 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

>>> row =np.array([0,3,1,0])

>>> col =np.array([0,3,1,2])

>>> data =np.array([4,5,7,9])

>>> coo_matrix((data,(row,col)), shape=(4,4)).todense()

matrix([[4, 0, 9, 0],

[0, 7, 0, 0],

[0, 0, 0, 0],

[0, 0, 0, 5]])

>>> # example with duplicates

>>> row =np.array([0,0,1,3,1,0,0])

>>> col =np.array([0,2,1,3,1,0,0])

>>> data =np.array([1,1,1,1,1,1,1])

>>> coo_matrix((data, (row,col)), shape=(4,4)).todense()

matrix([[3, 0, 1, 0],

[0, 2, 0, 0],

[0, 0, 0, 0],

[0, 0, 0, 1]])

Attributes

dtype (dtype) Data type of the matrix

shape (2-tuple) Shape of the matrix

ndim (int) Number of dimensions (this is always 2)

nnz Number of nonzero elements

data COO format data array of the matrix

row COO format row index array of the matrix

col COO format column index array of the matrix

Methods

arcsin() Element-wise arcsin.

arcsinh() Element-wise arcsinh.

arctan() Element-wise arctan.

arctanh() Element-wise arctanh.

asformat(format) Return this matrix in a given sparse format

asfptype() Upcast matrix to a ﬂoating point format (if necessary)

astype(t)

ceil() Element-wise ceil.

conj()

conjugate()

copy()

deg2rad() Element-wise deg2rad.

diagonal() Returns the main diagonal of the matrix

dot(other) Ordinary dot product ..

expm1() Element-wise expm1.

floor() Element-wise ﬂoor.

getH()

get_shape()

getcol(j) Returns a copy of column j of the matrix, as an (m x 1) sparse

getformat()

getmaxprint()

Continued on next page

5.23. Sparse matrices (scipy.sparse) 701

SciPy Reference Guide, Release 0.13.0

Table 5.123 – continued from previous page

getnnz()

getrow(i) Returns a copy of row i of the matrix, as a (1 x n) sparse

log1p() Element-wise log1p.

max() Maximum of the elements of this matrix.

mean([axis]) Average the matrix over the given axis.

min() Minimum of the elements of this matrix.

multiply(other) Point-wise multiplication by another matrix

nonzero() nonzero indices

rad2deg() Element-wise rad2deg.

reshape(shape)

rint() Element-wise rint.

set_shape(shape)

setdiag(values[, k]) Fills the diagonal elements {a_ii} with the values from the given sequence.

sign() Element-wise sign.

sin() Element-wise sin.

sinh() Element-wise sinh.

sqrt() Element-wise sqrt.

sum([axis]) Sum the matrix over the given axis.

tan() Element-wise tan.

tanh() Element-wise tanh.

toarray([order, out]) See the docstring for spmatrix.toarray.

tobsr([blocksize])

tocoo([copy])

tocsc() Return a copy of this matrix in Compressed Sparse Column format

tocsr() Return a copy of this matrix in Compressed Sparse Row format

todense([order, out]) Return a dense matrix representation of this matrix.

todia()

todok()

tolil()

transpose([copy])

trunc() Element-wise trunc.

coo_matrix.arcsin()

Element-wise arcsin.

See numpy.arcsin for more information.

coo_matrix.arcsinh()

Element-wise arcsinh.

See numpy.arcsinh for more information.

coo_matrix.arctan()

Element-wise arctan.

See numpy.arctan for more information.

coo_matrix.arctanh()

Element-wise arctanh.

See numpy.arctanh for more information.

coo_matrix.asformat(format)

Return this matrix in a given sparse format

Parameters format : {string, None}

702 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

desired sparse matrix format

•None for no format conversion

•“csr” for csr_matrix format

•“csc” for csc_matrix format

•“lil” for lil_matrix format

•“dok” for dok_matrix format and so on

coo_matrix.asfptype()

Upcast matrix to a ﬂoating point format (if necessary)

coo_matrix.astype(t)

coo_matrix.ceil()

Element-wise ceil.

See numpy.ceil for more information.

coo_matrix.conj()

coo_matrix.conjugate()

coo_matrix.copy()

coo_matrix.deg2rad()

Element-wise deg2rad.

See numpy.deg2rad for more information.

coo_matrix.diagonal()

Returns the main diagonal of the matrix

coo_matrix.dot(other)

Ordinary dot product

Examples

>>> import numpy as np

>>> from scipy.sparse import csr_matrix

>>> A=csr_matrix([[1,2,0], [0,0,3], [4,0,5]])

>>> v=np.array([1,0,-1])

>>> A.dot(v)

array([ 1, -3, -1], dtype=int64)

coo_matrix.expm1()

Element-wise expm1.

See numpy.expm1 for more information.

coo_matrix.floor()

Element-wise ﬂoor.

See numpy.ﬂoor for more information.

coo_matrix.getH()

coo_matrix.get_shape()

5.23. Sparse matrices (scipy.sparse) 703

SciPy Reference Guide, Release 0.13.0

coo_matrix.getcol(j)

Returns a copy of column j of the matrix, as an (m x 1) sparse matrix (column vector).

coo_matrix.getformat()

coo_matrix.getmaxprint()

coo_matrix.getnnz()

coo_matrix.getrow(i)

Returns a copy of row i of the matrix, as a (1 x n) sparse matrix (row vector).

coo_matrix.log1p()

Element-wise log1p.

See numpy.log1p for more information.

coo_matrix.max()

Maximum of the elements of this matrix.

This takes all elements into account, not just the non-zero ones.

Returns amax : self.dtype

Maximum element.

coo_matrix.mean(axis=None)

Average the matrix over the given axis. If the axis is None, average over both rows and columns, returning

a scalar.

coo_matrix.min()

Minimum of the elements of this matrix.

This takes all elements into account, not just the non-zero ones.

Returns amin : self.dtype

Minimum element.

coo_matrix.multiply(other)

Point-wise multiplication by another matrix

coo_matrix.nonzero()

nonzero indices

Returns a tuple of arrays (row,col) containing the indices of the non-zero elements of the matrix.

Examples

>>> from scipy.sparse import csr_matrix

>>> A=csr_matrix([[1,2,0],[0,0,3],[4,0,5]])

>>> A.nonzero()

(array([0, 0, 1, 2, 2]), array([0, 1, 2, 0, 2]))

coo_matrix.rad2deg()

Element-wise rad2deg.

See numpy.rad2deg for more information.

coo_matrix.reshape(shape)

704 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

coo_matrix.rint()

Element-wise rint.

See numpy.rint for more information.

coo_matrix.set_shape(shape)

coo_matrix.setdiag(values,k=0)

Fills the diagonal elements {a_ii} with the values from the given sequence. If k != 0, ﬁlls the off-diagonal

elements {a_{i,i+k}} instead.

values may have any length. If the diagonal is longer than values, then the remaining diagonal entries will

not be set. If values if longer than the diagonal, then the remaining values are ignored.

coo_matrix.sign()

Element-wise sign.

See numpy.sign for more information.

coo_matrix.sin()

Element-wise sin.

See numpy.sin for more information.

coo_matrix.sinh()

Element-wise sinh.

See numpy.sinh for more information.

coo_matrix.sqrt()

Element-wise sqrt.

See numpy.sqrt for more information.

coo_matrix.sum(axis=None)

Sum the matrix over the given axis. If the axis is None, sum over both rows and columns, returning a

scalar.

coo_matrix.tan()

Element-wise tan.

See numpy.tan for more information.

coo_matrix.tanh()

Element-wise tanh.

See numpy.tanh for more information.

coo_matrix.toarray(order=None,out=None)

See the docstring for spmatrix.toarray.

coo_matrix.tobsr(blocksize=None)

coo_matrix.tocoo(copy=False)

coo_matrix.tocsc()

Return a copy of this matrix in Compressed Sparse Column format

Duplicate entries will be summed together.

5.23. Sparse matrices (scipy.sparse) 705

SciPy Reference Guide, Release 0.13.0

Examples

>>> from numpy import array

>>> from scipy.sparse import coo_matrix

>>> row =array([0,0,1,3,1,0,0])

>>> col =array([0,2,1,3,1,0,0])

>>> data =array([1,1,1,1,1,1,1])

>>> A=coo_matrix( (data,(row,col)), shape=(4,4)).tocsc()

>>> A.todense()

matrix([[3, 0, 1, 0],

[0, 2, 0, 0],

[0, 0, 0, 0],

[0, 0, 0, 1]])

coo_matrix.tocsr()

Return a copy of this matrix in Compressed Sparse Row format

Duplicate entries will be summed together.

Examples

>>> from numpy import array

>>> from scipy.sparse import coo_matrix

>>> row =array([0,0,1,3,1,0,0])

>>> col =array([0,2,1,3,1,0,0])

>>> data =array([1,1,1,1,1,1,1])

>>> A=coo_matrix( (data,(row,col)), shape=(4,4)).tocsr()

>>> A.todense()

matrix([[3, 0, 1, 0],

[0, 2, 0, 0],

[0, 0, 0, 0],

[0, 0, 0, 1]])

coo_matrix.todense(order=None,out=None)

Return a dense matrix representation of this matrix.

Parameters order : {‘C’, ‘F’}, optional

Whether to store multi-dimensional data in C (row-major) or Fortran

(column-major) order in memory. The default is ‘None’, indicating

the NumPy default of C-ordered. Cannot be speciﬁed in conjunction

with the out argument.

out : ndarray, 2-dimensional, optional

If speciﬁed, uses this array (or numpy.matrix) as the output buffer

instead of allocating a new array to return. The provided array must

have the same shape and dtype as the sparse matrix on which you are

calling the method.

Returns arr : numpy.matrix, 2-dimensional

A NumPy matrix object with the same shape and containing the

same data represented by the sparse matrix, with the requested

memory order. If out was passed and was an array (rather than a

numpy.matrix), it will be ﬁlled with the appropriate values and

returned wrapped in a numpy.matrix object that shares the same

memory.

coo_matrix.todia()

coo_matrix.todok()

706 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

coo_matrix.tolil()

coo_matrix.transpose(copy=False)

coo_matrix.trunc()

Element-wise trunc.

See numpy.trunc for more information.

class scipy.sparse.csc_matrix(arg1,shape=None,dtype=None,copy=False)

Compressed Sparse Column matrix

This can be instantiated in several ways:

csc_matrix(D)with a dense matrix or rank-2 ndarray D

csc_matrix(S) with another sparse matrix S (equivalent to S.tocsc())

csc_matrix((M, N), [dtype])

to construct an empty matrix with shape (M, N) dtype is optional, defaulting to dtype=’d’.

csc_matrix((data, ij), [shape=(M, N)])

where data and ij satisfy the relationship a[ij[0, k], ij[1, k]] = data[k]

csc_matrix((data, indices, indptr), [shape=(M, N)])

is the standard CSC representation where the row indices for column i are stored in

indices[indptr[i]:indptr[i+1]] and their corresponding values are stored in

data[indptr[i]:indptr[i+1]]. If the shape parameter is not supplied, the matrix

dimensions are inferred from the index arrays.

Notes

Sparse matrices can be used in arithmetic operations: they support addition, subtraction, multiplication, division,

and matrix power.

Advantages of the CSC format

•efﬁcient arithmetic operations CSC + CSC, CSC * CSC, etc.

•efﬁcient column slicing

•fast matrix vector products (CSR, BSR may be faster)

Disadvantages of the CSC format

•slow row slicing operations (consider CSR)

•changes to the sparsity structure are expensive (consider LIL or DOK)

Examples

>>> from scipy.sparse import *

>>> from scipy import *

>>> csc_matrix( (3,4), dtype=int8 ).todense()

matrix([[0, 0, 0, 0],

[0, 0, 0, 0],

[0, 0, 0, 0]], dtype=int8)

>>> row =array([0,2,2,0,1,2])

>>> col =array([0,0,1,2,2,2])

>>> data =array([1,2,3,4,5,6])

>>> csc_matrix( (data,(row,col)), shape=(3,3) ).todense()

matrix([[1, 0, 4],

[0, 0, 5],

[2, 3, 6]])

5.23. Sparse matrices (scipy.sparse) 707

SciPy Reference Guide, Release 0.13.0

>>> indptr =array([0,2,3,6])

>>> indices =array([0,2,2,0,1,2])

>>> data =array([1,2,3,4,5,6])

>>> csc_matrix( (data,indices,indptr), shape=(3,3) ).todense()

matrix([[1, 0, 4],

[0, 0, 5],

[2, 3, 6]])

Attributes

has_sorted_indices Determine whether the matrix has sorted indices

csc_matrix.has_sorted_indices

Determine whether the matrix has sorted indices

Returns

•True: if the indices of the matrix are in sorted order

•False: otherwise

dtype (dtype) Data type of the matrix

shape (2-tuple) Shape of the matrix

ndim (int) Number of dimensions (this is always 2)

nnz Number of nonzero elements

data Data array of the matrix

indices CSC format index array

indptr CSC format index pointer array

Methods

arcsin() Element-wise arcsin.

arcsinh() Element-wise arcsinh.

arctan() Element-wise arctan.

arctanh() Element-wise arctanh.

asformat(format) Return this matrix in a given sparse format

asfptype() Upcast matrix to a ﬂoating point format (if necessary)

astype(t)

ceil() Element-wise ceil.

check_format([full_check]) check whether the matrix format is valid

conj()

conjugate()

copy()

deg2rad() Element-wise deg2rad.

diagonal() Returns the main diagonal of the matrix

dot(other) Ordinary dot product ..

eliminate_zeros() Remove zero entries from the matrix

expm1() Element-wise expm1.

floor() Element-wise ﬂoor.

getH()

get_shape()

getcol(i) Returns a copy of column i of the matrix, as a (m x 1)

getformat()

getmaxprint()

Continued on next page

708 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Table 5.125 – continued from previous page

getnnz()

getrow(i) Returns a copy of row i of the matrix, as a (1 x n)

log1p() Element-wise log1p.

max() Maximum of the elements of this matrix.

mean([axis]) Average the matrix over the given axis.

min() Minimum of the elements of this matrix.

multiply(other) Point-wise multiplication by another matrix, vector, or

nonzero() nonzero indices

prune() Remove empty space after all non-zero elements.

rad2deg() Element-wise rad2deg.

reshape(shape)

rint() Element-wise rint.

set_shape(shape)

setdiag(values[, k]) Fills the diagonal elements {a_ii} with the values from the given sequence.

sign() Element-wise sign.

sin() Element-wise sin.

sinh() Element-wise sinh.

sort_indices() Sort the indices of this matrix in place

sorted_indices() Return a copy of this matrix with sorted indices

sqrt() Element-wise sqrt.

sum([axis]) Sum the matrix over the given axis.

sum_duplicates() Eliminate duplicate matrix entries by adding them together

tan() Element-wise tan.

tanh() Element-wise tanh.

toarray([order, out]) See the docstring for spmatrix.toarray.

tobsr([blocksize])

tocoo([copy]) Return a COOrdinate representation of this matrix

tocsc([copy])

tocsr()

todense([order, out]) Return a dense matrix representation of this matrix.

todia()

todok()

tolil()

transpose([copy])

trunc() Element-wise trunc.

csc_matrix.arcsin()

Element-wise arcsin.

See numpy.arcsin for more information.

csc_matrix.arcsinh()

Element-wise arcsinh.

See numpy.arcsinh for more information.

csc_matrix.arctan()

Element-wise arctan.

See numpy.arctan for more information.

csc_matrix.arctanh()

Element-wise arctanh.

See numpy.arctanh for more information.

5.23. Sparse matrices (scipy.sparse) 709

SciPy Reference Guide, Release 0.13.0

csc_matrix.asformat(format)

Return this matrix in a given sparse format

Parameters format : {string, None}

desired sparse matrix format

•None for no format conversion

•“csr” for csr_matrix format

•“csc” for csc_matrix format

•“lil” for lil_matrix format

•“dok” for dok_matrix format and so on

csc_matrix.asfptype()

Upcast matrix to a ﬂoating point format (if necessary)

csc_matrix.astype(t)

csc_matrix.ceil()

Element-wise ceil.

See numpy.ceil for more information.

csc_matrix.check_format(full_check=True)

check whether the matrix format is valid

Parameters - full_check : {bool}

•True - rigorous check, O(N) operations : default

•False - basic check, O(1) operations

csc_matrix.conj()

csc_matrix.conjugate()

csc_matrix.copy()

csc_matrix.deg2rad()

Element-wise deg2rad.

See numpy.deg2rad for more information.

csc_matrix.diagonal()

Returns the main diagonal of the matrix

csc_matrix.dot(other)

Ordinary dot product

Examples

>>> import numpy as np

>>> from scipy.sparse import csr_matrix

>>> A=csr_matrix([[1,2,0], [0,0,3], [4,0,5]])

>>> v=np.array([1,0,-1])

>>> A.dot(v)

array([ 1, -3, -1], dtype=int64)

csc_matrix.eliminate_zeros()

Remove zero entries from the matrix

The is an in place operation

710 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

csc_matrix.expm1()

Element-wise expm1.

See numpy.expm1 for more information.

csc_matrix.floor()

Element-wise ﬂoor.

See numpy.ﬂoor for more information.

csc_matrix.getH()

csc_matrix.get_shape()

csc_matrix.getcol(i)

Returns a copy of column i of the matrix, as a (m x 1) CSC matrix (column vector).

csc_matrix.getformat()

csc_matrix.getmaxprint()

csc_matrix.getnnz()

csc_matrix.getrow(i)

Returns a copy of row i of the matrix, as a (1 x n) CSR matrix (row vector).

csc_matrix.log1p()

Element-wise log1p.

See numpy.log1p for more information.

csc_matrix.max()

Maximum of the elements of this matrix.

This takes all elements into account, not just the non-zero ones.

Returns amax : self.dtype

Maximum element.

csc_matrix.mean(axis=None)

Average the matrix over the given axis. If the axis is None, average over both rows and columns, returning

a scalar.

csc_matrix.min()

Minimum of the elements of this matrix.

This takes all elements into account, not just the non-zero ones.

Returns amin : self.dtype

Minimum element.

csc_matrix.multiply(other)

Point-wise multiplication by another matrix, vector, or scalar.

csc_matrix.nonzero()

nonzero indices

Returns a tuple of arrays (row,col) containing the indices of the non-zero elements of the matrix.

5.23. Sparse matrices (scipy.sparse) 711

SciPy Reference Guide, Release 0.13.0

Examples

>>> from scipy.sparse import csr_matrix

>>> A=csr_matrix([[1,2,0],[0,0,3],[4,0,5]])

>>> A.nonzero()

(array([0, 0, 1, 2, 2]), array([0, 1, 2, 0, 2]))

csc_matrix.prune()

Remove empty space after all non-zero elements.

csc_matrix.rad2deg()

Element-wise rad2deg.

See numpy.rad2deg for more information.

csc_matrix.reshape(shape)

csc_matrix.rint()

Element-wise rint.

See numpy.rint for more information.

csc_matrix.set_shape(shape)

csc_matrix.setdiag(values,k=0)

Fills the diagonal elements {a_ii} with the values from the given sequence. If k != 0, ﬁlls the off-diagonal

elements {a_{i,i+k}} instead.

values may have any length. If the diagonal is longer than values, then the remaining diagonal entries will

not be set. If values if longer than the diagonal, then the remaining values are ignored.

csc_matrix.sign()

Element-wise sign.

See numpy.sign for more information.

csc_matrix.sin()

Element-wise sin.

See numpy.sin for more information.

csc_matrix.sinh()

Element-wise sinh.

See numpy.sinh for more information.

csc_matrix.sort_indices()

Sort the indices of this matrix in place

csc_matrix.sorted_indices()

Return a copy of this matrix with sorted indices

csc_matrix.sqrt()

Element-wise sqrt.

See numpy.sqrt for more information.

csc_matrix.sum(axis=None)

Sum the matrix over the given axis. If the axis is None, sum over both rows and columns, returning a

scalar.

712 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

csc_matrix.sum_duplicates()

Eliminate duplicate matrix entries by adding them together

The is an in place operation

csc_matrix.tan()

Element-wise tan.

See numpy.tan for more information.

csc_matrix.tanh()

Element-wise tanh.

See numpy.tanh for more information.

csc_matrix.toarray(order=None,out=None)

See the docstring for spmatrix.toarray.

csc_matrix.tobsr(blocksize=None)

csc_matrix.tocoo(copy=True)

Return a COOrdinate representation of this matrix

When copy=False the index and data arrays are not copied.

csc_matrix.tocsc(copy=False)

csc_matrix.tocsr()

csc_matrix.todense(order=None,out=None)

Return a dense matrix representation of this matrix.

Parameters order : {‘C’, ‘F’}, optional

Whether to store multi-dimensional data in C (row-major) or Fortran

(column-major) order in memory. The default is ‘None’, indicating

the NumPy default of C-ordered. Cannot be speciﬁed in conjunction

with the out argument.

out : ndarray, 2-dimensional, optional

If speciﬁed, uses this array (or numpy.matrix) as the output buffer

instead of allocating a new array to return. The provided array must

have the same shape and dtype as the sparse matrix on which you are

calling the method.

Returns arr : numpy.matrix, 2-dimensional

A NumPy matrix object with the same shape and containing the

same data represented by the sparse matrix, with the requested

memory order. If out was passed and was an array (rather than a

numpy.matrix), it will be ﬁlled with the appropriate values and

returned wrapped in a numpy.matrix object that shares the same

memory.

csc_matrix.todia()

csc_matrix.todok()

csc_matrix.tolil()

5.23. Sparse matrices (scipy.sparse) 713

SciPy Reference Guide, Release 0.13.0

csc_matrix.transpose(copy=False)

csc_matrix.trunc()

Element-wise trunc.

See numpy.trunc for more information.

class scipy.sparse.csr_matrix(arg1,shape=None,dtype=None,copy=False)

Compressed Sparse Row matrix

This can be instantiated in several ways:

csr_matrix(D) with a dense matrix or rank-2 ndarray D

csr_matrix(S) with another sparse matrix S (equivalent to S.tocsr())

csr_matrix((M, N), [dtype])

to construct an empty matrix with shape (M, N) dtype is optional, defaulting to

dtype=’d’.

csr_matrix((data, ij), [shape=(M, N)])

where data and ij satisfy the relationship a[ij[0, k], ij[1, k]] =

data[k]

csr_matrix((data, indices, indptr), [shape=(M, N)])

is the standard CSR representation where the column indices for row i are stored in

indices[indptr[i]:indptr[i+1]] and their corresponding values are

stored in data[indptr[i]:indptr[i+1]]. If the shape parameter is not

supplied, the matrix dimensions are inferred from the index arrays.

Notes

Sparse matrices can be used in arithmetic operations: they support addition, subtraction, multiplication, division,

and matrix power.

Advantages of the CSR format

•efﬁcient arithmetic operations CSR + CSR, CSR * CSR, etc.

•efﬁcient row slicing

•fast matrix vector products

Disadvantages of the CSR format

•slow column slicing operations (consider CSC)

•changes to the sparsity structure are expensive (consider LIL or DOK)

Examples

>>> from scipy.sparse import *

>>> from scipy import *

>>> csr_matrix( (3,4), dtype=int8 ).todense()

matrix([[0, 0, 0, 0],

[0, 0, 0, 0],

[0, 0, 0, 0]], dtype=int8)

>>> row =array([0,0,1,2,2,2])

>>> col =array([0,2,2,0,1,2])

>>> data =array([1,2,3,4,5,6])

>>> csr_matrix( (data,(row,col)), shape=(3,3) ).todense()

matrix([[1, 0, 2],

[0, 0, 3],

[4, 5, 6]])

714 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

>>> indptr =array([0,2,3,6])

>>> indices =array([0,2,2,0,1,2])

>>> data =array([1,2,3,4,5,6])

>>> csr_matrix( (data,indices,indptr), shape=(3,3) ).todense()

matrix([[1, 0, 2],

[0, 0, 3],

[4, 5, 6]])

Attributes

has_sorted_indices Determine whether the matrix has sorted indices

csr_matrix.has_sorted_indices

Determine whether the matrix has sorted indices

Returns

•True: if the indices of the matrix are in sorted order

•False: otherwise

dtype (dtype) Data type of the matrix

shape (2-tuple) Shape of the matrix

ndim (int) Number of dimensions (this is always 2)

nnz Number of nonzero elements

data CSR format data array of the matrix

indices CSR format index array of the matrix

indptr CSR format index pointer array of the matrix

Methods

arcsin() Element-wise arcsin.

arcsinh() Element-wise arcsinh.

arctan() Element-wise arctan.

arctanh() Element-wise arctanh.

asformat(format) Return this matrix in a given sparse format

asfptype() Upcast matrix to a ﬂoating point format (if necessary)

astype(t)

ceil() Element-wise ceil.

check_format([full_check]) check whether the matrix format is valid

conj()

conjugate()

copy()

deg2rad() Element-wise deg2rad.

diagonal() Returns the main diagonal of the matrix

dot(other) Ordinary dot product ..

eliminate_zeros() Remove zero entries from the matrix

expm1() Element-wise expm1.

floor() Element-wise ﬂoor.

getH()

get_shape()

getcol(i) Returns a copy of column i of the matrix, as a (m x 1)

getformat()

getmaxprint()

Continued on next page

5.23. Sparse matrices (scipy.sparse) 715

SciPy Reference Guide, Release 0.13.0

Table 5.127 – continued from previous page

getnnz()

getrow(i) Returns a copy of row i of the matrix, as a (1 x n)

log1p() Element-wise log1p.

max() Maximum of the elements of this matrix.

mean([axis]) Average the matrix over the given axis.

min() Minimum of the elements of this matrix.

multiply(other) Point-wise multiplication by another matrix, vector, or

nonzero() nonzero indices

prune() Remove empty space after all non-zero elements.

rad2deg() Element-wise rad2deg.

reshape(shape)

rint() Element-wise rint.

set_shape(shape)

setdiag(values[, k]) Fills the diagonal elements {a_ii} with the values from the given sequence.

sign() Element-wise sign.

sin() Element-wise sin.

sinh() Element-wise sinh.

sort_indices() Sort the indices of this matrix in place

sorted_indices() Return a copy of this matrix with sorted indices

sqrt() Element-wise sqrt.

sum([axis]) Sum the matrix over the given axis.

sum_duplicates() Eliminate duplicate matrix entries by adding them together

tan() Element-wise tan.

tanh() Element-wise tanh.

toarray([order, out]) See the docstring for spmatrix.toarray.

tobsr([blocksize, copy])

tocoo([copy]) Return a COOrdinate representation of this matrix

tocsc()

tocsr([copy])

todense([order, out]) Return a dense matrix representation of this matrix.

todia()

todok()

tolil()

transpose([copy])

trunc() Element-wise trunc.

csr_matrix.arcsin()

Element-wise arcsin.

See numpy.arcsin for more information.

csr_matrix.arcsinh()

Element-wise arcsinh.

See numpy.arcsinh for more information.

csr_matrix.arctan()

Element-wise arctan.

See numpy.arctan for more information.

csr_matrix.arctanh()

Element-wise arctanh.

See numpy.arctanh for more information.

716 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

csr_matrix.asformat(format)

Return this matrix in a given sparse format

Parameters format : {string, None}

desired sparse matrix format

•None for no format conversion

•“csr” for csr_matrix format

•“csc” for csc_matrix format

•“lil” for lil_matrix format

•“dok” for dok_matrix format and so on

csr_matrix.asfptype()

Upcast matrix to a ﬂoating point format (if necessary)

csr_matrix.astype(t)

csr_matrix.ceil()

Element-wise ceil.

See numpy.ceil for more information.

csr_matrix.check_format(full_check=True)

check whether the matrix format is valid

Parameters - full_check : {bool}

•True - rigorous check, O(N) operations : default

•False - basic check, O(1) operations

csr_matrix.conj()

csr_matrix.conjugate()

csr_matrix.copy()

csr_matrix.deg2rad()

Element-wise deg2rad.

See numpy.deg2rad for more information.

csr_matrix.diagonal()

Returns the main diagonal of the matrix

csr_matrix.dot(other)

Ordinary dot product

Examples

>>> import numpy as np

>>> from scipy.sparse import csr_matrix

>>> A=csr_matrix([[1,2,0], [0,0,3], [4,0,5]])

>>> v=np.array([1,0,-1])

>>> A.dot(v)

array([ 1, -3, -1], dtype=int64)

csr_matrix.eliminate_zeros()

Remove zero entries from the matrix

The is an in place operation

5.23. Sparse matrices (scipy.sparse) 717

SciPy Reference Guide, Release 0.13.0

csr_matrix.expm1()

Element-wise expm1.

See numpy.expm1 for more information.

csr_matrix.floor()

Element-wise ﬂoor.

See numpy.ﬂoor for more information.

csr_matrix.getH()

csr_matrix.get_shape()

csr_matrix.getcol(i)

Returns a copy of column i of the matrix, as a (m x 1) CSR matrix (column vector).

csr_matrix.getformat()

csr_matrix.getmaxprint()

csr_matrix.getnnz()

csr_matrix.getrow(i)

Returns a copy of row i of the matrix, as a (1 x n) CSR matrix (row vector).

csr_matrix.log1p()

Element-wise log1p.

See numpy.log1p for more information.

csr_matrix.max()

Maximum of the elements of this matrix.

This takes all elements into account, not just the non-zero ones.

Returns amax : self.dtype

Maximum element.

csr_matrix.mean(axis=None)

Average the matrix over the given axis. If the axis is None, average over both rows and columns, returning

a scalar.

csr_matrix.min()

Minimum of the elements of this matrix.

This takes all elements into account, not just the non-zero ones.

Returns amin : self.dtype

Minimum element.

csr_matrix.multiply(other)

Point-wise multiplication by another matrix, vector, or scalar.

csr_matrix.nonzero()

nonzero indices

Returns a tuple of arrays (row,col) containing the indices of the non-zero elements of the matrix.

718 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Examples

>>> from scipy.sparse import csr_matrix

>>> A=csr_matrix([[1,2,0],[0,0,3],[4,0,5]])

>>> A.nonzero()

(array([0, 0, 1, 2, 2]), array([0, 1, 2, 0, 2]))

csr_matrix.prune()

Remove empty space after all non-zero elements.

csr_matrix.rad2deg()

Element-wise rad2deg.

See numpy.rad2deg for more information.

csr_matrix.reshape(shape)

csr_matrix.rint()

Element-wise rint.

See numpy.rint for more information.

csr_matrix.set_shape(shape)

csr_matrix.setdiag(values,k=0)

Fills the diagonal elements {a_ii} with the values from the given sequence. If k != 0, ﬁlls the off-diagonal

elements {a_{i,i+k}} instead.

values may have any length. If the diagonal is longer than values, then the remaining diagonal entries will

not be set. If values if longer than the diagonal, then the remaining values are ignored.

csr_matrix.sign()

Element-wise sign.

See numpy.sign for more information.

csr_matrix.sin()

Element-wise sin.

See numpy.sin for more information.

csr_matrix.sinh()

Element-wise sinh.

See numpy.sinh for more information.

csr_matrix.sort_indices()

Sort the indices of this matrix in place

csr_matrix.sorted_indices()

Return a copy of this matrix with sorted indices

csr_matrix.sqrt()

Element-wise sqrt.

See numpy.sqrt for more information.

csr_matrix.sum(axis=None)

Sum the matrix over the given axis. If the axis is None, sum over both rows and columns, returning a

scalar.

5.23. Sparse matrices (scipy.sparse) 719

SciPy Reference Guide, Release 0.13.0

csr_matrix.sum_duplicates()

Eliminate duplicate matrix entries by adding them together

The is an in place operation

csr_matrix.tan()

Element-wise tan.

See numpy.tan for more information.

csr_matrix.tanh()

Element-wise tanh.

See numpy.tanh for more information.

csr_matrix.toarray(order=None,out=None)

See the docstring for spmatrix.toarray.

csr_matrix.tobsr(blocksize=None,copy=True)

csr_matrix.tocoo(copy=True)

Return a COOrdinate representation of this matrix

When copy=False the index and data arrays are not copied.

csr_matrix.tocsc()

csr_matrix.tocsr(copy=False)

csr_matrix.todense(order=None,out=None)

Return a dense matrix representation of this matrix.

Parameters order : {‘C’, ‘F’}, optional

Whether to store multi-dimensional data in C (row-major) or Fortran

(column-major) order in memory. The default is ‘None’, indicating

the NumPy default of C-ordered. Cannot be speciﬁed in conjunction

with the out argument.

out : ndarray, 2-dimensional, optional

If speciﬁed, uses this array (or numpy.matrix) as the output buffer

instead of allocating a new array to return. The provided array must

have the same shape and dtype as the sparse matrix on which you are

calling the method.

Returns arr : numpy.matrix, 2-dimensional

A NumPy matrix object with the same shape and containing the

same data represented by the sparse matrix, with the requested

memory order. If out was passed and was an array (rather than a

numpy.matrix), it will be ﬁlled with the appropriate values and

returned wrapped in a numpy.matrix object that shares the same

memory.

csr_matrix.todia()

csr_matrix.todok()

csr_matrix.tolil()

720 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

csr_matrix.transpose(copy=False)

csr_matrix.trunc()

Element-wise trunc.

See numpy.trunc for more information.

class scipy.sparse.dia_matrix(arg1,shape=None,dtype=None,copy=False)

Sparse matrix with DIAgonal storage

This can be instantiated in several ways:

dia_matrix(D)with a dense matrix

dia_matrix(S) with another sparse matrix S (equivalent to S.todia())

dia_matrix((M, N), [dtype])

to construct an empty matrix with shape (M, N), dtype is optional, defaulting to

dtype=’d’.

dia_matrix((data, offsets), shape=(M, N))

where the data[k,:] stores the diagonal entries for diagonal offsets[k]

(See example below)

Notes

Sparse matrices can be used in arithmetic operations: they support addition, subtraction, multiplication, division,

and matrix power.

Examples

>>> from scipy.sparse import *

>>> from scipy import *

>>> dia_matrix( (3,4), dtype=int8).todense()

matrix([[0, 0, 0, 0],

[0, 0, 0, 0],

[0, 0, 0, 0]], dtype=int8)

>>> data =array([[1,2,3,4]]).repeat(3,axis=0)

>>> offsets =array([0,-1,2])

>>> dia_matrix( (data,offsets), shape=(4,4)).todense()

matrix([[1, 0, 3, 0],

[1, 2, 0, 4],

[0, 2, 3, 0],

[0, 0, 3, 4]])

Attributes

nnz number of nonzero values

dia_matrix.nnz

number of nonzero values

explicit zero values are included in this number

5.23. Sparse matrices (scipy.sparse) 721

SciPy Reference Guide, Release 0.13.0

dtype (dtype) Data type of the matrix

shape (2-tuple) Shape of the matrix

ndim (int) Number of dimensions (this is always 2)

data DIA format data array of the matrix

offsets DIA format offset array of the matrix

Methods

arcsin() Element-wise arcsin.

arcsinh() Element-wise arcsinh.

arctan() Element-wise arctan.

arctanh() Element-wise arctanh.

asformat(format) Return this matrix in a given sparse format

asfptype() Upcast matrix to a ﬂoating point format (if necessary)

astype(t)

ceil() Element-wise ceil.

conj()

conjugate()

copy()

deg2rad() Element-wise deg2rad.

diagonal() Returns the main diagonal of the matrix

dot(other) Ordinary dot product ..

expm1() Element-wise expm1.

floor() Element-wise ﬂoor.

getH()

get_shape()

getcol(j) Returns a copy of column j of the matrix, as an (m x 1) sparse

getformat()

getmaxprint()

getnnz() number of nonzero values

getrow(i) Returns a copy of row i of the matrix, as a (1 x n) sparse

log1p() Element-wise log1p.

mean([axis]) Average the matrix over the given axis.

multiply(other) Point-wise multiplication by another matrix

nonzero() nonzero indices

rad2deg() Element-wise rad2deg.

reshape(shape)

rint() Element-wise rint.

set_shape(shape)

setdiag(values[, k]) Fills the diagonal elements {a_ii} with the values from the given sequence.

sign() Element-wise sign.

sin() Element-wise sin.

sinh() Element-wise sinh.

sqrt() Element-wise sqrt.

sum([axis]) Sum the matrix over the given axis.

tan() Element-wise tan.

tanh() Element-wise tanh.

toarray([order, out]) Return a dense ndarray representation of this matrix.

tobsr([blocksize])

tocoo()

tocsc()

Continued on next page

722 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Table 5.129 – continued from previous page

tocsr()

todense([order, out]) Return a dense matrix representation of this matrix.

todia([copy])

todok()

tolil()

transpose()

trunc() Element-wise trunc.

dia_matrix.arcsin()

Element-wise arcsin.

See numpy.arcsin for more information.

dia_matrix.arcsinh()

Element-wise arcsinh.

See numpy.arcsinh for more information.

dia_matrix.arctan()

Element-wise arctan.

See numpy.arctan for more information.

dia_matrix.arctanh()

Element-wise arctanh.

See numpy.arctanh for more information.

dia_matrix.asformat(format)

Return this matrix in a given sparse format

Parameters format : {string, None}

desired sparse matrix format

•None for no format conversion

•“csr” for csr_matrix format

•“csc” for csc_matrix format

•“lil” for lil_matrix format

•“dok” for dok_matrix format and so on

dia_matrix.asfptype()

Upcast matrix to a ﬂoating point format (if necessary)

dia_matrix.astype(t)

dia_matrix.ceil()

Element-wise ceil.

See numpy.ceil for more information.

dia_matrix.conj()

dia_matrix.conjugate()

dia_matrix.copy()

5.23. Sparse matrices (scipy.sparse) 723

SciPy Reference Guide, Release 0.13.0

dia_matrix.deg2rad()

Element-wise deg2rad.

See numpy.deg2rad for more information.

dia_matrix.diagonal()

Returns the main diagonal of the matrix

dia_matrix.dot(other)

Ordinary dot product

Examples

>>> import numpy as np

>>> from scipy.sparse import csr_matrix

>>> A=csr_matrix([[1,2,0], [0,0,3], [4,0,5]])

>>> v=np.array([1,0,-1])

>>> A.dot(v)

array([ 1, -3, -1], dtype=int64)

dia_matrix.expm1()

Element-wise expm1.

See numpy.expm1 for more information.

dia_matrix.floor()

Element-wise ﬂoor.

See numpy.ﬂoor for more information.

dia_matrix.getH()

dia_matrix.get_shape()

dia_matrix.getcol(j)

Returns a copy of column j of the matrix, as an (m x 1) sparse matrix (column vector).

dia_matrix.getformat()

dia_matrix.getmaxprint()

dia_matrix.getnnz()

number of nonzero values

explicit zero values are included in this number

dia_matrix.getrow(i)

Returns a copy of row i of the matrix, as a (1 x n) sparse matrix (row vector).

dia_matrix.log1p()

Element-wise log1p.

See numpy.log1p for more information.

dia_matrix.mean(axis=None)

Average the matrix over the given axis. If the axis is None, average over both rows and columns, returning

a scalar.

dia_matrix.multiply(other)

Point-wise multiplication by another matrix

724 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

dia_matrix.nonzero()

nonzero indices

Returns a tuple of arrays (row,col) containing the indices of the non-zero elements of the matrix.

Examples

>>> from scipy.sparse import csr_matrix

>>> A=csr_matrix([[1,2,0],[0,0,3],[4,0,5]])

>>> A.nonzero()

(array([0, 0, 1, 2, 2]), array([0, 1, 2, 0, 2]))

dia_matrix.rad2deg()

Element-wise rad2deg.

See numpy.rad2deg for more information.

dia_matrix.reshape(shape)

dia_matrix.rint()

Element-wise rint.

See numpy.rint for more information.

dia_matrix.set_shape(shape)

dia_matrix.setdiag(values,k=0)

Fills the diagonal elements {a_ii} with the values from the given sequence. If k != 0, ﬁlls the off-diagonal

elements {a_{i,i+k}} instead.

values may have any length. If the diagonal is longer than values, then the remaining diagonal entries will

not be set. If values if longer than the diagonal, then the remaining values are ignored.

dia_matrix.sign()

Element-wise sign.

See numpy.sign for more information.

dia_matrix.sin()

Element-wise sin.

See numpy.sin for more information.

dia_matrix.sinh()

Element-wise sinh.

See numpy.sinh for more information.

dia_matrix.sqrt()

Element-wise sqrt.

See numpy.sqrt for more information.

dia_matrix.sum(axis=None)

Sum the matrix over the given axis. If the axis is None, sum over both rows and columns, returning a

scalar.

dia_matrix.tan()

Element-wise tan.

See numpy.tan for more information.

5.23. Sparse matrices (scipy.sparse) 725

SciPy Reference Guide, Release 0.13.0

dia_matrix.tanh()

Element-wise tanh.

See numpy.tanh for more information.

dia_matrix.toarray(order=None,out=None)

Return a dense ndarray representation of this matrix.

Parameters order : {‘C’, ‘F’}, optional

Whether to store multi-dimensional data in C (row-major) or Fortran

(column-major) order in memory. The default is ‘None’, indicating

the NumPy default of C-ordered. Cannot be speciﬁed in conjunction

with the out argument.

out : ndarray, 2-dimensional, optional

If speciﬁed, uses this array as the output buffer instead of allocating a

new array to return. The provided array must have the same shape and

dtype as the sparse matrix on which you are calling the method. For

most sparse types, out is required to be memory contiguous (either C

or Fortran ordered).

Returns arr : ndarray, 2-dimensional

An array with the same shape and containing the same data repre-

sented by the sparse matrix, with the requested memory order. If out

was passed, the same object is returned after being modiﬁed in-place

to contain the appropriate values.

dia_matrix.tobsr(blocksize=None)

dia_matrix.tocoo()

dia_matrix.tocsc()

dia_matrix.tocsr()

dia_matrix.todense(order=None,out=None)

Return a dense matrix representation of this matrix.

Parameters order : {‘C’, ‘F’}, optional

Whether to store multi-dimensional data in C (row-major) or Fortran

(column-major) order in memory. The default is ‘None’, indicating

the NumPy default of C-ordered. Cannot be speciﬁed in conjunction

with the out argument.

out : ndarray, 2-dimensional, optional

If speciﬁed, uses this array (or numpy.matrix) as the output buffer

instead of allocating a new array to return. The provided array must

have the same shape and dtype as the sparse matrix on which you are

calling the method.

Returns arr : numpy.matrix, 2-dimensional

A NumPy matrix object with the same shape and containing the

same data represented by the sparse matrix, with the requested

memory order. If out was passed and was an array (rather than a

numpy.matrix), it will be ﬁlled with the appropriate values and

returned wrapped in a numpy.matrix object that shares the same

memory.

dia_matrix.todia(copy=False)

726 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

dia_matrix.todok()

dia_matrix.tolil()

dia_matrix.transpose()

dia_matrix.trunc()

Element-wise trunc.

See numpy.trunc for more information.

class scipy.sparse.dok_matrix(arg1,shape=None,dtype=None,copy=False)

Dictionary Of Keys based sparse matrix.

This is an efﬁcient structure for constructing sparse matrices incrementally.

This can be instantiated in several ways:

dok_matrix(D)

with a dense matrix, D

dok_matrix(S)with a sparse matrix, S

dok_matrix((M,N), [dtype])

create the matrix with initial shape (M,N) dtype is optional, defaulting to

dtype=’d’

Notes

Sparse matrices can be used in arithmetic operations: they support addition, subtraction, multiplication, division,

and matrix power.

Allows for efﬁcient O(1) access of individual elements. Duplicates are not allowed. Can be efﬁciently converted

to a coo_matrix once constructed.

Examples

>>> from scipy.sparse import *

>>> from scipy import *

>>> S=dok_matrix((5,5), dtype=float32)

>>> for iin range(5):

>>> for jin range(5):

>>> S[i,j] =i+j# Update element

Attributes

dtype (dtype) Data type of the matrix

shape (2-tuple) Shape of the matrix

ndim (int) Number of dimensions (this is always 2)

nnz Number of nonzero elements

Methods

asformat(format) Return this matrix in a given sparse format

asfptype() Upcast matrix to a ﬂoating point format (if necessary)

astype(t)

clear(() -> None. Remove all items from D.)

Continued on next page

5.23. Sparse matrices (scipy.sparse) 727

SciPy Reference Guide, Release 0.13.0

Table 5.130 – continued from previous page

conj()

conjtransp() Return the conjugate transpose

conjugate()

copy()

diagonal() Returns the main diagonal of the matrix

dot(other) Ordinary dot product ..

fromkeys(...) v defaults to None.

get(key[, default]) This overrides the dict.get method, providing type checking

getH()

get_shape()

getcol(j) Returns a copy of column j of the matrix as a (m x 1)

getformat()

getmaxprint()

getnnz()

getrow(i) Returns a copy of row i of the matrix as a (1 x n)

has_key((k) -> True if D has a key k, else False)

items(() -> list of D’s (key, value) pairs, ...)

iteritems(() -> an iterator over the (key, ...)

iterkeys(() -> an iterator over the keys of D)

itervalues(...)

keys(() -> list of D’s keys)

mean([axis]) Average the matrix over the given axis.

multiply(other) Point-wise multiplication by another matrix

nonzero() nonzero indices

pop((k[,d]) -> v, ...) If key is not found, d is returned if given, otherwise KeyError is raised

popitem(() -> (k, v), ...) 2-tuple; but raise KeyError if D is empty.

reshape(shape)

resize(shape) Resize the matrix in-place to dimensions given by ‘shape’.

set_shape(shape)

setdefault((k[,d]) -> D.get(k,d), ...)

setdiag(values[, k]) Fills the diagonal elements {a_ii} with the values from the given sequence.

sum([axis]) Sum the matrix over the given axis.

toarray([order, out]) See the docstring for spmatrix.toarray.

tobsr([blocksize])

tocoo() Return a copy of this matrix in COOrdinate format

tocsc() Return a copy of this matrix in Compressed Sparse Column format

tocsr() Return a copy of this matrix in Compressed Sparse Row format

todense([order, out]) Return a dense matrix representation of this matrix.

todia()

todok([copy])

tolil()

transpose() Return the transpose

update(([E, ...) If E present and has a .keys() method, does: for k in E: D[k] = E[k]

values(() -> list of D’s values)

viewitems(...)

viewkeys(...)

viewvalues(...)

dok_matrix.asformat(format)

Return this matrix in a given sparse format

728 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Parameters format : {string, None}

desired sparse matrix format

•None for no format conversion

•“csr” for csr_matrix format

•“csc” for csc_matrix format

•“lil” for lil_matrix format

•“dok” for dok_matrix format and so on

dok_matrix.asfptype()

Upcast matrix to a ﬂoating point format (if necessary)

dok_matrix.astype(t)

dok_matrix.clear() →None. Remove all items from D.

dok_matrix.conj()

dok_matrix.conjtransp()

Return the conjugate transpose

dok_matrix.conjugate()

dok_matrix.copy()

dok_matrix.diagonal()

Returns the main diagonal of the matrix

dok_matrix.dot(other)

Ordinary dot product

Examples

>>> import numpy as np

>>> from scipy.sparse import csr_matrix

>>> A=csr_matrix([[1,2,0], [0,0,3], [4,0,5]])

>>> v=np.array([1,0,-1])

>>> A.dot(v)

array([ 1, -3, -1], dtype=int64)

static dok_matrix.fromkeys(S[,v])→New dict with keys from S and values equal to v.

v defaults to None.

dok_matrix.get(key,default=0.0)

This overrides the dict.get method, providing type checking but otherwise equivalent functionality.

dok_matrix.getH()

dok_matrix.get_shape()

dok_matrix.getcol(j)

Returns a copy of column j of the matrix as a (m x 1) DOK matrix.

dok_matrix.getformat()

5.23. Sparse matrices (scipy.sparse) 729

SciPy Reference Guide, Release 0.13.0

dok_matrix.getmaxprint()

dok_matrix.getnnz()

dok_matrix.getrow(i)

Returns a copy of row i of the matrix as a (1 x n) DOK matrix.

dok_matrix.has_key(k)→True if D has a key k, else False

dok_matrix.items() →list of D’s (key, value) pairs, as 2-tuples

dok_matrix.iteritems() →an iterator over the (key, value) items of D

dok_matrix.iterkeys() →an iterator over the keys of D

dok_matrix.itervalues() →an iterator over the values of D

dok_matrix.keys() →list of D’s keys

dok_matrix.mean(axis=None)

Average the matrix over the given axis. If the axis is None, average over both rows and columns, returning

a scalar.

dok_matrix.multiply(other)

Point-wise multiplication by another matrix

dok_matrix.nonzero()

nonzero indices

Returns a tuple of arrays (row,col) containing the indices of the non-zero elements of the matrix.

Examples

>>> from scipy.sparse import csr_matrix

>>> A=csr_matrix([[1,2,0],[0,0,3],[4,0,5]])

>>> A.nonzero()

(array([0, 0, 1, 2, 2]), array([0, 1, 2, 0, 2]))

dok_matrix.pop(k[,d])→v, remove speciﬁed key and return the corresponding value.

If key is not found, d is returned if given, otherwise KeyError is raised

dok_matrix.popitem() →(k, v), remove and return some (key, value) pair as a

2-tuple; but raise KeyError if D is empty.

dok_matrix.reshape(shape)

dok_matrix.resize(shape)

Resize the matrix in-place to dimensions given by ‘shape’.

Any non-zero elements that lie outside the new shape are removed.

dok_matrix.set_shape(shape)

730 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

dok_matrix.setdefault(k[,d])→D.get(k,d), also set D[k]=d if k not in D

dok_matrix.setdiag(values,k=0)

Fills the diagonal elements {a_ii} with the values from the given sequence. If k != 0, ﬁlls the off-diagonal

elements {a_{i,i+k}} instead.

values may have any length. If the diagonal is longer than values, then the remaining diagonal entries will

not be set. If values if longer than the diagonal, then the remaining values are ignored.

dok_matrix.sum(axis=None)

Sum the matrix over the given axis. If the axis is None, sum over both rows and columns, returning a

scalar.

dok_matrix.toarray(order=None,out=None)

See the docstring for spmatrix.toarray.

dok_matrix.tobsr(blocksize=None)

dok_matrix.tocoo()

Return a copy of this matrix in COOrdinate format

dok_matrix.tocsc()

Return a copy of this matrix in Compressed Sparse Column format

dok_matrix.tocsr()

Return a copy of this matrix in Compressed Sparse Row format

dok_matrix.todense(order=None,out=None)

Return a dense matrix representation of this matrix.

Parameters order : {‘C’, ‘F’}, optional

Whether to store multi-dimensional data in C (row-major) or Fortran

(column-major) order in memory. The default is ‘None’, indicating

the NumPy default of C-ordered. Cannot be speciﬁed in conjunction

with the out argument.

out : ndarray, 2-dimensional, optional

If speciﬁed, uses this array (or numpy.matrix) as the output buffer

instead of allocating a new array to return. The provided array must

have the same shape and dtype as the sparse matrix on which you are

calling the method.

Returns arr : numpy.matrix, 2-dimensional

A NumPy matrix object with the same shape and containing the

same data represented by the sparse matrix, with the requested

memory order. If out was passed and was an array (rather than a

numpy.matrix), it will be ﬁlled with the appropriate values and

returned wrapped in a numpy.matrix object that shares the same

memory.

dok_matrix.todia()

dok_matrix.todok(copy=False)

dok_matrix.tolil()

dok_matrix.transpose()

Return the transpose

5.23. Sparse matrices (scipy.sparse) 731

SciPy Reference Guide, Release 0.13.0

dok_matrix.update([E],**F)→None. Update D from dict/iterable E and F.

If E present and has a .keys() method, does: for k in E: D[k] = E[k] If E present and lacks .keys() method,

does: for (k, v) in E: D[k] = v In either case, this is followed by: for k in F: D[k] = F[k]

dok_matrix.values() →list of D’s values

dok_matrix.viewitems() →a set-like object providing a view on D’s items

dok_matrix.viewkeys() →a set-like object providing a view on D’s keys

dok_matrix.viewvalues() →an object providing a view on D’s values

class scipy.sparse.lil_matrix(arg1,shape=None,dtype=None,copy=False)

Row-based linked list sparse matrix

This is an efﬁcient structure for constructing sparse matrices incrementally.

This can be instantiated in several ways:

lil_matrix(D) with a dense matrix or rank-2 ndarray D

lil_matrix(S) with another sparse matrix S (equivalent to S.tolil())

lil_matrix((M, N), [dtype])

to construct an empty matrix with shape (M, N) dtype is optional, defaulting to

dtype=’d’.

Notes

Sparse matrices can be used in arithmetic operations: they support addition, subtraction, multiplication, division,

and matrix power.

Advantages of the LIL format

•supports ﬂexible slicing

•changes to the matrix sparsity structure are efﬁcient

Disadvantages of the LIL format

•arithmetic operations LIL + LIL are slow (consider CSR or CSC)

•slow column slicing (consider CSC)

•slow matrix vector products (consider CSR or CSC)

Intended Usage

•LIL is a convenient format for constructing sparse matrices

•once a matrix has been constructed, convert to CSR or CSC format for fast arith-

metic and matrix vector operations

•consider using the COO format when constructing large matrices

Data Structure

•An array (self.rows) of rows, each of which is a sorted list of column indices

of non-zero elements.

•The corresponding nonzero values are stored in similar fashion in self.data.

732 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Attributes

dtype (dtype) Data type of the matrix

shape (2-tuple) Shape of the matrix

ndim (int) Number of dimensions (this is always 2)

nnz Number of nonzero elements

data LIL format data array of the matrix

rows LIL format row index array of the matrix

Methods

asformat(format) Return this matrix in a given sparse format

asfptype() Upcast matrix to a ﬂoating point format (if necessary)

astype(t)

conj()

conjugate()

copy()

diagonal() Returns the main diagonal of the matrix

dot(other) Ordinary dot product ..

getH()

get_shape()

getcol(j) Returns a copy of column j of the matrix, as an (m x 1) sparse

getformat()

getmaxprint()

getnnz()

getrow(i) Returns a copy of the ‘i’th row.

getrowview(i) Returns a view of the ‘i’th row (without copying).

mean([axis]) Average the matrix over the given axis.

multiply(other) Point-wise multiplication by another matrix

nonzero() nonzero indices

reshape(shape)

set_shape(shape)

setdiag(values[, k]) Fills the diagonal elements {a_ii} with the values from the given sequence.

sum([axis]) Sum the matrix over the given axis.

toarray([order, out]) See the docstring for spmatrix.toarray.

tobsr([blocksize])

tocoo()

tocsc() Return Compressed Sparse Column format arrays for this matrix.

tocsr() Return Compressed Sparse Row format arrays for this matrix.

todense([order, out]) Return a dense matrix representation of this matrix.

todia()

todok()

tolil([copy])

transpose()

lil_matrix.asformat(format)

Return this matrix in a given sparse format

Parameters format : {string, None}

desired sparse matrix format

•None for no format conversion

•“csr” for csr_matrix format

5.23. Sparse matrices (scipy.sparse) 733

SciPy Reference Guide, Release 0.13.0

•“csc” for csc_matrix format

•“lil” for lil_matrix format

•“dok” for dok_matrix format and so on

lil_matrix.asfptype()

Upcast matrix to a ﬂoating point format (if necessary)

lil_matrix.astype(t)

lil_matrix.conj()

lil_matrix.conjugate()

lil_matrix.copy()

lil_matrix.diagonal()

Returns the main diagonal of the matrix

lil_matrix.dot(other)

Ordinary dot product

Examples

>>> import numpy as np

>>> from scipy.sparse import csr_matrix

>>> A=csr_matrix([[1,2,0], [0,0,3], [4,0,5]])

>>> v=np.array([1,0,-1])

>>> A.dot(v)

array([ 1, -3, -1], dtype=int64)

lil_matrix.getH()

lil_matrix.get_shape()

lil_matrix.getcol(j)

Returns a copy of column j of the matrix, as an (m x 1) sparse matrix (column vector).

lil_matrix.getformat()

lil_matrix.getmaxprint()

lil_matrix.getnnz()

lil_matrix.getrow(i)

Returns a copy of the ‘i’th row.

lil_matrix.getrowview(i)

Returns a view of the ‘i’th row (without copying).

lil_matrix.mean(axis=None)

Average the matrix over the given axis. If the axis is None, average over both rows and columns, returning

a scalar.

734 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

lil_matrix.multiply(other)

Point-wise multiplication by another matrix

lil_matrix.nonzero()

nonzero indices

Returns a tuple of arrays (row,col) containing the indices of the non-zero elements of the matrix.

Examples

>>> from scipy.sparse import csr_matrix

>>> A=csr_matrix([[1,2,0],[0,0,3],[4,0,5]])

>>> A.nonzero()

(array([0, 0, 1, 2, 2]), array([0, 1, 2, 0, 2]))

lil_matrix.reshape(shape)

lil_matrix.set_shape(shape)

lil_matrix.setdiag(values,k=0)

Fills the diagonal elements {a_ii} with the values from the given sequence. If k != 0, ﬁlls the off-diagonal

elements {a_{i,i+k}} instead.

values may have any length. If the diagonal is longer than values, then the remaining diagonal entries will

not be set. If values if longer than the diagonal, then the remaining values are ignored.

lil_matrix.sum(axis=None)

Sum the matrix over the given axis. If the axis is None, sum over both rows and columns, returning a

scalar.

lil_matrix.toarray(order=None,out=None)

See the docstring for spmatrix.toarray.

lil_matrix.tobsr(blocksize=None)

lil_matrix.tocoo()

lil_matrix.tocsc()

Return Compressed Sparse Column format arrays for this matrix.

lil_matrix.tocsr()

Return Compressed Sparse Row format arrays for this matrix.

lil_matrix.todense(order=None,out=None)

Return a dense matrix representation of this matrix.

Parameters order : {‘C’, ‘F’}, optional

Whether to store multi-dimensional data in C (row-major) or Fortran

(column-major) order in memory. The default is ‘None’, indicating

the NumPy default of C-ordered. Cannot be speciﬁed in conjunction

with the out argument.

out : ndarray, 2-dimensional, optional

If speciﬁed, uses this array (or numpy.matrix) as the output buffer

instead of allocating a new array to return. The provided array must

have the same shape and dtype as the sparse matrix on which you are

calling the method.

Returns arr : numpy.matrix, 2-dimensional

5.23. Sparse matrices (scipy.sparse) 735

SciPy Reference Guide, Release 0.13.0

A NumPy matrix object with the same shape and containing the

same data represented by the sparse matrix, with the requested

memory order. If out was passed and was an array (rather than a

numpy.matrix), it will be ﬁlled with the appropriate values and

returned wrapped in a numpy.matrix object that shares the same

memory.

lil_matrix.todia()

lil_matrix.todok()

lil_matrix.tolil(copy=False)

lil_matrix.transpose()

Functions

Building sparse matrices:

eye(m[, n, k, dtype, format]) Sparse matrix with ones on diagonal

identity(n[, dtype, format]) Identity matrix in sparse format

kron(A, B[, format]) kronecker product of sparse matrices A and B

kronsum(A, B[, format]) kronecker sum of sparse matrices A and B

diags(diagonals, offsets[, shape, format, dtype]) Construct a sparse matrix from diagonals.

spdiags(data, diags, m, n[, format]) Return a sparse matrix from diagonals.

block_diag(mats[, format, dtype]) Build a block diagonal sparse matrix from provided matrices.

tril(A[, k, format]) Return the lower triangular portion of a matrix in sparse format

triu(A[, k, format]) Return the upper triangular portion of a matrix in sparse format

bmat(blocks[, format, dtype]) Build a sparse matrix from sparse sub-blocks

hstack(blocks[, format, dtype]) Stack sparse matrices horizontally (column wise)

vstack(blocks[, format, dtype]) Stack sparse matrices vertically (row wise)

rand(m, n[, density, format, dtype, ...]) Generate a sparse matrix of the given shape and density with uniformely distributed values.

scipy.sparse.eye(m,n=None,k=0,dtype=<type ‘ﬂoat’>,format=None)

Sparse matrix with ones on diagonal

Returns a sparse (m x n) matrix where the k-th diagonal is all ones and everything else is zeros.

Parameters n: integer

Number of rows in the matrix.

m: integer, optional

Number of columns. Default: n

k: integer, optional

Diagonal to place ones on. Default: 0 (main diagonal)

dtype :

Data type of the matrix

format : string

Sparse format of the result, e.g. format=”csr”, etc.

736 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Examples

>>> from scipy import sparse

>>> sparse.eye(3).todense()

matrix([[ 1., 0., 0.],

[ 0., 1., 0.],

[ 0., 0., 1.]])

>>> sparse.eye(3, dtype=np.int8)

<3x3 sparse matrix of type ’<type ’numpy.int8’>’

with 3 stored elements (1 diagonals) in DIAgonal format>

scipy.sparse.identity(n,dtype=’d’,format=None)

Identity matrix in sparse format

Returns an identity matrix with shape (n,n) using a given sparse format and dtype.

Parameters n: integer

Shape of the identity matrix.

dtype :

Data type of the matrix

format : string

Sparse format of the result, e.g. format=”csr”, etc.

Examples

>>> identity(3).todense()

matrix([[ 1., 0., 0.],

[ 0., 1., 0.],

[ 0., 0., 1.]])

>>> identity(3, dtype=’int8’, format=’dia’)

<3x3 sparse matrix of type ’<type ’numpy.int8’>’

with 3 stored elements (1 diagonals) in DIAgonal format>

scipy.sparse.kron(A,B,format=None)

kronecker product of sparse matrices A and B

Parameters A: sparse or dense matrix

ﬁrst matrix of the product

B: sparse or dense matrix

second matrix of the product

format : string

format of the result (e.g. “csr”)

Returns kronecker product in a sparse matrix format

Examples

>>> A=csr_matrix(array([[0,2],[5,0]]))

>>> B=csr_matrix(array([[1,2],[3,4]]))

>>> kron(A,B).todense()

matrix([[ 0, 0, 2, 4],

[ 0, 0, 6, 8],

[ 5, 10, 0, 0],

[15, 20, 0, 0]])

>>> kron(A,[[1,2],[3,4]]).todense()

matrix([[ 0, 0, 2, 4],

[ 0, 0, 6, 8],

5.23. Sparse matrices (scipy.sparse) 737

SciPy Reference Guide, Release 0.13.0

[ 5, 10, 0, 0],

[15, 20, 0, 0]])

scipy.sparse.kronsum(A,B,format=None)

kronecker sum of sparse matrices A and B

Kronecker sum of two sparse matrices is a sum of two Kronecker products kron(I_n,A) + kron(B,I_m) where

A has shape (m,m) and B has shape (n,n) and I_m and I_n are identity matrices of shape (m,m) and (n,n)

respectively.

Parameters A

square matrix

format : string

format of the result (e.g. “csr”)

Returns kronecker sum in a sparse matrix format

scipy.sparse.diags(diagonals,offsets,shape=None,format=None,dtype=None)

Construct a sparse matrix from diagonals. New in version 0.11.

Parameters diagonals : sequence of array_like

Sequence of arrays containing the matrix diagonals, corresponding to off-

sets.

offsets : sequence of int

Diagonals to set:

•k = 0 the main diagonal

•k > 0 the k-th upper diagonal

•k < 0 the k-th lower diagonal

shape : tuple of int, optional

Shape of the result. If omitted, a square matrix large enough to contain the

diagonals is returned.

format : {“dia”, “csr”, “csc”, “lil”, ...}, optional

Matrix format of the result. By default (format=None) an appropriate sparse

matrix format is returned. This choice is subject to change.

dtype : dtype, optional

Data type of the matrix.

See Also

spdiags construct matrix from diagonals

Notes

This function differs from spdiags in the way it handles off-diagonals.

The result from diags is the sparse equivalent of:

np.diag(diagonals[0], offsets[0])

+ ...

+np.diag(diagonals[k], offsets[k])

Repeated diagonal offsets are disallowed.

Examples

738 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

>>> diagonals =[[1,2,3,4], [1,2,3], [1,2]]

>>> diags(diagonals, [0,-1,2]).todense()

matrix([[1, 0, 1, 0],

[1, 2, 0, 2],

[0, 2, 3, 0],

[0, 0, 3, 4]])

Broadcasting of scalars is supported (but shape needs to be speciﬁed):

>>> diags([1,-2,1], [-1,0,1], shape=(4,4)).todense()

matrix([[-2., 1., 0., 0.],

[ 1., -2., 1., 0.],

[ 0., 1., -2., 1.],

[ 0., 0., 1., -2.]])

If only one diagonal is wanted (as in numpy.diag), the following works as well:

>>> diags([1,2,3], 1).todense()

matrix([[ 0., 1., 0., 0.],

[ 0., 0., 2., 0.],

[ 0., 0., 0., 3.],

[ 0., 0., 0., 0.]])

scipy.sparse.spdiags(data,diags,m,n,format=None)

Return a sparse matrix from diagonals.

Parameters data : array_like

matrix diagonals stored row-wise

diags : diagonals to set

•k = 0 the main diagonal

•k > 0 the k-th upper diagonal

•k < 0 the k-th lower diagonal

m, n : int

shape of the result

format : format of the result (e.g. “csr”)

By default (format=None) an appropriate sparse matrix format is returned.

This choice is subject to change.

See Also

diags more convenient form of this function

dia_matrixthe sparse DIAgonal format.

Examples

>>> data =array([[1,2,3,4],[1,2,3,4],[1,2,3,4]])

>>> diags =array([0,-1,2])

>>> spdiags(data, diags, 4,4).todense()

matrix([[1, 0, 3, 0],

[1, 2, 0, 4],

[0, 2, 3, 0],

[0, 0, 3, 4]])

scipy.sparse.block_diag(mats,format=None,dtype=None)

Build a block diagonal sparse matrix from provided matrices. New in version 0.11.0.

Parameters A, B, ... : sequence of matrices

Input matrices.

format : str, optional

5.23. Sparse matrices (scipy.sparse) 739

SciPy Reference Guide, Release 0.13.0

The sparse format of the result (e.g. “csr”). If not given, the matrix is

returned in “coo” format.

dtype : dtype speciﬁer, optional

The data-type of the output matrix. If not given, the dtype is determined

from that of blocks.

Returns res : sparse matrix

See Also

bmat,diags

Examples

>>> A=coo_matrix([[1,2], [3,4]])

>>> B=coo_matrix([[5], [6]])

>>> C=coo_matrix([[7]])

>>> block_diag((A, B, C)).todense()

matrix([[1, 2, 0, 0],

[3, 4, 0, 0],

[0, 0, 5, 0],

[0, 0, 6, 0],

[0, 0, 0, 7]])

scipy.sparse.tril(A,k=0,format=None)

Return the lower triangular portion of a matrix in sparse format

Returns the elements on or below the k-th diagonal of the matrix A.

•k = 0 corresponds to the main diagonal

•k > 0 is above the main diagonal

•k < 0 is below the main diagonal

Parameters A: dense or sparse matrix

Matrix whose lower trianglar portion is desired.

k: integer

The top-most diagonal of the lower triangle.

format : string

Sparse format of the result, e.g. format=”csr”, etc.

Returns L: sparse matrix

Lower triangular portion of A in sparse format.

See Also

triu upper triangle in sparse format

Examples

>>> from scipy.sparse import csr_matrix

>>> A=csr_matrix( [[1,2,0,0,3],[4,5,0,6,7],[0,0,8,9,0]], dtype=’int32’ )

>>> A.todense()

matrix([[1, 2, 0, 0, 3],

[4, 5, 0, 6, 7],

[0, 0, 8, 9, 0]])

>>> tril(A).todense()

matrix([[1, 0, 0, 0, 0],

[4, 5, 0, 0, 0],

[0, 0, 8, 0, 0]])

>>> tril(A).nnz

740 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

>>> tril(A, k=1).todense()

matrix([[1, 2, 0, 0, 0],

[4, 5, 0, 0, 0],

[0, 0, 8, 9, 0]])

>>> tril(A, k=-1).todense()

matrix([[0, 0, 0, 0, 0],

[4, 0, 0, 0, 0],

[0, 0, 0, 0, 0]])

>>> tril(A, format=’csc’)

<3x5 sparse matrix of type ’<type ’numpy.int32’>’

with 4 stored elements in Compressed Sparse Column format>

scipy.sparse.triu(A,k=0,format=None)

Return the upper triangular portion of a matrix in sparse format

Returns the elements on or above the k-th diagonal of the matrix A.

•k = 0 corresponds to the main diagonal

•k > 0 is above the main diagonal

•k < 0 is below the main diagonal

Parameters A: dense or sparse matrix

Matrix whose upper trianglar portion is desired.

k: integer

The bottom-most diagonal of the upper triangle.

format : string

Sparse format of the result, e.g. format=”csr”, etc.

Returns L: sparse matrix

Upper triangular portion of A in sparse format.

See Also

tril lower triangle in sparse format

Examples

>>> from scipy.sparse import csr_matrix

>>> A=csr_matrix( [[1,2,0,0,3],[4,5,0,6,7],[0,0,8,9,0]], dtype=’int32’ )

>>> A.todense()

matrix([[1, 2, 0, 0, 3],

[4, 5, 0, 6, 7],

[0, 0, 8, 9, 0]])

>>> triu(A).todense()

matrix([[1, 2, 0, 0, 3],

[0, 5, 0, 6, 7],

[0, 0, 8, 9, 0]])

>>> triu(A).nnz

>>> triu(A, k=1).todense()

matrix([[0, 2, 0, 0, 3],

[0, 0, 0, 6, 7],

[0, 0, 0, 9, 0]])

>>> triu(A, k=-1).todense()

matrix([[1, 2, 0, 0, 3],

[4, 5, 0, 6, 7],

[0, 0, 8, 9, 0]])

>>> triu(A, format=’csc’)

5.23. Sparse matrices (scipy.sparse) 741

SciPy Reference Guide, Release 0.13.0

<3x5 sparse matrix of type ’<type ’numpy.int32’>’

with 8 stored elements in Compressed Sparse Column format>

scipy.sparse.bmat(blocks,format=None,dtype=None)

Build a sparse matrix from sparse sub-blocks

Parameters blocks : array_like

Grid of sparse matrices with compatible shapes. An entry of None implies

an all-zero matrix.

format : {‘bsr’, ‘coo’, ‘csc’, ‘csr’, ‘dia’, ‘dok’, ‘lil’}, optional

The sparse format of the result (e.g. “csr”). If not given, the matrix is

returned in “coo” format.

dtype : dtype speciﬁer, optional

The data-type of the output matrix. If not given, the dtype is determined

from that of blocks.

Returns bmat : sparse matrix

A “coo” sparse matrix or type of sparse matrix identiﬁed by format.

See Also

block_diag,diags

Examples

>>> from scipy.sparse import coo_matrix, bmat

>>> A=coo_matrix([[1,2],[3,4]])

>>> B=coo_matrix([[5],[6]])

>>> C=coo_matrix([[7]])

>>> bmat( [[A,B],[None,C]] ).todense()

matrix([[1, 2, 5],

[3, 4, 6],

[0, 0, 7]])

>>> bmat( [[A,None],[None,C]] ).todense()

matrix([[1, 2, 0],

[3, 4, 0],

[0, 0, 7]])

scipy.sparse.hstack(blocks,format=None,dtype=None)

Stack sparse matrices horizontally (column wise)

Parameters blocks

sequence of sparse matrices with compatible shapes

format : string

sparse format of the result (e.g. “csr”) by default an appropriate sparse

matrix format is returned. This choice is subject to change.

See Also

vstack stack sparse matrices vertically (row wise)

Examples

>>> from scipy.sparse import coo_matrix, hstack

>>> A=coo_matrix([[1,2],[3,4]])

>>> B=coo_matrix([[5],[6]])

>>> hstack( [A,B] ).todense()

742 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

matrix([[1, 2, 5],

[3, 4, 6]])

scipy.sparse.vstack(blocks,format=None,dtype=None)

Stack sparse matrices vertically (row wise)

Parameters blocks

sequence of sparse matrices with compatible shapes

format : string

sparse format of the result (e.g. “csr”) by default an appropriate sparse

matrix format is returned. This choice is subject to change.

See Also

hstack stack sparse matrices horizontally (column wise)

Examples

>>> from scipy.sparse import coo_matrix, vstack

>>> A=coo_matrix([[1,2],[3,4]])

>>> B=coo_matrix([[5,6]])

>>> vstack( [A,B] ).todense()

matrix([[1, 2],

[3, 4],

[5, 6]])

scipy.sparse.rand(m,n,density=0.01,format=’coo’,dtype=None,random_state=None)

Generate a sparse matrix of the given shape and density with uniformely distributed values.

Parameters m, n : int

shape of the matrix

density : real

density of the generated matrix: density equal to one means a full matrix,

density of 0 means a matrix with no non-zero items.

format : str

sparse matrix format.

dtype : dtype

type of the returned matrix values.

random_state : {numpy.random.RandomState, int}, optional

Random number generator or random seed. If not given, the singleton

numpy.random will be used.

Notes

Only ﬂoat types are supported for now.

Identifying sparse matrices:

issparse(x)

isspmatrix(x)

isspmatrix_csc(x)

isspmatrix_csr(x)

isspmatrix_bsr(x)

isspmatrix_lil(x)

isspmatrix_dok(x)

isspmatrix_coo(x)

Continued on next page

5.23. Sparse matrices (scipy.sparse) 743

SciPy Reference Guide, Release 0.13.0

Table 5.133 – continued from previous page

isspmatrix_dia(x)

scipy.sparse.issparse(x)

scipy.sparse.isspmatrix(x)

scipy.sparse.isspmatrix_csc(x)

scipy.sparse.isspmatrix_csr(x)

scipy.sparse.isspmatrix_bsr(x)

scipy.sparse.isspmatrix_lil(x)

scipy.sparse.isspmatrix_dok(x)

scipy.sparse.isspmatrix_coo(x)

scipy.sparse.isspmatrix_dia(x)

Submodules

csgraph

linalg

Compressed Sparse Graph Routines (scipy.sparse.csgraph)Fast graph algorithms based on sparse matrix

representations.

connected_components(csgraph[, directed, ...]) Analyze the connected components of a sparse graph ..

laplacian(csgraph[, normed, return_diag]) Return the Laplacian matrix of a directed graph.

shortest_path(csgraph[, method, directed, ...]) Perform a shortest-path graph search on a positive directed or undirected graph.

dijkstra(csgraph[, directed, indices, ...]) Dijkstra algorithm using Fibonacci Heaps

floyd_warshall(csgraph[, directed, ...]) Compute the shortest path lengths using the Floyd-Warshall algorithm

bellman_ford(csgraph[, directed, indices, ...]) Compute the shortest path lengths using the Bellman-Ford algorithm.

johnson(csgraph[, directed, indices, ...]) Compute the shortest path lengths using Johnson’s algorithm.

breadth_first_order(csgraph, i_start[, ...]) Return a breadth-ﬁrst ordering starting with speciﬁed node.

depth_first_order(csgraph, i_start[, ...]) Return a depth-ﬁrst ordering starting with speciﬁed node.

breadth_first_tree(csgraph, i_start[, directed]) Return the tree generated by a breadth-ﬁrst search

depth_first_tree(csgraph, i_start[, directed]) Return a tree generated by a depth-ﬁrst search.

minimum_spanning_tree(csgraph[, overwrite]) Return a minimum spanning tree of an undirected graph

Contents

744 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

scipy.sparse.csgraph.connected_components(csgraph,directed=True,connection=’weak’,re-

turn_labels=True)

Analyze the connected components of a sparse graph New in version 0.11.0.

Parameters csgraph : array_like or sparse matrix

The N x N matrix representing the compressed sparse graph. The input

csgraph will be converted to csr format for the calculation.

directed : bool, optional

If True (default), then operate on a directed graph: only move from point

i to point j along paths csgraph[i, j]. If False, then ﬁnd the shortest path

on an undirected graph: the algorithm can progress from point i to j along

csgraph[i, j] or csgraph[j, i].

connection : str, optional

[’weak’|’strong’]. For directed graphs, the type of connection to use. Nodes

i and j are strongly connected if a path exists both from i to j and from j to

i. Nodes i and j are weakly connected if only one of these paths exists. If

directed == False, this keyword is not referenced.

return_labels : str, optional

If True (default), then return the labels for each of the connected compo-

nents.

Returns n_components: int

The number of connected components.

labels: ndarray

The length-N array of labels of the connected components.

References

[R12]

scipy.sparse.csgraph.laplacian(csgraph,normed=False,return_diag=False)

Return the Laplacian matrix of a directed graph.

For non-symmetric graphs the out-degree is used in the computation.

Parameters csgraph : array_like or sparse matrix, 2 dimensions

compressed-sparse graph, with shape (N, N).

normed : bool, optional

If True, then compute normalized Laplacian.

return_diag : bool, optional

If True, then return diagonal as well as laplacian.

Returns lap : ndarray

The N x N laplacian matrix of graph.

diag : ndarray

The length-N diagonal of the laplacian matrix. diag is returned only if re-

turn_diag is True.

Notes

The Laplacian matrix of a graph is sometimes referred to as the “Kirchoff matrix” or the “admittance matrix”,

and is useful in many parts of spectral graph theory. In particular, the eigen-decomposition of the laplacian

matrix can give insight into many properties of the graph.

For non-symmetric directed graphs, the laplacian is computed using the out-degree of each node.

Examples

>>> from scipy.sparse import csgraph

>>> G=np.arange(5)*np.arange(5)[:, np.newaxis]

>>> G

5.23. Sparse matrices (scipy.sparse) 745

SciPy Reference Guide, Release 0.13.0

array([[ 0, 0, 0, 0, 0],

[ 0, 1, 2, 3, 4],

[ 0, 2, 4, 6, 8],

[ 0, 3, 6, 9, 12],

[ 0, 4, 8, 12, 16]])

>>> csgraph.laplacian(G, normed=False)

array([[ 0, 0, 0, 0, 0],

[ 0, 9, -2, -3, -4],

[ 0, -2, 16, -6, -8],

[ 0, -3, -6, 21, -12],

[ 0, -4, -8, -12, 24]])

scipy.sparse.csgraph.shortest_path(csgraph,method=’auto’,directed=True,re-

turn_predecessors=False,unweighted=False,over-

write=False)

Perform a shortest-path graph search on a positive directed or undirected graph. New in version 0.11.0.

Parameters csgraph : array, matrix, or sparse matrix, 2 dimensions

The N x N array of distances representing the input graph.

method : string [’auto’|’FW’|’D’], optional

Algorithm to use for shortest paths. Options are:

‘auto’ – (default) select the best among ‘FW’, ‘D’, ‘BF’, or ‘J’

based on the input data.

‘FW’ – Floyd-Warshall algorithm. Computational cost is

approximately O[N^3]. The input csgraph

will be converted to a dense representation.

‘D’ – Dijkstra’s algorithm with Fibonacci heaps. Computational

cost is approximately O[N(N*k +

N*log(N))], where kis the average

number of connected edges per node. The

input csgraph will be converted to a csr

representation.

‘BF’ – Bellman-Ford algorithm. This algorithm can be used when

weights are negative. If a negative cycle is

encountered, an error will be raised. Com-

putational cost is approximately O[N(N^2

k)], where kis the average number of

connected edges per node. The input csgraph

will be converted to a csr representation.

‘J’ – Johnson’s algorithm. Like the Bellman-Ford algorithm,

Johnson’s algorithm is designed for use

when the weights are negative. It combines

the Bellman-Ford algorithm with Dijkstra’s

algorithm for faster computation.

directed : bool, optional

If True (default), then ﬁnd the shortest path on a directed graph: only move

from point i to point j along paths csgraph[i, j]. If False, then ﬁnd the

shortest path on an undirected graph: the algorithm can progress from point

i to j along csgraph[i, j] or csgraph[j, i]

return_predecessors : bool, optional

If True, return the size (N, N) predecesor matrix

746 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

unweighted : bool, optional

If True, then ﬁnd unweighted distances. That is, rather than ﬁnding the path

between each point such that the sum of weights is minimized, ﬁnd the path

such that the number of edges is minimized.

overwrite : bool, optional

If True, overwrite csgraph with the result. This applies only if method ==

‘FW’ and csgraph is a dense, c-ordered array with dtype=ﬂoat64.

Returns dist_matrix : ndarray

The N x N matrix of distances between graph nodes. dist_matrix[i,j] gives

the shortest distance from point i to point j along the graph.

predecessors : ndarray

Returned only if return_predecessors == True. The N x N matrix of prede-

cessors, which can be used to reconstruct the shortest paths. Row i of the

predecessor matrix contains information on the shortest paths from point

i: each entry predecessors[i, j] gives the index of the previous node in the

path from point i to point j. If no path exists between point i and j, then

predecessors[i, j] = -9999

Raises NegativeCycleError:

if there are negative cycles in the graph

Notes

As currently implemented, Dijkstra’s algorithm and Johnson’s algorithm do not work for graphs with direction-

dependent distances when directed == False. i.e., if csgraph[i,j] and csgraph[j,i] are non-equal edges,

method=’D’ may yield an incorrect result.

scipy.sparse.csgraph.dijkstra(csgraph,directed=True,indices=None,re-

turn_predecessors=False,unweighted=False)

Dijkstra algorithm using Fibonacci Heaps New in version 0.11.0.

Parameters csgraph : array, matrix, or sparse matrix, 2 dimensions

The N x N array of non-negative distances representing the input graph.

directed : bool, optional

If True (default), then ﬁnd the shortest path on a directed graph: only move

from point i to point j along paths csgraph[i, j]. If False, then ﬁnd the

shortest path on an undirected graph: the algorithm can progress from point

i to j along csgraph[i, j] or csgraph[j, i]

indices : array_like or int, optional

if speciﬁed, only compute the paths for the points at the given indices.

return_predecessors : bool, optional

If True, return the size (N, N) predecesor matrix

unweighted : bool, optional

If True, then ﬁnd unweighted distances. That is, rather than ﬁnding the path

between each point such that the sum of weights is minimized, ﬁnd the path

such that the number of edges is minimized.

Returns dist_matrix : ndarray

The matrix of distances between graph nodes. dist_matrix[i,j] gives the

shortest distance from point i to point j along the graph.

predecessors : ndarray

Returned only if return_predecessors == True. The matrix of predecessors,

which can be used to reconstruct the shortest paths. Row i of the prede-

cessor matrix contains information on the shortest paths from point i: each

entry predecessors[i, j] gives the index of the previous node in the path from

point i to point j. If no path exists between point i and j, then predecessors[i,

j] = -9999

5.23. Sparse matrices (scipy.sparse) 747

SciPy Reference Guide, Release 0.13.0

Notes

As currently implemented, Dijkstra’s algorithm does not work for graphs with direction-dependent distances

when directed == False. i.e., if csgraph[i,j] and csgraph[j,i] are not equal and both are nonzero, setting di-

rected=False will not yield the correct result.

Also, this routine does not work for graphs with negative distances. Negative distances can lead to inﬁnite cycles

that must be handled by specialized algorithms such as Bellman-Ford’s algorithm or Johnson’s algorithm.

scipy.sparse.csgraph.floyd_warshall(csgraph,directed=True,return_predecessors=False,un-

weighted=False,overwrite=False)

Compute the shortest path lengths using the Floyd-Warshall algorithm New in version 0.11.0.

Parameters csgraph : array, matrix, or sparse matrix, 2 dimensions

The N x N array of distances representing the input graph.

directed : bool, optional

If True (default), then ﬁnd the shortest path on a directed graph: only move

from point i to point j along paths csgraph[i, j]. If False, then ﬁnd the

shortest path on an undirected graph: the algorithm can progress from point

i to j along csgraph[i, j] or csgraph[j, i]

return_predecessors : bool, optional

If True, return the size (N, N) predecesor matrix

unweighted : bool, optional

If True, then ﬁnd unweighted distances. That is, rather than ﬁnding the path

between each point such that the sum of weights is minimized, ﬁnd the path

such that the number of edges is minimized.

overwrite : bool, optional

If True, overwrite csgraph with the result. This applies only if csgraph is a

dense, c-ordered array with dtype=ﬂoat64.

Returns dist_matrix : ndarray

The N x N matrix of distances between graph nodes. dist_matrix[i,j] gives

the shortest distance from point i to point j along the graph.

predecessors : ndarray

Returned only if return_predecessors == True. The N x N matrix of prede-

cessors, which can be used to reconstruct the shortest paths. Row i of the

predecessor matrix contains information on the shortest paths from point

i: each entry predecessors[i, j] gives the index of the previous node in the

path from point i to point j. If no path exists between point i and j, then

predecessors[i, j] = -9999

Raises NegativeCycleError:

if there are negative cycles in the graph

scipy.sparse.csgraph.bellman_ford(csgraph,directed=True,indices=None,re-

turn_predecessors=False,unweighted=False)

Compute the shortest path lengths using the Bellman-Ford algorithm.

The Bellman-ford algorithm can robustly deal with graphs with negative weights. If a negative cycle is detected,

an error is raised. For graphs without negative edge weights, dijkstra’s algorithm may be faster. New in version

0.11.0.

Parameters csgraph : array, matrix, or sparse matrix, 2 dimensions

The N x N array of distances representing the input graph.

directed : bool, optional

If True (default), then ﬁnd the shortest path on a directed graph: only move

from point i to point j along paths csgraph[i, j]. If False, then ﬁnd the

shortest path on an undirected graph: the algorithm can progress from point

i to j along csgraph[i, j] or csgraph[j, i]

indices : array_like or int, optional

if speciﬁed, only compute the paths for the points at the given indices.

748 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

return_predecessors : bool, optional

If True, return the size (N, N) predecesor matrix

unweighted : bool, optional

If True, then ﬁnd unweighted distances. That is, rather than ﬁnding the path

between each point such that the sum of weights is minimized, ﬁnd the path

such that the number of edges is minimized.

Returns dist_matrix : ndarray

The N x N matrix of distances between graph nodes. dist_matrix[i,j] gives

the shortest distance from point i to point j along the graph.

predecessors : ndarray

Returned only if return_predecessors == True. The N x N matrix of prede-

cessors, which can be used to reconstruct the shortest paths. Row i of the

predecessor matrix contains information on the shortest paths from point

i: each entry predecessors[i, j] gives the index of the previous node in the

path from point i to point j. If no path exists between point i and j, then

predecessors[i, j] = -9999

Raises NegativeCycleError:

if there are negative cycles in the graph

Notes

This routine is specially designed for graphs with negative edge weights. If all edge weights are positive, then

Dijkstra’s algorithm is a better choice.

scipy.sparse.csgraph.johnson(csgraph,directed=True,indices=None,re-

turn_predecessors=False,unweighted=False)

Compute the shortest path lengths using Johnson’s algorithm.

Johnson’s algorithm combines the Bellman-Ford algorithm and Dijkstra’s algorithm to quickly ﬁnd shortest

paths in a way that is robust to the presence of negative cycles. If a negative cycle is detected, an error is raised.

For graphs without negative edge weights, dijkstra() may be faster. New in version 0.11.0.

Parameters csgraph : array, matrix, or sparse matrix, 2 dimensions

The N x N array of distances representing the input graph.

directed : bool, optional

If True (default), then ﬁnd the shortest path on a directed graph: only move

from point i to point j along paths csgraph[i, j]. If False, then ﬁnd the

shortest path on an undirected graph: the algorithm can progress from point

i to j along csgraph[i, j] or csgraph[j, i]

indices : array_like or int, optional

if speciﬁed, only compute the paths for the points at the given indices.

return_predecessors : bool, optional

If True, return the size (N, N) predecesor matrix

unweighted : bool, optional

If True, then ﬁnd unweighted distances. That is, rather than ﬁnding the path

between each point such that the sum of weights is minimized, ﬁnd the path

such that the number of edges is minimized.

Returns dist_matrix : ndarray

The N x N matrix of distances between graph nodes. dist_matrix[i,j] gives

the shortest distance from point i to point j along the graph.

predecessors : ndarray

Returned only if return_predecessors == True. The N x N matrix of prede-

cessors, which can be used to reconstruct the shortest paths. Row i of the

predecessor matrix contains information on the shortest paths from point

i: each entry predecessors[i, j] gives the index of the previous node in the

path from point i to point j. If no path exists between point i and j, then

predecessors[i, j] = -9999

Raises NegativeCycleError:

5.23. Sparse matrices (scipy.sparse) 749

SciPy Reference Guide, Release 0.13.0

if there are negative cycles in the graph

Notes

This routine is specially designed for graphs with negative edge weights. If all edge weights are positive, then

Dijkstra’s algorithm is a better choice.

scipy.sparse.csgraph.breadth_first_order(csgraph,i_start,directed=True,re-

turn_predecessors=True)

Return a breadth-ﬁrst ordering starting with speciﬁed node.

Note that a breadth-ﬁrst order is not unique, but the tree which it generates is unique. New in version 0.11.0.

Parameters csgraph : array_like or sparse matrix

The N x N compressed sparse graph. The input csgraph will be converted

to csr format for the calculation.

i_start : int

The index of starting node.

directed : bool, optional

If True (default), then operate on a directed graph: only move from point

i to point j along paths csgraph[i, j]. If False, then ﬁnd the shortest path

on an undirected graph: the algorithm can progress from point i to j along

csgraph[i, j] or csgraph[j, i].

return_predecessors : bool, optional

If True (default), then return the predecesor array (see below).

Returns node_array : ndarray, one dimension

The breadth-ﬁrst list of nodes, starting with speciﬁed node. The length of

node_array is the number of nodes reachable from the speciﬁed node.

predecessors : ndarray, one dimension

Returned only if return_predecessors is True. The length-N list of prede-

cessors of each node in a breadth-ﬁrst tree. If node i is in the tree, then its

parent is given by predecessors[i]. If node i is not in the tree (and for the

parent node) then predecessors[i] = -9999.

scipy.sparse.csgraph.depth_first_order(csgraph,i_start,directed=True,re-

turn_predecessors=True)

Return a depth-ﬁrst ordering starting with speciﬁed node.

Note that a depth-ﬁrst order is not unique. Furthermore, for graphs with cycles, the tree generated by a depth-ﬁrst

search is not unique either. New in version 0.11.0.

Parameters csgraph : array_like or sparse matrix

The N x N compressed sparse graph. The input csgraph will be converted

to csr format for the calculation.

i_start : int

The index of starting node.

directed : bool, optional

If True (default), then operate on a directed graph: only move from point

i to point j along paths csgraph[i, j]. If False, then ﬁnd the shortest path

on an undirected graph: the algorithm can progress from point i to j along

csgraph[i, j] or csgraph[j, i].

return_predecessors : bool, optional

If True (default), then return the predecesor array (see below).

Returns node_array : ndarray, one dimension

The breadth-ﬁrst list of nodes, starting with speciﬁed node. The length of

node_array is the number of nodes reachable from the speciﬁed node.

predecessors : ndarray, one dimension

Returned only if return_predecessors is True. The length-N list of prede-

cessors of each node in a breadth-ﬁrst tree. If node i is in the tree, then its

750 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

parent is given by predecessors[i]. If node i is not in the tree (and for the

parent node) then predecessors[i] = -9999.

scipy.sparse.csgraph.breadth_first_tree(csgraph,i_start,directed=True)

Return the tree generated by a breadth-ﬁrst search

Note that a breadth-ﬁrst tree from a speciﬁed node is unique. New in version 0.11.0.

Parameters csgraph : array_like or sparse matrix

The N x N matrix representing the compressed sparse graph. The input

csgraph will be converted to csr format for the calculation.

i_start : int

The index of starting node.

directed : bool, optional

If True (default), then operate on a directed graph: only move from point

i to point j along paths csgraph[i, j]. If False, then ﬁnd the shortest path

on an undirected graph: the algorithm can progress from point i to j along

csgraph[i, j] or csgraph[j, i].

Returns cstree : csr matrix

The N x N directed compressed-sparse representation of the breadth- ﬁrst

tree drawn from csgraph, starting at the speciﬁed node.

Examples

The following example shows the computation of a depth-ﬁrst tree over a simple four-component graph, starting

at node 0:

input graph breadth first tree from (0)

(0) (0)

/ \ / \

3 8 3 8

/ \ / \

(3)---5---(1) (3) (1)

\ / /

6 2 2

\ / /

(2) (2)

In compressed sparse representation, the solution looks like this:

>>> from scipy.sparse import csr_matrix

>>> from scipy.sparse.csgraph import breadth_first_tree

>>> X=csr_matrix([[0,8,0,3],

... [0,0,2,5],

... [0,0,0,6],

... [0,0,0,0]])

>>> Tcsr =breadth_first_tree(X, 0, directed=False)

>>> Tcsr.toarray().astype(int)

array([[0, 8, 0, 3],

[0, 0, 2, 0],

[0, 0, 0, 0],

[0, 0, 0, 0]])

Note that the resulting graph is a Directed Acyclic Graph which spans the graph. A breadth-ﬁrst tree from a

given node is unique.

scipy.sparse.csgraph.depth_first_tree(csgraph,i_start,directed=True)

Return a tree generated by a depth-ﬁrst search.

5.23. Sparse matrices (scipy.sparse) 751

SciPy Reference Guide, Release 0.13.0

Note that a tree generated by a depth-ﬁrst search is not unique: it depends on the order that the children of each

node are searched. New in version 0.11.0.

Parameters csgraph : array_like or sparse matrix

The N x N matrix representing the compressed sparse graph. The input

csgraph will be converted to csr format for the calculation.

i_start : int

The index of starting node.

directed : bool, optional

If True (default), then operate on a directed graph: only move from point

i to point j along paths csgraph[i, j]. If False, then ﬁnd the shortest path

on an undirected graph: the algorithm can progress from point i to j along

csgraph[i, j] or csgraph[j, i].

Returns cstree : csr matrix

The N x N directed compressed-sparse representation of the depth- ﬁrst tree

drawn from csgraph, starting at the speciﬁed node.

Examples

The following example shows the computation of a depth-ﬁrst tree over a simple four-component graph, starting

at node 0:

input graph depth first tree from (0)

(0) (0)

/ \ \

3 8 8

/ \ \

(3)---5---(1) (3) (1)

\ / \ /

6 2 6 2

\ / \ /

(2) (2)

In compressed sparse representation, the solution looks like this:

>>> from scipy.sparse import csr_matrix

>>> from scipy.sparse.csgraph import depth_first_tree

>>> X=csr_matrix([[0,8,0,3],

... [0,0,2,5],

... [0,0,0,6],

... [0,0,0,0]])

>>> Tcsr =depth_first_tree(X, 0, directed=False)

>>> Tcsr.toarray().astype(int)

array([[0, 8, 0, 0],

[0, 0, 2, 0],

[0, 0, 0, 6],

[0, 0, 0, 0]])

Note that the resulting graph is a Directed Acyclic Graph which spans the graph. Unlike a breadth-ﬁrst tree, a

depth-ﬁrst tree of a given graph is not unique if the graph contains cycles. If the above solution had begun with

the edge connecting nodes 0 and 3, the result would have been different.

scipy.sparse.csgraph.minimum_spanning_tree(csgraph,overwrite=False)

Return a minimum spanning tree of an undirected graph

A minimum spanning tree is a graph consisting of the subset of edges which together connect all connected

nodes, while minimizing the total sum of weights on the edges. This is computed using the Kruskal algorithm.

New in version 0.11.0.

752 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Parameters csgraph : array_like or sparse matrix, 2 dimensions

The N x N matrix representing an undirected graph over N nodes (see notes

below).

overwrite : bool, optional

if true, then parts of the input graph will be overwritten for efﬁciency.

Returns span_tree : csr matrix

The N x N compressed-sparse representation of the undirected minimum

spanning tree over the input (see notes below).

Notes

This routine uses undirected graphs as input and output. That is, if graph[i, j] and graph[j, i] are both zero,

then nodes i and j do not have an edge connecting them. If either is nonzero, then the two are connected by the

minimum nonzero value of the two.

Examples

The following example shows the computation of a minimum spanning tree over a simple four-component

graph:

input graph minimum spanning tree

(0) (0)

/ \ /

3 8 3

/ \ /

(3)---5---(1) (3)---5---(1)

\ / /

6 2 2

\ / /

(2) (2)

It is easy to see from inspection that the minimum spanning tree involves removing the edges with weights 8

and 6. In compressed sparse representation, the solution looks like this:

>>> from scipy.sparse import csr_matrix

>>> from scipy.sparse.csgraph import minimum_spanning_tree

>>> X=csr_matrix([[0,8,0,3],

... [0,0,2,5],

... [0,0,0,6],

... [0,0,0,0]])

>>> Tcsr =minimum_spanning_tree(X)

>>> Tcsr.toarray().astype(int)

array([[0, 0, 0, 3],

[0, 0, 2, 5],

[0, 0, 0, 0],

[0, 0, 0, 0]])

Graph Representations This module uses graphs which are stored in a matrix format. A graph with N nodes can

be represented by an (N x N) adjacency matrix G. If there is a connection from node i to node j, then G[i, j] = w, where

w is the weight of the connection. For nodes i and j which are not connected, the value depends on the representation:

• for dense array representations, non-edges are represented by G[i, j] = 0, inﬁnity, or NaN.

• for dense masked representations (of type np.ma.MaskedArray), non-edges are represented by masked values.

This can be useful when graphs with zero-weight edges are desired.

5.23. Sparse matrices (scipy.sparse) 753

SciPy Reference Guide, Release 0.13.0

• for sparse array representations, non-edges are represented by non-entries in the matrix. This sort of sparse

representation also allows for edges with zero weights.

As a concrete example, imagine that you would like to represent the following undirected graph:

(0)

/ \

1 2

/ \

(2) (1)

This graph has three nodes, where node 0 and 1 are connected by an edge of weight 2, and nodes 0 and 2 are connected

by an edge of weight 1. We can construct the dense, masked, and sparse representations as follows, keeping in mind

that an undirected graph is represented by a symmetric matrix:

>>> G_dense =np.array([[0,2,1],

... [2,0,0],

... [1,0,0]])

>>> G_masked =np.ma.masked_values(G_dense, 0)

>>> from scipy.sparse import csr_matrix

>>> G_sparse =csr_matrix(G_dense)

This becomes more difﬁcult when zero edges are signiﬁcant. For example, consider the situation when we slightly

modify the above graph:

(0)

/ \

0 2

/ \

(2) (1)

This is identical to the previous graph, except nodes 0 and 2 are connected by an edge of zero weight. In this case, the

dense representation above leads to ambiguities: how can non-edges be represented if zero is a meaningful value? In

this case, either a masked or sparse representation must be used to eliminate the ambiguity:

>>> G2_data =np.array([[np.inf, 2,0],

... [2, np.inf, np.inf],

... [0, np.inf, np.inf]])

>>> G2_masked =np.ma.masked_invalid(G2_data)

>>> from scipy.sparse.csgraph import csgraph_from_dense

>>> # G2_sparse = csr_matrix(G2_data) would give the wrong result

>>> G2_sparse =csgraph_from_dense(G2_data, null_value=np.inf)

>>> G2_sparse.data

array([ 2., 0., 2., 0.])

Here we have used a utility routine from the csgraph submodule in order to convert the dense representation to a sparse

representation which can be understood by the algorithms in submodule. By viewing the data array, we can see that

the zero values are explicitly encoded in the graph.

Directed vs. Undirected Matrices may represent either directed or undirected graphs. This is speciﬁed throughout

the csgraph module by a boolean keyword. Graphs are assumed to be directed by default. In a directed graph, traversal

from node i to node j can be accomplished over the edge G[i, j], but not the edge G[j, i]. In a non-directed graph,

754 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

traversal from node i to node j can be accomplished over either G[i, j] or G[j, i]. If both edges are not null, and the two

have unequal weights, then the smaller of the two is used. Note that a symmetric matrix will represent an undirected

graph, regardless of whether the ‘directed’ keyword is set to True or False. In this case, using directed=True

generally leads to more efﬁcient computation.

The routines in this module accept as input either scipy.sparse representations (csr, csc, or lil format), masked repre-

sentations, or dense representations with non-edges indicated by zeros, inﬁnities, and NaN entries.

Functions

bellman_ford(csgraph[, directed, indices, ...]) Compute the shortest path lengths using the Bellman-Ford algorithm.

breadth_first_order(csgraph, i_start[, ...]) Return a breadth-ﬁrst ordering starting with speciﬁed node.

breadth_first_tree(csgraph, i_start[, directed]) Return the tree generated by a breadth-ﬁrst search

connected_components(csgraph[, directed, ...]) Analyze the connected components of a sparse graph ..

construct_dist_matrix(graph, predecessors[, ...]) Construct distance matrix from a predecessor matrix ..

cs_graph_components(*args, **kwds) cs_graph_components is deprecated!

csgraph_from_dense(graph[, null_value, ...]) Construct a CSR-format sparse graph from a dense matrix.

csgraph_from_masked(graph) Construct a CSR-format graph from a masked array.

csgraph_masked_from_dense(graph[, ...]) Construct a masked array graph representation from a dense matrix.

csgraph_to_dense(csgraph[, null_value]) Convert a sparse graph representation to a dense representation ..

depth_first_order(csgraph, i_start[, ...]) Return a depth-ﬁrst ordering starting with speciﬁed node.

depth_first_tree(csgraph, i_start[, directed]) Return a tree generated by a depth-ﬁrst search.

dijkstra(csgraph[, directed, indices, ...]) Dijkstra algorithm using Fibonacci Heaps

floyd_warshall(csgraph[, directed, ...]) Compute the shortest path lengths using the Floyd-Warshall algorithm

johnson(csgraph[, directed, indices, ...]) Compute the shortest path lengths using Johnson’s algorithm.

laplacian(csgraph[, normed, return_diag]) Return the Laplacian matrix of a directed graph.

minimum_spanning_tree(csgraph[, overwrite]) Return a minimum spanning tree of an undirected graph

reconstruct_path(csgraph, predecessors[, ...]) Construct a tree from a graph and a predecessor list.

shortest_path(csgraph[, method, directed, ...]) Perform a shortest-path graph search on a positive directed or undirected graph.

Classes

Tester Nose test runner.

Exceptions

NegativeCycleError

Sparse linear algebra (scipy.sparse.linalg)

LinearOperator(shape, matvec[, rmatvec, ...]) Common interface for performing matrix vector products

aslinearoperator(A) Return A as a LinearOperator.

Abstract linear operators

class scipy.sparse.linalg.LinearOperator(shape,matvec,rmatvec=None,matmat=None,

dtype=None)

Common interface for performing matrix vector products

Many iterative methods (e.g. cg, gmres) do not need to know the individual entries of a matrix to solve a linear

system A*x=b. Such solvers only require the computation of matrix vector products, A*v where v is a dense

5.23. Sparse matrices (scipy.sparse) 755

SciPy Reference Guide, Release 0.13.0

vector. This class serves as an abstract interface between iterative solvers and matrix-like objects.

Parameters shape : tuple

Matrix dimensions (M,N)

matvec : callable f(v)

Returns returns A * v.

Other Parameters

rmatvec : callable f(v)

Returns A^H * v, where A^H is the conjugate transpose of A.

matmat : callable f(V)

Returns A * V, where V is a dense matrix with dimensions (N,K).

dtype : dtype

Data type of the matrix.

See Also

aslinearoperator

Construct LinearOperators

Notes

The user-deﬁned matvec() function must properly handle the case where v has shape (N,) as well as the (N,1)

case. The shape of the return type is handled internally by LinearOperator.

LinearOperator instances can also be multiplied, added with each other and exponentiated, to produce a new

linear operator.

Examples

>>> from scipy.sparse.linalg import LinearOperator

>>> from scipy import *

>>> def mv(v):

... return array([ 2*v[0], 3*v[1]])

...

>>> A=LinearOperator( (2,2), matvec=mv )

>>> A

<2x2 LinearOperator with unspecified dtype>

>>> A.matvec( ones(2) )

array([ 2., 3.])

>>> A*ones(2)

array([ 2., 3.])

Attributes

args (tuple) For linear operators describing products etc. of other linear operators, the operands of the

binary operation.

Methods

__call__(x)

dot(other)

matmat(X) Matrix-matrix multiplication

matvec(x) Matrix-vector multiplication

LinearOperator.__call__(x)

756 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

LinearOperator.dot(other)

LinearOperator.matmat(X)

Matrix-matrix multiplication

Performs the operation y=A*X where A is an MxN linear operator and X dense N*K matrix or ndarray.

Parameters X: {matrix, ndarray}

An array with shape (N,K).

Returns Y: {matrix, ndarray}

A matrix or ndarray with shape (M,K) depending on the type of the

X argument.

Notes

This matmat wraps any user-speciﬁed matmat routine to ensure that y has the correct type.

LinearOperator.matvec(x)

Matrix-vector multiplication

Performs the operation y=A*x where A is an MxN linear operator and x is a column vector or rank-1

array.

Parameters x: {matrix, ndarray}

An array with shape (N,) or (N,1).

Returns y: {matrix, ndarray}

A matrix or ndarray with shape (M,) or (M,1) depending on the type

and shape of the x argument.

Notes

This matvec wraps the user-speciﬁed matvec routine to ensure that y has the correct shape and type.

scipy.sparse.linalg.aslinearoperator(A)

Return A as a LinearOperator.

‘A’ may be any of the following types:

•ndarray

•matrix

•sparse matrix (e.g. csr_matrix, lil_matrix, etc.)

•LinearOperator

•An object with .shape and .matvec attributes

See the LinearOperator documentation for additonal information.

Examples

>>> from scipy import matrix

>>> M=matrix( [[1,2,3],[4,5,6]], dtype=’int32’ )

>>> aslinearoperator( M )

<2x3 LinearOperator with dtype=int32>

inv(A) Compute the inverse of a sparse matrix ..

expm(A) Compute the matrix exponential using Pade approximation.

expm_multiply(A, B[, start, stop, num, endpoint]) Compute the action of the matrix exponential of A on B.

Matrix Operations

scipy.sparse.linalg.inv(A)

5.23. Sparse matrices (scipy.sparse) 757

SciPy Reference Guide, Release 0.13.0

Compute the inverse of a sparse matrix New in version 0.12.0.

Parameters A: (M,M) ndarray or sparse matrix

square matrix to be inverted

Returns Ainv : (M,M) ndarray or sparse matrix

inverse of A

Notes

This computes the sparse inverse of A. If the inverse of Ais expected to be non-sparse, it will likely be faster to

convert Ato dense and use scipy.linalg.inv.

scipy.sparse.linalg.expm(A)

Compute the matrix exponential using Pade approximation. New in version 0.12.0.

Parameters A: (M,M) array or sparse matrix

2D Array or Matrix (sparse or dense) to be exponentiated

Returns expA : (M,M) ndarray

Matrix exponential of A

Notes

This is algorithm (6.1) which is a simpliﬁcation of algorithm (5.1).

References

[R17]

scipy.sparse.linalg.expm_multiply(A,B,start=None,stop=None,num=None,endpoint=None)

Compute the action of the matrix exponential of A on B.

Parameters A: transposable linear operator

The operator whose exponential is of interest.

B: ndarray

The matrix or vector to be multiplied by the matrix exponential of A.

start : scalar, optional

The starting time point of the sequence.

stop : scalar, optional

The end time point of the sequence, unless endpoint is set to False. In that

case, the sequence consists of all but the last of num + 1 evenly spaced

time points, so that stop is excluded. Note that the step size changes when

endpoint is False.

num : int, optional

Number of time points to use.

endpoint : bool, optional

If True, stop is the last time point. Otherwise, it is not included.

Returns expm_A_B : ndarray

The result of the action etkAB.

Notes

The optional arguments deﬁning the sequence of evenly spaced time points are compatible with the arguments

of numpy.linspace.

The output ndarray shape is somewhat complicated so I explain it here. The ndim of the output could be either

1, 2, or 3. It would be 1 if you are computing the expm action on a single vector at a single time point. It would

be 2 if you are computing the expm action on a vector at multiple time points, or if you are computing the expm

action on a matrix at a single time point. It would be 3 if you want the action on a matrix with multiple columns

at multiple time points. If multiple time points are requested, expm_A_B[0] will always be the action of the

expm at the ﬁrst time point, regardless of whether the action is on a vector or a matrix.

758 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

References

[R18],[R19]

onenormest(A[, t, itmax, compute_v, compute_w]) Compute a lower bound of the 1-norm of a sparse matrix.

Matrix norms

scipy.sparse.linalg.onenormest(A,t=2,itmax=5,compute_v=False,compute_w=False)

Compute a lower bound of the 1-norm of a sparse matrix. New in version 0.13.0.

Parameters A: ndarray or other linear operator

A linear operator that can be transposed and that can produce matrix prod-

ucts.

t: int, optional

A positive parameter controlling the tradeoff between accuracy versus time

and memory usage. Larger values take longer and use more memory but

give more accurate output.

itmax : int, optional

Use at most this many iterations.

compute_v : bool, optional

Request a norm-maximizing linear operator input vector if True.

compute_w : bool, optional

Request a norm-maximizing linear operator output vector if True.

Returns est : ﬂoat

An underestimate of the 1-norm of the sparse matrix.

v: ndarray, optional

The vector such that ||Av||_1 == est*||v||_1. It can be thought of as an input

to the linear operator that gives an output with particularly large norm.

w: ndarray, optional

The vector Av which has relatively large 1-norm. It can be thought of as an

output of the linear operator that is relatively large in norm compared to the

input.

Notes

This is algorithm 2.4 of [1].

In [2] it is described as follows. “This algorithm typically requires the evaluation of about 4t matrix-vector

products and almost invariably produces a norm estimate (which is, in fact, a lower bound on the norm) correct

to within a factor 3.”

References

[R25],[R26]

Solving linear problems Direct methods for linear equation systems:

spsolve(A, b[, permc_spec, use_umfpack]) Solve the sparse linear system Ax=b, where b may be a vector or a matrix.

factorized(A) Return a fuction for solving a sparse linear system, with A pre-factorized.

scipy.sparse.linalg.spsolve(A,b,permc_spec=None,use_umfpack=True)

Solve the sparse linear system Ax=b, where b may be a vector or a matrix.

Parameters A: ndarray or sparse matrix

5.23. Sparse matrices (scipy.sparse) 759

SciPy Reference Guide, Release 0.13.0

The square matrix A will be converted into CSC or CSR form

b: ndarray or sparse matrix

The matrix or vector representing the right hand side of the equation. If a

vector, b.size must

permc_spec : str, optional

How to permute the columns of the matrix for sparsity preservation. (de-

fault: ‘COLAMD’)

•NATURAL: natural ordering.

•MMD_ATA: minimum degree ordering on the structure of

A^T A.

•MMD_AT_PLUS_A: minimum degree ordering on the struc-

ture of A^T+A.

•COLAMD: approximate minimum degree column ordering

use_umfpack : bool (optional)

if True (default) then use umfpack for the solution. This is only referenced

if b is a vector.

Returns x: ndarray or sparse matrix

the solution of the sparse linear equation. If b is a vector, then x is a vector

of size A.shape[1] If b is a matrix, then x is a matrix of size (A.shape[1],

b.shape[1])

Notes

For solving the matrix expression AX = B, this solver assumes the resulting matrix X is sparse, as is often the

case for very sparse inputs. If the resulting X is dense, the construction of this sparse result will be relatively

expensive. In that case, consider converting A to a dense matrix and using scipy.linalg.solve or its variants.

scipy.sparse.linalg.factorized(A)

Return a fuction for solving a sparse linear system, with A pre-factorized.

Parameters A: (N, N) array_like

Input.

Returns solve : callable

To solve the linear system of equations given in A, the solve callable should

be passed an ndarray of shape (N,).

Examples

>>> A=np.array([[ 3. ,2. ,-1. ],

[ 2. , -2. , 4. ],

[-1. , 0.5, -1. ]])

>>> solve =factorized( A ) # Makes LU decomposition.

>>> rhs1 =np.array([1,-2,0])

>>> x1 =solve( rhs1 ) # Uses the LU factors.

array([ 1., -2., -2.])

Iterative methods for linear equation systems:

bicg(A, b[, x0, tol, maxiter, xtype, M, ...]) Use BIConjugate Gradient iteration to solve A x = b

bicgstab(A, b[, x0, tol, maxiter, xtype, M, ...]) Use BIConjugate Gradient STABilized iteration to solve A x = b

cg(A, b[, x0, tol, maxiter, xtype, M, callback]) Use Conjugate Gradient iteration to solve A x = b

cgs(A, b[, x0, tol, maxiter, xtype, M, callback]) Use Conjugate Gradient Squared iteration to solve A x = b

gmres(A, b[, x0, tol, restart, maxiter, ...]) Use Generalized Minimal RESidual iteration to solve A x = b.

lgmres(A, b[, x0, tol, maxiter, M, ...]) Solve a matrix equation using the LGMRES algorithm.

Continued on next page

760 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Table 5.144 – continued from previous page

minres(A, b[, x0, shift, tol, maxiter, ...]) Use MINimum RESidual iteration to solve Ax=b

qmr(A, b[, x0, tol, maxiter, xtype, M1, M2, ...]) Use Quasi-Minimal Residual iteration to solve A x = b

scipy.sparse.linalg.bicg(A,b,x0=None,tol=1e-05,maxiter=None,xtype=None,M=None,call-

back=None)

Use BIConjugate Gradient iteration to solve A x = b

Parameters A: {sparse matrix, dense matrix, LinearOperator}

The real or complex N-by-N matrix of the linear system It is required that

the linear operator can produce Ax and A^T x.

b: {array, matrix}

Right hand side of the linear system. Has shape (N,) or (N,1).

Returns x: {array, matrix}

The converged solution.

info : integer

Provides convergence information:

0 : successful exit >0 : convergence to tolerance not

achieved, number of iterations <0 : illegal input or break-

down

Other Parameters

x0 : {array, matrix}

Starting guess for the solution.

tol : ﬂoat

Tolerance to achieve. The algorithm terminates when either the relative or

the absolute residual is below tol.

maxiter : integer

Maximum number of iterations. Iteration will stop after maxiter steps even

if the speciﬁed tolerance has not been achieved.

M: {sparse matrix, dense matrix, LinearOperator}

Preconditioner for A. The preconditioner should approximate the inverse

of A. Effective preconditioning dramatically improves the rate of conver-

gence, which implies that fewer iterations are needed to reach a given error

tolerance.

callback : function

User-supplied function to call after each iteration. It is called as call-

back(xk), where xk is the current solution vector.

xtype : {‘f’,’d’,’F’,’D’}

This parameter is deprecated – avoid using it.

The type of the result. If None, then it will be determined from A.dtype.char

and b. If A does not have a typecode method then it will compute

A.matvec(x0) to get a typecode. To save the extra computation when A

does not have a typecode attribute use xtype=0 for the same type as b or

use xtype=’f’,’d’,’F’,or ‘D’. This parameter has been superceeded by Lin-

earOperator.

scipy.sparse.linalg.bicgstab(A,b,x0=None,tol=1e-05,maxiter=None,xtype=None,M=None,

callback=None)

Use BIConjugate Gradient STABilized iteration to solve A x = b

Parameters A: {sparse matrix, dense matrix, LinearOperator}

The real or complex N-by-N matrix of the linear system Amust represent a

hermitian, positive deﬁnite matrix

b: {array, matrix}

Right hand side of the linear system. Has shape (N,) or (N,1).

Returns x: {array, matrix}

The converged solution.

5.23. Sparse matrices (scipy.sparse) 761

SciPy Reference Guide, Release 0.13.0

info : integer

Provides convergence information:

0 : successful exit >0 : convergence to tolerance not

achieved, number of iterations <0 : illegal input or break-

down

Other Parameters

x0 : {array, matrix}

Starting guess for the solution.

tol : ﬂoat

Tolerance to achieve. The algorithm terminates when either the relative or

the absolute residual is below tol.

maxiter : integer

Maximum number of iterations. Iteration will stop after maxiter steps even

if the speciﬁed tolerance has not been achieved.

M: {sparse matrix, dense matrix, LinearOperator}

Preconditioner for A. The preconditioner should approximate the inverse

of A. Effective preconditioning dramatically improves the rate of conver-

gence, which implies that fewer iterations are needed to reach a given error

tolerance.

callback : function

User-supplied function to call after each iteration. It is called as call-

back(xk), where xk is the current solution vector.

xtype : {‘f’,’d’,’F’,’D’}

This parameter is deprecated – avoid using it.

The type of the result. If None, then it will be determined from A.dtype.char

and b. If A does not have a typecode method then it will compute

A.matvec(x0) to get a typecode. To save the extra computation when A

does not have a typecode attribute use xtype=0 for the same type as b or

use xtype=’f’,’d’,’F’,or ‘D’. This parameter has been superceeded by Lin-

earOperator.

scipy.sparse.linalg.cg(A,b,x0=None,tol=1e-05,maxiter=None,xtype=None,M=None,call-

back=None)

Use Conjugate Gradient iteration to solve A x = b

Parameters A: {sparse matrix, dense matrix, LinearOperator}

The real or complex N-by-N matrix of the linear system Amust represent a

hermitian, positive deﬁnite matrix

b: {array, matrix}

Right hand side of the linear system. Has shape (N,) or (N,1).

Returns x: {array, matrix}

The converged solution.

info : integer

Provides convergence information:

0 : successful exit >0 : convergence to tolerance not

achieved, number of iterations <0 : illegal input or break-

down

Other Parameters

x0 : {array, matrix}

Starting guess for the solution.

tol : ﬂoat

Tolerance to achieve. The algorithm terminates when either the relative or

the absolute residual is below tol.

maxiter : integer

Maximum number of iterations. Iteration will stop after maxiter steps even

if the speciﬁed tolerance has not been achieved.

M: {sparse matrix, dense matrix, LinearOperator}

762 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Preconditioner for A. The preconditioner should approximate the inverse

of A. Effective preconditioning dramatically improves the rate of conver-

gence, which implies that fewer iterations are needed to reach a given error

tolerance.

callback : function

User-supplied function to call after each iteration. It is called as call-

back(xk), where xk is the current solution vector.

xtype : {‘f’,’d’,’F’,’D’}

This parameter is deprecated – avoid using it.

The type of the result. If None, then it will be determined from A.dtype.char

and b. If A does not have a typecode method then it will compute

A.matvec(x0) to get a typecode. To save the extra computation when A

does not have a typecode attribute use xtype=0 for the same type as b or

use xtype=’f’,’d’,’F’,or ‘D’. This parameter has been superceeded by Lin-

earOperator.

scipy.sparse.linalg.cgs(A,b,x0=None,tol=1e-05,maxiter=None,xtype=None,M=None,call-

back=None)

Use Conjugate Gradient Squared iteration to solve A x = b

Parameters A: {sparse matrix, dense matrix, LinearOperator}

The real-valued N-by-N matrix of the linear system

b: {array, matrix}

Right hand side of the linear system. Has shape (N,) or (N,1).

Returns x: {array, matrix}

The converged solution.

info : integer

Provides convergence information:

0 : successful exit >0 : convergence to tolerance not

achieved, number of iterations <0 : illegal input or break-

down

Other Parameters

x0 : {array, matrix}

Starting guess for the solution.

tol : ﬂoat

Tolerance to achieve. The algorithm terminates when either the relative or

the absolute residual is below tol.

maxiter : integer

Maximum number of iterations. Iteration will stop after maxiter steps even

if the speciﬁed tolerance has not been achieved.

M: {sparse matrix, dense matrix, LinearOperator}

Preconditioner for A. The preconditioner should approximate the inverse

of A. Effective preconditioning dramatically improves the rate of conver-

gence, which implies that fewer iterations are needed to reach a given error

tolerance.

callback : function

User-supplied function to call after each iteration. It is called as call-

back(xk), where xk is the current solution vector.

xtype : {‘f’,’d’,’F’,’D’}

This parameter is deprecated – avoid using it.

The type of the result. If None, then it will be determined from A.dtype.char

and b. If A does not have a typecode method then it will compute

A.matvec(x0) to get a typecode. To save the extra computation when A

does not have a typecode attribute use xtype=0 for the same type as b or

use xtype=’f’,’d’,’F’,or ‘D’. This parameter has been superceeded by Lin-

earOperator.

5.23. Sparse matrices (scipy.sparse) 763

SciPy Reference Guide, Release 0.13.0

scipy.sparse.linalg.gmres(A,b,x0=None,tol=1e-05,restart=None,maxiter=None,xtype=None,

M=None,callback=None,restrt=None)

Use Generalized Minimal RESidual iteration to solve A x = b.

Parameters A: {sparse matrix, dense matrix, LinearOperator}

The real or complex N-by-N matrix of the linear system.

b: {array, matrix}

Right hand side of the linear system. Has shape (N,) or (N,1).

Returns x: {array, matrix}

The converged solution.

info : int

Provides convergence information:

•0 : successful exit

•>0 : convergence to tolerance not achieved, number of

iterations

•<0 : illegal input or breakdown

Other Parameters

x0 : {array, matrix}

Starting guess for the solution (a vector of zeros by default).

tol : ﬂoat

Tolerance to achieve. The algorithm terminates when either the relative or

the absolute residual is below tol.

restart : int, optional

Number of iterations between restarts. Larger values increase iteration cost,

but may be necessary for convergence. Default is 20.

maxiter : int, optional

Maximum number of iterations. Iteration will stop after maxiter steps even

if the speciﬁed tolerance has not been achieved.

xtype : {‘f’,’d’,’F’,’D’}

This parameter is DEPRECATED — avoid using it.

The type of the result. If None, then it will be determined from A.dtype.char

and b. If A does not have a typecode method then it will compute

A.matvec(x0) to get a typecode. To save the extra computation when A

does not have a typecode attribute use xtype=0 for the same type as b or

use xtype=’f’,’d’,’F’,or ‘D’. This parameter has been superceeded by Lin-

earOperator.

M: {sparse matrix, dense matrix, LinearOperator}

Inverse of the preconditioner of A. M should approximate the inverse of A

and be easy to solve for (see Notes). Effective preconditioning dramatically

improves the rate of convergence, which implies that fewer iterations are

needed to reach a given error tolerance. By default, no preconditioner is

used.

callback : function

User-supplied function to call after each iteration. It is called as call-

back(rk), where rk is the current residual vector.

restrt : int, optional

DEPRECATED - use restart instead.

See Also

LinearOperator

Notes

A preconditioner, P, is chosen such that P is close to A but easy to solve for. The preconditioner parameter

required by this routine is M = P^-1. The inverse should preferably not be calculated explicitly. Rather, use

764 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

the following template to produce M:

# Construct a linear operator that computes P^-1 *x.

import scipy.sparse.linalg as spla

M_x =lambda x: spla.spsolve(P, x)

M=spla.LinearOperator((n, n), M_x)

scipy.sparse.linalg.lgmres(A,b,x0=None,tol=1e-05,maxiter=1000,M=None,callback=None,

inner_m=30,outer_k=3,outer_v=None,store_outer_Av=True)

Solve a matrix equation using the LGMRES algorithm.

The LGMRES algorithm [BJM] [BPh] is designed to avoid some problems in the convergence in restarted

GMRES, and often converges in fewer iterations.

Parameters A: {sparse matrix, dense matrix, LinearOperator}

The real or complex N-by-N matrix of the linear system.

b: {array, matrix}

Right hand side of the linear system. Has shape (N,) or (N,1).

x0 : {array, matrix}

Starting guess for the solution.

tol : ﬂoat

Tolerance to achieve. The algorithm terminates when either the relative or

the absolute residual is below tol.

maxiter : int

Maximum number of iterations. Iteration will stop after maxiter steps even

if the speciﬁed tolerance has not been achieved.

M: {sparse matrix, dense matrix, LinearOperator}

Preconditioner for A. The preconditioner should approximate the inverse

of A. Effective preconditioning dramatically improves the rate of conver-

gence, which implies that fewer iterations are needed to reach a given error

tolerance.

callback : function

User-supplied function to call after each iteration. It is called as call-

back(xk), where xk is the current solution vector.

inner_m : int, optional

Number of inner GMRES iterations per each outer iteration.

outer_k : int, optional

Number of vectors to carry between inner GMRES iterations. According to

[BJM], good values are in the range of 1...3. However, note that if you want

to use the additional vectors to accelerate solving multiple similar problems,

larger values may be beneﬁcial.

outer_v : list of tuples, optional

List containing tuples (v, Av) of vectors and corresponding matrix-

vector products, used to augment the Krylov subspace, and carried between

inner GMRES iterations. The element Av can be None if the matrix-vector

product should be re-evaluated. This parameter is modiﬁed in-place by

lgmres, and can be used to pass “guess” vectors in and out of the algo-

rithm when solving similar problems.

store_outer_Av : bool, optional

Whether LGMRES should store also A*v in addition to vectors vin the

outer_v list. Default is True.

Returns x: array or matrix

The converged solution.

info : int

Provides convergence information:

•0 : successful exit

5.23. Sparse matrices (scipy.sparse) 765

SciPy Reference Guide, Release 0.13.0

•>0 : convergence to tolerance not achieved, number of iter-

ations

•<0 : illegal input or breakdown

Notes

The LGMRES algorithm [BJM] [BPh] is designed to avoid the slowing of convergence in restarted GMRES, due

to alternating residual vectors. Typically, it often outperforms GMRES(m) of comparable memory requirements

by some measure, or at least is not much worse.

Another advantage in this algorithm is that you can supply it with ‘guess’ vectors in the outer_v argument that

augment the Krylov subspace. If the solution lies close to the span of these vectors, the algorithm converges

faster. This can be useful if several very similar matrices need to be inverted one after another, such as in

Newton-Krylov iteration where the Jacobian matrix often changes little in the nonlinear steps.

References

[BJM],[BPh]

scipy.sparse.linalg.minres(A,b,x0=None,shift=0.0,tol=1e-05,maxiter=None,xtype=None,

M=None,callback=None,show=False,check=False)

Use MINimum RESidual iteration to solve Ax=b

MINRES minimizes norm(A*x - b) for a real symmetric matrix A. Unlike the Conjugate Gradient method, A

can be indeﬁnite or singular.

If shift != 0 then the method solves (A - shift*I)x = b

Parameters A: {sparse matrix, dense matrix, LinearOperator}

The real symmetric N-by-N matrix of the linear system

b: {array, matrix}

Right hand side of the linear system. Has shape (N,) or (N,1).

Returns x: {array, matrix}

The converged solution.

info : integer

Provides convergence information:

0 : successful exit >0 : convergence to tolerance not

achieved, number of iterations <0 : illegal input or break-

down

Other Parameters

x0 : {array, matrix}

Starting guess for the solution.

tol : ﬂoat

Tolerance to achieve. The algorithm terminates when either the relative or

the absolute residual is below tol.

maxiter : integer

Maximum number of iterations. Iteration will stop after maxiter steps even

if the speciﬁed tolerance has not been achieved.

M: {sparse matrix, dense matrix, LinearOperator}

Preconditioner for A. The preconditioner should approximate the inverse

of A. Effective preconditioning dramatically improves the rate of conver-

gence, which implies that fewer iterations are needed to reach a given error

tolerance.

callback : function

User-supplied function to call after each iteration. It is called as call-

back(xk), where xk is the current solution vector.

xtype : {‘f’,’d’,’F’,’D’}

This parameter is deprecated – avoid using it.

766 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

The type of the result. If None, then it will be determined from A.dtype.char

and b. If A does not have a typecode method then it will compute

A.matvec(x0) to get a typecode. To save the extra computation when A

does not have a typecode attribute use xtype=0 for the same type as b or

use xtype=’f’,’d’,’F’,or ‘D’. This parameter has been superceeded by Lin-

earOperator.

Notes

THIS FUNCTION IS EXPERIMENTAL AND SUBJECT TO CHANGE!

References

Solution of sparse indeﬁnite systems of linear equations,

C. C. Paige and M. A. Saunders (1975), SIAM J. Numer. Anal. 12(4), pp. 617-629.

http://www.stanford.edu/group/SOL/software/minres.html

This ﬁle is a translation of the following MATLAB implementation:

http://www.stanford.edu/group/SOL/software/minres/matlab/

scipy.sparse.linalg.qmr(A,b,x0=None,tol=1e-05,maxiter=None,xtype=None,M1=None,

M2=None,callback=None)

Use Quasi-Minimal Residual iteration to solve A x = b

Parameters A: {sparse matrix, dense matrix, LinearOperator}

The real-valued N-by-N matrix of the linear system. It is required that the

linear operator can produce Ax and A^T x.

b: {array, matrix}

Right hand side of the linear system. Has shape (N,) or (N,1).

Returns x: {array, matrix}

The converged solution.

info : integer

Provides convergence information:

0 : successful exit >0 : convergence to tolerance not

achieved, number of iterations <0 : illegal input or break-

down

Other Parameters

x0 : {array, matrix}

Starting guess for the solution.

tol : ﬂoat

Tolerance to achieve. The algorithm terminates when either the relative or

the absolute residual is below tol.

maxiter : integer

Maximum number of iterations. Iteration will stop after maxiter steps even

if the speciﬁed tolerance has not been achieved.

M1 : {sparse matrix, dense matrix, LinearOperator}

Left preconditioner for A.

M2 : {sparse matrix, dense matrix, LinearOperator}

Right preconditioner for A. Used together with the left preconditioner M1.

The matrix M1*A*M2 should have better conditioned than A alone.

callback : function

User-supplied function to call after each iteration. It is called as call-

back(xk), where xk is the current solution vector.

xtype : {‘f’,’d’,’F’,’D’}

This parameter is DEPRECATED – avoid using it.

The type of the result. If None, then it will be determined from A.dtype.char

and b. If A does not have a typecode method then it will compute

A.matvec(x0) to get a typecode. To save the extra computation when A

5.23. Sparse matrices (scipy.sparse) 767

SciPy Reference Guide, Release 0.13.0

does not have a typecode attribute use xtype=0 for the same type as b or

use xtype=’f’,’d’,’F’,or ‘D’. This parameter has been superceeded by Lin-

earOperator.

See Also

LinearOperator

Iterative methods for least-squares problems:

lsqr(A, b[, damp, atol, btol, conlim, ...]) Find the least-squares solution to a large, sparse, linear system of equations.

lsmr(A, b[, damp, atol, btol, conlim, ...]) Iterative solver for least-squares problems.

scipy.sparse.linalg.lsqr(A,b,damp=0.0,atol=1e-08,btol=1e-08,conlim=100000000.0,

iter_lim=None,show=False,calc_var=False)

Find the least-squares solution to a large, sparse, linear system of equations.

The function solves Ax = b or min ||b - Ax||^2 or min ||Ax - b||^2 + d^2 ||x||^2.

The matrix A may be square or rectangular (over-determined or under-determined), and may have any rank.

1. Unsymmetric equations -- solve A*x=b

2. Linear least squares -- solve A*x=b

in the least-squares sense

3. Damped least squares -- solve ( A )*x=(b)

( damp*I) (0)

in the least-squares sense

Parameters A: {sparse matrix, ndarray, LinearOperatorLinear}

Representation of an m-by-n matrix. It is required that the linear operator

can produce Ax and A^T x.

b: (m,) ndarray

Right-hand side vector b.

damp : ﬂoat

Damping coefﬁcient.

atol, btol : ﬂoat

Stopping tolerances. If both are 1.0e-9 (say), the ﬁnal residual norm should

be accurate to about 9 digits. (The ﬁnal x will usually have fewer correct

digits, depending on cond(A) and the size of damp.)

conlim : ﬂoat

Another stopping tolerance. lsqr terminates if an estimate of cond(A)

exceeds conlim. For compatible systems Ax = b,conlim could be as large

as 1.0e+12 (say). For least-squares problems, conlim should be less than

1.0e+8. Maximum precision can be obtained by setting atol = btol =

conlim = zero, but the number of iterations may then be excessive.

iter_lim : int

Explicit limitation on number of iterations (for safety).

show : bool

Display an iteration log.

calc_var : bool

Whether to estimate diagonals of (A’A + damp^2*I)^{-1}.

Returns x: ndarray of ﬂoat

The ﬁnal solution.

istop : int

768 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

Gives the reason for termination. 1 means x is an approximate solution to

Ax = b. 2 means x approximately solves the least-squares problem.

itn : int

Iteration number upon termination.

r1norm : ﬂoat

norm(r), where r=b-Ax.

r2norm : ﬂoat

sqrt( norm(r)^2 + damp^2 *norm(x)^2 ). Equal to r1norm

if damp == 0.

anorm : ﬂoat

Estimate of Frobenius norm of Abar = [[A]; [damp*I]].

acond : ﬂoat

Estimate of cond(Abar).

arnorm : ﬂoat

Estimate of norm(A’*r - damp^2*x).

xnorm : ﬂoat

norm(x)

var : ndarray of ﬂoat

If calc_var is True, estimates all diagonals of (A’A)^{-1} (if damp

== 0) or more generally (A’A + damp^2*I)^{-1}. This is well de-

ﬁned if A has full column rank or damp > 0. (Not sure what var means if

rank(A) < n and damp = 0.)

Notes

LSQR uses an iterative method to approximate the solution. The number of iterations required to reach a certain

accuracy depends strongly on the scaling of the problem. Poor scaling of the rows or columns of A should

therefore be avoided where possible.

For example, in problem 1 the solution is unaltered by row-scaling. If a row of A is very small or large compared

to the other rows of A, the corresponding row of ( A b ) should be scaled up or down.

In problems 1 and 2, the solution x is easily recovered following column-scaling. Unless better information is

known, the nonzero columns of A should be scaled so that they all have the same Euclidean norm (e.g., 1.0).

In problem 3, there is no freedom to re-scale if damp is nonzero. However, the value of damp should be assigned

only after attention has been paid to the scaling of A.

The parameter damp is intended to help regularize ill-conditioned systems, by preventing the true solution from

being very large. Another aid to regularization is provided by the parameter acond, which may be used to

terminate iterations before the computed solution becomes very large.

If some initial estimate x0 is known and if damp == 0, one could proceed as follows:

1.Compute a residual vector r0=b-A*x0.

2.Use LSQR to solve the system A*dx = r0.

3.Add the correction dx to obtain a ﬁnal solution x=x0+dx.

This requires that x0 be available before and after the call to LSQR. To judge the beneﬁts, suppose LSQR

takes k1 iterations to solve A*x = b and k2 iterations to solve A*dx = r0. If x0 is “good”, norm(r0) will be

smaller than norm(b). If the same stopping tolerances atol and btol are used for each system, k1 and k2 will be

similar, but the ﬁnal solution x0 + dx should be more accurate. The only way to reduce the total work is to use

a larger stopping tolerance for the second system. If some value btol is suitable for A*x = b, the larger value

btol*norm(b)/norm(r0) should be suitable for A*dx = r0.

Preconditioning is another way to reduce the number of iterations. If it is possible to solve a related system M*x

= b efﬁciently, where M approximates A in some helpful way (e.g. M - A has low rank or its elements are

small relative to those of A), LSQR may converge more rapidly on the system A*M(inverse)*z=b, after

which x can be recovered by solving M*x = z.

5.23. Sparse matrices (scipy.sparse) 769

SciPy Reference Guide, Release 0.13.0

If A is symmetric, LSQR should not be used!

Alternatives are the symmetric conjugate-gradient method (cg) and/or SYMMLQ. SYMMLQ is an implemen-

tation of symmetric cg that applies to any symmetric A and will converge more rapidly than LSQR. If A is

positive deﬁnite, there are other implementations of symmetric cg that require slightly less work per iteration

than SYMMLQ (but will take the same number of iterations).

References

[R22],[R23],[R24]

scipy.sparse.linalg.lsmr(A,b,damp=0.0,atol=1e-06,btol=1e-06,conlim=100000000.0,max-

iter=None,show=False)

Iterative solver for least-squares problems.

lsmr solves the system of linear equations Ax = b. If the system is inconsistent, it solves the least-squares

problem min ||b - Ax||_2. A is a rectangular matrix of dimension m-by-n, where all cases are allowed:

m = n, m > n, or m < n. B is a vector of length m. The matrix A may be dense or sparse (usually sparse). New

in version 0.11.0.

Parameters A: {matrix, sparse matrix, ndarray, LinearOperator}

Matrix A in the linear system.

b: (m,) ndarray

Vector b in the linear system.

damp : ﬂoat

Damping factor for regularized least-squares. lsmr solves the regularized

least-squares problem:

min ||(b) - ( A )x||

||(0) (damp*I) ||_2

where damp is a scalar. If damp is None or 0, the system is solved without

regularization.

atol, btol : ﬂoat

Stopping tolerances. lsmr continues iterations until a certain backward er-

ror estimate is smaller than some quantity depending on atol and btol. Let

r=b-Axbe the residual vector for the current approximate solution

x. If Ax = b seems to be consistent, lsmr terminates when norm(r)

<= atol *norm(A) *norm(x) + btol *norm(b). Other-

wise, lsmr terminates when norm(A^{T} r) <= atol *norm(A)

*norm(r). If both tolerances are 1.0e-6 (say), the ﬁnal norm(r)

should be accurate to about 6 digits. (The ﬁnal x will usually have fewer

correct digits, depending on cond(A) and the size of LAMBDA.) If atol

or btol is None, a default value of 1.0e-6 will be used. Ideally, they should

be estimates of the relative error in the entries of A and B respectively. For

example, if the entries of Ahave 7 correct digits, set atol = 1e-7. This pre-

vents the algorithm from doing unnecessary work beyond the uncertainty

of the input data.

conlim : ﬂoat

lsmr terminates if an estimate of cond(A) exceeds conlim. For compati-

ble systems Ax = b, conlim could be as large as 1.0e+12 (say). For least-

squares problems, conlim should be less than 1.0e+8. If conlim is None,

the default value is 1e+8. Maximum precision can be obtained by setting

atol = btol = conlim = 0, but the number of iterations may then

be excessive.

maxiter : int

770 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

lsmr terminates if the number of iterations reaches maxiter. The default is

maxiter = min(m, n). For ill-conditioned systems, a larger value of

maxiter may be needed.

show : bool

Print iterations logs if show=True.

Returns x: ndarray of ﬂoat

Least-square solution returned.

istop : int

istop gives the reason for stopping:

istop = 0 means x=0 is a solution.

= 1 means x is an approximate solution to A*x = B,

according to atol and btol.

= 2 means x approximately solves the least-squares problem

according to atol.

= 3 means COND(A) seems to be greater than CONLIM.

= 4 is the same as 1 with atol = btol = eps (machine

precision)

= 5 is the same as 2 with atol = eps.

= 6 is the same as 3 with CONLIM = 1/eps.

= 7 means ITN reached maxiter before the other stopping

conditions were satisfied.

itn : int

Number of iterations used.

normr : ﬂoat

norm(b-Ax)

normar : ﬂoat

norm(A^T (b - Ax))

norma : ﬂoat

norm(A)

conda : ﬂoat

Condition number of A.

normx : ﬂoat

norm(x)

References

[R20],[R21]

Matrix factorizations Eigenvalue problems:

eigs(A[, k, M, sigma, which, v0, ncv, ...]) Find k eigenvalues and eigenvectors of the square matrix A.

eigsh(A[, k, M, sigma, which, v0, ncv, ...]) Find k eigenvalues and eigenvectors of the real symmetric square matrix

lobpcg(A, X[, B, M, Y, tol, maxiter, ...]) Solve symmetric partial eigenproblems with optional preconditioning

scipy.sparse.linalg.eigs(A,k=6,M=None,sigma=None,which=’LM’,v0=None,ncv=None,max-

iter=None,tol=0,return_eigenvectors=True,Minv=None,OPinv=None,

OPpart=None)

Find k eigenvalues and eigenvectors of the square matrix A.

Solves A*x[i] = w[i] *x[i], the standard eigenvalue problem for w[i] eigenvalues with correspond-

ing eigenvectors x[i].

If M is speciﬁed, solves A*x[i] = w[i] *M*x[i], the generalized eigenvalue problem for w[i]

eigenvalues with corresponding eigenvectors x[i]

5.23. Sparse matrices (scipy.sparse) 771

SciPy Reference Guide, Release 0.13.0

Parameters A: ndarray, sparse matrix or LinearOperator

An array, sparse matrix, or LinearOperator representing the operation A*

x, where A is a real or complex square matrix.

k: int, optional

The number of eigenvalues and eigenvectors desired. kmust be smaller

than N. It is not possible to compute all eigenvectors of a matrix.

M: ndarray, sparse matrix or LinearOperator, optional

An array, sparse matrix, or LinearOperator representing the operation M*x

for the generalized eigenvalue problem

A*x=w*M*x.

M must represent a real, symmetric matrix if A is real, and must represent

a complex, hermitian matrix if A is complex. For best results, the data type

of M should be the same as that of A. Additionally:

If sigma is None, M is positive deﬁnite

If sigma is speciﬁed, M is positive semi-deﬁnite

If sigma is None, eigs requires an operator to compute the solution of the

linear equation M*x=b. This is done internally via a (sparse) LU de-

composition for an explicit matrix M, or via an iterative solver for a general

linear operator. Alternatively, the user can supply the matrix or operator

Minv, which gives x = Minv *b = M^-1 *b.

sigma : real or complex, optional

Find eigenvalues near sigma using shift-invert mode. This requires an op-

erator to compute the solution of the linear system [A - sigma *M]

*x=b, where M is the identity matrix if unspeciﬁed. This is computed

internally via a (sparse) LU decomposition for explicit matrices A & M, or

via an iterative solver if either A or M is a general linear operator. Alter-

natively, the user can supply the matrix or operator OPinv, which gives x

= OPinv *b = [A - sigma *M]^-1 *b. For a real matrix A,

shift-invert can either be done in imaginary mode or real mode, speciﬁed

by the parameter OPpart (‘r’ or ‘i’). Note that when sigma is speciﬁed, the

keyword ‘which’ (below) refers to the shifted eigenvalues w’[i] where:

If A is real and OPpart == ‘r’ (default),

w’[i] = 1/2 *[1/(w[i]-sigma)

+ 1/(w[i]-conj(sigma))].

If A is real and OPpart == ‘i’,

w’[i] = 1/2i *

[1/(w[i]-sigma) -

1/(w[i]-conj(sigma))].

If A is complex, w’[i] = 1/(w[i]-sigma).

v0 : ndarray, optional

Starting vector for iteration.

ncv : int, optional

The number of Lanczos vectors generated ncv must be greater than k; it is

recommended that ncv > 2*k.

which : str, [’LM’ | ‘SM’ | ‘LR’ | ‘SR’ | ‘LI’ | ‘SI’], optional

Which keigenvectors and eigenvalues to ﬁnd:

‘LM’ : largest magnitude

‘SM’ : smallest magnitude

‘LR’ : largest real part

‘SR’ : smallest real part

‘LI’ : largest imaginary part

‘SI’ : smallest imaginary part

When sigma != None, ‘which’ refers to the shifted eigenvalues w’[i] (see

discussion in ‘sigma’, above). ARPACK is generally better at ﬁnding large

772 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

values than small values. If small eigenvalues are desired, consider using

shift-invert mode for better performance.

maxiter : int, optional

Maximum number of Arnoldi update iterations allowed

tol : ﬂoat, optional

Relative accuracy for eigenvalues (stopping criterion) The default value of

0 implies machine precision.

return_eigenvectors : bool, optional

Return eigenvectors (True) in addition to eigenvalues

Minv : ndarray, sparse matrix or LinearOperator, optional

See notes in M, above.

OPinv : ndarray, sparse matrix or LinearOperator, optional

See notes in sigma, above.

OPpart : {‘r’ or ‘i’}, optional

See notes in sigma, above

Returns w: ndarray

Array of k eigenvalues.

v: ndarray

An array of keigenvectors. v[:, i] is the eigenvector corresponding to

the eigenvalue w[i].

Raises ArpackNoConvergence

When the requested convergence is not obtained. The currently con-

verged eigenvalues and eigenvectors can be found as eigenvalues and

eigenvectors attributes of the exception object.

See Also

eigsh eigenvalues and eigenvectors for symmetric matrix A

svds singular value decomposition for a matrix A

Notes

This function is a wrapper to the ARPACK [R13] SNEUPD, DNEUPD, CNEUPD, ZNEUPD, functions which

use the Implicitly Restarted Arnoldi Method to ﬁnd the eigenvalues and eigenvectors [R14].

References

[R13],[R14]

Examples

Find 6 eigenvectors of the identity matrix:

>>> id =np.eye(13)

>>> vals, vecs =sp.sparse.linalg.eigs(id, k=6)

>>> vals

array([ 1.+0.j, 1.+0.j, 1.+0.j, 1.+0.j, 1.+0.j, 1.+0.j])

>>> vecs.shape

(13, 6)

scipy.sparse.linalg.eigsh(A,k=6,M=None,sigma=None,which=’LM’,v0=None,ncv=None,

maxiter=None,tol=0,return_eigenvectors=True,Minv=None,

OPinv=None,mode=’normal’)

Find k eigenvalues and eigenvectors of the real symmetric square matrix or complex hermitian matrix A.

Solves A*x[i] = w[i] *x[i], the standard eigenvalue problem for w[i] eigenvalues with correspond-

ing eigenvectors x[i].

5.23. Sparse matrices (scipy.sparse) 773

SciPy Reference Guide, Release 0.13.0

If M is speciﬁed, solves A*x[i] = w[i] *M*x[i], the generalized eigenvalue problem for w[i]

eigenvalues with corresponding eigenvectors x[i]

Parameters A: An N x N matrix, array, sparse matrix, or LinearOperator representing

the operation A * x, where A is a real symmetric matrix For buckling mode

(see below) A must additionally be positive-deﬁnite

k: integer

The number of eigenvalues and eigenvectors desired. kmust be smaller

than N. It is not possible to compute all eigenvectors of a matrix.

Returns w: array

Array of k eigenvalues

v: array

An array of k eigenvectors The v[i] is the eigenvector corresponding to the

eigenvector w[i]

Other Parameters

M: An N x N matrix, array, sparse matrix, or linear operator representing

the operation M * x for the generalized eigenvalue problem

A*x=w*M*x.

M must represent a real, symmetric matrix if A is real, and must represent

a complex, hermitian matrix if A is complex. For best results, the data type

of M should be the same as that of A. Additionally:

If sigma is None, M is symmetric positive deﬁnite

If sigma is speciﬁed, M is symmetric positive semi-deﬁnite

In buckling mode, M is symmetric indeﬁnite.

If sigma is None, eigsh requires an operator to compute the solution of

the linear equation M*x=b. This is done internally via a (sparse)

LU decomposition for an explicit matrix M, or via an iterative solver for

a general linear operator. Alternatively, the user can supply the matrix or

operator Minv, which gives x = Minv *b = M^-1 *b.

sigma : real

Find eigenvalues near sigma using shift-invert mode. This requires an op-

erator to compute the solution of the linear system [A - sigma * M] x = b,

where M is the identity matrix if unspeciﬁed. This is computed internally

via a (sparse) LU decomposition for explicit matrices A & M, or via an it-

erative solver if either A or M is a general linear operator. Alternatively, the

user can supply the matrix or operator OPinv, which gives x = OPinv *

b = [A - sigma *M]^-1 *b. Note that when sigma is speciﬁed,

the keyword ‘which’ refers to the shifted eigenvalues w’[i] where:

if mode == ‘normal’, w’[i] = 1 / (w[i] -

sigma).

if mode == ‘cayley’, w’[i] = (w[i] + sigma) /

(w[i] - sigma).

if mode == ‘buckling’, w’[i] = w[i] / (w[i] -

sigma).

(see further discussion in ‘mode’ below)

v0 : ndarray

Starting vector for iteration.

ncv : int

The number of Lanczos vectors generated ncv must be greater than k and

smaller than n; it is recommended that ncv > 2*k.

which : str [’LM’ | ‘SM’ | ‘LA’ | ‘SA’ | ‘BE’]

If A is a complex hermitian matrix, ‘BE’ is invalid. Which keigenvectors

and eigenvalues to ﬁnd:

‘LM’ : Largest (in magnitude) eigenvalues

‘SM’ : Smallest (in magnitude) eigenvalues

774 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

‘LA’ : Largest (algebraic) eigenvalues

‘SA’ : Smallest (algebraic) eigenvalues

‘BE’ : Half (k/2) from each end of the spectrum

When k is odd, return one more (k/2+1) from the high end. When sigma !=

None, ‘which’ refers to the shifted eigenvalues w’[i] (see discussion in

‘sigma’, above). ARPACK is generally better at ﬁnding large values than

small values. If small eigenvalues are desired, consider using shift-invert

mode for better performance.

maxiter : int

Maximum number of Arnoldi update iterations allowed

tol : ﬂoat

Relative accuracy for eigenvalues (stopping criterion). The default value of

0 implies machine precision.

Minv : N x N matrix, array, sparse matrix, or LinearOperator

See notes in M, above

OPinv : N x N matrix, array, sparse matrix, or LinearOperator

See notes in sigma, above.

return_eigenvectors : bool

Return eigenvectors (True) in addition to eigenvalues

mode : string [’normal’ | ‘buckling’ | ‘cayley’]

Specify strategy to use for shift-invert mode. This argument applies only

for real-valued A and sigma != None. For shift-invert mode, ARPACK in-

ternally solves the eigenvalue problem OP *x’[i] = w’[i] *B*

x’[i] and transforms the resulting Ritz vectors x’[i] and Ritz values w’[i]

into the desired eigenvectors and eigenvalues of the problem A*x[i]

= w[i] *M*x[i]. The modes are as follows:

‘normal’ : OP = [A - sigma * M]^-1 * M, B = M, w’[i]

= 1 / (w[i] - sigma)

‘buckling’ : OP = [A - sigma * M]^-1 * A, B = A, w’[i] =

w[i] / (w[i] - sigma)

‘cayley’ : OP = [A - sigma * M]^-1 * [A + sigma * M],

B = M, w’[i] = (w[i] + sigma) / (w[i] - sigma)

The choice of mode will affect which eigenvalues are selected by the key-

word ‘which’, and can also impact the stability of convergence (see [2] for

a discussion)

Raises ArpackNoConvergence

When the requested convergence is not obtained.

The currently converged eigenvalues and eigenvectors can be found as

eigenvalues and eigenvectors attributes of the exception object.

See Also

eigs eigenvalues and eigenvectors for a general (nonsymmetric) matrix A

svds singular value decomposition for a matrix A

Notes

This function is a wrapper to the ARPACK [R15] SSEUPD and DSEUPD functions which use the Implicitly

Restarted Lanczos Method to ﬁnd the eigenvalues and eigenvectors [R16].

References

[R15],[R16]

5.23. Sparse matrices (scipy.sparse) 775

SciPy Reference Guide, Release 0.13.0

Examples

>>> id =np.eye(13)

>>> vals, vecs =sp.sparse.linalg.eigsh(id, k=6)

>>> vals

array([ 1.+0.j, 1.+0.j, 1.+0.j, 1.+0.j, 1.+0.j, 1.+0.j])

>>> vecs.shape

(13, 6)

scipy.sparse.linalg.lobpcg(A,X,B=None,M=None,Y=None,tol=None,maxiter=20,

largest=True,verbosityLevel=0,retLambdaHistory=False,re-

tResidualNormsHistory=False)

Solve symmetric partial eigenproblems with optional preconditioning

This function implements the Locally Optimal Block Preconditioned Conjugate Gradient Method (LOBPCG).

Parameters A: {sparse matrix, dense matrix, LinearOperator}

The symmetric linear operator of the problem, usually a sparse matrix. Of-

ten called the “stiffness matrix”.

X: array_like

Initial approximation to the k eigenvectors. If A has shape=(n,n) then X

should have shape shape=(n,k).

B: {dense matrix, sparse matrix, LinearOperator}, optional

the right hand side operator in a generalized eigenproblem. by default, B =

Identity often called the “mass matrix”

M: {dense matrix, sparse matrix, LinearOperator}, optional

preconditioner to A; by default M = Identity M should approximate the

inverse of A

Y: array_like, optional

n-by-sizeY matrix of constraints, sizeY < n The iterations will be performed

in the B-orthogonal complement of the column-space of Y. Y must be full

rank.

Returns w: array

Array of k eigenvalues

v: array

An array of k eigenvectors. V has the same shape as X.

Other Parameters

tol : scalar, optional

Solver tolerance (stopping criterion) by default: tol=n*sqrt(eps)

maxiter : integer, optional

maximum number of iterations by default: maxiter=min(n,20)

largest : boolean, optional

when True, solve for the largest eigenvalues, otherwise the smallest

verbosityLevel : integer, optional

controls solver output. default: verbosityLevel = 0.

retLambdaHistory : boolean, optional

whether to return eigenvalue history

retResidualNormsHistory : boolean, optional

whether to return history of residual norms

Notes

If both retLambdaHistory and retResidualNormsHistory are True, the return tuple has the following format

(lambda, V, lambda history, residual norms history)

Singular values problems:

776 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

svds(A[, k, ncv, tol, which, v0, maxiter, ...]) Compute the largest k singular values/vectors for a sparse matrix.

scipy.sparse.linalg.svds(A,k=6,ncv=None,tol=0,which=’LM’,v0=None,maxiter=None,re-

turn_singular_vectors=True)

Compute the largest k singular values/vectors for a sparse matrix.

Parameters A: sparse matrix

Array to compute the SVD on, of shape (M, N)

k: int, optional

Number of singular values and vectors to compute.

ncv : integer, optional

The number of Lanczos vectors generated ncv must be greater than k+1 and

smaller than n; it is recommended that ncv > 2*k

tol : ﬂoat, optional

Tolerance for singular values. Zero (default) means machine precision.

which : str, [’LM’ | ‘SM’], optional

Which ksingular values to ﬁnd:

•‘LM’ : largest singular values

•‘SM’ : smallest singular values

New in version 0.12.0.

v0 : ndarray, optional

Starting vector for iteration, of length min(A.shape). Should be an (approx-

imate) right singular vector if N > M and a right singular vector otherwise.

New in version 0.12.0.

maxiter: integer, optional

Maximum number of iterations. New in version 0.12.0.

return_singular_vectors : bool, optional

Return singular vectors (True) in addition to singular values New in version

0.12.0.

Returns

——-

u: ndarray, shape=(M, k)

Unitary matrix having left singular vectors as columns.

s: ndarray, shape=(k,)

The singular values.

vt : ndarray, shape=(k, N)

Unitary matrix having right singular vectors as rows.

Notes

This is a naive implementation using ARPACK as an eigensolver on A.H * A or A * A.H, depending on which

one is more efﬁcient.

Complete or incomplete LU factorizations

splu(A[, permc_spec, diag_pivot_thresh, ...]) Compute the LU decomposition of a sparse, square matrix.

spilu(A[, drop_tol, ﬁll_factor, drop_rule, ...]) Compute an incomplete LU decomposition for a sparse, square matrix.

scipy.sparse.linalg.splu(A,permc_spec=None,diag_pivot_thresh=None,drop_tol=None,re-

lax=None,panel_size=None,options={})

Compute the LU decomposition of a sparse, square matrix.

Parameters A: sparse matrix

Sparse matrix to factorize. Should be in CSR or CSC format.

permc_spec : str, optional

5.23. Sparse matrices (scipy.sparse) 777

SciPy Reference Guide, Release 0.13.0

How to permute the columns of the matrix for sparsity preservation. (de-

fault: ‘COLAMD’)

•NATURAL: natural ordering.

•MMD_ATA: minimum degree ordering on the structure of

A^T A.

•MMD_AT_PLUS_A: minimum degree ordering on the struc-

ture of A^T+A.

•COLAMD: approximate minimum degree column ordering

diag_pivot_thresh : ﬂoat, optional

Threshold used for a diagonal entry to be an acceptable pivot. See SuperLU

user’s guide for details [SLU]

drop_tol : ﬂoat, optional

(deprecated) No effect.

relax : int, optional

Expert option for customizing the degree of relaxing supernodes. See Su-

perLU user’s guide for details [SLU]

panel_size : int, optional

Expert option for customizing the panel size. See SuperLU user’s guide for

details [SLU]

options : dict, optional

Dictionary containing additional expert options to SuperLU. See SuperLU

user guide [SLU] (section 2.4 on the ‘Options’ argument) for more de-

tails. For example, you can specify options=dict(Equil=False,

IterRefine=’SINGLE’)) to turn equilibration off and perform a sin-

gle iterative reﬁnement.

Returns invA : scipy.sparse.linalg.dsolve._superlu.SciPyLUType

Object, which has a solve method.

See Also

spilu incomplete LU decomposition

Notes

This function uses the SuperLU library.

References

[SLU]

scipy.sparse.linalg.spilu(A,drop_tol=None,ﬁll_factor=None,drop_rule=None,

permc_spec=None,diag_pivot_thresh=None,relax=None,

panel_size=None,options=None)

Compute an incomplete LU decomposition for a sparse, square matrix.

The resulting object is an approximation to the inverse of A.

Parameters A: (N, N) array_like

Sparse matrix to factorize

drop_tol : ﬂoat, optional

Drop tolerance (0 <= tol <= 1) for an incomplete LU decomposition. (de-

fault: 1e-4)

ﬁll_factor : ﬂoat, optional

Speciﬁes the ﬁll ratio upper bound (>= 1.0) for ILU. (default: 10)

drop_rule : str, optional

Comma-separated string of drop rules to use. Available rules: basic,

prows,column,area,secondary,dynamic,interp. (Default:

basic,area)

See SuperLU documentation for details.

778 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

milu : str, optional

Which version of modiﬁed ILU to use. (Choices: silu,smilu_1,

smilu_2 (default), smilu_3.)

Remaining other options

Same as for splu

Returns invA_approx : scipy.sparse.linalg.dsolve._superlu.SciPyLUType

Object, which has a solve method.

See Also

splu complete LU decomposition

Notes

To improve the better approximation to the inverse, you may need to increase ﬁll_factor AND decrease drop_tol.

This function uses the SuperLU library.

ArpackNoConvergence(msg, eigenvalues, ...) ARPACK iteration did not converge

ArpackError(info[, infodict]) ARPACK error

Exceptions

exception scipy.sparse.linalg.ArpackNoConvergence(msg,eigenvalues,eigenvectors)

ARPACK iteration did not converge

Attributes

eigenvalues (ndarray) Partial result. Converged eigenvalues.

eigenvectors (ndarray) Partial result. Converged eigenvectors.

5.23. Sparse matrices (scipy.sparse) 779

SciPy Reference Guide, Release 0.13.0

exception scipy.sparse.linalg.ArpackError(info,infodict={‘c’: {0: ‘Normal exit.’,1: ‘Maximum

number of iterations taken. All possible eigenvalues

of OP has been found. IPARAM(5) returns the num-

ber of wanted converged Ritz values.’,2: ‘No longer

an informational error. Deprecated starting with re-

lease 2 of ARPACK.’,3: ‘No shifts could be applied

during a cycle of the Implicitly restarted Arnoldi it-

eration. One possibility is to increase the size of

NCV relative to NEV. ‘,-9999: ‘Could not build an

Arnoldi factorization. IPARAM(5) returns the size

of the current Arnoldi factorization. The user is ad-

vised to check that enough workspace and array stor-

age has been allocated.’,-13: “NEV and WHICH

= ‘BE’ are incompatable.”,-12: ‘IPARAM(1) must

be equal to 0 or 1.’,-1: ‘N must be positive.’,-10:

‘IPARAM(7) must be 1,2,3.’,-9: ‘Starting vec-

tor is zero.’,-8: ‘Error return from LAPACK eigen-

value calculation;’,-7: ‘Length of private work ar-

ray WORKL is not sufﬁcient.’,-6: “BMAT must be

one of ‘I’ or ‘G’.”,-5: ” WHICH must be one of

‘LM’,‘SM’,‘LR’,‘SR’,‘LI’,‘SI”’,-4: ‘The max-

imum number of Arnoldi update iterations allowed

must be greater than zero.’,-3: ‘NCV-NEV >= 2 and

less than or equal to N.’,-2: ‘NEV must be posi-

tive.’,-11: “IPARAM(7) = 1 and BMAT = ‘G’ are in-

compatable.”},‘s’: {0: ‘Normal exit.’,1: ‘Maximum

number of iterations taken. All possible eigenvalues

of OP has been found. IPARAM(5) returns the num-

ber of wanted converged Ritz values.’,2: ‘No longer

an informational error. Deprecated starting with re-

lease 2 of ARPACK.’,3: ‘No shifts could be applied

during a cycle of the Implicitly restarted Arnoldi it-

eration. One possibility is to increase the size of

NCV relative to NEV. ‘,-9999: ‘Could not build an

Arnoldi factorization. IPARAM(5) returns the size

of the current Arnoldi factorization. The user is ad-

vised to check that enough workspace and array stor-

age has been allocated.’,-13: “NEV and WHICH =

‘BE’ are incompatable.”,-12: ‘IPARAM(1) must be

equal to 0 or 1.’,-2: ‘NEV must be positive.’,-10:

‘IPARAM(7) must be 1,2,3,4.’,-9: ‘Starting vec-

tor is zero.’,-8: ‘Error return from LAPACK eigen-

value calculation;’,-7: ‘Length of private work ar-

ray WORKL is not sufﬁcient.’,-6: “BMAT must be

one of ‘I’ or ‘G’.”,-5: ” WHICH must be one of

‘LM’,‘SM’,‘LR’,‘SR’,‘LI’,‘SI”’,-4: ‘The max-

imum number of Arnoldi update iterations allowed

must be greater than zero.’,-3: ‘NCV-NEV >= 2

and less than or equal to N.’,-1: ‘N must be posi-

tive.’,-11: “IPARAM(7) = 1 and BMAT = ‘G’ are in-

compatable.”},‘z’: {0: ‘Normal exit.’,1: ‘Maximum

number of iterations taken. All possible eigenvalues

of OP has been found. IPARAM(5) returns the num-

ber of wanted converged Ritz values.’,2: ‘No longer

an informational error. Deprecated starting with re-

lease 2 of ARPACK.’,3: ‘No shifts could be applied

during a cycle of the Implicitly restarted Arnoldi it-

eration. One possibility is to increase the size of

NCV relative to NEV. ‘,-9999: ‘Could not build an

Arnoldi factorization. IPARAM(5) returns the size

of the current Arnoldi factorization. The user is ad-

vised to check that enough workspace and array stor-

age has been allocated.’,-13: “NEV and WHICH

= ‘BE’ are incompatable.”,-12: ‘IPARAM(1) must

be equal to 0 or 1.’,-1: ‘N must be positive.’,-10:

‘IPARAM(7) must be 1,2,3.’,-9: ‘Starting vec-

tor is zero.’,-8: ‘Error return from LAPACK eigen-

value calculation;’,-7: ‘Length of private work ar-

ray WORKL is not sufﬁcient.’,-6: “BMAT must be

one of ‘I’ or ‘G’.”,-5: ” WHICH must be one of

‘LM’,‘SM’,‘LR’,‘SR’,‘LI’,‘SI”’,-4: ‘The max-

imum number of Arnoldi update iterations allowed

must be greater than zero.’,-3: ‘NCV-NEV >= 2 and

less than or equal to N.’,-2: ‘NEV must be posi-

tive.’,-11: “IPARAM(7) = 1 and BMAT = ‘G’ are in-

compatable.”},‘d’: {0: ‘Normal exit.’,1: ‘Maximum

number of iterations taken. All possible eigenvalues

of OP has been found. IPARAM(5) returns the num-

ber of wanted converged Ritz values.’,2: ‘No longer

an informational error. Deprecated starting with re-

lease 2 of ARPACK.’,3: ‘No shifts could be applied

during a cycle of the Implicitly restarted Arnoldi it-

eration. One possibility is to increase the size of

NCV relative to NEV. ‘,-9999: ‘Could not build an

Arnoldi factorization. IPARAM(5) returns the size

of the current Arnoldi factorization. The user is ad-

vised to check that enough workspace and array stor-

age has been allocated.’,-13: “NEV and WHICH =

‘BE’ are incompatable.”,-12: ‘IPARAM(1) must be

equal to 0 or 1.’,-2: ‘NEV must be positive.’,-10:

‘IPARAM(7) must be 1,2,3,4.’,-9: ‘Starting vec-

tor is zero.’,-8: ‘Error return from LAPACK eigen-

value calculation;’,-7: ‘Length of private work ar-

ray WORKL is not sufﬁcient.’,-6: “BMAT must be

one of ‘I’ or ‘G’.”,-5: ” WHICH must be one of

‘LM’,‘SM’,‘LR’,‘SR’,‘LI’,‘SI”’,-4: ‘The max-

imum number of Arnoldi update iterations allowed

must be greater than zero.’,-3: ‘NCV-NEV >= 2 and

less than or equal to N.’,-1: ‘N must be positive.’,

-11: “IPARAM(7) = 1 and BMAT = ‘G’ are incom-

patable.”}})

780 Chapter 5. Reference

SciPy Reference Guide, Release 0.13.0

ARPACK error