Idas Guide

User Manual:

Open the PDF directly: View PDF .
Page Count: 368 [warning: Documents this large are best viewed by clicking the View PDF Link!]

List of Tables
List of Figures
Introduction
Mathematical Considerations
Code Organization
- SUNDIALS organization
- IDAS organization
Using IDAS for IVP Solution
Using IDAS for Forward Sensitivity Analysis
Using IDAS for Adjoint Sensitivity Analysis
Description of the NVECTOR module
Description of the SUNMatrix module
Description of the SUNLinearSolver module
Description of the SUNNonlinearSolver module
SUNDIALS Package Installation Procedure
IDAS Constants
- IDAS input constants
- IDAS output constants
Bibliography
Index

User Documentation for idas v3.0.0

(sundials v4.0.0)

Radu Serban, Cosmin Petra, and Alan C. Hindmarsh

Center for Applied Scientiﬁc Computing

Lawrence Livermore National Laboratory

December 7, 2018

UCRL-SM-208112

DISCLAIMER

This document was prepared as an account of work sponsored by an agency of the United States

government. Neither the United States government nor Lawrence Livermore National Security, LLC,

nor any of their employees makes any warranty, expressed or implied, or assumes any legal liability or

responsibility for the accuracy, completeness, or usefulness of any information, apparatus, product, or

process disclosed, or represents that its use would not infringe privately owned rights. Reference herein

to any speciﬁc commercial product, process, or service by trade name, trademark, manufacturer, or

otherwise does not necessarily constitute or imply its endorsement, recommendation, or favoring by

the United States government or Lawrence Livermore National Security, LLC. The views and opinions

of authors expressed herein do not necessarily state or reﬂect those of the United States government

or Lawrence Livermore National Security, LLC, and shall not be used for advertising or product

endorsement purposes.

This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore

National Laboratory under Contract DE-AC52-07NA27344.

Approved for public release; further dissemination unlimited

Contents

List of Tables ix

List of Figures xi

1 Introduction 1

1.1 Changes from previous versions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

1.2 ReadingthisUserGuide................................... 8

1.3 SUNDIALSReleaseLicense................................. 9

1.3.1 CopyrightNotices .................................. 9

1.3.1.1 SUNDIALS Copyright . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

1.3.1.2 ARKodeCopyright ............................ 9

1.3.2 BSDLicense ..................................... 10

2 Mathematical Considerations 11

2.1 IVPsolution ......................................... 11

2.2 Preconditioning........................................ 15

2.3 Rootﬁnding .......................................... 16

2.4 Purequadratureintegration................................. 17

2.5 Forward sensitivity analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

2.5.1 Forward sensitivity methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18

2.5.2 Selection of the absolute tolerances for sensitivity variables . . . . . . . . . . . 19

2.5.3 Evaluation of the sensitivity right-hand side . . . . . . . . . . . . . . . . . . . . 19

2.5.4 Quadratures depending on forward sensitivities . . . . . . . . . . . . . . . . . . 20

2.6 Adjoint sensitivity analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20

2.6.1 Sensitivity of G(p) .................................. 20

2.6.2 Sensitivity of g(T, p) ................................. 21

2.6.3 Checkpointingscheme ................................ 22

2.7 Second-order sensitivity analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

3 Code Organization 25

3.1 SUNDIALSorganization................................... 25

3.2 IDASorganization ...................................... 25

4 Using IDAS for IVP Solution 31

4.1 Access to library and header ﬁles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31

4.2 Datatypes .......................................... 32

4.2.1 Floatingpointtypes ................................. 32

4.2.2 Integer types used for vector and matrix indices . . . . . . . . . . . . . . . . . 32

4.3 Headerﬁles .......................................... 33

4.4 A skeleton of the user’s main program . . . . . . . . . . . . . . . . . . . . . . . . . . . 34

4.5 User-callablefunctions.................................... 37

4.5.1 IDAS initialization and deallocation functions . . . . . . . . . . . . . . . . . . . 38

4.5.2 IDAS tolerance speciﬁcation functions . . . . . . . . . . . . . . . . . . . . . . . 38

iii

4.5.3 Linear solver interface functions . . . . . . . . . . . . . . . . . . . . . . . . . . 40

4.5.4 Nonlinear solver interface function . . . . . . . . . . . . . . . . . . . . . . . . . 42

4.5.5 Initial condition calculation function . . . . . . . . . . . . . . . . . . . . . . . . 42

4.5.6 Rootﬁnding initialization function . . . . . . . . . . . . . . . . . . . . . . . . . 43

4.5.7 IDASsolverfunction................................. 44

4.5.8 Optional input functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45

4.5.8.1 Main solver optional input functions . . . . . . . . . . . . . . . . . . . 45

4.5.8.2 Linear solver interface optional input functions . . . . . . . . . . . . . 51

4.5.8.3 Initial condition calculation optional input functions . . . . . . . . . . 55

4.5.8.4 Rootﬁnding optional input functions . . . . . . . . . . . . . . . . . . . 57

4.5.9 Interpolated output function . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58

4.5.10 Optional output functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58

4.5.10.1 SUNDIALS version information . . . . . . . . . . . . . . . . . . . . . 58

4.5.10.2 Main solver optional output functions . . . . . . . . . . . . . . . . . . 60

4.5.10.3 Initial condition calculation optional output functions . . . . . . . . . 66

4.5.10.4 Rootﬁnding optional output functions . . . . . . . . . . . . . . . . . . 66

4.5.10.5 idals linear solver interface optional output functions . . . . . . . . . 67

4.5.11 IDAS reinitialization function . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71

4.6 User-suppliedfunctions ................................... 72

4.6.1 Residualfunction................................... 72

4.6.2 Error message handler function . . . . . . . . . . . . . . . . . . . . . . . . . . . 73

4.6.3 Errorweightfunction ................................ 73

4.6.4 Rootﬁndingfunction................................. 74

4.6.5 Jacobian construction (matrix-based linear solvers) . . . . . . . . . . . . . . . . 74

4.6.6 Jacobian-vector product (matrix-free linear solvers) . . . . . . . . . . . . . . . 76

4.6.7 Jacobian-vector product setup (matrix-free linear solvers) . . . . . . . . . . . . 77

4.6.8 Preconditioner solve (iterative linear solvers) . . . . . . . . . . . . . . . . . . . 78

4.6.9 Preconditioner setup (iterative linear solvers) . . . . . . . . . . . . . . . . . . . 78

4.7 Integration of pure quadrature equations . . . . . . . . . . . . . . . . . . . . . . . . . . 79

4.7.1 Quadrature initialization and deallocation functions . . . . . . . . . . . . . . . 81

4.7.2 IDASsolverfunction................................. 82

4.7.3 Quadrature extraction functions . . . . . . . . . . . . . . . . . . . . . . . . . . 82

4.7.4 Optional inputs for quadrature integration . . . . . . . . . . . . . . . . . . . . . 83

4.7.5 Optional outputs for quadrature integration . . . . . . . . . . . . . . . . . . . . 84

4.7.6 User-supplied function for quadrature integration . . . . . . . . . . . . . . . . . 85

4.8 A parallel band-block-diagonal preconditioner module . . . . . . . . . . . . . . . . . . 86

5 Using IDAS for Forward Sensitivity Analysis 93

5.1 A skeleton of the user’s main program . . . . . . . . . . . . . . . . . . . . . . . . . . . 93

5.2 User-callable routines for forward sensitivity analysis . . . . . . . . . . . . . . . . . . . 96

5.2.1 Forward sensitivity initialization and deallocation functions . . . . . . . . . . . 96

5.2.2 Forward sensitivity tolerance speciﬁcation functions . . . . . . . . . . . . . . . 98

5.2.3 Forward sensitivity nonlinear solver interface functions . . . . . . . . . . . . . . 100

5.2.4 Forward sensitivity initial condition calculation function . . . . . . . . . . . . . 101

5.2.5 IDASsolverfunction................................. 101

5.2.6 Forward sensitivity extraction functions . . . . . . . . . . . . . . . . . . . . . . 101

5.2.7 Optional inputs for forward sensitivity analysis . . . . . . . . . . . . . . . . . . 103

5.2.8 Optional outputs for forward sensitivity analysis . . . . . . . . . . . . . . . . . 105

5.2.8.1 Main solver optional output functions . . . . . . . . . . . . . . . . . . 105

5.2.8.2 Initial condition calculation optional output functions . . . . . . . . . 108

5.3 User-supplied routines for forward sensitivity analysis . . . . . . . . . . . . . . . . . . 108

5.4 Integration of quadrature equations depending on forward sensitivities . . . . . . . . . 109

5.4.1 Sensitivity-dependent quadrature initialization and deallocation . . . . . . . . . 111

5.4.2 IDASsolverfunction................................. 112

5.4.3 Sensitivity-dependent quadrature extraction functions . . . . . . . . . . . . . . 112

5.4.4 Optional inputs for sensitivity-dependent quadrature integration . . . . . . . . 114

5.4.5 Optional outputs for sensitivity-dependent quadrature integration . . . . . . . 116

5.4.6 User-supplied function for sensitivity-dependent quadrature integration . . . . 117

5.5 Note on using partial error control . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 118

6 Using IDAS for Adjoint Sensitivity Analysis 121

6.1 A skeleton of the user’s main program . . . . . . . . . . . . . . . . . . . . . . . . . . . 121

6.2 User-callable functions for adjoint sensitivity analysis . . . . . . . . . . . . . . . . . . . 124

6.2.1 Adjoint sensitivity allocation and deallocation functions . . . . . . . . . . . . . 124

6.2.2 Adjoint sensitivity optional input . . . . . . . . . . . . . . . . . . . . . . . . . . 126

6.2.3 Forward integration function . . . . . . . . . . . . . . . . . . . . . . . . . . . . 126

6.2.4 Backward problem initialization functions . . . . . . . . . . . . . . . . . . . . . 127

6.2.5 Tolerance speciﬁcation functions for backward problem . . . . . . . . . . . . . . 129

6.2.6 Linear solver initialization functions for backward problem . . . . . . . . . . . 130

6.2.7 Initial condition calculation functions for backward problem . . . . . . . . . . . 131

6.2.8 Backward integration function . . . . . . . . . . . . . . . . . . . . . . . . . . . 132

6.2.9 Optional input functions for the backward problem . . . . . . . . . . . . . . . . 134

6.2.9.1 Main solver optional input functions . . . . . . . . . . . . . . . . . . . 134

6.2.9.2 Linear solver interface optional input functions . . . . . . . . . . . . . 134

6.2.10 Optional output functions for the backward problem . . . . . . . . . . . . . . . 138

6.2.10.1 Main solver optional output functions . . . . . . . . . . . . . . . . . . 138

6.2.10.2 Initial condition calculation optional output function . . . . . . . . . 139

6.2.11 Backward integration of quadrature equations . . . . . . . . . . . . . . . . . . . 140

6.2.11.1 Backward quadrature initialization functions . . . . . . . . . . . . . . 140

6.2.11.2 Backward quadrature extraction function . . . . . . . . . . . . . . . . 141

6.2.11.3 Optional input/output functions for backward quadrature integration 142

6.3 User-supplied functions for adjoint sensitivity analysis . . . . . . . . . . . . . . . . . . 142

6.3.1 DAE residual for the backward problem . . . . . . . . . . . . . . . . . . . . . . 142

6.3.2 DAE residual for the backward problem depending on the forward sensitivities 143

6.3.3 Quadrature right-hand side for the backward problem . . . . . . . . . . . . . . 144

6.3.4 Sensitivity-dependent quadrature right-hand side for the backward problem . . 145

6.3.5 Jacobian construction for the backward problem (matrix-based linear solvers) . 146

6.3.6 Jacobian-vector product for the backward problem (matrix-free linear solvers) . 148

6.3.7 Jacobian-vector product setup for the backward problem (matrix-free linear

solvers) ........................................ 150

6.3.8 Preconditioner solve for the backward problem (iterative linear solvers) . . . . 151

6.3.9 Preconditioner setup for the backward problem (iterative linear solvers) . . . . 153

6.4 Using the band-block-diagonal preconditioner for backward problems . . . . . . . . . . 154

6.4.1 Usage of IDABBDPRE for the backward problem . . . . . . . . . . . . . . . . 154

6.4.2 User-supplied functions for IDABBDPRE . . . . . . . . . . . . . . . . . . . . . 156

7 Description of the NVECTOR module 159

7.1 NVECTOR functions used by IDAS . . . . . . . . . . . . . . . . . . . . . . . . . . . . 168

7.2 The NVECTOR SERIAL implementation . . . . . . . . . . . . . . . . . . . . . . . . . 169

7.2.1 NVECTOR SERIAL accessor macros . . . . . . . . . . . . . . . . . . . . . . . 169

7.2.2 NVECTOR SERIAL functions . . . . . . . . . . . . . . . . . . . . . . . . . . . 170

7.2.3 NVECTOR SERIAL Fortran interfaces . . . . . . . . . . . . . . . . . . . . . . 173

7.3 The NVECTOR PARALLEL implementation . . . . . . . . . . . . . . . . . . . . . . . 173

7.3.1 NVECTOR PARALLEL accessor macros . . . . . . . . . . . . . . . . . . . . . 174

7.3.2 NVECTOR PARALLEL functions . . . . . . . . . . . . . . . . . . . . . . . . . 175

7.3.3 NVECTOR PARALLEL Fortran interfaces . . . . . . . . . . . . . . . . . . . . 178

7.4 The NVECTOR OPENMP implementation . . . . . . . . . . . . . . . . . . . . . . . . 178

7.4.1 NVECTOR OPENMP accessor macros . . . . . . . . . . . . . . . . . . . . . . 178

7.4.2 NVECTOR OPENMP functions . . . . . . . . . . . . . . . . . . . . . . . . . . 179

7.4.3 NVECTOR OPENMP Fortran interfaces . . . . . . . . . . . . . . . . . . . . . 182

7.5 The NVECTOR PTHREADS implementation . . . . . . . . . . . . . . . . . . . . . . 183

7.5.1 NVECTOR PTHREADS accessor macros . . . . . . . . . . . . . . . . . . . . . 183

7.5.2 NVECTOR PTHREADS functions . . . . . . . . . . . . . . . . . . . . . . . . . 184

7.5.3 NVECTOR PTHREADS Fortran interfaces . . . . . . . . . . . . . . . . . . . . 187

7.6 The NVECTOR PARHYP implementation . . . . . . . . . . . . . . . . . . . . . . . . 188

7.6.1 NVECTOR PARHYP functions . . . . . . . . . . . . . . . . . . . . . . . . . . 188

7.7 The NVECTOR PETSC implementation . . . . . . . . . . . . . . . . . . . . . . . . . 191

7.7.1 NVECTOR PETSC functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 192

7.8 The NVECTOR CUDA implementation . . . . . . . . . . . . . . . . . . . . . . . . . . 195

7.8.1 NVECTOR CUDA functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 196

7.9 The NVECTOR RAJA implementation . . . . . . . . . . . . . . . . . . . . . . . . . . 201

7.9.1 NVECTOR RAJA functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 202

7.10 The NVECTOR OPENMPDEV implementation . . . . . . . . . . . . . . . . . . . . . 205

7.10.1 NVECTOR OPENMPDEV accessor macros . . . . . . . . . . . . . . . . . . . . 205

7.10.2 NVECTOR OPENMPDEV functions . . . . . . . . . . . . . . . . . . . . . . . 206

7.11NVECTORExamples .................................... 210

8 Description of the SUNMatrix module 215

8.1 SUNMatrix functions used by IDAS . . . . . . . . . . . . . . . . . . . . . . . . . . . . 218

8.2 The SUNMatrix Dense implementation . . . . . . . . . . . . . . . . . . . . . . . . . . 218

8.2.1 SUNMatrix Dense accessor macros . . . . . . . . . . . . . . . . . . . . . . . . . 219

8.2.2 SUNMatrix Dense functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 220

8.2.3 SUNMatrix Dense Fortran interfaces . . . . . . . . . . . . . . . . . . . . . . . . 221

8.3 The SUNMatrix Band implementation . . . . . . . . . . . . . . . . . . . . . . . . . . . 222

8.3.1 SUNMatrix Band accessor macros . . . . . . . . . . . . . . . . . . . . . . . . . 224

8.3.2 SUNMatrix Band functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 225

8.3.3 SUNMatrix Band Fortran interfaces . . . . . . . . . . . . . . . . . . . . . . . . 227

8.4 The SUNMatrix Sparse implementation . . . . . . . . . . . . . . . . . . . . . . . . . . 228

8.4.1 SUNMatrix Sparse accessor macros . . . . . . . . . . . . . . . . . . . . . . . . . 231

8.4.2 SUNMatrix Sparse functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 232

8.4.3 SUNMatrix Sparse Fortran interfaces . . . . . . . . . . . . . . . . . . . . . . . 234

9 Description of the SUNLinearSolver module 237

9.1 TheSUNLinearSolverAPI.................................. 238

9.1.1 SUNLinearSolver corefunctions .......................... 238

9.1.2 SUNLinearSolver setfunctions........................... 240

9.1.3 SUNLinearSolver getfunctions........................... 241

9.1.4 Functions provided by sundials packages..................... 242

9.1.5 SUNLinearSolver returncodes........................... 243

9.1.6 The generic SUNLinearSolver module....................... 244

9.2 Compatibility of SUNLinearSolver modules........................ 245

9.3 Implementing a custom SUNLinearSolver module .................... 245

9.3.1 Intendedusecases .................................. 246

9.4 IDAS SUNLinearSolver interface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 247

9.4.1 Lagged matrix information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 248

9.4.2 Iterative linear solver tolerance . . . . . . . . . . . . . . . . . . . . . . . . . . . 248

9.5 The SUNLinearSolver Dense implementation . . . . . . . . . . . . . . . . . . . . . . . 249

9.5.1 SUNLinearSolver Dense description . . . . . . . . . . . . . . . . . . . . . . . . 249

9.5.2 SUNLinearSolver Dense functions . . . . . . . . . . . . . . . . . . . . . . . . . 249

9.5.3 SUNLinearSolver Dense Fortran interfaces . . . . . . . . . . . . . . . . . . . . . 250

9.5.4 SUNLinearSolver Dense content . . . . . . . . . . . . . . . . . . . . . . . . . . 251

9.6 The SUNLinearSolver Band implementation . . . . . . . . . . . . . . . . . . . . . . . . 252

9.6.1 SUNLinearSolver Band description . . . . . . . . . . . . . . . . . . . . . . . . . 252

9.6.2 SUNLinearSolver Band functions . . . . . . . . . . . . . . . . . . . . . . . . . . 252

9.6.3 SUNLinearSolver Band Fortran interfaces . . . . . . . . . . . . . . . . . . . . . 253

9.6.4 SUNLinearSolver Band content . . . . . . . . . . . . . . . . . . . . . . . . . . . 254

9.7 The SUNLinearSolver LapackDense implementation . . . . . . . . . . . . . . . . . . . 254

9.7.1 SUNLinearSolver LapackDense description . . . . . . . . . . . . . . . . . . . . 254

9.7.2 SUNLinearSolver LapackDense functions . . . . . . . . . . . . . . . . . . . . . 255

9.7.3 SUNLinearSolver LapackDense Fortran interfaces . . . . . . . . . . . . . . . . . 255

9.7.4 SUNLinearSolver LapackDense content . . . . . . . . . . . . . . . . . . . . . . 256

9.8 The SUNLinearSolver LapackBand implementation . . . . . . . . . . . . . . . . . . . . 256

9.8.1 SUNLinearSolver LapackBand description . . . . . . . . . . . . . . . . . . . . . 257

9.8.2 SUNLinearSolver LapackBand functions . . . . . . . . . . . . . . . . . . . . . . 257

9.8.3 SUNLinearSolver LapackBand Fortran interfaces . . . . . . . . . . . . . . . . . 258

9.8.4 SUNLinearSolver LapackBand content . . . . . . . . . . . . . . . . . . . . . . . 259

9.9 The SUNLinearSolver KLU implementation . . . . . . . . . . . . . . . . . . . . . . . . 259

9.9.1 SUNLinearSolver KLU description . . . . . . . . . . . . . . . . . . . . . . . . . 259

9.9.2 SUNLinearSolver KLU functions . . . . . . . . . . . . . . . . . . . . . . . . . . 260

9.9.3 SUNLinearSolver KLU Fortran interfaces . . . . . . . . . . . . . . . . . . . . . 262

9.9.4 SUNLinearSolver KLU content . . . . . . . . . . . . . . . . . . . . . . . . . . . 264

9.10 The SUNLinearSolver SuperLUMT implementation . . . . . . . . . . . . . . . . . . . . 264

9.10.1 SUNLinearSolver SuperLUMT description . . . . . . . . . . . . . . . . . . . . . 265

9.10.2 SUNLinearSolver SuperLUMT functions . . . . . . . . . . . . . . . . . . . . . . 265

9.10.3 SUNLinearSolver SuperLUMT Fortran interfaces . . . . . . . . . . . . . . . . . 267

9.10.4 SUNLinearSolver SuperLUMT content . . . . . . . . . . . . . . . . . . . . . . . 268

9.11 The SUNLinearSolver SPGMRimplementation...................... 269

9.11.1 SUNLinearSolver SPGMR description . . . . . . . . . . . . . . . . . . . . . . . 269

9.11.2 SUNLinearSolver SPGMR functions . . . . . . . . . . . . . . . . . . . . . . . . 269

9.11.3 SUNLinearSolver SPGMR Fortran interfaces . . . . . . . . . . . . . . . . . . . 271

9.11.4 SUNLinearSolver SPGMR content . . . . . . . . . . . . . . . . . . . . . . . . . 274

9.12 The SUNLinearSolver SPFGMR implementation . . . . . . . . . . . . . . . . . . . . . 275

9.12.1 SUNLinearSolver SPFGMR description . . . . . . . . . . . . . . . . . . . . . . 275

9.12.2 SUNLinearSolver SPFGMR functions . . . . . . . . . . . . . . . . . . . . . . . 276

9.12.3 SUNLinearSolver SPFGMR Fortran interfaces . . . . . . . . . . . . . . . . . . 278

9.12.4 SUNLinearSolver SPFGMR content . . . . . . . . . . . . . . . . . . . . . . . . 280

9.13 The SUNLinearSolver SPBCGS implementation . . . . . . . . . . . . . . . . . . . . . . 282

9.13.1 SUNLinearSolver SPBCGS description . . . . . . . . . . . . . . . . . . . . . . . 282

9.13.2 SUNLinearSolver SPBCGS functions . . . . . . . . . . . . . . . . . . . . . . . . 282

9.13.3 SUNLinearSolver SPBCGS Fortran interfaces . . . . . . . . . . . . . . . . . . . 284

9.13.4 SUNLinearSolver SPBCGS content . . . . . . . . . . . . . . . . . . . . . . . . . 286

9.14 The SUNLinearSolver SPTFQMR implementation . . . . . . . . . . . . . . . . . . . . 287

9.14.1 SUNLinearSolver SPTFQMR description . . . . . . . . . . . . . . . . . . . . . 287

9.14.2 SUNLinearSolver SPTFQMR functions . . . . . . . . . . . . . . . . . . . . . . 288

9.14.3 SUNLinearSolver SPTFQMR Fortran interfaces . . . . . . . . . . . . . . . . . . 289

9.14.4 SUNLinearSolver SPTFQMR content . . . . . . . . . . . . . . . . . . . . . . . 291

9.15 The SUNLinearSolver PCG implementation . . . . . . . . . . . . . . . . . . . . . . . . 292

9.15.1 SUNLinearSolver PCG description . . . . . . . . . . . . . . . . . . . . . . . . . 293

9.15.2 SUNLinearSolver PCG functions . . . . . . . . . . . . . . . . . . . . . . . . . . 294

9.15.3 SUNLinearSolver PCG Fortran interfaces . . . . . . . . . . . . . . . . . . . . . 295

9.15.4 SUNLinearSolver PCG content . . . . . . . . . . . . . . . . . . . . . . . . . . . 297

9.16SUNLinearSolverExamples ................................. 298

vii

10 Description of the SUNNonlinearSolver module 301

10.1 The SUNNonlinearSolver API . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 301

10.1.1 SUNNonlinearSolver core functions . . . . . . . . . . . . . . . . . . . . . . . . . 301

10.1.2 SUNNonlinearSolver set functions . . . . . . . . . . . . . . . . . . . . . . . . . 303

10.1.3 SUNNonlinearSolver get functions . . . . . . . . . . . . . . . . . . . . . . . . . 304

10.1.4 Functions provided by SUNDIALS integrators . . . . . . . . . . . . . . . . . . 305

10.1.5 SUNNonlinearSolver return codes . . . . . . . . . . . . . . . . . . . . . . . . . . 307

10.1.6 The generic SUNNonlinearSolver module . . . . . . . . . . . . . . . . . . . . . 307

10.1.7 Usage with sensitivity enabled integrators . . . . . . . . . . . . . . . . . . . . . 308

10.1.8 Implementing a Custom SUNNonlinearSolver Module . . . . . . . . . . . . . . 310

10.2 The SUNNonlinearSolver Newton implementation . . . . . . . . . . . . . . . . . . . . 310

10.2.1 SUNNonlinearSolver Newton description . . . . . . . . . . . . . . . . . . . . . . 310

10.2.2 SUNNonlinearSolver Newton functions . . . . . . . . . . . . . . . . . . . . . . . 311

10.2.3 SUNNonlinearSolver Newton Fortran interfaces . . . . . . . . . . . . . . . . . . 312

10.2.4 SUNNonlinearSolver Newton content . . . . . . . . . . . . . . . . . . . . . . . . 313

10.3 The SUNNonlinearSolver FixedPoint implementation . . . . . . . . . . . . . . . . . . . 313

10.3.1 SUNNonlinearSolver FixedPoint description . . . . . . . . . . . . . . . . . . . . 313

10.3.2 SUNNonlinearSolver FixedPoint functions . . . . . . . . . . . . . . . . . . . . . 314

10.3.3 SUNNonlinearSolver FixedPoint Fortran interfaces . . . . . . . . . . . . . . . . 315

10.3.4 SUNNonlinearSolver FixedPoint content . . . . . . . . . . . . . . . . . . . . . . 316

A SUNDIALS Package Installation Procedure 319

A.1 CMake-basedinstallation .................................. 320

A.1.1 Conﬁguring, building, and installing on Unix-like systems . . . . . . . . . . . . 320

A.1.2 Conﬁguration options (Unix/Linux) . . . . . . . . . . . . . . . . . . . . . . . . 322

A.1.3 Conﬁgurationexamples ............................... 329

A.1.4 Working with external Libraries . . . . . . . . . . . . . . . . . . . . . . . . . . 329

A.1.5 Testing the build and installation . . . . . . . . . . . . . . . . . . . . . . . . . . 331

A.2 Building and Running Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 332

A.3 Conﬁguring, building, and installing on Windows . . . . . . . . . . . . . . . . . . . . . 332

A.4 Installed libraries and exported header ﬁles . . . . . . . . . . . . . . . . . . . . . . . . 332

B IDAS Constants 339

B.1 IDASinputconstants .................................... 339

B.2 IDASoutputconstants.................................... 339

Bibliography 343

Index 347

viii

List of Tables

4.1 sundials linear solver interfaces and vector implementations that can be used for each. 37

4.2 Optional inputs for idas and idals ............................. 46

4.3 Optional outputs from idas and idals ........................... 59

5.1 Forward sensitivity optional inputs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103

5.2 Forward sensitivity optional outputs . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105

7.1 Vector Identiﬁcations associated with vector kernels supplied with sundials. ..... 161

7.2 Description of the NVECTOR operations . . . . . . . . . . . . . . . . . . . . . . . . . 162

7.3 Description of the NVECTOR fused operations . . . . . . . . . . . . . . . . . . . . . . 165

7.4 Description of the NVECTOR vector array operations . . . . . . . . . . . . . . . . . . 166

7.5 List of vector functions usage by idas codemodules ................... 214

8.1 Identiﬁers associated with matrix kernels supplied with sundials. ........... 216

8.2 Description of the SUNMatrix operations.......................... 216

8.3 sundials matrix interfaces and vector implementations that can be used for each. . . 217

8.4 List of matrix functions usage by idas codemodules ................... 218

9.1 Description of the SUNLinearSolver errorcodes ..................... 244

9.2 sundials matrix-based linear solvers and matrix implementations that can be used for

each............................................... 245

9.3 List of linear solver function usage in the idals interface................. 248

10.1 Description of the SUNNonlinearSolver returncodes................... 307

A.1 sundials librariesandheaderﬁles ............................. 334

List of Figures

2.1 Illustration of the checkpointing algorithm for generation of the forward solution during

the integration of the adjoint system. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

3.1 High-level diagram of the sundials suite ......................... 26

3.2 Organization of the sundials suite............................. 27

3.3 Overall structure diagram of the ida package ....................... 28

8.1 Diagram of the storage for a sunmatrix band object .................. 223

8.2 Diagram of the storage for a compressed-sparse-column matrix . . . . . . . . . . . . . 230

A.1 Initial ccmake conﬁgurationscreen ............................. 321

A.2 Changing the instdir ..................................... 322

Chapter 1

Introduction

idas is part of a software family called sundials: SUite of Nonlinear and DIﬀerential/ALgebraic

equation Solvers [26]. This suite consists of cvode,arkode,kinsol, and ida, and variants of these

with sensitivity analysis capabilities, cvodes and idas.

idas is a general purpose solver for the initial value problem (IVP) for systems of diﬀerential-

algebraic equations (DAEs). The name IDAS stands for Implicit Diﬀerential-Algebraic solver with

Sensitivity capabilities. idas is an extension of the ida solver within sundials, itself based on

daspk [7,8]; however, like all sundials solvers, idas is written in ANSI-standard C rather than

Fortran77. Its most notable features are that, (1) in the solution of the underlying nonlinear system

at each time step, it oﬀers a choice of Newton/direct methods and a choice of Inexact Newton/Krylov

(iterative) methods; (2) it is written in a data-independent manner in that it acts on generic vectors

and matrices without any assumptions on the underlying organization of the data; and (3) it provides

a ﬂexible, extensible framework for sensitivity analysis, using either forward or adjoint methods. Thus

idas shares signiﬁcant modules previously written within CASC at LLNL to support the ordinary

diﬀerential equation (ODE) solvers cvode [27,15] and pvode [11,12], the DAE solver ida [30] on

which idas is based, the sensitivity-enabled ODE solver cvodes [28,42], and also the nonlinear system

solver kinsol [16].

At present, idas may utilize a variety of Krylov methods provided in sundials that can be used

in conjuction with Newton iteration: these include the GMRES (Generalized Minimal RESidual) [41],

FGMRES (Flexible Generalized Minimum RESidual) [40], Bi-CGStab (Bi-Conjugate Gradient Stabi-

lized) [44], TFQMR (Transpose-Free Quasi-Minimal Residual) [23], and PCG (Preconditioned Con-

jugate Gradient) [24] linear iterative methods. As Krylov methods, these require little matrix storage

for solving the Newton equations as compared to direct methods. However, the algorithms allow

for a user-supplied preconditioner matrix, and, for most problems, preconditioning is essential for an

eﬃcient solution.

For very large DAE systems, the Krylov methods are preferable over direct linear solver methods,

and are often the only feasible choice. Among the Krylov methods in sundials, we recommend

GMRES as the best overall choice. However, users are encouraged to compare all options, especially

if encountering convergence failures with GMRES. Bi-CGFStab and TFQMR have an advantage

in storage requirements, in that the number of workspace vectors they require is ﬁxed, while that

number for GMRES depends on the desired Krylov subspace size. FGMRES has an advantage in

that it is designed to support preconditioners that vary between iterations (e.g. iterative methods).

PCG exhibits rapid convergence and minimal workspace vectors, but only works for symmetric linear

systems.

idas is written with a functionality that is a superset of that of ida. Sensitivity analysis capabili-

ties, both forward and adjoint, have been added to the main integrator. Enabling forward sensitivity

computations in idas will result in the code integrating the so-called sensitivity equations simultane-

ously with the original IVP, yielding both the solution and its sensitivity with respect to parameters

in the model. Adjoint sensitivity analysis, most useful when the gradients of relatively few functionals

of the solution with respect to many parameters are sought, involves integration of the original IVP

2 Introduction

forward in time followed by the integration of the so-called adjoint equations backward in time. idas

provides the infrastructure needed to integrate any ﬁnal-condition ODE dependent on the solution of

the original IVP (in particular the adjoint system).

There are several motivations for choosing the Clanguage for idas. First, a general movement away

from Fortran and toward Cin scientiﬁc computing was apparent. Second, the pointer, structure,

and dynamic memory allocation features in Care extremely useful in software of this complexity,

with the great variety of method options oﬀered. Finally, we prefer Cover C++ for idas because of

the wider availability of Ccompilers, the potentially greater eﬃciency of C, and the greater ease of

interfacing the solver to applications written in extended Fortran.

1.1 Changes from previous versions

Changes in v3.0.0

idas’ previous direct and iterative linear solver interfaces, idadls and idaspils, have been merged

into a single uniﬁed linear solver interface, idals, to support any valid sunlinsol module. This

includes the “DIRECT” and “ITERATIVE” types as well as the new “MATRIX ITERATIVE” type.

Details regarding how idals utilizes linear solvers of each type as well as discussion regarding intended

use cases for user-supplied sunlinsol implementations are included in Chapter 9. All idas example

programs and the standalone linear solver examples have been updated to use the uniﬁed linear solver

interface.

The uniﬁed interface for the new idals module is very similar to the previous idadls and idaspils

interfaces. To minimize challenges in user migration to the new names, the previous Croutine names

may still be used; these will be deprecated in future releases, so we recommend that users migrate to

the new names soon.

The names of all constructor routines for sundials-provided sunlinsol implementations have

been updated to follow the naming convention SUNLinSol *where *is the name of the linear solver.

The new names are SUNLinSol Band,SUNLinSol Dense,SUNLinSol KLU,SUNLinSol LapackBand,

SUNLinSol LapackDense,SUNLinSol PCG,SUNLinSol SPBCGS,SUNLinSol SPFGMR,SUNLinSol SPGMR,

SUNLinSol SPTFQMR, and SUNLinSol SuperLUMT. Solver-speciﬁc “set” routine names have been simi-

larly standardized. To minimize challenges in user migration to the new names, the previous routine

names may still be used; these will be deprecated in future releases, so we recommend that users mi-

grate to the new names soon. All idas example programs and the standalone linear solver examples

have been updated to use the new naming convention.

The SUNBandMatrix constructor has been simpliﬁed to remove the storage upper bandwidth ar-

gument.

sundials integrators have been updated to utilize generic nonlinear solver modules deﬁned through

the sunnonlinsol API. This API will ease the addition of new nonlinear solver options and allow for

external or user-supplied nonlinear solvers. The sunnonlinsol API and sundials provided modules

are described in Chapter 10 and follow the same object oriented design and implementation used by

the nvector,sunmatrix, and sunlinsol modules. Currently two sunnonlinsol implementations

are provided, sunnonlinsol newton and sunnonlinsol fixedpoint. These replicate the previ-

ous integrator speciﬁc implementations of a Newton iteration and a ﬁxed-point iteration (previously

referred to as a functional iteration), respectively. Note the sunnonlinsol fixedpoint module can

optionally utilize Anderson’s method to accelerate convergence. Example programs using each of these

nonlinear solver modules in a standalone manner have been added and all idas example programs

have been updated to use generic sunnonlinsol modules.

By default idas uses the sunnonlinsol newton module. Since idas previously only used an

internal implementation of a Newton iteration no changes are required to user programs and func-

tions for setting the nonlinear solver options (e.g., IDASetMaxNonlinIters) or getting nonlinear solver

statistics (e.g., IDAGetNumNonlinSolvIters) remain unchanged and internally call generic sunnon-

linsol functions as needed. While sundials includes a ﬁxed-point nonlinear solver module, it is not

currently supported in idas. For details on attaching a user-supplied nonlinear solver to idas see

Chapter 4,5, and 6.

1.1 Changes from previous versions 3

Three fused vector operations and seven vector array operations have been added to the nvec-

tor API. These optional operations are disabled by default and may be activated by calling vector

speciﬁc routines after creating an nvector (see Chapter 7for more details). The new operations are

intended to increase data reuse in vector operations, reduce parallel communication on distributed

memory systems, and lower the number of kernel launches on systems with accelerators. The fused op-

erations are N VLinearCombination,N VScaleAddMulti, and N VDotProdMulti and the vector array

operations are N VLinearCombinationVectorArray,N VScaleVectorArray,N VConstVectorArray,

N VWrmsNormVectorArray,N VWrmsNormMaskVectorArray,N VScaleAddMultiVectorArray, and

N VLinearCombinationVectorArray. If an nvector implementation deﬁnes any of these operations

as NULL, then standard nvector operations will automatically be called as necessary to complete the

computation.

Multiple updates to nvector cuda were made:

•Changed N VGetLength Cuda to return the global vector length instead of the local vector length.

•Added N VGetLocalLength Cuda to return the local vector length.

•Added N VGetMPIComm Cuda to return the MPI communicator used.

•Removed the accessor functions in the namespace suncudavec.

•Changed the N VMake Cuda function to take a host data pointer and a device data pointer instead

of an N VectorContent Cuda object.

•Added the ability to set the cudaStream t used for execution of the nvector cuda kernels.

See the function N VSetCudaStreams Cuda.

•Added N VNewManaged Cuda,N VMakeManaged Cuda, and N VIsManagedMemory Cuda functions

to accommodate using managed memory with the nvector cuda.

Multiple changes to nvector raja were made:

•Changed N VGetLength Raja to return the global vector length instead of the local vector length.

•Added N VGetLocalLength Raja to return the local vector length.

•Added N VGetMPIComm Raja to return the MPI communicator used.

•Removed the accessor functions in the namespace suncudavec.

A new nvector implementation for leveraging OpenMP 4.5+ device oﬄoading has been added,

nvector openmpdev. See §7.10 for more details.

Changes in v2.2.1

The changes in this minor release include the following:

•Fixed a bug in the cuda nvector where the N VInvTest operation could write beyond the

allocated vector data.

•Fixed library installation path for multiarch systems. This ﬁx changes the default library instal-

lation path to CMAKE INSTALL PREFIX/CMAKE INSTALL LIBDIR from CMAKE INSTALL PREFIX/lib.

CMAKE INSTALL LIBDIR is automatically set, but is available as a CMake option that can modi-

ﬁed.

4 Introduction

Changes in v2.2.0

Fixed a bug in idas where the saved residual value used in the nonlinear solve for consistent initial

conditions was passed as temporary workspace and could be overwritten.

Fixed a thread-safety issue when using ajdoint sensitivity analysis.

Fixed a problem with setting sunindextype which would occur with some compilers (e.g. arm-

clang) that did not deﬁne STDC VERSION .

Added hybrid MPI/CUDA and MPI/RAJA vectors to allow use of more than one MPI rank when

using a GPU system. The vectors assume one GPU device per MPI rank.

Changed the name of the raja nvector library to libsundials nveccudaraja.lib from

libsundials nvecraja.lib to better reﬂect that we only support cuda as a backend for raja cur-

rently.

Several changes were made to the build system:

•CMake 3.1.3 is now the minimum required CMake version.

•Deprecate the behavior of the SUNDIALS INDEX TYPE CMake option and added the

SUNDIALS INDEX SIZE CMake option to select the sunindextype integer size.

•The native CMake FindMPI module is now used to locate an MPI installation.

•If MPI is enabled and MPI compiler wrappers are not set, the build system will check if

CMAKE <language> COMPILER can compile MPI programs before trying to locate and use an

MPI installation.

•The previous options for setting MPI compiler wrappers and the executable for running MPI

programs have been have been depreated. The new options that align with those used in native

CMake FindMPI module are MPI C COMPILER,MPI CXX COMPILER,MPI Fortran COMPILER, and

MPIEXEC EXECUTABLE.

•When a Fortran name-mangling scheme is needed (e.g., LAPACK ENABLE is ON) the build system

will infer the scheme from the Fortran compiler. If a Fortran compiler is not available or the in-

ferred or default scheme needs to be overridden, the advanced options SUNDIALS F77 FUNC CASE

and SUNDIALS F77 FUNC UNDERSCORES can be used to manually set the name-mangling scheme

and bypass trying to infer the scheme.

•Parts of the main CMakeLists.txt ﬁle were moved to new ﬁles in the src and example directories

to make the CMake conﬁguration ﬁle structure more modular.

Changes in v2.1.2

The changes in this minor release include the following:

•Updated the minimum required version of CMake to 2.8.12 and enabled using rpath by default

to locate shared libraries on OSX.

•Fixed Windows speciﬁc problem where sunindextype was not correctly deﬁned when using

64-bit integers for the sundials index type. On Windows sunindextype is now deﬁned as the

MSVC basic type int64.

•Added sparse SUNMatrix “Reallocate” routine to allow speciﬁcation of the nonzero storage.

1.1 Changes from previous versions 5

•Updated the KLU sunlinsol module to set constants for the two reinitialization types, and

ﬁxed a bug in the full reinitialization approach where the sparse SUNMatrix pointer would go

out of scope on some architectures.

•Updated the “ScaleAdd” and “ScaleAddI” implementations in the sparse SUNMatrix module

to more optimally handle the case where the target matrix contained suﬃcient storage for the

sum, but had the wrong sparsity pattern. The sum now occurs in-place, by performing the sum

backwards in the existing storage. However, it is still more eﬃcient if the user-supplied Jacobian

routine allocates storage for the sum I+γJ manually (with zero entries if needed).

•Changed the LICENSE install path to instdir/include/sundials.

Changes in v2.1.1

The changes in this minor release include the following:

•Fixed a potential memory leak in the spgmr and spfgmr linear solvers: if “Initialize” was

called multiple times then the solver memory was reallocated (without being freed).

•Updated KLU SUNLinearSolver module to use a typedef for the precision-speciﬁc solve function

to be used (to avoid compiler warnings).

•Added missing typecasts for some (void*) pointers (again, to avoid compiler warnings).

•Bugﬁx in sunmatrix sparse.c where we had used int instead of sunindextype in one location.

•Added missing #include <stdio.h> in nvector and sunmatrix header ﬁles.

•Added missing prototype for IDASpilsGetNumJTSetupEvals.

•Fixed an indexing bug in the cuda nvector implementation of N VWrmsNormMask and revised

the raja nvector implementation of N VWrmsNormMask to work with mask arrays using values

other than zero or one. Replaced double with realtype in the raja vector test functions.

In addition to the changes above, minor corrections were also made to the example programs, build

system, and user documentation.

Changes in v2.1.0

Added nvector print functions that write vector data to a speciﬁed ﬁle (e.g., N VPrintFile Serial).

Added make test and make test install options to the build system for testing sundials after

building with make and installing with make install respectively.

Changes in v2.0.0

All interfaces to matrix structures and linear solvers have been reworked, and all example programs

have been updated. The goal of the redesign of these interfaces was to provide more encapsulation and

to ease interfacing of custom linear solvers and interoperability with linear solver libraries. Speciﬁc

changes include:

•Added generic sunmatrix module with three provided implementations: dense, banded and

sparse. These replicate previous SUNDIALS Dls and Sls matrix structures in a single object-

oriented API.

•Added example problems demonstrating use of generic sunmatrix modules.

•Added generic SUNLinearSolver module with eleven provided implementations: sundials na-

tive dense, sundials native banded, LAPACK dense, LAPACK band, KLU, SuperLU MT,

SPGMR, SPBCGS, SPTFQMR, SPFGMR, and PCG. These replicate previous SUNDIALS

generic linear solvers in a single object-oriented API.

6 Introduction

•Added example problems demonstrating use of generic SUNLinearSolver modules.

•Expanded package-provided direct linear solver (Dls) interfaces and scaled, preconditioned, iter-

ative linear solver (Spils) interfaces to utilize generic sunmatrix and SUNLinearSolver objects.

•Removed package-speciﬁc, linear solver-speciﬁc, solver modules (e.g. CVDENSE,KINBAND,IDAKLU,

ARKSPGMR) since their functionality is entirely replicated by the generic Dls/Spils interfaces

and SUNLinearSolver/SUNMATRIX modules. The exception is CVDIAG, a diagonal approximate

Jacobian solver available to cvode and cvodes.

•Converted all sundials example problems and ﬁles to utilize the new generic sunmatrix and

SUNLinearSolver objects, along with updated Dls and Spils linear solver interfaces.

•Added Spils interface routines to arkode,cvode,cvodes,ida, and idas to allow speciﬁcation

of a user-provided ”JTSetup” routine. This change supports users who wish to set up data

structures for the user-provided Jacobian-times-vector (”JTimes”) routine, and where the cost

of one JTSetup setup per Newton iteration can be amortized between multiple JTimes calls.

Two additional nvector implementations were added – one for cuda and one for raja vectors.

These vectors are supplied to provide very basic support for running on GPU architectures. Users are

advised that these vectors both move all data to the GPU device upon construction, and speedup will

only be realized if the user also conducts the right-hand-side function evaluation on the device. In

addition, these vectors assume the problem ﬁts on one GPU. Further information about raja, users

are referred to the web site, https://software.llnl.gov/RAJA/. These additions are accompanied by

additions to various interface functions and to user documentation.

All indices for data structures were updated to a new sunindextype that can be conﬁgured to

be a 32- or 64-bit integer data index type. sunindextype is deﬁned to be int32 t or int64 t when

portable types are supported, otherwise it is deﬁned as int or long int. The Fortran interfaces

continue to use long int for indices, except for their sparse matrix interface that now uses the new

sunindextype. This new ﬂexible capability for index types includes interfaces to PETSc, hypre,

SuperLU MT, and KLU with either 32-bit or 64-bit capabilities depending how the user conﬁgures

sundials.

To avoid potential namespace conﬂicts, the macros deﬁning booleantype values TRUE and FALSE

have been changed to SUNTRUE and SUNFALSE respectively.

Temporary vectors were removed from preconditioner setup and solve routines for all packages. It

is assumed that all necessary data for user-provided preconditioner operations will be allocated and

stored in user-provided data structures.

The ﬁle include/sundials fconfig.h was added. This ﬁle contains sundials type information

for use in Fortran programs.

The build system was expanded to support many of the xSDK-compliant keys. The xSDK is

a movement in scientiﬁc software to provide a foundation for the rapid and eﬃcient production of

high-quality, sustainable extreme-scale scientiﬁc applications. More information can be found at,

https://xsdk.info.

Added functions SUNDIALSGetVersion and SUNDIALSGetVersionNumber to get sundials release

version information at runtime.

In addition, numerous changes were made to the build system. These include the addition of

separate BLAS ENABLE and BLAS LIBRARIES CMake variables, additional error checking during CMake

conﬁguration, minor bug ﬁxes, and renaming CMake options to enable/disable examples for greater

clarity and an added option to enable/disable Fortran 77 examples. These changes included changing

EXAMPLES ENABLE to EXAMPLES ENABLE C, changing CXX ENABLE to EXAMPLES ENABLE CXX, changing

F90 ENABLE to EXAMPLES ENABLE F90, and adding an EXAMPLES ENABLE F77 option.

A bug ﬁx was done to add a missing prototype for IDASetMaxBacksIC in ida.h.

Corrections and additions were made to the examples, to installation-related ﬁles, and to the user

documentation.

1.1 Changes from previous versions 7

Changes in v1.3.0

Two additional nvector implementations were added – one for Hypre (parallel) ParVector vectors,

and one for PETSc vectors. These additions are accompanied by additions to various interface func-

tions and to user documentation.

Each nvector module now includes a function, N VGetVectorID, that returns the nvector

module name.

An optional input function was added to set a maximum number of linesearch backtracks in

the initial condition calculation, and four user-callable functions were added to support the use of

LAPACK linear solvers in solving backward problems for adjoint sensitivity analysis.

For each linear solver, the various solver performance counters are now initialized to 0 in both the

solver speciﬁcation function and in solver linit function. This ensures that these solver counters are

initialized upon linear solver instantiation as well as at the beginning of the problem solution.

A bug in for-loop indices was ﬁxed in IDAAckpntAllocVectors. A bug was ﬁxed in the interpo-

lation functions used in solving backward problems.

A memory leak was ﬁxed in the banded preconditioner interface. In addition, updates were done

to return integers from linear solver and preconditioner ’free’ functions.

In interpolation routines for backward problems, added logic to bypass sensitivity interpolation if

input sensitivity argument is NULL.

The Krylov linear solver Bi-CGstab was enhanced by removing a redundant dot product. Various

additions and corrections were made to the interfaces to the sparse solvers KLU and SuperLU MT,

including support for CSR format when using KLU.

New examples were added for use of the OpenMP vector and for use of sparse direct solvers within

sensitivity integrations.

Minor corrections and additions were made to the idas solver, to the examples, to installation-

related ﬁles, and to the user documentation.

Changes in v1.2.0

Two major additions were made to the linear system solvers that are available for use with the idas

solver. First, in the serial case, an interface to the sparse direct solver KLU was added. Second,

an interface to SuperLU MT, the multi-threaded version of SuperLU, was added as a thread-parallel

sparse direct solver option, to be used with the serial version of the NVECTOR module. As part of

these additions, a sparse matrix (CSC format) structure was added to idas.

Otherwise, only relatively minor modiﬁcations were made to idas:

In IDARootfind, a minor bug was corrected, where the input array rootdir was ignored, and a

line was added to break out of root-search loop if the initial interval size is below the tolerance ttol.

In IDALapackBand, the line smu = MIN(N-1,mu+ml) was changed to smu = mu + ml to correct an

illegal input error for DGBTRF/DGBTRS.

An option was added in the case of Adjoint Sensitivity Analysis with dense or banded Jacobian:

With a call to IDADlsSetDenseJacFnBS or IDADlsSetBandJacFnBS, the user can specify a user-

supplied Jacobian function of type IDADls***JacFnBS, for the case where the backward problem

depends on the forward sensitivities.

A minor bug was ﬁxed regarding the testing of the input tstop on the ﬁrst call to IDASolve.

For the Adjoint Sensitivity Analysis case in which the backward problem depends on the forward

sensitivities, options have been added to allow for user-supplied pset,psolve, and jtimes functions.

In order to avoid possible name conﬂicts, the mathematical macro and function names MIN,MAX,

SQR,RAbs,RSqrt,RExp,RPowerI, and RPowerR were changed to SUNMIN,SUNMAX,SUNSQR,SUNRabs,

SUNRsqrt,SUNRexp,SRpowerI, and SUNRpowerR, respectively. These names occur in both the solver

and in various example programs.

In the User Guide, a paragraph was added in Section 6.2.1 on IDAAdjReInit, and a paragraph

was added in Section 6.2.9 on IDAGetAdjY.

Two new nvector modules have been added for thread-parallel computing environments — one

for OpenMP, denoted NVECTOR OPENMP, and one for Pthreads, denoted NVECTOR PTHREADS.

8 Introduction

With this version of sundials, support and documentation of the Autotools mode of installation

is being dropped, in favor of the CMake mode, which is considered more widely portable.

Changes in v1.1.0

One signiﬁcant design change was made with this release: The problem size and its relatives, band-

width parameters, related internal indices, pivot arrays, and the optional output lsflag have all

been changed from type int to type long int, except for the problem size and bandwidths in user

calls to routines specifying BLAS/LAPACK routines for the dense/band linear solvers. The function

NewIntArray is replaced by a pair NewIntArray/NewLintArray, for int and long int arrays, re-

spectively. In a minor change to the user interface, the type of the index which in IDAS was changed

from long int to int.

Errors in the logic for the integration of backward problems were identiﬁed and ﬁxed.

A large number of minor errors have been ﬁxed. Among these are the following: A missing

vector pointer setting was added in IDASensLineSrch. In IDACompleteStep, conditionals around

lines loading a new column of three auxiliary divided diﬀerence arrays, for a possible order increase,

were ﬁxed. After the solver memory is created, it is set to zero before being ﬁlled. In each linear solver

interface function, the linear solver memory is freed on an error return, and the **Free function now

includes a line setting to NULL the main memory pointer to the linear solver memory. A memory leak

was ﬁxed in two of the IDASp***Free functions. In the rootﬁnding functions IDARcheck1/IDARcheck2,

when an exact zero is found, the array glo of gvalues at the left endpoint is adjusted, instead of

shifting the tlocation tlo slightly. In the installation ﬁles, we modiﬁed the treatment of the macro

SUNDIALS USE GENERIC MATH, so that the parameter GENERIC MATH LIB is either deﬁned

(with no value) or not deﬁned.

1.2 Reading this User Guide

The structure of this document is as follows:

•In Chapter 2, we give short descriptions of the numerical methods implemented by idas for

the solution of initial value problems for systems of DAEs, continue with short descriptions of

preconditioning (§2.2) and rootﬁnding (§2.3), and then give an overview of the mathematical

aspects of sensitivity analysis, both forward (§2.5) and adjoint (§2.6).

•The following chapter describes the structure of the sundials suite of solvers (§3.1) and the

software organization of the idas solver (§3.2).

•Chapter 4is the main usage document for idas for simulation applications. It includes a complete

description of the user interface for the integration of DAE initial value problems. Readers that

are not interested in using idas for sensitivity analysis can then skip the next two chapters.

•Chapter 5describes the usage of idas for forward sensitivity analysis as an extension of its IVP

integration capabilities. We begin with a skeleton of the user main program, with emphasis

on the steps that are required in addition to those already described in Chapter 4. Following

that we provide detailed descriptions of the user-callable interface routines speciﬁc to forward

sensitivity analysis and of the additonal optional user-deﬁned routines.

•Chapter 6describes the usage of idas for adjoint sensitivity analysis. We begin by describing

the idas checkpointing implementation for interpolation of the original IVP solution during

integration of the adjoint system backward in time, and with an overview of a user’s main

program. Following that we provide complete descriptions of the user-callable interface routines

for adjoint sensitivity analysis as well as descriptions of the required additional user-deﬁned

routines.

•Chapter 7gives a brief overview of the generic nvector module shared amongst the various

components of sundials, as well as details on the nvector implementations provided with

sundials.

1.3 SUNDIALS Release License 9

•Chapter 8gives a brief overview of the generic sunmatrix module shared among the vari-

ous components of sundials, and details on the sunmatrix implementations provided with

sundials: a dense implementation (§8.2), a banded implementation (§8.3) and a sparse imple-

mentation (§8.4).

•Chapter 9gives a brief overview of the generic sunlinsol module shared among the various

components of sundials. This chapter contains details on the sunlinsol implementations

provided with sundials. The chapter also contains details on the sunlinsol implementations

provided with sundials that interface with external linear solver libraries.

•Chapter 10 describes the sunnonlinsol API and nonlinear solver implementations shared

among the various components of sundials.

•Finally, in the appendices, we provide detailed instructions for the installation of idas, within

the structure of sundials (Appendix A), as well as a list of all the constants used for input to

and output from idas functions (Appendix B).

Finally, the reader should be aware of the following notational conventions in this user guide:

program listings and identiﬁers (such as IDAInit) within textual explanations appear in typewriter

type style; ﬁelds in Cstructures (such as content) appear in italics; and packages or modules, such

as idals, are written in all capitals. Usage and installation instructions that constitute important

warnings are marked with a triangular symbol in the margin.

1.3 SUNDIALS Release License

The SUNDIALS packages are released open source, under a BSD license. The only requirements of

the BSD license are preservation of copyright and a standard disclaimer of liability. Our Copyright

notice is below along with the license.

**PLEASE NOTE** If you are using SUNDIALS with any third party libraries linked in (e.g.,

LaPACK, KLU, SuperLU MT, petsc, or hypre), be sure to review the respective license of the package

as that license may have more restrictive terms than the SUNDIALS license. For example, if someone

builds SUNDIALS with a statically linked KLU, the build is subject to terms of the LGPL license

(which is what KLU is released with) and *not* the SUNDIALS BSD license anymore.

1.3.1 Copyright Notices

All SUNDIALS packages except ARKode are subject to the following Copyright notice.

1.3.1.1 SUNDIALS Copyright

National Laboratory. Written by A.C. Hindmarsh, D.R. Reynolds, R. Serban, C.S. Woodward, S.D.

Cohen, A.G. Taylor, S. Peles, L.E. Banks, and D. Shumaker.

UCRL-CODE-155951 (CVODE)

UCRL-CODE-155950 (CVODES)

UCRL-CODE-155952 (IDA)

UCRL-CODE-237203 (IDAS)

LLNL-CODE-665877 (KINSOL)

1.3.1.2 ARKode Copyright

Methodist University and Lawrence Livermore National Security Written by D.R. Reynolds, D.J.

Gardner, A.C. Hindmarsh, C.S. Woodward, and J.M. Sexton.

10 Introduction

LLNL-CODE-667205 (ARKODE)

1.3.2 BSD License

Redistribution and use in source and binary forms, with or without modiﬁcation, are permitted

provided that the following conditions are met:

1. Redistributions of source code must retain the above copyright notice, this list of conditions

and the disclaimer below.

2. Redistributions in binary form must reproduce the above copyright notice, this list of conditions

and the disclaimer (as noted below) in the documentation and/or other materials provided with the

distribution.

3. Neither the name of the LLNS/LLNL nor the names of its contributors may be used to endorse

or promote products derived from this software without speciﬁc prior written permission.

THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS

“AS IS” AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED

TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTIC-

ULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL LAWRENCE LIVERMORE NA-

TIONAL SECURITY, LLC, THE U.S. DEPARTMENT OF ENERGY OR CONTRIBUTORS BE

LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CON-

SEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUB-

STITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS IN-

TERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN

CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE)

ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE

POSSIBILITY OF SUCH DAMAGE.

Additional BSD Notice

1. This notice is required to be provided under our contract with the U.S. Department of Energy

(DOE). This work was produced at Lawrence Livermore National Laboratory under Contract

No. DE-AC52-07NA27344 with the DOE.

2. Neither the United States Government nor Lawrence Livermore National Security, LLC nor any

of their employees, makes any warranty, express or implied, or assumes any liability or respon-

sibility for the accuracy, completeness, or usefulness of any information, apparatus, product, or

process disclosed, or represents that its use would not infringe privately-owned rights.

3. Also, reference herein to any speciﬁc commercial products, process, or services by trade name,

trademark, manufacturer or otherwise does not necessarily constitute or imply its endorsement,

recommendation, or favoring by the United States Government or Lawrence Livermore National

Security, LLC. The views and opinions of authors expressed herein do not necessarily state or

reﬂect those of the United States Government or Lawrence Livermore National Security, LLC,

and shall not be used for advertising or product endorsement purposes.

Chapter 2

Mathematical Considerations

idas solves the initial-value problem (IVP) for a DAE system of the general form

F(t, y, ˙y)=0, y(t0) = y0,˙y(t0) = ˙y0,(2.1)

where y, ˙y, and Fare vectors in RN,tis the independent variable, ˙y=dy/dt, and initial values y0,

˙y0are given. (Often tis time, but it certainly need not be.)

Additionally, if (2.1) depends on some parameters p∈RNp, i.e.

F(t, y, ˙y, p)=0

y(t0) = y0(p),˙y(t0) = ˙y0(p),(2.2)

idas can also compute ﬁrst order derivative information, performing either forward sensitivity analysis

or adjoint sensitivity analysis. In the ﬁrst case, idas computes the sensitivities of the solution with

respect to the parameters p, while in the second case, idas computes the gradient of a derived function

with respect to the parameters p.

2.1 IVP solution

Prior to integrating a DAE initial-value problem, an important requirement is that the pair of vectors

y0and ˙y0are both initialized to satisfy the DAE residual F(t0, y0,˙y0) = 0. For a class of problems that

includes so-called semi-explicit index-one systems, idas provides a routine that computes consistent

initial conditions from a user’s initial guess [8]. For this, the user must identify sub-vectors of y(not

necessarily contiguous), denoted ydand ya, which are its diﬀerential and algebraic parts, respectively,

such that Fdepends on ˙ydbut not on any components of ˙ya. The assumption that the system is

“index one” means that for a given tand yd, the system F(t, y, ˙y) = 0 deﬁnes yauniquely. In this

case, a solver within idas computes yaand ˙ydat t=t0, given ydand an initial guess for ya. A second

available option with this solver also computes all of y(t0) given ˙y(t0); this is intended mainly for quasi-

steady-state problems, where ˙y(t0) = 0 is given. In both cases, ida solves the system F(t0, y0,˙y0)=0

for the unknown components of y0and ˙y0, using Newton iteration augmented with a line search global

strategy. In doing this, it makes use of the existing machinery that is to be used for solving the linear

systems during the integration, in combination with certain tricks involving the step size (which is set

artiﬁcially for this calculation). For problems that do not fall into either of these categories, the user

is responsible for passing consistent values, or risks failure in the numerical integration.

The integration method used in idas is the variable-order, variable-coeﬃcient BDF (Backward

Diﬀerentiation Formula), in ﬁxed-leading-coeﬃcient form [4]. The method order ranges from 1 to 5,

with the BDF of order qgiven by the multistep formula

i=0

αn,iyn−i=hn˙yn,(2.3)

12 Mathematical Considerations

where ynand ˙ynare the computed approximations to y(tn) and ˙y(tn), respectively, and the step size

is hn=tn−tn−1. The coeﬃcients αn,i are uniquely determined by the order q, and the history of the

step sizes. The application of the BDF (2.3) to the DAE system (2.1) results in a nonlinear algebraic

system to be solved at each step:

G(yn)≡F tn, yn, h−1

i=0

αn,iyn−i!= 0 .(2.4)

By default idas solves (2.4) with a Newton iteration but idas also allows for user-deﬁned nonlinear

solvers (see Chapter 10). Each Newton iteration requires the soution of a linear system of the form

J[yn(m+1) −yn(m)] = −G(yn(m)),(2.5)

where yn(m)is the m-th approximation to yn. Here Jis some approximation to the system Jacobian

J=∂G

∂y =∂F

∂y +α∂F

∂˙y,(2.6)

where α=αn,0/hn. The scalar αchanges whenever the step size or method order changes.

For the solution of the linear systems within the Newton iteration, idas provides several choices,

including the option of a user-supplied linear solver module (see Chapter 9). The linear solver modules

distributed with sundials are organized in two families, a direct family comprising direct linear solvers

for dense, banded, or sparse matrices and a spils family comprising scaled preconditioned iterative

(Krylov) linear solvers. The methods oﬀered through these modules are as follows:

•dense direct solvers, using either an internal implementation or a BLAS/LAPACK implementa-

tion (serial or threaded vector modules only),

•band direct solvers, using either an internal implementation or a BLAS/LAPACK implementa-

tion (serial or threaded vector modules only),

•sparse direct solver interfaces, using either the KLU sparse solver library [17,1], or the thread-

enabled SuperLU MT sparse solver library [35,19,2] (serial or threaded vector modules only)

[Note that users will need to download and install the klu or superlumt packages independent

of idas],

•spgmr, a scaled preconditioned GMRES (Generalized Minimal Residual method) solver without

restarts,

•spfgmr, a scaled preconditioned FGMRES (Flexible Generalized Minimal Residual method)

solver,

•spbcgs, a scaled preconditioned Bi-CGStab (Bi-Conjugate Gradient Stable method) solver,

•sptfqmr, a scaled preconditioned TFQMR (Transpose-Free Quasi-Minimal Residual method)

solver, or

•pcg, a scaled preconditioned CG (Conjugate Gradient method) solver.

For large stiﬀ systems, where direct methods are not feasible, the combination of a BDF integrator and

a preconditioned Krylov method yields a powerful tool because it combines established methods for

stiﬀ integration, nonlinear iteration, and Krylov (linear) iteration with a problem-speciﬁc treatment

of the dominant source of stiﬀness, in the form of the user-supplied preconditioner matrix [6]. For

the spils linear solvers with idas, preconditioning is allowed only on the left (see §2.2). Note that

the dense, band, and sparse direct linear solvers can only be used with serial and threaded vector

representations.

2.1 IVP solution 13

In the process of controlling errors at various levels, idas uses a weighted root-mean-square norm,

denoted k · kWRMS, for all error-like quantities. The multiplicative weights used are based on the

current solution and on the relative and absolute tolerances input by the user, namely

Wi= 1/[rtol · |yi|+atoli].(2.7)

Because 1/Wirepresents a tolerance in the component yi, a vector whose norm is 1 is regarded as

“small.” For brevity, we will usually drop the subscript WRMS on norms in what follows.

In the case of a matrix-based linear solver, the default Newton iteration is a Modiﬁed Newton

iteration, in that the Jacobian Jis ﬁxed (and usually out of date) throughout the nonlinear iterations,

with a coeﬃcient ¯αin place of αin J. However, in the case that a matrix-free iterative linear solver is

used, the default Newton iteration is an Inexact Newton iteration, in which Jis applied in a matrix-

free manner, with matrix-vector products Jv obtained by either diﬀerence quotients or a user-supplied

routine. In this case, the linear residual J∆y+Gis nonzero but controlled. With the default Newton

iteration, the matrix Jand preconditioner matrix Pare updated as infrequently as possible to balance

the high costs of matrix operations against other costs. Speciﬁcally, this matrix update occurs when:

•starting the problem,

•the value ¯αat the last update is such that α/¯α < 3/5 or α/¯α > 5/3, or

•a non-fatal convergence failure occurred with an out-of-date Jor P.

The above strategy balances the high cost of frequent matrix evaluations and preprocessing with

the slow convergence due to infrequent updates. To reduce storage costs on an update, Jacobian

information is always reevaluated from scratch.

The default stopping test for nonlinear solver iterations in idas ensures that the iteration error

yn−yn(m)is small relative to yitself. For this, we estimate the linear convergence rate at all iterations

m > 1 as

R=δm

δ11

m−1

where the δm=yn(m)−yn(m−1) is the correction at iteration m= 1,2, . . .. The nonlinear solver

iteration is halted if R > 0.9. The convergence test at the m-th iteration is then

Skδmk<0.33 ,(2.8)

where S=R/(R−1) whenever m > 1 and R≤0.9. The user has the option of changing the constant

in the convergence test from its default value of 0.33. The quantity Sis set to S= 20 initially and

whenever Jor Pis updated, and it is reset to S= 100 on a step with α6= ¯α. Note that at m= 1, the

convergence test (2.8) uses an old value for S. Therefore, at the ﬁrst nonlinear solver iteration, we

make an additional test and stop the iteration if kδ1k<0.33 ·10−4(since such a δ1is probably just

noise and therefore not appropriate for use in evaluating R). We allow only a small number (default

value 4) of nonlinear iterations. If convergence fails with Jor Pcurrent, we are forced to reduce the

step size hn, and we replace hnby hn/4. The integration is halted after a preset number (default

value 10) of convergence failures. Both the maximum number of allowable nonlinear iterations and

the maximum number of nonlinear convergence failures can be changed by the user from their default

values.

When an iterative method is used to solve the linear system, to minimize the eﬀect of linear

iteration errors on the nonlinear and local integration error controls, we require the preconditioned

linear residual to be small relative to the allowed error in the nonlinear iteration, i.e., kP−1(Jx+G)k<

0.05 ·0.33. The safety factor 0.05 can be changed by the user.

When the Jacobian is stored using either dense or band sunmatrix objects, the Jacobian Jdeﬁned

in (2.6) can be either supplied by the user or have idas compute one internally by diﬀerence quotients.

In the latter case, we use the approximation

Jij = [Fi(t, y +σjej,˙y+ασjej)−Fi(t, y, ˙y)]/σj,with

σj=√Umax {|yj|,|h˙yj|,1/Wj}sign(h˙yj),

14 Mathematical Considerations

where Uis the unit roundoﬀ, his the current step size, and Wjis the error weight for the component

yjdeﬁned by (2.7). We note that with sparse and user-supplied sunmatrix objects, the Jacobian

must be supplied by a user routine.

In the case of an iterative linear solver, if a routine for Jv is not supplied, such products are

approximated by

Jv = [F(t, y +σv, ˙y+ασv)−F(t, y, ˙y)]/σ ,

where the increment σ=√N. As an option, the user can specify a constant factor that is inserted

into this expression for σ.

During the course of integrating the system, idas computes an estimate of the local truncation

error, LTE, at the n-th time step, and requires this to satisfy the inequality

kLTEkWRMS ≤1.

Asymptotically, LTE varies as hq+1 at step size hand order q, as does the predictor-corrector diﬀerence

∆n≡yn−yn(0). Thus there is a constant Csuch that

LTE = C∆n+O(hq+2),

and so the norm of LTE is estimated as |C| · k∆nk. In addition, idas requires that the error in the

associated polynomial interpolant over the current step be bounded by 1 in norm. The leading term

of the norm of this error is bounded by ¯

Ck∆nkfor another constant ¯

C. Thus the local error test in

idas is

max{|C|,¯

C}k∆nk ≤ 1.(2.9)

A user option is available by which the algebraic components of the error vector are omitted from the

test (2.9), if these have been so identiﬁed.

In idas, the local error test is tightly coupled with the logic for selecting the step size and order.

First, there is an initial phase that is treated specially; for the ﬁrst few steps, the step size is doubled

and the order raised (from its initial value of 1) on every step, until (a) the local error test (2.9) fails,

(b) the order is reduced (by the rules given below), or (c) the order reaches 5 (the maximum). For

step and order selection on the general step, idas uses a diﬀerent set of local error estimates, based

on the asymptotic behavior of the local error in the case of ﬁxed step sizes. At each of the orders q0

equal to q,q−1 (if q > 1), q−2 (if q > 2), or q+ 1 (if q < 5), there are constants C(q0) such that the

norm of the local truncation error at order q0satisﬁes

LTE(q0) = C(q0)kφ(q0+ 1)k+O(hq0+2),

where φ(k) is a modiﬁed divided diﬀerence of order kthat is retained by idas (and behaves asymp-

totically as hk). Thus the local truncation errors are estimated as ELTE(q0) = C(q0)kφ(q0+ 1)kto

select step sizes. But the choice of order in idas is based on the requirement that the scaled derivative

norms, khky(k)k, are monotonically decreasing with k, for knear q. These norms are again estimated

using the φ(k), and in fact

khq0+1y(q0+1)k ≈ T(q0)≡(q0+ 1)ELTE(q0).

The step/order selection begins with a test for monotonicity that is made even before the local error

test is performed. Namely, the order is reset to q0=q−1 if (a) q= 2 and T(1) ≤T(2)/2, or (b) q > 2

and max{T(q−1), T (q−2)} ≤ T(q); otherwise q0=q. Next the local error test (2.9) is performed,

and if it fails, the step is redone at order q←q0and a new step size h0. The latter is based on the

hq+1 asymptotic behavior of ELTE(q), and, with safety factors, is given by

η=h0/h = 0.9/[2 ELTE(q)]1/(q+1) .

The value of ηis adjusted so that 0.25 ≤η≤0.9 before setting h←h0=ηh. If the local error test

fails a second time, idas uses η= 0.25, and on the third and subsequent failures it uses q= 1 and

η= 0.25. After 10 failures, idas returns with a give-up message.

2.2 Preconditioning 15

As soon as the local error test has passed, the step and order for the next step may be adjusted.

No such change is made if q0=q−1 from the prior test, if q= 5, or if qwas increased on the previous

step. Otherwise, if the last q+ 1 steps were taken at a constant order q < 5 and a constant step size,

idas considers raising the order to q+ 1. The logic is as follows: (a) If q= 1, then reset q= 2 if

T(2) < T (1)/2. (b) If q > 1 then

•reset q←q−1 if T(q−1) ≤min{T(q), T (q+ 1)};

•else reset q←q+ 1 if T(q+ 1) < T (q);

•leave qunchanged otherwise [then T(q−1) > T (q)≤T(q+ 1)].

In any case, the new step size h0is set much as before:

η=h0/h = 1/[2 ELTE(q)]1/(q+1) .

The value of ηis adjusted such that (a) if η > 2, ηis reset to 2; (b) if η≤1, ηis restricted to

0.5≤η≤0.9; and (c) if 1 < η < 2 we use η= 1. Finally his reset to h0=ηh. Thus we do not

increase the step size unless it can be doubled. See [4] for details.

idas permits the user to impose optional inequality constraints on individual components of the

solution vector y. Any of the following four constraints can be imposed: yi>0, yi<0, yi≥0,

or yi≤0. The constraint satisfaction is tested after a successful nonlinear system solution. If any

constraint fails, we declare a convergence failure of the nonlinear iteration and reduce the step size.

Rather than cutting the step size by some arbitrary factor, idas estimates a new step size h0using a

linear approximation of the components in ythat failed the constraint test (including a safety factor

of 0.9 to cover the strict inequality case). These additional constraints are also imposed during the

calculation of consistent initial conditions.

Normally, idas takes steps until a user-deﬁned output value t=tout is overtaken, and then

computes y(tout) by interpolation. However, a “one step” mode option is available, where control

returns to the calling program after each step. There are also options to force idas not to integrate

past a given stopping point t=tstop.

2.2 Preconditioning

When using a nonlinear solver that requires the solution of a linear system of the form J∆y=−G(e.g.,

the default Newton iteration), idas makes repeated use of a linear solver. If this linear system solve

is done with one of the scaled preconditioned iterative linear solvers supplied with sundials, these

solvers are rarely successful if used without preconditioning; it is generally necessary to precondition

the system in order to obtain acceptable eﬃciency. A system Ax =bcan be preconditioned on the

left, on the right, or on both sides. The Krylov method is then applied to a system with the matrix

P−1A, or AP −1, or P−1

LAP −1

R, instead of A. However, within idas, preconditioning is allowed only on

the left, so that the iterative method is applied to systems (P−1J)∆y=−P−1G. Left preconditioning

is required to make the norm of the linear residual in the nonlinear iteration meaningful; in general,

kJ∆y+Gkis meaningless, since the weights used in the WRMS-norm correspond to y.

In order to improve the convergence of the Krylov iteration, the preconditioner matrix Pshould in

some sense approximate the system matrix A. Yet at the same time, in order to be cost-eﬀective, the

matrix Pshould be reasonably eﬃcient to evaluate and solve. Finding a good point in this tradeoﬀ be-

tween rapid convergence and low cost can be very diﬃcult. Good choices are often problem-dependent

(for example, see [6] for an extensive study of preconditioners for reaction-transport systems).

Typical preconditioners used with idas are based on approximations to the iteration matrix of

the systems involved; in other words, P≈∂F

∂y +α∂F

∂˙y, where αis a scalar inversely proportional to

the integration step size h. Because the Krylov iteration occurs within a nonlinear solver iteration

and further also within a time integration, and since each of these iterations has its own test for

convergence, the preconditioner may use a very crude approximation, as long as it captures the

dominant numerical feature(s) of the system. We have found that the combination of a preconditioner

16 Mathematical Considerations

with the Newton-Krylov iteration, using even a fairly poor approximation to the Jacobian, can be

surprisingly superior to using the same matrix without Krylov acceleration (i.e., a modiﬁed Newton

iteration), as well as to using the Newton-Krylov method with no preconditioning.

2.3 Rootﬁnding

The idas solver has been augmented to include a rootﬁnding feature. This means that, while inte-

grating the Initial Value Problem (2.1), idas can also ﬁnd the roots of a set of user-deﬁned functions

gi(t, y, ˙y) that depend on t, the solution vector y=y(t), and its t−derivative ˙y(t). The number of

these root functions is arbitrary, and if more than one giis found to have a root in any given interval,

the various root locations are found and reported in the order that they occur on the taxis, in the

direction of integration.

Generally, this rootﬁnding feature ﬁnds only roots of odd multiplicity, corresponding to changes in

sign of gi(t, y(t),˙y(t)), denoted gi(t) for short. If a user root function has a root of even multiplicity (no

sign change), it will probably be missed by idas. If such a root is desired, the user should reformulate

the root function so that it changes sign at the desired root.

The basic scheme used is to check for sign changes of any gi(t) over each time step taken, and then

(when a sign change is found) to home in on the root (or roots) with a modiﬁed secant method [25].

In addition, each time gis computed, idas checks to see if gi(t) = 0 exactly, and if so it reports this as

a root. However, if an exact zero of any giis found at a point t,idas computes gat t+δfor a small

increment δ, slightly further in the direction of integration, and if any gi(t+δ) = 0 also, idas stops

and reports an error. This way, each time idas takes a time step, it is guaranteed that the values of

all giare nonzero at some past value of t, beyond which a search for roots is to be done.

At any given time in the course of the time-stepping, after suitable checking and adjusting has

been done, idas has an interval (tlo, thi] in which roots of the gi(t) are to be sought, such that thi is

further ahead in the direction of integration, and all gi(tlo)6= 0. The endpoint thi is either tn, the end

of the time step last taken, or the next requested output time tout if this comes sooner. The endpoint

tlo is either tn−1, or the last output time tout (if this occurred within the last step), or the last root

location (if a root was just located within this step), possibly adjusted slightly toward tnif an exact

zero was found. The algorithm checks gat thi for zeros and for sign changes in (tlo, thi). If no sign

changes are found, then either a root is reported (if some gi(thi) = 0) or we proceed to the next time

interval (starting at thi). If one or more sign changes were found, then a loop is entered to locate the

root to within a rather tight tolerance, given by

τ= 100 ∗U∗(|tn|+|h|) (U= unit roundoﬀ) .

Whenever sign changes are seen in two or more root functions, the one deemed most likely to have

its root occur ﬁrst is the one with the largest value of |gi(thi)|/|gi(thi)−gi(tlo)|, corresponding to the

closest to tlo of the secant method values. At each pass through the loop, a new value tmid is set,

strictly within the search interval, and the values of gi(tmid) are checked. Then either tlo or thi is reset

to tmid according to which subinterval is found to have the sign change. If there is none in (tlo, tmid)

but some gi(tmid) = 0, then that root is reported. The loop continues until |thi −tlo|< τ, and then

the reported root location is thi.

In the loop to locate the root of gi(t), the formula for tmid is

tmid =thi −(thi −tlo)gi(thi)/[gi(thi)−αgi(tlo)] ,

where αa weight parameter. On the ﬁrst two passes through the loop, αis set to 1, making tmid

the secant method value. Thereafter, αis reset according to the side of the subinterval (low vs high,

i.e. toward tlo vs toward thi) in which the sign change was found in the previous two passes. If the

two sides were opposite, αis set to 1. If the two sides were the same, αis halved (if on the low

side) or doubled (if on the high side). The value of tmid is closer to tlo when α < 1 and closer to thi

when α > 1. If the above value of tmid is within τ/2 of tlo or thi, it is adjusted inward, such that its

fractional distance from the endpoint (relative to the interval size) is between .1 and .5 (.5 being the

midpoint), and the actual distance from the endpoint is at least τ/2.

2.4 Pure quadrature integration 17

2.4 Pure quadrature integration

In many applications, and most notably during the backward integration phase of an adjoint sensitivity

analysis run (see §2.6) it is of interest to compute integral quantities of the form

z(t) = Zt

q(τ, y(τ),˙y(τ), p)dτ . (2.10)

The most eﬀective approach to compute z(t) is to extend the original problem with the additional

ODEs (obtained by applying Leibnitz’s diﬀerentiation rule):

˙z=q(t, y, ˙y, p), z(t0)=0.(2.11)

Note that this is equivalent to using a quadrature method based on the underlying linear multistep

polynomial representation for y(t).

This can be done at the “user level” by simply exposing to idas the extended DAE system

(2.2)+(2.10). However, in the context of an implicit integration solver, this approach is not desir-

able since the nonlinear solver module will require the Jacobian (or Jacobian-vector product) of this

extended DAE. Moreover, since the additional states, z, do not enter the right-hand side of the ODE

(2.10) and therefore the residual of the extended DAE system does not depend on z, it is much more

eﬃcient to treat the ODE system (2.10) separately from the original DAE system (2.2) by “taking

out” the additional states zfrom the nonlinear system (2.4) that must be solved in the correction step

of the LMM. Instead, “corrected” values znare computed explicitly as

zn=1

αn,0 hnq(tn, yn,˙yn, p)−

i=1

αn,izn−i!,

once the new approximation ynis available.

The quadrature variables zcan be optionally included in the error test, in which case corresponding

relative and absolute tolerances must be provided.

2.5 Forward sensitivity analysis

Typically, the governing equations of complex, large-scale models depend on various parameters,

through the right-hand side vector and/or through the vector of initial conditions, as in (2.2). In

addition to numerically solving the DAEs, it may be desirable to determine the sensitivity of the results

with respect to the model parameters. Such sensitivity information can be used to estimate which

parameters are most inﬂuential in aﬀecting the behavior of the simulation or to evaluate optimization

gradients (in the setting of dynamic optimization, parameter estimation, optimal control, etc.).

The solution sensitivity with respect to the model parameter piis deﬁned as the vector si(t) =

∂y(t)/∂piand satisﬁes the following forward sensitivity equations (or sensitivity equations for short):

∂F

∂y si+∂F

∂˙y˙si+∂F

∂pi

= 0

si(t0) = ∂y0(p)

∂pi

,˙si(t0) = ∂˙y0(p)

∂pi

(2.12)

obtained by applying the chain rule of diﬀerentiation to the original DAEs (2.2).

When performing forward sensitivity analysis, idas carries out the time integration of the combined

system, (2.2) and (2.12), by viewing it as a DAE system of size N(Ns+ 1), where Nsis the number

of model parameters pi, with respect to which sensitivities are desired (Ns≤Np). However, major

improvements in eﬃciency can be made by taking advantage of the special form of the sensitivity

equations as linearizations of the original DAEs. In particular, the original DAE system and all

sensitivity systems share the same Jacobian matrix Jin (2.6).

The sensitivity equations are solved with the same linear multistep formula that was selected

for the original DAEs and the same linear solver is used in the correction phase for both state and

sensitivity variables. In addition, idas oﬀers the option of including (full error control) or excluding

(partial error control) the sensitivity variables from the local error test.

18 Mathematical Considerations

2.5.1 Forward sensitivity methods

In what follows we brieﬂy describe three methods that have been proposed for the solution of the

combined DAE and sensitivity system for the vector ˆy= [y, s1, . . . , sNs].

•Staggered Direct In this approach [14], the nonlinear system (2.4) is ﬁrst solved and, once an

acceptable numerical solution is obtained, the sensitivity variables at the new step are found

by directly solving (2.12) after the BDF discretization is used to eliminate ˙si. Although the

system matrix of the above linear system is based on exactly the same information as the

matrix Jin (2.6), it must be updated and factored at every step of the integration, in contrast

to an evaluation of Jwhich is updated only occasionally. For problems with many parameters

(relative to the problem size), the staggered direct method can outperform the methods described

below [34]. However, the computational cost associated with matrix updates and factorizations

makes this method unattractive for problems with many more states than parameters (such as

those arising from semidiscretization of PDEs) and is therefore not implemented in idas.

•Simultaneous Corrector In this method [37], the discretization is applied simultaneously to both

the original equations (2.2) and the sensitivity systems (2.12) resulting in an “extended” non-

linear system ˆ

G(ˆyn) = 0 where ˆyn= [yn, . . . , si, . . .]. This combined nonlinear system can be

solved using a modiﬁed Newton method as in (2.5) by solving the corrector equation

J[ˆyn(m+1) −ˆyn(m)] = −ˆ

G(ˆyn(m)) (2.13)

at each iteration, where

J=





J1J

J20J

.......

JNs0. . . 0J







Jis deﬁned as in (2.6), and Ji= (∂/∂y) [Fysi+F˙y˙si+Fpi]. It can be shown that 2-step

quadratic convergence can be retained by using only the block-diagonal portion of ˆ

Jin the

corrector equation (2.13). This results in a decoupling that allows the reuse of Jwithout

additional matrix factorizations. However, the sum Fysi+F˙y˙si+Fpimust still be reevaluated

at each step of the iterative process (2.13) to update the sensitivity portions of the residual ˆ

•Staggered corrector In this approach [22], as in the staggered direct method, the nonlinear system

(2.4) is solved ﬁrst using the Newton iteration (2.5). Then, for each sensitivity vector ξ≡si, a

separate Newton iteration is used to solve the sensitivity system (2.12):

J[ξn(m+1) −ξn(m)] =

−"Fy(tn, yn,˙yn)ξn(m)+F˙y(tn, yn,˙yn)·h−1

n αn,0ξn(m)+

i=1

αn,iξn−i!+Fpi(tn, yn,˙yn)#.

(2.14)

In other words, a modiﬁed Newton iteration is used to solve a linear system. In this approach,

the matrices ∂F/∂y,∂F/∂ ˙yand vectors ∂F/∂pineed be updated only once per integration step,

after the state correction phase (2.5) has converged.

idas implements both the simultaneous corrector method and the staggered corrector method.

An important observation is that the staggered corrector method, combined with a Krylov linear

solver, eﬀectively results in a staggered direct method. Indeed, the Krylov solver requires only the

action of the matrix Jon a vector, and this can be provided with the current Jacobian information.

Therefore, the modiﬁed Newton procedure (2.14) will theoretically converge after one iteration.

2.5 Forward sensitivity analysis 19

2.5.2 Selection of the absolute tolerances for sensitivity variables

If the sensitivities are included in the error test, idas provides an automated estimation of absolute

tolerances for the sensitivity variables based on the absolute tolerance for the corresponding state

variable. The relative tolerance for sensitivity variables is set to be the same as for the state variables.

The selection of absolute tolerances for the sensitivity variables is based on the observation that

the sensitivity vector siwill have units of [y]/[pi]. With this, the absolute tolerance for the j-th

component of the sensitivity vector siis set to atolj/|¯pi|, where atoljare the absolute tolerances for

the state variables and ¯pis a vector of scaling factors that are dimensionally consistent with the model

parameters pand give an indication of their order of magnitude. This choice of relative and absolute

tolerances is equivalent to requiring that the weighted root-mean-square norm of the sensitivity vector

siwith weights based on sibe the same as the weighted root-mean-square norm of the vector of scaled

sensitivities ¯si=|¯pi|siwith weights based on the state variables (the scaled sensitivities ¯sibeing

dimensionally consistent with the state variables). However, this choice of tolerances for the simay

be a poor one, and the user of idas can provide diﬀerent values as an option.

2.5.3 Evaluation of the sensitivity right-hand side

There are several methods for evaluating the residual functions in the sensitivity systems (2.12):

analytic evaluation, automatic diﬀerentiation, complex-step approximation, and ﬁnite diﬀerences (or

directional derivatives). idas provides all the software hooks for implementing interfaces to automatic

diﬀerentiation (AD) or complex-step approximation; future versions will include a generic interface

to AD-generated functions. At the present time, besides the option for analytical sensitivity right-

hand sides (user-provided), idas can evaluate these quantities using various ﬁnite diﬀerence-based

approximations to evaluate the terms (∂F/∂y)si+ (∂F/∂ ˙y) ˙siand (∂F/∂pi), or using directional

derivatives to evaluate [(∂F/∂y)si+ (∂F/∂ ˙y) ˙si+ (∂F/∂pi)]. As is typical for ﬁnite diﬀerences, the

proper choice of perturbations is a delicate matter. idas takes into account several problem-related

features: the relative DAE error tolerance rtol, the machine unit roundoﬀ U, the scale factor ¯pi, and

the weighted root-mean-square norm of the sensitivity vector si.

Using central ﬁnite diﬀerences as an example, the two terms (∂F/∂y)si+ (∂F/∂ ˙y) ˙siand ∂F/∂pi

in (2.12) can be evaluated either separately:

∂F

∂y si+∂F

∂˙y˙si≈F(t, y +σysi,˙y+σy˙si, p)−F(t, y −σysi,˙y−σy˙si, p)

2σy

,(2.15)

∂F

∂pi≈F(t, y, ˙y, p +σiei)−F(t, y, ˙y, p −σiei)

2σi

,(2.15’)

σi=|¯pi|pmax(rtol, U ), σy=1

max(1/σi,ksikWRMS/|¯pi|),

or simultaneously:

∂F

∂y si+∂F

∂˙y˙si+∂F

∂pi≈F(t, y +σsi,˙y+σ˙si, p +σei)−F(t, y −σsi,˙y−σ˙si, p −σei)

2σ,(2.16)

σ= min(σi, σy),

or by adaptively switching between (2.15)+(2.15’) and (2.16), depending on the relative size of the

two ﬁnite diﬀerence increments σiand σy. In the adaptive scheme, if ρ= max(σi/σy, σy/σi), we use

separate evaluations if ρ>ρmax (an input value), and simultaneous evaluations otherwise.

These procedures for choosing the perturbations (σi,σy,σ) and switching between derivative

formulas have also been implemented for one-sided diﬀerence formulas. Forward ﬁnite diﬀerences can

be applied to (∂F/∂y)si+ (∂F/∂ ˙y) ˙siand ∂F

∂piseparately, or the single directional derivative formula

∂F

∂y si+∂F

∂˙y˙si+∂F

∂pi≈F(t, y +σsi,˙y+σ˙si, p +σei)−F(t, y, ˙y, p)

20 Mathematical Considerations

can be used. In idas, the default value of ρmax = 0 indicates the use of the second-order centered

directional derivative formula (2.16) exclusively. Otherwise, the magnitude of ρmax and its sign (pos-

itive or negative) indicates whether this switching is done with regard to (centered or forward) ﬁnite

diﬀerences, respectively.

2.5.4 Quadratures depending on forward sensitivities

If pure quadrature variables are also included in the problem deﬁnition (see §2.4), idas does not carry

their sensitivities automatically. Instead, we provide a more general feature through which integrals

depending on both the states yof (2.2) and the state sensitivities siof (2.12) can be evaluated. In

other words, idas provides support for computing integrals of the form:

¯z(t) = Zt

¯q(τ, y(τ),˙y(τ), s1(τ), . . . , sNp(τ), p)dτ .

If the sensitivities of the quadrature variables zof (2.10) are desired, these can then be computed

by using:

¯qi=qysi+q˙y˙si+qpi, i = 1, . . . , Np,

as integrands for ¯z, where qy,q˙y, and qpare the partial derivatives of the integrand function qof

(2.10).

As with the quadrature variables z, the new variables ¯zare also excluded from any nonlinear solver

phase and “corrected” values ¯znare obtained through explicit formulas.

2.6 Adjoint sensitivity analysis

In the forward sensitivity approach described in the previous section, obtaining sensitivities with

respect to Nsparameters is roughly equivalent to solving an DAE system of size (1 + Ns)N. This

can become prohibitively expensive, especially for large-scale problems, if sensitivities with respect

to many parameters are desired. In this situation, the adjoint sensitivity method is a very attractive

alternative, provided that we do not need the solution sensitivities si, but rather the gradients with

respect to model parameters of a relatively few derived functionals of the solution. In other words, if

y(t) is the solution of (2.2), we wish to evaluate the gradient dG/dp of

G(p) = ZT

g(t, y, p)dt , (2.17)

or, alternatively, the gradient dg/dp of the function g(t, y, p) at the ﬁnal time t=T. The function g

must be smooth enough that ∂g/∂y and ∂g/∂p exist and are bounded.

In what follows, we only sketch the analysis for the sensitivity problem for both Gand g. For

details on the derivation see [13].

2.6.1 Sensitivity of G(p)

We focus ﬁrst on solving the sensitivity problem for G(p) deﬁned by (2.17). Introducing a Lagrange

multiplier λ, we form the augmented objective function

I(p) = G(p)−ZT

λ∗F(t, y, ˙y, p)dt.

Since F(t, y, ˙y, p) = 0, the sensitivity of Gwith respect to pis

dp =dI

dp =ZT

(gp+gyyp)dt −ZT

λ∗(Fp+Fyyp+F˙y˙yp)dt, (2.18)

2.6 Adjoint sensitivity analysis 21

where subscripts on functions such as For gare used to denote partial derivatives. By integration

by parts, we have

λ∗F˙y˙ypdt = (λ∗F˙yyp)|T

t0−ZT

(λ∗F˙y)0ypdt,

where (···)0denotes the t−derivative. Thus equation (2.18) becomes

dp =ZT

(gp−λ∗Fp)dt −ZT

[−gy+λ∗Fy−(λ∗F˙y)0]ypdt −(λ∗F˙yyp)|T

t0.(2.19)

Now by requiring λto satisfy

(λ∗F˙y)0−λ∗Fy=−gy,(2.20)

we obtain

dp =ZT

(gp−λ∗Fp)dt −(λ∗F˙yyp)|T

t0.(2.21)

Note that ypat t=t0is the sensitivity of the initial conditions with respect to p, which is easily ob-

tained. To ﬁnd the initial conditions (at t=T) for the adjoint system, we must take into consideration

the structure of the DAE system.

For index-0 and index-1 DAE systems, we can simply take

λ∗F˙y|t=T= 0,(2.22)

yielding the sensitivity equation for dG/dp

dp =ZT

(gp−λ∗Fp)dt + (λ∗F˙yyp)|t=t0.(2.23)

This choice will not suﬃce for a Hessenberg index-2 DAE system. For a derivation of proper ﬁnal

conditions in such cases, see [13].

The ﬁrst thing to notice about the adjoint system (2.20) is that there is no explicit speciﬁcation

of the parameters p; this implies that, once the solution λis found, the formula (2.21) can then be

used to ﬁnd the gradient of Gwith respect to any of the parameters p. The second important remark

is that the adjoint system (2.20) is a terminal value problem which depends on the solution y(t) of

the original IVP (2.2). Therefore, a procedure is needed for providing the states yobtained during

a forward integration phase of (2.2) to idas during the backward integration phase of (2.20). The

approach adopted in idas, based on checkpointing, is described in §2.6.3 below.

2.6.2 Sensitivity of g(T, p)

Now let us consider the computation of dg/dp(T). From dg/dp(T)=(d/dT )(dG/dp) and equation

(2.21), we have

dp = (gp−λ∗Fp)(T)−ZT

λ∗

TFpdt + (λ∗

TF˙yyp)|t=t0−d(λ∗F˙yyp)

dT (2.24)

where λTdenotes ∂λ/∂T . For index-0 and index-1 DAEs, we obtain

d(λ∗F˙yyp)|t=T

dT = 0,

while for a Hessenberg index-2 DAE system we have

d(λ∗F˙yyp)|t=T

dT =−d(gya(CB)−1f2

dt t=T

22 Mathematical Considerations

The corresponding adjoint equations are

(λ∗

TF˙y)0−λ∗

TFy= 0.(2.25)

For index-0 and index-1 DAEs (as shown above, the index-2 case is diﬀerent), to ﬁnd the boundary

condition for this equation we write λas λ(t, T ) because it depends on both tand T. Then

λ∗(T, T )F˙y|t=T= 0.

Taking the total derivative, we obtain

(λt+λT)∗(T, T )F˙y|t=T+λ∗(T, T )dF ˙y

dt |t=T= 0.

Since λtis just ˙

λ, we have the boundary condition

(λ∗

TF˙y)|t=T=−λ∗(T, T )dF ˙y

dt +˙

λ∗F˙y|t=T.

For the index-one DAE case, the above relation and (2.20) yield

(λ∗

TF˙y)|t=T= [gy−λ∗Fy]|t=T.(2.26)

For the regular implicit ODE case, F˙yis invertible; thus we have λ(T, T ) = 0, which leads to λT(T) =

−˙

λ(T). As with the ﬁnal conditions for λ(T) in (2.20), the above selection for λT(T) is not suﬃcient

for index-two Hessenberg DAEs (see [13] for details).

2.6.3 Checkpointing scheme

During the backward integration, the evaluation of the right-hand side of the adjoint system requires,

at the current time, the states ywhich were computed during the forward integration phase. Since

idas implements variable-step integration formulas, it is unlikely that the states will be available at

the desired time and so some form of interpolation is needed. The idas implementation being also

variable-order, it is possible that during the forward integration phase the order may be reduced as

low as ﬁrst order, which means that there may be points in time where only yand ˙yare available.

These requirements therefore limit the choices for possible interpolation schemes. idas implements

two interpolation methods: a cubic Hermite interpolation algorithm and a variable-degree polynomial

interpolation method which attempts to mimic the BDF interpolant for the forward integration.

However, especially for large-scale problems and long integration intervals, the number and size

of the vectors yand ˙ythat would need to be stored make this approach computationally intractable.

Thus, idas settles for a compromise between storage space and execution time by implementing a so-

called checkpointing scheme. At the cost of at most one additional forward integration, this approach

oﬀers the best possible estimate of memory requirements for adjoint sensitivity analysis. To begin

with, based on the problem size Nand the available memory, the user decides on the number Nd

of data pairs (y, ˙y) if cubic Hermite interpolation is selected, or on the number Ndof yvectors in

the case of variable-degree polynomial interpolation, that can be kept in memory for the purpose of

interpolation. Then, during the ﬁrst forward integration stage, after every Ndintegration steps a

checkpoint is formed by saving enough information (either in memory or on disk) to allow for a hot

restart, that is a restart which will exactly reproduce the forward integration. In order to avoid storing

Jacobian-related data at each checkpoint, a reevaluation of the iteration matrix is forced before each

checkpoint. At the end of this stage, we are left with Nccheckpoints, including one at t0. During the

backward integration stage, the adjoint variables are integrated backwards from Tto t0, going from

one checkpoint to the previous one. The backward integration from checkpoint i+ 1 to checkpoint i

is preceded by a forward integration from ito i+ 1 during which the Ndvectors y(and, if necessary

˙y) are generated and stored in memory for interpolation1

1The degree of the interpolation polynomial is always that of the current BDF order for the forward interpolation at

2.7 Second-order sensitivity analysis 23

t0t1t2t3tf

Forward pass

Backward pass

. . . .

. . .

Figure 2.1: Illustration of the checkpointing algorithm for generation of the forward solution during

the integration of the adjoint system.

This approach transfers the uncertainty in the number of integration steps in the forward inte-

gration phase to uncertainty in the ﬁnal number of checkpoints. However, Ncis much smaller than

the number of steps taken during the forward integration, and there is no major penalty for writ-

ing/reading the checkpoint data to/from a temporary ﬁle. Note that, at the end of the ﬁrst forward

integration stage, interpolation data are available from the last checkpoint to the end of the interval

of integration. If no checkpoints are necessary (Ndis larger than the number of integration steps

taken in the solution of (2.2)), the total cost of an adjoint sensitivity computation can be as low as

one forward plus one backward integration. In addition, idas provides the capability of reusing a set

of checkpoints for multiple backward integrations, thus allowing for eﬃcient computation of gradients

of several functionals (2.17).

Finally, we note that the adjoint sensitivity module in idas provides the necessary infrastructure

to integrate backwards in time any DAE terminal value problem dependent on the solution of the

IVP (2.2), including adjoint systems (2.20) or (2.25), as well as any other quadrature ODEs that may

be needed in evaluating the integrals in (2.21). In particular, for DAE systems arising from semi-

discretization of time-dependent PDEs, this feature allows for integration of either the discretized

adjoint PDE system or the adjoint of the discretized PDE.

2.7 Second-order sensitivity analysis

In some applications (e.g., dynamically-constrained optimization) it may be desirable to compute

second-order derivative information. Considering the DAE problem (2.2) and some model output

functional2g(y), the Hessian d2g/dp2can be obtained in a forward sensitivity analysis setting as

d2g

dp2=gy⊗INpypp +yT

pgyyyp,

where ⊗is the Kronecker product. The second-order sensitivities are solution of the matrix DAE

system:

F˙y⊗INp·˙ypp +Fy⊗INp·ypp +IN⊗˙yT

p·(F˙y˙y˙yp+Fy˙yyp) + IN⊗yT

p·(Fy˙y˙yp+Fyyyp)=0

ypp(t0) = ∂2y0

∂p2,˙ypp(t0) = ∂2˙y0

∂p2,

the ﬁrst point to the right of the time at which the interpolated value is sought (unless too close to the i-th checkpoint, in

which case it uses the BDF order at the right-most relevant point). However, because of the FLC BDF implementation

(see §2.1), the resulting interpolation polynomial is only an approximation to the underlying BDF interpolant.

The Hermite cubic interpolation option is present because it was implemented chronologically ﬁrst and it is also used

by other adjoint solvers (e.g. daspkadjoint). The variable-degree polynomial is more memory-eﬃcient (it requires only

half of the memory storage of the cubic Hermite interpolation) and is more accurate.

2For the sake of simpliﬁty in presentation, we do not include explicit dependencies of gon time tor parameters p.

Moreover, we only consider the case in which the dependency of the original DAE (2.2) on the parameters pis through

its initial conditions only. For details on the derivation in the general case, see [38].

24 Mathematical Considerations

where ypdenotes the ﬁrst-order sensitivity matrix, the solution of Npsystems (2.12), and ypp is a

third-order tensor. It is easy to see that, except for situations in which the number of parameters Np

is very small, the computational cost of this so-called forward-over-forward approach is exorbitant as

it requires the solution of Np+N2

padditional DAE systems of the same dimension as (2.2).

A much more eﬃcient alternative is to compute Hessian-vector products using a so-called forward-

over-adjoint approach. This method is based on using the same “trick” as the one used in computing

gradients of pointwise functionals with the adjoint method, namely applying a formal directional for-

ward derivation to the gradient of (2.21) (or the equivalent one for a pointwise functional g(T, y(T))).

With that, the cost of computing a full Hessian is roughly equivalent to the cost of computing the gra-

dient with forward sensitivity analysis. However, Hessian-vector products can be cheaply computed

with one additional adjoint solve.

As an illustration3, consider the ODE problem

˙y=f(t, y), y(t0) = y0(p),

depending on some parameters pthrough the initial conditions only and consider the model functional

output G(p) = Rtf

t0g(t, y)dt. It can be shown that the product between the Hessian of G(with respect

to the parameters p) and some vector ucan be computed as

∂2G

∂p2u=λT⊗INpyppu+yT

pµt=t0,

where λand µare solutions of

−˙µ=fT

yµ+λT⊗Infyy s;µ(tf)=0

−˙

λ=fT

yλ+gT

y;λ(tf)=0

˙s=fys;s(t0) = y0pu.

(2.27)

In the above equation, s=ypuis a linear combination of the columns of the sensitivity matrix yp.

The forward-over-adjoint approach hinges crucially on the fact that scan be computed at the cost of

a forward sensitivity analysis with respect to a single parameter (the last ODE problem above) which

is possible due to the linearity of the forward sensitivity equations (2.12).

Therefore (and this is also valid for the DAE case), the cost of computing the Hessian-vector

product is roughly that of two forward and two backward integrations of a system of DAEs of size

N. For more details, including the corresponding formulas for a pointwise model functional output,

see the work by Ozyurt and Barton [38] who discuss this problem for ODE initial value problems. As

far as we know, there is no published equivalent work on DAE problems. However, the derivations

given in [38] for ODE problems can be extended to DAEs with some careful consideration given to

the derivation of proper ﬁnal conditions on the adjoint systems, following the ideas presented in [13].

To allow the foward-over-adjoint approach described above, idas provides support for:

•the integration of multiple backward problems depending on the same underlying forward prob-

lem (2.2), and

•the integration of backward problems and computation of backward quadratures depending on

both the states yand forward sensitivities (for this particular application, s) of the original

problem (2.2).

3The derivation for the general DAE case is too involved for the purposes of this discussion.

Chapter 3

Code Organization

3.1 SUNDIALS organization

The family of solvers referred to as sundials consists of the solvers cvode and arkode (for ODE

systems), kinsol (for nonlinear algebraic systems), and ida (for diﬀerential-algebraic systems). In

addition, sundials also includes variants of cvode and ida with sensitivity analysis capabilities

(using either forward or adjoint methods), called cvodes and idas, respectively.

The various solvers of this family share many subordinate modules. For this reason, it is organized

as a family, with a directory structure that exploits that sharing (see Figs. 3.1 and 3.2). The following

is a list of the solver packages presently available, and the basic functionality of each:

•cvode, a solver for stiﬀ and nonstiﬀ ODE systems dy/dt =f(t, y) based on Adams and BDF

methods;

•cvodes, a solver for stiﬀ and nonstiﬀ ODE systems with sensitivity analysis capabilities;

•arkode, a solver for ODE systems Mdy/dt =fE(t, y) + fI(t, y) based on additive Runge-Kutta

methods;

•ida, a solver for diﬀerential-algebraic systems F(t, y, ˙y) = 0 based on BDF methods;

•idas, a solver for diﬀerential-algebraic systems with sensitivity analysis capabilities;

•kinsol, a solver for nonlinear algebraic systems F(u) = 0.

3.2 IDAS organization

The idas package is written in the ANSI Clanguage. The following summarizes the basic structure

of the package, although knowledge of this structure is not necessary for its use.

The overall organization of the idas package is shown in Figure 3.3. The central integration

module, implemented in the ﬁles idas.h,idas impl.h, and idas.c, deals with the evaluation of

integration coeﬃcients, estimation of local error, selection of stepsize and order, and interpolation to

user output points, among other issues.

idas utilizes generic linear and nonlinear solver modules deﬁned by the sunlinsol API (see Chap-

ter 9) and sunnonlinsol API (see Chapter 10) respectively. As such, idas has no knowledge of the

method being used to solve the linear and nonlinear systems that arise in each time step. For any given

user problem, there exists a single nonlinear solver interface and, if necessary, one of the linear system

solver interfaces is speciﬁed, and invoked as needed during the integration. While sundials includes a

ﬁxed-point nonlinear solver module, it is not currently supported in idas (note the ﬁxed-point module

is listed in Figure 3.1 but not Figure 3.3).

In addition, if forward sensitivity analysis is turned on, the main module will integrate the forward

sensitivity equations simultaneously with the original IVP. The sensitivity variables may be included

26 Code Organization

SUNDIALS

CVODE CVODES ARKODE IDAS KINSOLIDA

VECTOR MODULES

SERIAL PARALLEL

(MPI)

PTHREADSOPENMP

CUDA RAJA

PARHYP

(HYPRE) PETSC

NVECTOR API SUNMATRIX API

MATRIX MODULES

DENSE

BAND

SPARSE

Cut Here

SUNLINEARSOLVER API

LINEAR SOLVER MODULES

MATRIX-BASED

DENSE

SUPERLU_MT

BAND

KLU

LAPACK

DENSE

LAPACK

BAND

MATRIX-FREE

SPTFQMR

SPBCG

SPFGMR

PCG

SPGMR

SUNNONLINEARSOLVER API

NONLINEAR SOLVER MODULES

NEWTON

FIXED POINT

Figure 3.1: High-level diagram of the sundials suite

in the local error control mechanism of the main integrator. idas provides two diﬀerent strategies

for dealing with the correction stage for the sensitivity variables: IDA SIMULTANEOUS IDA STAGGERED

(see §2.5). The idas package includes an algorithm for the approximation of the sensitivity equations

residuals by diﬀerence quotients, but the user has the option of supplying these residual functions

directly.

The adjoint sensitivity module (ﬁle idaa.c) provides the infrastructure needed for the backward

integration of any system of DAEs which depends on the solution of the original IVP, in particular the

adjoint system and any quadratures required in evaluating the gradient of the objective functional.

This module deals with the setup of the checkpoints, the interpolation of the forward solution during

the backward integration, and the backward integration of the adjoint equations.

idas now has a single uniﬁed linear solver interface, idals, supporting both direct and iterative

linear solvers built using the generic sunlinsol API (see Chapter 9). These solvers may utilize a

sunmatrix object (see Chapter 8) for storing Jacobian information, or they may be matrix-free.

Since idas can operate on any valid sunlinsol implementation, the set of linear solver modules

available to idas will expand as new sunlinsol modules are developed.

For users employing dense or banded Jacobian matrices, idals includes algorithms for their ap-

proximation through diﬀerence quotients, but the user also has the option of supplying the Jacobian

(or an approximation to it) directly. This user-supplied routine is required when using sparse or

user-supplied Jacobian matrices.

For users employing matrix-free iterative linear solvers, idals includes an algorithm for the approx-

imation by diﬀerence quotients of the product between the Jacobian matrix and a vector, Jv. Again,

the user has the option of providing routines for this operation, in two phases: setup (preprocessing

of Jacobian data) and multiplication.

For preconditioned iterative methods, the preconditioning must be supplied by the user, again

in two phases: setup and solve. While there is no default choice of preconditioner analogous to

the diﬀerence-quotient approximation in the direct case, the references [6,10], together with the

example and demonstration programs included with idas, oﬀer considerable assistance in building

3.2 IDAS organization 27

sundials-x.x.x

include src examples docconfig

Cut Here

test

cvode

cvodes

arkode

ida

idas

kinsol

sundials

nvector

sunmatrix

sunlinsol

sunnonlinsol nvec_*

sunmat_*

sunlinsol_*

cvode

cvodes

arkode

ida

idas

kinsol

sundials

fcmix

sunnonlinsol

cvode

cvodes

arkode

ida

idas

kinsol

sundials

nvector

sunmatrix

sunlinsol

sunnonlinsol

(a) Directory structure of the sundials source tree

sundials-x.x.x

Cut Here

cvode

cvodes

arkode

ida

idas

kinsol

nvector

serial parallel openmp

pthread

rajacuda

parhyp petsc

C_serial C_parallel

CXX_serial CXX_parallel F77_serial

F77_parallel F90_serial F90_parallel

C_openmp C_parhyp

sunlinsol

dense

band

lapackdense

lapackband

klu superlumt

spgmr spfgmr sptfqmr

spbcg pcg

sunmatrix

dense band sparse

serial parallel

fcmix_serial fcmix_parallel

C_openmp

fcmix_opemp

petsc

serial parallel

parhyp cuda raja

C_openmp fcmix_serial

fcmix_parallel

examples

serial parallel C_openmp

serial parallel

fcmix_serial fcmix_parallel

C_openmp

sunnonlinsol

newton fixed point

(b) Directory structure of the sundials examples

Figure 3.2: Organization of the sundials suite

28 Code Organization

SUNMATRIX API

MATRIX

MODULES

DENSE

BAND

SPARSE

Cut Here

SUNLINEARSOLVER API

LINEAR SOLVER

MODULES

MATRIX-BASED

DENSE

SUPERLU_MT

BAND

KLU

LAPACK

DENSE

LAPACK

BAND

MATRIX-FREE

SPTFQMR

SPBCG

SPFGMR

PCG

SPGMR

SUNNONLINEARSOLVER API

NONLINEAR SOLVER

MODULES

NEWTON

VECTOR

MODULES

SERIAL

PARALLEL (MPI)

PTHREADS

OPENMP

CUDA

RAJA

PARHYP (HYPRE)

PETSC

NVECTOR API

IDALS:

LINEAR SOLVER INTERFACE

IDANLS:

NONLINEAR SOLVER INTERFACE

PRECONDITIONER MODULES

IDABBDPRE

IDASSUNDIALS IDAADJOINT

Figure 3.3: Overall structure diagram of the ida package. Modules speciﬁc to ida begin with “IDA”

(idals,idabbdpre, and idanls), all other items correspond to generic solver and auxiliary modules.

Note also that the LAPACK, klu and superlumt support is through interfaces to external packages.

Users will need to download and compile those packages independently.

3.2 IDAS organization 29

preconditioners.

idas’ linear solver interface consists of four primary routines, devoted to (1) memory allocation

and initialization, (2) setup of the matrix data involved, (3) solution of the system, and (4) freeing

of memory. The setup and solution phases are separate because the evaluation of Jacobians and

preconditioners is done only periodically during the integration, as required to achieve convergence.

The call list within the central idas module to each of the four associated functions is ﬁxed, thus

allowing the central module to be completely independent of the linear system method.

idas also provides a preconditioner module, idabbdpre, for use with any of the Krylov iterative

linear solvers. It works in conjunction with nvector parallel and generates a preconditioner that

is a block-diagonal matrix with each block being a banded matrix.

All state information used by idas to solve a given problem is saved in a structure, and a pointer

to that structure is returned to the user. There is no global data in the idas package, and so, in this

respect, it is reentrant. State information speciﬁc to the linear solver is saved in a separate structure,

a pointer to which resides in the idas memory structure. The reentrancy of idas was motivated by

the situation where two or more problems are solved by intermixed calls to the package from one user

program.

Chapter 4

Using IDAS for IVP Solution

This chapter is concerned with the use of idas for the integration of DAEs in a C language setting.

The following sections treat the header ﬁles, the layout of the user’s main program, description of

the idas user-callable functions, and description of user-supplied functions. This usage is essentially

equivalent to using ida [30].

The sample programs described in the companion document [43] may also be helpful. Those codes

may be used as templates (with the removal of some lines involved in testing), and are included in

the idas package.

The user should be aware that not all sunlinsol and sunmatrix modules are compatible with

all nvector implementations. Details on compatibility are given in the documentation for each

sunmatrix module (Chapter 8) and each sunlinsol module (Chapter 9). For example, nvec-

tor parallel is not compatible with the dense, banded, or sparse sunmatrix types, or with the

corresponding dense, banded, or sparse sunlinsol modules. Please check Chapters 8and 9to verify

compatibility between these modules. In addition to that documentation, we note that the precon-

ditioner module idabbdpre can only be used with nvector parallel. It is not recommended to

use a threaded vector module with SuperLU MT unless it is the nvector openmp module, and

SuperLU MT is also compiled with OpenMP.

idas uses various constants for both input and output. These are deﬁned as needed in this chapter,

but for convenience are also listed separately in Appendix B.

4.1 Access to library and header ﬁles

At this point, it is assumed that the installation of idas, following the procedure described in Appendix

A, has been completed successfully.

Regardless of where the user’s application program resides, its associated compilation and load

commands must make reference to the appropriate locations for the library and header ﬁles required

by idas. The relevant library ﬁles are

•libdir/libsundials idas.lib,

•libdir/libsundials nvec*.lib,

where the ﬁle extension .lib is typically .so for shared libraries and .a for static libraries. The relevant

header ﬁles are located in the subdirectories

•incdir/include/idas

•incdir/include/sundials

•incdir/include/nvector

•incdir/include/sunmatrix

32 Using IDAS for IVP Solution

•incdir/include/sunlinsol

•incdir/include/sunnonlinsol

The directories libdir and incdir are the install library and include directories, respectively. For

a default installation, these are instdir/lib and instdir/include, respectively, where instdir is the

directory where sundials was installed (see Appendix A).

Note that an application cannot link to both the ida and idas libraries because both contain

user-callable functions with the same names (to ensure that idas is backward compatible with ida).

Therefore, applications that contain both DAE problems and DAEs with sensitivity analysis, should

use idas.

4.2 Data types

The sundials types.h ﬁle contains the deﬁnition of the type realtype, which is used by the sundials

solvers for all ﬂoating-point data, the deﬁnition of the integer type sunindextype, which is used

for vector and matrix indices, and booleantype, which is used for certain logic operations within

sundials.

4.2.1 Floating point types

The type realtype can be float,double, or long double, with the default being double. The user

can change the precision of the sundials solvers arithmetic at the conﬁguration stage (see §A.1.2).

Additionally, based on the current precision, sundials types.h deﬁnes BIG REAL to be the largest

value representable as a realtype,SMALL REAL to be the smallest value representable as a realtype,

and UNIT ROUNDOFF to be the diﬀerence between 1.0 and the minimum realtype greater than 1.0.

Within sundials, real constants are set by way of a macro called RCONST. It is this macro that

needs the ability to branch on the deﬁnition realtype. In ANSI C, a ﬂoating-point constant with no

suﬃx is stored as a double. Placing the suﬃx “F” at the end of a ﬂoating point constant makes it a

float, whereas using the suﬃx “L” makes it a long double. For example,

#define A 1.0

#define B 1.0F

#define C 1.0L

deﬁnes Ato be a double constant equal to 1.0, Bto be a float constant equal to 1.0, and Cto be

along double constant equal to 1.0. The macro call RCONST(1.0) automatically expands to 1.0 if

realtype is double, to 1.0F if realtype is float, or to 1.0L if realtype is long double.sundials

uses the RCONST macro internally to declare all of its ﬂoating-point constants.

A user program which uses the type realtype and the RCONST macro to handle ﬂoating-point

constants is precision-independent except for any calls to precision-speciﬁc standard math library

functions. (Our example programs use both realtype and RCONST.) Users can, however, use the type

double,float, or long double in their code (assuming that this usage is consistent with the typedef

for realtype). Thus, a previously existing piece of ANSI Ccode can use sundials without modifying

the code to use realtype, so long as the sundials libraries use the correct precision (for details see

§A.1.2).

4.2.2 Integer types used for vector and matrix indices

The type sunindextype can be either a 32- or 64-bit signed integer. The default is the portable

int64 t type, and the user can change it to int32 t at the conﬁguration stage. The conﬁguration

system will detect if the compiler does not support portable types, and will replace int32 t and

int64 t with int and long int, respectively, to ensure use of the desired sizes on Linux, Mac OS X,

and Windows platforms. sundials currently does not support unsigned integer types for vector and

matrix indices, although these could be added in the future if there is suﬃcient demand.

4.3 Header ﬁles 33

A user program which uses sunindextype to handle vector and matrix indices will work with both

index storage types except for any calls to index storage-speciﬁc external libraries. (Our Cand C++

example programs use sunindextype.) Users can, however, use any one of int,long int,int32 t,

int64 t or long long int in their code, assuming that this usage is consistent with the typedef

for sunindextype on their architecture). Thus, a previously existing piece of ANSI Ccode can use

sundials without modifying the code to use sunindextype, so long as the sundials libraries use the

appropriate index storage type (for details see §A.1.2).

4.3 Header ﬁles

The calling program must include several header ﬁles so that various macros and data types can be

used. The header ﬁle that is always required is:

•idas/idas.h, the header ﬁle for idas, which deﬁnes the several types and various constants,

and includes function prototypes. This includes the header ﬁle for idals,ida/ida ls.h.

Note that idas.h includes sundials types.h, which deﬁnes the types realtype,sunindextype, and

booleantype and the constants SUNFALSE and SUNTRUE.

The calling program must also include an nvector implementation header ﬁle, of the form

nvector/nvector ***.h. See Chapter 7for the appropriate name. This ﬁle in turn includes the

header ﬁle sundials nvector.h which deﬁnes the abstract N Vector data type.

If using a non-default nonlinear solver module, or when interacting with a sunnonlinsol module

directly, the calling program must also include a sunnonlinsol implementation header ﬁle, of the form

sunnonlinsol/sunnonlinsol ***.h where *** is the name of the nonlinear solver module (see Chap-

ter 10 for more information). This ﬁle in turn includes the header ﬁle sundials nonlinearsolver.h

which deﬁnes the abstract SUNNonlinearSolver data type.

If using a nonlinear solver that requires the solution of a linear system of the form (2.5) (e.g.,

the default Newton iteration), a linear solver module header ﬁle is also required. The header ﬁles

corresponding to the various sundials-provided linear solver modules available for use with idas are:

•Direct linear solvers:

–sunlinsol/sunlinsol dense.h, which is used with the dense linear solver module, sun-

linsol dense;

–sunlinsol/sunlinsol band.h, which is used with the banded linear solver module, sun-

linsol band;

–sunlinsol/sunlinsol lapackdense.h, which is used with the LAPACK dense linear solver

module, sunlinsol lapackdense;

–sunlinsol/sunlinsol lapackband.h, which is used with the LAPACK banded linear

solver module, sunlinsol lapackband;

–sunlinsol/sunlinsol klu.h, which is used with the klu sparse linear solver module,

sunlinsol klu;

–sunlinsol/sunlinsol superlumt.h, which is used with the superlumt sparse linear

solver module, sunlinsol superlumt;

•Iterative linear solvers:

–sunlinsol/sunlinsol spgmr.h, which is used with the scaled, preconditioned GMRES

Krylov linear solver module, sunlinsol spgmr;

–sunlinsol/sunlinsol spfgmr.h, which is used with the scaled, preconditioned FGMRES

Krylov linear solver module, sunlinsol spfgmr;

–sunlinsol/sunlinsol spbcgs.h, which is used with the scaled, preconditioned Bi-CGStab

Krylov linear solver module, sunlinsol spbcgs;

34 Using IDAS for IVP Solution

–sunlinsol/sunlinsol sptfqmr.h, which is used with the scaled, preconditioned TFQMR

Krylov linear solver module, sunlinsol sptfqmr;

–sunlinsol/sunlinsol pcg.h, which is used with the scaled, preconditioned CG Krylov

linear solver module, sunlinsol pcg;

The header ﬁles for the sunlinsol dense and sunlinsol lapackdense linear solver modules

include the ﬁle sunmatrix/sunmatrix dense.h, which deﬁnes the sunmatrix dense matrix module,

as as well as various functions and macros acting on such matrices.

The header ﬁles for the sunlinsol band and sunlinsol lapackband linear solver modules in-

clude the ﬁle sunmatrix/sunmatrix band.h, which deﬁnes the sunmatrix band matrix module, as

as well as various functions and macros acting on such matrices.

The header ﬁles for the sunlinsol klu and sunlinsol superlumt sparse linear solvers include

the ﬁle sunmatrix/sunmatrix sparse.h, which deﬁnes the sunmatrix sparse matrix module, as

well as various functions and macros acting on such matrices.

The header ﬁles for the Krylov iterative solvers include the ﬁle sundials/sundials iterative.h,

which enumerates the kind of preconditioning, and (for the spgmr and spfgmr solvers) the choices

for the Gram-Schmidt process.

Other headers may be needed, according to the choice of preconditioner, etc. For example, in the

idasFoodWeb kry p example (see [43]), preconditioning is done with a block-diagonal matrix. For this,

even though the sunlinsol spgmr linear solver is used, the header sundials/sundials dense.h is

included for access to the underlying generic dense matrix arithmetic routines.

4.4 A skeleton of the user’s main program

The following is a skeleton of the user’s main program (or calling program) for the integration of a DAE

IVP. Most of the steps are independent of the nvector,sunmatrix,sunlinsol, and sunnonlinsol

implementations used. For the steps that are not, refer to Chapter 7,8,9, and 10 for the speciﬁc

name of the function to be called or macro to be referenced.

1. Initialize parallel or multi-threaded environment, if appropriate

For example, call MPI Init to initialize MPI if used, or set num threads, the number of threads

to use within the threaded vector functions, if used.

2. Set problem dimensions etc.

This generally includes the problem size N, and may include the local vector length Nlocal.

Note: The variables Nand Nlocal should be of type sunindextype.

3. Set vectors of initial values

To set the vectors y0 and yp0 to initial values for yand ˙y, use the appropriate functions deﬁned

by the particular nvector implementation.

For native sundials vector implementations (except the cuda and raja-based ones), use a call

of the form y0 = N VMake ***(..., ydata) if the realtype array ydata containing the initial

values of yalready exists. Otherwise, create a new vector by making a call of the form y0 =

N VNew ***(...), and then set its elements by accessing the underlying data with a call of the

form ydata = N VGetArrayPointer(y0). See §7.2-7.5 for details.

For the hypre and petsc vector wrappers, ﬁrst create and initialize the underlying vector and

then create an nvector wrapper with a call of the form y0 = N VMake ***(yvec), where yvec

is a hypre or petsc vector. Note that calls like N VNew ***(...) and N VGetArrayPointer(...)

are not available for these vector wrappers. See §7.6 and §7.7 for details.

If using either the cuda- or raja-based vector implementations use a call of the form y0 =

N VMake ***(..., c) where cis a pointer to a suncudavec or sunrajavec vector class if this class

already exists. Otherwise, create a new vector by making a call of the form y0 = N VNew ***(...),

4.4 A skeleton of the user’s main program 35

and then set its elements by accessing the underlying data where it is located with a call of the

form N VGetDeviceArrayPointer *** or N VGetHostArrayPointer ***. Note that the vector

class will allocate memory on both the host and device when instantiated. See §7.8-7.9 for details.

Set the vector yp0 of initial conditions for ˙ysimilarly.

4. Create idas object

Call ida mem = IDACreate() to create the idas memory block. IDACreate returns a pointer to

the idas memory structure. See §4.5.1 for details. This void * pointer must then be passed as

the ﬁrst argument to all subsequent idas function calls.

5. Initialize idas solver

Call IDAInit(...) to provide required problem speciﬁcations (residual function, initial time, and

initial conditions), allocate internal memory for idas, and initialize idas.IDAInit returns an

error ﬂag to indicate success or an illegal argument value. See §4.5.1 for details.

6. Specify integration tolerances

Call IDASStolerances(...) or IDASVtolerances(...) to specify, respectively, a scalar relative

tolerance and scalar absolute tolerance, or a scalar relative tolerance and a vector of absolute

tolerances. Alternatively, call IDAWFtolerances to specify a function which sets directly the

weights used in evaluating WRMS vector norms. See §4.5.2 for details.

7. Create matrix object

If a nonlinear solver requiring a linear solver will be used (e.g., the default Newton iteration)

and the linear solver will be a matrix-based linear solver, then a template Jacobian matrix must

be created by using the appropriate constructor function deﬁned by the particular sunmatrix

implementation.

For the sundials-supplied sunmatrix implementations, the matrix object may be created using

a call of the form

SUNMatrix J = SUNBandMatrix(...);

SUNMatrix J = SUNDenseMatrix(...);

SUNMatrix J = SUNSparseMatrix(...);

NOTE: The dense, banded, and sparse matrix objects are usable only in a serial or threaded

environment.

8. Create linear solver object

If a nonlinear solver requiring a linear solver is chosen (e.g., the default Newton iteration), then

the desired linear solver object must be created by calling the appropriate constructor function

deﬁned by the particular sunlinsol implementation.

For any of the sundials-supplied sunlinsol implementations, the linear solver object may be

created using a call of the form

SUNLinearSolver LS = SUNLinSol *(...);

where *can be replaced with “Dense”, “SPGMR”, or other options, as discussed in §4.5.3 and

Chapter 9.

9. Set linear solver optional inputs

Call *Set* functions from the selected linear solver module to change optional inputs speciﬁc to

that linear solver. See the documentation for each sunlinsol module in Chapter 9for details.

36 Using IDAS for IVP Solution

10. Attach linear solver module

If a nonlinear solver requiring a linear solver is chosen (e.g., the default Newton iteration), then

initialize the idals linear solver interface by attaching the linear solver object (and matrix object,

if applicable) with the following call (for details see §4.5.3):

ier = IDASetLinearSolver(...);

11. Set optional inputs

Optionally, call IDASet* functions to change from their default values any optional inputs that

control the behavior of idas. See §4.5.8.1 and §4.5.8 for details.

12. Create nonlinear solver object (optional)

If using a non-default nonlinear solver (see §4.5.4), then create the desired nonlinear solver object

by calling the appropriate constructor function deﬁned by the particular sunnonlinsol imple-

mentation (e.g., NLS = SUNNonlinSol ***(...); where *** is the name of the nonlinear solver

(see Chapter 10 for details).

13. Attach nonlinear solver module (optional)

If using a non-default nonlinear solver, then initialize the nonlinear solver interface by attaching the

nonlinear solver object by calling ier = IDASetNonlinearSolver(ida mem, NLS); (see §4.5.4 for

details).

14. Set nonlinear solver optional inputs (optional)

Call the appropriate set functions for the selected nonlinear solver module to change optional

inputs speciﬁc to that nonlinear solver. These must be called after IDAInit if using the default

nonlinear solver or after attaching a new nonlinear solver to idas, otherwise the optional inputs

will be overridden by idas defaults. See Chapter 10 for more information on optional inputs.

15. Correct initial values

Optionally, call IDACalcIC to correct the initial values y0 and yp0 passed to IDAInit. See §4.5.5.

Also see §4.5.8.3 for relevant optional input calls.

16. Specify rootﬁnding problem

Optionally, call IDARootInit to initialize a rootﬁnding problem to be solved during the integration

of the DAE system. See §4.5.6 for details, and see §4.5.8.4 for relevant optional input calls.

17. Advance solution in time

For each point at which output is desired, call flag = IDASolve(ida mem, tout, &tret, yret,

ypret, itask). Here itask speciﬁes the return mode. The vector yret (which can be the same

as the vector y0 above) will contain y(t), while the vector ypret (which can be the same as the

vector yp0 above) will contain ˙y(t). See §4.5.7 for details.

18. Get optional outputs

Call IDA*Get* functions to obtain optional output. See §4.5.10 for details.

19. Deallocate memory for solution vectors

Upon completion of the integration, deallocate memory for the vectors yret and ypret (or yand

yp) by calling the appropriate destructor function deﬁned by the nvector implementation:

N VDestroy(yret);

and similarly for ypret.

20. Free solver memory

4.5 User-callable functions 37

IDAFree(&ida mem) to free the memory allocated for idas.

21. Free nonlinear solver memory (optional)

If a non-default nonlinear solver was used, then call SUNNonlinSolFree(NLS) to free any memory

allocated for the sunnonlinsol object.

22. Free linear solver and matrix memory

Call SUNLinSolFree and SUNMatDestroy to free any memory allocated for the linear solver and

matrix objects created above.

23. Finalize MPI, if used

Call MPI Finalize() to terminate MPI.

sundials provides some linear solvers only as a means for users to get problems running and not

as highly eﬃcient solvers. For example, if solving a dense system, we suggest using the LAPACK

solvers if the size of the linear system is >50,000. (Thanks to A. Nicolai for his testing and rec-

ommendation.) Table 4.1 shows the linear solver interfaces available as sunlinsol modules and the

vector implementations required for use. As an example, one cannot use the dense direct solver inter-

faces with the MPI-based vector implementation. However, as discussed in Chapter 9the sundials

packages operate on generic sunlinsol objects, allowing a user to develop their own solvers should

they so desire.

Table 4.1: sundials linear solver interfaces and vector implementations that can be used for each.

Linear Solver

Serial

Parallel

(MPI)

OpenMP

pThreads

hypre

petsc

cuda

raja

User

Supp.

Dense X X X X

Band X X X X

LapackDense X X X X

LapackBand X X X X

klu X X X X

superlumt X X X X

spgmr X X X X X X X X X

spfgmr X X X X X X X X X

spbcgs X X X X X X X X X

sptfqmr X X X X X X X X X

pcg X X X X X X X X X

User Supp. X X X X X X X X X

4.5 User-callable functions

This section describes the idas functions that are called by the user to set up and solve a DAE. Some of

these are required. However, starting with §4.5.8, the functions listed involve optional inputs/outputs

or restarting, and those paragraphs can be skipped for a casual use of idas. In any case, refer to §4.4

for the correct order of these calls.

On an error, each user-callable function returns a negative value and sends an error message to

the error handler routine, which prints the message on stderr by default. However, the user can set

a ﬁle as error output or can provide his own error handler function (see §4.5.8.1).

38 Using IDAS for IVP Solution

4.5.1 IDAS initialization and deallocation functions

The following three functions must be called in the order listed. The last one is to be called only after

the DAE solution is complete, as it frees the idas memory block created and allocated by the ﬁrst

two calls.

IDACreate

Call ida mem = IDACreate();

Description The function IDACreate instantiates an idas solver object.

Arguments IDACreate has no arguments.

Return value If successful, IDACreate returns a pointer to the newly created idas memory block (of

type void *). Otherwise it returns NULL.

IDAInit

Call flag = IDAInit(ida mem, res, t0, y0, yp0);

Description The function IDAInit provides required problem and solution speciﬁcations, allocates

internal memory, and initializes idas.

Arguments ida mem (void *) pointer to the idas memory block returned by IDACreate.

res (IDAResFn) is the Cfunction which computes the residual function Fin the

DAE. This function has the form res(t, yy, yp, resval, user data). For

full details see §4.6.1.

t0 (realtype) is the initial value of t.

y0 (N Vector) is the initial value of y.

yp0 (N Vector) is the initial value of ˙y.

Return value The return value flag (of type int) will be one of the following:

IDA SUCCESS The call to IDAInit was successful.

IDA MEM NULL The idas memory block was not initialized through a previous call to

IDACreate.

IDA MEM FAIL A memory allocation request has failed.

IDA ILL INPUT An input argument to IDAInit has an illegal value.

Notes If an error occurred, IDAInit also sends an error message to the error handler function.

IDAFree

Call IDAFree(&ida mem);

Description The function IDAFree frees the pointer allocated by a previous call to IDACreate.

Arguments The argument is the pointer to the idas memory block (of type void *).

Return value The function IDAFree has no return value.

4.5.2 IDAS tolerance speciﬁcation functions

One of the following three functions must be called to specify the integration tolerances (or directly

specify the weights used in evaluating WRMS vector norms). Note that this call must be made after

the call to IDAInit.

4.5 User-callable functions 39

IDASStolerances

Call flag = IDASStolerances(ida mem, reltol, abstol);

Description The function IDASStolerances speciﬁes scalar relative and absolute tolerances.

Arguments ida mem (void *) pointer to the idas memory block returned by IDACreate.

reltol (realtype) is the scalar relative error tolerance.

abstol (realtype) is the scalar absolute error tolerance.

Return value The return value flag (of type int) will be one of the following:

IDA SUCCESS The call to IDASStolerances was successful.

IDA MEM NULL The idas memory block was not initialized through a previous call to

IDACreate.

IDA NO MALLOC The allocation function IDAInit has not been called.

IDA ILL INPUT One of the input tolerances was negative.

IDASVtolerances

Call flag = IDASVtolerances(ida mem, reltol, abstol);

Description The function IDASVtolerances speciﬁes scalar relative tolerance and vector absolute

tolerances.

Arguments ida mem (void *) pointer to the idas memory block returned by IDACreate.

reltol (realtype) is the scalar relative error tolerance.

abstol (N Vector) is the vector of absolute error tolerances.

Return value The return value flag (of type int) will be one of the following:

IDA SUCCESS The call to IDASVtolerances was successful.

IDA MEM NULL The idas memory block was not initialized through a previous call to

IDACreate.

IDA NO MALLOC The allocation function IDAInit has not been called.

IDA ILL INPUT The relative error tolerance was negative or the absolute tolerance had

a negative component.

Notes This choice of tolerances is important when the absolute error tolerance needs to be

diﬀerent for each component of the state vector y.

IDAWFtolerances

Call flag = IDAWFtolerances(ida mem, efun);

Description The function IDAWFtolerances speciﬁes a user-supplied function efun that sets the

multiplicative error weights Wifor use in the weighted RMS norm, which are normally

deﬁned by Eq. (2.7).

Arguments ida mem (void *) pointer to the idas memory block returned by IDACreate.

efun (IDAEwtFn) is the Cfunction which deﬁnes the ewt vector (see §4.6.3).

Return value The return value flag (of type int) will be one of the following:

IDA SUCCESS The call to IDAWFtolerances was successful.

IDA MEM NULL The idas memory block was not initialized through a previous call to

IDACreate.

IDA NO MALLOC The allocation function IDAInit has not been called.

General advice on choice of tolerances. For many users, the appropriate choices for tolerance

values in reltol and abstol are a concern. The following pieces of advice are relevant.

(1) The scalar relative tolerance reltol is to be set to control relative errors. So reltol=10−4

means that errors are controlled to .01%. We do not recommend using reltol larger than 10−3.

40 Using IDAS for IVP Solution

On the other hand, reltol should not be so small that it is comparable to the unit roundoﬀ of the

machine arithmetic (generally around 10−15).

(2) The absolute tolerances abstol (whether scalar or vector) need to be set to control absolute

errors when any components of the solution vector ymay be so small that pure relative error control

is meaningless. For example, if y[i] starts at some nonzero value, but in time decays to zero, then

pure relative error control on y[i] makes no sense (and is overly costly) after y[i] is below some

noise level. Then abstol (if scalar) or abstol[i] (if a vector) needs to be set to that noise level. If

the diﬀerent components have diﬀerent noise levels, then abstol should be a vector. See the example

idasRoberts dns in the idas package, and the discussion of it in the idas Examples document [43].

In that problem, the three components vary between 0 and 1, and have diﬀerent noise levels; hence the

abstol vector. It is impossible to give any general advice on abstol values, because the appropriate

noise levels are completely problem-dependent. The user or modeler hopefully has some idea as to

what those noise levels are.

(3) Finally, it is important to pick all the tolerance values conservatively, because they control the

error committed on each individual time step. The ﬁnal (global) errors are a sort of accumulation of

those per-step errors. A good rule of thumb is to reduce the tolerances by a factor of .01 from the actual

desired limits on errors. So if you want .01% accuracy (globally), a good choice is reltol= 10−6. But

in any case, it is a good idea to do a few experiments with the tolerances to see how the computed

solution values vary as tolerances are reduced.

Advice on controlling unphysical negative values. In many applications, some components

in the true solution are always positive or non-negative, though at times very small. In the numerical

solution, however, small negative (hence unphysical) values can then occur. In most cases, these values

are harmless, and simply need to be controlled, not eliminated. The following pieces of advice are

relevant.

(1) The way to control the size of unwanted negative computed values is with tighter absolute

tolerances. Again this requires some knowledge of the noise level of these components, which may or

may not be diﬀerent for diﬀerent components. Some experimentation may be needed.

(2) If output plots or tables are being generated, and it is important to avoid having negative

numbers appear there (for the sake of avoiding a long explanation of them, if nothing else), then

eliminate them, but only in the context of the output medium. Then the internal values carried by

the solver are unaﬀected. Remember that a small negative value in yret returned by idas, with

magnitude comparable to abstol or less, is equivalent to zero as far as the computation is concerned.

(3) The user’s residual routine res should never change a negative value in the solution vector yy

to a non-negative value, as a ”solution” to this problem. This can cause instability. If the res routine

cannot tolerate a zero or negative value (e.g., because there is a square root or log of it), then the

oﬀending value should be changed to zero or a tiny positive number in a temporary variable (not in

the input yy vector) for the purposes of computing F(t, y, ˙y).

(4) idas provides the option of enforcing positivity or non-negativity on components. Also, such

constraints can be enforced by use of the recoverable error return feature in the user-supplied residual

function. However, because these options involve some extra overhead cost, they should only be

exercised if the use of absolute tolerances to control the computed values is unsuccessful.

4.5.3 Linear solver interface functions

As previously explained, if the nonlinear solver requires the solution of linear systems of the form (2.5)

(e.g., the default Newton iteration, then solution of these linear systems is handled with the idals

linear solver interface. This interface supports all valid sunlinsol modules. Here, matrix-based

sunlinsol modules utilize sunmatrix objects to store the Jacobian matrix J=∂F/∂y +α∂F/∂ ˙y

and factorizations used throughout the solution process. Conversely, matrix-free sunlinsol modules

instead use iterative methods to solve the linear systems of equations, and only require the action of

the Jacobian on a vector, Jv.

With most iterative linear solvers, preconditioning can be done on the left only, on the right only,

on both the left and the right, or not at all. The exceptions to this rule are spfgmr that supports

4.5 User-callable functions 41

right preconditioning only and pcg that performs symmetric preconditioning. However, in idas only

left preconditioning is supported. For the speciﬁcation of a preconditioner, see the iterative linear

solver sections in §4.5.8 and §4.6. A preconditioner matrix Pmust approximate the Jacobian J, at

least crudely.

To specify a generic linear solver to idas, after the call to IDACreate but before any calls to

IDASolve, the user’s program must create the appropriate sunlinsol object and call the function

IDASetLinearSolver, as documented below. To create the SUNLinearSolver object, the user may

call one of the sundials-packaged sunlinsol module constructor routines via a call of the form

SUNLinearSolver LS = SUNLinSol_*(...);

The current list of such constructor routines includes SUNLinSol Dense,SUNLinSol Band,

SUNLinSol LapackDense,SUNLinSol LapackBand,SUNLinSol KLU,SUNLinSol SuperLUMT,

SUNLinSol SPGMR,SUNLinSol SPFGMR,SUNLinSol SPBCGS,SUNLinSol SPTFQMR, and SUNLinSol PCG.

Alternately, a user-supplied SUNLinearSolver module may be created and used instead. The use

of each of the generic linear solvers involves certain constants, functions and possibly some macros,

that are likely to be needed in the user code. These are available in the corresponding header ﬁle

associated with the speciﬁc sunmatrix or sunlinsol module in question, as described in Chapters

8and 9.

Once this solver object has been constructed, the user should attach it to idas via a call to

IDASetLinearSolver. The ﬁrst argument passed to this function is the idas memory pointer returned

by IDACreate; the second argument is the desired sunlinsol object to use for solving systems. The

third argument is an optional sunmatrix object to accompany matrix-based sunlinsol inputs (for

matrix-free linear solvers, the third argument should be NULL). A call to this function initializes the

idals linear solver interface, linking it to the main idas integrator, and allows the user to specify

additional parameters and routines pertinent to their choice of linear solver.

IDASetLinearSolver

Call flag = IDASetLinearSolver(ida mem, LS, J);

Description The function IDASetLinearSolver attaches a generic sunlinsol object LS and corre-

sponding template Jacobian sunmatrix object J(if applicable) to idas, initializing the

idals linear solver interface.

Arguments ida mem (void *) pointer to the idas memory block.

LS (SUNLinearSolver)sunlinsol object to use for solving linear systems of the

form (2.5.

J(SUNMatrix)sunmatrix object for used as a template for the Jacobian (or

NULL if not applicable).

Return value The return value flag (of type int) is one of

IDALS SUCCESS The idals initialization was successful.

IDALS MEM NULL The ida mem pointer is NULL.

IDALS ILL INPUT The idals interface is not compatible with the LS or Jinput objects

or is incompatible with the current nvector module.

IDALS SUNLS FAIL A call to the LS object failed.

IDALS MEM FAIL A memory allocation request failed.

Notes If LS is a matrix-based linear solver, then the template Jacobian matrix Jwill be used

in the solve process, so if additional storage is required within the sunmatrix object

(e.g., for factorization of a banded matrix), ensure that the input object is allocated

with suﬃcient size (see the documentation of the particular sunmatrix type in Chapter

8for further information).

The previous routines IDADlsSetLinearSolver and IDASpilsSetLinearSolver are

now wrappers for this routine, and may still be used for backward-compatibility. How-

ever, these will be deprecated in future releases, so we recommend that users transition

to the new routine name soon.

42 Using IDAS for IVP Solution

4.5.4 Nonlinear solver interface function

By default idas uses the sunnonlinsol implementation of Newton’s method deﬁned by the sunnon-

linsol newton module (see §10.2). To specify a diﬀerent nonlinear solver in idas, the user’s program

must create a sunnonlinsol object by calling the appropriate constructor routine. The user must

then attach the sunnonlinsol object to idas by calling IDASetNonlinearSolver, as documented

below.

When changing the nonlinear solver in idas,IDASetNonlinearSolver must be called after IDAInit.

If any calls to IDASolve have been made, then idas will need to be reinitialized by calling IDAReInit

to ensure that the nonlinear solver is initialized correctly before any subsequent calls to IDASolve.

The ﬁrst argument passed to the routine IDASetNonlinearSolver is the idas memory pointer

returned by IDACreate and the second argument is the sunnonlinsol object to use for solving the

nonlinear system 2.4. A call to this function attaches the nonlinear solver to the main idas integrator.

We note that at present, the sunnonlinsol object must be of type SUNNONLINEARSOLVER ROOTFIND.

IDASetNonlinearSolver

Call flag = IDASetNonlinearSolver(ida mem, NLS);

Description The function IDASetNonLinearSolver attaches a sunnonlinsol object (NLS) to idas.

Arguments ida mem (void *) pointer to the idas memory block.

NLS (SUNNonlinearSolver)sunnonlinsol object to use for solving nonlinear sys-

tems.

Return value The return value flag (of type int) is one of

IDA SUCCESS The nonlinear solver was successfully attached.

IDA MEM NULL The ida mem pointer is NULL.

IDA ILL INPUT The sunnonlinsol object is NULL, does not implement the required

nonlinear solver operations, is not of the correct type, or the residual

function, convergence test function, or maximum number of nonlinear

iterations could not be set.

Notes When forward sensitivity analysis capabilities are enabled and the IDA STAGGERED cor-

rector method is used this function sets the nonlinear solver method for correcting state

variables (see §5.2.3 for more details).

4.5.5 Initial condition calculation function

IDACalcIC calculates corrected initial conditions for the DAE system for certain index-one problems

including a class of systems of semi-implicit form. (See §2.1 and Ref. [8].) It uses Newton iteration

combined with a linesearch algorithm. Calling IDACalcIC is optional. It is only necessary when

the initial conditions do not satisfy the given system. Thus if y0 and yp0 are known to satisfy

F(t0, y0,˙y0) = 0, then a call to IDACalcIC is generally not necessary.

A call to the function IDACalcIC must be preceded by successful calls to IDACreate and IDAInit

(or IDAReInit), and by a successful call to the linear system solver speciﬁcation function. The call to

IDACalcIC should precede the call(s) to IDASolve for the given problem.

IDACalcIC

Call flag = IDACalcIC(ida mem, icopt, tout1);

Description The function IDACalcIC corrects the initial values y0 and yp0 at time t0.

Arguments ida mem (void *) pointer to the idas memory block.

icopt (int) is one of the following two options for the initial condition calculation.

icopt=IDA YA YDP INIT directs IDACalcIC to compute the algebraic compo-

nents of yand diﬀerential components of ˙y, given the diﬀerential components

4.5 User-callable functions 43

of y. This option requires that the N Vector id was set through IDASetId,

specifying the diﬀerential and algebraic components.

icopt=IDA Y INIT directs IDACalcIC to compute all components of y, given

˙y. In this case, id is not required.

tout1 (realtype) is the ﬁrst value of tat which a solution will be requested (from

IDASolve). This value is needed here only to determine the direction of inte-

gration and rough scale in the independent variable t.

Return value The return value flag (of type int) will be one of the following:

IDA SUCCESS IDASolve succeeded.

IDA MEM NULL The argument ida mem was NULL.

IDA NO MALLOC The allocation function IDAInit has not been called.

IDA ILL INPUT One of the input arguments was illegal.

IDA LSETUP FAIL The linear solver’s setup function failed in an unrecoverable man-

ner.

IDA LINIT FAIL The linear solver’s initialization function failed.

IDA LSOLVE FAIL The linear solver’s solve function failed in an unrecoverable man-

ner.

IDA BAD EWT Some component of the error weight vector is zero (illegal), either

for the input value of y0 or a corrected value.

IDA FIRST RES FAIL The user’s residual function returned a recoverable error ﬂag on

the ﬁrst call, but IDACalcIC was unable to recover.

IDA RES FAIL The user’s residual function returned a nonrecoverable error ﬂag.

IDA NO RECOVERY The user’s residual function, or the linear solver’s setup or solve

function had a recoverable error, but IDACalcIC was unable to

recover.

IDA CONSTR FAIL IDACalcIC was unable to ﬁnd a solution satisfying the inequality

constraints.

IDA LINESEARCH FAIL The linesearch algorithm failed to ﬁnd a solution with a step

larger than steptol in weighted RMS norm, and within the

allowed number of backtracks.

IDA CONV FAIL IDACalcIC failed to get convergence of the Newton iterations.

Notes All failure return values are negative and therefore a test flag <0 will trap all

IDACalcIC failures.

Note that IDACalcIC will correct the values of y(t0) and ˙y(t0) which were speciﬁed

in the previous call to IDAInit or IDAReInit. To obtain the corrected values, call

IDAGetconsistentIC (see §4.5.10.3).

4.5.6 Rootﬁnding initialization function

While integrating the IVP, idas has the capability of ﬁnding the roots of a set of user-deﬁned functions.

To activate the rootﬁnding algorithm, call the following function. This is normally called only once,

prior to the ﬁrst call to IDASolve, but if the rootﬁnding problem is to be changed during the solution,

IDARootInit can also be called prior to a continuation call to IDASolve.

IDARootInit

Call flag = IDARootInit(ida mem, nrtfn, g);

Description The function IDARootInit speciﬁes that the roots of a set of functions gi(t, y, ˙y) are to

be found while the IVP is being solved.

Arguments ida mem (void *) pointer to the idas memory block returned by IDACreate.

44 Using IDAS for IVP Solution

nrtfn (int) is the number of root functions gi.

g(IDARootFn) is the Cfunction which deﬁnes the nrtfn functions gi(t, y, ˙y)

whose roots are sought. See §4.6.4 for details.

Return value The return value flag (of type int) is one of

IDA SUCCESS The call to IDARootInit was successful.

IDA MEM NULL The ida mem argument was NULL.

IDA MEM FAIL A memory allocation failed.

IDA ILL INPUT The function gis NULL, but nrtfn>0.

Notes If a new IVP is to be solved with a call to IDAReInit, where the new IVP has no

rootﬁnding problem but the prior one did, then call IDARootInit with nrtfn= 0.

4.5.7 IDAS solver function

This is the central step in the solution process, the call to perform the integration of the DAE. One

of the input arguments (itask) speciﬁes one of two modes as to where idas is to return a solution.

But these modes are modiﬁed if the user has set a stop time (with IDASetStopTime) or requested

rootﬁnding.

IDASolve

Call flag = IDASolve(ida mem, tout, &tret, yret, ypret, itask);

Description The function IDASolve integrates the DAE over an interval in t.

Arguments ida mem (void *) pointer to the idas memory block.

tout (realtype) the next time at which a computed solution is desired.

tret (realtype) the time reached by the solver (output).

yret (N Vector) the computed solution vector y.

ypret (N Vector) the computed solution vector ˙y.

itask (int) a ﬂag indicating the job of the solver for the next user step. The

IDA NORMAL task is to have the solver take internal steps until it has reached or

just passed the user speciﬁed tout parameter. The solver then interpolates in

order to return approximate values of y(tout) and ˙y(tout). The IDA ONE STEP

option tells the solver to just take one internal step and return the solution at

the point reached by that step.

Return value IDASolve returns vectors yret and ypret and a corresponding independent variable

value t=tret, such that (yret,ypret) are the computed values of (y(t), ˙y(t)).

In IDA NORMAL mode with no errors, tret will be equal to tout and yret =y(tout),

ypret = ˙y(tout).

The return value flag (of type int) will be one of the following:

IDA SUCCESS IDASolve succeeded.

IDA TSTOP RETURN IDASolve succeeded by reaching the stop point speciﬁed through

the optional input function IDASetStopTime.

IDA ROOT RETURN IDASolve succeeded and found one or more roots. In this case,

tret is the location of the root. If nrtfn >1, call IDAGetRootInfo

to see which giwere found to have a root. See §4.5.10.4 for more

information.

IDA MEM NULL The ida mem argument was NULL.

IDA ILL INPUT One of the inputs to IDASolve was illegal, or some other input

to the solver was either illegal or missing. The latter category

includes the following situations: (a) The tolerances have not been

set. (b) A component of the error weight vector became zero during

4.5 User-callable functions 45

internal time-stepping. (c) The linear solver initialization function

(called by the user after calling IDACreate) failed to set the linear

solver-speciﬁc lsolve ﬁeld in ida mem. (d) A root of one of the

root functions was found both at a point tand also very near t. In

any case, the user should see the printed error message for details.

IDA TOO MUCH WORK The solver took mxstep internal steps but could not reach tout.

The default value for mxstep is MXSTEP DEFAULT = 500.

IDA TOO MUCH ACC The solver could not satisfy the accuracy demanded by the user for

some internal step.

IDA ERR FAIL Error test failures occurred too many times (MXNEF = 10) during

one internal time step or occurred with |h|=hmin.

IDA CONV FAIL Convergence test failures occurred too many times (MXNCF = 10)

during one internal time step or occurred with |h|=hmin.

IDA LINIT FAIL The linear solver’s initialization function failed.

IDA LSETUP FAIL The linear solver’s setup function failed in an unrecoverable man-

ner.

IDA LSOLVE FAIL The linear solver’s solve function failed in an unrecoverable manner.

IDA CONSTR FAIL The inequality constraints were violated and the solver was unable

to recover.

IDA REP RES ERR The user’s residual function repeatedly returned a recoverable error

ﬂag, but the solver was unable to recover.

IDA RES FAIL The user’s residual function returned a nonrecoverable error ﬂag.

IDA RTFUNC FAIL The rootﬁnding function failed.

Notes The vector yret can occupy the same space as the vector y0 of initial conditions that

was passed to IDAInit, and the vector ypret can occupy the same space as yp0.

In the IDA ONE STEP mode, tout is used on the ﬁrst call only, and only to get the

direction and rough scale of the independent variable.

All failure return values are negative and therefore a test flag <0 will trap all IDASolve

failures.

On any error return in which one or more internal steps were taken by IDASolve, the

returned values of tret,yret, and ypret correspond to the farthest point reached in

the integration. On all other error returns, these values are left unchanged from the

previous IDASolve return.

4.5.8 Optional input functions

There are numerous optional input parameters that control the behavior of the idas solver. idas

provides functions that can be used to change these optional input parameters from their default

values. Table 4.2 lists all optional input functions in idas which are then described in detail in the

remainder of this section. For the most casual use of idas, the reader can skip to §4.6.

We note that, on an error return, all these functions also send an error message to the error handler

function. We also note that all error return values are negative, so a test flag <0 will catch any

error.

4.5.8.1 Main solver optional input functions

The calls listed here can be executed in any order. However, if the user’s program calls either

IDASetErrFile or IDASetErrHandlerFn, then that call should appear ﬁrst, in order to take eﬀect for

any later error message.

46 Using IDAS for IVP Solution

Table 4.2: Optional inputs for idas and idals

Optional input Function name Default

IDAS main solver

Pointer to an error ﬁle IDASetErrFile stderr

Error handler function IDASetErrHandlerFn internal fn.

User data IDASetUserData NULL

Maximum order for BDF method IDASetMaxOrd 5

Maximum no. of internal steps before tout IDASetMaxNumSteps 500

Initial step size IDASetInitStep estimated

Maximum absolute step size IDASetMaxStep ∞

Value of tstop IDASetStopTime ∞

Maximum no. of error test failures IDASetMaxErrTestFails 10

Maximum no. of nonlinear iterations IDASetMaxNonlinIters 4

Maximum no. of convergence failures IDASetMaxConvFails 10

Maximum no. of error test failures IDASetMaxErrTestFails 7

Coeﬀ. in the nonlinear convergence test IDASetNonlinConvCoef 0.33

Suppress alg. vars. from error test IDASetSuppressAlg SUNFALSE

Variable types (diﬀerential/algebraic) IDASetId NULL

Inequality constraints on solution IDASetConstraints NULL

Direction of zero-crossing IDASetRootDirection both

Disable rootﬁnding warnings IDASetNoInactiveRootWarn none

IDAS initial conditions calculation

Coeﬀ. in the nonlinear convergence test IDASetNonlinConvCoefIC 0.0033

Maximum no. of steps IDASetMaxNumStepsIC 5

Maximum no. of Jacobian/precond. evals. IDASetMaxNumJacsIC 4

Maximum no. of Newton iterations IDASetMaxNumItersIC 10

Max. linesearch backtracks per Newton iter. IDASetMaxBacksIC 100

Turn oﬀ linesearch IDASetLineSearchOffIC SUNFALSE

Lower bound on Newton step IDASetStepToleranceIC uround2/3

IDALS linear solver interface

Jacobian function IDASetJacFn DQ

Jacobian-times-vector function IDASetJacTimes NULL, DQ

Preconditioner functions IDASetPreconditioner NULL, NULL

Ratio between linear and nonlinear tolerances IDASetEpsLin 0.05

Increment factor used in DQ Jv approx. IDASetIncrementFactor 1.0

4.5 User-callable functions 47

IDASetErrFile

Call flag = IDASetErrFile(ida mem, errfp);

Description The function IDASetErrFile speciﬁes the pointer to the ﬁle where all idas messages

should be directed when the default idas error handler function is used.

Arguments ida mem (void *) pointer to the idas memory block.

errfp (FILE *) pointer to output ﬁle.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

Notes The default value for errfp is stderr.

Passing a value NULL disables all future error message output (except for the case in

which the idas memory pointer is NULL). This use of IDASetErrFile is strongly dis-

couraged.

If IDASetErrFile is to be called, it should be called before any other optional input

functions, in order to take eﬀect for any later error message.

IDASetErrHandlerFn

Call flag = IDASetErrHandlerFn(ida mem, ehfun, eh data);

Description The function IDASetErrHandlerFn speciﬁes the optional user-deﬁned function to be

used in handling error messages.

Arguments ida mem (void *) pointer to the idas memory block.

ehfun (IDAErrHandlerFn) is the user’s Cerror handler function (see §4.6.2).

eh data (void *) pointer to user data passed to ehfun every time it is called.

Return value The return value flag (of type int) is one of

IDA SUCCESS The function ehfun and data pointer eh data have been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

Notes Error messages indicating that the idas solver memory is NULL will always be directed

to stderr.

IDASetUserData

Call flag = IDASetUserData(ida mem, user data);

Description The function IDASetUserData speciﬁes the user data block user data and attaches it

to the main idas memory block.

Arguments ida mem (void *) pointer to the idas memory block.

user data (void *) pointer to the user data.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

Notes If speciﬁed, the pointer to user data is passed to all user-supplied functions that have

it as an argument. Otherwise, a NULL pointer is passed.

If user data is needed in user linear solver or preconditioner functions, the call to

IDASetUserData must be made before the call to specify the linear solver.

48 Using IDAS for IVP Solution

IDASetMaxOrd

Call flag = IDASetMaxOrd(ida mem, maxord);

Description The function IDASetMaxOrd speciﬁes the maximum order of the linear multistep method.

Arguments ida mem (void *) pointer to the idas memory block.

maxord (int) value of the maximum method order. This must be positive.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA ILL INPUT The input value maxord is ≤0, or larger than its previous value.

Notes The default value is 5. If the input value exceeds 5, the value 5 will be used. Since

maxord aﬀects the memory requirements for the internal idas memory block, its value

cannot be increased past its previous value.

IDASetMaxNumSteps

Call flag = IDASetMaxNumSteps(ida mem, mxsteps);

Description The function IDASetMaxNumSteps speciﬁes the maximum number of steps to be taken

by the solver in its attempt to reach the next output time.

Arguments ida mem (void *) pointer to the idas memory block.

mxsteps (long int) maximum allowed number of steps.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

Notes Passing mxsteps = 0 results in idas using the default value (500).

Passing mxsteps <0 disables the test (not recommended).

IDASetInitStep

Call flag = IDASetInitStep(ida mem, hin);

Description The function IDASetInitStep speciﬁes the initial step size.

Arguments ida mem (void *) pointer to the idas memory block.

hin (realtype) value of the initial step size to be attempted. Pass 0.0 to have

idas use the default value.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

Notes By default, idas estimates the initial step as the solution of kh˙ykWRMS = 1/2, with an

added restriction that |h| ≤ .001|tout - t0|.

IDASetMaxStep

Call flag = IDASetMaxStep(ida mem, hmax);

Description The function IDASetMaxStep speciﬁes the maximum absolute value of the step size.

Arguments ida mem (void *) pointer to the idas memory block.

hmax (realtype) maximum absolute value of the step size.

Return value The return value flag (of type int) is one of

4.5 User-callable functions 49

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA ILL INPUT Either hmax is not positive or it is smaller than the minimum allowable

step.

Notes Pass hmax= 0 to obtain the default value ∞.

IDASetStopTime

Call flag = IDASetStopTime(ida mem, tstop);

Description The function IDASetStopTime speciﬁes the value of the independent variable tpast

which the solution is not to proceed.

Arguments ida mem (void *) pointer to the idas memory block.

tstop (realtype) value of the independent variable past which the solution should

not proceed.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA ILL INPUT The value of tstop is not beyond the current tvalue, tn.

Notes The default, if this routine is not called, is that no stop time is imposed.

IDASetMaxErrTestFails

Call flag = IDASetMaxErrTestFails(ida mem, maxnef);

Description The function IDASetMaxErrTestFails speciﬁes the maximum number of error test

failures in attempting one step.

Arguments ida mem (void *) pointer to the idas memory block.

maxnef (int) maximum number of error test failures allowed on one step (>0).

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

Notes The default value is 7.

IDASetMaxNonlinIters

Call flag = IDASetMaxNonlinIters(ida mem, maxcor);

Description The function IDASetMaxNonlinIters speciﬁes the maximum number of nonlinear solver

iterations at one step.

Arguments ida mem (void *) pointer to the idas memory block.

maxcor (int) maximum number of nonlinear solver iterations allowed on one step

(>0).

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA MEM FAIL The sunnonlinsol module is NULL.

Notes The default value is 3.

50 Using IDAS for IVP Solution

IDASetMaxConvFails

Call flag = IDASetMaxConvFails(ida mem, maxncf);

Description The function IDASetMaxConvFails speciﬁes the maximum number of nonlinear solver

convergence failures at one step.

Arguments ida mem (void *) pointer to the idas memory block.

maxncf (int) maximum number of allowable nonlinear solver convergence failures on

one step (>0).

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

Notes The default value is 10.

IDASetNonlinConvCoef

Call flag = IDASetNonlinConvCoef(ida mem, nlscoef);

Description The function IDASetNonlinConvCoef speciﬁes the safety factor in the nonlinear con-

vergence test; see Chapter 2, Eq. (2.8).

Arguments ida mem (void *) pointer to the idas memory block.

nlscoef (realtype) coeﬃcient in nonlinear convergence test (>0.0).

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA ILL INPUT The value of nlscoef is <= 0.0.

Notes The default value is 0.33.

IDASetSuppressAlg

Call flag = IDASetSuppressAlg(ida mem, suppressalg);

Description The function IDASetSuppressAlg indicates whether or not to suppress algebraic vari-

ables in the local error test.

Arguments ida mem (void *) pointer to the idas memory block.

suppressalg (booleantype) indicates whether to suppress (SUNTRUE) or not (SUNFALSE)

the algebraic variables in the local error test.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

Notes The default value is SUNFALSE.

If suppressalg=SUNTRUE is selected, then the id vector must be set (through IDASetId)

to specify the algebraic components.

In general, the use of this option (with suppressalg = SUNTRUE) is discouraged when

solving DAE systems of index 1, whereas it is generally encouraged for systems of index

2 or more. See pp. 146-147 of Ref. [4] for more on this issue.

4.5 User-callable functions 51

IDASetId

Call flag = IDASetId(ida mem, id);

Description The function IDASetId speciﬁes algebraic/diﬀerential components in the yvector.

Arguments ida mem (void *) pointer to the idas memory block.

id (N Vector) state vector. A value of 1.0 indicates a diﬀerential variable, while

0.0 indicates an algebraic variable.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

Notes The vector id is required if the algebraic variables are to be suppressed from the lo-

cal error test (see IDASetSuppressAlg) or if IDACalcIC is to be called with icopt =

IDA YA YDP INIT (see §4.5.5).

IDASetConstraints

Call flag = IDASetConstraints(ida mem, constraints);

Description The function IDASetConstraints speciﬁes a vector deﬁning inequality constraints for

each component of the solution vector y.

Arguments ida mem (void *) pointer to the idas memory block.

constraints (N Vector) vector of constraint ﬂags. If constraints[i] is

0.0 then no constraint is imposed on yi.

1.0 then yiwill be constrained to be yi≥0.0.

−1.0 then yiwill be constrained to be yi≤0.0.

2.0 then yiwill be constrained to be yi>0.0.

−2.0 then yiwill be constrained to be yi<0.0.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA ILL INPUT The constraints vector contains illegal values or the simultaneous cor-

rector option has been selected when doing forward sensitivity analysis.

Notes The presence of a non-NULL constraints vector that is not 0.0 in all components will

cause constraint checking to be performed. However, a call with 0.0 in all components

of constraints will result in an illegal input return.

Constraint checking when doing forward sensitivity analysis with the simultaneous cor-

rector option is currently disallowed and will result in an illegal input return.

4.5.8.2 Linear solver interface optional input functions

The mathematical explanation of the linear solver methods available to idas is provided in §2.1. We

group the user-callable routines into four categories: general routines concerning the overall idals

linear solver interface, optional inputs for matrix-based linear solvers, optional inputs for matrix-free

linear solvers, and optional inputs for iterative linear solvers. We note that the matrix-based and

matrix-free groups are mutually exclusive, whereas the “iterative” tag can apply to either case.

When using matrix-based linear solver modules, the idals solver interface needs a function to com-

pute an approximation to the Jacobian matrix J(t, y, ˙y). This function must be of type IDALsJacFn.

The user can supply a Jacobian function, or if using a dense or banded matrix Jcan use the de-

fault internal diﬀerence quotient approximation that comes with the idals interface. To specify a

user-supplied Jacobian function jac,idals provides the function IDASetJacFn. The idals interface

52 Using IDAS for IVP Solution

passes the pointer user data to the Jacobian function. This allows the user to create an arbitrary

structure with relevant problem data and access it during the execution of the user-supplied Jacobian

function, without using global data in the program. The pointer user data may be speciﬁed through

IDASetUserData.

IDASetJacFn

Call flag = IDASetJacFn(ida mem, jac);

Description The function IDASetJacFn speciﬁes the Jacobian approximation function to be used for

a matrix-based solver within the idals interface.

Arguments ida mem (void *) pointer to the idas memory block.

jac (IDALsJacFn) user-deﬁned Jacobian approximation function.

Return value The return value flag (of type int) is one of

IDALS SUCCESS The optional value has been successfully set.

IDALS MEM NULL The ida mem pointer is NULL.

IDALS LMEM NULL The idals linear solver interface has not been initialized.

Notes This function must be called after the idals linear solver interface has been initialized

through a call to IDASetLinearSolver.

By default, idals uses an internal diﬀerence quotient function for dense and band

matrices. If NULL is passed to jac, this default function is used. An error will occur if

no jac is supplied when using other matrix types.

The function type IDALsJacFn is described in §4.6.5.

The previous routine IDADlsSetJacFn is now a wrapper for this routine, and may still

be used for backward-compatibility. However, this will be deprecated in future releases,

so we recommend that users transition to the new routine name soon.

When using matrix-free linear solver modules, the idals solver interface requires a function to compute

an approximation to the product between the Jacobian matrix J(t, y) and a vector v. The user can

supply a Jacobian-times-vector approximation function, or use the default internal diﬀerence quotient

function that comes with the idals solver interface. A user-deﬁned Jacobian-vector function must

be of type IDALsJacTimesVecFn and can be speciﬁed through a call to IDASetJacTimes (see §4.6.6

for speciﬁcation details). The evaluation and processing of any Jacobian-related data needed by the

user’s Jacobian-times-vector function may be done in the optional user-supplied function jtsetup

(see §4.6.7 for speciﬁcation details). The pointer user data received through IDASetUserData (or

a pointer to NULL if user data was not speciﬁed) is passed to the Jacobian-times-vector setup and

product functions, jtsetup and jtimes, each time they are called. This allows the user to create an

arbitrary structure with relevant problem data and access it during the execution of the user-supplied

preconditioner functions without using global data in the program.

IDASetJacTimes

Call flag = IDASetJacTimes(ida mem, jsetup, jtimes);

Description The function IDASetJacTimes speciﬁes the Jacobian-vector setup and product func-

tions.

Arguments ida mem (void *) pointer to the idas memory block.

jtsetup (IDALsJacTimesSetupFn) user-deﬁned function to set up the Jacobian-vector

product. Pass NULL if no setup is necessary.

jtimes (IDALsJacTimesVecFn) user-deﬁned Jacobian-vector product function.

Return value The return value flag (of type int) is one of

IDALS SUCCESS The optional value has been successfully set.

IDALS MEM NULL The ida mem pointer is NULL.

4.5 User-callable functions 53

IDALS LMEM NULL The idals linear solver has not been initialized.

IDALS SUNLS FAIL An error occurred when setting up the system matrix-times-vector

routines in the sunlinsol object used by the idals interface.

Notes The default is to use an internal ﬁnite diﬀerence quotient for jtimes and to omit

jtsetup. If NULL is passed to jtimes, these defaults are used. A user may specify

non-NULL jtimes and NULL jtsetup inputs.

This function must be called after the idals linear solver interface has been initialized

through a call to IDASetLinearSolver.

The function type IDALsJacTimesSetupFn is described in §4.6.7.

The function type IDALsJacTimesVecFn is described in §4.6.6.

The previous routine IDASpilsSetJacTimes is now a wrapper for this routine, and may

still be used for backward-compatibility. However, this will be deprecated in future

releases, so we recommend that users transition to the new routine name soon.

Alternately, when using the default diﬀerence-quotient approximation to the Jacobian-vector product,

the user may specify the factor to use in setting increments for the ﬁnite-diﬀerence approximation,

via a call to IDASetIncrementFactor:

IDASetIncrementFactor

Call flag = IDASetIncrementFactor(ida mem, dqincfac);

Description The function IDASetIncrementFactor speciﬁes the increment factor to be used in the

diﬀerence-quotient approximation to the product Jv. Speciﬁcally, Jv is approximated

via the formula

Jv =1

σ[F(t, ˜y, ˜y0)−F(t, y, y0)] ,

where ˜y=y+σv, ˜y0=y0+cjσv,cjis a BDF parameter proportional to the step size,

σ=√Ndqincfac, and Nis the number of equations in the DAE system.

Arguments ida mem (void *) pointer to the idas memory block.

dqincfac (realtype) user-speciﬁed increment factor (positive).

Return value The return value flag (of type int) is one of

IDALS SUCCESS The optional value has been successfully set.

IDALS MEM NULL The ida mem pointer is NULL.

IDALS LMEM NULL The idals linear solver has not been initialized.

IDALS ILL INPUT The speciﬁed value of dqincfac is ≤0.

Notes The default value is 1.0.

This function must be called after the idals linear solver interface has been initialized

through a call to IDASetLinearSolver.

The previous routine IDASpilsSetIncrementFactor is now a wrapper for this routine,

and may still be used for backward-compatibility. However, this will be deprecated in

future releases, so we recommend that users transition to the new routine name soon.

When using an iterative linear solver, the user may supply a preconditioning operator to aid in

solution of the system. This operator consists of two user-supplied functions, psetup and psolve,

that are supplied to ida using the function IDASetPreconditioner. The psetup function supplied

to this routine should handle evaluation and preprocessing of any Jacobian data needed by the user’s

preconditioner solve function, psolve. Both of these functions are fully speciﬁed in §4.6. The user

data pointer received through IDASetUserData (or a pointer to NULL if user data was not speciﬁed) is

passed to the psetup and psolve functions. This allows the user to create an arbitrary structure with

relevant problem data and access it during the execution of the user-supplied preconditioner functions

without using global data in the program.

54 Using IDAS for IVP Solution

Also, as described in §2.1, the idals interface requires that iterative linear solvers stop when the

norm of the preconditioned residual satisﬁes

krk ≤ L

where is the nonlinear solver tolerance, and the default L= 0.05; this value may be modiﬁed by

the user through the IDASetEpsLin function.

IDASetPreconditioner

Call flag = IDASetPreconditioner(ida mem, psetup, psolve);

Description The function IDASetPreconditioner speciﬁes the preconditioner setup and solve func-

tions.

Arguments ida mem (void *) pointer to the idas memory block.

psetup (IDALsPrecSetupFn) user-deﬁned function to set up the preconditioner. Pass

NULL if no setup is necessary.

psolve (IDALsPrecSolveFn) user-deﬁned preconditioner solve function.

Return value The return value flag (of type int) is one of

IDALS SUCCESS The optional values have been successfully set.

IDALS MEM NULL The ida mem pointer is NULL.

IDALS LMEM NULL The idals linear solver has not been initialized.

IDALS SUNLS FAIL An error occurred when setting up preconditioning in the sunlinsol

object used by the idals interface.

Notes The default is NULL for both arguments (i.e., no preconditioning).

This function must be called after the idals linear solver interface has been initialized

through a call to IDASetLinearSolver.

The function type IDALsPrecSolveFn is described in §4.6.8.

The function type IDALsPrecSetupFn is described in §4.6.9.

The previous routine IDASpilsSetPreconditioner is now a wrapper for this routine,

and may still be used for backward-compatibility. However, this will be deprecated in

future releases, so we recommend that users transition to the new routine name soon.

IDASetEpsLin

Call flag = IDASetEpsLin(ida mem, eplifac);

Description The function IDASetEpsLin speciﬁes the factor by which the Krylov linear solver’s

convergence test constant is reduced from the nonlinear iteration test constant.

Arguments ida mem (void *) pointer to the idas memory block.

eplifac (realtype) linear convergence safety factor (≥0.0).

Return value The return value flag (of type int) is one of

IDALS SUCCESS The optional value has been successfully set.

IDALS MEM NULL The ida mem pointer is NULL.

IDALS LMEM NULL The idals linear solver has not been initialized.

IDALS ILL INPUT The factor eplifac is negative.

Notes The default value is 0.05.

This function must be called after the idals linear solver interface has been initialized

through a call to IDASetLinearSolver.

If eplifac= 0.0 is passed, the default value is used.

4.5 User-callable functions 55

The previous routine IDASpilsSetEpsLin is now a wrapper for this routine, and may

still be used for backward-compatibility. However, this will be deprecated in future

releases, so we recommend that users transition to the new routine name soon.

4.5.8.3 Initial condition calculation optional input functions

The following functions can be called just prior to calling IDACalcIC to set optional inputs controlling

the initial condition calculation.

IDASetNonlinConvCoefIC

Call flag = IDASetNonlinConvCoefIC(ida mem, epiccon);

Description The function IDASetNonlinConvCoefIC speciﬁes the positive constant in the Newton

iteration convergence test within the initial condition calculation.

Arguments ida mem (void *) pointer to the idas memory block.

epiccon (realtype) coeﬃcient in the Newton convergence test (>0).

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA ILL INPUT The epiccon factor is <= 0.0.

Notes The default value is 0.01 ·0.33.

This test uses a weighted RMS norm (with weights deﬁned by the tolerances). For

new initial value vectors yand ˙yto be accepted, the norm of J−1F(t0, y, ˙y) must be ≤

epiccon, where Jis the system Jacobian.

IDASetMaxNumStepsIC

Call flag = IDASetMaxNumStepsIC(ida mem, maxnh);

Description The function IDASetMaxNumStepsIC speciﬁes the maximum number of steps allowed

when icopt=IDA YA YDP INIT in IDACalcIC, where happears in the system Jacobian,

J=∂F/∂y + (1/h)∂F/∂ ˙y.

Arguments ida mem (void *) pointer to the idas memory block.

maxnh (int) maximum allowed number of values for h.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA ILL INPUT maxnh is non-positive.

Notes The default value is 5.

IDASetMaxNumJacsIC

Call flag = IDASetMaxNumJacsIC(ida mem, maxnj);

Description The function IDASetMaxNumJacsIC speciﬁes the maximum number of the approximate

Jacobian or preconditioner evaluations allowed when the Newton iteration appears to

be slowly converging.

Arguments ida mem (void *) pointer to the idas memory block.

maxnj (int) maximum allowed number of Jacobian or preconditioner evaluations.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

56 Using IDAS for IVP Solution

IDA MEM NULL The ida mem pointer is NULL.

IDA ILL INPUT maxnj is non-positive.

Notes The default value is 4.

IDASetMaxNumItersIC

Call flag = IDASetMaxNumItersIC(ida mem, maxnit);

Description The function IDASetMaxNumItersIC speciﬁes the maximum number of Newton itera-

tions allowed in any one attempt to solve the initial conditions calculation problem.

Arguments ida mem (void *) pointer to the idas memory block.

maxnit (int) maximum number of Newton iterations.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA ILL INPUT maxnit is non-positive.

Notes The default value is 10.

IDASetMaxBacksIC

Call flag = IDASetMaxBacksIC(ida mem, maxbacks);

Description The function IDASetMaxBacksIC speciﬁes the maximum number of linesearch back-

tracks allowed in any Newton iteration, when solving the initial conditions calculation

problem.

Arguments ida mem (void *) pointer to the idas memory block.

maxbacks (int) maximum number of linesearch backtracks per Newton step.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA ILL INPUT maxbacks is non-positive.

Notes The default value is 100.

If IDASetMaxBacksIC is called in a Forward Sensitivity Analysis, the the limit maxbacks

applies in the calculation of both the initial state values and the initial sensititivies.

IDASetLineSearchOffIC

Call flag = IDASetLineSearchOffIC(ida mem, lsoff);

Description The function IDASetLineSearchOffIC speciﬁes whether to turn on or oﬀ the linesearch

algorithm.

Arguments ida mem (void *) pointer to the idas memory block.

lsoff (booleantype) a ﬂag to turn oﬀ (SUNTRUE) or keep (SUNFALSE) the linesearch

algorithm.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

Notes The default value is SUNFALSE.

4.5 User-callable functions 57

IDASetStepToleranceIC

Call flag = IDASetStepToleranceIC(ida mem, steptol);

Description The function IDASetStepToleranceIC speciﬁes a positive lower bound on the Newton

step.

Arguments ida mem (void *) pointer to the idas memory block.

steptol (int) Minimum allowed WRMS-norm of the Newton step (>0.0).

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA ILL INPUT The steptol tolerance is <= 0.0.

Notes The default value is (unit roundoﬀ)2/3.

4.5.8.4 Rootﬁnding optional input functions

The following functions can be called to set optional inputs to control the rootﬁnding algorithm.

IDASetRootDirection

Call flag = IDASetRootDirection(ida mem, rootdir);

Description The function IDASetRootDirection speciﬁes the direction of zero-crossings to be lo-

cated and returned to the user.

Arguments ida mem (void *) pointer to the idas memory block.

rootdir (int *) state array of length nrtfn, the number of root functions gi, as spec-

iﬁed in the call to the function IDARootInit. A value of 0 for rootdir[i]

indicates that crossing in either direction should be reported for gi. A value

of +1 or −1 indicates that the solver should report only zero-crossings where

giis increasing or decreasing, respectively.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA ILL INPUT rootﬁnding has not been activated through a call to IDARootInit.

Notes The default behavior is to locate both zero-crossing directions.

IDASetNoInactiveRootWarn

Call flag = IDASetNoInactiveRootWarn(ida mem);

Description The function IDASetNoInactiveRootWarn disables issuing a warning if some root func-

tion appears to be identically zero at the beginning of the integration.

Arguments ida mem (void *) pointer to the idas memory block.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

Notes idas will not report the initial conditions as a possible zero-crossing (assuming that one

or more components giare zero at the initial time). However, if it appears that some gi

is identically zero at the initial time (i.e., giis zero at the initial time and after the ﬁrst

step), idas will issue a warning which can be disabled with this optional input function.

58 Using IDAS for IVP Solution

4.5.9 Interpolated output function

An optional function IDAGetDky is available to obtain additional output values. This function must be

called after a successful return from IDASolve and provides interpolated values of yor its derivatives

of order up to the last internal order used for any value of tin the last internal step taken by idas.

The call to the IDAGetDky function has the following form:

IDAGetDky

Call flag = IDAGetDky(ida mem, t, k, dky);

Description The function IDAGetDky computes the interpolated values of the kth derivative of yfor

any value of tin the last internal step taken by idas. The value of kmust be non-

negative and smaller than the last internal order used. A value of 0 for kmeans that

the yis interpolated. The value of tmust satisfy tn−hu≤t≤tn, where tndenotes

the current internal time reached, and huis the last internal step size used successfully.

Arguments ida mem (void *) pointer to the idas memory block.

t(realtype) time at which to interpolate.

k(int) integer specifying the order of the derivative of ywanted.

dky (N Vector) vector containing the interpolated kth derivative of y(t).

Return value The return value flag (of type int) is one of

IDA SUCCESS IDAGetDky succeeded.

IDA MEM NULL The ida mem argument was NULL.

IDA BAD T t is not in the interval [tn−hu, tn].

IDA BAD K k is not one of {0,1, . . . , klast}.

IDA BAD DKY dky is NULL.

Notes It is only legal to call the function IDAGetDky after a successful return from IDASolve.

Functions IDAGetCurrentTime,IDAGetLastStep and IDAGetLastOrder (see §4.5.10.2)

can be used to access tn,huand klast.

4.5.10 Optional output functions

idas provides an extensive list of functions that can be used to obtain solver performance information.

Table 4.3 lists all optional output functions in idas, which are then described in detail in the remainder

of this section.

Some of the optional outputs, especially the various counters, can be very useful in determining

how successful the idas solver is in doing its job. For example, the counters nsteps and nrevals

provide a rough measure of the overall cost of a given run, and can be compared among runs with

diﬀering input options to suggest which set of options is most eﬃcient. The ratio nniters/nsteps

measures the performance of the nonlinear solver in solving the nonlinear systems at each time step;

typical values for this range from 1.1 to 1.8. The ratio njevals/nniters (in the case of a matrix-

based linear solver), and the ratio npevals/nniters (in the case of an iterative linear solver) measure

the overall degree of nonlinearity in these systems, and also the quality of the approximate Jacobian

or preconditioner being used. Thus, for example, njevals/nniters can indicate if a user-supplied

Jacobian is inaccurate, if this ratio is larger than for the case of the corresponding internal Jacobian.

The ratio nliters/nniters measures the performance of the Krylov iterative linear solver, and thus

(indirectly) the quality of the preconditioner.

4.5.10.1 SUNDIALS version information

The following functions provide a way to get sundials version information at runtime.

4.5 User-callable functions 59

Table 4.3: Optional outputs from idas and idals

Optional output Function name

IDAS main solver

Size of idas real and integer workspace IDAGetWorkSpace

Cumulative number of internal steps IDAGetNumSteps

No. of calls to residual function IDAGetNumResEvals

No. of calls to linear solver setup function IDAGetNumLinSolvSetups

No. of local error test failures that have occurred IDAGetNumErrTestFails

Order used during the last step IDAGetLastOrder

Order to be attempted on the next step IDAGetCurrentOrder

Order reductions due to stability limit detection IDAGetNumStabLimOrderReds

Actual initial step size used IDAGetActualInitStep

Step size used for the last step IDAGetLastStep

Step size to be attempted on the next step IDAGetCurrentStep

Current internal time reached by the solver IDAGetCurrentTime

Suggested factor for tolerance scaling IDAGetTolScaleFactor

Error weight vector for state variables IDAGetErrWeights

Estimated local errors IDAGetEstLocalErrors

No. of nonlinear solver iterations IDAGetNumNonlinSolvIters

No. of nonlinear convergence failures IDAGetNumNonlinSolvConvFails

Array showing roots found IDAGetRootInfo

No. of calls to user root function IDAGetNumGEvals

Name of constant associated with a return ﬂag IDAGetReturnFlagName

IDAS initial conditions calculation

Number of backtrack operations IDAGetNumBacktrackops

Corrected initial conditions IDAGetConsistentIC

IDALS linear solver interface

Size of real and integer workspace IDAGetLinWorkSpace

No. of Jacobian evaluations IDAGetNumJacEvals

No. of residual calls for ﬁnite diﬀ. Jacobian[-vector] evals. IDAGetNumLinResEvals

No. of linear iterations IDAGetNumLinIters

No. of linear convergence failures IDAGetNumLinConvFails

No. of preconditioner evaluations IDAGetNumPrecEvals

No. of preconditioner solves IDAGetNumPrecSolves

No. of Jacobian-vector setup evaluations IDAGetNumJTSetupEvals

No. of Jacobian-vector product evaluations IDAGetNumJtimesEvals

Last return from a linear solver function IDAGetLastLinFlag

Name of constant associated with a return ﬂag IDAGetLinReturnFlagName

60 Using IDAS for IVP Solution

SUNDIALSGetVersion

Call flag = SUNDIALSGetVersion(version, len);

Description The function SUNDIALSGetVersion ﬁlls a character array with sundials version infor-

mation.

Arguments version (char *) character array to hold the sundials version information.

len (int) allocated length of the version character array.

Return value If successful, SUNDIALSGetVersion returns 0 and version contains the sundials ver-

sion information. Otherwise, it returns −1 and version is not set (the input character

array is too short).

Notes A string of 25 characters should be suﬃcient to hold the version information. Any

trailing characters in the version array are removed.

SUNDIALSGetVersionNumber

Call flag = SUNDIALSGetVersionNumber(&major, &minor, &patch, label, len);

Description The function SUNDIALSGetVersionNumber set integers for the sundials major, minor,

and patch release numbers and ﬁlls a character array with the release label if applicable.

Arguments major (int)sundials release major version number.

minor (int)sundials release minor version number.

patch (int)sundials release patch version number.

label (char *) character array to hold the sundials release label.

len (int) allocated length of the label character array.

Return value If successful, SUNDIALSGetVersionNumber returns 0 and the major,minor,patch, and

label values are set. Otherwise, it returns −1 and the values are not set (the input

character array is too short).

Notes A string of 10 characters should be suﬃcient to hold the label information. If a label

is not used in the release version, no information is copied to label. Any trailing

characters in the label array are removed.

4.5.10.2 Main solver optional output functions

idas provides several user-callable functions that can be used to obtain diﬀerent quantities that may

be of interest to the user, such as solver workspace requirements, solver performance statistics, as well

as additional data from the idas memory block (a suggested tolerance scaling factor, the error weight

vector, and the vector of estimated local errors). Also provided are functions to extract statistics

related to the performance of the sunnonlinsol nonlinear solver being used. As a convenience, ad-

ditional extraction functions provide the optional outputs in groups. These optional output functions

are described next.

IDAGetWorkSpace

Call flag = IDAGetWorkSpace(ida mem, &lenrw, &leniw);

Description The function IDAGetWorkSpace returns the idas real and integer workspace sizes.

Arguments ida mem (void *) pointer to the idas memory block.

lenrw (long int) number of real values in the idas workspace.

leniw (long int) number of integer values in the idas workspace.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

4.5 User-callable functions 61

Notes In terms of the problem size N, the maximum method order maxord, and the number

nrtfn of root functions (see §4.5.6), the actual size of the real workspace, in realtype

words, is given by the following:

•base value: lenrw = 55 + (m+ 6) ∗Nr+ 3∗nrtfn;

•with IDASVtolerances:lenrw =lenrw +Nr;

•with constraint checking (see IDASetConstraints): lenrw =lenrw +Nr;

•with id speciﬁed (see IDASetId): lenrw =lenrw +Nr;

where m= max(maxord,3), and Nris the number of real words in one N Vector (≈N).

The size of the integer workspace (without distinction between int and long int words)

is given by:

•base value: leniw = 38 + (m+ 6) ∗Ni+nrtfn;

•with IDASVtolerances:leniw =leniw +Ni;

•with constraint checking: lenrw =lenrw +Ni;

•with id speciﬁed: lenrw =lenrw +Ni;

where Niis the number of integer words in one N Vector (= 1 for nvector serial

and 2*npes for nvector parallel on npes processors).

For the default value of maxord, with no rootﬁnding, no id, no constraints, and with

no call to IDASVtolerances, these lengths are given roughly by: lenrw = 55 + 11N,

leniw = 49.

Note that additional memory is allocated if quadratures and/or forward sensitivity

integration is enabled. See §4.7.1 and §5.2.1 for more details.

IDAGetNumSteps

Call flag = IDAGetNumSteps(ida mem, &nsteps);

Description The function IDAGetNumSteps returns the cumulative number of internal steps taken

by the solver (total so far).

Arguments ida mem (void *) pointer to the idas memory block.

nsteps (long int) number of steps taken by idas.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDAGetNumResEvals

Call flag = IDAGetNumResEvals(ida mem, &nrevals);

Description The function IDAGetNumResEvals returns the number of calls to the user’s residual

evaluation function.

Arguments ida mem (void *) pointer to the idas memory block.

nrevals (long int) number of calls to the user’s res function.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

Notes The nrevals value returned by IDAGetNumResEvals does not account for calls made to

res from a linear solver or preconditioner module.

62 Using IDAS for IVP Solution

IDAGetNumLinSolvSetups

Call flag = IDAGetNumLinSolvSetups(ida mem, &nlinsetups);

Description The function IDAGetNumLinSolvSetups returns the cumulative number of calls made

to the linear solver’s setup function (total so far).

Arguments ida mem (void *) pointer to the idas memory block.

nlinsetups (long int) number of calls made to the linear solver setup function.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDAGetNumErrTestFails

Call flag = IDAGetNumErrTestFails(ida mem, &netfails);

Description The function IDAGetNumErrTestFails returns the cumulative number of local error

test failures that have occurred (total so far).

Arguments ida mem (void *) pointer to the idas memory block.

netfails (long int) number of error test failures.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDAGetLastOrder

Call flag = IDAGetLastOrder(ida mem, &klast);

Description The function IDAGetLastOrder returns the integration method order used during the

last internal step.

Arguments ida mem (void *) pointer to the idas memory block.

klast (int) method order used on the last internal step.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDAGetCurrentOrder

Call flag = IDAGetCurrentOrder(ida mem, &kcur);

Description The function IDAGetCurrentOrder returns the integration method order to be used on

the next internal step.

Arguments ida mem (void *) pointer to the idas memory block.

kcur (int) method order to be used on the next internal step.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

4.5 User-callable functions 63

IDAGetLastStep

Call flag = IDAGetLastStep(ida mem, &hlast);

Description The function IDAGetLastStep returns the integration step size taken on the last internal

step.

Arguments ida mem (void *) pointer to the idas memory block.

hlast (realtype) step size taken on the last internal step by idas, or last artiﬁcial

step size used in IDACalcIC, whichever was called last.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDAGetCurrentStep

Call flag = IDAGetCurrentStep(ida mem, &hcur);

Description The function IDAGetCurrentStep returns the integration step size to be attempted on

the next internal step.

Arguments ida mem (void *) pointer to the idas memory block.

hcur (realtype) step size to be attempted on the next internal step.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDAGetActualInitStep

Call flag = IDAGetActualInitStep(ida mem, &hinused);

Description The function IDAGetActualInitStep returns the value of the integration step size used

on the ﬁrst step.

Arguments ida mem (void *) pointer to the idas memory block.

hinused (realtype) actual value of initial step size.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

Notes Even if the value of the initial integration step size was speciﬁed by the user through

a call to IDASetInitStep, this value might have been changed by idas to ensure that

the step size is within the prescribed bounds (hmin ≤h0≤hmax), or to meet the local

error test.

IDAGetCurrentTime

Call flag = IDAGetCurrentTime(ida mem, &tcur);

Description The function IDAGetCurrentTime returns the current internal time reached by the

solver.

Arguments ida mem (void *) pointer to the idas memory block.

tcur (realtype) current internal time reached.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

64 Using IDAS for IVP Solution

IDAGetTolScaleFactor

Call flag = IDAGetTolScaleFactor(ida mem, &tolsfac);

Description The function IDAGetTolScaleFactor returns a suggested factor by which the user’s

tolerances should be scaled when too much accuracy has been requested for some internal

step.

Arguments ida mem (void *) pointer to the idas memory block.

tolsfac (realtype) suggested scaling factor for user tolerances.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDAGetErrWeights

Call flag = IDAGetErrWeights(ida mem, eweight);

Description The function IDAGetErrWeights returns the solution error weights at the current time.

These are the Wigiven by Eq. (2.7) (or by the user’s IDAEwtFn).

Arguments ida mem (void *) pointer to the idas memory block.

eweight (N Vector) solution error weights at the current time.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

Notes The user must allocate space for eweight.

IDAGetEstLocalErrors

Call flag = IDAGetEstLocalErrors(ida mem, ele);

Description The function IDAGetEstLocalErrors returns the estimated local errors.

Arguments ida mem (void *) pointer to the idas memory block.

ele (N Vector) estimated local errors at the current time.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

Notes The user must allocate space for ele.

The values returned in ele are only valid if IDASolve returned a non-negative value.

The ele vector, togther with the eweight vector from IDAGetErrWeights, can be used

to determine how the various components of the system contributed to the estimated

local error test. Speciﬁcally, that error test uses the RMS norm of a vector whose

components are the products of the components of these two vectors. Thus, for example,

if there were recent error test failures, the components causing the failures are those

with largest values for the products, denoted loosely as eweight[i]*ele[i].

IDAGetIntegratorStats

Call flag = IDAGetIntegratorStats(ida mem, &nsteps, &nrevals, &nlinsetups,

&netfails, &klast, &kcur, &hinused,

&hlast, &hcur, &tcur);

Description The function IDAGetIntegratorStats returns the idas integrator statistics as a group.

Arguments ida mem (void *) pointer to the idas memory block.

4.5 User-callable functions 65

nsteps (long int) cumulative number of steps taken by idas.

nrevals (long int) cumulative number of calls to the user’s res function.

nlinsetups (long int) cumulative number of calls made to the linear solver setup

function.

netfails (long int) cumulative number of error test failures.

klast (int) method order used on the last internal step.

kcur (int) method order to be used on the next internal step.

hinused (realtype) actual value of initial step size.

hlast (realtype) step size taken on the last internal step.

hcur (realtype) step size to be attempted on the next internal step.

tcur (realtype) current internal time reached.

Return value The return value flag (of type int) is one of

IDA SUCCESS the optional output values have been successfully set.

IDA MEM NULL the ida mem pointer is NULL.

IDAGetNumNonlinSolvIters

Call flag = IDAGetNumNonlinSolvIters(ida mem, &nniters);

Description The function IDAGetNumNonlinSolvIters returns the cumulative number of nonlinear

iterations performed.

Arguments ida mem (void *) pointer to the idas memory block.

nniters (long int) number of nonlinear iterations performed.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA MEM FAIL The sunnonlinsol module is NULL.

IDAGetNumNonlinSolvConvFails

Call flag = IDAGetNumNonlinSolvConvFails(ida mem, &nncfails);

Description The function IDAGetNumNonlinSolvConvFails returns the cumulative number of non-

linear convergence failures that have occurred.

Arguments ida mem (void *) pointer to the idas memory block.

nncfails (long int) number of nonlinear convergence failures.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDAGetNonlinSolvStats

Call flag = IDAGetNonlinSolvStats(ida mem, &nniters, &nncfails);

Description The function IDAGetNonlinSolvStats returns the idas nonlinear solver statistics as a

group.

Arguments ida mem (void *) pointer to the idas memory block.

nniters (long int) cumulative number of nonlinear iterations performed.

nncfails (long int) cumulative number of nonlinear convergence failures.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

66 Using IDAS for IVP Solution

IDA MEM NULL The ida mem pointer is NULL.

IDA MEM FAIL The sunnonlinsol module is NULL.

IDAGetReturnFlagName

Call name = IDAGetReturnFlagName(flag);

Description The function IDAGetReturnFlagName returns the name of the idas constant correspond-

ing to flag.

Arguments The only argument, of type int, is a return ﬂag from an idas function.

Return value The return value is a string containing the name of the corresponding constant.

4.5.10.3 Initial condition calculation optional output functions

IDAGetNumBcktrackOps

Call flag = IDAGetNumBacktrackOps(ida mem, &nbacktr);

Description The function IDAGetNumBacktrackOps returns the number of backtrack operations done

in the linesearch algorithm in IDACalcIC.

Arguments ida mem (void *) pointer to the idas memory block.

nbacktr (long int) the cumulative number of backtrack operations.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDAGetConsistentIC

Call flag = IDAGetConsistentIC(ida mem, yy0 mod, yp0 mod);

Description The function IDAGetConsistentIC returns the corrected initial conditions calculated

by IDACalcIC.

Arguments ida mem (void *) pointer to the idas memory block.

yy0 mod (N Vector) consistent solution vector.

yp0 mod (N Vector) consistent derivative vector.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA ILL INPUT The function was not called before the ﬁrst call to IDASolve.

IDA MEM NULL The ida mem pointer is NULL.

Notes If the consistent solution vector or consistent derivative vector is not desired, pass NULL

for the corresponding argument.

The user must allocate space for yy0 mod and yp0 mod (if not NULL).

4.5.10.4 Rootﬁnding optional output functions

There are two optional output functions associated with rootﬁnding.

4.5 User-callable functions 67

IDAGetRootInfo

Call flag = IDAGetRootInfo(ida mem, rootsfound);

Description The function IDAGetRootInfo returns an array showing which functions were found to

have a root.

Arguments ida mem (void *) pointer to the idas memory block.

rootsfound (int *) array of length nrtfn with the indices of the user functions gi

found to have a root. For i= 0,...,nrtfn −1, rootsfound[i]6= 0 if gihas a

root, and = 0 if not.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output values have been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

Notes Note that, for the components gifor which a root was found, the sign of rootsfound[i]

indicates the direction of zero-crossing. A value of +1 indicates that giis increasing,

while a value of −1 indicates a decreasing gi.

The user must allocate memory for the vector rootsfound.

IDAGetNumGEvals

Call flag = IDAGetNumGEvals(ida mem, &ngevals);

Description The function IDAGetNumGEvals returns the cumulative number of calls to the user root

function g.

Arguments ida mem (void *) pointer to the idas memory block.

ngevals (long int) number of calls to the user’s function gso far.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

4.5.10.5 idals linear solver interface optional output functions

The following optional outputs are available from the idals modules: workspace requirements, number

of calls to the Jacobian routine, number of calls to the residual routine for ﬁnite-diﬀerence Jacobian

or Jacobian-vector product approximation, number of linear iterations, number of linear convergence

failures, number of calls to the preconditioner setup and solve routines, number of calls to the Jacobian-

vector setup and product routines, and last return value from an idals function. Note that, where

the name of an output would otherwise conﬂict with the name of an optional output from the main

solver, a suﬃx LS (for Linear Solver) has been added (e.g., lenrwLS).

IDAGetLinWorkSpace

Call flag = IDAGetLinWorkSpace(ida mem, &lenrwLS, &leniwLS);

Description The function IDAGetLinWorkSpace returns the sizes of the real and integer workspaces

used by the idals linear solver interface.

Arguments ida mem (void *) pointer to the idas memory block.

lenrwLS (long int) the number of real values in the idals workspace.

leniwLS (long int) the number of integer values in the idals workspace.

Return value The return value flag (of type int) is one of

IDALS SUCCESS The optional output value has been successfully set.

IDALS MEM NULL The ida mem pointer is NULL.

IDALS LMEM NULL The idals linear solver has not been initialized.

68 Using IDAS for IVP Solution

Notes The workspace requirements reported by this routine correspond only to memory allo-

cated within this interface and to memory allocated by the sunlinsol object attached

to it. The template Jacobian matrix allocated by the user outside of idals is not

included in this report.

The previous routines IDADlsGetWorkspace and IDASpilsGetWorkspace are now wrap-

pers for this routine, and may still be used for backward-compatibility. However, these

will be deprecated in future releases, so we recommend that users transition to the new

routine name soon.

IDAGetNumJacEvals

Call flag = IDAGetNumJacEvals(ida mem, &njevals);

Description The function IDAGetNumJacEvals returns the cumulative number of calls to the idals

Jacobian approximation function.

Arguments ida mem (void *) pointer to the idas memory block.

njevals (long int) the cumulative number of calls to the Jacobian function (total so

far).

Return value The return value flag (of type int) is one of

IDALS SUCCESS The optional output value has been successfully set.

IDALS MEM NULL The ida mem pointer is NULL.

IDALS LMEM NULL The idals linear solver has not been initialized.

Notes The previous routine IDADlsGetNumJacEvals is now a wrapper for this routine, and

may still be used for backward-compatibility. However, this will be deprecated in future

releases, so we recommend that users transition to the new routine name soon.

IDAGetNumLinResEvals

Call flag = IDAGetNumLinResEvals(ida mem, &nrevalsLS);

Description The function IDAGetNumLinResEvals returns the cumulative number of calls to the user

residual function due to the ﬁnite diﬀerence Jacobian approximation or ﬁnite diﬀerence

Jacobian-vector product approximation.

Arguments ida mem (void *) pointer to the idas memory block.

nrevalsLS (long int) the cumulative number of calls to the user residual function.

Return value The return value flag (of type int) is one of

IDALS SUCCESS The optional output value has been successfully set.

IDALS MEM NULL The ida mem pointer is NULL.

IDALS LMEM NULL The idals linear solver has not been initialized.

Notes The value nrevalsLS is incremented only if one of the default internal diﬀerence quotient

functions is used.

The previous routines IDADlsGetNumRhsEvals and IDASpilsGetNumRhsEvals are now

wrappers for this routine, and may still be used for backward-compatibility. However,

these will be deprecated in future releases, so we recommend that users transition to

the new routine name soon.

IDAGetNumLinIters

Call flag = IDAGetNumLinIters(ida mem, &nliters);

Description The function IDAGetNumLinIters returns the cumulative number of linear iterations.

Arguments ida mem (void *) pointer to the idas memory block.

4.5 User-callable functions 69

nliters (long int) the current number of linear iterations.

Return value The return value flag (of type int) is one of

IDALS SUCCESS The optional output value has been successfully set.

IDALS MEM NULL The ida mem pointer is NULL.

IDALS LMEM NULL The idals linear solver has not been initialized.

Notes The previous routine IDASpilsGetNumLinIters is now a wrapper for this routine, and

may still be used for backward-compatibility. However, this will be deprecated in future

releases, so we recommend that users transition to the new routine name soon.

IDAGetNumLinConvFails

Call flag = IDAGetNumLinConvFails(ida mem, &nlcfails);

Description The function IDAGetNumLinConvFails returns the cumulative number of linear conver-

gence failures.

Arguments ida mem (void *) pointer to the idas memory block.

nlcfails (long int) the current number of linear convergence failures.

Return value The return value flag (of type int) is one of

IDALS SUCCESS The optional output value has been successfully set.

IDALS MEM NULL The ida mem pointer is NULL.

IDALS LMEM NULL The idals linear solver has not been initialized.

Notes The previous routine IDASpilsGetNumConvFails is now a wrapper for this routine, and

may still be used for backward-compatibility. However, this will be deprecated in future

releases, so we recommend that users transition to the new routine name soon.

IDAGetNumPrecEvals

Call flag = IDAGetNumPrecEvals(ida mem, &npevals);

Description The function IDAGetNumPrecEvals returns the cumulative number of preconditioner

evaluations, i.e., the number of calls made to psetup.

Arguments ida mem (void *) pointer to the idas memory block.

npevals (long int) the cumulative number of calls to psetup.

Return value The return value flag (of type int) is one of

IDALS SUCCESS The optional output value has been successfully set.

IDALS MEM NULL The ida mem pointer is NULL.

IDALS LMEM NULL The idals linear solver has not been initialized.

Notes The previous routine IDASpilsGetNumPrecEvals is now a wrapper for this routine, and

may still be used for backward-compatibility. However, this will be deprecated in future

releases, so we recommend that users transition to the new routine name soon.

IDAGetNumPrecSolves

Call flag = IDAGetNumPrecSolves(ida mem, &npsolves);

Description The function IDAGetNumPrecSolves returns the cumulative number of calls made to

the preconditioner solve function, psolve.

Arguments ida mem (void *) pointer to the idas memory block.

npsolves (long int) the cumulative number of calls to psolve.

Return value The return value flag (of type int) is one of

IDALS SUCCESS The optional output value has been successfully set.

70 Using IDAS for IVP Solution

IDALS MEM NULL The ida mem pointer is NULL.

IDALS LMEM NULL The idals linear solver has not been initialized.

Notes The previous routine IDASpilsGetNumPrecSolves is now a wrapper for this routine,

and may still be used for backward-compatibility. However, this will be deprecated in

future releases, so we recommend that users transition to the new routine name soon.

IDAGetNumJTSetupEvals

Call flag = IDAGetNumJTSetupEvals(ida mem, &njtsetup);

Description The function IDAGetNumJTSetupEvals returns the cumulative number of calls made to

the Jacobian-vector setup function jtsetup.

Arguments ida mem (void *) pointer to the idas memory block.

njtsetup (long int) the current number of calls to jtsetup.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA LMEM NULL The ida linear solver has not been initialized.

Notes The previous routine IDASpilsGetNumJTSetupEvals is now a wrapper for this routine,

and may still be used for backward-compatibility. However, this will be deprecated in

future releases, so we recommend that users transition to the new routine name soon.

IDAGetNumJtimesEvals

Call flag = IDAGetNumJtimesEvals(ida mem, &njvevals);

Description The function IDAGetNumJtimesEvals returns the cumulative number of calls made to

the Jacobian-vector function, jtimes.

Arguments ida mem (void *) pointer to the idas memory block.

njvevals (long int) the cumulative number of calls to jtimes.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA LMEM NULL The ida linear solver has not been initialized.

Notes The previous routine IDASpilsGetNumJtimesEvals is now a wrapper for this routine,

and may still be used for backward-compatibility. However, this will be deprecated in

future releases, so we recommend that users transition to the new routine name soon.

IDAGetLastLinFlag

Call flag = IDAGetLastLinFlag(ida mem, &lsflag);

Description The function IDAGetLastLinFlag returns the last return value from an idals routine.

Arguments ida mem (void *) pointer to the idas memory block.

lsflag (long int) the value of the last return ﬂag from an idals function.

Return value The return value flag (of type int) is one of

IDALS SUCCESS The optional output value has been successfully set.

IDALS MEM NULL The ida mem pointer is NULL.

IDALS LMEM NULL The idals linear solver has not been initialized.

4.5 User-callable functions 71

Notes If the idals setup function failed (i.e., IDASolve returned IDA LSETUP FAIL) when

using the sunlinsol dense or sunlinsol band modules, then the value of lsflag is

equal to the column index (numbered from one) at which a zero diagonal element was

encountered during the LU factorization of the (dense or banded) Jacobian matrix.

If the idals setup function failed when using another sunlinsol module, then lsflag

will be SUNLS PSET FAIL UNREC,SUNLS ASET FAIL UNREC, or

SUNLS PACKAGE FAIL UNREC.

If the idals solve function failed (IDASolve returned IDA LSOLVE FAIL), lsflag con-

tains the error return ﬂag from the sunlinsol object, which will be one of:

SUNLS MEM NULL, indicating that the sunlinsol memory is NULL;

SUNLS ATIMES FAIL UNREC, indicating an unrecoverable failure in the J∗vfunction;

SUNLS PSOLVE FAIL UNREC, indicating that the preconditioner solve function psolve

failed unrecoverably; SUNLS GS FAIL, indicating a failure in the Gram-Schmidt proce-

dure (generated only in spgmr or spfgmr); SUNLS QRSOL FAIL, indicating that the

matrix Rwas found to be singular during the QR solve phase (spgmr and spfgmr

only); or SUNLS PACKAGE FAIL UNREC, indicating an unrecoverable failure in an external

iterative linear solver package.

The previous routines IDADlsGetLastFlag and IDASpilsGetLastFlag are now wrap-

pers for this routine, and may still be used for backward-compatibility. However, these

will be deprecated in future releases, so we recommend that users transition to the new

routine name soon.

IDAGetLinReturnFlagName

Call name = IDAGetLinReturnFlagName(lsflag);

Description The function IDAGetLinReturnFlagName returns the name of the ida constant corre-

sponding to lsflag.

Arguments The only argument, of type long int, is a return ﬂag from an idals function.

Return value The return value is a string containing the name of the corresponding constant.

If 1 ≤lsflag ≤N(LU factorization failed), this function returns “NONE”.

Notes The previous routines IDADlsGetReturnFlagName and IDASpilsGetReturnFlagName

are now wrappers for this routine, and may still be used for backward-compatibility.

However, these will be deprecated in future releases, so we recommend that users tran-

sition to the new routine name soon.

4.5.11 IDAS reinitialization function

The function IDAReInit reinitializes the main idas solver for the solution of a new problem, where

a prior call to IDAInit has been made. The new problem must have the same size as the previous

one. IDAReInit performs the same input checking and initializations that IDAInit does, but does

no memory allocation, as it assumes that the existing internal memory is suﬃcient for the new prob-

lem. A call to IDAReInit deletes the solution history that was stored internally during the previous

integration. Following a successful call to IDAReInit, call IDASolve again for the solution of the new

problem.

The use of IDAReInit requires that the maximum method order, maxord, is no larger for the new

problem than for the problem speciﬁed in the last call to IDAInit. In addition, the same nvector

module set for the previous problem will be reused for the new problem.

If there are changes to the linear solver speciﬁcations, make the appropriate calls to either the

linear solver objects themselves, or to the idals interface routines, as described in §4.5.3.

If there are changes to any optional inputs, make the appropriate IDASet*** calls, as described in

§4.5.8. Otherwise, all solver inputs set previously remain in eﬀect.

72 Using IDAS for IVP Solution

One important use of the IDAReInit function is in the treating of jump discontinuities in the

residual function. Except in cases of fairly small jumps, it is usually more eﬃcient to stop at each point

of discontinuity and restart the integrator with a readjusted DAE model, using a call to IDAReInit.

To stop when the location of the discontinuity is known, simply make that location a value of tout. To

stop when the location of the discontinuity is determined by the solution, use the rootﬁnding feature.

In either case, it is critical that the residual function not incorporate the discontinuity, but rather have

a smooth extention over the discontinuity, so that the step across it (and subsequent rootﬁnding, if

used) can be done eﬃciently. Then use a switch within the residual function (communicated through

user data) that can be ﬂipped between the stopping of the integration and the restart, so that the

restarted problem uses the new values (which have jumped). Similar comments apply if there is to be

a jump in the dependent variable vector.

IDAReInit

Call flag = IDAReInit(ida mem, t0, y0, yp0);

Description The function IDAReInit provides required problem speciﬁcations and reinitializes idas.

Arguments ida mem (void *) pointer to the idas memory block.

t0 (realtype) is the initial value of t.

y0 (N Vector) is the initial value of y.

yp0 (N Vector) is the initial value of ˙y.

Return value The return value flag (of type int) will be one of the following:

IDA SUCCESS The call to IDAReInit was successful.

IDA MEM NULL The idas memory block was not initialized through a previous call to

IDACreate.

IDA NO MALLOC Memory space for the idas memory block was not allocated through a

previous call to IDAInit.

IDA ILL INPUT An input argument to IDAReInit has an illegal value.

Notes If an error occurred, IDAReInit also sends an error message to the error handler func-

tion.

4.6 User-supplied functions

The user-supplied functions consist of one function deﬁning the DAE residual, (optionally) a function

that handles error and warning messages, (optionally) a function that provides the error weight vector,

(optionally) one or two functions that provide Jacobian-related information for the linear solver, and

(optionally) one or two functions that deﬁne the preconditioner for use in any of the Krylov iteration

algorithms.

4.6.1 Residual function

The user must provide a function of type IDAResFn deﬁned as follows:

IDAResFn

Deﬁnition typedef int (*IDAResFn)(realtype tt, N Vector yy, N Vector yp,

N Vector rr, void *user data);

Purpose This function computes the problem residual for given values of the independent variable

t, state vector y, and derivative ˙y.

Arguments tt is the current value of the independent variable.

yy is the current value of the dependent variable vector, y(t).

yp is the current value of ˙y(t).

4.6 User-supplied functions 73

rr is the output residual vector F(t, y, ˙y).

user data is a pointer to user data, the same as the user data parameter passed to

IDASetUserData.

Return value An IDAResFn function type should return a value of 0 if successful, a positive value

if a recoverable error occurred (e.g., yy has an illegal value), or a negative value if a

nonrecoverable error occurred. In the last case, the integrator halts. If a recoverable

error occurred, the integrator will attempt to correct and retry.

Notes A recoverable failure error return from the IDAResFn is typically used to ﬂag a value

of the dependent variable ythat is “illegal” in some way (e.g., negative where only a

non-negative value is physically meaningful). If such a return is made, idas will attempt

to recover (possibly repeating the nonlinear solve, or reducing the step size) in order to

avoid this recoverable error return.

For eﬃciency reasons, the DAE residual function is not evaluated at the converged solu-

tion of the nonlinear solver. Therefore, in general, a recoverable error in that converged

value cannot be corrected. (It may be detected when the right-hand side function is

called the ﬁrst time during the following integration step, but a successful step cannot

be undone.) However, if the user program also includes quadrature integration, the

state variables can be checked for legality in the call to IDAQuadRhsFn, which is called

at the converged solution of the nonlinear system, and therefore idas can be ﬂagged to

attempt to recover from such a situation. Also, if sensitivity analysis is performed with

the staggered method, the DAE residual function is called at the converged solution of

the nonlinear system, and a recoverable error at that point can be ﬂagged, and idas

will then try to correct it.

Allocation of memory for yp is handled within idas.

4.6.2 Error message handler function

As an alternative to the default behavior of directing error and warning messages to the ﬁle pointed to

by errfp (see IDASetErrFile), the user may provide a function of type IDAErrHandlerFn to process

any such messages. The function type IDAErrHandlerFn is deﬁned as follows:

IDAErrHandlerFn

Deﬁnition typedef void (*IDAErrHandlerFn)(int error code, const char *module,

const char *function, char *msg,

void *eh data);

Purpose This function processes error and warning messages from idas and its sub-modules.

Arguments error code is the error code.

module is the name of the idas module reporting the error.

function is the name of the function in which the error occurred.

msg is the error message.

eh data is a pointer to user data, the same as the eh data parameter passed to

IDASetErrHandlerFn.

Return value A IDAErrHandlerFn function has no return value.

Notes error code is negative for errors and positive (IDA WARNING) for warnings. If a function

that returns a pointer to memory encounters an error, it sets error code to 0.

4.6.3 Error weight function

As an alternative to providing the relative and absolute tolerances, the user may provide a function of

type IDAEwtFn to compute a vector ewt containing the multiplicative weights Wiused in the WRMS

74 Using IDAS for IVP Solution

norm kvkWRMS =q(1/N )PN

1(Wi·vi)2. These weights will used in place of those deﬁned by Eq.

(2.7). The function type IDAEwtFn is deﬁned as follows:

IDAEwtFn

Deﬁnition typedef int (*IDAEwtFn)(N Vector y, N Vector ewt, void *user data);

Purpose This function computes the WRMS error weights for the vector y.

Arguments yis the value of the dependent variable vector at which the weight vector is

to be computed.

ewt is the output vector containing the error weights.

user data is a pointer to user data, the same as the user data parameter passed to

IDASetUserData.

Return value An IDAEwtFn function type must return 0 if it successfully set the error weights and −1

otherwise.

Notes Allocation of memory for ewt is handled within idas.

The error weight vector must have all components positive. It is the user’s responsiblity

to perform this test and return −1 if it is not satisﬁed.

4.6.4 Rootﬁnding function

If a rootﬁnding problem is to be solved during the integration of the DAE system, the user must

supply a Cfunction of type IDARootFn, deﬁned as follows:

IDARootFn

Deﬁnition typedef int (*IDARootFn)(realtype t, N Vector y, N Vector yp,

realtype *gout, void *user data);

Purpose This function computes a vector-valued function g(t, y, ˙y) such that the roots of the

nrtfn components gi(t, y, ˙y) are to be found during the integration.

Arguments tis the current value of the independent variable.

yis the current value of the dependent variable vector, y(t).

yp is the current value of ˙y(t), the t−derivative of y.

gout is the output array, of length nrtfn, with components gi(t, y, ˙y).

user data is a pointer to user data, the same as the user data parameter passed to

IDASetUserData.

Return value An IDARootFn should return 0 if successful or a non-zero value if an error occurred (in

which case the integration is halted and IDASolve returns IDA RTFUNC FAIL).

Notes Allocation of memory for gout is handled within idas.

4.6.5 Jacobian construction (matrix-based linear solvers)

If a matrix-based linear solver module is used (i.e. a non-NULL sunmatrix object was supplied to

IDASetLinearSolver), the user may provide a function of type IDALsJacFn deﬁned as follows:

IDALsJacFn

Deﬁnition typedef int (*IDALsJacFn)(realtype tt, realtype cj,

N Vector yy, N Vector yp, N Vector rr,

SUNMatrix Jac, void *user data,

N Vector tmp1, N Vector tmp2, N Vector tmp3);

4.6 User-supplied functions 75

Purpose This function computes the Jacobian matrix Jof the DAE system (or an approximation

to it), deﬁned by Eq. (2.6).

Arguments tt is the current value of the independent variable t.

cj is the scalar in the system Jacobian, proportional to the inverse of the step

size (αin Eq. (2.6) ).

yy is the current value of the dependent variable vector, y(t).

yp is the current value of ˙y(t).

rr is the current value of the residual vector F(t, y, ˙y).

Jac is the output (approximate) Jacobian matrix (of type SUNMatrix), J=

∂F/∂y +cj ∂F/∂ ˙y.

user data is a pointer to user data, the same as the user data parameter passed to

IDASetUserData.

tmp1

tmp2

tmp3 are pointers to memory allocated for variables of type N Vector which can

be used by IDALsJacFn function as temporary storage or work space.

Return value An IDALsJacFn should return 0 if successful, a positive value if a recoverable error

occurred, or a negative value if a nonrecoverable error occurred.

In the case of a recoverable eror return, the integrator will attempt to recover by reducing

the stepsize, and hence changing αin (2.6).

Notes Information regarding the structure of the speciﬁc sunmatrix structure (e.g., number

of rows, upper/lower bandwidth, sparsity type) may be obtained through using the

implementation-speciﬁc sunmatrix interface functions (see Chapter 8for details).

Prior to calling the user-supplied Jacobian function, the Jacobian matrix J(t, y) is zeroed

out, so only nonzero elements need to be loaded into Jac.

If the user’s IDALsJacFn function uses diﬀerence quotient approximations, it may need

to access quantities not in the call list. These quantities may include the current stepsize,

the error weights, etc. To obtain these, the user will need to add a pointer to ida mem to

user data and then use the IDAGet* functions described in §4.5.10.2. The unit roundoﬀ

can be accessed as UNIT ROUNDOFF deﬁned in sundials types.h.

dense:

A user-supplied dense Jacobian function must load the Neq ×Neq dense matrix Jac

with an approximation to the Jacobian matrix J(t, y, ˙y) at the point (tt,yy,yp). The

accessor macros SM ELEMENT D and SM COLUMN D allow the user to read and write dense

matrix elements without making explicit references to the underlying representation of

the sunmatrix dense type. SM ELEMENT D(J, i, j) references the (i,j)-th element

of the dense matrix Jac (with i,j= 0 . . . N−1). This macro is meant for small

problems for which eﬃciency of access is not a major concern. Thus, in terms of

the indices mand nranging from 1 to N, the Jacobian element Jm,n can be set using

the statement SM ELEMENT D(J, m-1, n-1) = Jm,n. Alternatively, SM COLUMN D(J, j)

returns a pointer to the ﬁrst element of the j-th column of Jac (with j= 0 . . . N−1),

and the elements of the j-th column can then be accessed using ordinary array indexing.

Consequently, Jm,n can be loaded using the statements col n = SM COLUMN D(J, n-1);

col n[m-1] = Jm,n. For large problems, it is more eﬃcient to use SM COLUMN D than to

use SM ELEMENT D. Note that both of these macros number rows and columns starting

from 0. The sunmatrix dense type and accessor macros are documented in §8.2.

banded:

A user-supplied banded Jacobian function must load the Neq ×Neq banded matrix

Jac with an approximation to the Jacobian matrix J(t, y, ˙y) at the point (tt,yy,yp).

The accessor macros SM ELEMENT B,SM COLUMN B, and SM COLUMN ELEMENT B allow the

76 Using IDAS for IVP Solution

user to read and write banded matrix elements without making speciﬁc references to

the underlying representation of the sunmatrix band type. SM ELEMENT B(J, i, j)

references the (i,j)-th element of the banded matrix Jac, counting from 0. This

macro is meant for use in small problems for which eﬃciency of access is not a major

concern. Thus, in terms of the indices mand nranging from 1 to Nwith (m, n)

within the band deﬁned by mupper and mlower, the Jacobian element Jm,n can be

loaded using the statement SM ELEMENT B(J, m-1, n-1) = Jm,n. The elements within

the band are those with -mupper ≤m-n ≤mlower. Alternatively, SM COLUMN B(J,

j) returns a pointer to the diagonal element of the j-th column of Jac, and if we

assign this address to realtype *col j, then the i-th element of the j-th column

is given by SM COLUMN ELEMENT B(col j, i, j), counting from 0. Thus, for (m, n)

within the band, Jm,n can be loaded by setting col n = SM COLUMN B(J, n-1); and

SM COLUMN ELEMENT B(col n, m-1, n-1) = Jm,n. The elements of the j-th column

can also be accessed via ordinary array indexing, but this approach requires knowledge

of the underlying storage for a band matrix of type sunmatrix band. The array col n

can be indexed from −mupper to mlower. For large problems, it is more eﬃcient to

use SM COLUMN B and SM COLUMN ELEMENT B than to use the SM ELEMENT B macro. As

in the dense case, these macros all number rows and columns starting from 0. The

sunmatrix band type and accessor macros are documented in §8.3.

sparse:

A user-supplied sparse Jacobian function must load the Neq ×Neq compressed-sparse-

column or compressed-sparse-row matrix Jac with an approximation to the Jacobian

matrix J(t, y, ˙y) at the point (tt,yy,yp). Storage for Jac already exists on entry to

this function, although the user should ensure that suﬃcient space is allocated in Jac

to hold the nonzero values to be set; if the existing space is insuﬃcient the user may

reallocate the data and index arrays as needed. The amount of allocated space in a

sunmatrix sparse object may be accessed using the macro SM NNZ S or the routine

SUNSparseMatrix NNZ. The sunmatrix sparse type and accessor macros are docu-

mented in §8.4.

The previous function type IDADlsJacFn is identical to IDALsJacFn, and may still be

used for backward-compatibility. However, this will be deprecated in future releases, so

we recommend that users transition to the new function type name soon.

4.6.6 Jacobian-vector product (matrix-free linear solvers)

If a matrix-free linear solver is to be used (i.e., a NULL-valued sunmatrix was supplied to

IDASetLinearSolver), the user may provide a function of type IDALsJacTimesVecFn in the following

form, to compute matrix-vector products Jv. If such a function is not supplied, the default is a

diﬀerence quotient approximation to these products.

IDALsJacTimesVecFn

Deﬁnition typedef int (*IDALsJacTimesVecFn)(realtype tt, N Vector yy,

N Vector yp, N Vector rr,

N Vector v, N Vector Jv,

realtype cj, void *user data,

N Vector tmp1, N Vector tmp2);

Purpose This function computes the product Jv of the DAE system Jacobian J(or an approxi-

mation to it) and a given vector v, where Jis deﬁned by Eq. (2.6).

Arguments tt is the current value of the independent variable.

yy is the current value of the dependent variable vector, y(t).

yp is the current value of ˙y(t).

rr is the current value of the residual vector F(t, y, ˙y).

4.6 User-supplied functions 77

vis the vector by which the Jacobian must be multiplied to the right.

Jv is the computed output vector.

cj is the scalar in the system Jacobian, proportional to the inverse of the step

size (αin Eq. (2.6) ).

user data is a pointer to user data, the same as the user data parameter passed to

IDASetUserData.

tmp1

tmp2 are pointers to memory allocated for variables of type N Vector which can

be used by IDALsJacTimesVecFn as temporary storage or work space.

Return value The value returned by the Jacobian-times-vector function should be 0 if successful. A

nonzero value indicates that a nonrecoverable error occurred.

Notes This function must return a value of J∗vthat uses the current value of J, i.e. as

evaluated at the current (t, y, ˙y).

If the user’s IDALsJacTimesVecFn function uses diﬀerence quotient approximations, it

may need to access quantities not in the call list. These include the current stepsize, the

error weights, etc. To obtain these, the user will need to add a pointer to ida mem to

user data and then use the IDAGet* functions described in §4.5.10.2. The unit roundoﬀ

can be accessed as UNIT ROUNDOFF deﬁned in sundials types.h.

The previous function type IDASpilsJacTimesVecFn is identical to

IDALsJacTimesVecFn, and may still be used for backward-compatibility. However, this

will be deprecated in future releases, so we recommend that users transition to the new

function type name soon.

4.6.7 Jacobian-vector product setup (matrix-free linear solvers)

If the user’s Jacobian-times-vector requires that any Jacobian-related data be preprocessed or evalu-

ated, then this needs to be done in a user-supplied function of type IDALsJacTimesSetupFn, deﬁned

as follows:

IDAJacTimesSetupFn

Deﬁnition typedef int (*IDAJacTimesSetupFn)(realtype tt, N Vector yy,

N Vector yp, N Vector rr,

realtype cj, void *user data);

Purpose This function preprocesses and/or evaluates Jacobian data needed by the Jacobian-

times-vector routine.

Arguments tt is the current value of the independent variable.

yy is the current value of the dependent variable vector, y(t).

yp is the current value of ˙y(t).

rr is the current value of the residual vector F(t, y, ˙y).

cj is the scalar in the system Jacobian, proportional to the inverse of the step

size (αin Eq. (2.6) ).

user data is a pointer to user data, the same as the user data parameter passed to

IDASetUserData.

Return value The value returned by the Jacobian-vector setup function should be 0 if successful,

positive for a recoverable error (in which case the step will be retried), or negative for

an unrecoverable error (in which case the integration is halted).

Notes Each call to the Jacobian-vector setup function is preceded by a call to the IDAResFn

user function with the same (t,y, yp) arguments. Thus, the setup function can use any

auxiliary data that is computed and saved during the evaluation of the DAE residual.

78 Using IDAS for IVP Solution

If the user’s IDALsJacTimesVecFn function uses diﬀerence quotient approximations, it

may need to access quantities not in the call list. These include the current stepsize, the

error weights, etc. To obtain these, the user will need to add a pointer to ida mem to

user data and then use the IDAGet* functions described in §4.5.10.2. The unit roundoﬀ

can be accessed as UNIT ROUNDOFF deﬁned in sundials types.h.

The previous function type IDASpilsJacTimesSetupFn is identical to

IDALsJacTimesSetupFn, and may still be used for backward-compatibility. However,

this will be deprecated in future releases, so we recommend that users transition to the

new function type name soon.

4.6.8 Preconditioner solve (iterative linear solvers)

If a user-supplied preconditioner is to be used with a sunlinsol solver module, then the user must

provide a function to solve the linear system P z =rwhere Pis a left preconditioner matrix which

approximates (at least crudely) the Jacobian matrix J=∂F/∂y +cj ∂F/∂ ˙y. This function must be

of type IDALsPrecSolveFn, deﬁned as follows:

IDALsPrecSolveFn

Deﬁnition typedef int (*IDALsPrecSolveFn)(realtype tt, N Vector yy,

N Vector yp, N Vector rr,

N Vector rvec, N Vector zvec,

realtype cj, realtype delta,

void *user data);

Purpose This function solves the preconditioning system P z =r.

Arguments tt is the current value of the independent variable.

yy is the current value of the dependent variable vector, y(t).

yp is the current value of ˙y(t).

rr is the current value of the residual vector F(t, y, ˙y).

rvec is the right-hand side vector rof the linear system to be solved.

zvec is the computed output vector.

cj is the scalar in the system Jacobian, proportional to the inverse of the step

size (αin Eq. (2.6) ).

delta is an input tolerance to be used if an iterative method is employed in the

solution. In that case, the residual vector Res =r−P z of the system should

be made less than delta in weighted l2norm, i.e., pPi(Resi·ewti)2<

delta. To obtain the N Vector ewt, call IDAGetErrWeights (see §4.5.10.2).

user data is a pointer to user data, the same as the user data parameter passed to

the function IDASetUserData.

Return value The value to be returned by the preconditioner solve function is a ﬂag indicating whether

it was successful. This value should be 0 if successful, positive for a recoverable error

(in which case the step will be retried), negative for an unrecoverable error (in which

case the integration is halted).

Notes The previous function type IDASpilsPrecSolveFn is identical to IDALsPrecSolveFn,

and may still be used for backward-compatibility. However, this will be deprecated in

future releases, so we recommend that users transition to the new function type name

soon.

4.6.9 Preconditioner setup (iterative linear solvers)

If the user’s preconditioner requires that any Jacobian-related data be evaluated or preprocessed, then

this needs to be done in a user-supplied function of type IDALsPrecSetupFn, deﬁned as follows:

4.7 Integration of pure quadrature equations 79

IDALsPrecSetupFn

Deﬁnition typedef int (*IDALsPrecSetupFn)(realtype tt, N Vector yy,

N Vector yp, N Vector rr,

realtype cj, void *user data);

Purpose This function evaluates and/or preprocesses Jacobian-related data needed by the pre-

conditioner.

Arguments tt is the current value of the independent variable.

yy is the current value of the dependent variable vector, y(t).

yp is the current value of ˙y(t).

rr is the current value of the residual vector F(t, y, ˙y).

cj is the scalar in the system Jacobian, proportional to the inverse of the step

size (αin Eq. (2.6) ).

user data is a pointer to user data, the same as the user data parameter passed to

the function IDASetUserData.

Return value The value returned by the preconditioner setup function is a ﬂag indicating whether it

was successful. This value should be 0 if successful, positive for a recoverable error (in

which case the step will be retried), negative for an unrecoverable error (in which case

the integration is halted).

Notes The operations performed by this function might include forming a crude approximate

Jacobian, and performing an LU factorization on the resulting approximation.

Each call to the preconditioner setup function is preceded by a call to the IDAResFn

user function with the same (tt,yy,yp) arguments. Thus the preconditioner setup

function can use any auxiliary data that is computed and saved during the evaluation

of the DAE residual.

This function is not called in advance of every call to the preconditioner solve function,

but rather is called only as often as needed to achieve convergence in the nonlinear

solver.

If the user’s IDALsPrecSetupFn function uses diﬀerence quotient approximations, it

may need to access quantities not in the call list. These include the current stepsize,

the error weights, etc. To obtain these, the user will need to add a pointer to ida mem to

user data and then use the IDAGet* functions described in §4.5.10.2. The unit roundoﬀ

can be accessed as UNIT ROUNDOFF deﬁned in sundials types.h.

The previous function type IDASpilsPrecSetupFn is identical to IDALsPrecSetupFn,

and may still be used for backward-compatibility. However, this will be deprecated in

future releases, so we recommend that users transition to the new function type name

soon.

4.7 Integration of pure quadrature equations

idas allows the DAE system to include pure quadratures. In this case, it is more eﬃcient to treat

the quadratures separately by excluding them from the nonlinear solution stage. To do this, begin

by excluding the quadrature variables from the vectors yy and yp and the quadrature equations from

within res. Thus a separate vector yQ of quadrature variables is to satisfy (d/dt)yQ =fQ(t, y, ˙y). The

following is an overview of the sequence of calls in a user’s main program in this situation. Steps that

are unchanged from the skeleton program presented in §4.4 are grayed out.

1. Initialize parallel or multi-threaded environment, if appropriate

2. Set problem dimensions, etc.

80 Using IDAS for IVP Solution

This generally includes N, the problem size N(excluding quadrature variables), Nq, the number

of quadrature variables, and may include the local vector length Nlocal (excluding quadrature

variables), and local number of quadrature variables Nqlocal.

3. Set vectors of initial values

4. Create idas object

5. Initialize idas solver

6. Specify integration tolerances

7. Create matrix object

8. Create linear solver object

9. Set linear solver optional inputs

10. Attach linear solver module

11. Set optional inputs

12. Create nonlinear solver object

13. Attach nonlinear solver module

14. Set nonlinear solver optional inputs

15. Correct initial values

16. Set vector of initial values for quadrature variables

Typically, the quadrature variables should be initialized to 0.

17. Initialize quadrature integration

Call IDAQuadInit to specify the quadrature equation right-hand side function and to allocate

internal memory related to quadrature integration. See §4.7.1 for details.

18. Set optional inputs for quadrature integration

Call IDASetQuadErrCon to indicate whether or not quadrature variables should be used in the

step size control mechanism. If so, one of the IDAQuad*tolerances functions must be called to

specify the integration tolerances for quadrature variables. See §4.7.4 for details.

19. Advance solution in time

20. Extract quadrature variables

Call IDAGetQuad or IDAGetQuadDky to obtain the values of the quadrature variables or their

derivatives at the current time. See §4.7.3 for details.

21. Get optional outputs

22. Get quadrature optional outputs

Call IDAGetQuad* functions to obtain optional output related to the integration of quadratures.

See §4.7.5 for details.

23. Deallocate memory for solution vectors and for the vector of quadrature variables

24. Free solver memory

4.7 Integration of pure quadrature equations 81

25. Free nonlinear solver memory

26. Free linear solver and matrix memory

27. Finalize MPI, if used

IDAQuadInit can be called and quadrature-related optional inputs (step 18 above) can be set, any-

where between steps 4and 19.

4.7.1 Quadrature initialization and deallocation functions

The function IDAQuadInit activates integration of quadrature equations and allocates internal mem-

ory related to these calculations. The form of the call to this function is as follows:

IDAQuadInit

Call flag = IDAQuadInit(ida mem, rhsQ, yQ0);

Description The function IDAQuadInit provides required problem speciﬁcations, allocates internal

memory, and initializes quadrature integration.

Arguments ida mem (void *) pointer to the idas memory block returned by IDACreate.

rhsQ (IDAQuadRhsFn) is the Cfunction which computes fQ, the right-hand side of

the quadrature equations. This function has the form fQ(t, yy, yp, rhsQ,

user data) (for full details see §4.7.6).

yQ0 (N Vector) is the initial value of yQ.

Return value The return value flag (of type int) will be one of the following:

IDA SUCCESS The call to IDAQuadInit was successful.

IDA MEM NULL The idas memory was not initialized by a prior call to IDACreate.

IDA MEM FAIL A memory allocation request failed.

Notes If an error occurred, IDAQuadInit also sends an error message to the error handler

function.

In terms of the number of quadrature variables Nqand maximum method order maxord, the size of

the real workspace is increased as follows:

•Base value: lenrw =lenrw + (maxord+5)Nq

•If IDAQuadSVtolerances is called: lenrw =lenrw +Nq

and the size of the integer workspace is increased as follows:

•Base value: leniw =leniw + (maxord+5)Nq

•If IDAQuadSVtolerances is called: leniw =leniw +Nq

The function IDAQuadReInit, useful during the solution of a sequence of problems of same size,

reinitializes the quadrature-related internal memory and must follow a call to IDAQuadInit (and

maybe a call to IDAReInit). The number Nq of quadratures is assumed to be unchanged from the

prior call to IDAQuadInit. The call to the IDAQuadReInit function has the following form:

IDAQuadReInit

Call flag = IDAQuadReInit(ida mem, yQ0);

Description The function IDAQuadReInit provides required problem speciﬁcations and reinitializes

the quadrature integration.

Arguments ida mem (void *) pointer to the idas memory block.

yQ0 (N Vector) is the initial value of yQ.

82 Using IDAS for IVP Solution

Return value The return value flag (of type int) will be one of the following:

IDA SUCCESS The call to IDAReInit was successful.

IDA MEM NULL The idas memory was not initialized by a prior call to IDACreate.

IDA NO QUAD Memory space for the quadrature integration was not allocated by a prior

call to IDAQuadInit.

Notes If an error occurred, IDAQuadReInit also sends an error message to the error handler

function.

IDAQuadFree

Call IDAQuadFree(ida mem);

Description The function IDAQuadFree frees the memory allocated for quadrature integration.

Arguments The argument is the pointer to the idas memory block (of type void *).

Return value The function IDAQuadFree has no return value.

Notes In general, IDAQuadFree need not be called by the user as it is invoked automatically

by IDAFree.

4.7.2 IDAS solver function

Even if quadrature integration was enabled, the call to the main solver function IDASolve is exactly the

same as in §4.5.7. However, in this case the return value flag can also be one of the following:

IDA QRHS FAIL The quadrature right-hand side function failed in an unrecoverable man-

ner.

IDA FIRST QRHS ERR The quadrature right-hand side function failed at the ﬁrst call.

IDA REP QRHS ERR Convergence test failures occurred too many times due to repeated recov-

erable errors in the quadrature right-hand side function. This value will

also be returned if the quadrature right-hand side function had repeated

recoverable errors during the estimation of an initial step size (assuming

the quadrature variables are included in the error tests).

4.7.3 Quadrature extraction functions

If quadrature integration has been initialized by a call to IDAQuadInit, or reinitialized by a call to

IDAQuadReInit, then idas computes both a solution and quadratures at time t. However, IDASolve

will still return only the solution yin y. Solution quadratures can be obtained using the following

function:

IDAGetQuad

Call flag = IDAGetQuad(ida mem, &tret, yQ);

Description The function IDAGetQuad returns the quadrature solution vector after a successful return

from IDASolve.

Arguments ida mem (void *) pointer to the memory previously allocated by IDAInit.

tret (realtype) the time reached by the solver (output).

yQ (N Vector) the computed quadrature vector.

Return value The return value flag of IDAGetQuad is one of:

IDA SUCCESS IDAGetQuad was successful.

IDA MEM NULL ida mem was NULL.

IDA NO QUAD Quadrature integration was not initialized.

IDA BAD DKY yQ is NULL.

4.7 Integration of pure quadrature equations 83

The function IDAGetQuadDky computes the k-th derivatives of the interpolating polynomials for the

quadrature variables at time t. This function is called by IDAGetQuad with k=0and with the current

time at which IDASolve has returned, but may also be called directly by the user.

IDAGetQuadDky

Call flag = IDAGetQuadDky(ida mem, t, k, dkyQ);

Description The function IDAGetQuadDky returns derivatives of the quadrature solution vector after

a successful return from IDASolve.

Arguments ida mem (void *) pointer to the memory previously allocated by IDAInit.

t(realtype) the time at which quadrature information is requested. The time

tmust fall within the interval deﬁned by the last successful step taken by idas.

k(int) order of the requested derivative. This must be ≤klast.

dkyQ (N Vector) the vector containing the derivative. This vector must be allocated

by the user.

Return value The return value flag of IDAGetQuadDky is one of:

IDA SUCCESS IDAGetQuadDky succeeded.

IDA MEM NULL The pointer to ida mem was NULL.

IDA NO QUAD Quadrature integration was not initialized.

IDA BAD DKY The vector dkyQ is NULL.

IDA BAD K k is not in the range 0,1, ..., klast.

IDA BAD T The time tis not in the allowed range.

4.7.4 Optional inputs for quadrature integration

idas provides the following optional input functions to control the integration of quadrature equa-

tions.

IDASetQuadErrCon

Call flag = IDASetQuadErrCon(ida mem, errconQ);

Description The function IDASetQuadErrCon speciﬁes whether or not the quadrature variables are

to be used in the step size control mechanism within idas. If they are, the user must

call either IDAQuadSStolerances or IDAQuadSVtolerances to specify the integration

tolerances for the quadrature variables.

Arguments ida mem (void *) pointer to the idas memory block.

errconQ (booleantype) speciﬁes whether quadrature variables are included (SUNTRUE)

or not (SUNFALSE) in the error control mechanism.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL

IDA NO QUAD Quadrature integration has not been initialized.

Notes By default, errconQ is set to SUNFALSE.

It is illegal to call IDASetQuadErrCon before a call to IDAQuadInit.

If the quadrature variables are part of the step size control mechanism, one of the following

functions must be called to specify the integration tolerances for quadrature variables.

84 Using IDAS for IVP Solution

IDAQuadSStolerances

Call flag = IDAQuadSVtolerances(ida mem, reltolQ, abstolQ);

Description The function IDAQuadSStolerances speciﬁes scalar relative and absolute tolerances.

Arguments ida mem (void *) pointer to the idas memory block.

reltolQ (realtype) is the scalar relative error tolerance.

abstolQ (realtype) is the scalar absolute error tolerance.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional value has been successfully set.

IDA NO QUAD Quadrature integration was not initialized.

IDA MEM NULL The ida mem pointer is NULL.

IDA ILL INPUT One of the input tolerances was negative.

IDAQuadSVtolerances

Call flag = IDAQuadSVtolerances(ida mem, reltolQ, abstolQ);

Description The function IDAQuadSVtolerances speciﬁes scalar relative and vector absolute toler-

ances.

Arguments ida mem (void *) pointer to the idas memory block.

reltolQ (realtype) is the scalar relative error tolerance.

abstolQ (N Vector) is the vector absolute error tolerance.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional value has been successfully set.

IDA NO QUAD Quadrature integration was not initialized.

IDA MEM NULL The ida mem pointer is NULL.

IDA ILL INPUT One of the input tolerances was negative.

4.7.5 Optional outputs for quadrature integration

idas provides the following functions that can be used to obtain solver performance information

related to quadrature integration.

IDAGetQuadNumRhsEvals

Call flag = IDAGetQuadNumRhsEvals(ida mem, &nrhsQevals);

Description The function IDAGetQuadNumRhsEvals returns the number of calls made to the user’s

quadrature right-hand side function.

Arguments ida mem (void *) pointer to the idas memory block.

nrhsQevals (long int) number of calls made to the user’s rhsQ function.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA NO QUAD Quadrature integration has not been initialized.

4.7 Integration of pure quadrature equations 85

IDAGetQuadNumErrTestFails

Call flag = IDAGetQuadNumErrTestFails(ida mem, &nQetfails);

Description The function IDAGetQuadNumErrTestFails returns the number of local error test fail-

ures due to quadrature variables.

Arguments ida mem (void *) pointer to the idas memory block.

nQetfails (long int) number of error test failures due to quadrature variables.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA NO QUAD Quadrature integration has not been initialized.

IDAGetQuadErrWeights

Call flag = IDAGetQuadErrWeights(ida mem, eQweight);

Description The function IDAGetQuadErrWeights returns the quadrature error weights at the cur-

rent time.

Arguments ida mem (void *) pointer to the idas memory block.

eQweight (N Vector) quadrature error weights at the current time.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA NO QUAD Quadrature integration has not been initialized.

Notes The user must allocate memory for eQweight.

If quadratures were not included in the error control mechanism (through a call to

IDASetQuadErrCon with errconQ = SUNTRUE), IDAGetQuadErrWeights does not set

the eQweight vector.

IDAGetQuadStats

Call flag = IDAGetQuadStats(ida mem, &nrhsQevals, &nQetfails);

Description The function IDAGetQuadStats returns the idas integrator statistics as a group.

Arguments ida mem (void *) pointer to the idas memory block.

nrhsQevals (long int) number of calls to the user’s rhsQ function.

nQetfails (long int) number of error test failures due to quadrature variables.

Return value The return value flag (of type int) is one of

IDA SUCCESS the optional output values have been successfully set.

IDA MEM NULL the ida mem pointer is NULL.

IDA NO QUAD Quadrature integration has not been initialized.

4.7.6 User-supplied function for quadrature integration

For integration of quadrature equations, the user must provide a function that deﬁnes the right-hand

side of the quadrature equations (in other words, the integrand function of the integral that must be

evaluated). This function must be of type IDAQuadRhsFn deﬁned as follows:

86 Using IDAS for IVP Solution

IDAQuadRhsFn

Deﬁnition typedef int (*IDAQuadRhsFn)(realtype t, N Vector yy, N Vector yp,

N Vector rhsQ, void *user data);

Purpose This function computes the quadrature equation right-hand side for a given value of the

independent variable tand state vectors yand ˙y.

Arguments tis the current value of the independent variable.

yy is the current value of the dependent variable vector, y(t).

yp is the current value of the dependent variable derivative vector, ˙y(t).

rhsQ is the output vector fQ(t, y, ˙y).

user data is the user data pointer passed to IDASetUserData.

Return value A IDAQuadRhsFn should return 0 if successful, a positive value if a recoverable error

occurred (in which case idas will attempt to correct), or a negative value if it failed

unrecoverably (in which case the integration is halted and IDA QRHS FAIL is returned).

Notes Allocation of memory for rhsQ is automatically handled within idas.

Both yand rhsQ are of type N Vector, but they typically have diﬀerent internal repre-

sentations. It is the user’s responsibility to access the vector data consistently (including

the use of the correct accessor macros from each nvector implementation). For the

sake of computational eﬃciency, the vector functions in the two nvector implementa-

tions provided with idas do not perform any consistency checks with respect to their

NVector arguments (see §7.2 and §7.3).

There is one situation in which recovery is not possible even if IDAQuadRhsFn function

returns a recoverable error ﬂag. This is when this occurs at the very ﬁrst call to the

IDAQuadRhsFn (in which case idas returns IDA FIRST QRHS ERR).

4.8 A parallel band-block-diagonal preconditioner module

A principal reason for using a parallel DAE solver such as idas lies in the solution of partial diﬀerential

equations (PDEs). Moreover, the use of a Krylov iterative method for the solution of many such

problems is motivated by the nature of the underlying linear system of equations (2.5) that must be

solved at each time step. The linear algebraic system is large, sparse, and structured. However, if a

Krylov iterative method is to be eﬀective in this setting, then a nontrivial preconditioner needs to be

used. Otherwise, the rate of convergence of the Krylov iterative method is usually unacceptably slow.

Unfortunately, an eﬀective preconditioner tends to be problem-speciﬁc.

However, we have developed one type of preconditioner that treats a rather broad class of PDE-

based problems. It has been successfully used for several realistic, large-scale problems [32] and is

included in a software module within the idas package. This module works with the parallel vector

module nvector parallel and generates a preconditioner that is a block-diagonal matrix with each

block being a band matrix. The blocks need not have the same number of super- and sub-diagonals,

and these numbers may vary from block to block. This Band-Block-Diagonal Preconditioner module

is called idabbdpre.

One way to envision these preconditioners is to think of the domain of the computational PDE

problem as being subdivided into Mnon-overlapping sub-domains. Each of these sub-domains is then

assigned to one of the Mprocessors to be used to solve the DAE system. The basic idea is to isolate the

preconditioning so that it is local to each processor, and also to use a (possibly cheaper) approximate

residual function. This requires the deﬁnition of a new function G(t, y, ˙y) which approximates the

function F(t, y, ˙y) in the deﬁnition of the DAE system (2.1). However, the user may set G=F.

Corresponding to the domain decomposition, there is a decomposition of the solution vectors yand ˙y

into Mdisjoint blocks ymand ˙ym, and a decomposition of Ginto blocks Gm. The block Gmdepends

on ymand ˙ym, and also on components of ym0and ˙ym0associated with neighboring sub-domains

4.8 A parallel band-block-diagonal preconditioner module 87

(so-called ghost-cell data). Let ¯ymand ¯

˙ymdenote ymand ˙ym(respectively) augmented with those

other components on which Gmdepends. Then we have

G(t, y, ˙y)=[G1(t, ¯y1,¯

˙y1), G2(t, ¯y2,¯

˙y2), . . . , GM(t, ¯yM,¯

˙yM)]T,(4.1)

and each of the blocks Gm(t, ¯ym,¯

˙ym) is uncoupled from the others.

The preconditioner associated with this decomposition has the form

P=diag[P1, P2, . . . , PM] (4.2)

where

Pm≈∂Gm/∂ym+α∂Gm/∂ ˙ym(4.3)

This matrix is taken to be banded, with upper and lower half-bandwidths mudq and mldq deﬁned as

the number of non-zero diagonals above and below the main diagonal, respectively. The diﬀerence

quotient approximation is computed using mudq +mldq +2 evaluations of Gm, but only a matrix of

bandwidth mukeep +mlkeep +1 is retained.

Neither pair of parameters need be the true half-bandwidths of the Jacobians of the local block of

G, if smaller values provide a more eﬃcient preconditioner. Such an eﬃciency gain may occur if the

couplings in the DAE system outside a certain bandwidth are considerably weaker than those within

the band. Reducing mukeep and mlkeep while keeping mudq and mldq at their true values, discards

the elements outside the narrower band. Reducing both pairs has the additional eﬀect of lumping the

outer Jacobian elements into the computed elements within the band, and requires more caution and

experimentation.

The solution of the complete linear system

P x =b(4.4)

reduces to solving each of the equations

Pmxm=bm(4.5)

and this is done by banded LU factorization of Pmfollowed by a banded backsolve.

Similar block-diagonal preconditioners could be considered with diﬀerent treatment of the blocks

Pm. For example, incomplete LU factorization or an iterative method could be used instead of banded

LU factorization.

The idabbdpre module calls two user-provided functions to construct P: a required function

Gres (of type IDABBDLocalFn) which approximates the residual function G(t, y, ˙y)≈F(t, y, ˙y) and

which is computed locally, and an optional function Gcomm (of type IDABBDCommFn) which performs

all inter-process communication necessary to evaluate the approximate residual G. These are in

addition to the user-supplied residual function res. Both functions take as input the same pointer

user data as passed by the user to IDASetUserData and passed to the user’s function res. The user

is responsible for providing space (presumably within user data) for components of yy and yp that

are communicated by Gcomm from the other processors, and that are then used by Gres, which should

not do any communication.

IDABBDLocalFn

Deﬁnition typedef int (*IDABBDLocalFn)(sunindextype Nlocal, realtype tt,

N Vector yy, N Vector yp, N Vector gval,

void *user data);

Purpose This Gres function computes G(t, y, ˙y). It loads the vector gval as a function of tt,

yy, and yp.

Arguments Nlocal is the local vector length.

tt is the value of the independent variable.

yy is the dependent variable.

yp is the derivative of the dependent variable.

88 Using IDAS for IVP Solution

gval is the output vector.

user data is a pointer to user data, the same as the user data parameter passed to

IDASetUserData.

Return value An IDABBDLocalFn function type should return 0 to indicate success, 1 for a recoverable

error, or -1 for a non-recoverable error.

Notes This function must assume that all inter-processor communication of data needed to

calculate gval has already been done, and this data is accessible within user data.

The case where Gis mathematically identical to Fis allowed.

IDABBDCommFn

Deﬁnition typedef int (*IDABBDCommFn)(sunindextype Nlocal, realtype tt,

N Vector yy, N Vector yp, void *user data);

Purpose This Gcomm function performs all inter-processor communications necessary for the ex-

ecution of the Gres function above, using the input vectors yy and yp.

Arguments Nlocal is the local vector length.

tt is the value of the independent variable.

yy is the dependent variable.

yp is the derivative of the dependent variable.

user data is a pointer to user data, the same as the user data parameter passed to

IDASetUserData.

Return value An IDABBDCommFn function type should return 0 to indicate success, 1 for a recoverable

error, or -1 for a non-recoverable error.

Notes The Gcomm function is expected to save communicated data in space deﬁned within the

structure user data.

Each call to the Gcomm function is preceded by a call to the residual function res with

the same (tt,yy,yp) arguments. Thus Gcomm can omit any communications done by

res if relevant to the evaluation of Gres. If all necessary communication was done in

res, then Gcomm =NULL can be passed in the call to IDABBDPrecInit (see below).

Besides the header ﬁles required for the integration of the DAE problem (see §4.3), to use the

idabbdpre module, the main program must include the header ﬁle idas bbdpre.h which declares

the needed function prototypes.

The following is a summary of the usage of this module and describes the sequence of calls in

the user main program. Steps that are unchanged from the user main program presented in §4.4 are

grayed-out.

1. Initialize MPI

2. Set problem dimensions etc.

3. Set vectors of initial values

4. Create idas object

5. Initialize idas solver

6. Specify integration tolerances

7. Create linear solver object

When creating the iterative linear solver object, specify the use of left preconditioning (PREC LEFT)

as idas only supports left preconditioning.

8. Set linear solver optional inputs

4.8 A parallel band-block-diagonal preconditioner module 89

9. Attach linear solver module

10. Set optional inputs

Note that the user should not overwrite the preconditioner setup function or solve function through

calls to idIDASetPreconditioner optional input function.

11. Initialize the idabbdpre preconditioner module

Specify the upper and lower bandwidths mudq,mldq and mukeep,mlkeep and call

flag = IDABBDPrecInit(ida mem, Nlocal, mudq, mldq,

mukeep, mlkeep, dq rel yy, Gres, Gcomm);

to allocate memory and initialize the internal preconditioner data. The last two arguments of

IDABBDPrecInit are the two user-supplied functions described above.

12. Create nonlinear solver object

13. Attach nonlinear solver module

14. Set nonlinear solver optional inputs

15. Correct initial values

16. Specify rootﬁnding problem

17. Advance solution in time

18. Get optional outputs

Additional optional outputs associated with idabbdpre are available by way of two routines

described below, IDABBDPrecGetWorkSpace and IDABBDPrecGetNumGfnEvals.

19. Deallocate memory for solution vectors

20. Free solver memory

21. Free nonlinear solver memory

22. Free linear solver memory

23. Finalize MPI

The user-callable functions that initialize (step 11 above) or re-initialize the idabbdpre preconditioner

module are described next.

IDABBDPrecInit

Call flag = IDABBDPrecInit(ida mem, Nlocal, mudq, mldq,

mukeep, mlkeep, dq rel yy, Gres, Gcomm);

Description The function IDABBDPrecInit initializes and allocates (internal) memory for the id-

abbdpre preconditioner.

Arguments ida mem (void *) pointer to the idas memory block.

Nlocal (sunindextype) local vector dimension.

mudq (sunindextype) upper half-bandwidth to be used in the diﬀerence-quotient

Jacobian approximation.

mldq (sunindextype) lower half-bandwidth to be used in the diﬀerence-quotient

Jacobian approximation.

mukeep (sunindextype) upper half-bandwidth of the retained banded approximate

Jacobian block.

90 Using IDAS for IVP Solution

mlkeep (sunindextype) lower half-bandwidth of the retained banded approximate

Jacobian block.

dq rel yy (realtype) the relative increment in components of yused in the diﬀerence

quotient approximations. The default is dq rel yy=√unit roundoﬀ, which

can be speciﬁed by passing dq rel yy= 0.0.

Gres (IDABBDLocalFn) the Cfunction which computes the local residual approx-

imation G(t, y, ˙y).

Gcomm (IDABBDCommFn) the optional Cfunction which performs all inter-process

communication required for the computation of G(t, y, ˙y).

Return value The return value flag (of type int) is one of

IDALS SUCCESS The call to IDABBDPrecInit was successful.

IDALS MEM NULL The ida mem pointer was NULL.

IDALS MEM FAIL A memory allocation request has failed.

IDALS LMEM NULL An idals linear solver memory was not attached.

IDALS ILL INPUT The supplied vector implementation was not compatible with the

block band preconditioner.

Notes If one of the half-bandwidths mudq or mldq to be used in the diﬀerence-quotient cal-

culation of the approximate Jacobian is negative or exceeds the value Nlocal−1, it is

replaced by 0 or Nlocal−1 accordingly.

The half-bandwidths mudq and mldq need not be the true half-bandwidths of the Jaco-

bian of the local block of G, when smaller values may provide a greater eﬃciency.

Also, the half-bandwidths mukeep and mlkeep of the retained banded approximate

Jacobian block may be even smaller, to reduce storage and computation costs further.

For all four half-bandwidths, the values need not be the same on every processor.

The idabbdpre module also provides a reinitialization function to allow for a sequence of prob-

lems of the same size, with the same linear solver choice, provided there is no change in local N,

mukeep, or mlkeep. After solving one problem, and after calling IDAReInit to re-initialize idas for

a subsequent problem, a call to IDABBDPrecReInit can be made to change any of the following: the

half-bandwidths mudq and mldq used in the diﬀerence-quotient Jacobian approximations, the relative

increment dq rel yy, or one of the user-supplied functions Gres and Gcomm. If there is a change in

any of the linear solver inputs, an additional call to the “Set” routines provided by the sunlinsol

module, and/or one or more of the corresponding IDASet*** functions, must also be made (in the

proper order).

IDABBDPrecReInit

Call flag = IDABBDPrecReInit(ida mem, mudq, mldq, dq rel yy);

Description The function IDABBDPrecReInit reinitializes the idabbdpre preconditioner.

Arguments ida mem (void *) pointer to the idas memory block.

mudq (sunindextype) upper half-bandwidth to be used in the diﬀerence-quotient

Jacobian approximation.

mldq (sunindextype) lower half-bandwidth to be used in the diﬀerence-quotient

Jacobian approximation.

dq rel yy (realtype) the relative increment in components of yused in the diﬀerence

quotient approximations. The default is dq rel yy =√unit roundoﬀ, which

can be speciﬁed by passing dq rel yy = 0.0.

Return value The return value flag (of type int) is one of

IDALS SUCCESS The call to IDABBDPrecReInit was successful.

IDALS MEM NULL The ida mem pointer was NULL.

4.8 A parallel band-block-diagonal preconditioner module 91

IDALS LMEM NULL An idals linear solver memory was not attached.

IDALS PMEM NULL The function IDABBDPrecInit was not previously called.

Notes If one of the half-bandwidths mudq or mldq is negative or exceeds the value Nlocal−1,

it is replaced by 0 or Nlocal−1, accordingly.

The following two optional output functions are available for use with the idabbdpre module:

IDABBDPrecGetWorkSpace

Call flag = IDABBDPrecGetWorkSpace(ida mem, &lenrwBBDP, &leniwBBDP);

Description The function IDABBDPrecGetWorkSpace returns the local sizes of the idabbdpre real

and integer workspaces.

Arguments ida mem (void *) pointer to the idas memory block.

lenrwBBDP (long int) local number of real values in the idabbdpre workspace.

leniwBBDP (long int) local number of integer values in the idabbdpre workspace.

Return value The return value flag (of type int) is one of

IDALS SUCCESS The optional output value has been successfully set.

IDALS MEM NULL The ida mem pointer was NULL.

IDALS PMEM NULL The idabbdpre preconditioner has not been initialized.

Notes The workspace requirements reported by this routine correspond only to memory allo-

cated within the idabbdpre module (the banded matrix approximation, banded sun-

linsol object, temporary vectors). These values are local to each process.

The workspaces referred to here exist in addition to those given by the corresponding

function IDAGetLinWorkSpace.

IDABBDPrecGetNumGfnEvals

Call flag = IDABBDPrecGetNumGfnEvals(ida mem, &ngevalsBBDP);

Description The function IDABBDPrecGetNumGfnEvals returns the cumulative number of calls to

the user Gres function due to the ﬁnite diﬀerence approximation of the Jacobian blocks

used within idabbdpre’s preconditioner setup function.

Arguments ida mem (void *) pointer to the idas memory block.

ngevalsBBDP (long int) the cumulative number of calls to the user Gres function.

Return value The return value flag (of type int) is one of

IDALS SUCCESS The optional output value has been successfully set.

IDALS MEM NULL The ida mem pointer was NULL.

IDALS PMEM NULL The idabbdpre preconditioner has not been initialized.

In addition to the ngevalsBBDP Gres evaluations, the costs associated with idabbdpre also include

nlinsetups LU factorizations, nlinsetups calls to Gcomm,npsolves banded backsolve calls, and

nrevalsLS residual function evaluations, where nlinsetups is an optional idas output (see §4.5.10.2),

and npsolves and nrevalsLS are linear solver optional outputs (see §4.5.10.5).

Chapter 5

Using IDAS for Forward Sensitivity

Analysis

This chapter describes the use of idas to compute solution sensitivities using forward sensitivity anal-

ysis. One of our main guiding principles was to design the idas user interface for forward sensitivity

analysis as an extension of that for IVP integration. Assuming a user main program and user-deﬁned

support routines for IVP integration have already been deﬁned, in order to perform forward sensitivity

analysis the user only has to insert a few more calls into the main program and (optionally) deﬁne

an additional routine which computes the residuals for sensitivity systems (2.12). The only departure

from this philosophy is due to the IDAResFn type deﬁnition (§4.6.1). Without changing the deﬁnition

of this type, the only way to pass values of the problem parameters to the DAE residual function is

to require the user data structure user data to contain a pointer to the array of real parameters p.

idas uses various constants for both input and output. These are deﬁned as needed in this chapter,

but for convenience are also listed separately in Appendix B.

We begin with a brief overview, in the form of a skeleton user program. Following that are detailed

descriptions of the interface to the various user-callable routines and of the user-supplied routines that

were not already described in Chapter 4.

5.1 A skeleton of the user’s main program

The following is a skeleton of the user’s main program (or calling program) as an application of idas.

The user program is to have these steps in the order indicated, unless otherwise noted. For the sake

of brevity, we defer many of the details to the later sections. As in §4.4, most steps are independent

of the nvector,sunmatrix,sunlinsol, and sunnonlinsol implementations used. For the steps

that are not, refer to Chapter 7,8,9, and 10 for the speciﬁc name of the function to be called or

macro to be referenced.

Diﬀerences between the user main program in §4.4 and the one below start only at step (16). Steps

that are unchanged from the skeleton program presented in §4.4 are grayed out.

First, note that no additional header ﬁles need be included for forward sensitivity analysis beyond

those for IVP solution (§4.4).

1. Initialize parallel or multi-threaded environment, if appropriate

2. Set problem dimensions etc.

3. Set vectors of initial values

4. Create idas object

5. Initialize idas solver

94 Using IDAS for Forward Sensitivity Analysis

6. Specify integration tolerances

7. Create matrix object

8. Create linear solver object

9. Set linear solver optional inputs

10. Attach linear solver module

11. Set optional inputs

12. Create nonlinear solver object

13. Attach nonlinear solver module

14. Set nonlinear solver optional inputs

15. Initialize quadrature problem, if not sensitivity-dependent

16. Deﬁne the sensitivity problem

•Number of sensitivities (required)

Set Ns =Ns, the number of parameters with respect to which sensitivities are to be computed.

•Problem parameters (optional)

If idas is to evaluate the residuals of the sensitivity systems, set p, an array of Np real

parameters upon which the IVP depends. Only parameters with respect to which sensitivities

are (potentially) desired need to be included. Attach pto the user data structure user data.

For example, user data->p = p;

If the user provides a function to evaluate the sensitivity residuals, pneed not be speciﬁed.

•Parameter list (optional)

If idas is to evaluate the sensitivity residuals, set plist, an array of Ns integers to specify the

parameters pwith respect to which solution sensitivities are to be computed. If sensitivities

with respect to the j-th parameter p[j] (0 ≤j<Np) are desired, set plisti=j, for some

i= 0, . . . , Ns−1.

If plist is not speciﬁed, idas will compute sensitivities with respect to the ﬁrst Ns parame-

ters; i.e., plisti=i(i= 0, . . . , Ns−1).

If the user provides a function to evaluate the sensitivity residuals, plist need not be spec-

iﬁed.

•Parameter scaling factors (optional)

If idas is to estimate tolerances for the sensitivity solution vectors (based on tolerances for

the state solution vector) or if idas is to evaluate the residuals of the sensitivity systems

using the internal diﬀerence-quotient function, the results will be more accurate if order of

magnitude information is provided.

Set pbar, an array of Ns positive scaling factors. Typically, if pi6= 0, the value ¯pi=|pplisti|

can be used.

If pbar is not speciﬁed, idas will use ¯pi= 1.0.

If the user provides a function to evaluate the sensitivity residual and speciﬁes tolerances for

the sensitivity variables, pbar need not be speciﬁed.

Note that the names for p,pbar,plist, as well as the ﬁeld pof user data are arbitrary, but they

must agree with the arguments passed to IDASetSensParams below.

5.1 A skeleton of the user’s main program 95

17. Set sensitivity initial conditions

Set the Ns vectors yS0[i] and ypS0[i] of initial values for sensitivities (for i= 0,..., Ns −1),

using the appropriate functions deﬁned by the particular nvector implementation chosen.

First, create an array of Ns vectors by making the appropriate call

yS0 = N VCloneVectorArray ***(Ns, y0);

yS0 = N VCloneVectorArrayEmpty ***(Ns, y0);

Here the argument y0 serves only to provide the N Vector type for cloning.

Then, for each i= 0,...,Ns −1, load initial values for the i-th sensitivity vector yS0[i].

Set the initial conditions for the Ns sensitivity derivative vectors ypS0 of ˙ysimilarly.

18. Activate sensitivity calculations

Call flag = IDASensInit(...); to activate forward sensitivity computations and allocate inter-

nal memory for idas related to sensitivity calculations (see §5.2.1).

19. Set sensitivity tolerances

Call IDASensSStolerances,IDASensSVtolerances, or IDASensEEtolerances. See §5.2.2.

20. Set sensitivity analysis optional inputs

Call IDASetSens* routines to change from their default values any optional inputs that control

the behavior of idas in computing forward sensitivities. See §5.2.7.

21. Create sensitivity nonlinear solver object (optional)

If using a non-default nonlinear solver (see §5.2.3), then create the desired nonlinear solver object

by calling the appropriate constructor function deﬁned by the particular sunnonlinsol imple-

mentation e.g.,

NLSSens = SUNNonlinSol_***Sens(...);

where *** is the name of the nonlinear solver and ... are constructor speciﬁc arguments (see

Chapter 10 for details).

22. Attach the sensitvity nonlinear solver module (optional)

If using a non-default nonlinear solver, then initialize the nonlinear solver interface by attaching

the nonlinear solver object by calling

ier = IDASetNonlinearSolverSensSim(ida_mem, NLSSens);

when using the IDA SIMULTANEOUS corrector method or

ier = IDASetNonlinearSolverSensStg(ida_mem, NLSSens);

when using the IDA STAGGERED corrector method (see §5.2.3 for details).

23. Set sensitivity nonlinear solver optional inputs (optional)

Call the appropriate set functions for the selected nonlinear solver module to change optional

inputs speciﬁc to that nonlinear solver. These must be called after IDASensInit if using the

default nonlinear solver or after attaching a new nonlinear solver to idas, otherwise the optional

inputs will be overridden by idas defaults. See Chapter 10 for more information on optional

inputs.

96 Using IDAS for Forward Sensitivity Analysis

24. Correct initial values

25. Specify rootﬁnding problem

26. Advance solution in time

27. Extract sensitivity solution

After each successful return from IDASolve, the solution of the original IVP is available in the y

argument of IDASolve, while the sensitivity solution can be extracted into yS and ypS (which can

be the same as yS0 and ypS0, respectively) by calling one of the following routines: IDAGetSens,

IDAGetSens1,IDAGetSensDky or IDAGetSensDky1 (see §5.2.6).

28. Get optional outputs

29. Deallocate memory for solution vector

30. Deallocate memory for sensitivity vectors

Upon completion of the integration, deallocate memory for the vectors contained in yS0 and ypS0:

N VDestroyVectorArray ***(yS0, Ns);

If yS was created from realtype arrays yS i, it is the user’s responsibility to also free the space

for the arrays yS i, and likewise for ypS.

31. Free user data structure

32. Free solver memory

33. Free nonlinear solver memory

34. Free vector speciﬁcation memory

35. Free linear solver and matrix memory

36. Finalize MPI, if used

5.2 User-callable routines for forward sensitivity analysis

This section describes the idas functions, in addition to those presented in §4.5, that are called by

the user to set up and solve a forward sensitivity problem.

5.2.1 Forward sensitivity initialization and deallocation functions

Activation of forward sensitivity computation is done by calling IDASensInit. The form of the call

to this routine is as follows:

IDASensInit

Call flag = IDASensInit(ida mem, Ns, ism, resS, yS0, ypS0);

Description The routine IDASensInit activates forward sensitivity computations and allocates in-

ternal memory related to sensitivity calculations.

Arguments ida mem (void *) pointer to the idas memory block returned by IDACreate.

Ns (int) the number of sensitivities to be computed.

ism (int) a ﬂag used to select the sensitivity solution method. Its value can be

either IDA SIMULTANEOUS or IDA STAGGERED:

5.2 User-callable routines for forward sensitivity analysis 97

•In the IDA SIMULTANEOUS approach, the state and sensitivity variables are

corrected at the same time. If the default Newton nonlinear solver is used,

this amounts to performing a modiﬁed Newton iteration on the combined

nonlinear system;

•In the IDA STAGGERED approach, the correction step for the sensitivity

variables takes place at the same time for all sensitivity equations, but

only after the correction of the state variables has converged and the state

variables have passed the local error test;

resS (IDASensResFn) is the Cfunction which computes the residual of the sensitiv-

ity DAE. For full details see §5.3.

yS0 (N Vector *) a pointer to an array of Ns vectors containing the initial values

of the sensitivities of y.

ypS0 (N Vector *) a pointer to an array of Ns vectors containing the initial values

of the sensitivities of ˙y.

Return value The return value flag (of type int) will be one of the following:

IDA SUCCESS The call to IDASensInit was successful.

IDA MEM NULL The idas memory block was not initialized through a previous call to

IDACreate.

IDA MEM FAIL A memory allocation request has failed.

IDA ILL INPUT An input argument to IDASensInit has an illegal value.

Notes Passing resS=NULL indicates using the default internal diﬀerence quotient sensitivity

residual routine.

If an error occurred, IDASensInit also prints an error message to the ﬁle speciﬁed by

the optional input errfp.

In terms of the problem size N, number of sensitivity vectors Ns, and maximum method order maxord,

the size of the real workspace is increased as follows:

•Base value: lenrw =lenrw + (maxord+5)NsN

•With IDASensSVtolerances:lenrw =lenrw +NsN

the size of the integer workspace is increased as follows:

•Base value: leniw =leniw + (maxord+5)NsNi

•With IDASensSVtolerances:leniw =leniw +NsNi,

where Niis the number of integer words in one N Vector.

The routine IDASensReInit, useful during the solution of a sequence of problems of same size,

reinitializes the sensitivity-related internal memory and must follow a call to IDASensInit (and maybe

a call to IDAReInit). The number Ns of sensitivities is assumed to be unchanged since the call to

IDASensInit. The call to the IDASensReInit function has the form:

IDASensReInit

Call flag = IDASensReInit(ida mem, ism, yS0, ypS0);

Description The routine IDASensReInit reinitializes forward sensitivity computations.

Arguments ida mem (void *) pointer to the idas memory block returned by IDACreate.

ism (int) a ﬂag used to select the sensitivity solution method. Its value can be

either IDA SIMULTANEOUS or IDA STAGGERED.

yS0 (N Vector *) a pointer to an array of Ns variables of type N Vector containing

the initial values of the sensitivities of y.

98 Using IDAS for Forward Sensitivity Analysis

ypS0 (N Vector *) a pointer to an array of Ns variables of type N Vector containing

the initial values of the sensitivities of ˙y.

Return value The return value flag (of type int) will be one of the following:

IDA SUCCESS The call to IDAReInit was successful.

IDA MEM NULL The idas memory block was not initialized through a previous call to

IDACreate.

IDA NO SENS Memory space for sensitivity integration was not allocated through a

previous call to IDASensInit.

IDA ILL INPUT An input argument to IDASensReInit has an illegal value.

IDA MEM FAIL A memory allocation request has failed.

Notes All arguments of IDASensReInit are the same as those of IDASensInit.

If an error occurred, IDASensReInit also prints an error message to the ﬁle speciﬁed

by the optional input errfp.

To deallocate all forward sensitivity-related memory (allocated in a prior call to IDASensInit), the

user must call

IDASensFree

Call IDASensFree(ida mem);

Description The function IDASensFree frees the memory allocated for forward sensitivity compu-

tations by a previous call to IDASensInit.

Arguments The argument is the pointer to the idas memory block (of type void *).

Return value The function IDASensFree has no return value.

Notes In general, IDASensFree need not be called by the user as it is invoked automatically

by IDAFree.

After a call to IDASensFree, forward sensitivity computations can be reactivated only

by calling IDASensInit again.

To activate and deactivate forward sensitivity calculations for successive idas runs, without having

to allocate and deallocate memory, the following function is provided:

IDASensToggleOff

Call IDASensToggleOff(ida mem);

Description The function IDASensToggleOff deactivates forward sensitivity calculations. It does

not deallocate sensitivity-related memory.

Arguments ida mem (void *) pointer to the memory previously allocated by IDAInit.

Return value The return value flag of IDASensToggle is one of:

IDA SUCCESS IDASensToggleOff was successful.

IDA MEM NULL ida mem was NULL.

Notes Since sensitivity-related memory is not deallocated, sensitivities can be reactivated at

a later time (using IDASensReInit).

5.2.2 Forward sensitivity tolerance speciﬁcation functions

One of the following three functions must be called to specify the integration tolerances for sensitivities.

Note that this call must be made after the call to IDASensInit.

5.2 User-callable routines for forward sensitivity analysis 99

IDASensSStolerances

Call flag = IDASensSStolerances(ida mem, reltolS, abstolS);

Description The function IDASensSStolerances speciﬁes scalar relative and absolute tolerances.

Arguments ida mem (void *) pointer to the idas memory block returned by IDACreate.

reltolS (realtype) is the scalar relative error tolerance.

abstolS (realtype*) is a pointer to an array of length Ns containing the scalar absolute

error tolerances.

Return value The return ﬂag flag (of type int) will be one of the following:

IDA SUCCESS The call to IDASStolerances was successful.

IDA MEM NULL The idas memory block was not initialized through a previous call to

IDACreate.

IDA NO SENS The sensitivity allocation function IDASensInit has not been called.

IDA ILL INPUT One of the input tolerances was negative.

IDASensSVtolerances

Call flag = IDASensSVtolerances(ida mem, reltolS, abstolS);

Description The function IDASensSVtolerances speciﬁes scalar relative tolerance and vector abso-

lute tolerances.

Arguments ida mem (void *) pointer to the idas memory block returned by IDACreate.

reltolS (realtype) is the scalar relative error tolerance.

abstolS (N Vector*) is an array of Ns variables of type N Vector. The N Vector from

abstolS[is] speciﬁes the vector tolerances for is-th sensitivity.

Return value The return ﬂag flag (of type int) will be one of the following:

IDA SUCCESS The call to IDASVtolerances was successful.

IDA MEM NULL The idas memory block was not initialized through a previous call to

IDACreate.

IDA NO SENS The sensitivity allocation function IDASensInit has not been called.

IDA ILL INPUT The relative error tolerance was negative or one of the absolute tolerance

vectors had a negative component.

Notes This choice of tolerances is important when the absolute error tolerance needs to be

diﬀerent for each component of any vector yS[i].

IDASensEEtolerances

Call flag = IDASensEEtolerances(ida mem);

Description When IDASensEEtolerances is called, idas will estimate tolerances for sensitivity vari-

ables based on the tolerances supplied for states variables and the scaling factors ¯p.

Arguments ida mem (void *) pointer to the idas memory block returned by IDACreate.

Return value The return ﬂag flag (of type int) will be one of the following:

IDA SUCCESS The call to IDASensEEtolerances was successful.

IDA MEM NULL The idas memory block was not initialized through a previous call to

IDACreate.

IDA NO SENS The sensitivity allocation function IDASensInit has not been called.

100 Using IDAS for Forward Sensitivity Analysis

5.2.3 Forward sensitivity nonlinear solver interface functions

As in the pure DAE case, when computing solution sensitivities using forward sensitivitiy analysis idas

uses the sunnonlinsol implementation of Newton’s method deﬁned by the sunnonlinsol newton

module (see §10.2) by default. To specify a diﬀerent nonlinear solver in idas, the user’s program

must create a sunnonlinsol object by calling the appropriate constructor routine. The user must

then attach the sunnonlinsol object to idas by calling either IDASetNonlinearSolverSensSim

when using the IDA SIMULTANEOUS corrector option, or IDASetNonlinearSolver (see §4.5.4) and

IDASetNonlinearSolverSensStg when using the IDA STAGGERED corrector option, as documented

below.

When changing the nonlinear solver in idas,IDASetNonlinearSolver must be called after IDAInit;

similarly IDASetNonlinearSolverSensSim and IDASetNonlinearSolverStg must be called after

IDASensInit. If any calls to IDASolve have been made, then idas will need to be reinitialized

by calling IDAReInit to ensure that the nonlinear solver is initialized correctly before any subsequent

calls to IDASolve.

The ﬁrst argument passed to the routines IDASetNonlinearSolverSensSim and

IDASetNonlinearSolverSensStg is the idas memory pointer returned by IDACreate and the second

argument is the sunnonlinsol object to use for solving the nonlinear system 2.4. A call to this

function attaches the nonlinear solver to the main idas integrator. We note that at present, the

sunnonlinsol object must be of type SUNNONLINEARSOLVER ROOTFIND.

IDASetNonlinearSolverSensSim

Call flag = IDASetNonlinearSolverSensSim(ida mem, NLS);

Description The function IDASetNonLinearSolverSensSim attaches a sunnonlinsol object (NLS)

to idas when using the IDA SIMULTANEOUS approach to correct the state and sensitivity

variables at the same time.

Arguments ida mem (void *) pointer to the idas memory block.

NLS (SUNNonlinearSolver)sunnonlinsol object to use for solving nonlinear sys-

tems.

Return value The return value flag (of type int) is one of

IDA SUCCESS The nonlinear solver was successfully attached.

IDA MEM NULL The ida mem pointer is NULL.

IDA ILL INPUT The sunnonlinsol object is NULL, does not implement the required

nonlinear solver operations, is not of the correct type, or the residual

function, convergence test function, or maximum number of nonlinear

iterations could not be set.

IDASetNonlinearSolverSensStg

Call flag = IDASetNonlinearSolverSensStg(ida mem, NLS);

Description The function IDASetNonLinearSolverSensStg attaches a sunnonlinsol object (NLS)

to idas when using the IDA STAGGERED approach to correct the sensitivity variables

after the correction of the state variables.

Arguments ida mem (void *) pointer to the idas memory block.

NLS (SUNNonlinearSolver)sunnonlinsol object to use for solving nonlinear sys-

tems.

Return value The return value flag (of type int) is one of

IDA SUCCESS The nonlinear solver was successfully attached.

IDA MEM NULL The ida mem pointer is NULL.

5.2 User-callable routines for forward sensitivity analysis 101

IDA ILL INPUT The sunnonlinsol object is NULL, does not implement the required

nonlinear solver operations, is not of the correct type, or the residual

function, convergence test function, or maximum number of nonlinear

iterations could not be set.

Notes This function only attaches the sunnonlinsol object for correcting the sensitivity

variables. To attach a sunnonlinsol object for the state variable correction use

IDASetNonlinearSolver (see §4.5.4).

5.2.4 Forward sensitivity initial condition calculation function

IDACalcIC also calculates corrected initial conditions for sensitivity variables of a DAE system. When

used for initial conditions calculation of the forward sensitivities, IDACalcIC must be preceded by

successful calls to IDASensInit (or IDASensReInit) and should precede the call(s) to IDASolve. For

restrictions that apply for initial conditions calculation of the state variables, see §4.5.5.

Calling IDACalcIC is optional. It is only necessary when the initial conditions do not satisfy the

sensitivity systems. Even if forward sensitivity analysis was enabled, the call to the initial conditions

calculation function IDACalcIC is exactly the same as for state variables.

flag = IDACalcIC(ida_mem, icopt, tout1);

See §4.5.5 for a list of possible return values.

5.2.5 IDAS solver function

Even if forward sensitivity analysis was enabled, the call to the main solver function IDASolve is

exactly the same as in §4.5.7. However, in this case the return value flag can also be one of the

following:

IDA SRES FAIL The sensitivity residual function failed in an unrecoverable manner.

IDA REP SRES ERR The user’s residual function repeatedly returned a recoverable error ﬂag, but the

solver was unable to recover.

5.2.6 Forward sensitivity extraction functions

If forward sensitivity computations have been initialized by a call to IDASensInit, or reinitialized by

a call to IDASensReInit, then idas computes both a solution and sensitivities at time t. However,

IDASolve will still return only the solutions yand ˙yin yret and ypret, respectively. Solution

sensitivities can be obtained through one of the following functions:

IDAGetSens

Call flag = IDAGetSens(ida mem, &tret, yS);

Description The function IDAGetSens returns the sensitivity solution vectors after a successful return

from IDASolve.

Arguments ida mem (void *) pointer to the memory previously allocated by IDAInit.

tret (realtype) the time reached by the solver (output).

yS (N Vector *) the array of Ns computed forward sensitivity vectors.

Return value The return value flag of IDAGetSens is one of:

IDA SUCCESS IDAGetSens was successful.

IDA MEM NULL ida mem was NULL.

IDA NO SENS Forward sensitivity analysis was not initialized.

IDA BAD DKY yS is NULL.

Notes Note that the argument tret is an output for this function. Its value will be the same

as that returned at the last IDASolve call.

102 Using IDAS for Forward Sensitivity Analysis

The function IDAGetSensDky computes the k-th derivatives of the interpolating polynomials for the

sensitivity variables at time t. This function is called by IDAGetSens with k= 0, but may also be

called directly by the user.

IDAGetSensDky

Call flag = IDAGetSensDky(ida mem, t, k, dkyS);

Description The function IDAGetSensDky returns derivatives of the sensitivity solution vectors after

a successful return from IDASolve.

Arguments ida mem (void *) pointer to the memory previously allocated by IDAInit.

t(realtype) speciﬁes the time at which sensitivity information is requested.

The time tmust fall within the interval deﬁned by the last successful step

taken by idas.

k(int) order of derivatives.

dkyS (N Vector *) array of Ns vectors containing the derivatives on output. The

space for dkyS must be allocated by the user.

Return value The return value flag of IDAGetSensDky is one of:

IDA SUCCESS IDAGetSensDky succeeded.

IDA MEM NULL ida mem was NULL.

IDA NO SENS Forward sensitivity analysis was not initialized.

IDA BAD DKY dkyS or one of the vectors dkyS[i] is NULL.

IDA BAD K k is not in the range 0,1, ..., klast.

IDA BAD T The time tis not in the allowed range.

Forward sensitivity solution vectors can also be extracted separately for each parameter in turn

through the functions IDAGetSens1 and IDAGetSensDky1, deﬁned as follows:

IDAGetSens1

Call flag = IDAGetSens1(ida mem, &tret, is, yS);

Description The function IDAGetSens1 returns the is-th sensitivity solution vector after a successful

return from IDASolve.

Arguments ida mem (void *) pointer to the memory previously allocated by IDAInit.

tret (realtype *) the time reached by the solver (output).

is (int) speciﬁes which sensitivity vector is to be returned (0 ≤is< Ns).

yS (N Vector) the computed forward sensitivity vector.

Return value The return value flag of IDAGetSens1 is one of:

IDA SUCCESS IDAGetSens1 was successful.

IDA MEM NULL ida mem was NULL.

IDA NO SENS Forward sensitivity analysis was not initialized.

IDA BAD IS The index is is not in the allowed range.

IDA BAD DKY yS is NULL.

IDA BAD T The time tis not in the allowed range.

Notes Note that the argument tret is an output for this function. Its value will be the same

as that returned at the last IDASolve call.

5.2 User-callable routines for forward sensitivity analysis 103

IDAGetSensDky1

Call flag = IDAGetSensDky1(ida mem, t, k, is, dkyS);

Description The function IDAGetSensDky1 returns the k-th derivative of the is-th sensitivity solu-

tion vector after a successful return from IDASolve.

Arguments ida mem (void *) pointer to the memory previously allocated by IDAInit.

t(realtype) speciﬁes the time at which sensitivity information is requested.

The time tmust fall within the interval deﬁned by the last successful step

taken by idas.

k(int) order of derivative.

is (int) speciﬁes the sensitivity derivative vector to be returned (0 ≤is< Ns).

dkyS (N Vector) the vector containing the derivative on output. The space for dkyS

must be allocated by the user.

Return value The return value flag of IDAGetSensDky1 is one of:

IDA SUCCESS IDAGetQuadDky1 succeeded.

IDA MEM NULL ida mem was NULL.

IDA NO SENS Forward sensitivity analysis was not initialized.

IDA BAD DKY dkyS is NULL.

IDA BAD IS The index is is not in the allowed range.

IDA BAD K k is not in the range 0,1, ..., klast.

IDA BAD T The time tis not in the allowed range.

5.2.7 Optional inputs for forward sensitivity analysis

Optional input variables that control the computation of sensitivities can be changed from their default

values through calls to IDASetSens* functions. Table 5.1 lists all forward sensitivity optional input

functions in idas which are described in detail in the remainder of this section.

IDASetSensParams

Call flag = IDASetSensParams(ida mem, p, pbar, plist);

Description The function IDASetSensParams speciﬁes problem parameter information for sensitivity

calculations.

Arguments ida mem (void *) pointer to the idas memory block.

p(realtype *) a pointer to the array of real problem parameters used to evalu-

ate F(t, y, ˙y, p). If non-NULL,pmust point to a ﬁeld in the user’s data structure

user data passed to the user’s residual function. (See §5.1).

pbar (realtype *) an array of Ns positive scaling factors. If non-NULL,pbar must

have all its components >0.0. (See §5.1).

plist (int *) an array of Ns non-negative indices to specify which components of p

to use in estimating the sensitivity equations. If non-NULL,plist must have

all components ≥0. (See §5.1).

Table 5.1: Forward sensitivity optional inputs

Optional input Routine name Default

Sensitivity scaling factors IDASetSensParams NULL

DQ approximation method IDASetSensDQMethod centered,0.0

Error control strategy IDASetSensErrCon SUNFALSE

Maximum no. of nonlinear iterations IDASetSensMaxNonlinIters 3

104 Using IDAS for Forward Sensitivity Analysis

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA NO SENS Forward sensitivity analysis was not initialized.

IDA ILL INPUT An argument has an illegal value.

Notes This function must be preceded by a call to IDASensInit.

IDASetSensDQMethod

Call flag = IDASetSensDQMethod(ida mem, DQtype, DQrhomax);

Description The function IDASetSensDQMethod speciﬁes the diﬀerence quotient strategy in the case

in which the residual of the sensitivity equations are to be computed by idas.

Arguments ida mem (void *) pointer to the idas memory block.

DQtype (int) speciﬁes the diﬀerence quotient type and can be either IDA CENTERED or

IDA FORWARD.

DQrhomax (realtype) positive value of the selection parameter used in deciding switch-

ing between a simultaneous or separate approximation of the two terms in the

sensitivity residual.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA ILL INPUT An argument has an illegal value.

Notes If DQrhomax = 0.0, then no switching is performed. The approximation is done simul-

taneously using either centered or forward ﬁnite diﬀerences, depending on the value of

DQtype. For values of DQrhomax ≥1.0, the simultaneous approximation is used when-

ever the estimated ﬁnite diﬀerence perturbations for states and parameters are within

a factor of DQrhomax, and the separate approximation is used otherwise. Note that a

value DQrhomax <1.0 will eﬀectively disable switching. See §2.5 for more details.

The default value are DQtype=IDA CENTERED and DQrhomax= 0.0.

IDASetSensErrCon

Call flag = IDASetSensErrCon(ida mem, errconS);

Description The function IDASetSensErrCon speciﬁes the error control strategy for sensitivity vari-

ables.

Arguments ida mem (void *) pointer to the idas memory block.

errconS (booleantype) speciﬁes whether sensitivity variables are included (SUNTRUE)

or not (SUNFALSE) in the error control mechanism.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

Notes By default, errconS is set to SUNFALSE. If errconS=SUNTRUE then both state variables

and sensitivity variables are included in the error tests. If errconS=SUNFALSE then

the sensitivity variables are excluded from the error tests. Note that, in any event, all

variables are considered in the convergence tests.

5.2 User-callable routines for forward sensitivity analysis 105

IDASetSensMaxNonlinIters

Call flag = IDASetSensMaxNonlinIters(ida mem, maxcorS);

Description The function IDASetSensMaxNonlinIters speciﬁes the maximum number of nonlinear

solver iterations for sensitivity variables per step.

Arguments ida mem (void *) pointer to the idas memory block.

maxcorS (int) maximum number of nonlinear solver iterations allowed per step (>0).

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA MEM FAIL The ida mem sunnonlinsol module is NULL.

Notes The default value is 3.

5.2.8 Optional outputs for forward sensitivity analysis

5.2.8.1 Main solver optional output functions

Optional output functions that return statistics and solver performance information related to forward

sensitivity computations are listed in Table 5.2 and described in detail in the remainder of this section.

IDAGetSensNumResEvals

Call flag = IDAGetSensNumResEvals(ida mem, &nfSevals);

Description The function IDAGetSensNumResEvals returns the number of calls to the sensitivity

residual function.

Arguments ida mem (void *) pointer to the idas memory block.

nfSevals (long int) number of calls to the sensitivity residual function.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA NO SENS Forward sensitivity analysis was not initialized.

IDAGetNumResEvalsSens

Call flag = IDAGetNumResEvalsSens(ida mem, &nfevalsS);

Description The function IDAGetNumResEvalsSEns returns the number of calls to the user’s residual

function due to the internal ﬁnite diﬀerence approximation of the sensitivity residuals.

Table 5.2: Forward sensitivity optional outputs

Optional output Routine name

No. of calls to sensitivity residual function IDAGetSensNumResEvals

No. of calls to residual function for sensitivity IDAGetNumResEvalsSens

No. of sensitivity local error test failures IDAGetSensNumErrTestFails

No. of calls to lin. solv. setup routine for sens. IDAGetSensNumLinSolvSetups

Sensitivity-related statistics as a group IDAGetSensStats

Error weight vector for sensitivity variables IDAGetSensErrWeights

No. of sens. nonlinear solver iterations IDAGetSensNumNonlinSolvIters

No. of sens. convergence failures IDAGetSensNumNonlinSolvConvFails

Sens. nonlinear solver statistics as a group IDAGetSensNonlinSolvStats

106 Using IDAS for Forward Sensitivity Analysis

Arguments ida mem (void *) pointer to the idas memory block.

nfevalsS (long int) number of calls to the user residual function for sensitivity resid-

uals.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA NO SENS Forward sensitivity analysis was not initialized.

Notes This counter is incremented only if the internal ﬁnite diﬀerence approximation routines

are used for the evaluation of the sensitivity residuals.

IDAGetSensNumErrTestFails

Call flag = IDAGetSensNumErrTestFails(ida mem, &nSetfails);

Description The function IDAGetSensNumErrTestFails returns the number of local error test fail-

ures for the sensitivity variables that have occurred.

Arguments ida mem (void *) pointer to the idas memory block.

nSetfails (long int) number of error test failures.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA NO SENS Forward sensitivity analysis was not initialized.

Notes This counter is incremented only if the sensitivity variables have been included in the

error test (see IDASetSensErrCon in §5.2.7). Even in that case, this counter is not

incremented if the ism=IDA SIMULTANEOUS sensitivity solution method has been used.

IDAGetSensNumLinSolvSetups

Call flag = IDAGetSensNumLinSolvSetups(ida mem, &nlinsetupsS);

Description The function IDAGetSensNumLinSolvSetups returns the number of calls to the linear

solver setup function due to forward sensitivity calculations.

Arguments ida mem (void *) pointer to the idas memory block.

nlinsetupsS (long int) number of calls to the linear solver setup function.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA NO SENS Forward sensitivity analysis was not initialized.

Notes This counter is incremented only if a nonlinear solver requiring linear solves has been

used and staggered sensitivity solution method (ism=IDA STAGGERED) was speciﬁed in

the call to IDASensInit (see §5.2.1).

IDAGetSensStats

Call flag = IDAGetSensStats(ida mem, &nfSevals, &nfevalsS, &nSetfails,

&nlinsetupsS);

Description The function IDAGetSensStats returns all of the above sensitivity-related solver statis-

tics as a group.

Arguments ida mem (void *) pointer to the idas memory block.

nfSevals (long int) number of calls to the sensitivity residual function.

5.2 User-callable routines for forward sensitivity analysis 107

nfevalsS (long int) number of calls to the user-supplied residual function.

nSetfails (long int) number of error test failures.

nlinsetupsS (long int) number of calls to the linear solver setup function.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional output values have been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA NO SENS Forward sensitivity analysis was not initialized.

IDAGetSensErrWeights

Call flag = IDAGetSensErrWeights(ida mem, eSweight);

Description The function IDAGetSensErrWeights returns the sensitivity error weight vectors at the

current time. These are the reciprocals of the Wiof (2.7) for the sensitivity variables.

Arguments ida mem (void *) pointer to the idas memory block.

eSweight (N Vector S) pointer to the array of error weight vectors.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA NO SENS Forward sensitivity analysis was not initialized.

Notes The user must allocate memory for eweightS.

IDAGetSensNumNonlinSolvIters

Call flag = IDAGetSensNumNonlinSolvIters(ida mem, &nSniters);

Description The function IDAGetSensNumNonlinSolvIters returns the number of nonlinear itera-

tions performed for sensitivity calculations.

Arguments ida mem (void *) pointer to the idas memory block.

nSniters (long int) number of nonlinear iterations performed.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA NO SENS Forward sensitivity analysis was not initialized.

IDA MEM FAIL The sunnonlinsol module is NULL.

Notes This counter is incremented only if ism was IDA STAGGERED in the call to IDASensInit

(see §5.2.1).

IDAGetSensNumNonlinSolvConvFails

Call flag = IDAGetSensNumNonlinSolvConvFails(ida mem, &nSncfails);

Description The function IDAGetSensNumNonlinSolvConvFails returns the number of nonlinear

convergence failures that have occurred for sensitivity calculations.

Arguments ida mem (void *) pointer to the idas memory block.

nSncfails (long int) number of nonlinear convergence failures.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA NO SENS Forward sensitivity analysis was not initialized.

Notes This counter is incremented only if ism was IDA STAGGERED in the call to IDASensInit

(see §5.2.1).

108 Using IDAS for Forward Sensitivity Analysis

IDAGetSensNonlinSolvStats

Call flag = IDAGetSensNonlinSolvStats(ida mem, &nSniters, &nSncfails);

Description The function IDAGetSensNonlinSolvStats returns the sensitivity-related nonlinear

solver statistics as a group.

Arguments ida mem (void *) pointer to the idas memory block.

nSniters (long int) number of nonlinear iterations performed.

nSncfails (long int) number of nonlinear convergence failures.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional output values have been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA NO SENS Forward sensitivity analysis was not initialized.

IDA MEM FAIL The sunnonlinsol module is NULL.

5.2.8.2 Initial condition calculation optional output functions

The sensitivity consistent initial conditions found by idas (after a successful call to IDACalcIC) can

be obtained by calling the following function:

IDAGetSensConsistentIC

Call flag = IDAGetSensConsistentIC(ida mem, yyS0 mod, ypS0 mod);

Description The function IDAGetSensConsistentIC returns the corrected initial conditions calcu-

lated by IDACalcIC for sensitivities variables.

Arguments ida mem (void *) pointer to the idas memory block.

yyS0 mod (N Vector *) a pointer to an array of Ns vectors containing consistent sensi-

tivity vectors.

ypS0 mod (N Vector *) a pointer to an array of Ns vectors containing consistent sensi-

tivity derivative vectors.

Return value The return value flag (of type int) is one of

IDA SUCCESS IDAGetSensConsistentIC succeeded.

IDA MEM NULL The ida mem pointer is NULL.

IDA NO SENS The function IDASensInit has not been previously called.

IDA ILL INPUT IDASolve has been already called.

Notes If the consistent sensitivity vectors or consistent derivative vectors are not desired, pass

NULL for the corresponding argument.

The user must allocate space for yyS0 mod and ypS0 mod (if not NULL).

5.3 User-supplied routines for forward sensitivity analysis

In addition to the required and optional user-supplied routines described in §4.6, when using idas for

forward sensitivity analysis, the user has the option of providing a routine that calculates the residual

of the sensitivity equations (2.12).

By default, idas uses diﬀerence quotient approximation routines for the residual of the sensitivity

equations. However, idas allows the option for user-deﬁned sensitivity residual routines (which also

provides a mechanism for interfacing idas to routines generated by automatic diﬀerentiation).

The user may provide the residuals of the sensitivity equations (2.12), for all sensitivity parameters

at once, through a function of type IDASensResFn deﬁned by:

5.4 Integration of quadrature equations depending on forward sensitivities 109

IDASensResFn

Deﬁnition typedef int (*IDASensResFn)(int Ns, realtype t,

N Vector yy, N Vector yp, N Vector resval,

N Vector *yS, N Vector *ypS,

N Vector *resvalS, void *user data,

N Vector tmp1, N Vector tmp2, N Vector tmp3);

Purpose This function computes the sensitivity residual for all sensitivity equations. It must com-

pute the vectors (∂F/∂y)si(t)+(∂F/∂ ˙y) ˙si(t)+(∂F/∂pi) and store them in resvalS[i].

Arguments Ns is the number of sensitivities.

tis the current value of the independent variable.

yy is the current value of the state vector, y(t).

yp is the current value of ˙y(t).

resval contains the current value Fof the original DAE residual.

yS contains the current values of the sensitivities si.

ypS contains the current values of the sensitivity derivatives ˙si.

resvalS contains the output sensitivity residual vectors.

user data is a pointer to user data.

tmp1

tmp2

tmp3 are N Vectors of length Nwhich can be used as temporary storage.

Return value An IDASensResFn should return 0 if successful, a positive value if a recoverable error

occurred (in which case idas will attempt to correct), or a negative value if it failed

unrecoverably (in which case the integration is halted and IDA SRES FAIL is returned).

Notes There is one situation in which recovery is not possible even if IDASensResFn function

returns a recoverable error ﬂag. That is when this occurs at the very ﬁrst call to the

IDASensResFn, in which case idas returns IDA FIRST RES FAIL.

5.4 Integration of quadrature equations depending on forward

sensitivities

idas provides support for integration of quadrature equations that depends not only on the state

variables but also on forward sensitivities.

The following is an overview of the sequence of calls in a user’s main program in this situation.

Steps that are unchanged from the skeleton program presented in §5.1 are grayed out. See also §4.7.

1. Initialize parallel or multi-threaded environment

2. Set problem dimensions, etc.

3. Set vectors of initial values

4. Create idas object

5. Initialize idas solver

6. Specify integration tolerances

7. Create matrix object

8. Create linear solver object

9. Set linear solver optional inputs

110 Using IDAS for Forward Sensitivity Analysis

10. Attach linear solver module

11. Set optional inputs

12. Create nonlinear solver object

13. Attach nonlinear solver module

14. Set nonlinear solver optional inputs

15. Initialize sensitivity-independent quadrature problem

16. Deﬁne the sensitivity problem

17. Set sensitivity initial conditions

18. Activate sensitivity calculations

19. Set sensitivity tolerances

20. Set sensitivity analysis optional inputs

21. Create sensitivity nonlinear solver object

22. Attach the sensitvity nonlinear solver module

23. Set sensitivity nonlinear solver optional inputs

24. Set vector of initial values for quadrature variables

Typically, the quadrature variables should be initialized to 0.

25. Initialize sensitivity-dependent quadrature integration

Call IDAQuadSensInit to specify the quadrature equation right-hand side function and to allocate

internal memory related to quadrature integration. See §5.4.1 for details.

26. Set optional inputs for sensitivity-dependent quadrature integration

Call IDASetQuadSensErrCon to indicate whether or not quadrature variables should be used in

the step size control mechanism. If so, one of the IDAQuadSens*tolerances functions must be

called to specify the integration tolerances for quadrature variables. See §5.4.4 for details.

27. Advance solution in time

28. Extract sensitivity-dependent quadrature variables

Call IDAGetQuadSens,IDAGetQuadSens1,IDAGetQuadSensDky or IDAGetQuadSensDky1 to obtain

the values of the quadrature variables or their derivatives at the current time. See §5.4.3 for details.

29. Get optional outputs

30. Extract sensitivity solution

31. Get sensitivity-dependent quadrature optional outputs

Call IDAGetQuadSens* functions to obtain optional output related to the integration of sensitivity-

dependent quadratures. See §5.4.5 for details.

32. Deallocate memory for solutions vector

33. Deallocate memory for sensitivity vectors

5.4 Integration of quadrature equations depending on forward sensitivities 111

34. Deallocate memory for sensitivity-dependent quadrature variables

35. Free solver memory

36. Free nonlinear solver memory

37. Free vector speciﬁcation memory

38. Free linear solver and matrix memory

39. Finalize MPI, if used

Note: IDAQuadSensInit (step 25 above) can be called and quadrature-related optional inputs (step

26 above) can be set, anywhere between steps 16 and 27.

5.4.1 Sensitivity-dependent quadrature initialization and deallocation

The function IDAQuadSensInit activates integration of quadrature equations depending on sensitiv-

ities and allocates internal memory related to these calculations. If rhsQS is input as NULL, then

idas uses an internal function that computes diﬀerence quotient approximations to the functions

¯qi= (∂q/∂y)si+ (∂q/∂ ˙y) ˙si+∂q/∂pi, in the notation of (2.10). The form of the call to this function

is as follows:

IDAQuadSensInit

Call flag = IDAQuadSensInit(ida mem, rhsQS, yQS0);

Description The function IDAQuadSensInit provides required problem speciﬁcations, allocates in-

ternal memory, and initializes quadrature integration.

Arguments ida mem (void *) pointer to the idas memory block returned by IDACreate.

rhsQS (IDAQuadSensRhsFn) is the Cfunction which computes fQS , the right-hand

side of the sensitivity-dependent quadrature equations (for full details see

§5.4.6).

yQS0 (N Vector *) contains the initial values of sensitivity-dependent quadratures.

Return value The return value flag (of type int) will be one of the following:

IDA SUCCESS The call to IDAQuadSensInit was successful.

IDA MEM NULL The idas memory was not initialized by a prior call to IDACreate.

IDA MEM FAIL A memory allocation request failed.

IDA NO SENS The sensitivities were not initialized by a prior call to IDASensInit.

IDA ILL INPUT The parameter yQS0 is NULL.

Notes Before calling IDAQuadSensInit, the user must enable the sensitivites by calling

IDASensInit.

If an error occurred, IDAQuadSensInit also sends an error message to the error handler

function.

In terms of the number of quadrature variables Nqand maximum method order maxord, the size of

the real workspace is increased as follows:

•Base value: lenrw =lenrw + (maxord+5)Nq

•If IDAQuadSensSVtolerances is called: lenrw =lenrw +NqNs

and the size of the integer workspace is increased as follows:

•Base value: leniw =leniw + (maxord+5)Nq

•If IDAQuadSensSVtolerances is called: leniw =leniw +NqNs

112 Using IDAS for Forward Sensitivity Analysis

The function IDAQuadSensReInit, useful during the solution of a sequence of problems of same

size, reinitializes the quadrature related internal memory and must follow a call to IDAQuadSensInit.

The number Nq of quadratures as well as the number Ns of sensitivities are assumed to be unchanged

from the prior call to IDAQuadSensInit. The call to the IDAQuadSensReInit function has the form:

IDAQuadSensReInit

Call flag = IDAQuadSensReInit(ida mem, yQS0);

Description The function IDAQuadSensReInit provides required problem speciﬁcations and reini-

tializes the sensitivity-dependent quadrature integration.

Arguments ida mem (void *) pointer to the idas memory block.

yQS0 (N Vector *) contains the initial values of sensitivity-dependent quadratures.

Return value The return value flag (of type int) will be one of the following:

IDA SUCCESS The call to IDAQuadSensReInit was successful.

IDA MEM NULL The idas memory was not initialized by a prior call to IDACreate.

IDA NO SENS Memory space for the sensitivity calculation was not allocated by a

prior call to IDASensInit.

IDA NO QUADSENS Memory space for the sensitivity quadratures integration was not

allocated by a prior call to IDAQuadSensInit.

IDA ILL INPUT The parameter yQS0 is NULL.

Notes If an error occurred, IDAQuadSensReInit also sends an error message to the error

handler function.

IDAQuadSensFree

Call IDAQuadSensFree(ida mem);

Description The function IDAQuadSensFree frees the memory allocated for sensitivity quadrature

integration.

Arguments The argument is the pointer to the idas memory block (of type void *).

Return value The function IDAQuadSensFree has no return value.

Notes In general, IDAQuadSensFree need not be called by the user as it is called automatically

by IDAFree.

5.4.2 IDAS solver function

Even if quadrature integration was enabled, the call to the main solver function IDASolve is exactly the

same as in §4.5.7. However, in this case the return value flag can also be one of the following:

IDA QSRHS FAIL The sensitivity quadrature right-hand side function failed in an unrecoverable

manner.

IDA FIRST QSRHS ERR The sensitivity quadrature right-hand side function failed at the ﬁrst call.

IDA REP QSRHS ERR Convergence test failures occurred too many times due to repeated recover-

able errors in the quadrature right-hand side function. The IDA REP RES ERR

will also be returned if the quadrature right-hand side function had repeated

recoverable errors during the estimation of an initial step size (assuming the

sensitivity quadrature variables are included in the error tests).

5.4.3 Sensitivity-dependent quadrature extraction functions

If sensitivity-dependent quadratures have been initialized by a call to IDAQuadSensInit, or reinitial-

ized by a call to IDAQuadSensReInit, then idas computes a solution, sensitivities, and quadratures

depending on sensitivities at time t. However, IDASolve will still return only the solutions yand ˙y.

Sensitivity-dependent quadratures can be obtained using one of the following functions:

5.4 Integration of quadrature equations depending on forward sensitivities 113

IDAGetQuadSens

Call flag = IDAGetQuadSens(ida mem, &tret, yQS);

Description The function IDAGetQuadSens returns the quadrature sensitivity solution vectors after

a successful return from IDASolve.

Arguments ida mem (void *) pointer to the memory previously allocated by IDAInit.

tret (realtype) the time reached by the solver (output).

yQS (N Vector *) array of Ns computed sensitivity-dependent quadrature vectors.

Return value The return value flag of IDAGetQuadSens is one of:

IDA SUCCESS IDAGetQuadSens was successful.

IDA MEM NULL ida mem was NULL.

IDA NO SENS Sensitivities were not activated.

IDA NO QUADSENS Quadratures depending on the sensitivities were not activated.

IDA BAD DKY yQS or one of the yQS[i] is NULL.

The function IDAGetQuadSensDky computes the k-th derivatives of the interpolating polynomials for

the sensitivity-dependent quadrature variables at time t. This function is called by IDAGetQuadSens

with k=0, but may also be called directly by the user.

IDAGetQuadSensDky

Call flag = IDAGetQuadSensDky(ida mem, t, k, dkyQS);

Description The function IDAGetQuadSensDky returns derivatives of the quadrature sensitivities

solution vectors after a successful return from IDASolve.

Arguments ida mem (void *) pointer to the memory previously allocated by IDAInit.

t(realtype) the time at which information is requested. The time tmust fall

within the interval deﬁned by the last successful step taken by idas.

k(int) order of the requested derivative.

dkyQS (N Vector *) array of Ns vectors containing the derivatives. This vector array

must be allocated by the user.

Return value The return value flag of IDAGetQuadSensDky is one of:

IDA SUCCESS IDAGetQuadSensDky succeeded.

IDA MEM NULL ida mem was NULL.

IDA NO SENS Sensitivities were not activated.

IDA NO QUADSENS Quadratures depending on the sensitivities were not activated.

IDA BAD DKY dkyQS or one of the vectors dkyQS[i] is NULL.

IDA BAD K k is not in the range 0,1, ..., klast.

IDA BAD T The time tis not in the allowed range.

Quadrature sensitivity solution vectors can also be extracted separately for each parameter in turn

through the functions IDAGetQuadSens1 and IDAGetQuadSensDky1, deﬁned as follows:

IDAGetQuadSens1

Call flag = IDAGetQuadSens1(ida mem, &tret, is, yQS);

Description The function IDAGetQuadSens1 returns the is-th sensitivity of quadratures after a

successful return from IDASolve.

Arguments ida mem (void *) pointer to the memory previously allocated by IDAInit.

tret (realtype) the time reached by the solver (output).

is (int) speciﬁes which sensitivity vector is to be returned (0 ≤is< Ns).

yQS (N Vector) the computed sensitivity-dependent quadrature vector.

114 Using IDAS for Forward Sensitivity Analysis

Return value The return value flag of IDAGetQuadSens1 is one of:

IDA SUCCESS IDAGetQuadSens1 was successful.

IDA MEM NULL ida mem was NULL.

IDA NO SENS Forward sensitivity analysis was not initialized.

IDA NO QUADSENS Quadratures depending on the sensitivities were not activated.

IDA BAD IS The index is is not in the allowed range.

IDA BAD DKY yQS is NULL.

IDAGetQuadSensDky1

Call flag = IDAGetQuadSensDky1(ida mem, t, k, is, dkyQS);

Description The function IDAGetQuadSensDky1 returns the k-th derivative of the is-th sensitivity

solution vector after a successful return from IDASolve.

Arguments ida mem (void *) pointer to the memory previously allocated by IDAInit.

t(realtype) speciﬁes the time at which sensitivity information is requested.

The time tmust fall within the interval deﬁned by the last successful step

taken by idas.

k(int) order of derivative.

is (int) speciﬁes the sensitivity derivative vector to be returned (0 ≤is< Ns).

dkyQS (N Vector) the vector containing the derivative. The space for dkyQS must be

allocated by the user.

Return value The return value flag of IDAGetQuadSensDky1 is one of:

IDA SUCCESS IDAGetQuadDky1 succeeded.

IDA MEM NULL ida mem was NULL.

IDA NO SENS Forward sensitivity analysis was not initialized.

IDA NO QUADSENS Quadratures depending on the sensitivities were not activated.

IDA BAD DKY dkyQS is NULL.

IDA BAD IS The index is is not in the allowed range.

IDA BAD K k is not in the range 0,1, ..., klast.

IDA BAD T The time tis not in the allowed range.

5.4.4 Optional inputs for sensitivity-dependent quadrature integration

idas provides the following optional input functions to control the integration of sensitivity-dependent

quadrature equations.

IDASetQuadSensErrCon

Call flag = IDASetQuadSensErrCon(ida mem, errconQS)

Description The function IDASetQuadSensErrCon speciﬁes whether or not the quadrature variables

are to be used in the local error control mechanism. If they are, the user must specify

the error tolerances for the quadrature variables by calling IDAQuadSensSStolerances,

IDAQuadSensSVtolerances, or IDAQuadSensEEtolerances.

Arguments ida mem (void *) pointer to the idas memory block.

errconQS (booleantype) speciﬁes whether sensitivity quadrature variables are included

(SUNTRUE) or not (SUNFALSE) in the error control mechanism.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

5.4 Integration of quadrature equations depending on forward sensitivities 115

IDA NO SENS Sensitivities were not activated.

IDA NO QUADSENS Quadratures depending on the sensitivities were not activated.

Notes By default, errconQS is set to SUNFALSE.

It is illegal to call IDASetQuadSensErrCon before a call to IDAQuadSensInit.

If the quadrature variables are part of the step size control mechanism, one of the following

functions must be called to specify the integration tolerances for quadrature variables.

IDAQuadSensSStolerances

Call flag = IDAQuadSensSVtolerances(ida mem, reltolQS, abstolQS);

Description The function IDAQuadSensSStolerances speciﬁes scalar relative and absolute toler-

ances.

Arguments ida mem (void *) pointer to the idas memory block.

reltolQS (realtype) is the scalar relative error tolerance.

abstolQS (realtype*) is a pointer to an array containing the Ns scalar absolute error

tolerances.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA NO SENS Sensitivities were not activated.

IDA NO QUADSENS Quadratures depending on the sensitivities were not activated.

IDA ILL INPUT One of the input tolerances was negative.

IDAQuadSensSVtolerances

Call flag = IDAQuadSensSVtolerances(ida mem, reltolQS, abstolQS);

Description The function IDAQuadSensSVtolerances speciﬁes scalar relative and vector absolute

tolerances.

Arguments ida mem (void *) pointer to the idas memory block.

reltolQS (realtype) is the scalar relative error tolerance.

abstolQS (N Vector*) is an array of Ns variables of type N Vector. The N Vector from

abstolS[is] speciﬁes the vector tolerances for is-th quadrature sensitivity.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional value has been successfully set.

IDA NO QUAD Quadrature integration was not initialized.

IDA MEM NULL The ida mem pointer is NULL.

IDA NO SENS Sensitivities were not activated.

IDA NO QUADSENS Quadratures depending on the sensitivities were not activated.

IDA ILL INPUT One of the input tolerances was negative.

IDAQuadSensEEtolerances

Call flag = IDAQuadSensEEtolerances(ida mem);

Description The function IDAQuadSensEEtolerances speciﬁes that the tolerances for the sensitivity-

dependent quadratures should be estimated from those provided for the pure quadrature

variables.

Arguments ida mem (void *) pointer to the idas memory block.

Return value The return value flag (of type int) is one of:

116 Using IDAS for Forward Sensitivity Analysis

IDA SUCCESS The optional value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA NO SENS Sensitivities were not activated.

IDA NO QUADSENS Quadratures depending on the sensitivities were not activated.

Notes When IDAQuadSensEEtolerances is used, before calling IDASolve, integration of pure

quadratures must be initialized (see 4.7.1) and tolerances for pure quadratures must be

also speciﬁed (see 4.7.4).

5.4.5 Optional outputs for sensitivity-dependent quadrature integration

idas provides the following functions that can be used to obtain solver performance information

related to quadrature integration.

IDAGetQuadSensNumRhsEvals

Call flag = IDAGetQuadSensNumRhsEvals(ida mem, &nrhsQSevals);

Description The function IDAGetQuadSensNumRhsEvals returns the number of calls made to the

user’s quadrature right-hand side function.

Arguments ida mem (void *) pointer to the idas memory block.

nrhsQSevals (long int) number of calls made to the user’s rhsQS function.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA NO QUADSENS Sensitivity-dependent quadrature integration has not been initialized.

IDAGetQuadSensNumErrTestFails

Call flag = IDAGetQuadSensNumErrTestFails(ida mem, &nQSetfails);

Description The function IDAGetQuadSensNumErrTestFails returns the number of local error test

failures due to quadrature variables.

Arguments ida mem (void *) pointer to the idas memory block.

nQSetfails (long int) number of error test failures due to quadrature variables.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA NO QUADSENS Sensitivity-dependent quadrature integration has not been initialized.

IDAGetQuadSensErrWeights

Call flag = IDAGetQuadSensErrWeights(ida mem, eQSweight);

Description The function IDAGetQuadSensErrWeights returns the quadrature error weights at the

current time.

Arguments ida mem (void *) pointer to the idas memory block.

eQSweight (N Vector *) array of quadrature error weight vectors at the current time.

Return value The return value flag (of type int) is one of:

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA NO QUADSENS Sensitivity-dependent quadrature integration has not been initialized.

5.4 Integration of quadrature equations depending on forward sensitivities 117

Notes The user must allocate memory for eQSweight.

If quadratures were not included in the error control mechanism (through a call to

IDASetQuadSensErrCon with errconQS=SUNTRUE), IDAGetQuadSensErrWeights does

not set the eQSweight vector.

IDAGetQuadSensStats

Call flag = IDAGetQuadSensStats(ida mem, &nrhsQSevals, &nQSetfails);

Description The function IDAGetQuadSensStats returns the idas integrator statistics as a group.

Arguments ida mem (void *) pointer to the idas memory block.

nrhsQSevals (long int) number of calls to the user’s rhsQS function.

nQSetfails (long int) number of error test failures due to quadrature variables.

Return value The return value flag (of type int) is one of

IDA SUCCESS the optional output values have been successfully set.

IDA MEM NULL the ida mem pointer is NULL.

IDA NO QUADSENS Sensitivity-dependent quadrature integration has not been initialized.

5.4.6 User-supplied function for sensitivity-dependent quadrature integra-

tion

For the integration of sensitivity-dependent quadrature equations, the user must provide a function

that deﬁnes the right-hand side of the sensitivity quadrature equations. For sensitivities of quadratures

(2.10) with integrands q, the appropriate right-hand side functions are given by ¯qi= (∂q/∂y)si+

(∂q/∂ ˙y) ˙si+∂q/∂pi. This user function must be of type IDAQuadSensRhsFn, deﬁned as follows:

IDAQuadSensRhsFn

Deﬁnition typedef int (*IDAQuadSensRhsFn)(int Ns, realtype t, N Vector yy,

N Vector yp, N Vector *yyS, N Vector *ypS,

N Vector rrQ, N Vector *rhsvalQS,

void *user data, N Vector tmp1,

N Vector tmp2, N Vector tmp3)

Purpose This function computes the sensitivity quadrature equation right-hand side for a given

value of the independent variable tand state vector y.

Arguments Ns is the number of sensitivity vectors.

tis the current value of the independent variable.

yy is the current value of the dependent variable vector, y(t).

yp is the current value of the dependent variable vector, ˙y(t).

yyS is an array of Ns variables of type N Vector containing the dependent sen-

sitivity vectors si.

ypS is an array of Ns variables of type N Vector containing the dependent sen-

sitivity derivatives ˙si.

rrQ is the current value of the quadrature right-hand side q.

rhsvalQS contains the Ns output vectors.

user data is the user data pointer passed to IDASetUserData.

tmp1

tmp2

tmp3 are N Vectors which can be used as temporary storage.

118 Using IDAS for Forward Sensitivity Analysis

Return value An IDAQuadSensRhsFn should return 0 if successful, a positive value if a recoverable

error occurred (in which case idas will attempt to correct), or a negative value if it failed

unrecoverably (in which case the integration is halted and IDA QRHS FAIL is returned).

Notes Allocation of memory for rhsvalQS is automatically handled within idas.

Both yy and yp are of type N Vector and both yyS and ypS are pointers to an array

containing Ns vectors of type N Vector. It is the user’s responsibility to access the vector

data consistently (including the use of the correct accessor macros from each nvector

implementation). For the sake of computational eﬃciency, the vector functions in the

two nvector implementations provided with idas do not perform any consistency

checks with respect to their N Vector arguments (see §7.2 and §7.3).

There is one situation in which recovery is not possible even if IDAQuadSensRhsFn

function returns a recoverable error ﬂag. That is when this occurs at the very ﬁrst call

to the IDAQuadSensRhsFn, in which case idas returns IDA FIRST QSRHS ERR).

5.5 Note on using partial error control

For some problems, when sensitivities are excluded from the error control test, the behavior of idas

may appear at ﬁrst glance to be erroneous. One would expect that, in such cases, the sensitivity

variables would not inﬂuence in any way the step size selection.

The short explanation of this behavior is that the step size selection implemented by the error

control mechanism in idas is based on the magnitude of the correction calculated by the nonlinear

solver. As mentioned in §5.2.1, even with partial error control selected in the call to IDASensInit,

the sensitivity variables are included in the convergence tests of the nonlinear solver.

When using the simultaneous corrector method (§2.5), the nonlinear system that is solved at each

step involves both the state and sensitivity equations. In this case, it is easy to see how the sensitivity

variables may aﬀect the convergence rate of the nonlinear solver and therefore the step size selection.

The case of the staggered corrector approach is more subtle. The sensitivity variables at a given

step are computed only once the solver for the nonlinear state equations has converged. However, if

the nonlinear system corresponding to the sensitivity equations has convergence problems, idas will

attempt to improve the initial guess by reducing the step size in order to provide a better prediction

of the sensitivity variables. Moreover, even if there are no convergence failures in the solution of the

sensitivity system, idas may trigger a call to the linear solver’s setup routine which typically involves

reevaluation of Jacobian information (Jacobian approximation in the case of idadense and idaband,

or preconditioner data in the case of the Krylov solvers). The new Jacobian information will be used

by subsequent calls to the nonlinear solver for the state equations and, in this way, potentially aﬀect

the step size selection.

When using the simultaneous corrector method it is not possible to decide whether nonlinear solver

convergence failures or calls to the linear solver setup routine have been triggered by convergence

problems due to the state or the sensitivity equations. When using one of the staggered corrector

methods, however, these situations can be identiﬁed by carefully monitoring the diagnostic information

provided through optional outputs. If there are no convergence failures in the sensitivity nonlinear

solver, and none of the calls to the linear solver setup routine were made by the sensitivity nonlinear

solver, then the step size selection is not aﬀected by the sensitivity variables.

Finally, the user must be warned that the eﬀect of appending sensitivity equations to a given system

of DAEs on the step size selection (through the mechanisms described above) is problem-dependent

and can therefore lead to either an increase or decrease of the total number of steps that idas takes to

complete the simulation. At ﬁrst glance, one would expect that the impact of the sensitivity variables,

if any, would be in the direction of increasing the step size and therefore reducing the total number

of steps. The argument for this is that the presence of the sensitivity variables in the convergence

test of the nonlinear solver can only lead to additional iterations (and therefore a smaller iteration

error), or to additional calls to the linear solver setup routine (and therefore more up-to-date Jacobian

information), both of which will lead to larger steps being taken by idas. However, this is true only

5.5 Note on using partial error control 119

locally. Overall, a larger integration step taken at a given time may lead to step size reductions at

later times, due to either nonlinear solver convergence failures or error test failures.

Chapter 6

Using IDAS for Adjoint Sensitivity

Analysis

This chapter describes the use of idas to compute sensitivities of derived functions using adjoint sensi-

tivity analysis. As mentioned before, the adjoint sensitivity module of idas provides the infrastructure

for integrating backward in time any system of DAEs that depends on the solution of the original IVP,

by providing various interfaces to the main idas integrator, as well as several supporting user-callable

functions. For this reason, in the following sections we refer to the backward problem and not to the

adjoint problem when discussing details relevant to the DAEs that are integrated backward in time.

The backward problem can be the adjoint problem (2.20) or (2.25), and can be augmented with some

quadrature diﬀerential equations.

idas uses various constants for both input and output. These are deﬁned as needed in this chapter,

but for convenience are also listed separately in Appendix B.

We begin with a brief overview, in the form of a skeleton user program. Following that are detailed

descriptions of the interface to the various user-callable functions and of the user-supplied functions

that were not already described in Chapter 4.

6.1 A skeleton of the user’s main program

The following is a skeleton of the user’s main program as an application of idas. The user program

is to have these steps in the order indicated, unless otherwise noted. For the sake of brevity, we defer

many of the details to the later sections. As in §4.4, most steps are independent of the nvector,

sunmatrix,sunlinsol, and sunnonlinsol implementations used. For the steps that are not, refer

to Chapters 7,8,9, and 10 for the speciﬁc name of the function to be called or macro to be referenced.

Steps that are unchanged from the skeleton programs presented in §4.4,§5.1, and §5.4, are grayed

out.

1. Include necessary header ﬁles

The idas.h header ﬁle also deﬁnes additional types, constants, and function prototypes for the

adjoint sensitivity module user-callable functions. In addition, the main program should include an

nvector implementation header ﬁle (for the particular implementation used) and, if a nonlinear

solver requiring a linear solver (e.g., the default Newton iteration) will be used, the header ﬁle of

the desired linear solver module.

2. Initialize parallel or multi-threaded environment

Forward problem

3. Set problem dimensions etc. for the forward problem

122 Using IDAS for Adjoint Sensitivity Analysis

4. Set initial conditions for the forward problem

5. Create idas object for the forward problem

6. Initialize idas solver for the forward problem

7. Specify integration tolerances for forward problem

8. Set optional inputs for the forward problem

9. Create matrix object for the forward problem

10. Create linear solver object for the forward problem

11. Set linear solver optional inputs for the forward problem

12. Attach linear solver module for the forward problem

13. Create nonlinear solver module for the forward problem

14. Attach nonlinear solver module for the forward problem

15. Set nonlinear solver optional inputs for the forward problem

16. Initialize quadrature problem or problems for forward problems, using IDAQuadInit

and/or IDAQuadSensInit.

17. Initialize forward sensitivity problem

18. Specify rootﬁnding

19. Allocate space for the adjoint computation

Call IDAAdjInit() to allocate memory for the combined forward-backward problem (see §6.2.1

for details). This call requires Nd, the number of steps between two consecutive checkpoints.

IDAAdjInit also speciﬁes the type of interpolation used (see §2.6.3).

20. Integrate forward problem

Call IDASolveF, a wrapper for the idas main integration function IDASolve, either in IDA NORMAL

mode to the time tout or in IDA ONE STEP mode inside a loop (if intermediate solutions of the

forward problem are desired (see §6.2.3)). The ﬁnal value of tret is then the maximum allowable

value for the endpoint Tof the backward problem.

Backward problem(s)

21. Set problem dimensions etc. for the backward problem

This generally includes NB, the number of variables in the backward problem and possibly the

local vector length NBlocal.

22. Set initial values for the backward problem

Set the endpoint time tB0 =T, and set the corresponding vectors yB0 and ypB0 at which the

backward problem starts.

23. Create the backward problem

Call IDACreateB, a wrapper for IDACreate, to create the idas memory block for the new backward

problem. Unlike IDACreate, the function IDACreateB does not return a pointer to the newly

created memory block (see §6.2.4). Instead, this pointer is attached to the internal adjoint memory

block (created by IDAAdjInit) and returns an identiﬁer called which that the user must later

specify in any actions on the newly created backward problem.

6.1 A skeleton of the user’s main program 123

24. Allocate memory for the backward problem

Call IDAInitB (or IDAInitBS, when the backward problem depends on the forward sensitivi-

ties). The two functions are actually wrappers for IDAInit and allocate internal memory, specify

problem data, and initialize idas at tB0 for the backward problem (see §6.2.4).

25. Specify integration tolerances for backward problem

Call IDASStolerancesB(...) or IDASVtolerancesB(...) to specify a scalar relative tolerance

and scalar absolute tolerance, or a scalar relative tolerance and a vector of absolute tolerances,

respectively. The functions are wrappers for IDASStolerances(...) and IDASVtolerances(...)

but they require an extra argument which, the identiﬁer of the backward problem returned by

IDACreateB. See §6.2.5 for more information.

26. Set optional inputs for the backward problem

Call IDASet*B functions to change from their default values any optional inputs that control the

behavior of idas. Unlike their counterparts for the forward problem, these functions take an extra

argument which, the identiﬁer of the backward problem returned by IDACreateB (see §6.2.9).

27. Create matrix object for the backward problem

If a nonlinear solver requiring a linear solve will be used (e.g., the the default Newton iteration) and

the linear solver will be a direct linear solver, then a template Jacobian matrix must be created by

calling the appropriate constructor function deﬁned by the particular sunmatrix implementation.

NOTE: The dense, banded, and sparse matrix objects are usable only in a serial or threaded

environment.

Note also that it is not required to use the same matrix type for both the forward and the backward

problems.

28. Create linear solver object for the backward problem

If a nonlinear solver requiring a linear solver is chosen (e.g., the default Newton iteration), then the

desired linear solver object for the backward problem must be created by calling the appropriate

constructor function deﬁned by the particular sunlinsol implementation.

Note that it is not required to use the same linear solver module for both the forward and the

backward problems; for example, the forward problem could be solved with the sunlinsol dense

linear solver module and the backward problem with sunlinsol spgmr linear solver module.

29. Set linear solver interface optional inputs for the backward problem

Call IDASet*B functions to change optional inputs speciﬁc to the linear solver interface. See §6.2.9

for details.

30. Attach linear solver module for the backward problem

If a nonlinear solver requiring a linear solver is chosen for the backward problem (e.g., the default

Newton iteration), then initialize the idals linear solver interface by attaching the linear solver

object (and matrix object, if applicable) with the following call (for details see §4.5.3):

ier = IDASetLinearSolverB(...);

31. Create nonlinear solver object for the backward problem (optional)

If using a non-default nonlinear solver for the backward problem, then create the desired nonlinear

solver object by calling the appropriate constructor function deﬁned by the particular sunnon-

linsol implementation e.g., NLSB = SUNNonlinSol ***(...); where *** is the name of the

nonlinear solver (see Chapter 10 for details).

32. Attach nonlinear solver module for the backward problem (optional)

124 Using IDAS for Adjoint Sensitivity Analysis

If using a non-default nonlinear solver for the backward problem, then initialize the nonlinear

solver interface by attaching the nonlinear solver object by calling

ier = IDASetNonlinearSolverB(idaode mem, NLSB); (see §4.5.4 for details).

33. Initialize quadrature calculation

If additional quadrature equations must be evaluated, call IDAQuadInitB or IDAQuadInitBS (if

quadrature depends also on the forward sensitivities) as shown in §6.2.11.1. These functions are

wrappers around IDAQuadInit and can be used to initialize and allocate memory for quadrature

integration. Optionally, call IDASetQuad*B functions to change from their default values optional

inputs that control the integration of quadratures during the backward phase.

34. Integrate backward problem

Call IDASolveB, a second wrapper around the idas main integration function IDASolve, to inte-

grate the backward problem from tB0 (see §6.2.8). This function can be called either in IDA NORMAL

or IDA ONE STEP mode. Typically, IDASolveB will be called in IDA NORMAL mode with an end time

equal to the initial time t0of the forward problem.

35. Extract quadrature variables

If applicable, call IDAGetQuadB, a wrapper around IDAGetQuad, to extract the values of the quadra-

ture variables at the time returned by the last call to IDASolveB. See §6.2.11.2.

36. Deallocate memory

Upon completion of the backward integration, call all necessary deallocation functions. These

include appropriate destructors for the vectors yand yB, a call to IDAFree to free the idas

memory block for the forward problem. If one or more additional adjoint sensitivity analyses are

to be done for this problem, a call to IDAAdjFree (see §6.2.1) may be made to free and deallocate

the memory allocated for the backward problems, followed by a call to IDAAdjInit.

37. Free the nonlinear solver memory for the forward and backward problems

38. Free linear solver and matrix memory for the forward and backward problems

39. Finalize MPI, if used

The above user interface to the adjoint sensitivity module in idas was motivated by the desire to

keep it as close as possible in look and feel to the one for DAE IVP integration. Note that if steps

(21)-(35) are not present, a program with the above structure will have the same functionality as one

described in §4.4 for integration of DAEs, albeit with some overhead due to the checkpointing scheme.

If there are multiple backward problems associated with the same forward problem, repeat steps

(21)-(35) above for each successive backward problem. In the process, each call to IDACreateB creates

a new value of the identiﬁer which.

6.2 User-callable functions for adjoint sensitivity analysis

6.2.1 Adjoint sensitivity allocation and deallocation functions

After the setup phase for the forward problem, but before the call to IDASolveF, memory for the

combined forward-backward problem must be allocated by a call to the function IDAAdjInit. The

form of the call to this function is

IDAAdjInit

Call flag = IDAAdjInit(ida mem, Nd, interpType);

6.2 User-callable functions for adjoint sensitivity analysis 125

Description The function IDAAdjInit updates idas memory block by allocating the internal memory

needed for backward integration. Space is allocated for the Nd =Ndinterpolation data

points, and a linked list of checkpoints is initialized.

Arguments ida mem (void *) is the pointer to the idas memory block returned by a previous

call to IDACreate.

Nd (long int) is the number of integration steps between two consecutive

checkpoints.

interpType (int) speciﬁes the type of interpolation used and can be IDA POLYNOMIAL

or IDA HERMITE, indicating variable-degree polynomial and cubic Hermite

interpolation, respectively (see §2.6.3).

Return value The return value flag (of type int) is one of:

IDA SUCCESS IDAAdjInit was successful.

IDA MEM FAIL A memory allocation request has failed.

IDA MEM NULL ida mem was NULL.

IDA ILL INPUT One of the parameters was invalid: Nd was not positive or interpType

is not one of the IDA POLYNOMIAL or IDA HERMITE.

Notes The user must set Nd so that all data needed for interpolation of the forward problem

solution between two checkpoints ﬁts in memory. IDAAdjInit attempts to allocate

space for (2Nd+3) variables of type N Vector.

If an error occurred, IDAAdjInit also sends a message to the error handler function.

IDAAdjReInit

Call flag = IDAAdjReInit(ida mem);

Description The function IDAAdjReInit reinitializes the idas memory block for ASA, assuming

that the number of steps between check points and the type of interpolation remain

unchanged.

Arguments ida mem (void *) is the pointer to the idas memory block returned by a previous call

to IDACreate.

Return value The return value flag (of type int) is one of:

IDA SUCCESS IDAAdjReInit was successful.

IDA MEM NULL ida mem was NULL.

IDA NO ADJ The function IDAAdjInit was not previously called.

Notes The list of check points (and associated memory) is deleted.

The list of backward problems is kept. However, new backward problems can be added

to this list by calling IDACreateB. If a new list of backward problems is also needed, then

free the adjoint memory (by calling IDAAdjFree) and reinitialize ASA with IDAAdjInit.

The idas memory for the forward and backward problems can be reinitialized separately

by calling IDAReInit and IDAReInitB, respectively.

IDAAdjFree

Call IDAAdjFree(ida mem);

Description The function IDAAdjFree frees the memory related to backward integration allocated

by a previous call to IDAAdjInit.

Arguments The only argument is the idas memory block pointer returned by a previous call to

IDACreate.

Return value The function IDAAdjFree has no return value.

126 Using IDAS for Adjoint Sensitivity Analysis

Notes This function frees all memory allocated by IDAAdjInit. This includes workspace

memory, the linked list of checkpoints, memory for the interpolation data, as well as

the idas memory for the backward integration phase.

Unless one or more further calls to IDAAdjInit are to be made, IDAAdjFree should not

be called by the user, as it is invoked automatically by IDAFree.

6.2.2 Adjoint sensitivity optional input

At any time during the integration of the forward problem, the user can disable the checkpointing of

the forward sensitivities by calling the following function:

IDAAdjSetNoSensi

Call flag = IDAAdjSetNoSensi(ida mem);

Description The function IDAAdjSetNoSensi instructs IDASolveF not to save checkpointing data

for forward sensitivities any more.

Arguments ida mem (void *) pointer to the idas memory block.

Return value The return flag (of type int) is one of:

IDA SUCCESS The call to IDACreateB was successful.

IDA MEM NULL The ida mem was NULL.

IDA NO ADJ The function IDAAdjInit has not been previously called.

6.2.3 Forward integration function

The function IDASolveF is very similar to the idas function IDASolve (see §4.5.7) in that it integrates

the solution of the forward problem and returns the solution (y, ˙y). At the same time, however,

IDASolveF stores checkpoint data every Nd integration steps. IDASolveF can be called repeatedly

by the user. Note that IDASolveF is used only for the forward integration pass within an Adjoint

Sensitivity Analysis. It is not for use in Forward Sensitivity Analysis; for that, see Chapter 5. The

call to this function has the form

IDASolveF

Call flag = IDASolveF(ida mem, tout, &tret, yret, ypret, itask, &ncheck);

Description The function IDASolveF integrates the forward problem over an interval in tand saves

checkpointing data.

Arguments ida mem (void *) pointer to the idas memory block.

tout (realtype) the next time at which a computed solution is desired.

tret (realtype) the time reached by the solver (output).

yret (N Vector) the computed solution vector y.

ypret (N Vector) the computed solution vector ˙y.

itask (int) a ﬂag indicating the job of the solver for the next step. The IDA NORMAL

task is to have the solver take internal steps until it has reached or just passed

the user-speciﬁed tout parameter. The solver then interpolates in order to

return an approximate value of y(tout) and ˙y(tout). The IDA ONE STEP option

tells the solver to take just one internal step and return the solution at the

point reached by that step.

ncheck (int) the number of (internal) checkpoints stored so far.

Return value On return, IDASolveF returns vectors yret,ypret and a corresponding independent

variable value t=tret, such that yret is the computed value of y(t) and ypret the

value of ˙y(t). Additionally, it returns in ncheck the number of internal checkpoints

6.2 User-callable functions for adjoint sensitivity analysis 127

saved; the total number of checkpoint intervals is ncheck+1. The return value flag (of

type int) will be one of the following. For more details see §4.5.7.

IDA SUCCESS IDASolveF succeeded.

IDA TSTOP RETURN IDASolveF succeeded by reaching the optional stopping point.

IDA ROOT RETURN IDASolveF succeeded and found one or more roots. In this case,

tret is the location of the root. If nrtfn >1, call IDAGetRootInfo

to see which giwere found to have a root.

IDA NO MALLOC The function IDAInit has not been previously called.

IDA ILL INPUT One of the inputs to IDASolveF is illegal.

IDA TOO MUCH WORK The solver took mxstep internal steps but could not reach tout.

IDA TOO MUCH ACC The solver could not satisfy the accuracy demanded by the user for

some internal step.

IDA ERR FAILURE Error test failures occurred too many times during one internal

time step or occurred with |h|=hmin.

IDA CONV FAILURE Convergence test failures occurred too many times during one in-

ternal time step or occurred with |h|=hmin.

IDA LSETUP FAIL The linear solver’s setup function failed in an unrecoverable man-

ner.

IDA LSOLVE FAIL The linear solver’s solve function failed in an unrecoverable manner.

IDA NO ADJ The function IDAAdjInit has not been previously called.

IDA MEM FAIL A memory allocation request has failed (in an attempt to allocate

space for a new checkpoint).

Notes All failure return values are negative and therefore a test flag<0 will trap all IDASolveF

failures.

At this time, IDASolveF stores checkpoint information in memory only. Future versions

will provide for a safeguard option of dumping checkpoint data into a temporary ﬁle as

needed. The data stored at each checkpoint is basically a snapshot of the idas internal

memory block and contains enough information to restart the integration from that

time and to proceed with the same step size and method order sequence as during the

forward integration.

In addition, IDASolveF also stores interpolation data between consecutive checkpoints

so that, at the end of this ﬁrst forward integration phase, interpolation information is

already available from the last checkpoint forward. In particular, if no checkpoints were

necessary, there is no need for the second forward integration phase.

It is illegal to change the integration tolerances between consecutive calls to IDASolveF,

as this information is not captured in the checkpoint data.

6.2.4 Backward problem initialization functions

The functions IDACreateB and IDAInitB (or IDAInitBS) must be called in the order listed. They

instantiate an idas solver object, provide problem and solution speciﬁcations, and allocate internal

memory for the backward problem.

IDACreateB

Call flag = IDACreateB(ida mem, &which);

Description The function IDACreateB instantiates an idas solver object for the backward problem.

Arguments ida mem (void *) pointer to the idas memory block returned by IDACreate.

which (int) contains the identiﬁer assigned by idas for the newly created backward

problem. Any call to IDA*B functions requires such an identiﬁer.

128 Using IDAS for Adjoint Sensitivity Analysis

Return value The return flag (of type int) is one of:

IDA SUCCESS The call to IDACreateB was successful.

IDA MEM NULL The ida mem was NULL.

IDA NO ADJ The function IDAAdjInit has not been previously called.

IDA MEM FAIL A memory allocation request has failed.

There are two initialization functions for the backward problem – one for the case when the

backward problem does not depend on the forward sensitivities, and one for the case when it does.

These two functions are described next.

The function IDAInitB initializes the backward problem when it does not depend on the for-

ward sensitivities. It is essentially wrapper for IDAInit with some particularization for backward

integration, as described below.

IDAInitB

Call flag = IDAInitB(ida mem, which, resB, tB0, yB0, ypB0);

Description The function IDAInitB provides problem speciﬁcation, allocates internal memory, and

initializes the backward problem.

Arguments ida mem (void *) pointer to the idas memory block returned by IDACreate.

which (int) represents the identiﬁer of the backward problem.

resB (IDAResFnB) is the Cfunction which computes fB, the residual of the back-

ward DAE problem. This function has the form resB(t, y, yp, yB, ypB,

resvalB, user dataB) (for full details see §6.3.1).

tB0 (realtype) speciﬁes the endpoint Twhere ﬁnal conditions are provided for the

backward problem, normally equal to the endpoint of the forward integration.

yB0 (N Vector) is the initial value (at t=tB0) of the backward solution.

ypB0 (N Vector) is the initial derivative value (at t=tB0) of the backward solution.

Return value The return flag (of type int) will be one of the following:

IDA SUCCESS The call to IDAInitB was successful.

IDA NO MALLOC The function IDAInit has not been previously called.

IDA MEM NULL The ida mem was NULL.

IDA NO ADJ The function IDAAdjInit has not been previously called.

IDA BAD TB0 The ﬁnal time tB0 was outside the interval over which the forward

problem was solved.

IDA ILL INPUT The parameter which represented an invalid identiﬁer, or one of yB0,

ypB0,resB was NULL.

Notes The memory allocated by IDAInitB is deallocated by the function IDAAdjFree.

For the case when backward problem also depends on the forward sensitivities, user must call

IDAInitBS instead of IDAInitB. Only the third argument of each function diﬀers between these

functions.

IDAInitBS

Call flag = IDAInitBS(ida mem, which, resBS, tB0, yB0, ypB0);

Description The function IDAInitBS provides problem speciﬁcation, allocates internal memory, and

initializes the backward problem.

Arguments ida mem (void *) pointer to the idas memory block returned by IDACreate.

which (int) represents the identiﬁer of the backward problem.

resBS (IDAResFnBS) is the Cfunction which computes f B, the residual or the back-

ward DAE problem. This function has the form resBS(t, y, yp, yS, ypS,

yB, ypB, resvalB, user dataB) (for full details see §6.3.2).

6.2 User-callable functions for adjoint sensitivity analysis 129

tB0 (realtype) speciﬁes the endpoint Twhere ﬁnal conditions are provided for

the backward problem.

yB0 (N Vector) is the initial value (at t=tB0) of the backward solution.

ypB0 (N Vector) is the initial derivative value (at t=tB0) of the backward solution.

Return value The return flag (of type int) will be one of the following:

IDA SUCCESS The call to IDAInitB was successful.

IDA NO MALLOC The function IDAInit has not been previously called.

IDA MEM NULL The ida mem was NULL.

IDA NO ADJ The function IDAAdjInit has not been previously called.

IDA BAD TB0 The ﬁnal time tB0 was outside the interval over which the forward

problem was solved.

IDA ILL INPUT The parameter which represented an invalid identiﬁer, or one of yB0,

ypB0,resB was NULL, or sensitivities were not active during the forward

integration.

Notes The memory allocated by IDAInitBS is deallocated by the function IDAAdjFree.

The function IDAReInitB reinitializes idas for the solution of a series of backward problems, each

identiﬁed by a value of the parameter which.IDAReInitB is essentially a wrapper for IDAReInit, and

so all details given for IDAReInit in §4.5.11 apply here. Also, IDAReInitB can be called to reinitialize

a backward problem even if it has been initialized with the sensitivity-dependent version IDAInitBS.

Before calling IDAReInitB for a new backward problem, call any desired solution extraction functions

IDAGet** associated with the previous backward problem. The call to the IDAReInitB function has

the form

IDAReInitB

Call flag = IDAReInitB(ida mem, which, tB0, yB0, ypB0)

Description The function IDAReInitB reinitializes an idas backward problem.

Arguments ida mem (void *) pointer to idas memory block returned by IDACreate.

which (int) represents the identiﬁer of the backward problem.

tB0 (realtype) speciﬁes the endpoint Twhere ﬁnal conditions are provided for

the backward problem.

yB0 (N Vector) is the initial value (at t=tB0) of the backward solution.

ypB0 (N Vector) is the initial derivative value (at t=tB0) of the backward solution.

Return value The return value flag (of type int) will be one of the following:

IDA SUCCESS The call to IDAReInitB was successful.

IDA NO MALLOC The function IDAInit has not been previously called.

IDA MEM NULL The ida mem memory block pointer was NULL.

IDA NO ADJ The function IDAAdjInit has not been previously called.

IDA BAD TB0 The ﬁnal time tB0 is outside the interval over which the forward problem

was solved.

IDA ILL INPUT The parameter which represented an invalid identiﬁer, or one of yB0,

ypB0 was NULL.

6.2.5 Tolerance speciﬁcation functions for backward problem

One of the following two functions must be called to specify the integration tolerances for the backward

problem. Note that this call must be made after the call to IDAInitB or IDAInitBS.

130 Using IDAS for Adjoint Sensitivity Analysis

IDASStolerancesB

Call flag = IDASStolerances(ida mem, which, reltolB, abstolB);

Description The function IDASStolerancesB speciﬁes scalar relative and absolute tolerances.

Arguments ida mem (void *) pointer to the idas memory block returned by IDACreate.

which (int) represents the identiﬁer of the backward problem.

reltolB (realtype) is the scalar relative error tolerance.

abstolB (realtype) is the scalar absolute error tolerance.

Return value The return flag (of type int) will be one of the following:

IDA SUCCESS The call to IDASStolerancesB was successful.

IDA MEM NULL The idas memory block was not initialized through a previous call to

IDACreate.

IDA NO MALLOC The allocation function IDAInit has not been called.

IDA NO ADJ The function IDAAdjInit has not been previously called.

IDA ILL INPUT One of the input tolerances was negative.

IDASVtolerancesB

Call flag = IDASVtolerancesB(ida mem, which, reltolB, abstolB);

Description The function IDASVtolerancesB speciﬁes scalar relative tolerance and vector absolute

tolerances.

Arguments ida mem (void *) pointer to the idas memory block returned by IDACreate.

which (int) represents the identiﬁer of the backward problem.

reltol (realtype) is the scalar relative error tolerance.

abstol (N Vector) is the vector of absolute error tolerances.

Return value The return flag (of type int) will be one of the following:

IDA SUCCESS The call to IDASVtolerancesB was successful.

IDA MEM NULL The idas memory block was not initialized through a previous call to

IDACreate.

IDA NO MALLOC The allocation function IDAInit has not been called.

IDA NO ADJ The function IDAAdjInit has not been previously called.

IDA ILL INPUT The relative error tolerance was negative or the absolute tolerance had

a negative component.

Notes This choice of tolerances is important when the absolute error tolerance needs to be

diﬀerent for each component of the DAE state vector y.

6.2.6 Linear solver initialization functions for backward problem

All idas linear solver modules available for forward problems are available for the backward problem.

They should be created as for the forward problem then attached to the memory structure for the

backward problem using the following function.

IDASetLinearSolverB

Call flag = IDASetLinearSolverB(ida mem, which, LS, A);

Description The function IDASetLinearSolverB attaches a generic sunlinsol object LS and cor-

responding template Jacobian sunmatrix object A(if applicable) to idas, initializing

the idals linear solver interface for solution of the backward problem.

Arguments ida mem (void *) pointer to the idas memory block.

which (int) represents the identiﬁer of the backward problem returned by IDACreateB.

6.2 User-callable functions for adjoint sensitivity analysis 131

LS (SUNLinearSolver)sunlinsol object to use for solving linear systems for the

backward problem.

A(SUNMatrix)sunmatrix object for used as a template for the Jacobian for

the backward problem (or NULL if not applicable).

Return value The return value flag (of type int) is one of

IDALS SUCCESS The idals initialization was successful.

IDALS MEM NULL The ida mem pointer is NULL.

IDALS ILL INPUT The idals interface is not compatible with the LS or Ainput objects

or is incompatible with the current nvector module.

IDALS MEM FAIL A memory allocation request failed.

IDALS NO ADJ The function IDAAdjInit has not been previously called.

IDALS ILL INPUT The parameter which represented an invalid identiﬁer.

Notes If LS is a matrix-based linear solver, then the template Jacobian matrix Awill be used

in the solve process, so if additional storage is required within the sunmatrix object

(e.g. for factorization of a banded matrix), ensure that the input object is allocated

with suﬃcient size (see the documentation of the particular sunmatrix type in Chapter

8for further information).

The previous routines IDADlsSetLinearSolverB and IDASpilsSetLinearSolverB are

now wrappers for this routine, and may still be used for backward-compatibility. How-

ever, these will be deprecated in future releases, so we recommend that users transition

to the new routine name soon.

6.2.7 Initial condition calculation functions for backward problem

idas provides support for calculation of consistent initial conditions for certain backward index-one

problems of semi-implicit form through the functions IDACalcICB and IDACalcICBS. Calling them is

optional. It is only necessary when the initial conditions do not satisfy the adjoint system.

The above functions provide the same functionality for backward problems as IDACalcIC with

parameter icopt =IDA YA YDP INIT provides for forward problems (see §4.5.5): compute the algebraic

components of yB and diﬀerential components of ˙yB, given the diﬀerential components of yB. They

require that the IDASetIdB was previously called to specify the diﬀerential and algebraic components.

Both functions require forward solutions at the ﬁnal time tB0.IDACalcICBS also needs forward

sensitivities at the ﬁnal time tB0.

IDACalcICB

Call flag = IDACalcICB(ida mem, which, tBout1, N Vector yfin, N Vector ypfin);

Description The function IDACalcICB corrects the initial values yB0 and ypB0 at time tB0 for the

backward problem.

Arguments ida mem (void *) pointer to the idas memory block.

which (int) is the identiﬁer of the backward problem.

tBout1 (realtype) is the ﬁrst value of tat which a solution will be requested (from

IDASolveB). This value is needed here only to determine the direction of inte-

gration and rough scale in the independent variable t.

yfin (N Vector) the forward solution at the ﬁnal time tB0.

ypfin (N Vector) the forward solution derivative at the ﬁnal time tB0.

Return value The return value flag (of type int) can be any that is returned by IDACalcIC (see

§4.5.5). However IDACalcICB can also return one of the following:

IDA NO ADJ IDAAdjInit has not been previously called.

IDA ILL INPUT Parameter which represented an invalid identiﬁer.

132 Using IDAS for Adjoint Sensitivity Analysis

Notes All failure return values are negative and therefore a test flag <0 will trap all

IDACalcICB failures.

Note that IDACalcICB will correct the values of yB(tB0) and ˙yB(tB0) which were

speciﬁed in the previous call to IDAInitB or IDAReInitB. To obtain the corrected values,

call IDAGetconsistentICB (see §6.2.10.2).

In the case where the backward problem also depends on the forward sensitivities, user must call

the following function to correct the initial conditions:

IDACalcICBS

Call flag = IDACalcICBS(ida mem, which, tBout1, N Vector yfin, N Vector ypfin,

N Vector ySfin, N Vector ypSfin);

Description The function IDACalcICBS corrects the initial values yB0 and ypB0 at time tB0 for the

backward problem.

Arguments ida mem (void *) pointer to the idas memory block.

which (int) is the identiﬁer of the backward problem.

tBout1 (realtype) is the ﬁrst value of tat which a solution will be requested (from

IDASolveB).This value is needed here only to determine the direction of inte-

gration and rough scale in the independent variable t.

yfin (N Vector) the forward solution at the ﬁnal time tB0.

ypfin (N Vector) the forward solution derivative at the ﬁnal time tB0.

ySfin (N Vector *) a pointer to an array of Ns vectors containing the sensitivities

of the forward solution at the ﬁnal time tB0.

ypSfin (N Vector *) a pointer to an array of Ns vectors containing the derivatives of

the forward solution sensitivities at the ﬁnal time tB0.

Return value The return value flag (of type int) can be any that is returned by IDACalcIC (see

§4.5.5). However IDACalcICBS can also return one of the following:

IDA NO ADJ IDAAdjInit has not been previously called.

IDA ILL INPUT Parameter which represented an invalid identiﬁer, sensitivities were not

active during forward integration, or IDAInitBS (or IDAReInitBS) has

not been previously called.

Notes All failure return values are negative and therefore a test flag <0 will trap all

IDACalcICBS failures.

Note that IDACalcICBS will correct the values of yB(tB0) and ˙yB(tB0) which were

speciﬁed in the previous call to IDAInitBS or IDAReInitBS. To obtain the corrected

values, call IDAGetConsistentICB (see §6.2.10.2).

6.2.8 Backward integration function

The function IDASolveB performs the integration of the backward problem. It is essentially a wrapper

for the idas main integration function IDASolve and, in the case in which checkpoints were needed,

it evolves the solution of the backward problem through a sequence of forward-backward integration

pairs between consecutive checkpoints. In each pair, the ﬁrst run integrates the original IVP forward

in time and stores interpolation data; the second run integrates the backward problem backward in

time and performs the required interpolation to provide the solution of the IVP to the backward

problem.

The function IDASolveB does not return the solution yB itself. To obtain that, call the function

IDAGetB, which is also described below.

The IDASolveB function does not support rootﬁnding, unlike IDASoveF, which supports the ﬁnding

of roots of functions of (t, y, ˙y). If rootﬁnding was performed by IDASolveF, then for the sake of

eﬃciency, it should be disabled for IDASolveB by ﬁrst calling IDARootInit with nrtfn = 0.

The call to IDASolveB has the form

6.2 User-callable functions for adjoint sensitivity analysis 133

IDASolveB

Call flag = IDASolveB(ida mem, tBout, itaskB);

Description The function IDASolveB integrates the backward DAE problem.

Arguments ida mem (void *) pointer to the idas memory returned by IDACreate.

tBout (realtype) the next time at which a computed solution is desired.

itaskB (int) a ﬂag indicating the job of the solver for the next step. The IDA NORMAL

task is to have the solver take internal steps until it has reached or just passed

the user-speciﬁed value tBout. The solver then interpolates in order to return

an approximate value of yB(tBout). The IDA ONE STEP option tells the solver

to take just one internal step in the direction of tBout and return.

Return value The return value flag (of type int) will be one of the following. For more details see

§4.5.7.

IDA SUCCESS IDASolveB succeeded.

IDA MEM NULL The ida mem was NULL.

IDA NO ADJ The function IDAAdjInit has not been previously called.

IDA NO BCK No backward problem has been added to the list of backward prob-

lems by a call to IDACreateB

IDA NO FWD The function IDASolveF has not been previously called.

IDA ILL INPUT One of the inputs to IDASolveB is illegal.

IDA BAD ITASK The itaskB argument has an illegal value.

IDA TOO MUCH WORK The solver took mxstep internal steps but could not reach tBout.

IDA TOO MUCH ACC The solver could not satisfy the accuracy demanded by the user for

some internal step.

IDA ERR FAILURE Error test failures occurred too many times during one internal

time step.

IDA CONV FAILURE Convergence test failures occurred too many times during one in-

ternal time step.

IDA LSETUP FAIL The linear solver’s setup function failed in an unrecoverable man-

ner.

IDA SOLVE FAIL The linear solver’s solve function failed in an unrecoverable manner.

IDA BCKMEM NULL The idas memory for the backward problem was not created with

a call to IDACreateB.

IDA BAD TBOUT The desired output time tBout is outside the interval over which

the forward problem was solved.

IDA REIFWD FAIL Reinitialization of the forward problem failed at the ﬁrst checkpoint

(corresponding to the initial time of the forward problem).

IDA FWD FAIL An error occurred during the integration of the forward problem.

Notes All failure return values are negative and therefore a test flag<0 will trap all IDASolveB

failures.

In the case of multiple checkpoints and multiple backward problems, a given call to

IDASolveB in IDA ONE STEP mode may not advance every problem one step, depending

on the relative locations of the current times reached. But repeated calls will eventually

advance all problems to tBout.

To obtain the solution yB to the backward problem, call the function IDAGetB as follows:

IDAGetB

Call flag = IDAGetB(ida mem, which, &tret, yB, ypB);

Description The function IDAGetB provides the solution yB of the backward DAE problem.

134 Using IDAS for Adjoint Sensitivity Analysis

Arguments ida mem (void *) pointer to the idas memory returned by IDACreate.

which (int) the identiﬁer of the backward problem.

tret (realtype) the time reached by the solver (output).

yB (N Vector) the backward solution at time tret.

ypB (N Vector) the backward solution derivative at time tret.

Return value The return value flag (of type int) will be one of the following.

IDA SUCCESS IDAGetB was successful.

IDA MEM NULL ida mem is NULL.

IDA NO ADJ The function IDAAdjInit has not been previously called.

IDA ILL INPUT The parameter which is an invalid identiﬁer.

Notes The user must allocate space for yB and ypB.

6.2.9 Optional input functions for the backward problem

6.2.9.1 Main solver optional input functions

The adjoint module in idas provides wrappers for most of the optional input functions deﬁned in

§4.5.8.1. The only diﬀerence is that the user must specify the identiﬁer which of the backward

problem within the list managed by idas.

The optional input functions deﬁned for the backward problem are:

flag = IDASetNonlinearSolverB(ida_mem, which, NLSB);

flag = IDASetUserDataB(ida_mem, which, user_dataB);

flag = IDASetMaxOrdB(ida_mem, which, maxordB);

flag = IDASetMaxNumStepsB(ida_mem, which, mxstepsB);

flag = IDASetInitStepB(ida_mem, which, hinB)

flag = IDASetMaxStepB(ida_mem, which, hmaxB);

flag = IDASetSuppressAlgB(ida_mem, which, suppressalgB);

flag = IDASetIdB(ida_mem, which, idB);

flag = IDASetConstraintsB(ida_mem, which, constraintsB);

Their return value flag (of type int) can have any of the return values of their counterparts, but it

can also be IDA NO ADJ if IDAAdjInit has not been called, or IDA ILL INPUT if which was an invalid

identiﬁer.

6.2.9.2 Linear solver interface optional input functions

When using matrix-based linear solver modules for the backward problem, i.e., a non-NULL sunmatrix

object Awas passed to IDASetLinearSolverB, the idals linear solver interface needs a function to

compute an approximation to the Jacobian matrix. This can be attached through a call to either

IDASetJacFnB or idaIDASetJacFnBS, with the second used when the backward problem depends on

the forward sensitivities.

IDASetJacFnB

Call flag = IDASetJacFnB(ida mem, which, jacB);

Description The function IDASetJacFnB speciﬁes the Jacobian approximation function to be used

for the backward problem.

Arguments ida mem (void *) pointer to the idas memory block.

which (int) represents the identiﬁer of the backward problem.

jacB (IDALsJacFnB) user-deﬁned Jacobian approximation function.

Return value The return value flag (of type int) is one of

IDALS SUCCESS IDASetJacFnB succeeded.

6.2 User-callable functions for adjoint sensitivity analysis 135

IDALS MEM NULL The ida mem was NULL.

IDALS NO ADJ The function IDAAdjInit has not been previously called.

IDALS LMEM NULL The linear solver has not been initialized with a call to

IDASetLinearSolverB.

IDALS ILL INPUT The parameter which represented an invalid identiﬁer.

Notes The function type IDALsJacFnB is described in §6.3.5.

The previous routine IDADlsSetJacFnB is now a wrapper for this routine, and may still

be used for backward-compatibility. However, this will be deprecated in future releases,

so we recommend that users transition to the new routine name soon.

IDASetJacFnBS

Call flag = IDASetJacFnBS(ida mem, which, jacBS);

Description The function IDASetJacFnBS speciﬁes the Jacobian approximation function to be used

for the backward problem in the case where the backward problem depends on the

forward sensitivities.

Arguments ida mem (void *) pointer to the idas memory block.

which (int) represents the identiﬁer of the backward problem.

jacBS (IDALJacFnBS) user-deﬁned Jacobian approximation function.

Return value The return value flag (of type int) is one of

IDALS SUCCESS IDASetJacFnBS succeeded.

IDALS MEM NULL The ida mem was NULL.

IDALS NO ADJ The function IDAAdjInit has not been previously called.

IDALS LMEM NULL The linear solver has not been initialized with a call to

IDASetLinearSolverBS.

IDALS ILL INPUT The parameter which represented an invalid identiﬁer.

Notes The function type IDALsJacFnBS is described in §6.3.5.

The previous routine IDADlsSetJacFnBS is now a wrapper for this routine, and may

still be used for backward-compatibility. However, this will be deprecated in future

releases, so we recommend that users transition to the new routine name soon.

When using a matrix-free linear solver module for the backward problem, the idals linear solver

interface requires a function to compute an approximation to the product between the Jacobian matrix

J(t, y) and a vector v. This may be performed internally using a diﬀerence-quotient approximation,

or it may be supplied by the user by calling one of the following two functions:

IDASetJacTimesB

Call flag = IDASetJacTimesB(ida mem, which, jsetupB, jtimesB);

Description The function IDASetJacTimesB speciﬁes the Jacobian-vector setup and product func-

tions to be used.

Arguments ida mem (void *) pointer to the idas memory block.

which (int) the identiﬁer of the backward problem.

jtsetupB (IDALsJacTimesSetupFnB) user-deﬁned function to set up the Jacobian-vector

product. Pass NULL if no setup is necessary.

jtimesB (IDALsJacTimesVecFnB) user-deﬁned Jacobian-vector product function.

Return value The return value flag (of type int) is one of:

IDALS SUCCESS The optional value has been successfully set.

IDALS MEM NULL The ida mem memory block pointer was NULL.

136 Using IDAS for Adjoint Sensitivity Analysis

IDALS LMEM NULL The idals linear solver has not been initialized.

IDALS NO ADJ The function IDAAdjInit has not been previously called.

IDALS ILL INPUT The parameter which represented an invalid identiﬁer.

Notes The function types IDALsJacTimesVecFnB and IDALsJacTimesSetupFnB are described

in §6.3.6.

The previous routine IDASpilsSetJacTimesB is now a wrapper for this routine, and

may still be used for backward-compatibility. However, this will be deprecated in future

releases, so we recommend that users transition to the new routine name soon.

IDASetJacTimesBS

Call flag = IDASetJacTimesBS(ida mem, which, jsetupBS, jtimesBS);

Description The function IDASetJacTimesBS speciﬁes the Jacobian-vector product setup and eval-

uation functions to be used, in the case where the backward problem depends on the

forward sensitivities.

Arguments ida mem (void *) pointer to the idas memory block.

which (int) the identiﬁer of the backward problem.

jtsetupBS (IDALsJacTimesSetupFnBS) user-deﬁned function to set up the Jacobian-

vector product. Pass NULL if no setup is necessary.

jtimesBS (IDALsJacTimesVecFnBS) user-deﬁned Jacobian-vector product function.

Return value The return value flag (of type int) is one of:

IDALS SUCCESS The optional value has been successfully set.

IDALS MEM NULL The ida mem memory block pointer was NULL.

IDALS LMEM NULL The idals linear solver has not been initialized.

IDALS NO ADJ The function IDAAdjInit has not been previously called.

IDALS ILL INPUT The parameter which represented an invalid identiﬁer.

Notes The function types IDALsJacTimesVecFnBS and IDALsJacTimesSetupFnBS are described

in §6.3.6.

The previous routine IDASpilsSetJacTimesBS is now a wrapper for this routine, and

may still be used for backward-compatibility. However, this will be deprecated in future

releases, so we recommend that users transition to the new routine name soon.

Alternately, when using the default diﬀerence-quotient approximation to the Jacobian-vector product

for the backward problem, the user may specify the factor to use in setting increments for the ﬁnite-

diﬀerence approximation, via a call to IDASetIncrementFactorB:

IDASetIncrementFactorB

Call flag = IDASetIncrementFactorB(ida mem, which, dqincfacB);

Description The function IDASetIncrementFactorB speciﬁes the factor in the increments used in the

diﬀerence quotient approximations to matrix-vector products for the backward problem.

This routine can be used in both the cases wherethe backward problem does and does

not depend on the forward sensitvities.

Arguments ida mem (void *) pointer to the idas memory block.

which (int) the identiﬁer of the backward problem.

dqincfacB (realtype) diﬀerence quotient approximation factor.

Return value The return value flag (of type int) is one of

IDALS SUCCESS The optional value has been successfully set.

IDALS MEM NULL The ida mem pointer is NULL.

6.2 User-callable functions for adjoint sensitivity analysis 137

IDALS LMEM NULL The idals linear solver has not been initialized.

IDALS NO ADJ The function IDAAdjInit has not been previously called.

IDALS ILL INPUT The value of eplifacB is negative.

IDALS ILL INPUT The parameter which represented an invalid identiﬁer.

Notes The default value is 1.0.

The previous routine IDASpilsSetIncrementFactorB is now a wrapper for this routine,

and may still be used for backward-compatibility. However, this will be deprecated in

future releases, so we recommend that users transition to the new routine name soon.

When using an iterative linear solver for the backward problem, the user may supply a preconditioning

operator to aid in solution of the system, or she/he may adjust the convergence tolerance factor for

the iterative linear solver. These may be accomplished through calling the following functions:

IDASetPreconditionerB

Call flag = IDASetPreconditionerB(ida mem, which, psetupB, psolveB);

Description The function IDASetPrecSolveFnB speciﬁes the preconditioner setup and solve func-

tions for the backward integration.

Arguments ida mem (void *) pointer to the idas memory block.

which (int) the identiﬁer of the backward problem.

psetupB (IDALsPrecSetupFnB) user-deﬁned preconditioner setup function.

psolveB (IDALsPrecSolveFnB) user-deﬁned preconditioner solve function.

Return value The return value flag (of type int) is one of:

IDALS SUCCESS The optional value has been successfully set.

IDALS MEM NULL The ida mem memory block pointer was NULL.

IDALS LMEM NULL The idals linear solver has not been initialized.

IDALS NO ADJ The function IDAAdjInit has not been previously called.

IDALS ILL INPUT The parameter which represented an invalid identiﬁer.

Notes The function types IDALsPrecSolveFnB and IDALsPrecSetupFnB are described in §6.3.8

and §6.3.9, respectively. The psetupB argument may be NULL if no setup operation is

involved in the preconditioner.

The previous routine IDASpilsSetPreconditionerB is now a wrapper for this routine,

and may still be used for backward-compatibility. However, this will be deprecated in

future releases, so we recommend that users transition to the new routine name soon.

IDASetPreconditionerBS

Call flag = IDASetPreconditionerBS(ida mem, which, psetupBS, psolveBS);

Description The function IDASetPrecSolveFnBS speciﬁes the preconditioner setup and solve func-

tions for the backward integration, in the case where the backward problem depends on

the forward sensitivities.

Arguments ida mem (void *) pointer to the idas memory block.

which (int) the identiﬁer of the backward problem.

psetupBS (IDALsPrecSetupFnBS) user-deﬁned preconditioner setup function.

psolveBS (IDALsPrecSolveFnBS) user-deﬁned preconditioner solve function.

Return value The return value flag (of type int) is one of:

IDALS SUCCESS The optional value has been successfully set.

IDALS MEM NULL The ida mem memory block pointer was NULL.

IDALS LMEM NULL The idals linear solver has not been initialized.

138 Using IDAS for Adjoint Sensitivity Analysis

IDALS NO ADJ The function IDAAdjInit has not been previously called.

IDALS ILL INPUT The parameter which represented an invalid identiﬁer.

Notes The function types IDALsPrecSolveFnBS and IDALsPrecSetupFnBS are described in

§6.3.8 and §6.3.9, respectively. The psetupBS argument may be NULL if no setup oper-

ation is involved in the preconditioner.

The previous routine IDASpilsSetPreconditionerBS is now a wrapper for this routine,

and may still be used for backward-compatibility. However, this will be deprecated in

future releases, so we recommend that users transition to the new routine name soon.

IDASetEpsLinB

Call flag = IDASetEpsLinB(ida mem, which, eplifacB);

Description The function IDASetEpsLinB speciﬁes the factor by which the Krylov linear solver’s

convergence test constant is reduced from the nonlinear iteration test constant. (See

§2.1). This routine can be used in both the cases wherethe backward problem does and

does not depend on the forward sensitvities.

Arguments ida mem (void *) pointer to the idas memory block.

which (int) the identiﬁer of the backward problem.

eplifacB (realtype) linear convergence safety factor (>= 0.0).

Return value The return value flag (of type int) is one of

IDALS SUCCESS The optional value has been successfully set.

IDALS MEM NULL The ida mem pointer is NULL.

IDALS LMEM NULL The idals linear solver has not been initialized.

IDALS NO ADJ The function IDAAdjInit has not been previously called.

IDALS ILL INPUT The value of eplifacB is negative.

IDALS ILL INPUT The parameter which represented an invalid identiﬁer.

Notes The default value is 0.05.

Passing a value eplifacB= 0.0 also indicates using the default value.

The previous routine IDASpilsSetEpsLinB is now a wrapper for this routine, and may

still be used for backward-compatibility. However, this will be deprecated in future

releases, so we recommend that users transition to the new routine name soon.

6.2.10 Optional output functions for the backward problem

6.2.10.1 Main solver optional output functions

The user of the adjoint module in idas has access to any of the optional output functions described

in §4.5.10, both for the main solver and for the linear solver modules. The ﬁrst argument of these

IDAGet* and IDA*Get* functions is the pointer to the idas memory block for the backward problem.

In order to call any of these functions, the user must ﬁrst call the following function to obtain this

pointer:

IDAGetAdjIDABmem

Call ida memB = IDAGetAdjIDABmem(ida mem, which);

Description The function IDAGetAdjIDABmem returns a pointer to the idas memory block for the

backward problem.

Arguments ida mem (void *) pointer to the idas memory block created by IDACreate.

which (int) the identiﬁer of the backward problem.

6.2 User-callable functions for adjoint sensitivity analysis 139

Return value The return value, ida memB (of type void *), is a pointer to the idas memory for the

backward problem.

Notes The user should not modify ida memB in any way.

Optional output calls should pass ida memB as the ﬁrst argument; thus, for example, to

get the number of integration steps: flag = IDAGetNumSteps(idas memB,&nsteps).

To get values of the forward solution during a backward integration, use the following function.

The input value of twould typically be equal to that at which the backward solution has just been

obtained with IDAGetB. In any case, it must be within the last checkpoint interval used by IDASolveB.

IDAGetAdjY

Call flag = IDAGetAdjY(ida mem, t, y, yp);

Description The function IDAGetAdjY returns the interpolated value of the forward solution yand

its derivative during a backward integration.

Arguments ida mem (void *) pointer to the idas memory block created by IDACreate.

t(realtype) value of the independent variable at which yis desired (input).

y(N Vector) forward solution y(t).

yp (N Vector) forward solution derivative ˙y(t).

Return value The return value flag (of type int) is one of:

IDA SUCCESS IDAGetAdjY was successful.

IDA MEM NULL ida mem was NULL.

IDA GETY BADT The value of twas outside the current checkpoint interval.

Notes The user must allocate space for yand yp.

IDAGetAdjCheckPointsInfo

Call flag = IDAGetAdjCheckPointsInfo(ida mem, IDAadjCheckPointRec *ckpnt);

Description The function IDAGetAdjCheckPointsInfo loads an array of ncheck+1 records of type

IDAadjCheckPointRec. The user must allocate space for the array ckpnt.

Arguments ida mem (void *) pointer to the idas memory block created by IDACreate.

ckpnt (IDAadjCheckPointRec *) array of ncheck+1 checkpoint records, each of type

IDAadjCheckPointRec.

Return value The return value is IDA SUCCESS if successful, or IDA MEM NULL if ida mem is NULL, or

IDA NO ADJ if ASA was not initialized.

Notes The members of each record ckpnt[i] are:

•ckpnt[i].my addr (void *) address of current checkpoint in ida mem->ida adj mem

•ckpnt[i].next addr (void *) address of next checkpoint

•ckpnt[i].t0 (realtype) start of checkpoint interval

•ckpnt[i].t1 (realtype) end of checkpoint interval

•ckpnt[i].nstep (long int) step counter at ckeckpoint t0

•ckpnt[i].order (int) method order at checkpoint t0

•ckpnt[i].step (realtype) step size at checkpoint t0

6.2.10.2 Initial condition calculation optional output function

140 Using IDAS for Adjoint Sensitivity Analysis

IDAGetConsistentICB

Call flag = IDAGetConsistentICB(ida mem, which, yB0 mod, ypB0 mod);

Description The function IDAGetConsistentICB returns the corrected initial conditions for back-

ward problem calculated by IDACalcICB.

Arguments ida mem (void *) pointer to the idas memory block.

which is the identiﬁer of the backward problem.

yB0 mod (N Vector) consistent initial vector.

ypB0 mod (N Vector) consistent initial derivative vector.

Return value The return value flag (of type int) is one of

IDA SUCCESS The optional output value has been successfully set.

IDA MEM NULL The ida mem pointer is NULL.

IDA NO ADJ IDAAdjInit has not been previously called.

IDA ILL INPUT Parameter which did not refer a valid backward problem identiﬁer.

Notes If the consistent solution vector or consistent derivative vector is not desired, pass NULL

for the corresponding argument.

The user must allocate space for yB0 mod and ypB0 mod (if not NULL).

6.2.11 Backward integration of quadrature equations

Not only the backward problem but also the backward quadrature equations may or may not depend on

the forward sensitivities. Accordingly, one of the IDAQuadInitB or IDAQuadInitBS should be used to

allocate internal memory and to initialize backward quadratures. For any other operation (extraction,

optional input/output, reinitialization, deallocation), the same function is called regardless of whether

or not the quadratures are sensitivity-dependent.

6.2.11.1 Backward quadrature initialization functions

The function IDAQuadInitB initializes and allocates memory for the backward integration of quadra-

ture equations that do not depende on forward sensititvities. It has the following form:

IDAQuadInitB

Call flag = IDAQuadInitB(ida mem, which, rhsQB, yQB0);

Description The function IDAQuadInitB provides required problem speciﬁcations, allocates internal

memory, and initializes backward quadrature integration.

Arguments ida mem (void *) pointer to the idas memory block.

which (int) the identiﬁer of the backward problem.

rhsQB (IDAQuadRhsFnB) is the Cfunction which computes f QB, the residual of the

backward quadrature equations. This function has the form rhsQB(t, y, yp,

yB, ypB, rhsvalBQ, user dataB) (see §6.3.3).

yQB0 (N Vector) is the value of the quadrature variables at tB0.

Return value The return value flag (of type int) will be one of the following:

IDA SUCCESS The call to IDAQuadInitB was successful.

IDA MEM NULL ida mem was NULL.

IDA NO ADJ The function IDAAdjInit has not been previously called.

IDA MEM FAIL A memory allocation request has failed.

IDA ILL INPUT The parameter which is an invalid identiﬁer.

The function IDAQuadInitBS initializes and allocates memory for the backward integration of

quadrature equations that depend on the forward sensitivities.

6.2 User-callable functions for adjoint sensitivity analysis 141

IDAQuadInitBS

Call flag = IDAQuadInitBS(ida mem, which, rhsQBS, yQBS0);

Description The function IDAQuadInitBS provides required problem speciﬁcations, allocates internal

memory, and initializes backward quadrature integration.

Arguments ida mem (void *) pointer to the idas memory block.

which (int) the identiﬁer of the backward problem.

rhsQBS (IDAQuadRhsFnBS) is the Cfunction which computes f QBS, the residual of

the backward quadrature equations. This function has the form rhsQBS(t,

y, yp, yS, ypS, yB, ypB, rhsvalBQS, user dataB) (see §6.3.4).

yQBS0 (N Vector) is the value of the sensitivity-dependent quadrature variables at

tB0.

Return value The return value flag (of type int) will be one of the following:

IDA SUCCESS The call to IDAQuadInitBS was successful.

IDA MEM NULL ida mem was NULL.

IDA NO ADJ The function IDAAdjInit has not been previously called.

IDA MEM FAIL A memory allocation request has failed.

IDA ILL INPUT The parameter which is an invalid identiﬁer.

The integration of quadrature equations during the backward phase can be re-initialized by calling

the following function. Before calling IDAQuadReInitB for a new backward problem, call any desired

solution extraction functions IDAGet** associated with the previous backward problem.

IDAQuadReInitB

Call flag = IDAQuadReInitB(ida mem, which, yQB0);

Description The function IDAQuadReInitB re-initializes the backward quadrature integration.

Arguments ida mem (void *) pointer to the idas memory block.

which (int) the identiﬁer of the backward problem.

yQB0 (N Vector) is the value of the quadrature variables at tB0.

Return value The return value flag (of type int) will be one of the following:

IDA SUCCESS The call to IDAQuadReInitB was successful.

IDA MEM NULL ida mem was NULL.

IDA NO ADJ The function IDAAdjInit has not been previously called.

IDA MEM FAIL A memory allocation request has failed.

IDA NO QUAD Quadrature integration was not activated through a previous call to

IDAQuadInitB.

IDA ILL INPUT The parameter which is an invalid identiﬁer.

Notes IDAQuadReInitB can be used after a call to either IDAQuadInitB or IDAQuadInitBS.

6.2.11.2 Backward quadrature extraction function

To extract the values of the quadrature variables at the last return time of IDASolveB,idas provides

a wrapper for the function IDAGetQuad (see §4.7.3). The call to this function has the form

IDAGetQuadB

Call flag = IDAGetQuadB(ida mem, which, &tret, yQB);

Description The function IDAGetQuadB returns the quadrature solution vector after a successful

return from IDASolveB.

Arguments ida mem (void *) pointer to the idas memory.

142 Using IDAS for Adjoint Sensitivity Analysis

tret (realtype) the time reached by the solver (output).

yQB (N Vector) the computed quadrature vector.

Return value

Notes T

he user must allocate space for yQB. The return value flag of IDAGetQuadB is one of:

IDA SUCCESS IDAGetQuadB was successful.

IDA MEM NULL ida mem is NULL.

IDA NO ADJ The function IDAAdjInit has not been previously called.

IDA NO QUAD Quadrature integration was not initialized.

IDA BAD DKY yQB was NULL.

IDA ILL INPUT The parameter which is an invalid identiﬁer.

6.2.11.3 Optional input/output functions for backward quadrature integration

Optional values controlling the backward integration of quadrature equations can be changed from

their default values through calls to one of the following functions which are wrappers for the corre-

sponding optional input functions deﬁned in §4.7.4. The user must specify the identiﬁer which of the

backward problem for which the optional values are speciﬁed.

flag = IDASetQuadErrConB(ida_mem, which, errconQ);

flag = IDAQuadSStolerancesB(ida_mem, which, reltolQ, abstolQ);

flag = IDAQuadSVtolerancesB(ida_mem, which, reltolQ, abstolQ);

Their return value flag (of type int) can have any of the return values of its counterparts, but it

can also be IDA NO ADJ if the function IDAAdjInit has not been previously called or IDA ILL INPUT

if the parameter which was an invalid identiﬁer.

Access to optional outputs related to backward quadrature integration can be obtained by calling

the corresponding IDAGetQuad* functions (see §4.7.5). A pointer ida memB to the idas memory block

for the backward problem, required as the ﬁrst argument of these functions, can be obtained through

a call to the functions IDAGetAdjIDABmem (see §6.2.10).

6.3 User-supplied functions for adjoint sensitivity analysis

In addition to the required DAE residual function and any optional functions for the forward problem,

when using the adjoint sensitivity module in idas, the user must supply one function deﬁning the

backward problem DAE and, optionally, functions to supply Jacobian-related information and one or

two functions that deﬁne the preconditioner (if applicable for the choice of sunlinsol object) for the

backward problem. Type deﬁnitions for all these user-supplied functions are given below.

6.3.1 DAE residual for the backward problem

The user must provide a resB function of type IDAResFnB deﬁned as follows:

IDAResFnB

Deﬁnition typedef int (*IDAResFnB)(realtype t, N Vector y, N Vector yp,

N Vector yB, N Vector ypB,

N Vector resvalB, void *user dataB);

Purpose This function evaluates the residual of the backward problem DAE system. This could

be (2.20) or (2.25).

Arguments tis the current value of the independent variable.

6.3 User-supplied functions for adjoint sensitivity analysis 143

yis the current value of the forward solution vector.

yp is the current value of the forward solution derivative vector.

yB is the current value of the backward dependent variable vector.

ypB is the current value of the backward dependent derivative vector.

resvalB is the output vector containing the residual for the backward DAE problem.

user dataB is a pointer to user data, same as passed to IDASetUserDataB.

Return value An IDAResFnB should return 0 if successful, a positive value if a recoverable error oc-

curred (in which case idas will attempt to correct), or a negative value if an unre-

coverabl failure occurred (in which case the integration stops and IDASolveB returns

IDA RESFUNC FAIL).

Notes Allocation of memory for resvalB is handled within idas.

The y,yp,yB,ypB, and resvalB arguments are all of type N Vector, but yB,ypB, and

resvalB typically have diﬀerent internal representations from yand yp. It is the user’s

responsibility to access the vector data consistently (including the use of the correct

accessor macros from each nvector implementation). For the sake of computational

eﬃciency, the vector functions in the two nvector implementations provided with idas

do not perform any consistency checks with respect to their N Vector arguments (see

§7.2 and §7.3).

The user dataB pointer is passed to the user’s resB function every time it is called and

can be the same as the user data pointer used for the forward problem.

Before calling the user’s resB function, idas needs to evaluate (through interpolation)

the values of the states from the forward integration. If an error occurs in the inter-

polation, idas triggers an unrecoverable failure in the residual function which will halt

the integration and IDASolveB will return IDA RESFUNC FAIL.

6.3.2 DAE residual for the backward problem depending on the forward

sensitivities

The user must provide a resBS function of type IDAResFnBS deﬁned as follows:

IDAResFnBS

Deﬁnition typedef int (*IDAResFnBS)(realtype t, N Vector y, N Vector yp,

N Vector *yS, N Vector *ypS,

N Vector yB, N Vector ypB,

N Vector resvalB, void *user dataB);

Purpose This function evaluates the residual of the backward problem DAE system. This could

be (2.20) or (2.25).

Arguments tis the current value of the independent variable.

yis the current value of the forward solution vector.

yp is the current value of the forward solution derivative vector.

yS a pointer to an array of Ns vectors containing the sensitivities of the forward

solution.

ypS a pointer to an array of Ns vectors containing the derivatives of the forward

sensitivities.

yB is the current value of the backward dependent variable vector.

ypB is the current value of the backward dependent derivative vector.

resvalB is the output vector containing the residual for the backward DAE problem.

user dataB is a pointer to user data, same as passed to IDASetUserDataB.

144 Using IDAS for Adjoint Sensitivity Analysis

Return value An IDAResFnBS should return 0 if successful, a positive value if a recoverable error

occurred (in which case idas will attempt to correct), or a negative value if an unre-

coverable error occurred (in which case the integration stops and IDASolveB returns

IDA RESFUNC FAIL).

Notes Allocation of memory for resvalB is handled within idas.

The y,yp,yB,ypB, and resvalB arguments are all of type N Vector, but yB,ypB,

and resvalB typically have diﬀerent internal representations from yand yp. Likewise

for each yS[i] and ypS[i]. It is the user’s responsibility to access the vector data

consistently (including the use of the correct accessor macros from each nvector im-

plementation). For the sake of computational eﬃciency, the vector functions in the two

nvector implementations provided with idas do not perform any consistency checks

with respect to their N Vector arguments (see §7.2 and §7.3).

The user dataB pointer is passed to the user’s resBS function every time it is called

and can be the same as the user data pointer used for the forward problem.

Before calling the user’s resBS function, idas needs to evaluate (through interpolation)

the values of the states from the forward integration. If an error occurs in the inter-

polation, idas triggers an unrecoverable failure in the residual function which will halt

the integration and IDASolveB will return IDA RESFUNC FAIL.

6.3.3 Quadrature right-hand side for the backward problem

The user must provide an fQB function of type IDAQuadRhsFnB deﬁned by

IDAQuadRhsFnB

Deﬁnition typedef int (*IDAQuadRhsFnB)(realtype t, N Vector y, N Vector yp,

N Vector yB, N Vector ypB,

N Vector rhsvalBQ, void *user dataB);

Purpose This function computes the quadrature equation right-hand side for the backward prob-

lem.

Arguments tis the current value of the independent variable.

yis the current value of the forward solution vector.

yp is the current value of the forward solution derivative vector.

yB is the current value of the backward dependent variable vector.

ypB is the current value of the backward dependent derivative vector.

rhsvalBQ is the output vector containing the residual for the backward quadrature

equations.

user dataB is a pointer to user data, same as passed to IDASetUserDataB.

Return value An IDAQuadRhsFnB should return 0 if successful, a positive value if a recoverable er-

ror occurred (in which case idas will attempt to correct), or a negative value if it

failed unrecoverably (in which case the integration is halted and IDASolveB returns

IDA QRHSFUNC FAIL).

Notes Allocation of memory for rhsvalBQ is handled within idas.

The y,yp,yB,ypB, and rhsvalBQ arguments are all of type N Vector, but they typi-

cally all have diﬀerent internal representations. It is the user’s responsibility to access

the vector data consistently (including the use of the correct accessor macros from each

nvector implementation). For the sake of computational eﬃciency, the vector func-

tions in the two nvector implementations provided with idas do not perform any

consistency checks with repsect to their N Vector arguments (see §7.2 and §7.3).

The user dataB pointer is passed to the user’s fQB function every time it is called and

can be the same as the user data pointer used for the forward problem.

6.3 User-supplied functions for adjoint sensitivity analysis 145

Before calling the user’s fQB function, idas needs to evaluate (through interpolation) the

values of the states from the forward integration. If an error occurs in the interpolation,

idas triggers an unrecoverable failure in the quadrature right-hand side function which

will halt the integration and IDASolveB will return IDA QRHSFUNC FAIL.

6.3.4 Sensitivity-dependent quadrature right-hand side for the backward

problem

The user must provide an fQBS function of type IDAQuadRhsFnBS deﬁned by

IDAQuadRhsFnBS

Deﬁnition typedef int (*IDAQuadRhsFnBS)(realtype t, N Vector y, N Vector yp,

N Vector *yS, N Vector *ypS,

N Vector yB, N Vector ypB,

N Vector rhsvalBQS, void *user dataB);

Purpose This function computes the quadrature equation residual for the backward problem.

Arguments tis the current value of the independent variable.

yis the current value of the forward solution vector.

yp is the current value of the forward solution derivative vector.

yS a pointer to an array of Ns vectors containing the sensitivities of the forward

solution.

ypS a pointer to an array of Ns vectors containing the derivatives of the forward

sensitivities.

yB is the current value of the backward dependent variable vector.

ypB is the current value of the backward dependent derivative vector.

rhsvalBQS is the output vector containing the residual for the backward quadrature

equations.

user dataB is a pointer to user data, same as passed to IDASetUserDataB.

Return value An IDAQuadRhsFnBS should return 0 if successful, a positive value if a recoverable er-

ror occurred (in which case idas will attempt to correct), or a negative value if it

failed unrecoverably (in which case the integration is halted and IDASolveB returns

IDA QRHSFUNC FAIL).

Notes Allocation of memory for rhsvalBQS is handled within idas.

The y,yp,yB,ypB, and rhsvalBQS arguments are all of type N Vector, but they typically

do not all have the same internal representations. Likewise for each yS[i] and ypS[i].

It is the user’s responsibility to access the vector data consistently (including the use

of the correct accessor macros from each nvector implementation). For the sake

of computational eﬃciency, the vector functions in the two nvector implementations

provided with idas do not perform any consistency checks with repsect to their N Vector

arguments (see §7.2 and §7.3).

The user dataB pointer is passed to the user’s fQBS function every time it is called and

can be the same as the user data pointer used for the forward problem.

Before calling the user’s fQBS function, idas needs to evaluate (through interpolation)

the values of the states from the forward integration. If an error occurs in the interpo-

lation, idas triggers an unrecoverable failure in the quadrature right-hand side function

which will halt the integration and IDASolveB will return IDA QRHSFUNC FAIL.

146 Using IDAS for Adjoint Sensitivity Analysis

6.3.5 Jacobian construction for the backward problem (matrix-based lin-

ear solvers)

If a matrix-based linear solver module is is used for the backward problem (i.e., IDASetLinearSolverB

is called with non-NULL sunmatrix argument in the step described in §6.1), the user may provide a

function of type IDALsJacFnB or IDALsJacFnBS (see §6.2.9), deﬁned as follows:

IDALsJacFnB

Deﬁnition typedef int (*IDALsJacFnB)(realtype tt, realtype cjB,

N Vector yy, N Vector yp,

N Vector yB, N Vector ypB,

N Vector resvalB,

SUNMatrix JacB, void *user dataB,

N Vector tmp1B, N Vector tmp2B,

N Vector tmp3B);

Purpose This function computes the Jacobian of the backward problem (or an approximation to

it).

Arguments tt is the current value of the independent variable.

cjB is the scalar in the system Jacobian, proportional to the inverse of the step

size (αin Eq. (2.6) ).

yy is the current value of the forward solution vector.

yp is the current value of the forward solution derivative vector.

yB is the current value of the backward dependent variable vector.

ypB is the current value of the backward dependent derivative vector.

resvalB is the current value of the residual for the backward problem.

JacB is the output approximate Jacobian matrix.

user dataB is a pointer to user data — the parameter passed to IDASetUserDataB.

tmp1B

tmp2B

tmp3B are pointers to memory allocated for variables of type N Vector which can

be used by the IDALsJacFnB function as temporary storage or work space.

Return value An IDALsJacFnB should return 0 if successful, a positive value if a recoverable error

occurred (in which case idas will attempt to correct, while idals sets last flag to

IDALS JACFUNC RECVR), or a negative value if it failed unrecoverably (in which case the

integration is halted, IDASolveB returns IDA LSETUP FAIL and idals sets last flag to

IDALS JACFUNC UNRECVR).

Notes A user-supplied Jacobian function must load the matrix JacB with an approximation

to the Jacobian matrix at the point (tt,yy,yB), where yy is the solution of the original

IVP at time tt, and yB is the solution of the backward problem at the same time.

Information regarding the structure of the speciﬁc sunmatrix structure (e.g. number

of rows, upper/lower bandwidth, sparsity type) may be obtained through using the

implementation-speciﬁc sunmatrix interface functions (see Chapter 8for details). Only

nonzero elements need to be loaded into JacB as this matrix is set to zero before the

call to the Jacobian function.

Before calling the user’s IDALsJacFnB,idas needs to evaluate (through interpolation)

the values of the states from the forward integration. If an error occurs in the in-

terpolation, idas triggers an unrecoverable failure in the Jacobian function which will

halt the integration (IDASolveB returns IDA LSETUP FAIL and idals sets last flag to

IDALS JACFUNC UNRECVR).

6.3 User-supplied functions for adjoint sensitivity analysis 147

The previous function type IDADlsJacFnB is identical to IDALsJacFnB, and may still

be used for backward-compatibility. However, this will be deprecated in future releases,

so we recommend that users transition to the new function type name soon.

IDALsJacFnBS

Deﬁnition typedef int (*IDALsJacFnBS)(realtype tt, realtype cjB,

N Vector yy, N Vector yp,

N Vector *yS, N Vector *ypS,

N Vector yB, N Vector ypB,

N Vector resvalB,

SUNMatrix JacB, void *user dataB,

N Vector tmp1B, N Vector tmp2B,

N Vector tmp3B);

Purpose This function computes the Jacobian of the backward problem (or an approximation to

it), in the case where the backward problem depends on the forward sensitivities.

Arguments tt is the current value of the independent variable.

cjB is the scalar in the system Jacobian, proportional to the inverse of the step

size (αin Eq. (2.6) ).

yy is the current value of the forward solution vector.

yp is the current value of the forward solution derivative vector.

yS a pointer to an array of Ns vectors containing the sensitivities of the forward

solution.

ypS a pointer to an array of Ns vectors containing the derivatives of the forward

solution sensitivities.

yB is the current value of the backward dependent variable vector.

ypB is the current value of the backward dependent derivative vector.

resvalB is the current value of the residual for the backward problem.

JacB is the output approximate Jacobian matrix.

user dataB is a pointer to user data — the parameter passed to IDASetUserDataB.

tmp1B

tmp2B

tmp3B are pointers to memory allocated for variables of type N Vector which can

be used by IDALsJacFnBS as temporary storage or work space.

Return value An IDALsJacFnBS should return 0 if successful, a positive value if a recoverable error

occurred (in which case idas will attempt to correct, while idals sets last flag to

IDALS JACFUNC RECVR), or a negative value if it failed unrecoverably (in which case the

integration is halted, IDASolveB returns IDA LSETUP FAIL and idals sets last flag to

IDALS JACFUNC UNRECVR).

Notes A user-supplied dense Jacobian function must load the matrix JacB with an approxima-

tion to the Jacobian matrix at the point (tt,yy,yS,yB), where yy is the solution of the

original IVP at time tt,yS is the array of forward sensitivities at time tt, and yB is the

solution of the backward problem at the same time. Information regarding the struc-

ture of the speciﬁc sunmatrix structure (e.g. number of rows, upper/lower bandwidth,

sparsity type) may be obtained through using the implementation-speciﬁc sunmatrix

interface functions (see Chapter 8for details). Only nonzero elements need to be loaded

into JacB as this matrix is set to zero before the call to the Jacobian function.

Before calling the user’s IDALsJacFnBS,idas needs to evaluate (through interpolation)

the values of the states from the forward integration. If an error occurs in the in-

terpolation, idas triggers an unrecoverable failure in the Jacobian function which will

148 Using IDAS for Adjoint Sensitivity Analysis

halt the integration (IDASolveB returns IDA LSETUP FAIL and idals sets last flag to

IDALS JACFUNC UNRECVR).

The previous function type IDADlsJacFnBS is identical to IDALsJacFnBS, and may still

be used for backward-compatibility. However, this will be deprecated in future releases,

so we recommend that users transition to the new function type name soon.

6.3.6 Jacobian-vector product for the backward problem (matrix-free lin-

ear solvers)

If a matrix-free linear solver is selected for the backward problem (i.e., IDASetLinearSolverB is

called with NULL-valued sunmatrix argument in the steps described in §6.1), the user may provide a

function of type

IDALsJacTimesVecFnB or IDALsJacTimesVecFnBS in the following form, to compute matrix-vector

products Jv. If such a function is not supplied, the default is a diﬀerence quotient approximation to

these products.

IDALsJacTimesVecFnB

Deﬁnition typedef int (*IDALsJacTimesVecFnB)(realtype t,

N Vector yy, N Vector yp,

N Vector yB, N Vector ypB,

N Vector resvalB,

N Vector vB, N Vector JvB,

realtype cjB, void *user dataB,

N Vector tmp1B, N Vector tmp2B);

Purpose This function computes the action of the backward problem Jacobian JB on a given

vector vB.

Arguments tis the current value of the independent variable.

yy is the current value of the forward solution vector.

yp is the current value of the forward solution derivative vector.

yB is the current value of the backward dependent variable vector.

ypB is the current value of the backward dependent derivative vector.

resvalB is the current value of the residual for the backward problem.

vB is the vector by which the Jacobian must be multiplied.

JvB is the computed output vector, JB*vB.

cjB is the scalar in the system Jacobian, proportional to the inverse of the step

size (αin Eq. (2.6) ).

user dataB is a pointer to user data — the same as the user dataB parameter passed

to IDASetUserDataB.

tmp1B

tmp2B are pointers to memory allocated for variables of type N Vector which can

be used by IDALsJacTimesVecFnB as temporary storage or work space.

Return value The return value of a function of type IDALsJtimesVecFnB should be 0 if successful or

nonzero if an error was encountered, in which case the integration is halted.

Notes A user-supplied Jacobian-vector product function must load the vector JvB with the

product of the Jacobian of the backward problem at the point (t,y,yB) and the vector

vB. Here, yis the solution of the original IVP at time tand yB is the solution of the

backward problem at the same time. The rest of the arguments are equivalent to those

passed to a function of type IDALsJacTimesVecFn (see §4.6.6). If the backward problem

is the adjoint of ˙y=f(t, y), then this function is to compute −(∂f/∂y)TvB.

6.3 User-supplied functions for adjoint sensitivity analysis 149

The previous function type IDASpilsJacTimesVecFnB is identical to

IDALsJacTimesVecFnB, and may still be used for backward-compatibility. However,

this will be deprecated in future releases, so we recommend that users transition to the

new function type name soon.

IDALsJacTimesVecFnBS

Deﬁnition typedef int (*IDALsJacTimesVecFnBS)(realtype t,

N Vector yy, N Vector yp,

N Vector *yyS, N Vector *ypS,

N Vector yB, N Vector ypB,

N Vector resvalB,

N Vector vB, N Vector JvB,

realtype cjB, void *user dataB,

N Vector tmp1B, N Vector tmp2B);

Purpose This function computes the action of the backward problem Jacobian JB on a given

vector vB, in the case where the backward problem depends on the forward sensitivities.

Arguments tis the current value of the independent variable.

yy is the current value of the forward solution vector.

yp is the current value of the forward solution derivative vector.

yyS a pointer to an array of Ns vectors containing the sensitivities of the forward

solution.

ypS a pointer to an array of Ns vectors containing the derivatives of the forward

sensitivities.

yB is the current value of the backward dependent variable vector.

ypB is the current value of the backward dependent derivative vector.

resvalB is the current value of the residual for the backward problem.

vB is the vector by which the Jacobian must be multiplied.

JvB is the computed output vector, JB*vB.

cjB is the scalar in the system Jacobian, proportional to the inverse of the step

size (αin Eq. (2.6) ).

user dataB is a pointer to user data — the same as the user dataB parameter passed

to IDASetUserDataB.

tmp1B

tmp2B are pointers to memory allocated for variables of type N Vector which can

be used by IDALsJacTimesVecFnBS as temporary storage or work space.

Return value The return value of a function of type IDALsJtimesVecFnBS should be 0 if successful

or nonzero if an error was encountered, in which case the integration is halted.

Notes A user-supplied Jacobian-vector product function must load the vector JvB with the

product of the Jacobian of the backward problem at the point (t,y,yB) and the vector

vB. Here, yis the solution of the original IVP at time tand yB is the solution of the

backward problem at the same time. The rest of the arguments are equivalent to those

passed to a function of type IDALsJacTimesVecFn (see §4.6.6).

The previous function type IDASpilsJacTimesVecFnBS is identical to

IDALsJacTimesVecFnBS, and may still be used for backward-compatibility. However,

this will be deprecated in future releases, so we recommend that users transition to the

new function type name soon.

150 Using IDAS for Adjoint Sensitivity Analysis

6.3.7 Jacobian-vector product setup for the backward problem (matrix-

free linear solvers)

If the user’s Jacobian-times-vector requires that any Jacobian-related data be preprocessed or eval-

uated, then this needs to be done in a user-supplied function of type IDALsJacTimesSetupFnB or

IDALsJacTimesSetupFnBS, deﬁned as follows:

IDALsJacTimesSetupFnB

Deﬁnition typedef int (*IDALsJacTimesSetupFnB)(realtype tt,

N Vector yy, N Vector yp,

N Vector yB, N Vector ypB,

N Vector resvalB,

realtype cjB, void *user dataB);

Purpose This function preprocesses and/or evaluates Jacobian data needed by the Jacobian-

times-vector routine for the backward problem.

Arguments tt is the current value of the independent variable.

yy is the current value of the dependent variable vector, y(t).

yp is the current value of ˙y(t).

yB is the current value of the backward dependent variable vector.

ypB is the current value of the backward dependent derivative vector.

resvalB is the current value of the residual for the backward problem.

cjB is the scalar in the system Jacobian, proportional to the inverse of the step

size (αin Eq. (2.6) ).

user dataB is a pointer to user data — the same as the user dataB parameter passed

to IDASetUserDataB.

Return value The value returned by the Jacobian-vector setup function should be 0 if successful,

positive for a recoverable error (in which case the step will be retried), or negative for

an unrecoverable error (in which case the integration is halted).

Notes Each call to the Jacobian-vector setup function is preceded by a call to the backward

problem residual user function with the same (t,y, yp, yB, ypB) arguments. Thus,

the setup function can use any auxiliary data that is computed and saved during the

evaluation of the DAE residual.

If the user’s IDALsJacTimesVecFnB function uses diﬀerence quotient approximations, it

may need to access quantities not in the call list. These include the current stepsize,

the error weights, etc. To obtain these, the user will need to add a pointer to ida mem

to user dataB and then use the IDAGet* functions described in §4.5.10.2. The unit

roundoﬀ can be accessed as UNIT ROUNDOFF deﬁned in sundials types.h.

The previous function type IDASpilsJacTimesSetupFnB is identical to

IDALsJacTimesSetupFnB, and may still be used for backward-compatibility. However,

this will be deprecated in future releases, so we recommend that users transition to the

new function type name soon.

IDALsJacTimesSetupFnBS

Deﬁnition typedef int (*IDALsJacTimesSetupFnBS)(realtype tt,

N Vector yy, N Vector yp,

N Vector *yyS, N Vector *ypS,

N Vector yB, N Vector ypB,

N Vector resvalB,

realtype cjB, void *user dataB);

6.3 User-supplied functions for adjoint sensitivity analysis 151

Purpose This function preprocesses and/or evaluates Jacobian data needed by the Jacobian-

times-vector routine for the backward problem, in the case that the backward problem

depends on the forward sensitivities.

Arguments tt is the current value of the independent variable.

yy is the current value of the dependent variable vector, y(t).

yp is the current value of ˙y(t).

yyS a pointer to an array of Ns vectors containing the sensitivities of the forward

solution.

ypS a pointer to an array of Ns vectors containing the derivatives of the forward

sensitivities.

yB is the current value of the backward dependent variable vector.

ypB is the current value of the backward dependent derivative vector.

resvalB is the current value of the residual for the backward problem.

cjB is the scalar in the system Jacobian, proportional to the inverse of the step

size (αin Eq. (2.6) ).

user dataB is a pointer to user data — the same as the user dataB parameter passed

to IDASetUserDataB.

Return value The value returned by the Jacobian-vector setup function should be 0 if successful,

positive for a recoverable error (in which case the step will be retried), or negative for

an unrecoverable error (in which case the integration is halted).

Notes Each call to the Jacobian-vector setup function is preceded by a call to the backward

problem residual user function with the same (t,y, yp, yyS, ypS, yB, ypB) argu-

ments. Thus, the setup function can use any auxiliary data that is computed and saved

during the evaluation of the DAE residual.

If the user’s IDALsJacTimesVecFnB function uses diﬀerence quotient approximations, it

may need to access quantities not in the call list. These include the current stepsize,

the error weights, etc. To obtain these, the user will need to add a pointer to ida mem

to user dataB and then use the IDAGet* functions described in §4.5.10.2. The unit

roundoﬀ can be accessed as UNIT ROUNDOFF deﬁned in sundials types.h.

The previous function type IDASpilsJacTimesSetupFnBS is identical to

IDALsJacTimesSetupFnBS, and may still be used for backward-compatibility. However,

this will be deprecated in future releases, so we recommend that users transition to the

new function type name soon.

6.3.8 Preconditioner solve for the backward problem (iterative linear solvers)

If preconditioning is used during integration of the backward problem, then the user must provide a

function to solve the linear system P z =r, where Pis a left preconditioner matrix. This function

must have one of the following two forms:

IDALsPrecSolveFnB

Deﬁnition typedef int (*IDALsPrecSolveFnB)(realtype t,

N Vector yy, N Vector yp,

N Vector yB, N Vector ypB,

N Vector resvalB,

N Vector rvecB, N Vector zvecB,

realtype cjB, realtype deltaB,

void *user dataB);

Purpose This function solves the preconditioning system P z =rfor the backward problem.

Arguments tis the current value of the independent variable.

152 Using IDAS for Adjoint Sensitivity Analysis

yy is the current value of the forward solution vector.

yp is the current value of the forward solution derivative vector.

yB is the current value of the backward dependent variable vector.

ypB is the current value of the backward dependent derivative vector.

resvalB is the current value of the residual for the backward problem.

rvecB is the right-hand side vector rof the linear system to be solved.

zvecB is the computed output vector.

cjB is the scalar in the system Jacobian, proportional to the inverse of the step

size (αin Eq. (2.6) ).

deltaB is an input tolerance to be used if an iterative method is employed in the

solution.

user dataB is a pointer to user data — the same as the user dataB parameter passed

to the function IDASetUserDataB.

Return value The return value of a preconditioner solve function for the backward problem should be

0 if successful, positive for a recoverable error (in which case the step will be retried),

or negative for an unrecoverable error (in which case the integration is halted).

Notes The previous function type IDASpilsPrecSolveFnB is identical to IDALsPrecSolveFnB,

and may still be used for backward-compatibility. However, this will be deprecated in

future releases, so we recommend that users transition to the new function type name

soon.

IDALsPrecSolveFnBS

Deﬁnition typedef int (*IDALsPrecSolveFnBS)(realtype t,

N Vector yy, N Vector yp,

N Vector *yyS, N Vector *ypS,

N Vector yB, N Vector ypB,

N Vector resvalB,

N Vector rvecB, N Vector zvecB,

realtype cjB, realtype deltaB,

void *user dataB);

Purpose This function solves the preconditioning system P z =rfor the backward problem, for

the case in which the backward problem depends on the forward sensitivities.

Arguments tis the current value of the independent variable.

yy is the current value of the forward solution vector.

yp is the current value of the forward solution derivative vector.

yyS a pointer to an array of Ns vectors containing the sensitivities of the forward

solution.

ypS a pointer to an array of Ns vectors containing the derivatives of the forward

sensitivities.

yB is the current value of the backward dependent variable vector.

ypB is the current value of the backward dependent derivative vector.

resvalB is the current value of the residual for the backward problem.

rvecB is the right-hand side vector rof the linear system to be solved.

zvecB is the computed output vector.

cjB is the scalar in the system Jacobian, proportional to the inverse of the step

size (αin Eq. (2.6) ).

deltaB is an input tolerance to be used if an iterative method is employed in the

solution.

6.3 User-supplied functions for adjoint sensitivity analysis 153

user dataB is a pointer to user data — the same as the user dataB parameter passed

to the function IDASetUserDataB.

Return value The return value of a preconditioner solve function for the backward problem should be

0 if successful, positive for a recoverable error (in which case the step will be retried),

or negative for an unrecoverable error (in which case the integration is halted).

Notes The previous function type IDASpilsPrecSolveFnBS is identical to IDALsPrecSolveFnBS,

and may still be used for backward-compatibility. However, this will be deprecated in

future releases, so we recommend that users transition to the new function type name

soon.

6.3.9 Preconditioner setup for the backward problem (iterative linear solvers)

If the user’s preconditioner requires that any Jacobian-related data be preprocessed or evaluated, then

this needs to be done in a user-supplied function of one of the following two types:

IDALsPrecSetupFnB

Deﬁnition typedef int (*IDALsPrecSetupFnB)(realtype t,

N Vector yy, N Vector yp,

N Vector yB, N Vector ypB,

N Vector resvalB,

realtype cjB, void *user dataB);

Purpose This function preprocesses and/or evaluates Jacobian-related data needed by the pre-

conditioner for the backward problem.

Arguments The arguments of an IDALsPrecSetupFnB are as follows:

tis the current value of the independent variable.

yy is the current value of the forward solution vector.

yp is the current value of the forward solution vector.

yB is the current value of the backward dependent variable vector.

ypB is the current value of the backward dependent derivative vector.

resvalB is the current value of the residual for the backward problem.

cjB is the scalar in the system Jacobian, proportional to the inverse of the step

size (αin Eq. (2.6) ).

user dataB is a pointer to user data — the same as the user dataB parameter passed

to the function IDASetUserDataB.

Return value The return value of a preconditioner setup function for the backward problem should

be 0 if successful, positive for a recoverable error (in which case the step will be retried),

or negative for an unrecoverable error (in which case the integration is halted).

Notes The previous function type IDASpilsPrecSetupFnB is identical to IDALsPrecSetupFnB,

and may still be used for backward-compatibility. However, this will be deprecated in

future releases, so we recommend that users transition to the new function type name

soon.

IDALsPrecSetupFnBS

Deﬁnition typedef int (*IDALsPrecSetupFnBS)(realtype t,

N Vector yy, N Vector yp,

N Vector *yyS, N Vector *ypS,

N Vector yB, N Vector ypB,

N Vector resvalB,

realtype cjB, void *user dataB);

154 Using IDAS for Adjoint Sensitivity Analysis

Purpose This function preprocesses and/or evaluates Jacobian-related data needed by the pre-

conditioner for the backward problem, in the case where the backward problem depends

on the forward sensitivities.

Arguments The arguments of an IDALsPrecSetupFnBS are as follows:

tis the current value of the independent variable.

yy is the current value of the forward solution vector.

yp is the current value of the forward solution vector.

yyS a pointer to an array of Ns vectors containing the sensitivities of the forward

solution.

ypS a pointer to an array of Ns vectors containing the derivatives of the forward

sensitivities.

yB is the current value of the backward dependent variable vector.

ypB is the current value of the backward dependent derivative vector.

resvalB is the current value of the residual for the backward problem.

cjB is the scalar in the system Jacobian, proportional to the inverse of the step

size (αin Eq. (2.6) ).

user dataB is a pointer to user data — the same as the user dataB parameter passed

to the function IDASetUserDataB.

Return value The return value of a preconditioner setup function for the backward problem should

be 0 if successful, positive for a recoverable error (in which case the step will be retried),

or negative for an unrecoverable error (in which case the integration is halted).

Notes The previous function type IDASpilsPrecSetupFnBS is identical to IDALsPrecSetupFnBS,

and may still be used for backward-compatibility. However, this will be deprecated in

future releases, so we recommend that users transition to the new function type name

soon.

6.4 Using the band-block-diagonal preconditioner for back-

ward problems

As on the forward integration phase, the eﬃciency of Krylov iterative methods for the solution of

linear systems can be greatly enhanced through preconditioning. The band-block-diagonal precondi-

tioner module idabbdpre, provides interface functions through which it can be used on the backward

integration phase.

The adjoint module in idas oﬀers an interface to the band-block-diagonal preconditioner module

idabbdpre described in section §4.8. This generates a preconditioner that is a block-diagonal matrix

with each block being a band matrix and can be used with one of the Krylov linear solvers and with

the MPI-parallel vector module nvector parallel.

In order to use the idabbdpre module in the solution of the backward problem, the user must

deﬁne one or two additional functions, described at the end of this section.

6.4.1 Usage of IDABBDPRE for the backward problem

The idabbdpre module is initialized by calling the following function, after an iterative linear solver

for the backward problem has been attached to idas by calling IDASetLinearSolverB (see §6.2.6).

IDABBDPrecInitB

Call flag = IDABBDPrecInitB(ida mem, which, NlocalB, mudqB, mldqB,

mukeepB, mlkeepB, dqrelyB, GresB, GcommB);

Description The function IDABBDPrecInitB initializes and allocates memory for the idabbdpre

preconditioner for the backward problem.

6.4 Using the band-block-diagonal preconditioner for backward problems 155

Arguments ida mem (void *) pointer to the idas memory block.

which (int) the identiﬁer of the backward problem.

NlocalB (sunindextype) local vector dimension for the backward problem.

mudqB (sunindextype) upper half-bandwidth to be used in the diﬀerence-quotient

Jacobian approximation.

mldqB (sunindextype) lower half-bandwidth to be used in the diﬀerence-quotient

Jacobian approximation.

mukeepB (sunindextype) upper half-bandwidth of the retained banded approximate

Jacobian block.

mlkeepB (sunindextype) lower half-bandwidth of the retained banded approximate

Jacobian block.

dqrelyB (realtype) the relative increment in components of yB used in the diﬀerence

quotient approximations. The default is dqrelyB=√unit roundoﬀ, which can

be speciﬁed by passing dqrely= 0.0.

GresB (IDABBDLocalFnB) the Cfunction which computes GB(t, y, ˙y, yB,˙yB), the func-

tion approximating the residual of the backward problem.

GcommB (IDABBDCommFnB) the optional Cfunction which performs all interprocess com-

munication required for the computation of GB.

Return value If successful, IDABBDPrecInitB creates, allocates, and stores (internally in the idas

solver block) a pointer to the newly created idabbdpre memory block. The return

value flag (of type int) is one of:

IDALS SUCCESS The call to IDABBDPrecInitB was successful.

IDALS MEM FAIL A memory allocation request has failed.

IDALS MEM NULL The ida mem argument was NULL.

IDALS LMEM NULL No linear solver has been attached.

IDALS ILL INPUT An invalid parameter has been passed.

To reinitialize the idabbdpre preconditioner module for the backward problem, possibly with a change

in mudqB,mldqB, or dqrelyB, call the following function:

IDABBDPrecReInitB

Call flag = IDABBDPrecReInitB(ida mem, which, mudqB, mldqB, dqrelyB);

Description The function IDABBDPrecReInitB reinitializes the idabbdpre preconditioner for the

backward problem.

Arguments ida mem (void *) pointer to the idas memory block returned by IDACreate.

which (int) the identiﬁer of the backward problem.

mudqB (sunindextype) upper half-bandwidth to be used in the diﬀerence-quotient

Jacobian approximation.

mldqB (sunindextype) lower half-bandwidth to be used in the diﬀerence-quotient

Jacobian approximation.

dqrelyB (realtype) the relative increment in components of yB used in the diﬀerence

quotient approximations.

Return value The return value flag (of type int) is one of:

IDALS SUCCESS The call to IDABBDPrecReInitB was successful.

IDALS MEM FAIL A memory allocation request has failed.

IDALS MEM NULL The ida mem argument was NULL.

IDALS PMEM NULL The IDABBDPrecInitB has not been previously called.

IDALS LMEM NULL No linear solver has been attached.

IDALS ILL INPUT An invalid parameter has been passed.

For more details on idabbdpre see §4.8.

156 Using IDAS for Adjoint Sensitivity Analysis

6.4.2 User-supplied functions for IDABBDPRE

To use the idabbdpre module, the user must supply one or two functions which the module calls

to construct the preconditioner: a required function GresB (of type IDABBDLocalFnB) which approxi-

mates the residual of the backward problem and which is computed locally, and an optional function

GcommB (of type IDABBDCommFnB) which performs all interprocess communication necessary to evaluate

this approximate residual (see §4.8). The prototypes for these two functions are described below.

IDABBDLocalFnB

Deﬁnition typedef int (*IDABBDLocalFnB)(sunindextype NlocalB, realtype t,

N Vector y, N Vector yp,

N Vector yB, N Vector ypB,

N Vector gB, void *user dataB);

Purpose This GresB function loads the vector gB, an approximation to the residual of the back-

ward problem, as a function of t,y,yp, and yB and ypB.

Arguments NlocalB is the local vector length for the backward problem.

tis the value of the independent variable.

yis the current value of the forward solution vector.

yp is the current value of the forward solution derivative vector.

yB is the current value of the backward dependent variable vector.

ypB is the current value of the backward dependent derivative vector.

gB is the output vector, GB(t, y, ˙y, yB,˙yB).

user dataB is a pointer to user data — the same as the user dataB parameter passed

to IDASetUserDataB.

Return value An IDABBDLocalFnB should return 0 if successful, a positive value if a recoverable er-

ror occurred (in which case idas will attempt to correct), or a negative value if it

failed unrecoverably (in which case the integration is halted and IDASolveB returns

IDA LSETUP FAIL).

Notes This routine must assume that all interprocess communication of data needed to calcu-

late gB has already been done, and this data is accessible within user dataB.

Before calling the user’s IDABBDLocalFnB,idas needs to evaluate (through interpola-

tion) the values of the states from the forward integration. If an error occurs in the

interpolation, idas triggers an unrecoverable failure in the preconditioner setup function

which will halt the integration (IDASolveB returns IDA LSETUP FAIL).

IDABBDCommFnB

Deﬁnition typedef int (*IDABBDCommFnB)(sunindextype NlocalB, realtype t,

N Vector y, N Vector yp,

N Vector yB, N Vector ypB,

void *user dataB);

Purpose This GcommB function performs all interprocess communications necessary for the exe-

cution of the GresB function above, using the input vectors y,yp,yB and ypB.

Arguments NlocalB is the local vector length.

tis the value of the independent variable.

yis the current value of the forward solution vector.

yp is the current value of the forward solution derivative vector.

yB is the current value of the backward dependent variable vector.

ypB is the current value of the backward dependent derivative vector.

6.4 Using the band-block-diagonal preconditioner for backward problems 157

user dataB is a pointer to user data — the same as the user dataB parameter passed

to IDASetUserDataB.

Return value An IDABBDCommFnB should return 0 if successful, a positive value if a recoverable er-

ror occurred (in which case idas will attempt to correct), or a negative value if it

failed unrecoverably (in which case the integration is halted and IDASolveB returns

IDA LSETUP FAIL).

Notes The GcommB function is expected to save communicated data in space deﬁned within

the structure user dataB.

Each call to the GcommB function is preceded by a call to the function that evaluates the

residual of the backward problem with the same t,y,yp,yB and ypB arguments. If there

is no additional communication needed, then pass GcommB =NULL to IDABBDPrecInitB.

Chapter 7

Description of the NVECTOR

module

The sundials solvers are written in a data-independent manner. They all operate on generic vec-

tors (of type N Vector) through a set of operations deﬁned by the particular nvector implemen-

tation. Users can provide their own speciﬁc implementation of the nvector module, or use one of

the implementations provided with sundials. The generic operations are described below and the

implementations provided with sundials are described in the following sections.

The generic N Vector type is a pointer to a structure that has an implementation-dependent

content ﬁeld containing the description and actual data of the vector, and an ops ﬁeld pointing to a

structure with generic vector operations. The type N Vector is deﬁned as

typedef struct _generic_N_Vector *N_Vector;

struct _generic_N_Vector {

void *content;

struct _generic_N_Vector_Ops *ops;

};

The generic N Vector Ops structure is essentially a list of pointers to the various actual vector

operations, and is deﬁned as

struct _generic_N_Vector_Ops {

N_Vector_ID (*nvgetvectorid)(N_Vector);

N_Vector (*nvclone)(N_Vector);

N_Vector (*nvcloneempty)(N_Vector);

void (*nvdestroy)(N_Vector);

void (*nvspace)(N_Vector, sunindextype *, sunindextype *);

realtype* (*nvgetarraypointer)(N_Vector);

void (*nvsetarraypointer)(realtype *, N_Vector);

void (*nvlinearsum)(realtype, N_Vector, realtype, N_Vector, N_Vector);

void (*nvconst)(realtype, N_Vector);

void (*nvprod)(N_Vector, N_Vector, N_Vector);

void (*nvdiv)(N_Vector, N_Vector, N_Vector);

void (*nvscale)(realtype, N_Vector, N_Vector);

void (*nvabs)(N_Vector, N_Vector);

void (*nvinv)(N_Vector, N_Vector);

void (*nvaddconst)(N_Vector, realtype, N_Vector);

realtype (*nvdotprod)(N_Vector, N_Vector);

realtype (*nvmaxnorm)(N_Vector);

realtype (*nvwrmsnorm)(N_Vector, N_Vector);

160 Description of the NVECTOR module

realtype (*nvwrmsnormmask)(N_Vector, N_Vector, N_Vector);

realtype (*nvmin)(N_Vector);

realtype (*nvwl2norm)(N_Vector, N_Vector);

realtype (*nvl1norm)(N_Vector);

void (*nvcompare)(realtype, N_Vector, N_Vector);

booleantype (*nvinvtest)(N_Vector, N_Vector);

booleantype (*nvconstrmask)(N_Vector, N_Vector, N_Vector);

realtype (*nvminquotient)(N_Vector, N_Vector);

int (*nvlinearcombination)(int, realtype*, N_Vector*, N_Vector);

int (*nvscaleaddmulti)(int, realtype*, N_Vector, N_Vector*, N_Vector*);

int (*nvdotprodmulti)(int, N_Vector, N_Vector*, realtype*);

int (*nvlinearsumvectorarray)(int, realtype, N_Vector*, realtype,

N_Vector*, N_Vector*);

int (*nvscalevectorarray)(int, realtype*, N_Vector*, N_Vector*);

int (*nvconstvectorarray)(int, realtype, N_Vector*);

int (*nvwrmsnomrvectorarray)(int, N_Vector*, N_Vector*, realtype*);

int (*nvwrmsnomrmaskvectorarray)(int, N_Vector*, N_Vector*, N_Vector,

realtype*);

int (*nvscaleaddmultivectorarray)(int, int, realtype*, N_Vector*,

N_Vector**, N_Vector**);

int (*nvlinearcombinationvectorarray)(int, int, realtype*, N_Vector**,

N_Vector*);

};

The generic nvector module deﬁnes and implements the vector operations acting on an N Vector.

These routines are nothing but wrappers for the vector operations deﬁned by a particular nvector

implementation, which are accessed through the ops ﬁeld of the N Vector structure. To illustrate

this point we show below the implementation of a typical vector operation from the generic nvector

module, namely N VScale, which performs the scaling of a vector xby a scalar c:

void N_VScale(realtype c, N_Vector x, N_Vector z)

{

z->ops->nvscale(c, x, z);

}

Table 7.2 contains a complete list of all standard vector operations deﬁned by the generic nvector

module. Tables 7.3 and 7.4 list optional fused and vector array operations respectively.

Fused and vector array operations are intended to increase data reuse, reduce parallel commu-

nication on distributed memory systems, and lower the number of kernel launches on systems with

accelerators. If a particular nvector implementation deﬁnes a fused or vector array operation as

NULL, the generic nvector module will automatically call standard vector operations as necessary

to complete the desired operation. Currently, all fused and vector array operations are disabled by

default however, sundials provided nvector implementations deﬁne additional user-callable func-

tions to enable/disable any or all of the fused and vector array operations. See the following sections

for the implementation speciﬁc functions to enable/disable operations.

Finally, note that the generic nvector module deﬁnes the functions N VCloneVectorArray and

N VCloneVectorArrayEmpty. Both functions create (by cloning) an array of count variables of type

N Vector, each of the same type as an existing N Vector. Their prototypes are

N_Vector *N_VCloneVectorArray(int count, N_Vector w);

N_Vector *N_VCloneVectorArrayEmpty(int count, N_Vector w);

and their deﬁnitions are based on the implementation-speciﬁc N VClone and N VCloneEmpty opera-

tions, respectively.

An array of variables of type N Vector can be destroyed by calling N VDestroyVectorArray, whose

prototype is

161

Table 7.1: Vector Identiﬁcations associated with vector kernels supplied with sundials.

Vector ID Vector type ID Value

SUNDIALS NVEC SERIAL Serial 0

SUNDIALS NVEC PARALLEL Distributed memory parallel (MPI) 1

SUNDIALS NVEC OPENMP OpenMP shared memory parallel 2

SUNDIALS NVEC PTHREADS PThreads shared memory parallel 3

SUNDIALS NVEC PARHYP hypre ParHyp parallel vector 4

SUNDIALS NVEC PETSC petsc parallel vector 5

SUNDIALS NVEC OPENMPDEV OpenMP shared memory parallel with device oﬄoading 6

SUNDIALS NVEC CUSTOM User-provided custom vector 7

void N_VDestroyVectorArray(N_Vector *vs, int count);

and whose deﬁnition is based on the implementation-speciﬁc N VDestroy operation.

A particular implementation of the nvector module must:

•Specify the content ﬁeld of N Vector.

•Deﬁne and implement the vector operations. Note that the names of these routines should be

unique to that implementation in order to permit using more than one nvector module (each

with diﬀerent N Vector internal data representations) in the same code.

•Deﬁne and implement user-callable constructor and destructor routines to create and free an

N Vector with the new content ﬁeld and with ops pointing to the new vector operations.

•Optionally, deﬁne and implement additional user-callable routines acting on the newly deﬁned

N Vector (e.g., a routine to print the content for debugging purposes).

•Optionally, provide accessor macros as needed for that particular implementation to be used to

access diﬀerent parts in the content ﬁeld of the newly deﬁned N Vector.

Each nvector implementation included in sundials has a unique identiﬁer speciﬁed in enumer-

ation and shown in Table 7.1. It is recommended that a user-supplied nvector implementation use

the SUNDIALS NVEC CUSTOM identiﬁer.

162 Description of the NVECTOR module

Table 7.2: Description of the NVECTOR operations

Name Usage and Description

N VGetVectorID id = N VGetVectorID(w);

Returns the vector type identiﬁer for the vector w. It is used to determine

the vector implementation type (e.g. serial, parallel,. . . ) from the abstract

N Vector interface. Returned values are given in Table 7.1.

N VClone v = N VClone(w);

Creates a new N Vector of the same type as an existing vector wand sets

the ops ﬁeld. It does not copy the vector, but rather allocates storage for

the new vector.

N VCloneEmpty v = N VCloneEmpty(w);

Creates a new N Vector of the same type as an existing vector wand sets

the ops ﬁeld. It does not allocate storage for data.

N VDestroy N VDestroy(v);

Destroys the N Vector v and frees memory allocated for its internal data.

N VSpace N VSpace(nvSpec, &lrw, &liw);

Returns storage requirements for one N Vector.lrw contains the number

of realtype words and liw contains the number of integer words. This

function is advisory only, for use in determining a user’s total space re-

quirements; it could be a dummy function in a user-supplied nvector

module if that information is not of interest.

N VGetArrayPointer vdata = N VGetArrayPointer(v);

Returns a pointer to a realtype array from the N Vector v. Note that

this assumes that the internal data in N Vector is a contiguous array of

realtype. This routine is only used in the solver-speciﬁc interfaces to the

dense and banded (serial) linear solvers, the sparse linear solvers (serial

and threaded), and in the interfaces to the banded (serial) and band-block-

diagonal (parallel) preconditioner modules provided with sundials.

N VSetArrayPointer N VSetArrayPointer(vdata, v);

Overwrites the data in an N Vector with a given array of realtype. Note

that this assumes that the internal data in N Vector is a contiguous array

of realtype. This routine is only used in the interfaces to the dense

(serial) linear solver, hence need not exist in a user-supplied nvector

module for a parallel environment.

continued on next page

163

continued from last page

Name Usage and Description

N VLinearSum N VLinearSum(a, x, b, y, z);

Performs the operation z=ax +by, where aand bare realtype scalars

and xand yare of type N Vector:zi=axi+byi, i = 0, . . . , n −1.

N VConst N VConst(c, z);

Sets all components of the N Vector z to realtype c:zi=c, i = 0, . . . , n−

N VProd N VProd(x, y, z);

Sets the N Vector z to be the component-wise product of the N Vector

inputs xand y:zi=xiyi, i = 0, . . . , n −1.

N VDiv N VDiv(x, y, z);

Sets the N Vector z to be the component-wise ratio of the N Vector inputs

xand y:zi=xi/yi, i = 0, . . . , n −1. The yimay not be tested for 0

values. It should only be called with a ythat is guaranteed to have all

nonzero components.

N VScale N VScale(c, x, z);

Scales the N Vector x by the realtype scalar cand returns the result in

z:zi=cxi, i = 0, . . . , n −1.

N VAbs N VAbs(x, z);

Sets the components of the N Vector z to be the absolute values of the

components of the N Vector x:yi=|xi|, i = 0, . . . , n −1.

N VInv N VInv(x, z);

Sets the components of the N Vector z to be the inverses of the compo-

nents of the N Vector x:zi= 1.0/xi, i = 0, . . . , n −1. This routine may

not check for division by 0. It should be called only with an xwhich is

guaranteed to have all nonzero components.

N VAddConst N VAddConst(x, b, z);

Adds the realtype scalar bto all components of xand returns the result

in the N Vector z:zi=xi+b, i = 0, . . . , n −1.

N VDotProd d = N VDotProd(x, y);

Returns the value of the ordinary dot product of xand y:d=Pn−1

i=0 xiyi.

N VMaxNorm m = N VMaxNorm(x);

Returns the maximum norm of the N Vector x:m= maxi|xi|.

continued on next page

164 Description of the NVECTOR module

continued from last page

Name Usage and Description

N VWrmsNorm m = N VWrmsNorm(x, w)

Returns the weighted root-mean-square norm of the N Vector x with

realtype weight vector w:m=rPn−1

i=0 (xiwi)2/n.

N VWrmsNormMask m = N VWrmsNormMask(x, w, id);

Returns the weighted root mean square norm of the N Vector x with

realtype weight vector wbuilt using only the elements of xcorresponding

to positive elements of the N Vector id:

m=rPn−1

i=0 (xiwiH(idi))2/n, where H(α) = (1α > 0

0α≤0

N VMin m = N VMin(x);

Returns the smallest element of the N Vector x:m= minixi.

N VWL2Norm m = N VWL2Norm(x, w);

Returns the weighted Euclidean `2norm of the N Vector x with realtype

weight vector w:m=qPn−1

i=0 (xiwi)2.

N VL1Norm m = N VL1Norm(x);

Returns the `1norm of the N Vector x:m=Pn−1

i=0 |xi|.

N VCompare N VCompare(c, x, z);

Compares the components of the N Vector x to the realtype scalar c

and returns an N Vector z such that: zi= 1.0 if |xi| ≥ cand zi= 0.0

otherwise.

N VInvTest t = N VInvTest(x, z);

Sets the components of the N Vector z to be the inverses of the compo-

nents of the N Vector x, with prior testing for zero values: zi= 1.0/xi, i =

0, . . . , n −1. This routine returns a boolean assigned to SUNTRUE if all

components of xare nonzero (successful inversion) and returns SUNFALSE

otherwise.

N VConstrMask t = N VConstrMask(c, x, m);

Performs the following constraint tests: xi>0 if ci= 2, xi≥0 if ci= 1,

xi≤0 if ci=−1, xi<0 if ci=−2. There is no constraint on xiif ci= 0.

This routine returns a boolean assigned to SUNFALSE if any element failed

the constraint test and assigned to SUNTRUE if all passed. It also sets a

mask vector m, with elements equal to 1.0 where the constraint test failed,

and 0.0 where the test passed. This routine is used only for constraint

checking.

continued on next page

165

continued from last page

Name Usage and Description

N VMinQuotient minq = N VMinQuotient(num, denom);

This routine returns the minimum of the quotients obtained by term-wise

dividing numiby denomi. A zero element in denom will be skipped. If no

such quotients are found, then the large value BIG REAL (deﬁned in the

header ﬁle sundials types.h) is returned.

Table 7.3: Description of the NVECTOR fused operations

Name Usage and Description

N VLinearCombination ier = N VLinearCombination(nv, c, X, z);

This routine computes the linear combination of nvvectors with n

elements:

zi=

nv−1

j=0

cjxj,i, i = 0, . . . , n −1,

where cis an array of nvscalars (type realtype*), Xis an array of nv

vectors (type N Vector*), and zis the output vector (type N Vector).

If the output vector zis one of the vectors in X, then it must be the

ﬁrst vector in the vector array. The operation returns 0for success and

a non-zero value otherwise.

N VScaleAddMulti ier = N VScaleAddMulti(nv, c, x, Y, Z);

This routine scales and adds one vector to nvvectors with nelements:

zj,i =cjxi+yj,i, j = 0, . . . , nv−1i= 0, . . . , n −1,

where cis an array of nvscalars (type realtype*), xis the vector (type

N Vector) to be scaled and added to each vector in the vector array

of nvvectors Y(type N Vector*), and Z(type N Vector*) is a vector

array of nvoutput vectors. The operation returns 0for success and a

non-zero value otherwise.

continued on next page

166 Description of the NVECTOR module

continued from last page

Name Usage and Description

N VDotProdMulti ier = N VDotProdMulti(nv, x, Y, d);

This routine computes the dot product of a vector with nvother vectors:

dj=

n−1

i=0

xiyj,i, j = 0, . . . , nv−1,

where d(type realtype*) is an array of nvscalars containing the dot

products of the vector x(type N Vector) with each of the nvvectors

in the vector array Y(type N Vector*). The operation returns 0for

success and a non-zero value otherwise.

Table 7.4: Description of the NVECTOR vector array operations

Name Usage and Description

N VLinearSumVectorArray ier = N VLinearSumVectorArray(nv, a, X, b, Y,

Z);

This routine comuptes the linear sum of two vector arrays

containing nvvectors of nelements:

zj,i =axj,i +byj,i, i = 0, . . . , n −1j= 0, . . . , nv−1,

where aand bare realtype scalars and X,Y, and Z

are arrays of nvvectors (type N Vector*). The operation

returns 0for success and a non-zero value otherwise.

N VScaleVectorArray ier = N VScaleVectorArray(nv, c, X, Z);

This routine scales each vector of nelements in a vector

array of nvvectors by a potentially diﬀerent constant:

zj,i =cjxj,i, i = 0, . . . , n −1j= 0, . . . , nv−1,

where cis an array of nvscalars (type realtype*) and

Xand Zare arrays of nvvectors (type N Vector*).

The operation returns 0for success and a non-zero value

otherwise.

continued on next page

167

continued from last page

Name Usage and Description

N VConstVectorArray ier = N VConstVectorArray(nv, c, X);

This routine sets each element in a vector of nelements

in a vector array of nvvectors to the same value:

zj,i =c, i = 0, . . . , n −1j= 0, . . . , nv−1,

where cis a realtype scalar and Xis an array of nv

vectors (type N Vector*). The operation returns 0for

success and a non-zero value otherwise.

N VWrmsNormVectorArray ier = N VWrmsNormVectorArray(nv, X, W, m);

This routine computes the weighted root mean square

norm of nvvectors with nelements:

mj= 1

n−1

i=0

(xj,iwj,i)2!1/2

, j = 0, . . . , nv−1,

where m(type realtype*) contains the nvnorms of the

vectors in the vector array X(type N Vector*) with corre-

sponding weight vectors W(type N Vector*). The opera-

tion returns 0for success and a non-zero value otherwise.

N VWrmsNormMaskVectorArray ier = N VWrmsNormMaskVectorArray(nv, X, W, id,

m);

This routine computes the masked weighted root mean

square norm of nvvectors with nelements:

mj= 1

n−1

i=0

(xj,iwj,iH(idi))2!1/2

, j = 0, . . . , nv−1,

H(idi) = 1 for idi>0 and is zero otherwise, m(type

realtype*) contains the nvnorms of the vectors in

the vector array X(type NVector*) with corresponding

weight vectors W(type N Vector*) and mask vector id

(type N Vector). The operation returns 0for success and

a non-zero value otherwise.

continued on next page

168 Description of the NVECTOR module

continued from last page

Name Usage and Description

N VScaleAddMultiVectorArray ier = N VScaleAddMultiVectorArray(nv, ns, c, X,

YY, ZZ);

This routine scales and adds a vector in a vector array of

nvvectors to the corresponding vector in nsvector arrays:

zj,i =

ns−1

k=0

ckxk,j,i, i = 0, . . . , n −1j= 0, . . . , nv−1,

where cis an array of nsscalars (type realtype*), X

is a vector array of nvvectors (type idN Vector*) to be

scaled and added to the corresponding vector in each of

the nsvector arrays in the array of vector arrays Y Y (type

N Vector**) and stored in the output array of vector ar-

rays ZZ (type N Vector**). The operation returns 0for

success and a non-zero value otherwise.

N VLinearCombinationVectorArray ier = N VLinearCombinationVectorArray(nv, ns, c,

XX, Z);

This routine computes the linear combination of nsvector

arrays containing nvvectors with nelements:

zj,i =

ns−1

k=0

ckxk,j,i, i = 0, . . . , n −1j= 0, . . . , nv−1,

where cis an array of nsscalars (type realtype*), XX

(type N Vector**) is an array of nsvector arrays each

containing nvvectors to be summed into the output vector

array of nvvectors Z(type N Vector*). If the output

vector array Zis one of the vector arrays in XX, then

it must be the ﬁrst vector array in XX. The operation

returns 0for success and a non-zero value otherwise.

7.1 NVECTOR functions used by IDAS

In Table 7.5 below, we list the vector functions used in the nvector module used by the idas package.

The table also shows, for each function, which of the code modules uses the function. The idas column

shows function usage within the main integrator module, while the remaining columns show function

usage within the idas linear solvers interface, the idabbdpre preconditioner module, and the idaa

module.

At this point, we should emphasize that the idas user does not need to know anything about the

usage of vector functions by the idas code modules in order to use idas. The information is presented

as an implementation detail for the interested reader.

Special cases (numbers match markings in table):

1. These routines are only required if an internal diﬀerence-quotient routine for constructing dense

or band Jacobian matrices is used.

2. This routine is optional, and is only used in estimating space requirements for idas modules for

user feedback.

3. The optional function N VDotProdMulti is only used when Classical Gram-Schmidt is enabled

with spgmr or spfgmr. The remaining operations from Tables 7.3 and 7.4 not listed above are

7.2 The NVECTOR SERIAL implementation 169

unused and a user-supplied nvector module for idas could omit these operations.

Of the functions listed in Table 7.2,N VWL2Norm,N VL1Norm, and N VInvTest are not used by idas.

Therefore a user-supplied nvector module for idas could omit these functions.

7.2 The NVECTOR SERIAL implementation

The serial implementation of the nvector module provided with sundials,nvector serial, deﬁnes

the content ﬁeld of N Vector to be a structure containing the length of the vector, a pointer to the

beginning of a contiguous data array, and a boolean ﬂag own data which speciﬁes the ownership of

data.

struct _N_VectorContent_Serial {

sunindextype length;

booleantype own_data;

realtype *data;

};

The header ﬁle to include when using this module is nvector serial.h. The installed module

library to link to is libsundials nvecserial.lib where .lib is typically .so for shared libraries

and .a for static libraries.

7.2.1 NVECTOR SERIAL accessor macros

The following macros are provided to access the content of an nvector serial vector. The suﬃx S

in the names denotes the serial version.

•NV CONTENT S

This routine gives access to the contents of the serial vector N Vector.

The assignment v cont =NV CONTENT S(v) sets v cont to be a pointer to the serial N Vector

content structure.

Implementation:

#define NV_CONTENT_S(v) ( (N_VectorContent_Serial)(v->content) )

•NV OWN DATA S,NV DATA S,NV LENGTH S

These macros give individual access to the parts of the content of a serial N Vector.

The assignment v data = NV DATA S(v) sets v data to be a pointer to the ﬁrst component of

the data for the N Vector v. The assignment NV DATA S(v) = v data sets the component array

of vto be v data by storing the pointer v data.

The assignment v len = NV LENGTH S(v) sets v len to be the length of v. On the other hand,

the call NV LENGTH S(v) = len v sets the length of vto be len v.

Implementation:

#define NV_OWN_DATA_S(v) ( NV_CONTENT_S(v)->own_data )

#define NV_DATA_S(v) ( NV_CONTENT_S(v)->data )

#define NV_LENGTH_S(v) ( NV_CONTENT_S(v)->length )

•NV Ith S

This macro gives access to the individual components of the data array of an N Vector.

The assignment r = NV Ith S(v,i) sets rto be the value of the i-th component of v. The

assignment NV Ith S(v,i) = r sets the value of the i-th component of vto be r.

Here iranges from 0 to n−1 for a vector of length n.

Implementation:

#define NV_Ith_S(v,i) ( NV_DATA_S(v)[i] )

170 Description of the NVECTOR module

7.2.2 NVECTOR SERIAL functions

The nvector serial module deﬁnes serial implementations of all vector operations listed in Tables

7.2,7.3, and 7.4. Their names are obtained from those in Tables 7.2,7.3, and 7.4 by appending

the suﬃx Serial (e.g. N VDestroy Serial). All the standard vector operations listed in 7.2 with

the suﬃx Serial appended are callable via the Fortran 2003 interface by prepending an ‘F’ (e.g.

FN VDestroy Serial).

The module nvector serial provides the following additional user-callable routines:

N VNew Serial

Prototype N Vector N VNew Serial(sunindextype vec length);

Description This function creates and allocates memory for a serial N Vector. Its only argument is

the vector length.

F2003 Name This function is callable as FN VNew Serial when using the Fortran 2003 interface mod-

ule.

N VNewEmpty Serial

Prototype N Vector N VNewEmpty Serial(sunindextype vec length);

Description This function creates a new serial N Vector with an empty (NULL) data array.

F2003 Name This function is callable as FN VNewEmpty Serial when using the Fortran 2003 interface

module.

N VMake Serial

Prototype N Vector N VMake Serial(sunindextype vec length, realtype *v data);

Description This function creates and allocates memory for a serial vector with user-provided data

array.

(This function does not allocate memory for v data itself.)

F2003 Name This function is callable as FN VMake Serial when using the Fortran 2003 interface

module.

N VCloneVectorArray Serial

Prototype N Vector *N VCloneVectorArray Serial(int count, N Vector w);

Description This function creates (by cloning) an array of count serial vectors.

N VCloneVectorArrayEmpty Serial

Prototype N Vector *N VCloneVectorArrayEmpty Serial(int count, N Vector w);

Description This function creates (by cloning) an array of count serial vectors, each with an empty

(NULL) data array.

N VDestroyVectorArray Serial

Prototype void N VDestroyVectorArray Serial(N Vector *vs, int count);

Description This function frees memory allocated for the array of count variables of type N Vector

created with N VCloneVectorArray Serial or with

N VCloneVectorArrayEmpty Serial.

7.2 The NVECTOR SERIAL implementation 171

N VGetLength Serial

Prototype sunindextype N VGetLength Serial(N Vector v);

Description This function returns the number of vector elements.

F2003 Name This function is callable as FN VGetLength Serial when using the Fortran 2003 interface

module.

N VPrint Serial

Prototype void N VPrint Serial(N Vector v);

Description This function prints the content of a serial vector to stdout.

F2003 Name This function is callable as FN VPrint Serial when using the Fortran 2003 interface

module.

N VPrintFile Serial

Prototype void N VPrintFile Serial(N Vector v, FILE *outfile);

Description This function prints the content of a serial vector to outfile.

By default all fused and vector array operations are disabled in the nvector serial module.

The following additional user-callable routines are provided to enable or disable fused and vector

array operations for a speciﬁc vector. To ensure consistency across vectors it is recommended to ﬁrst

create a vector with N VNew Serial, enable/disable the desired operations for that vector with the

functions below, and create any additional vectors from that vector using N VClone. This guarantees

the new vectors will have the same operations enabled/disabled as cloned vectors inherit the same

enable/disable options as the vector they are cloned from while vectors created with N VNew Serial

will have the default settings for the nvector serial module.

N VEnableFusedOps Serial

Prototype int N VEnableFusedOps Serial(N Vector v, booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) all fused and vector array op-

erations in the serial vector. The return value is 0for success and -1 if the input vector

or its ops structure are NULL.

N VEnableLinearCombination Serial

Prototype int N VEnableLinearCombination Serial(N Vector v, booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear combination fused

operation in the serial vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableScaleAddMulti Serial

Prototype int N VEnableScaleAddMulti Serial(N Vector v, booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale and add a vector to

multiple vectors fused operation in the serial vector. The return value is 0for success

and -1 if the input vector or its ops structure are NULL.

172 Description of the NVECTOR module

N VEnableDotProdMulti Serial

Prototype int N VEnableDotProdMulti Serial(N Vector v, booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) the multiple dot products fused

operation in the serial vector. The return value is 0for success and -1 if the input vector

or its ops structure are NULL.

N VEnableLinearSumVectorArray Serial

Prototype int N VEnableLinearSumVectorArray Serial(N Vector v, booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear sum operation for

vector arrays in the serial vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableScaleVectorArray Serial

Prototype int N VEnableScaleVectorArray Serial(N Vector v, booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale operation for vector

arrays in the serial vector. The return value is 0for success and -1 if the input vector

or its ops structure are NULL.

N VEnableConstVectorArray Serial

Prototype int N VEnableConstVectorArray Serial(N Vector v, booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) the const operation for vector

arrays in the serial vector. The return value is 0for success and -1 if the input vector

or its ops structure are NULL.

N VEnableWrmsNormVectorArray Serial

Prototype int N VEnableWrmsNormVectorArray Serial(N Vector v, booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) the WRMS norm operation for

vector arrays in the serial vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableWrmsNormMaskVectorArray Serial

Prototype int N VEnableWrmsNormMaskVectorArray Serial(N Vector v, booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) the masked WRMS norm op-

eration for vector arrays in the serial vector. The return value is 0for success and -1 if

the input vector or its ops structure are NULL.

N VEnableScaleAddMultiVectorArray Serial

Prototype int N VEnableScaleAddMultiVectorArray Serial(N Vector v,

booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale and add a vector array

to multiple vector arrays operation in the serial vector. The return value is 0for success

and -1 if the input vector or its ops structure are NULL.

7.3 The NVECTOR PARALLEL implementation 173

N VEnableLinearCombinationVectorArray Serial

Prototype int N VEnableLinearCombinationVectorArray Serial(N Vector v,

booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear combination operation

for vector arrays in the serial vector. The return value is 0for success and -1 if the

input vector or its ops structure are NULL.

Notes

•When looping over the components of an N Vector v, it is more eﬃcient to ﬁrst obtain the

component array via v data = NV DATA S(v) and then access v data[i] within the loop than

it is to use NV Ith S(v,i) within the loop.

•N VNewEmpty Serial,N VMake Serial, and N VCloneVectorArrayEmpty Serial set the ﬁeld

own data =SUNFALSE.N VDestroy Serial and N VDestroyVectorArray Serial will not at-

tempt to free the pointer data for any NVector with own data set to SUNFALSE. In such a case,

it is the user’s responsibility to deallocate the data pointer.

•To maximize eﬃciency, vector operations in the nvector serial implementation that have

more than one N Vector argument do not check for consistent internal representation of these

vectors. It is the user’s responsibility to ensure that such routines are called with N Vector

arguments that were all created with the same internal representations.

7.2.3 NVECTOR SERIAL Fortran interfaces

The nvector serial module provides a Fortran 2003 module as well as Fortran 77 style interface

functions for use from Fortran applications.

FORTRAN 2003 interface module

The fnvector serial mod Fortran module deﬁnes interfaces to all nvector serial C functions

using the intrinsic iso c binding module which provides a standardized mechanism for interoperat-

ing with C. As noted in the Cfunction descriptions above, the interface functions are named after

the corresponding Cfunction, but with a leading ‘F’. For example, the function N VNew Serial is

interfaced as FN VNew Serial.

The Fortran 2003 nvector serial interface module can be accessed with the use statement,

i.e. use fnvector serial mod, and linking to the library libsundials fnvectorserial mod.lib in

addition to the Clibrary. For details on where the library and module ﬁle fnvector serial mod.mod

are installed see Appendix A. We note that the module is accessible from the Fortran 2003 sundials

integrators without separately linking to the libsundials fnvectorserial mod library.

FORTRAN 77 interface functions

For solvers that include a Fortran 77 interface module, the nvector serial module also includes a

Fortran-callable function FNVINITS(code, NEQ, IER), to initialize this nvector serial module.

Here code is an input solver id (1 for cvode, 2 for ida, 3 for kinsol, 4 for arkode); NEQ is the

problem size (declared so as to match C type long int); and IER is an error return ﬂag equal 0 for

success and -1 for failure.

7.3 The NVECTOR PARALLEL implementation

The nvector parallel implementation of the nvector module provided with sundials is based on

MPI. It deﬁnes the content ﬁeld of N Vector to be a structure containing the global and local lengths

of the vector, a pointer to the beginning of a contiguous local data array, an MPI communicator, and

a boolean ﬂag own data indicating ownership of the data array data.

174 Description of the NVECTOR module

struct _N_VectorContent_Parallel {

sunindextype local_length;

sunindextype global_length;

booleantype own_data;

realtype *data;

MPI_Comm comm;

};

The header ﬁle to include when using this module is nvector parallel.h. The installed module

library to link to is libsundials nvecparallel.lib where .lib is typically .so for shared libraries

and .a for static libraries.

7.3.1 NVECTOR PARALLEL accessor macros

The following macros are provided to access the content of a nvector parallel vector. The suﬃx

Pin the names denotes the distributed memory parallel version.

•NV CONTENT P

This macro gives access to the contents of the parallel vector N Vector.

The assignment v cont = NV CONTENT P(v) sets v cont to be a pointer to the N Vector content

structure of type struct N VectorContent Parallel.

Implementation:

#define NV_CONTENT_P(v) ( (N_VectorContent_Parallel)(v->content) )

•NV OWN DATA P,NV DATA P,NV LOCLENGTH P,NV GLOBLENGTH P

These macros give individual access to the parts of the content of a parallel N Vector.

The assignment v data = NV DATA P(v) sets v data to be a pointer to the ﬁrst component of

the local data for the N Vector v. The assignment NV DATA P(v) = v data sets the component

array of vto be v data by storing the pointer v data.

The assignment v llen = NV LOCLENGTH P(v) sets v llen to be the length of the local part of

v. The call NV LENGTH P(v) = llen v sets the local length of vto be llen v.

The assignment v glen = NV GLOBLENGTH P(v) sets v glen to be the global length of the vector

v. The call NV GLOBLENGTH P(v) = glen v sets the global length of vto be glen v.

Implementation:

#define NV_OWN_DATA_P(v) ( NV_CONTENT_P(v)->own_data )

#define NV_DATA_P(v) ( NV_CONTENT_P(v)->data )

#define NV_LOCLENGTH_P(v) ( NV_CONTENT_P(v)->local_length )

#define NV_GLOBLENGTH_P(v) ( NV_CONTENT_P(v)->global_length )

•NV COMM P

This macro provides access to the MPI communicator used by the nvector parallel vectors.

Implementation:

#define NV_COMM_P(v) ( NV_CONTENT_P(v)->comm )

•NV Ith P

This macro gives access to the individual components of the local data array of an N Vector.

The assignment r = NV Ith P(v,i) sets rto be the value of the i-th component of the local

part of v. The assignment NV Ith P(v,i) = r sets the value of the i-th component of the local

part of vto be r.

Here iranges from 0 to n−1, where nis the local length.

Implementation:

#define NV_Ith_P(v,i) ( NV_DATA_P(v)[i] )

7.3 The NVECTOR PARALLEL implementation 175

7.3.2 NVECTOR PARALLEL functions

The nvector parallel module deﬁnes parallel implementations of all vector operations listed in

Tables 7.2,7.3, and 7.4. Their names are obtained from those in Tables 7.2,7.3, and 7.4 by appending

the suﬃx Parallel (e.g. N VDestroy Parallel). The module nvector parallel provides the

following additional user-callable routines:

N VNew Parallel

Prototype N Vector N VNew Parallel(MPI Comm comm, sunindextype local length,

sunindextype global length);

Description This function creates and allocates memory for a parallel vector.

N VNewEmpty Parallel

Prototype N Vector N VNewEmpty Parallel(MPI Comm comm, sunindextype local length,

sunindextype global length);

Description This function creates a new parallel N Vector with an empty (NULL) data array.

N VMake Parallel

Prototype N Vector N VMake Parallel(MPI Comm comm, sunindextype local length,

sunindextype global length, realtype *v data);

Description This function creates and allocates memory for a parallel vector with user-provided data

array. This function does not allocate memory for v data itself.

N VCloneVectorArray Parallel

Prototype N Vector *N VCloneVectorArray Parallel(int count, N Vector w);

Description This function creates (by cloning) an array of count parallel vectors.

N VCloneVectorArrayEmpty Parallel

Prototype N Vector *N VCloneVectorArrayEmpty Parallel(int count, N Vector w);

Description This function creates (by cloning) an array of count parallel vectors, each with an empty

(NULL) data array.

N VDestroyVectorArray Parallel

Prototype void N VDestroyVectorArray Parallel(N Vector *vs, int count);

Description This function frees memory allocated for the array of count variables of type N Vector

created with N VCloneVectorArray Parallel or with

N VCloneVectorArrayEmpty Parallel.

N VGetLength Parallel

Prototype sunindextype N VGetLength Parallel(N Vector v);

Description This function returns the number of vector elements (global vector length).

N VGetLocalLength Parallel

Prototype sunindextype N VGetLocalLength Parallel(N Vector v);

Description This function returns the local vector length.

176 Description of the NVECTOR module

N VPrint Parallel

Prototype void N VPrint Parallel(N Vector v);

Description This function prints the local content of a parallel vector to stdout.

N VPrintFile Parallel

Prototype void N VPrintFile Parallel(N Vector v, FILE *outfile);

Description This function prints the local content of a parallel vector to outfile.

By default all fused and vector array operations are disabled in the nvector parallel module.

The following additional user-callable routines are provided to enable or disable fused and vector

array operations for a speciﬁc vector. To ensure consistency across vectors it is recommended to ﬁrst

create a vector with N VNew Parallel, enable/disable the desired operations for that vector with the

functions below, and create any additional vectors from that vector using N VClone with that vector.

This guarantees the new vectors will have the same operations enabled/disabled as cloned vectors

inherit the same enable/disable options as the vector they are cloned from while vectors created with

N VNew Parallel will have the default settings for the nvector parallel module.

N VEnableFusedOps Parallel

Prototype int N VEnableFusedOps Parallel(N Vector v, booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) all fused and vector array oper-

ations in the parallel vector. The return value is 0for success and -1 if the input vector

or its ops structure are NULL.

N VEnableLinearCombination Parallel

Prototype int N VEnableLinearCombination Parallel(N Vector v, booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear combination fused

operation in the parallel vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableScaleAddMulti Parallel

Prototype int N VEnableScaleAddMulti Parallel(N Vector v, booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale and add a vector to

multiple vectors fused operation in the parallel vector. The return value is 0for success

and -1 if the input vector or its ops structure are NULL.

N VEnableDotProdMulti Parallel

Prototype int N VEnableDotProdMulti Parallel(N Vector v, booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) the multiple dot products fused

operation in the parallel vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableLinearSumVectorArray Parallel

Prototype int N VEnableLinearSumVectorArray Parallel(N Vector v, booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear sum operation for

vector arrays in the parallel vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

7.3 The NVECTOR PARALLEL implementation 177

N VEnableScaleVectorArray Parallel

Prototype int N VEnableScaleVectorArray Parallel(N Vector v, booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale operation for vector

arrays in the parallel vector. The return value is 0for success and -1 if the input vector

or its ops structure are NULL.

N VEnableConstVectorArray Parallel

Prototype int N VEnableConstVectorArray Parallel(N Vector v, booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) the const operation for vector

arrays in the parallel vector. The return value is 0for success and -1 if the input vector

or its ops structure are NULL.

N VEnableWrmsNormVectorArray Parallel

Prototype int N VEnableWrmsNormVectorArray Parallel(N Vector v, booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) the WRMS norm operation for

vector arrays in the parallel vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableWrmsNormMaskVectorArray Parallel

Prototype int N VEnableWrmsNormMaskVectorArray Parallel(N Vector v, booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) the masked WRMS norm op-

eration for vector arrays in the parallel vector. The return value is 0for success and -1

if the input vector or its ops structure are NULL.

N VEnableScaleAddMultiVectorArray Parallel

Prototype int N VEnableScaleAddMultiVectorArray Parallel(N Vector v,

booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale and add a vector

array to multiple vector arrays operation in the parallel vector. The return value is 0

for success and -1 if the input vector or its ops structure are NULL.

N VEnableLinearCombinationVectorArray Parallel

Prototype int N VEnableLinearCombinationVectorArray Parallel(N Vector v,

booleantype tf);

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear combination operation

for vector arrays in the parallel vector. The return value is 0for success and -1 if the

input vector or its ops structure are NULL.

Notes

•When looping over the components of an N Vector v, it is more eﬃcient to ﬁrst obtain the local

component array via v data = NV DATA P(v) and then access v data[i] within the loop than

it is to use NV Ith P(v,i) within the loop.

•N VNewEmpty Parallel,N VMake Parallel, and N VCloneVectorArrayEmpty Parallel set the

ﬁeld own data =SUNFALSE.N VDestroy Parallel and N VDestroyVectorArray Parallel will

not attempt to free the pointer data for any N Vector with own data set to SUNFALSE. In such

a case, it is the user’s responsibility to deallocate the data pointer.

178 Description of the NVECTOR module

•To maximize eﬃciency, vector operations in the nvector parallel implementation that have

more than one N Vector argument do not check for consistent internal representation of these

vectors. It is the user’s responsibility to ensure that such routines are called with N Vector

arguments that were all created with the same internal representations.

7.3.3 NVECTOR PARALLEL Fortran interfaces

For solvers that include a Fortran 77 interface module, the nvector parallel module also in-

cludes a Fortran-callable function FNVINITP(COMM, code, NLOCAL, NGLOBAL, IER), to initialize

this nvector parallel module. Here COMM is the MPI communicator, code is an input solver

id (1 for cvode, 2 for ida, 3 for kinsol, 4 for arkode); NLOCAL and NGLOBAL are the local and

global vector sizes, respectively (declared so as to match C type long int); and IER is an error

return ﬂag equal 0 for success and -1 for failure. NOTE: If the header ﬁle sundials config.h de-

ﬁnes SUNDIALS MPI COMM F2C to be 1 (meaning the MPI implementation used to build sundials

includes the MPI Comm f2c function), then COMM can be any valid MPI communicator. Otherwise,

MPI COMM WORLD will be used, so just pass an integer value as a placeholder.

7.4 The NVECTOR OPENMP implementation

In situations where a user has a multi-core processing unit capable of running multiple parallel threads

with shared memory, sundials provides an implementation of nvector using OpenMP, called nvec-

tor openmp, and an implementation using Pthreads, called nvector pthreads. Testing has shown

that vectors should be of length at least 100,000 before the overhead associated with creating and

using the threads is made up by the parallelism in the vector calculations.

The OpenMP nvector implementation provided with sundials,nvector openmp, deﬁnes the

content ﬁeld of N Vector to be a structure containing the length of the vector, a pointer to the

beginning of a contiguous data array, a boolean ﬂag own data which speciﬁes the ownership of data,

and the number of threads. Operations on the vector are threaded using OpenMP.

struct _N_VectorContent_OpenMP {

sunindextype length;

booleantype own_data;

realtype *data;

int num_threads;

};

The header ﬁle to include when using this module is nvector openmp.h. The installed module

library to link to is libsundials nvecopenmp.lib where .lib is typically .so for shared libraries

and .a for static libraries. The Fortran module ﬁle to use when using the Fortran 2003 interface

to this module is fnvector openmp mod.mod.

7.4.1 NVECTOR OPENMP accessor macros

The following macros are provided to access the content of an nvector openmp vector. The suﬃx

OMP in the names denotes the OpenMP version.

•NV CONTENT OMP

This routine gives access to the contents of the OpenMP vector N Vector.

The assignment v cont =NV CONTENT OMP(v) sets v cont to be a pointer to the OpenMP

N Vector content structure.

Implementation:

#define NV_CONTENT_OMP(v) ( (N_VectorContent_OpenMP)(v->content) )

7.4 The NVECTOR OPENMP implementation 179

•NV OWN DATA OMP,NV DATA OMP,NV LENGTH OMP,NV NUM THREADS OMP

These macros give individual access to the parts of the content of a OpenMP N Vector.

The assignment v data = NV DATA OMP(v) sets v data to be a pointer to the ﬁrst component

of the data for the N Vector v. The assignment NV DATA OMP(v) = v data sets the component

array of vto be v data by storing the pointer v data.

The assignment v len = NV LENGTH OMP(v) sets v len to be the length of v. On the other

hand, the call NV LENGTH OMP(v) = len v sets the length of vto be len v.

The assignment v num threads = NV NUM THREADS OMP(v) sets v num threads to be the num-

ber of threads from v. On the other hand, the call NV NUM THREADS OMP(v) = num threads v

sets the number of threads for vto be num threads v.

Implementation:

#define NV_OWN_DATA_OMP(v) ( NV_CONTENT_OMP(v)->own_data )

#define NV_DATA_OMP(v) ( NV_CONTENT_OMP(v)->data )

#define NV_LENGTH_OMP(v) ( NV_CONTENT_OMP(v)->length )

#define NV_NUM_THREADS_OMP(v) ( NV_CONTENT_OMP(v)->num_threads )

•NV Ith OMP

This macro gives access to the individual components of the data array of an N Vector.

The assignment r = NV Ith OMP(v,i) sets rto be the value of the i-th component of v. The

assignment NV Ith OMP(v,i) = r sets the value of the i-th component of vto be r.

Here iranges from 0 to n−1 for a vector of length n.

Implementation:

#define NV_Ith_OMP(v,i) ( NV_DATA_OMP(v)[i] )

7.4.2 NVECTOR OPENMP functions

The nvector openmp module deﬁnes OpenMP implementations of all vector operations listed in

Tables 7.2,7.3, and 7.4. Their names are obtained from those in Tables 7.2,7.3, and 7.4 by appending

the suﬃx OpenMP (e.g. N VDestroy OpenMP). All the standard vector operations listed in 7.2 with

the suﬃx OpenMP appended are callable via the Fortran 2003 interface by prepending an ‘F’ (e.g.

FN VDestroy OpenMP).

The module nvector openmp provides the following additional user-callable routines:

N VNew OpenMP

Prototype N Vector N VNew OpenMP(sunindextype vec length, int num threads)

Description This function creates and allocates memory for a OpenMP N Vector. Arguments are

the vector length and number of threads.

F2003 Name This function is callable as FN VNew OpenMP when using the Fortran 2003 interface mod-

ule.

N VNewEmpty OpenMP

Prototype N Vector N VNewEmpty OpenMP(sunindextype vec length, int num threads)

Description This function creates a new OpenMP N Vector with an empty (NULL) data array.

F2003 Name This function is callable as FN VNewEmpty OpenMP when using the Fortran 2003 interface

module.

180 Description of the NVECTOR module

N VMake OpenMP

Prototype N Vector N VMake OpenMP(sunindextype vec length, realtype *v data,

int num threads);

Description This function creates and allocates memory for a OpenMP vector with user-provided

data array. This function does not allocate memory for v data itself.

F2003 Name This function is callable as FN VMake OpenMP when using the Fortran 2003 interface

module.

N VCloneVectorArray OpenMP

Prototype N Vector *N VCloneVectorArray OpenMP(int count, N Vector w)

Description This function creates (by cloning) an array of count OpenMP vectors.

N VCloneVectorArrayEmpty OpenMP

Prototype N Vector *N VCloneVectorArrayEmpty OpenMP(int count, N Vector w)

Description This function creates (by cloning) an array of count OpenMP vectors, each with an

empty (NULL) data array.

N VDestroyVectorArray OpenMP

Prototype void N VDestroyVectorArray OpenMP(N Vector *vs, int count)

Description This function frees memory allocated for the array of count variables of type N Vector

created with N VCloneVectorArray OpenMP or with N VCloneVectorArrayEmpty OpenMP.

N VGetLength OpenMP

Prototype sunindextype N VGetLength OpenMP(N Vector v)

Description This function returns number of vector elements.

F2003 Name This function is callable as FN VGetLength OpenMP when using the Fortran 2003 interface

module.

N VPrint OpenMP

Prototype void N VPrint OpenMP(N Vector v)

Description This function prints the content of an OpenMP vector to stdout.

F2003 Name This function is callable as FN VPrint OpenMP when using the Fortran 2003 interface

module.

N VPrintFile OpenMP

Prototype void N VPrintFile OpenMP(N Vector v, FILE *outfile)

Description This function prints the content of an OpenMP vector to outfile.

By default all fused and vector array operations are disabled in the nvector openmp module.

The following additional user-callable routines are provided to enable or disable fused and vector

array operations for a speciﬁc vector. To ensure consistency across vectors it is recommended to ﬁrst

create a vector with N VNew OpenMP, enable/disable the desired operations for that vector with the

functions below, and create any additional vectors from that vector using N VClone. This guarantees

the new vectors will have the same operations enabled/disabled as cloned vectors inherit the same

enable/disable options as the vector they are cloned from while vectors created with N VNew OpenMP

will have the default settings for the nvector openmp module.

7.4 The NVECTOR OPENMP implementation 181

N VEnableFusedOps OpenMP

Prototype int N VEnableFusedOps OpenMP(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) all fused and vector array op-

erations in the OpenMP vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableLinearCombination OpenMP

Prototype int N VEnableLinearCombination OpenMP(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear combination fused

operation in the OpenMP vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableScaleAddMulti OpenMP

Prototype int N VEnableScaleAddMulti OpenMP(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale and add a vector to

multiple vectors fused operation in the OpenMP vector. The return value is 0for success

and -1 if the input vector or its ops structure are NULL.

N VEnableDotProdMulti OpenMP

Prototype int N VEnableDotProdMulti OpenMP(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the multiple dot products fused

operation in the OpenMP vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableLinearSumVectorArray OpenMP

Prototype int N VEnableLinearSumVectorArray OpenMP(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear sum operation for

vector arrays in the OpenMP vector. The return value is 0for success and -1 if the

input vector or its ops structure are NULL.

N VEnableScaleVectorArray OpenMP

Prototype int N VEnableScaleVectorArray OpenMP(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale operation for vector

arrays in the OpenMP vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableConstVectorArray OpenMP

Prototype int N VEnableConstVectorArray OpenMP(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the const operation for vector

arrays in the OpenMP vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

182 Description of the NVECTOR module

N VEnableWrmsNormVectorArray OpenMP

Prototype int N VEnableWrmsNormVectorArray OpenMP(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the WRMS norm operation for

vector arrays in the OpenMP vector. The return value is 0for success and -1 if the

input vector or its ops structure are NULL.

N VEnableWrmsNormMaskVectorArray OpenMP

Prototype int N VEnableWrmsNormMaskVectorArray OpenMP(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the masked WRMS norm op-

eration for vector arrays in the OpenMP vector. The return value is 0for success and

-1 if the input vector or its ops structure are NULL.

N VEnableScaleAddMultiVectorArray OpenMP

Prototype int N VEnableScaleAddMultiVectorArray OpenMP(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale and add a vector array

to multiple vector arrays operation in the OpenMP vector. The return value is 0for

success and -1 if the input vector or its ops structure are NULL.

N VEnableLinearCombinationVectorArray OpenMP

Prototype int N VEnableLinearCombinationVectorArray OpenMP(N Vector v,

booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear combination operation

for vector arrays in the OpenMP vector. The return value is 0for success and -1 if the

input vector or its ops structure are NULL.

Notes

•When looping over the components of an N Vector v, it is more eﬃcient to ﬁrst obtain the

component array via v data = NV DATA OMP(v) and then access v data[i] within the loop

than it is to use NV Ith OMP(v,i) within the loop.

•N VNewEmpty OpenMP,N VMake OpenMP, and N VCloneVectorArrayEmpty OpenMP set the ﬁeld

own data =SUNFALSE.N VDestroy OpenMP and N VDestroyVectorArray OpenMP will not at-

tempt to free the pointer data for any N Vector with own data set to SUNFALSE. In such a case,

it is the user’s responsibility to deallocate the data pointer.

•To maximize eﬃciency, vector operations in the nvector openmp implementation that have

more than one N Vector argument do not check for consistent internal representation of these

vectors. It is the user’s responsibility to ensure that such routines are called with N Vector

arguments that were all created with the same internal representations.

7.4.3 NVECTOR OPENMP Fortran interfaces

The nvector openmp module provides a Fortran 2003 module as well as Fortran 77 style inter-

face functions for use from Fortran applications.

7.5 The NVECTOR PTHREADS implementation 183

FORTRAN 2003 interface module

The nvector openmp mod Fortran module deﬁnes interfaces to most nvector openmp C functions

using the intrinsic iso c binding module which provides a standardized mechanism for interoperat-

ing with C. As noted in the Cfunction descriptions above, the interface functions are named after

the corresponding Cfunction, but with a leading ‘F’. For example, the function N VNew OpenMP is

interfaced as FN VNew OpenMP.

The Fortran 2003 nvector openmp interface module can be accessed with the use statement,

i.e. use fnvector openmp mod, and linking to the library libsundials fnvectoropenmp mod.lib in

addition to the Clibrary. For details on where the library and module ﬁle fnvector openmp mod.mod

are installed see Appendix A.

FORTRAN 77 interface functions

For solvers that include a Fortran 77 interface module, the nvector openmp module also includes

aFortran-callable function FNVINITOMP(code, NEQ, NUMTHREADS, IER), to initialize this module.

Here code is an input solver id (1 for cvode, 2 for ida, 3 for kinsol, 4 for arkode); NEQ is the

problem size (declared so as to match C type long int); NUMTHREADS is the number of threads;

and IER is an error return ﬂag equal 0 for success and -1 for failure.

7.5 The NVECTOR PTHREADS implementation

In situations where a user has a multi-core processing unit capable of running multiple parallel threads

with shared memory, sundials provides an implementation of nvector using OpenMP, called nvec-

tor openmp, and an implementation using Pthreads, called nvector pthreads. Testing has shown

that vectors should be of length at least 100,000 before the overhead associated with creating and

using the threads is made up by the parallelism in the vector calculations.

The Pthreads nvector implementation provided with sundials, denoted nvector pthreads,

deﬁnes the content ﬁeld of N Vector to be a structure containing the length of the vector, a pointer

to the beginning of a contiguous data array, a boolean ﬂag own data which speciﬁes the ownership

of data, and the number of threads. Operations on the vector are threaded using POSIX threads

(Pthreads).

struct _N_VectorContent_Pthreads {

sunindextype length;

booleantype own_data;

realtype *data;

int num_threads;

};

The header ﬁle to include when using this module is nvector pthreads.h. The installed module

library to link to is libsundials nvecpthreads.lib where .lib is typically .so for shared libraries

and .a for static libraries.

7.5.1 NVECTOR PTHREADS accessor macros

The following macros are provided to access the content of an nvector pthreads vector. The suﬃx

PT in the names denotes the Pthreads version.

•NV CONTENT PT

This routine gives access to the contents of the Pthreads vector N Vector.

The assignment v cont =NV CONTENT PT(v) sets v cont to be a pointer to the Pthreads

N Vector content structure.

Implementation:

#define NV_CONTENT_PT(v) ( (N_VectorContent_Pthreads)(v->content) )

184 Description of the NVECTOR module

•NV OWN DATA PT,NV DATA PT,NV LENGTH PT,NV NUM THREADS PT

These macros give individual access to the parts of the content of a Pthreads N Vector.

The assignment v data = NV DATA PT(v) sets v data to be a pointer to the ﬁrst component

of the data for the N Vector v. The assignment NV DATA PT(v) = v data sets the component

array of vto be v data by storing the pointer v data.

The assignment v len = NV LENGTH PT(v) sets v len to be the length of v. On the other hand,

the call NV LENGTH PT(v) = len v sets the length of vto be len v.

The assignment v num threads = NV NUM THREADS PT(v) sets v num threads to be the number

of threads from v. On the other hand, the call NV NUM THREADS PT(v) = num threads v sets

the number of threads for vto be num threads v.

Implementation:

#define NV_OWN_DATA_PT(v) ( NV_CONTENT_PT(v)->own_data )

#define NV_DATA_PT(v) ( NV_CONTENT_PT(v)->data )

#define NV_LENGTH_PT(v) ( NV_CONTENT_PT(v)->length )

#define NV_NUM_THREADS_PT(v) ( NV_CONTENT_PT(v)->num_threads )

•NV Ith PT

This macro gives access to the individual components of the data array of an N Vector.

The assignment r = NV Ith PT(v,i) sets rto be the value of the i-th component of v. The

assignment NV Ith PT(v,i) = r sets the value of the i-th component of vto be r.

Here iranges from 0 to n−1 for a vector of length n.

Implementation:

#define NV_Ith_PT(v,i) ( NV_DATA_PT(v)[i] )

7.5.2 NVECTOR PTHREADS functions

The nvector pthreads module deﬁnes Pthreads implementations of all vector operations listed in

Tables 7.2,7.3, and 7.4. Their names are obtained from those in Tables 7.2,7.3, and 7.4 by appending

the suﬃx Pthreads (e.g. N VDestroy Pthreads). All the standard vector operations listed in 7.2

are callable via the Fortran 2003 interface by prepending an ‘F’ (e.g. FN VDestroy Pthreads). The

module nvector pthreads provides the following additional user-callable routines:

N VNew Pthreads

Prototype N Vector N VNew Pthreads(sunindextype vec length, int num threads)

Description This function creates and allocates memory for a Pthreads N Vector. Arguments are

the vector length and number of threads.

F2003 Name This function is callable as FN VNew Pthreads when using the Fortran 2003 interface

module.

N VNewEmpty Pthreads

Prototype N Vector N VNewEmpty Pthreads(sunindextype vec length, int num threads)

Description This function creates a new Pthreads N Vector with an empty (NULL) data array.

F2003 Name This function is callable as FN VNewEmpty Pthreads when using the Fortran 2003 inter-

face module.

7.5 The NVECTOR PTHREADS implementation 185

N VMake Pthreads

Prototype N Vector N VMake Pthreads(sunindextype vec length, realtype *v data,

int num threads);

Description This function creates and allocates memory for a Pthreads vector with user-provided

data array. This function does not allocate memory for v data itself.

F2003 Name This function is callable as FN VMake Pthreads when using the Fortran 2003 interface

module.

N VCloneVectorArray Pthreads

Prototype N Vector *N VCloneVectorArray Pthreads(int count, N Vector w)

Description This function creates (by cloning) an array of count Pthreads vectors.

N VCloneVectorArrayEmpty Pthreads

Prototype N Vector *N VCloneVectorArrayEmpty Pthreads(int count, N Vector w)

Description This function creates (by cloning) an array of count Pthreads vectors, each with an

empty (NULL) data array.

N VDestroyVectorArray Pthreads

Prototype void N VDestroyVectorArray Pthreads(N Vector *vs, int count)

Description This function frees memory allocated for the array of count variables of type N Vector

created with N VCloneVectorArray Pthreads or with

N VCloneVectorArrayEmpty Pthreads.

N VGetLength Pthreads

Prototype sunindextype N VGetLength Pthreads(N Vector v)

Description This function returns the number of vector elements.

F2003 Name This function is callable as FN VGetLength Pthreads when using the Fortran 2003 in-

terface module.

N VPrint Pthreads

Prototype void N VPrint Pthreads(N Vector v)

Description This function prints the content of a Pthreads vector to stdout.

F2003 Name This function is callable as FN VPrint Pthreads when using the Fortran 2003 interface

module.

N VPrintFile Pthreads

Prototype void N VPrintFile Pthreads(N Vector v, FILE *outfile)

Description This function prints the content of a Pthreads vector to outfile.

By default all fused and vector array operations are disabled in the nvector pthreads module.

The following additional user-callable routines are provided to enable or disable fused and vector

array operations for a speciﬁc vector. To ensure consistency across vectors it is recommended to ﬁrst

create a vector with N VNew Pthreads, enable/disable the desired operations for that vector with the

functions below, and create any additional vectors from that vector using N VClone. This guarantees

the new vectors will have the same operations enabled/disabled as cloned vectors inherit the same

enable/disable options as the vector they are cloned from while vectors created with N VNew Pthreads

will have the default settings for the nvector pthreads module.

186 Description of the NVECTOR module

N VEnableFusedOps Pthreads

Prototype int N VEnableFusedOps Pthreads(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) all fused and vector array op-

erations in the Pthreads vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableLinearCombination Pthreads

Prototype int N VEnableLinearCombination Pthreads(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear combination fused

operation in the Pthreads vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableScaleAddMulti Pthreads

Prototype int N VEnableScaleAddMulti Pthreads(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale and add a vector to

multiple vectors fused operation in the Pthreads vector. The return value is 0for success

and -1 if the input vector or its ops structure are NULL.

N VEnableDotProdMulti Pthreads

Prototype int N VEnableDotProdMulti Pthreads(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the multiple dot products fused

operation in the Pthreads vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableLinearSumVectorArray Pthreads

Prototype int N VEnableLinearSumVectorArray Pthreads(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear sum operation for

vector arrays in the Pthreads vector. The return value is 0for success and -1 if the

input vector or its ops structure are NULL.

N VEnableScaleVectorArray Pthreads

Prototype int N VEnableScaleVectorArray Pthreads(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale operation for vector

arrays in the Pthreads vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableConstVectorArray Pthreads

Prototype int N VEnableConstVectorArray Pthreads(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the const operation for vector

arrays in the Pthreads vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

7.5 The NVECTOR PTHREADS implementation 187

N VEnableWrmsNormVectorArray Pthreads

Prototype int N VEnableWrmsNormVectorArray Pthreads(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the WRMS norm operation for

vector arrays in the Pthreads vector. The return value is 0for success and -1 if the

input vector or its ops structure are NULL.

N VEnableWrmsNormMaskVectorArray Pthreads

Prototype int N VEnableWrmsNormMaskVectorArray Pthreads(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the masked WRMS norm op-

eration for vector arrays in the Pthreads vector. The return value is 0for success and

-1 if the input vector or its ops structure are NULL.

N VEnableScaleAddMultiVectorArray Pthreads

Prototype int N VEnableScaleAddMultiVectorArray Pthreads(N Vector v,

booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale and add a vector array

to multiple vector arrays operation in the Pthreads vector. The return value is 0for

success and -1 if the input vector or its ops structure are NULL.

N VEnableLinearCombinationVectorArray Pthreads

Prototype int N VEnableLinearCombinationVectorArray Pthreads(N Vector v,

booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear combination operation

for vector arrays in the Pthreads vector. The return value is 0for success and -1 if the

input vector or its ops structure are NULL.

Notes

•When looping over the components of an N Vector v, it is more eﬃcient to ﬁrst obtain the

component array via v data = NV DATA PT(v) and then access v data[i] within the loop than

it is to use NV Ith PT(v,i) within the loop.

•N VNewEmpty Pthreads,N VMake Pthreads, and N VCloneVectorArrayEmpty Pthreads set the

ﬁeld own data =SUNFALSE.N VDestroy Pthreads and N VDestroyVectorArray Pthreads will

not attempt to free the pointer data for any N Vector with own data set to SUNFALSE. In such

a case, it is the user’s responsibility to deallocate the data pointer.

•To maximize eﬃciency, vector operations in the nvector pthreads implementation that have

more than one N Vector argument do not check for consistent internal representation of these

vectors. It is the user’s responsibility to ensure that such routines are called with N Vector

arguments that were all created with the same internal representations.

7.5.3 NVECTOR PTHREADS Fortran interfaces

The nvector pthreads module provides a Fortran 2003 module as well as Fortran 77 style

interface functions for use from Fortran applications.

188 Description of the NVECTOR module

FORTRAN 2003 interface module

The nvector pthreads mod Fortran module deﬁnes interfaces to most nvector pthreads C func-

tions using the intrinsic iso c binding module which provides a standardized mechanism for interop-

erating with C. As noted in the Cfunction descriptions above, the interface functions are named after

the corresponding Cfunction, but with a leading ‘F’. For example, the function N VNew Pthreads is

interfaced as FN VNew Pthreads.

The Fortran 2003 nvector pthreads interface module can be accessed with the use statement,

i.e. use fnvector pthreads mod, and linking to the library libsundials fnvectorpthreads mod.lib

in addition to the Clibrary. For details on where the library and module ﬁle fnvector pthreads mod.mod

are installed see Appendix A.

FORTRAN 77 interface functions

For solvers that include a Fortran interface module, the nvector pthreads module also includes

aFortran-callable function FNVINITPTS(code, NEQ, NUMTHREADS, IER), to initialize this module.

Here code is an input solver id (1 for cvode, 2 for ida, 3 for kinsol, 4 for arkode); NEQ is the

problem size (declared so as to match C type long int); NUMTHREADS is the number of threads;

and IER is an error return ﬂag equal 0 for success and -1 for failure.

7.6 The NVECTOR PARHYP implementation

The nvector parhyp implementation of the nvector module provided with sundials is a wrapper

around hypre’s ParVector class. Most of the vector kernels simply call hypre vector operations. The

implementation deﬁnes the content ﬁeld of N Vector to be a structure containing the global and local

lengths of the vector, a pointer to an object of type HYPRE ParVector, an MPI communicator, and a

boolean ﬂag own parvector indicating ownership of the hypre parallel vector object x.

struct _N_VectorContent_ParHyp {

sunindextype local_length;

sunindextype global_length;

booleantype own_parvector;

MPI_Comm comm;

HYPRE_ParVector x;

};

The header ﬁle to include when using this module is nvector parhyp.h. The installed module library

to link to is libsundials nvecparhyp.lib where .lib is typically .so for shared libraries and .a

for static libraries.

Unlike native sundials vector types, nvector parhyp does not provide macros to access its

member variables. Note that nvector parhyp requires sundials to be built with MPI support.

7.6.1 NVECTOR PARHYP functions

The nvector parhyp module deﬁnes implementations of all vector operations listed in Tables 7.2,

7.3, and 7.4, except for N VSetArrayPointer and N VGetArrayPointer, because accessing raw vector

data is handled by low-level hypre functions. As such, this vector is not available for use with sundials

Fortran interfaces. When access to raw vector data is needed, one should extract the hypre vector ﬁrst,

and then use hypre methods to access the data. Usage examples of nvector parhyp are provided in

the cvAdvDiff non ph.c example program for cvode [31] and the ark diurnal kry ph.c example

program for arkode [39].

The names of parhyp methods are obtained from those in Tables 7.2,7.3, and 7.4 by appending

the suﬃx ParHyp (e.g. N VDestroy ParHyp). The module nvector parhyp provides the following

additional user-callable routines:

7.6 The NVECTOR PARHYP implementation 189

N VNewEmpty ParHyp

Prototype N Vector N VNewEmpty ParHyp(MPI Comm comm, sunindextype local length,

sunindextype global length)

Description This function creates a new parhyp N Vector with the pointer to the hypre vector set

to NULL.

N VMake ParHyp

Prototype N Vector N VMake ParHyp(HYPRE ParVector x)

Description This function creates an N Vector wrapper around an existing hypre parallel vector. It

does not allocate memory for xitself.

N VGetVector ParHyp

Prototype HYPRE ParVector N VGetVector ParHyp(N Vector v)

Description This function returns the underlying hypre vector.

N VCloneVectorArray ParHyp

Prototype N Vector *N VCloneVectorArray ParHyp(int count, N Vector w)

Description This function creates (by cloning) an array of count parallel vectors.

N VCloneVectorArrayEmpty ParHyp

Prototype N Vector *N VCloneVectorArrayEmpty ParHyp(int count, N Vector w)

Description This function creates (by cloning) an array of count parallel vectors, each with an empty

(NULL) data array.

N VDestroyVectorArray ParHyp

Prototype void N VDestroyVectorArray ParHyp(N Vector *vs, int count)

Description This function frees memory allocated for the array of count variables of type N Vector

created with N VCloneVectorArray ParHyp or with N VCloneVectorArrayEmpty ParHyp.

N VPrint ParHyp

Prototype void N VPrint ParHyp(N Vector v)

Description This function prints the local content of a parhyp vector to stdout.

N VPrintFile ParHyp

Prototype void N VPrintFile ParHyp(N Vector v, FILE *outfile)

Description This function prints the local content of a parhyp vector to outfile.

By default all fused and vector array operations are disabled in the nvector parhyp module.

The following additional user-callable routines are provided to enable or disable fused and vector

array operations for a speciﬁc vector. To ensure consistency across vectors it is recommended to ﬁrst

create a vector with N VMake ParHyp, enable/disable the desired operations for that vector with the

functions below, and create any additional vectors from that vector using N VClone. This guarantees

the new vectors will have the same operations enabled/disabled as cloned vectors inherit the same

enable/disable options as the vector they are cloned from while vectors created with N VMake ParHyp

will have the default settings for the nvector parhyp module.

190 Description of the NVECTOR module

N VEnableFusedOps ParHyp

Prototype int N VEnableFusedOps ParHyp(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) all fused and vector array oper-

ations in the parhyp vector. The return value is 0for success and -1 if the input vector

or its ops structure are NULL.

N VEnableLinearCombination ParHyp

Prototype int N VEnableLinearCombination ParHyp(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear combination fused

operation in the parhyp vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableScaleAddMulti ParHyp

Prototype int N VEnableScaleAddMulti ParHyp(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale and add a vector to

multiple vectors fused operation in the parhyp vector. The return value is 0for success

and -1 if the input vector or its ops structure are NULL.

N VEnableDotProdMulti ParHyp

Prototype int N VEnableDotProdMulti ParHyp(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the multiple dot products fused

operation in the parhyp vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableLinearSumVectorArray ParHyp

Prototype int N VEnableLinearSumVectorArray ParHyp(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear sum operation for

vector arrays in the parhyp vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableScaleVectorArray ParHyp

Prototype int N VEnableScaleVectorArray ParHyp(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale operation for vector

arrays in the parhyp vector. The return value is 0for success and -1 if the input vector

or its ops structure are NULL.

N VEnableConstVectorArray ParHyp

Prototype int N VEnableConstVectorArray ParHyp(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the const operation for vector

arrays in the parhyp vector. The return value is 0for success and -1 if the input vector

or its ops structure are NULL.

7.7 The NVECTOR PETSC implementation 191

N VEnableWrmsNormVectorArray ParHyp

Prototype int N VEnableWrmsNormVectorArray ParHyp(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the WRMS norm operation for

vector arrays in the parhyp vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableWrmsNormMaskVectorArray ParHyp

Prototype int N VEnableWrmsNormMaskVectorArray ParHyp(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the masked WRMS norm op-

eration for vector arrays in the parhyp vector. The return value is 0for success and -1

if the input vector or its ops structure are NULL.

N VEnableScaleAddMultiVectorArray ParHyp

Prototype int N VEnableScaleAddMultiVectorArray ParHyp(N Vector v,

booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale and add a vector array

to multiple vector arrays operation in the parhyp vector. The return value is 0for success

and -1 if the input vector or its ops structure are NULL.

N VEnableLinearCombinationVectorArray ParHyp

Prototype int N VEnableLinearCombinationVectorArray ParHyp(N Vector v,

booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear combination operation

for vector arrays in the parhyp vector. The return value is 0for success and -1 if the

input vector or its ops structure are NULL.

Notes

•When there is a need to access components of an N Vector ParHyp,v, it is recommended to

extract the hypre vector via x vec = N VGetVector ParHyp(v) and then access components

using appropriate hypre functions.

•N VNewEmpty ParHyp,N VMake ParHyp, and N VCloneVectorArrayEmpty ParHyp set the ﬁeld

own parvector to SUNFALSE.N VDestroy ParHyp and N VDestroyVectorArray ParHyp will not

attempt to delete an underlying hypre vector for any N Vector with own parvector set to

SUNFALSE. In such a case, it is the user’s responsibility to delete the underlying vector.

•To maximize eﬃciency, vector operations in the nvector parhyp implementation that have

more than one N Vector argument do not check for consistent internal representations of these

vectors. It is the user’s responsibility to ensure that such routines are called with N Vector

arguments that were all created with the same internal representations.

7.7 The NVECTOR PETSC implementation

The nvector petsc module is an nvector wrapper around the petsc vector. It deﬁnes the content

ﬁeld of a N Vector to be a structure containing the global and local lengths of the vector, a pointer

to the petsc vector, an MPI communicator, and a boolean ﬂag own data indicating ownership of the

wrapped petsc vector.

192 Description of the NVECTOR module

struct _N_VectorContent_Petsc {

sunindextype local_length;

sunindextype global_length;

booleantype own_data;

Vec *pvec;

MPI_Comm comm;

};

The header ﬁle to include when using this module is nvector petsc.h. The installed module library

to link to is libsundials nvecpetsc.lib where .lib is typically .so for shared libraries and .a for

static libraries.

Unlike native sundials vector types, nvector petsc does not provide macros to access its mem-

ber variables. Note that nvector petsc requires sundials to be built with MPI support.

7.7.1 NVECTOR PETSC functions

The nvector petsc module deﬁnes implementations of all vector operations listed in Tables 7.2,7.3,

and 7.4, except for N VGetArrayPointer and N VSetArrayPointer. As such, this vector cannot be

used with sundials Fortran interfaces. When access to raw vector data is needed, it is recommended

to extract the petsc vector ﬁrst, and then use petsc methods to access the data. Usage examples of

nvector petsc are provided in example programs for ida [29].

The names of vector operations are obtained from those in Tables 7.2,7.3, and 7.4 by appending

the suﬃx Petsc (e.g. N VDestroy Petsc). The module nvector petsc provides the following

additional user-callable routines:

N VNewEmpty Petsc

Prototype N Vector N VNewEmpty Petsc(MPI Comm comm, sunindextype local length,

sunindextype global length)

Description This function creates a new nvector wrapper with the pointer to the wrapped petsc

vector set to (NULL). It is used by the N VMake Petsc and N VClone Petsc implementa-

tions.

N VMake Petsc

Prototype N Vector N VMake Petsc(Vec *pvec)

Description This function creates and allocates memory for an nvector petsc wrapper around a

user-provided petsc vector. It does not allocate memory for the vector pvec itself.

N VGetVector Petsc

Prototype Vec *N VGetVector Petsc(N Vector v)

Description This function returns a pointer to the underlying petsc vector.

N VCloneVectorArray Petsc

Prototype N Vector *N VCloneVectorArray Petsc(int count, N Vector w)

Description This function creates (by cloning) an array of count nvector petsc vectors.

N VCloneVectorArrayEmpty Petsc

Prototype N Vector *N VCloneVectorArrayEmpty Petsc(int count, N Vector w)

Description This function creates (by cloning) an array of count nvector petsc vectors, each with

pointers to petsc vectors set to (NULL).

7.7 The NVECTOR PETSC implementation 193

N VDestroyVectorArray Petsc

Prototype void N VDestroyVectorArray Petsc(N Vector *vs, int count)

Description This function frees memory allocated for the array of count variables of type N Vector

created with N VCloneVectorArray Petsc or with N VCloneVectorArrayEmpty Petsc.

N VPrint Petsc

Prototype void N VPrint Petsc(N Vector v)

Description This function prints the global content of a wrapped petsc vector to stdout.

N VPrintFile Petsc

Prototype void N VPrintFile Petsc(N Vector v, const char fname[])

Description This function prints the global content of a wrapped petsc vector to fname.

By default all fused and vector array operations are disabled in the nvector petsc module.

The following additional user-callable routines are provided to enable or disable fused and vector

array operations for a speciﬁc vector. To ensure consistency across vectors it is recommended to ﬁrst

create a vector with N VMake Petsc, enable/disable the desired operations for that vector with the

functions below, and create any additional vectors from that vector using N VClone. This guarantees

the new vectors will have the same operations enabled/disabled as cloned vectors inherit the same

enable/disable options as the vector they are cloned from while vectors created with N VMake Petsc

will have the default settings for the nvector petsc module.

N VEnableFusedOps Petsc

Prototype int N VEnableFusedOps Petsc(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) all fused and vector array oper-

ations in the petsc vector. The return value is 0for success and -1 if the input vector

or its ops structure are NULL.

N VEnableLinearCombination Petsc

Prototype int N VEnableLinearCombination Petsc(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear combination fused

operation in the petsc vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableScaleAddMulti Petsc

Prototype int N VEnableScaleAddMulti Petsc(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale and add a vector to

multiple vectors fused operation in the petsc vector. The return value is 0for success

and -1 if the input vector or its ops structure are NULL.

N VEnableDotProdMulti Petsc

Prototype int N VEnableDotProdMulti Petsc(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the multiple dot products fused

operation in the petsc vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

194 Description of the NVECTOR module

N VEnableLinearSumVectorArray Petsc

Prototype int N VEnableLinearSumVectorArray Petsc(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear sum operation for

vector arrays in the petsc vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableScaleVectorArray Petsc

Prototype int N VEnableScaleVectorArray Petsc(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale operation for vector

arrays in the petsc vector. The return value is 0for success and -1 if the input vector

or its ops structure are NULL.

N VEnableConstVectorArray Petsc

Prototype int N VEnableConstVectorArray Petsc(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the const operation for vector

arrays in the petsc vector. The return value is 0for success and -1 if the input vector

or its ops structure are NULL.

N VEnableWrmsNormVectorArray Petsc

Prototype int N VEnableWrmsNormVectorArray Petsc(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the WRMS norm operation for

vector arrays in the petsc vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableWrmsNormMaskVectorArray Petsc

Prototype int N VEnableWrmsNormMaskVectorArray Petsc(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the masked WRMS norm op-

eration for vector arrays in the petsc vector. The return value is 0for success and -1

if the input vector or its ops structure are NULL.

N VEnableScaleAddMultiVectorArray Petsc

Prototype int N VEnableScaleAddMultiVectorArray Petsc(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale and add a vector array

to multiple vector arrays operation in the petsc vector. The return value is 0for success

and -1 if the input vector or its ops structure are NULL.

N VEnableLinearCombinationVectorArray Petsc

Prototype int N VEnableLinearCombinationVectorArray Petsc(N Vector v,

booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear combination operation

for vector arrays in the petsc vector. The return value is 0for success and -1 if the

input vector or its ops structure are NULL.

7.8 The NVECTOR CUDA implementation 195

Notes

•When there is a need to access components of an N Vector Petsc,v, it is recommeded to

extract the petsc vector via x vec = N VGetVector Petsc(v) and then access components

using appropriate petsc functions.

•The functions N VNewEmpty Petsc,N VMake Petsc, and N VCloneVectorArrayEmpty Petsc set

the ﬁeld own data to SUNFALSE.N VDestroy Petsc and N VDestroyVectorArray Petsc will not

attempt to free the pointer pvec for any N Vector with own data set to SUNFALSE. In such a

case, it is the user’s responsibility to deallocate the pvec pointer.

•To maximize eﬃciency, vector operations in the nvector petsc implementation that have

more than one N Vector argument do not check for consistent internal representations of these

vectors. It is the user’s responsibility to ensure that such routines are called with N Vector

arguments that were all created with the same internal representations.

7.8 The NVECTOR CUDA implementation

The nvector cuda module is an experimental nvector implementation in the cuda language.

The module allows for sundials vector kernels to run on GPU devices. It is intended for users

who are already familiar with cuda and GPU programming. Building this vector module requires a

CUDA compiler and, by extension, a C++ compiler. The class Vector in the namespace suncudavec

manages the vector data layout:

template <class T, class I>

class Vector {

I size_;

I mem_size_;

I global_size_;

T* h_vec_;

T* d_vec_;

ThreadPartitioning<T, I>* partStream_;

ThreadPartitioning<T, I>* partReduce_;

bool ownPartitioning_;

bool ownData_;

bool managed_mem_;

SUNMPI_Comm comm_;

...

};

The class members are vector size (length), size of the vector data memory block, pointers to vector

data on the host and the device, pointers to ThreadPartitioning implementations that handle thread

partitioning for streaming and reduction vector kernels, a boolean ﬂag that signals if the vector owns

the thread partitioning, a boolean ﬂag that signals if the vector owns the data, a boolean ﬂag that

signals if managed memory is used for the data arrays, and the MPI communicator. The class Vector

inherits from the empty structure

struct _N_VectorContent_Cuda {};

to interface the C++ class with the nvector C code. Due to the rapid progress of cuda development,

we expect that the suncudavec::Vector class will change frequently in future sundials releases. The

code is structured so that it can tolerate signiﬁcant changes in the suncudavec::Vector class without

requiring changes to the user API.

When instantiated with N VNew Cuda, the class Vector will allocate memory on both the host and

the device. Alternatively, a user can provide host and device data arrays by using the N VMake Cuda

196 Description of the NVECTOR module

constructor. To use cuda managed memory, the constructors N VNewManaged Cuda and

N VMakeManaged Cuda are provided. Details on each of these constructors are provided below.

The nvector cuda module can be utilized for single-node parallelism or in a distributed context

with MPI. In the single-node case the header ﬁle to include nvector cuda.h and the library to

link to is libsundials nveccuda.lib . In the a distributed setting the header ﬁle to include is

nvector mpicuda.h and the library to link to is libsundials nvecmpicuda.lib . The extension,

.lib, is typically .so for shared libraries and .a for static libraries. Only one of these libraries may

be linked to when creating an executable or library. sundials must be built with MPI support if the

distributed library is desired.

7.8.1 NVECTOR CUDA functions

Unlike other native sundials vector types, nvector cuda does not provide macros to access its

member variables. Instead, user should use the accessor functions:

N VGetLength Cuda

Prototype sunindextype N VGetLength Cuda(N Vector v)

Description This function returns the global length of the vector.

N VGetLocalLength Cuda

Prototype sunindextype N VGetLocalLength Cuda(N Vector v)

Description This function returns the local length of the vector.

Note: This function is for use in a distributed context and is deﬁned in the header

nvector mpicuda.h and the library to link to is libsundials nvecmpicuda.lib.

N VGetHostArrayPointer Cuda

Prototype realtype *N VGetHostArrayPointer Cuda(N Vector v)

Description This function returns a pointer to the vector data on the host.

N VGetDeviceArrayPointer Cuda

Prototype realtype *N VGetDeviceArrayPointer Cuda(N Vector v)

Description This function returns a pointer to the vector data on the device.

N VGetMPIComm Cuda

Prototype MPI Comm N VGetMPIComm Cuda(N Vector v)

Description This function returns the MPI communicator for the vector.

Note: This function is for use in a distributed context and is deﬁned in the header

nvector mpicuda.h and the library to link to is libsundials nvecmpicuda.lib.

N VIsManagedMemory Cuda

Prototype booleantype *N VIsManagedMemory Cuda(N Vector v)

Description This function returns a boolean ﬂag indicating if the vector data is allocated in managed

memory or not.

7.8 The NVECTOR CUDA implementation 197

The nvector cuda module deﬁnes implementations of all vector operations listed in Tables 7.2,

7.3, and 7.4, except for N VGetArrayPointer and N VSetArrayPointer. As such, this vector cannot be

used with the sundials Fortran interfaces, nor with the sundials direct solvers and preconditioners.

Instead, the nvector cuda module provides separate functions to access data on the host and on

the device. It also provides methods for copying from the host to the device and vice versa. Usage

examples of nvector cuda are provided in some example programs for cvode [31].

The names of vector operations are obtained from those in Tables 7.2,7.3, and 7.4 by appending the

suﬃx Cuda (e.g. N VDestroy Cuda). The module nvector cuda provides the following functions:

N VNew Cuda

Single-node usage

Prototype N Vector N VNew Cuda(sunindextype length)

Description This function creates and allocates memory for a cuda N Vector. The vector data array

is allocated on both the host and device. In the single-node setting, the only input is

the vector length. This constructor is deﬁned in the header nvector cuda.h and the

library to link to is libsundials nveccuda.lib.

Distributed-memory parallel usage

Prototype N Vector N VNew Cuda(MPI Comm comm, sunindextype local length,

sunindextype global length)

Description This function creates and allocates memory for a cuda N Vector. The vector data

array is allocated on both the host and device. When used in a distributed context

with MPI, the arguments are the MPI communicator, the local vector length, and the

global vector length. This constructor is deﬁned in the header nvector mpicuda.h and

the library to link to is libsundials nvecmpicuda.lib.

N VNewManaged Cuda

Single-node usage

Prototype N Vector N VNewManaged Cuda(sunindextype length)

Description This function creates and allocates memory for a cuda N Vector on a single node. The

vector data array is allocated in managed memory. In the single-node setting, the only

input is the vector length. This constructor is deﬁned in the header nvector cuda.h

and the library to link to is libsundials nveccuda.lib.

Distributed-memory parallel usage

Prototype N Vector N VNewManaged Cuda(MPI Comm comm, sunindextype local length,

sunindextype global length)

Description This function creates and allocates memory for a cuda N Vector on a single node. The

vector data array is allocated in managed memory. When used in a distributed context

with MPI, the arguments are the MPI communicator, the local vector lenght, and the

global vector length. This constructor is deﬁned in the header nvector mpicuda.h and

the library to link to is libsundials nvecmpicuda.lib.

N VNewEmpty Cuda

Prototype N Vector N VNewEmpty Cuda()

Description This function creates a new nvector wrapper with the pointer to the wrapped cuda

vector set to NULL. It is used by the N VNew Cuda,N VMake Cuda, and N VClone Cuda

implementations.

198 Description of the NVECTOR module

N VMake Cuda

Single-node usage

Prototype N Vector N VMake Cuda(sunindextype length, realtype *h vdata,

realtype *d vdata)

Description This function creates an nvector cuda with user-supplied vector data arrays h vdata

and d vdata. This function does not allocate memory for data itself. In the single-

node setting, the inputs are the vector length, the host data array, and the device data.

This constructor is deﬁned in the header nvector cuda.h and the library to link to is

libsundials nveccuda.lib.

Distributed-memory parallel usage

Prototype N Vector N VMake Cuda(MPI Comm comm, sunindextype local length,

sunindextype global length, realtype *h vdata,

realtype *d vdata)

Description This function creates an nvector cuda with user-supplied vector data arrays h vdata

and d vdata. This function does not allocate memory for data itself. When used in

adistributed context with MPI, the arguments are the MPI communicator, the local

vector lenght, the global vector length, the host data array, and the device data array.

This constructor is deﬁned in the header nvector mpicuda.h and the library to link to

is libsundials nvecmpicuda.lib.

N VMakeManaged Cuda

Single-node usage

Prototype N Vector N VMakeManaged Cuda(sunindextype length, realtype *vdata)

Description This function creates an nvector cuda with a user-supplied managed memory data

array. This function does not allocate memory for data itself. In the single-node setting,

the inputs are the vector length and the managed data array. This constructor is deﬁned

in the header nvector cuda.h and the library to link to is libsundials nveccuda.lib.

Distributed-memory parallel usage

Prototype N Vector N VMakeManaged Cuda(MPI Comm comm, sunindextype local length,

sunindextype global length, realtype *vdata)

Description This function creates an nvector cuda with a user-supplied managed memory data

array. This function does not allocate memory for data itself. When used in a distributed

context with MPI, the arguments are the MPI communicator, the local vector lenght,

the global vector length, the managed data array. This constructor is deﬁned in the

header nvector mpicuda.h and the library to link to is libsundials nvecmpicuda.lib.

The module nvector cuda also provides the following user-callable routines:

N VSetCudaStream Cuda

Prototype void N VSetCudaStream Cuda(N Vector v, cudaStream t *stream)

Description This function sets the cuda stream that all vector kernels will be launched on. By

default an nvector cuda uses the default cuda stream.

Note: All vectors used in a single instance of a SUNDIALS solver must use the same

cuda stream, and the cuda stream must be set prior to solver initialization. Addi-

tionally, if manually instantiating the stream and reduce ThreadPartitioning of a

suncudavec::Vector, ensure that they use the same cuda stream.

7.8 The NVECTOR CUDA implementation 199

N VCopyToDevice Cuda

Prototype realtype *N VCopyToDevice Cuda(N Vector v)

Description This function copies host vector data to the device.

N VCopyFromDevice Cuda

Prototype realtype *N VCopyFromDevice Cuda(N Vector v)

Description This function copies vector data from the device to the host.

N VPrint Cuda

Prototype void N VPrint Cuda(N Vector v)

Description This function prints the content of a cuda vector to stdout.

N VPrintFile Cuda

Prototype void N VPrintFile Cuda(N Vector v, FILE *outfile)

Description This function prints the content of a cuda vector to outfile.

By default all fused and vector array operations are disabled in the nvector cuda module.

The following additional user-callable routines are provided to enable or disable fused and vector

array operations for a speciﬁc vector. To ensure consistency across vectors it is recommended to

ﬁrst create a vector with N VNew Cuda, enable/disable the desired operations for that vector with the

functions below, and create any additional vectors from that vector using N VClone. This guarantees

the new vectors will have the same operations enabled/disabled as cloned vectors inherit the same

enable/disable options as the vector they are cloned from while vectors created with N VNew Cuda will

have the default settings for the nvector cuda module.

N VEnableFusedOps Cuda

Prototype int N VEnableFusedOps Cuda(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) all fused and vector array op-

erations in the cuda vector. The return value is 0for success and -1 if the input vector

or its ops structure are NULL.

N VEnableLinearCombination Cuda

Prototype int N VEnableLinearCombination Cuda(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear combination fused

operation in the cuda vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableScaleAddMulti Cuda

Prototype int N VEnableScaleAddMulti Cuda(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale and add a vector to

multiple vectors fused operation in the cuda vector. The return value is 0for success

and -1 if the input vector or its ops structure are NULL.

200 Description of the NVECTOR module

N VEnableDotProdMulti Cuda

Prototype int N VEnableDotProdMulti Cuda(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the multiple dot products fused

operation in the cuda vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableLinearSumVectorArray Cuda

Prototype int N VEnableLinearSumVectorArray Cuda(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear sum operation for

vector arrays in the cuda vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableScaleVectorArray Cuda

Prototype int N VEnableScaleVectorArray Cuda(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale operation for vector

arrays in the cuda vector. The return value is 0for success and -1 if the input vector

or its ops structure are NULL.

N VEnableConstVectorArray Cuda

Prototype int N VEnableConstVectorArray Cuda(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the const operation for vector

arrays in the cuda vector. The return value is 0for success and -1 if the input vector

or its ops structure are NULL.

N VEnableWrmsNormVectorArray Cuda

Prototype int N VEnableWrmsNormVectorArray Cuda(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the WRMS norm operation for

vector arrays in the cuda vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableWrmsNormMaskVectorArray Cuda

Prototype int N VEnableWrmsNormMaskVectorArray Cuda(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the masked WRMS norm op-

eration for vector arrays in the cuda vector. The return value is 0for success and -1 if

the input vector or its ops structure are NULL.

N VEnableScaleAddMultiVectorArray Cuda

Prototype int N VEnableScaleAddMultiVectorArray Cuda(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale and add a vector array

to multiple vector arrays operation in the cuda vector. The return value is 0for success

and -1 if the input vector or its ops structure are NULL.

7.9 The NVECTOR RAJA implementation 201

N VEnableLinearCombinationVectorArray Cuda

Prototype int N VEnableLinearCombinationVectorArray Cuda(N Vector v,

booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear combination operation

for vector arrays in the cuda vector. The return value is 0for success and -1 if the

input vector or its ops structure are NULL.

Notes

•When there is a need to access components of an N Vector Cuda,v, it is recommeded to use

functions N VGetDeviceArrayPointer Cuda or N VGetHostArrayPointer Cuda.

•To maximize eﬃciency, vector operations in the nvector cuda implementation that have more

than one N Vector argument do not check for consistent internal representations of these vectors.

It is the user’s responsibility to ensure that such routines are called with N Vector arguments

that were all created with the same internal representations.

7.9 The NVECTOR RAJA implementation

The nvector raja module is an experimental nvector implementation using the raja hardware

abstraction layer. In this implementation, raja allows for sundials vector kernels to run on GPU

devices. The module is intended for users who are already familiar with raja and GPU programming.

Building this vector module requires a C++11 compliant compiler and a CUDA software development

toolkit. Besides the cuda backend, raja has other backends such as serial, OpenMP, and OpenACC.

These backends are not used in this sundials release. Class Vector in namespace sunrajavec

manages the vector data layout:

template <class T, class I>

class Vector {

I size_;

I mem_size_;

I global_size_;

T* h_vec_;

T* d_vec_;

SUNMPI_Comm comm_;

...

};

The class members are: vector size (length), size of the vector data memory block, the global vector

size (length), pointers to vector data on the host and on the device, and the MPI communicator. The

class Vector inherits from an empty structure

struct _N_VectorContent_Raja {

};

to interface the C++ class with the nvector C code. When instantiated, the class Vector will

allocate memory on both the host and the device. Due to the rapid progress of raja development, we

expect that the sunrajavec::Vector class will change frequently in future sundials releases. The

code is structured so that it can tolerate signiﬁcant changes in the sunrajavec::Vector class without

requiring changes to the user API.

The nvector raja module can be utilized for single-node parallelism or in a distributed con-

text with MPI. The header ﬁle to include when using this module for single-node parallelism is

nvector raja.h. The header ﬁle to include when using this module in the distributed case is

nvector mpiraja.h. The installed module libraries to link to are libsundials nvecraja.lib in

the single-node case, or libsundials nvecmpicudaraja.lib in the distributed case. Only one one

202 Description of the NVECTOR module

of these libraries may be linked to when creating an executable or library. sundials must be built

with MPI support if the distributed library is desired.

7.9.1 NVECTOR RAJA functions

Unlike other native sundials vector types, nvector raja does not provide macros to access its

member variables. Instead, user should use the accessor functions:

N VGetLength Raja

Prototype sunindextype N VGetLength Raja(N Vector v)

Description This function returns the global length of the vector.

N VGetLocalLength Raja

Prototype sunindextype N VGetLocalLength Raja(N Vector v)

Description This function returns the local length of the vector.

Note: This function is for use in a distributed context and is deﬁned in the header

nvector mpiraja.h and the library to link to is libsundials nvecmpicudaraja.lib.

N VGetHostArrayPointer Raja

Prototype realtype *N VGetHostArrayPointer Raja(N Vector v)

Description This function returns a pointer to the vector data on the host.

N VGetDeviceArrayPointer Raja

Prototype realtype *N VGetDeviceArrayPointer Raja(N Vector v)

Description This function returns a pointer to the vector data on the device.

N VGetMPIComm Raja

Prototype MPI Comm N VGetMPIComm Raja(N Vector v)

Description This function returns the MPI communicator for the vector.

Note: This function is for use in a distributed context and is deﬁned in the header

nvector mpiraja.h and the library to link to is libsundials nvecmpicudaraja.lib.

The nvector raja module deﬁnes the implementations of all vector operations listed in Tables

7.2,7.3, and 7.4, except for N VDotProdMulti,N VWrmsNormVectorArray, and

N VWrmsNormMaskVectorArray as support for arrays of reduction vectors is not yet supported in raja.

These function will be added to the nvector raja implementation in the future. Additionally the

vector operations N VGetArrayPointer and N VSetArrayPointer are not implemented by the raja

vector. As such, this vector cannot be used with the sundials Fortran interfaces, nor with the

sundials direct solvers and preconditioners. The nvector raja module provides separate functions

to access data on the host and on the device. It also provides methods for copying data from the

host to the device and vice versa. Usage examples of nvector raja are provided in some example

programs for cvode [31].

The names of vector operations are obtained from those in Tables 7.2,7.3, and 7.4, by append-

ing the suﬃx Raja (e.g. N VDestroy Raja). The module nvector raja provides the following

additional user-callable routines:

7.9 The NVECTOR RAJA implementation 203

N VNew Raja

Single-node usage

Prototype N Vector N VNew Raja(sunindextype length)

Description This function creates and allocates memory for a cuda N Vector. The vector data array

is allocated on both the host and device. In the single-node setting, the only input is

the vector length. This constructor is deﬁned in the header nvector raja.h and the

library to link to is libsundials nveccudaraja.lib.

Distributed-memory parallel usage

Prototype N Vector N VNew Raja(MPI Comm comm, sunindextype local length,

sunindextype global length)

Description This function creates and allocates memory for a cuda N Vector. The vector data

array is allocated on both the host and device. When used in a distributed context

with MPI, the arguments are the MPI communicator, the local vector lenght, and the

global vector length. This constructor is deﬁned in the header nvector mpiraja.h and

the library to link to is libsundials nvecmpicudaraja.lib.

N VNewEmpty Raja

Prototype N Vector N VNewEmpty Raja()

Description This function creates a new nvector wrapper with the pointer to the wrapped raja

vector set to NULL. It is used by the N VNew Raja,N VMake Raja, and N VClone Raja

implementations.

N VMake Raja

Prototype N Vector N VMake Raja(N VectorContent Raja c)

Description This function creates and allocates memory for an nvector raja wrapper around a

user-provided sunrajavec::Vector class. Its only argument is of type

N VectorContent Raja, which is the pointer to the class.

N VCopyToDevice Raja

Prototype realtype *N VCopyToDevice Raja(N Vector v)

Description This function copies host vector data to the device.

N VCopyFromDevice Raja

Prototype realtype *N VCopyFromDevice Raja(N Vector v)

Description This function copies vector data from the device to the host.

N VPrint Raja

Prototype void N VPrint Raja(N Vector v)

Description This function prints the content of a raja vector to stdout.

204 Description of the NVECTOR module

N VPrintFile Raja

Prototype void N VPrintFile Raja(N Vector v, FILE *outfile)

Description This function prints the content of a raja vector to outfile.

By default all fused and vector array operations are disabled in the nvector raja module. The

following additional user-callable routines are provided to enable or disable fused and vector array

operations for a speciﬁc vector. To ensure consistency across vectors it is recommended to ﬁrst

create a vector with N VNew Raja, enable/disable the desired operations for that vector with the

functions below, and create any additional vectors from that vector using N VClone. This guarantees

the new vectors will have the same operations enabled/disabled as cloned vectors inherit the same

enable/disable options as the vector they are cloned from while vectors created with N VNew Raja will

have the default settings for the nvector raja module.

N VEnableFusedOps Raja

Prototype int N VEnableFusedOps Raja(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) all fused and vector array op-

erations in the raja vector. The return value is 0for success and -1 if the input vector

or its ops structure are NULL.

N VEnableLinearCombination Raja

Prototype int N VEnableLinearCombination Raja(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear combination fused

operation in the raja vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableScaleAddMulti Raja

Prototype int N VEnableScaleAddMulti Raja(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale and add a vector to

multiple vectors fused operation in the raja vector. The return value is 0for success

and -1 if the input vector or its ops structure are NULL.

N VEnableLinearSumVectorArray Raja

Prototype int N VEnableLinearSumVectorArray Raja(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear sum operation for

vector arrays in the raja vector. The return value is 0for success and -1 if the input

vector or its ops structure are NULL.

N VEnableScaleVectorArray Raja

Prototype int N VEnableScaleVectorArray Raja(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale operation for vector

arrays in the raja vector. The return value is 0for success and -1 if the input vector

or its ops structure are NULL.

N VEnableConstVectorArray Raja

Prototype int N VEnableConstVectorArray Raja(N Vector v, booleantype tf)

7.10 The NVECTOR OPENMPDEV implementation 205

Description This function enables (SUNTRUE) or disables (SUNFALSE) the const operation for vector

arrays in the raja vector. The return value is 0for success and -1 if the input vector

or its ops structure are NULL.

N VEnableScaleAddMultiVectorArray Raja

Prototype int N VEnableScaleAddMultiVectorArray Raja(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale and add a vector array

to multiple vector arrays operation in the raja vector. The return value is 0for success

and -1 if the input vector or its ops structure are NULL.

N VEnableLinearCombinationVectorArray Raja

Prototype int N VEnableLinearCombinationVectorArray Raja(N Vector v,

booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear combination operation

for vector arrays in the raja vector. The return value is 0for success and -1 if the

input vector or its ops structure are NULL.

Notes

•When there is a need to access components of an N Vector Raja,v, it is recommeded to use

functions N VGetDeviceArrayPointer Raja or N VGetHostArrayPointer Raja.

•To maximize eﬃciency, vector operations in the nvector raja implementation that have more

than one N Vector argument do not check for consistent internal representations of these vectors.

It is the user’s responsibility to ensure that such routines are called with N Vector arguments

that were all created with the same internal representations.

7.10 The NVECTOR OPENMPDEV implementation

In situations where a user has access to a device such as a GPU for oﬄoading computation, sundials

provides an nvector implementation using OpenMP device oﬄoading, called nvector openmpdev.

The nvector openmpdev implementation deﬁnes the content ﬁeld of the N Vector to be a

structure containing the length of the vector, a pointer to the beginning of a contiguous data array

on the host, a pointer to the beginning of a contiguous data array on the device, and a boolean ﬂag

own data which speciﬁes the ownership of host and device data arrays.

struct _N_VectorContent_OpenMPDEV {

sunindextype length;

booleantype own_data;

realtype *host_data;

realtype *dev_data;

};

The header ﬁle to include when using this module is nvector openmpdev.h. The installed module

library to link to is libsundials nvecopenmpdev.lib where .lib is typically .so for shared libraries

and .a for static libraries.

7.10.1 NVECTOR OPENMPDEV accessor macros

The following macros are provided to access the content of an nvector openmpdev vector.

206 Description of the NVECTOR module

•NV CONTENT OMPDEV

This routine gives access to the contents of the nvector openmpdev vector N Vector.

The assignment v cont =NV CONTENT OMPDEV(v) sets v cont to be a pointer to the nvec-

tor openmpdev N Vector content structure.

Implementation:

#define NV_CONTENT_OMPDEV(v) ( (N_VectorContent_OpenMPDEV)(v->content) )

•NV OWN DATA OMPDEV,NV DATA HOST OMPDEV,NV DATA DEV OMPDEV,NV LENGTH OMPDEV

These macros give individual access to the parts of the content of an nvector openmpdev

N Vector.

The assignment v data = NV DATA HOST OMPDEV(v) sets v data to be a pointer to the ﬁrst

component of the data on the host for the N Vector v. The assignment NV DATA HOST OMPDEV(v)

= v data sets the host component array of vto be v data by storing the pointer v data.

The assignment v dev data = NV DATA DEV OMPDEV(v) sets v dev data to be a pointer to the

ﬁrst component of the data on the device for the N Vector v. The assignment NV DATA DEV OMPDEV(v)

= v dev data sets the device component array of vto be v dev data by storing the pointer

v dev data.

The assignment v len = NV LENGTH OMPDEV(v) sets v len to be the length of v. On the other

hand, the call NV LENGTH OMPDEV(v) = len v sets the length of vto be len v.

Implementation:

#define NV_OWN_DATA_OMPDEV(v) ( NV_CONTENT_OMPDEV(v)->own_data )

#define NV_DATA_HOST_OMPDEV(v) ( NV_CONTENT_OMPDEV(v)->host_data )

#define NV_DATA_DEV_OMPDEV(v) ( NV_CONTENT_OMPDEV(v)->dev_data )

#define NV_LENGTH_OMPDEV(v) ( NV_CONTENT_OMPDEV(v)->length )

7.10.2 NVECTOR OPENMPDEV functions

The nvector openmpdev module deﬁnes OpenMP device oﬄoading implementations of all vector

operations listed in Tables 7.2,7.3, and 7.4, except for N VGetArrayPointer and N VSetArrayPointer.

As such, this vector cannot be used with the sundials Fortran interfaces, nor with the sundials direct

solvers and preconditioners. It also provides methods for copying from the host to the device and vice

versa.

The names of vector operations are obtained from those in Tables 7.2,7.3, and 7.4 by appending

the suﬃx OpenMPDEV (e.g. N VDestroy OpenMPDEV). The module nvector openmpdev provides the

following additional user-callable routines:

N VNew OpenMPDEV

Prototype N Vector N VNew OpenMPDEV(sunindextype vec length)

Description This function creates and allocates memory for an nvector openmpdev N Vector.

N VNewEmpty OpenMPDEV

Prototype N Vector N VNewEmpty OpenMPDEV(sunindextype vec length)

Description This function creates a new nvector openmpdev N Vector with an empty (NULL) host

and device data arrays.

7.10 The NVECTOR OPENMPDEV implementation 207

N VMake OpenMPDEV

Prototype N Vector N VMake OpenMPDEV(sunindextype vec length, realtype *h vdata,

realtype *d vdata)

Description This function creates an nvector openmpdev vector with user-supplied vector data

arrays h vdata and d vdata. This function does not allocate memory for data itself.

N VCloneVectorArray OpenMPDEV

Prototype N Vector *N VCloneVectorArray OpenMPDEV(int count, N Vector w)

Description This function creates (by cloning) an array of count nvector openmpdev vectors.

N VCloneVectorArrayEmpty OpenMPDEV

Prototype N Vector *N VCloneVectorArrayEmpty OpenMPDEV(int count, N Vector w)

Description This function creates (by cloning) an array of count nvector openmpdev vectors,

each with an empty (NULL) data array.

N VDestroyVectorArray OpenMPDEV

Prototype void N VDestroyVectorArray OpenMPDEV(N Vector *vs, int count)

Description This function frees memory allocated for the array of count variables of type N Vector

created with N VCloneVectorArray OpenMPDEV or with

N VCloneVectorArrayEmpty OpenMPDEV.

N VGetLength OpenMPDEV

Prototype sunindextype N VGetLength OpenMPDEV(N Vector v)

Description This function returns the number of vector elements.

N VGetHostArrayPointer OpenMPDEV

Prototype realtype *N VGetHostArrayPointer OpenMPDEV(N Vector v)

Description This function returns a pointer to the host data array.

N VGetDeviceArrayPointer OpenMPDEV

Prototype realtype *N VGetDeviceArrayPointer OpenMPDEV(N Vector v)

Description This function returns a pointer to the device data array.

N VPrint OpenMPDEV

Prototype void N VPrint OpenMPDEV(N Vector v)

Description This function prints the content of an nvector openmpdev vector to stdout.

N VPrintFile OpenMPDEV

Prototype void N VPrintFile OpenMPDEV(N Vector v, FILE *outfile)

Description This function prints the content of an nvector openmpdev vector to outfile.

208 Description of the NVECTOR module

N VCopyToDevice OpenMPDEV

Prototype void N VCopyToDevice OpenMPDEV(N Vector v)

Description This function copies the content of an nvector openmpdev vector’s host data array

to the device data array.

N VCopyFromDevice OpenMPDEV

Prototype void N VCopyFromDevice OpenMPDEV(N Vector v)

Description This function copies the content of an nvector openmpdev vector’s device data array

to the host data array.

By default all fused and vector array operations are disabled in the nvector openmpdev module.

The following additional user-callable routines are provided to enable or disable fused and vector

array operations for a speciﬁc vector. To ensure consistency across vectors it is recommended to ﬁrst

create a vector with N VNew OpenMPDEV, enable/disable the desired operations for that vector with the

functions below, and create any additional vectors from that vector using N VClone. This guarantees

the new vectors will have the same operations enabled/disabled as cloned vectors inherit the same

enable/disable options as the vector they are cloned from while vectors created with N VNew OpenMPDEV

will have the default settings for the nvector openmpdev module.

N VEnableFusedOps OpenMPDEV

Prototype int N VEnableFusedOps OpenMPDEV(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) all fused and vector array op-

erations in the nvector openmpdev vector. The return value is 0for success and -1

if the input vector or its ops structure are NULL.

N VEnableLinearCombination OpenMPDEV

Prototype int N VEnableLinearCombination OpenMPDEV(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear combination fused

operation in the nvector openmpdev vector. The return value is 0for success and

-1 if the input vector or its ops structure are NULL.

N VEnableScaleAddMulti OpenMPDEV

Prototype int N VEnableScaleAddMulti OpenMPDEV(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale and add a vector to

multiple vectors fused operation in the nvector openmpdev vector. The return value

is 0for success and -1 if the input vector or its ops structure are NULL.

N VEnableDotProdMulti OpenMPDEV

Prototype int N VEnableDotProdMulti OpenMPDEV(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the multiple dot products fused

operation in the nvector openmpdev vector. The return value is 0for success and

-1 if the input vector or its ops structure are NULL.

N VEnableLinearSumVectorArray OpenMPDEV

Prototype int N VEnableLinearSumVectorArray OpenMPDEV(N Vector v, booleantype tf)

7.10 The NVECTOR OPENMPDEV implementation 209

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear sum operation for

vector arrays in the nvector openmpdev vector. The return value is 0for success

and -1 if the input vector or its ops structure are NULL.

N VEnableScaleVectorArray OpenMPDEV

Prototype int N VEnableScaleVectorArray OpenMPDEV(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale operation for vector

arrays in the nvector openmpdev vector. The return value is 0for success and -1 if

the input vector or its ops structure are NULL.

N VEnableConstVectorArray OpenMPDEV

Prototype int N VEnableConstVectorArray OpenMPDEV(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the const operation for vector

arrays in the nvector openmpdev vector. The return value is 0for success and -1 if

the input vector or its ops structure are NULL.

N VEnableWrmsNormVectorArray OpenMPDEV

Prototype int N VEnableWrmsNormVectorArray OpenMPDEV(N Vector v, booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the WRMS norm operation for

vector arrays in the nvector openmpdev vector. The return value is 0for success

and -1 if the input vector or its ops structure are NULL.

N VEnableWrmsNormMaskVectorArray OpenMPDEV

Prototype int N VEnableWrmsNormMaskVectorArray OpenMPDEV(N Vector v,

booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the masked WRMS norm op-

eration for vector arrays in the nvector openmpdev vector. The return value is 0for

success and -1 if the input vector or its ops structure are NULL.

N VEnableScaleAddMultiVectorArray OpenMPDEV

Prototype int N VEnableScaleAddMultiVectorArray OpenMPDEV(N Vector v,

booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the scale and add a vector array

to multiple vector arrays operation in the nvector openmpdev vector. The return

value is 0for success and -1 if the input vector or its ops structure are NULL.

N VEnableLinearCombinationVectorArray OpenMPDEV

Prototype int N VEnableLinearCombinationVectorArray OpenMPDEV(N Vector v,

booleantype tf)

Description This function enables (SUNTRUE) or disables (SUNFALSE) the linear combination operation

for vector arrays in the nvector openmpdev vector. The return value is 0for success

and -1 if the input vector or its ops structure are NULL.

210 Description of the NVECTOR module

Notes

•When looping over the components of an N Vector v, it is most eﬃcient to ﬁrst obtain the

component array via h data = NV DATA HOST OMPDEV(v) for the host array or

d data = NV DATA DEV OMPDEV(v) for the device array and then access h data[i] or d data[i]

within the loop.

•When accessing individual components of an N Vector v on the host remember to ﬁrst copy the

array back from the device with N VCopyFromDevice OpenMPDEV(v) to ensure the array is up

to date.

•N VNewEmpty OpenMPDEV,N VMake OpenMPDEV, and N VCloneVectorArrayEmpty OpenMPDEV set

the ﬁeld own data =SUNFALSE.N VDestroy OpenMPDEV and N VDestroyVectorArray OpenMPDEV

will not attempt to free the pointer data for any N Vector with own data set to SUNFALSE. In

such a case, it is the user’s responsibility to deallocate the data pointer.

•To maximize eﬃciency, vector operations in the nvector openmpdev implementation that

have more than one N Vector argument do not check for consistent internal representation of

these vectors. It is the user’s responsibility to ensure that such routines are called with N Vector

arguments that were all created with the same internal representations.

7.11 NVECTOR Examples

There are NVector examples that may be installed for the implementations provided with sundials.

Each implementation makes use of the functions in test nvector.c. These example functions show

simple usage of the NVector family of functions. The input to the examples are the vector length,

number of threads (if threaded implementation), and a print timing ﬂag.

The following is a list of the example functions in test nvector.c:

•Test N VClone: Creates clone of vector and checks validity of clone.

•Test N VCloneEmpty: Creates clone of empty vector and checks validity of clone.

•Test N VCloneVectorArray: Creates clone of vector array and checks validity of cloned array.

•Test N VCloneVectorArray: Creates clone of empty vector array and checks validity of cloned

array.

•Test N VGetArrayPointer: Get array pointer.

•Test N VSetArrayPointer: Allocate new vector, set pointer to new vector array, and check

values.

•Test N VLinearSum Case 1a: Test y = x + y

•Test N VLinearSum Case 1b: Test y = -x + y

•Test N VLinearSum Case 1c: Test y = ax + y

•Test N VLinearSum Case 2a: Test x = x + y

•Test N VLinearSum Case 2b: Test x = x - y

•Test N VLinearSum Case 2c: Test x = x + by

•Test N VLinearSum Case 3: Test z = x + y

•Test N VLinearSum Case 4a: Test z = x - y

•Test N VLinearSum Case 4b: Test z = -x + y

7.11 NVECTOR Examples 211

•Test N VLinearSum Case 5a: Test z = x + by

•Test N VLinearSum Case 5b: Test z = ax + y

•Test N VLinearSum Case 6a: Test z = -x + by

•Test N VLinearSum Case 6b: Test z = ax - y

•Test N VLinearSum Case 7: Test z = a(x + y)

•Test N VLinearSum Case 8: Test z = a(x - y)

•Test N VLinearSum Case 9: Test z = ax + by

•Test N VConst: Fill vector with constant and check result.

•Test N VProd: Test vector multiply: z = x * y

•Test N VDiv: Test vector division: z = x / y

•Test N VScale: Case 1: scale: x = cx

•Test N VScale: Case 2: copy: z = x

•Test N VScale: Case 3: negate: z = -x

•Test N VScale: Case 4: combination: z = cx

•Test N VAbs: Create absolute value of vector.

•Test N VAddConst: add constant vector: z = c + x

•Test N VDotProd: Calculate dot product of two vectors.

•Test N VMaxNorm: Create vector with known values, ﬁnd and validate the max norm.

•Test N VWrmsNorm: Create vector of known values, ﬁnd and validate the weighted root mean

square.

•Test N VWrmsNormMask: Create vector of known values, ﬁnd and validate the weighted root

mean square using all elements except one.

•Test N VMin: Create vector, ﬁnd and validate the min.

•Test N VWL2Norm: Create vector, ﬁnd and validate the weighted Euclidean L2 norm.

•Test N VL1Norm: Create vector, ﬁnd and validate the L1 norm.

•Test N VCompare: Compare vector with constant returning and validating comparison vector.

•Test N VInvTest: Test z[i] = 1 / x[i]

•Test N VConstrMask: Test mask of vector x with vector c.

•Test N VMinQuotient: Fill two vectors with known values. Calculate and validate minimum

quotient.

•Test N VLinearCombination Case 1a: Test x = a x

•Test N VLinearCombination Case 1b: Test z = a x

•Test N VLinearCombination Case 2a: Test x = a x + b y

•Test N VLinearCombination Case 2b: Test z = a x + b y

212 Description of the NVECTOR module

•Test N VLinearCombination Case 3a: Test x = x + a y + b z

•Test N VLinearCombination Case 3b: Test x = a x + b y + c z

•Test N VLinearCombination Case 3c: Test w = a x + b y + c z

•Test N VScaleAddMulti Case 1a: y = a x + y

•Test N VScaleAddMulti Case 1b: z = a x + y

•Test N VScaleAddMulti Case 2a: Y[i] = c[i] x + Y[i], i = 1,2,3

•Test N VScaleAddMulti Case 2b: Z[i] = c[i] x + Y[i], i = 1,2,3

•Test N VDotProdMulti Case 1: Calculate the dot product of two vectors

•Test N VDotProdMulti Case 2: Calculate the dot product of one vector with three other vectors

in a vector array.

•Test N VLinearSumVectorArray Case 1: z = a x + b y

•Test N VLinearSumVectorArray Case 2a: Z[i] = a X[i] + b Y[i]

•Test N VLinearSumVectorArray Case 2b: X[i] = a X[i] + b Y[i]

•Test N VLinearSumVectorArray Case 2c: Y[i] = a X[i] + b Y[i]

•Test N VScaleVectorArray Case 1a: y = c y

•Test N VScaleVectorArray Case 1b: z = c y

•Test N VScaleVectorArray Case 2a: Y[i] = c[i] Y[i]

•Test N VScaleVectorArray Case 2b: Z[i] = c[i] Y[i]

•Test N VScaleVectorArray Case 1a: z = c

•Test N VScaleVectorArray Case 1b: Z[i] = c

•Test N VWrmsNormVectorArray Case 1a: Create a vector of know values, ﬁnd and validate the

weighted root mean square norm.

•Test N VWrmsNormVectorArray Case 1b: Create a vector array of three vectors of know values,

ﬁnd and validate the weighted root mean square norm of each.

•Test N VWrmsNormMaskVectorArray Case 1a: Create a vector of know values, ﬁnd and validate

the weighted root mean square norm using all elements except one.

•Test N VWrmsNormMaskVectorArray Case 1b: Create a vector array of three vectors of know

values, ﬁnd and validate the weighted root mean square norm of each using all elements except

one.

•Test N VScaleAddMultiVectorArray Case 1a: y = a x + y

•Test N VScaleAddMultiVectorArray Case 1b: z = a x + y

•Test N VScaleAddMultiVectorArray Case 2a: Y[j][0] = a[j] X[0] + Y[j][0]

•Test N VScaleAddMultiVectorArray Case 2b: Z[j][0] = a[j] X[0] + Y[j][0]

•Test N VScaleAddMultiVectorArray Case 3a: Y[0][i] = a[0] X[i] + Y[0][i]

•Test N VScaleAddMultiVectorArray Case 3b: Z[0][i] = a[0] X[i] + Y[0][i]

7.11 NVECTOR Examples 213

•Test N VScaleAddMultiVectorArray Case 4a: Y[j][i] = a[j] X[i] + Y[j][i]

•Test N VScaleAddMultiVectorArray Case 4b: Z[j][i] = a[j] X[i] + Y[j][i]

•Test N VLinearCombinationVectorArray Case 1a: x = a x

•Test N VLinearCombinationVectorArray Case 1b: z = a x

•Test N VLinearCombinationVectorArray Case 2a: x = a x + b y

•Test N VLinearCombinationVectorArray Case 2b: z = a x + b y

•Test N VLinearCombinationVectorArray Case 3a: x = a x + b y + c z

•Test N VLinearCombinationVectorArray Case 3b: w = a x + b y + c z

•Test N VLinearCombinationVectorArray Case 4a: X[0][i] = c[0] X[0][i]

•Test N VLinearCombinationVectorArray Case 4b: Z[i] = c[0] X[0][i]

•Test N VLinearCombinationVectorArray Case 5a: X[0][i] = c[0] X[0][i] + c[1] X[1][i]

•Test N VLinearCombinationVectorArray Case 5b: Z[i] = c[0] X[0][i] + c[1] X[1][i]

•Test N VLinearCombinationVectorArray Case 6a: X[0][i] = X[0][i] + c[1] X[1][i] + c[2] X[2][i]

•Test N VLinearCombinationVectorArray Case 6b: X[0][i] = c[0] X[0][i] + c[1] X[1][i] + c[2]

X[2][i]

•Test N VLinearCombinationVectorArray Case 6c: Z[i] = c[0] X[0][i] + c[1] X[1][i] + c[2] X[2][i]

214 Description of the NVECTOR module

Table 7.5: List of vector functions usage by idas code modules

idas

idals

idabbdpre

idaa

N VGetVectorID

N VClone X X X X

N VCloneEmpty 1

N VDestroy X X X X

N VCloneVectorArray X X

N VDestroyVectorArray X X

N VSpace X2

N VGetArrayPointer 1X

N VSetArrayPointer 1

N VLinearSum X X X

N VConst X X X

N VProd X

N VDiv X

N VScale X X X X

N VAbs X

N VInv X

N VAddConst X

N VDotProd X

N VMaxNorm X

N VWrmsNorm X X

N VMin X

N VMinQuotient X

N VConstrMask X

N VWrmsNormMask X

N VCompare X

N VLinearCombination X

N VScaleAddMulti X

N VDotProdMulti 3

N VLinearSumVectorArray X

N VScaleVectorArray X

N VConstVectorArray X

N VWrmsNormVectorArray X

N VWrmsNormMaskVectorArray X

N VScaleAddMultiVectorArray X

N VLinearCombinationVectorArray X

Chapter 8

Description of the SUNMatrix

module

For problems that involve direct methods for solving linear systems, the sundials solvers not only op-

erate on generic vectors, but also on generic matrices (of type SUNMatrix), through a set of operations

deﬁned by the particular sunmatrix implementation. Users can provide their own speciﬁc imple-

mentation of the sunmatrix module, particularly in cases where they provide their own nvector

and/or linear solver modules, and require matrices that are compatible with those implementations.

Alternately, we provide three sunmatrix implementations: dense, banded, and sparse. The generic

operations are described below, and descriptions of the implementations provided with sundials

follow.

The generic SUNMatrix type has been modeled after the object-oriented style of the generic

N Vector type. Speciﬁcally, a generic SUNMatrix is a pointer to a structure that has an implementation-

dependent content ﬁeld containing the description and actual data of the matrix, and an ops ﬁeld

pointing to a structure with generic matrix operations. The type SUNMatrix is deﬁned as

typedef struct _generic_SUNMatrix *SUNMatrix;

struct _generic_SUNMatrix {

void *content;

struct _generic_SUNMatrix_Ops *ops;

};

The generic SUNMatrix Ops structure is essentially a list of pointers to the various actual matrix

operations, and is deﬁned as

struct _generic_SUNMatrix_Ops {

SUNMatrix_ID (*getid)(SUNMatrix);

SUNMatrix (*clone)(SUNMatrix);

void (*destroy)(SUNMatrix);

int (*zero)(SUNMatrix);

int (*copy)(SUNMatrix, SUNMatrix);

int (*scaleadd)(realtype, SUNMatrix, SUNMatrix);

int (*scaleaddi)(realtype, SUNMatrix);

int (*matvec)(SUNMatrix, N_Vector, N_Vector);

int (*space)(SUNMatrix, long int*, long int*);

};

The generic sunmatrix module deﬁnes and implements the matrix operations acting on SUNMatrix

objects. These routines are nothing but wrappers for the matrix operations deﬁned by a particular

sunmatrix implementation, which are accessed through the ops ﬁeld of the SUNMatrix structure. To

216 Description of the SUNMatrix module

Table 8.1: Identiﬁers associated with matrix kernels supplied with sundials.

Matrix ID Matrix type ID Value

SUNMATRIX DENSE Dense M×Nmatrix 0

SUNMATRIX BAND Band M×Mmatrix 1

SUNMATRIX SPARSE Sparse (CSR or CSC) M×Nmatrix 2

SUNMATRIX CUSTOM User-provided custom matrix 3

illustrate this point we show below the implementation of a typical matrix operation from the generic

sunmatrix module, namely SUNMatZero, which sets all values of a matrix Ato zero, returning a ﬂag

denoting a successful/failed operation:

int SUNMatZero(SUNMatrix A)

{

return((int) A->ops->zero(A));

}

Table 8.2 contains a complete list of all matrix operations deﬁned by the generic sunmatrix module.

A particular implementation of the sunmatrix module must:

•Specify the content ﬁeld of the SUNMatrix object.

•Deﬁne and implement a minimal subset of the matrix operations. See the documentation for

each sundials solver to determine which sunmatrix operations they require.

Note that the names of these routines should be unique to that implementation in order to

permit using more than one sunmatrix module (each with diﬀerent SUNMatrix internal data

representations) in the same code.

•Deﬁne and implement user-callable constructor and destructor routines to create and free a

SUNMatrix with the new content ﬁeld and with ops pointing to the new matrix operations.

•Optionally, deﬁne and implement additional user-callable routines acting on the newly deﬁned

SUNMatrix (e.g., a routine to print the content for debugging purposes).

•Optionally, provide accessor macros or functions as needed for that particular implementation

to access diﬀerent parts of the content ﬁeld of the newly deﬁned SUNMatrix.

Each sunmatrix implementation included in sundials has a unique identiﬁer speciﬁed in enu-

meration and shown in Table 8.1. It is recommended that a user-supplied sunmatrix implementation

use the SUNMATRIX CUSTOM identiﬁer.

Table 8.2: Description of the SUNMatrix operations

Name Usage and Description

SUNMatGetID id = SUNMatGetID(A);

Returns the type identiﬁer for the matrix A. It is used to determine the ma-

trix implementation type (e.g. dense, banded, sparse,. . . ) from the abstract

SUNMatrix interface. This is used to assess compatibility with sundials-

provided linear solver implementations. Returned values are given in the

Table 8.1.

continued on next page

217

Name Usage and Description

SUNMatClone B = SUNMatClone(A);

Creates a new SUNMatrix of the same type as an existing matrix Aand sets

the ops ﬁeld. It does not copy the matrix, but rather allocates storage for

the new matrix.

SUNMatDestroy SUNMatDestroy(A);

Destroys the SUNMatrix A and frees memory allocated for its internal data.

SUNMatSpace ier = SUNMatSpace(A, &lrw, &liw);

Returns the storage requirements for the matrix A.lrw is a long int con-

taining the number of realtype words and liw is a long int containing

the number of integer words. The return value is an integer ﬂag denoting

success/failure of the operation.

This function is advisory only, for use in determining a user’s total space

requirements; it could be a dummy function in a user-supplied sunmatrix

module if that information is not of interest.

SUNMatZero ier = SUNMatZero(A);

Performs the operation Aij = 0 for all entries of the matrix A. The return

value is an integer ﬂag denoting success/failure of the operation.

SUNMatCopy ier = SUNMatCopy(A,B);

Performs the operation Bij =Ai,j for all entries of the matrices Aand B.

The return value is an integer ﬂag denoting success/failure of the operation.

SUNMatScaleAdd ier = SUNMatScaleAdd(c, A, B);

Performs the operation A=cA +B. The return value is an integer ﬂag

denoting success/failure of the operation.

SUNMatScaleAddI ier = SUNMatScaleAddI(c, A);

Performs the operation A=cA +I. The return value is an integer ﬂag

denoting success/failure of the operation.

SUNMatMatvec ier = SUNMatMatvec(A, x, y);

Performs the matrix-vector product operation, y=Ax. It should only be

called with vectors xand ythat are compatible with the matrix A– both in

storage type and dimensions. The return value is an integer ﬂag denoting

success/failure of the operation.

We note that not all sunmatrix types are compatible with all nvector types provided with

sundials. This is primarily due to the need for compatibility within the SUNMatMatvec routine;

however, compatibility between sunmatrix and nvector implementations is more crucial when

considering their interaction within sunlinsol objects, as will be described in more detail in Chapter

9. More speciﬁcally, in Table 8.3 we show the matrix interfaces available as sunmatrix modules, and

the compatible vector implementations.

Table 8.3: sundials matrix interfaces and vector implementations that can be used for each.

Matrix

Interface

Serial Parallel

(MPI)

OpenMP pThreads hypre

Vec.

petsc

Vec.

cuda raja User

Suppl.

Dense X X X X

continued on next page

218 Description of the SUNMatrix module

Matrix

Interface

Serial Parallel

(MPI)

OpenMP pThreads hypre

Vec.

petsc

Vec.

cuda raja User

Suppl.

Band X X X X

Sparse X X X X

User supplied X X X X X X X X X

8.1 SUNMatrix functions used by IDAS

In Table 8.4, we list the matrix functions in the sunmatrix module used within the idas package.

The table also shows, for each function, which of the code modules uses the function. The main idas

integrator does not call any sunmatrix functions directly, so the table columns are speciﬁc to the

idals interface and the idabbdpre preconditioner module. We further note that the idals interface

only utilizes these routines when supplied with a matrix-based linear solver, i.e., the sunmatrix object

passed to IDASetLinearSolver was not NULL.

At this point, we should emphasize that the idas user does not need to know anything about the

usage of matrix functions by the idas code modules in order to use idas. The information is presented

as an implementation detail for the interested reader.

Table 8.4: List of matrix functions usage by idas code modules

idals

idabbdpre

SUNMatGetID X

SUNMatDestroy X

SUNMatZero X X

SUNMatSpace †

The matrix functions listed in Table 8.2 with a †symbol are optionally used, in that these are only

called if they are implemented in the sunmatrix module that is being used (i.e. their function pointers

are non-NULL). The matrix functions listed in Table 8.2 that are not used by idas are: SUNMatCopy,

SUNMatClone,SUNMatScaleAdd,SUNMatScaleAddI and SUNMatMatvec. Therefore a user-supplied

sunmatrix module for idas could omit these functions.

8.2 The SUNMatrix Dense implementation

The dense implementation of the sunmatrix module provided with sundials,sunmatrix dense,

deﬁnes the content ﬁeld of SUNMatrix to be the following structure:

struct _SUNMatrixContent_Dense {

sunindextype M;

sunindextype N;

realtype *data;

sunindextype ldata;

realtype **cols;

};

These entries of the content ﬁeld contain the following information:

M- number of rows

N- number of columns

8.2 The SUNMatrix Dense implementation 219

data - pointer to a contiguous block of realtype variables. The elements of the dense matrix are

stored columnwise, i.e. the (i,j)-th element of a dense sunmatrix A(with 0 ≤i<Mand 0 ≤

j<N) may be accessed via data[j*M+i].

ldata - length of the data array (= M·N).

cols - array of pointers. cols[j] points to the ﬁrst element of the j-th column of the matrix in the

array data. The (i,j)-th element of a dense sunmatrix A(with 0 ≤i<Mand 0 ≤j<N)

may be accessed via cols[j][i].

The header ﬁle to include when using this module is sunmatrix/sunmatrix dense.h. The sunma-

trix dense module is accessible from all sundials solvers without linking to the

libsundials sunmatrixdense module library.

8.2.1 SUNMatrix Dense accessor macros

The following macros are provided to access the content of a sunmatrix dense matrix. The preﬁx

SM in the names denotes that these macros are for SUNMatrix implementations, and the suﬃx D

denotes that these are speciﬁc to the dense version.

•SM CONTENT D

This macro gives access to the contents of the dense SUNMatrix.

The assignment A cont =SM CONTENT D(A) sets A cont to be a pointer to the dense SUNMatrix

content structure.

Implementation:

#define SM_CONTENT_D(A) ( (SUNMatrixContent_Dense)(A->content) )

•SM ROWS D,SM COLUMNS D, and SM LDATA D

These macros give individual access to various lengths relevant to the content of a dense

SUNMatrix.

These may be used either to retrieve or to set these values. For example, the assignment A rows

= SM ROWS D(A) sets A rows to be the number of rows in the matrix A. Similarly, the assignment

SM COLUMNS D(A) = A cols sets the number of columns in Ato equal A cols.

Implementation:

#define SM_ROWS_D(A) ( SM_CONTENT_D(A)->M )

#define SM_COLUMNS_D(A) ( SM_CONTENT_D(A)->N )

#define SM_LDATA_D(A) ( SM_CONTENT_D(A)->ldata )

•SM DATA D and SM COLS D

These macros give access to the data and cols pointers for the matrix entries.

The assignment A data = SM DATA D(A) sets A data to be a pointer to the ﬁrst component of

the data array for the dense SUNMatrix A. The assignment SM DATA D(A) = A data sets the data

array of Ato be A data by storing the pointer A data.

Similarly, the assignment A cols = SM COLS D(A) sets A cols to be a pointer to the array of

column pointers for the dense SUNMatrix A. The assignment SM COLS D(A) = A cols sets the

column pointer array of Ato be Acols by storing the pointer A cols.

Implementation:

#define SM_DATA_D(A) ( SM_CONTENT_D(A)->data )

#define SM_COLS_D(A) ( SM_CONTENT_D(A)->cols )

220 Description of the SUNMatrix module

•SM COLUMN D and SM ELEMENT D

These macros give access to the individual columns and entries of the data array of a dense

SUNMatrix.

The assignment col j = SM COLUMN D(A,j) sets col j to be a pointer to the ﬁrst entry of

the j-th column of the M×Ndense matrix A(with 0 ≤j<N). The type of the expression

SM COLUMN D(A,j) is realtype *. The pointer returned by the call SM COLUMN D(A,j) can be

treated as an array which is indexed from 0 to M−1.

The assignments SM ELEMENT D(A,i,j) = a ij and a ij = SM ELEMENT D(A,i,j) reference the

(i,j)-th element of the M×Ndense matrix A(with 0 ≤i<Mand 0 ≤j<N).

Implementation:

#define SM_COLUMN_D(A,j) ( (SM_CONTENT_D(A)->cols)[j] )

#define SM_ELEMENT_D(A,i,j) ( (SM_CONTENT_D(A)->cols)[j][i] )

8.2.2 SUNMatrix Dense functions

The sunmatrix dense module deﬁnes dense implementations of all matrix operations listed in Ta-

ble 8.2. Their names are obtained from those in Table 8.2 by appending the suﬃx Dense (e.g.

SUNMatCopy Dense). All the standard matrix operations listed in 8.2 with the suﬃx Dense appended

are callable via the Fortran 2003 interface by prepending an ‘F’ (e.g. FSUNMatCopy Dense).

The module sunmatrix dense provides the following additional user-callable routines:

SUNDenseMatrix

Prototype SUNMatrix SUNDenseMatrix(sunindextype M, sunindextype N)

Description This constructor function creates and allocates memory for a dense SUNMatrix. Its

arguments are the number of rows, M, and columns, N, for the dense matrix.

F2003 Name This function is callable as FSUNDenseMatrix when using the Fortran 2003 interface

module.

SUNDenseMatrix Print

Prototype void SUNDenseMatrix Print(SUNMatrix A, FILE* outfile)

Description This function prints the content of a dense SUNMatrix to the output stream speciﬁed

by outfile. Note: stdout or stderr may be used as arguments for outfile to print

directly to standard output or standard error, respectively.

SUNDenseMatrix Rows

Prototype sunindextype SUNDenseMatrix Rows(SUNMatrix A)

Description This function returns the number of rows in the dense SUNMatrix.

F2003 Name This function is callable as FSUNDenseMatrix Rows when using the Fortran 2003 inter-

face module.

SUNDenseMatrix Columns

Prototype sunindextype SUNDenseMatrix Columns(SUNMatrix A)

Description This function returns the number of columns in the dense SUNMatrix.

F2003 Name This function is callable as FSUNDenseMatrix Columns when using the Fortran 2003

interface module.

8.2 The SUNMatrix Dense implementation 221

SUNDenseMatrix LData

Prototype sunindextype SUNDenseMatrix LData(SUNMatrix A)

Description This function returns the length of the data array for the dense SUNMatrix.

F2003 Name This function is callable as FSUNDenseMatrix LData when using the Fortran 2003 inter-

face module.

SUNDenseMatrix Data

Prototype realtype* SUNDenseMatrix Data(SUNMatrix A)

Description This function returns a pointer to the data array for the dense SUNMatrix.

F2003 Name This function is callable as FSUNDenseMatrix Data when using the Fortran 2003 inter-

face module.

SUNDenseMatrix Cols

Prototype realtype** SUNDenseMatrix Cols(SUNMatrix A)

Description This function returns a pointer to the cols array for the dense SUNMatrix.

SUNDenseMatrix Column

Prototype realtype* SUNDenseMatrix Column(SUNMatrix A, sunindextype j)

Description This function returns a pointer to the ﬁrst entry of the jth column of the dense SUNMatrix.

The resulting pointer should be indexed over the range 0 to M−1.

F2003 Name This function is callable as FSUNDenseMatrix Column when using the Fortran 2003 in-

terface module.

Notes

•When looping over the components of a dense SUNMatrix A, the most eﬃcient approaches are

to:

–First obtain the component array via A data = SM DATA D(A) or

A data = SUNDenseMatrix Data(A) and then access A data[i] within the loop.

–First obtain the array of column pointers via A cols = SM COLS D(A) or

A cols = SUNDenseMatrix Cols(A), and then access A cols[j][i] within the loop.

–Within a loop over the columns, access the column pointer via

A colj = SUNDenseMatrix Column(A,j) and then to access the entries within that column

using A colj[i] within the loop.

All three of these are more eﬃcient than using SM ELEMENT D(A,i,j) within a double loop.

•Within the SUNMatMatvec Dense routine, internal consistency checks are performed to ensure

that the matrix is called with consistent nvector implementations. These are currently limited

to: nvector serial,nvector openmp, and nvector pthreads. As additional compatible

vector implementations are added to sundials, these will be included within this compatibility

check.

8.2.3 SUNMatrix Dense Fortran interfaces

The sunmatrix dense module provides a Fortran 2003 module as well as Fortran 77 style inter-

face functions for use from Fortran applications.

222 Description of the SUNMatrix module

FORTRAN 2003 interface module

The fsunmatrix dense mod Fortran module deﬁnes interfaces to most sunmatrix dense C func-

tions using the intrinsic iso c binding module which provides a standardized mechanism for interop-

erating with C. As noted in the Cfunction descriptions above, the interface functions are named after

the corresponding Cfunction, but with a leading ‘F’. For example, the function SUNDenseMatrix is

interfaced as FSUNDenseMatrix.

The Fortran 2003 sunmatrix dense interface module can be accessed with the use statement,

i.e. use fsunmatrix dense mod, and linking to the library libsundials fsunmatrixdense mod.lib in

addition to the Clibrary. For details on where the library and module ﬁle fsunmatrix dense mod.mod

are installed see Appendix A. We note that the module is accessible from the Fortran 2003 sundials

integrators without separately linking to the libsundials fsunmatrixdense mod library.

FORTRAN 77 interface functions

For solvers that include a Fortran interface module, the sunmatrix dense module also includes the

Fortran-callable function FSUNDenseMatInit(code, M, N, ier) to initialize this sunmatrix dense

module for a given sundials solver. Here code is an integer input solver id (1 for cvode, 2 for ida,

3 for kinsol, 4 for arkode); Mand Nare the corresponding dense matrix construction arguments

(declared to match C type long int); and ier is an error return ﬂag equal to 0 for success and -1

for failure. Both code and ier are declared to match C type int. Additionally, when using arkode

with a non-identity mass matrix, the Fortran-callable function FSUNDenseMassMatInit(M, N, ier)

initializes this sunmatrix dense module for storing the mass matrix.

8.3 The SUNMatrix Band implementation

The banded implementation of the sunmatrix module provided with sundials,sunmatrix band,

deﬁnes the content ﬁeld of SUNMatrix to be the following structure:

struct _SUNMatrixContent_Band {

sunindextype M;

sunindextype N;

sunindextype mu;

sunindextype ml;

sunindextype s_mu;

sunindextype ldim;

realtype *data;

sunindextype ldata;

realtype **cols;

};

A diagram of the underlying data representation in a banded matrix is shown in Figure 8.1. A more

complete description of the parts of this content ﬁeld is given below:

M- number of rows

N- number of columns (N=M)

mu - upper half-bandwidth, 0 ≤mu <N

ml - lower half-bandwidth, 0 ≤ml <N

s mu - storage upper bandwidth, mu ≤s mu <N. The LU decomposition routines in the associated

sunlinsol band and sunlinsol lapackband modules write the LU factors into the storage

for A. The upper triangular factor U, however, may have an upper bandwidth as big as min(N-

1,mu+ml) because of partial pivoting. The s mu ﬁeld holds the upper half-bandwidth allocated

for A.

ldim - leading dimension (ldim ≥s mu+ml+1)

8.3 The SUNMatrix Band implementation 223

size data

mu ml smu

data[0]

data[1]

data[j]

data[j+1]

data[N−1]

data[j][smu−mu]

data[j][smu]

data[j][smu+ml]

mu+ml+1

smu−mu

A(j−mu−1,j)

A(j−mu,j)

A(j,j)

A(j+ml,j)

Figure 8.1: Diagram of the storage for the sunmatrix band module. Here Ais an N×Nband

matrix with upper and lower half-bandwidths mu and ml, respectively. The rows and columns of Aare

numbered from 0 to N−1 and the (i, j)-th element of Ais denoted A(i,j). The greyed out areas of

the underlying component storage are used by the associated sunlinsol band linear solver.

data - pointer to a contiguous block of realtype variables. The elements of the banded matrix are

stored columnwise (i.e. columns are stored one on top of the other in memory). Only elements

within the speciﬁed half-bandwidths are stored. data is a pointer to ldata contiguous locations

which hold the elements within the band of A.

ldata - length of the data array (= ldim·N)

cols - array of pointers. cols[j] is a pointer to the uppermost element within the band in the

j-th column. This pointer may be treated as an array indexed from s mu−mu (to access the

uppermost element within the band in the j-th column) to smu+ml (to access the lowest

element within the band in the j-th column). Indices from 0 to s mu−mu−1 give access to extra

storage elements required by the LU decomposition function. Finally, cols[j][i-j+s mu] is

the (i, j)-th element with j−mu ≤i≤j+ml.

The header ﬁle to include when using this module is sunmatrix/sunmatrix band.h. The sunma-

trix band module is accessible from all sundials solvers without linking to the

libsundials sunmatrixband module library.

224 Description of the SUNMatrix module

8.3.1 SUNMatrix Band accessor macros

The following macros are provided to access the content of a sunmatrix band matrix. The preﬁx

SM in the names denotes that these macros are for SUNMatrix implementations, and the suﬃx B

denotes that these are speciﬁc to the banded version.

•SM CONTENT B

This routine gives access to the contents of the banded SUNMatrix.

The assignment A cont =SM CONTENT B(A) sets A cont to be a pointer to the banded SUNMatrix

content structure.

Implementation:

#define SM_CONTENT_B(A) ( (SUNMatrixContent_Band)(A->content) )

•SM ROWS B,SM COLUMNS B,SM UBAND B,SM LBAND B,SM SUBAND B,SM LDIM B, and SM LDATA B

These macros give individual access to various lengths relevant to the content of a banded

SUNMatrix.

These may be used either to retrieve or to set these values. For example, the assignment A rows

= SM ROWS B(A) sets A rows to be the number of rows in the matrix A. Similarly, the assignment

SM COLUMNS B(A) = A cols sets the number of columns in Ato equal A cols.

Implementation:

#define SM_ROWS_B(A) ( SM_CONTENT_B(A)->M )

#define SM_COLUMNS_B(A) ( SM_CONTENT_B(A)->N )

#define SM_UBAND_B(A) ( SM_CONTENT_B(A)->mu )

#define SM_LBAND_B(A) ( SM_CONTENT_B(A)->ml )

#define SM_SUBAND_B(A) ( SM_CONTENT_B(A)->s_mu )

#define SM_LDIM_B(A) ( SM_CONTENT_B(A)->ldim )

#define SM_LDATA_B(A) ( SM_CONTENT_B(A)->ldata )

•SM DATA B and SM COLS B

These macros give access to the data and cols pointers for the matrix entries.

The assignment A data = SM DATA B(A) sets A data to be a pointer to the ﬁrst component of

the data array for the banded SUNMatrix A. The assignment SM DATA B(A) = A data sets the

data array of Ato be A data by storing the pointer A data.

Similarly, the assignment A cols = SM COLS B(A) sets A cols to be a pointer to the array of

column pointers for the banded SUNMatrix A. The assignment SM COLS B(A) = A cols sets the

column pointer array of Ato be A cols by storing the pointer A cols.

Implementation:

#define SM_DATA_B(A) ( SM_CONTENT_B(A)->data )

#define SM_COLS_B(A) ( SM_CONTENT_B(A)->cols )

•SM COLUMN B,SM COLUMN ELEMENT B, and SM ELEMENT B

These macros give access to the individual columns and entries of the data array of a banded

SUNMatrix.

The assignments SM ELEMENT B(A,i,j) = a ij and a ij = SM ELEMENT B(A,i,j) reference the

(i,j)-th element of the N×Nband matrix A, where 0 ≤i,j≤N−1. The location (i,j) should

further satisfy j−mu ≤i≤j+ml.

The assignment col j = SM COLUMN B(A,j) sets col j to be a pointer to the diagonal element

of the j-th column of the N×Nband matrix A, 0 ≤j≤N−1. The type of the expression

8.3 The SUNMatrix Band implementation 225

SM COLUMN B(A,j) is realtype *. The pointer returned by the call SM COLUMN B(A,j) can be

treated as an array which is indexed from −mu to ml.

The assignments SM COLUMN ELEMENT B(col j,i,j) = a ij and

a ij = SM COLUMN ELEMENT B(col j,i,j) reference the (i,j)-th entry of the band matrix A

when used in conjunction with SM COLUMN B to reference the j-th column through col j. The

index (i,j) should satisfy j−mu ≤i≤j+ml.

Implementation:

#define SM_COLUMN_B(A,j) ( ((SM_CONTENT_B(A)->cols)[j])+SM_SUBAND_B(A) )

#define SM_COLUMN_ELEMENT_B(col_j,i,j) (col_j[(i)-(j)])

#define SM_ELEMENT_B(A,i,j)

( (SM_CONTENT_B(A)->cols)[j][(i)-(j)+SM_SUBAND_B(A)] )

8.3.2 SUNMatrix Band functions

The sunmatrix band module deﬁnes banded implementations of all matrix operations listed in

Table 8.2. Their names are obtained from those in Table 8.2 by appending the suﬃx Band (e.g.

SUNMatCopy Band). All the standard matrix operations listed in 8.2 with the suﬃx Band appended

are callable via the Fortran 2003 interface by prepending an ‘F’ (e.g. FSUNMatCopy Band).

The module sunmatrix band provides the following additional user-callable routines:

SUNBandMatrix

Prototype SUNMatrix SUNBandMatrix(sunindextype N, sunindextype mu, sunindextype ml)

Description This constructor function creates and allocates memory for a banded SUNMatrix. Its

arguments are the matrix size, N, and the upper and lower half-bandwidths of the matrix,

mu and ml. The stored upper bandwidth is set to mu+ml to accommodate subsequent

factorization in the sunlinsol band and sunlinsol lapackband modules.

F2003 Name This function is callable as FSUNBandMatrix when using the Fortran 2003 interface

module.

SUNBandMatrixStorage

Prototype SUNMatrix SUNBandMatrixStorage(sunindextype N, sunindextype mu,

sunindextype ml, sunindextype smu)

Description This constructor function creates and allocates memory for a banded SUNMatrix. Its

arguments are the matrix size, N, the upper and lower half-bandwidths of the matrix,

mu and ml, and the stored upper bandwidth, smu. When creating a band SUNMatrix,

this value should be

•at least min(N-1,mu+ml) if the matrix will be used by the sunlinsol band module;

•exactly equal to mu+ml if the matrix will be used by the sunlinsol lapackband

module;

•at least mu if used in some other manner.

Note: it is strongly recommended that users call the default constructor, SUNBandMatrix,

in all standard use cases. This advanced constructor is used internally within sundials

solvers, and is provided to users who require banded matrices for non-default purposes.

226 Description of the SUNMatrix module

SUNBandMatrix Print

Prototype void SUNBandMatrix Print(SUNMatrix A, FILE* outfile)

Description This function prints the content of a banded SUNMatrix to the output stream speciﬁed

by outfile. Note: stdout or stderr may be used as arguments for outfile to print

directly to standard output or standard error, respectively.

SUNBandMatrix Rows

Prototype sunindextype SUNBandMatrix Rows(SUNMatrix A)

Description This function returns the number of rows in the banded SUNMatrix.

F2003 Name This function is callable as FSUNBandMatrix Rows when using the Fortran 2003 interface

module.

SUNBandMatrix Columns

Prototype sunindextype SUNBandMatrix Columns(SUNMatrix A)

Description This function returns the number of columns in the banded SUNMatrix.

F2003 Name This function is callable as FSUNBandMatrix Columns when using the Fortran 2003 in-

terface module.

SUNBandMatrix LowerBandwidth

Prototype sunindextype SUNBandMatrix LowerBandwidth(SUNMatrix A)

Description This function returns the lower half-bandwidth of the banded SUNMatrix.

F2003 Name This function is callable as FSUNBandMatrix LowerBandwidth when using the Fortran

2003 interface module.

SUNBandMatrix UpperBandwidth

Prototype sunindextype SUNBandMatrix UpperBandwidth(SUNMatrix A)

Description This function returns the upper half-bandwidth of the banded SUNMatrix.

F2003 Name This function is callable as FSUNBandMatrix UpperBandwidth when using the Fortran

2003 interface module.

SUNBandMatrix StoredUpperBandwidth

Prototype sunindextype SUNBandMatrix StoredUpperBandwidth(SUNMatrix A)

Description This function returns the stored upper half-bandwidth of the banded SUNMatrix.

F2003 Name This function is callable as FSUNBandMatrix StoredUpperBandwidth when using the

Fortran 2003 interface module.

SUNBandMatrix LDim

Prototype sunindextype SUNBandMatrix LDim(SUNMatrix A)

Description This function returns the length of the leading dimension of the banded SUNMatrix.

F2003 Name This function is callable as FSUNBandMatrix LDim when using the Fortran 2003 interface

module.

8.3 The SUNMatrix Band implementation 227

SUNBandMatrix Data

Prototype realtype* SUNBandMatrix Data(SUNMatrix A)

Description This function returns a pointer to the data array for the banded SUNMatrix.

F2003 Name This function is callable as FSUNBandMatrix Data when using the Fortran 2003 interface

module.

SUNBandMatrix Cols

Prototype realtype** SUNBandMatrix Cols(SUNMatrix A)

Description This function returns a pointer to the cols array for the banded SUNMatrix.

SUNBandMatrix Column

Prototype realtype* SUNBandMatrix Column(SUNMatrix A, sunindextype j)

Description This function returns a pointer to the diagonal entry of the j-th column of the banded

SUNMatrix. The resulting pointer should be indexed over the range −mu to ml.

F2003 Name This function is callable as FSUNBandMatrix Column when using the Fortran 2003 inter-

face module.

Notes

•When looping over the components of a banded SUNMatrix A, the most eﬃcient approaches are

to:

–First obtain the component array via A data = SM DATA B(A) or

A data = SUNBandMatrix Data(A) and then access A data[i] within the loop.

–First obtain the array of column pointers via A cols = SM COLS B(A) or

A cols = SUNBandMatrix Cols(A), and then access A cols[j][i] within the loop.

–Within a loop over the columns, access the column pointer via

A colj = SUNBandMatrix Column(A,j) and then to access the entries within that column

using SM COLUMN ELEMENT B(A colj,i,j).

All three of these are more eﬃcient than using SM ELEMENT B(A,i,j) within a double loop.

•Within the SUNMatMatvec Band routine, internal consistency checks are performed to ensure

that the matrix is called with consistent nvector implementations. These are currently limited

to: nvector serial,nvector openmp, and nvector pthreads. As additional compatible

vector implementations are added to sundials, these will be included within this compatibility

check.

8.3.3 SUNMatrix Band Fortran interfaces

The sunmatrix band module provides a Fortran 2003 module as well as Fortran 77 style interface

functions for use from Fortran applications.

FORTRAN 2003 interface module

The fsunmatrix band mod Fortran module deﬁnes interfaces to most sunmatrix band C functions

using the intrinsic iso c binding module which provides a standardized mechanism for interoperat-

ing with C. As noted in the Cfunction descriptions above, the interface functions are named after

the corresponding Cfunction, but with a leading ‘F’. For example, the function SUNBandMatrix is

interfaced as FSUNBandMatrix.

The Fortran 2003 sunmatrix band interface module can be accessed with the use statement,

i.e. use fsunmatrix band mod, and linking to the library libsundials fsunmatrixband mod.lib in

228 Description of the SUNMatrix module

addition to the Clibrary. For details on where the library and module ﬁle fsunmatrix band mod.mod

are installed see Appendix A. We note that the module is accessible from the Fortran 2003 sundials

integrators without separately linking to the libsundials fsunmatrixband mod library.

FORTRAN 77 interface functions

For solvers that include a Fortran interface module, the sunmatrix band module also includes

the Fortran-callable function FSUNBandMatInit(code, N, mu, ml, ier) to initialize this sunma-

trix band module for a given sundials solver. Here code is an integer input solver id (1 for cvode,

2 for ida, 3 for kinsol, 4 for arkode); N,mu, and ml are the corresponding band matrix construction

arguments (declared to match C type long int); and ier is an error return ﬂag equal to 0 for success

and -1 for failure. Both code and ier are declared to match C type int. Additionally, when using

arkode with a non-identity mass matrix, the Fortran-callable function FSUNBandMassMatInit(N,

mu, ml, ier) initializes this sunmatrix band module for storing the mass matrix.

8.4 The SUNMatrix Sparse implementation

The sparse implementation of the sunmatrix module provided with sundials,sunmatrix sparse,

is designed to work with either compressed-sparse-column (CSC) or compressed-sparse-row (CSR)

sparse matrix formats. To this end, it deﬁnes the content ﬁeld of SUNMatrix to be the following

structure:

struct _SUNMatrixContent_Sparse {

sunindextype M;

sunindextype N;

sunindextype NNZ;

sunindextype NP;

realtype *data;

int sparsetype;

sunindextype *indexvals;

sunindextype *indexptrs;

/* CSC indices */

sunindextype **rowvals;

sunindextype **colptrs;

/* CSR indices */

sunindextype **colvals;

sunindextype **rowptrs;

};

A diagram of the underlying data representation for a CSC matrix is shown in Figure 8.2 (the CSR

format is similar). A more complete description of the parts of this content ﬁeld is given below:

M- number of rows

N- number of columns

NNZ - maximum number of nonzero entries in the matrix (allocated length of data and

indexvals arrays)

NP - number of index pointers (e.g. number of column pointers for CSC matrix). For CSC

matrices NP =N, and for CSR matrices NP =M. This value is set automatically based

the input for sparsetype.

data - pointer to a contiguous block of realtype variables (of length NNZ), containing the

values of the nonzero entries in the matrix

sparsetype - type of the sparse matrix (CSC MAT or CSR MAT)

indexvals - pointer to a contiguous block of int variables (of length NNZ), containing the row indices

(if CSC) or column indices (if CSR) of each nonzero matrix entry held in data

8.4 The SUNMatrix Sparse implementation 229

indexptrs - pointer to a contiguous block of int variables (of length NP+1). For CSC matrices each

entry provides the index of the ﬁrst column entry into the data and indexvals arrays,

e.g. if indexptr[3]=7, then the ﬁrst nonzero entry in the fourth column of the matrix

is located in data[7], and is located in row indexvals[7] of the matrix. The last entry

contains the total number of nonzero values in the matrix and hence points one past the

end of the active data in the data and indexvals arrays. For CSR matrices, each entry

provides the index of the ﬁrst row entry into the data and indexvals arrays.

The following pointers are added to the SlsMat type for user convenience, to provide a more intuitive

interface to the CSC and CSR sparse matrix data structures. They are set automatically when creating

a sparse sunmatrix, based on the sparse matrix storage type.

rowvals - pointer to indexvals when sparsetype is CSC MAT, otherwise set to NULL.

colptrs - pointer to indexptrs when sparsetype is CSC MAT, otherwise set to NULL.

colvals - pointer to indexvals when sparsetype is CSR MAT, otherwise set to NULL.

rowptrs - pointer to indexptrs when sparsetype is CSR MAT, otherwise set to NULL.

For example, the 5 ×4 CSC matrix







0310

3002

0700

1009

0005







could be stored in this structure as either

M = 5;

N = 4;

NNZ = 8;

NP = N;

data = {3.0, 1.0, 3.0, 7.0, 1.0, 2.0, 9.0, 5.0};

sparsetype = CSC_MAT;

indexvals = {1, 3, 0, 2, 0, 1, 3, 4};

indexptrs = {0, 2, 4, 5, 8};

M = 5;

N = 4;

NNZ = 10;

NP = N;

data = {3.0, 1.0, 3.0, 7.0, 1.0, 2.0, 9.0, 5.0, *, *};

sparsetype = CSC_MAT;

indexvals = {1, 3, 0, 2, 0, 1, 3, 4, *, *};

indexptrs = {0, 2, 4, 5, 8};

where the ﬁrst has no unused space, and the second has additional storage (the entries marked with

*may contain any values). Note in both cases that the ﬁnal value in indexptrs is 8, indicating the

total number of nonzero entries in the matrix.

Similarly, in CSR format, the same matrix could be stored as

M = 5;

N = 4;

NNZ = 8;

NP = N;

data = {3.0, 1.0, 3.0, 2.0, 7.0, 1.0, 9.0, 5.0};

sparsetype = CSR_MAT;

indexvals = {1, 2, 0, 3, 1, 0, 3, 3};

indexptrs = {0, 2, 4, 5, 7, 8};

230 Description of the SUNMatrix module

data

j column 0

unused

storage

rowvals colptrs

indexvals indexptrs

colvals rowptrs

NULL NULL

A(*rowvals[j],1)

A(*rowvals[1],0)

A(*rowvals[0],0)

A(*rowvals[k],NP−1)

A(*rowvals[nz−1],NP−1)

column NP−1

NNZ

sparsetype=CSC_MAT

NNP = N

Figure 8.2: Diagram of the storage for a compressed-sparse-column matrix. Here Ais an M×Nsparse

matrix with storage for up to NNZ nonzero entries (the allocated length of both data and indexvals).

The entries in indexvals may assume values from 0 to M−1, corresponding to the row index (zero-

based) of each nonzero value. The entries in data contain the values of the nonzero entries, with the

row i, column jentry of A(again, zero-based) denoted as A(i,j). The indexptrs array contains N+1

entries; the ﬁrst Ndenote the starting index of each column within the indexvals and data arrays,

while the ﬁnal entry points one past the ﬁnal nonzero entry. Here, although NNZ values are allocated,

only nz are actually ﬁlled in; the greyed-out portions of data and indexvals indicate extra allocated

space.

8.4 The SUNMatrix Sparse implementation 231

The header ﬁle to include when using this module is sunmatrix/sunmatrix sparse.h. The sunma-

trix sparse module is accessible from all sundials solvers without linking to the

libsundials sunmatrixsparse module library.

8.4.1 SUNMatrix Sparse accessor macros

The following macros are provided to access the content of a sunmatrix sparse matrix. The preﬁx

SM in the names denotes that these macros are for SUNMatrix implementations, and the suﬃx S

denotes that these are speciﬁc to the sparse version.

•SM CONTENT S

This routine gives access to the contents of the sparse SUNMatrix.

The assignment A cont =SM CONTENT S(A) sets A cont to be a pointer to the sparse SUNMatrix

content structure.

Implementation:

#define SM_CONTENT_S(A) ( (SUNMatrixContent_Sparse)(A->content) )

•SM ROWS S,SM COLUMNS S,SM NNZ S,SM NP S, and SM SPARSETYPE S

These macros give individual access to various lengths relevant to the content of a sparse

SUNMatrix.

These may be used either to retrieve or to set these values. For example, the assignment A rows

= SM ROWS S(A) sets A rows to be the number of rows in the matrix A. Similarly, the assignment

SM COLUMNS S(A) = A cols sets the number of columns in Ato equal A cols.

Implementation:

#define SM_ROWS_S(A) ( SM_CONTENT_S(A)->M )

#define SM_COLUMNS_S(A) ( SM_CONTENT_S(A)->N )

#define SM_NNZ_S(A) ( SM_CONTENT_S(A)->NNZ )

#define SM_NP_S(A) ( SM_CONTENT_S(A)->NP )

#define SM_SPARSETYPE_S(A) ( SM_CONTENT_S(A)->sparsetype )

•SM DATA S,SM INDEXVALS S, and SM INDEXPTRS S

These macros give access to the data and index arrays for the matrix entries.

The assignment A data = SM DATA S(A) sets A data to be a pointer to the ﬁrst component of

the data array for the sparse SUNMatrix A. The assignment SM DATA S(A) = A data sets the

data array of Ato be A data by storing the pointer A data.

Similarly, the assignment A indexvals = SM INDEXVALS S(A) sets A indexvals to be a pointer

to the array of index values (i.e. row indices for a CSC matrix, or column indices for a CSR

matrix) for the sparse SUNMatrix A. The assignment A indexptrs = SM INDEXPTRS S(A) sets

A indexptrs to be a pointer to the array of index pointers (i.e. the starting indices in the

data/indexvals arrays for each row or column in CSR or CSC formats, respectively).

Implementation:

#define SM_DATA_S(A) ( SM_CONTENT_S(A)->data )

#define SM_INDEXVALS_S(A) ( SM_CONTENT_S(A)->indexvals )

#define SM_INDEXPTRS_S(A) ( SM_CONTENT_S(A)->indexptrs )

232 Description of the SUNMatrix module

8.4.2 SUNMatrix Sparse functions

The sunmatrix sparse module deﬁnes sparse implementations of all matrix operations listed in Ta-

ble 8.2. Their names are obtained from those in Table 8.2 by appending the suﬃx Sparse (e.g.

SUNMatCopy Sparse). All the standard matrix operations listed in 8.2 with the suﬃx Sparse ap-

pended are callable via the Fortran 2003 interface by prepending an ‘F’ (e.g. FSUNMatCopy Sparse).

The module sunmatrix sparse provides the following additional user-callable routines:

SUNSparseMatrix

Prototype SUNMatrix SUNSparseMatrix(sunindextype M, sunindextype N,

sunindextype NNZ, int sparsetype)

Description This function creates and allocates memory for a sparse SUNMatrix. Its arguments

are the number of rows and columns of the matrix, Mand N, the maximum number of

nonzeros to be stored in the matrix, NNZ, and a ﬂag sparsetype indicating whether to

use CSR or CSC format (valid arguments are CSR MAT or CSC MAT).

F2003 Name This function is callable as FSUNSparseMatrix when using the Fortran 2003 interface

module.

SUNSparseFromDenseMatrix

Prototype SUNMatrix SUNSparseFromDenseMatrix(SUNMatrix A, realtype droptol,

int sparsetype);

Description This function creates a new sparse matrix from an existing dense matrix by copying all

values with magnitude larger than droptol into the sparse matrix structure.

Requirements:

•Amust have type SUNMATRIX DENSE;

•droptol must be non-negative;

•sparsetype must be either CSC MAT or CSR MAT.

The function returns NULL if any requirements are violated, or if the matrix storage

request cannot be satisﬁed.

F2003 Name This function is callable as FSUNSparseFromDenseMatrix when using the Fortran 2003

interface module.

SUNSparseFromBandMatrix

Prototype SUNMatrix SUNSparseFromBandMatrix(SUNMatrix A, realtype droptol,

int sparsetype);

Description This function creates a new sparse matrix from an existing band matrix by copying all

values with magnitude larger than droptol into the sparse matrix structure.

Requirements:

•Amust have type SUNMATRIX BAND;

•droptol must be non-negative;

•sparsetype must be either CSC MAT or CSR MAT.

The function returns NULL if any requirements are violated, or if the matrix storage

request cannot be satisﬁed.

F2003 Name This function is callable as FSUNSparseFromBandMatrix when using the Fortran 2003

interface module.

8.4 The SUNMatrix Sparse implementation 233

SUNSparseMatrix Realloc

Prototype int SUNSparseMatrix Realloc(SUNMatrix A)

Description This function reallocates internal storage arrays in a sparse matrix so that the resulting

sparse matrix has no wasted space (i.e. the space allocated for nonzero entries equals

the actual number of nonzeros, indexptrs[NP]). Returns 0 on success and 1 on failure

(e.g. if the input matrix is not sparse).

F2003 Name This function is callable as FSUNSparseMatrix Realloc when using the Fortran 2003

interface module.

SUNSparseMatrix Reallocate

Prototype int SUNSparseMatrix Reallocate(SUNMatrix A, sunindextype NNZ)

Description This function reallocates internal storage arrays in a sparse matrix so that the resulting

sparse matrix has storage for a speciﬁed number of nonzeros. Returns 0 on success and

1 on failure (e.g. if the input matrix is not sparse or if NNZ is negative).

F2003 Name This function is callable as FSUNSparseMatrix Reallocate when using the Fortran 2003

interface module.

SUNSparseMatrix Print

Prototype void SUNSparseMatrix Print(SUNMatrix A, FILE* outfile)

Description This function prints the content of a sparse SUNMatrix to the output stream speciﬁed

by outfile. Note: stdout or stderr may be used as arguments for outfile to print

directly to standard output or standard error, respectively.

SUNSparseMatrix Rows

Prototype sunindextype SUNSparseMatrix Rows(SUNMatrix A)

Description This function returns the number of rows in the sparse SUNMatrix.

F2003 Name This function is callable as FSUNSparseMatrix Rows when using the Fortran 2003 inter-

face module.

SUNSparseMatrix Columns

Prototype sunindextype SUNSparseMatrix Columns(SUNMatrix A)

Description This function returns the number of columns in the sparse SUNMatrix.

F2003 Name This function is callable as FSUNSparseMatrix Columns when using the Fortran 2003

interface module.

SUNSparseMatrix NNZ

Prototype sunindextype SUNSparseMatrix NNZ(SUNMatrix A)

Description This function returns the number of entries allocated for nonzero storage for the sparse

matrix SUNMatrix.

F2003 Name This function is callable as FSUNSparseMatrix NNZ when using the Fortran 2003 inter-

face module.

234 Description of the SUNMatrix module

SUNSparseMatrix NP

Prototype sunindextype SUNSparseMatrix NP(SUNMatrix A)

Description This function returns the number of columns/rows for the sparse SUNMatrix, depending

on whether the matrix uses CSC/CSR format, respectively. The indexptrs array has

NP+1 entries.

F2003 Name This function is callable as FSUNSparseMatrix NP when using the Fortran 2003 interface

module.

SUNSparseMatrix SparseType

Prototype int SUNSparseMatrix SparseType(SUNMatrix A)

Description This function returns the storage type (CSR MAT or CSC MAT) for the sparse SUNMatrix.

F2003 Name This function is callable as FSUNSparseMatrix SparseType when using the Fortran 2003

interface module.

SUNSparseMatrix Data

Prototype realtype* SUNSparseMatrix Data(SUNMatrix A)

Description This function returns a pointer to the data array for the sparse SUNMatrix.

F2003 Name This function is callable as FSUNSparseMatrix Data when using the Fortran 2003 inter-

face module.

SUNSparseMatrix IndexValues

Prototype sunindextype* SUNSparseMatrix IndexValues(SUNMatrix A)

Description This function returns a pointer to index value array for the sparse SUNMatrix: for CSR

format this is the column index for each nonzero entry, for CSC format this is the row

index for each nonzero entry.

F2003 Name This function is callable as FSUNSparseMatrix IndexValues when using the Fortran

2003 interface module.

SUNSparseMatrix IndexPointers

Prototype sunindextype* SUNSparseMatrix IndexPointers(SUNMatrix A)

Description This function returns a pointer to the index pointer array for the sparse SUNMatrix:

for CSR format this is the location of the ﬁrst entry of each row in the data and

indexvalues arrays, for CSC format this is the location of the ﬁrst entry of each column.

F2003 Name This function is callable as FSUNSparseMatrix IndexPointers when using the Fortran

2003 interface module.

Within the SUNMatMatvec Sparse routine, internal consistency checks are performed to ensure that

the matrix is called with consistent nvector implementations. These are currently limited to: nvec-

tor serial,nvector openmp, and nvector pthreads. As additional compatible vector imple-

mentations are added to sundials, these will be included within this compatibility check.

8.4.3 SUNMatrix Sparse Fortran interfaces

The sunmatrix sparse module provides a Fortran 2003 module as well as Fortran 77 style

interface functions for use from Fortran applications.

8.4 The SUNMatrix Sparse implementation 235

FORTRAN 2003 interface module

The fsunmatrix sparse mod Fortran module deﬁnes interfaces to most sunmatrix sparse C func-

tions using the intrinsic iso c binding module which provides a standardized mechanism for interop-

erating with C. As noted in the Cfunction descriptions above, the interface functions are named after

the corresponding Cfunction, but with a leading ‘F’. For example, the function SUNSparseMatrix is

interfaced as FSUNSparseMatrix.

The Fortran 2003 sunmatrix sparse interface module can be accessed with the use statement,

i.e. use fsunmatrix sparse mod, and linking to the library libsundials fsunmatrixsparse mod.lib

in addition to the Clibrary. For details on where the library and module ﬁle fsunmatrix sparse mod.mod

are installed see Appendix A. We note that the module is accessible from the Fortran 2003 sundials

integrators without separately linking to the libsundials fsunmatrixsparse mod library.

FORTRAN 77 interface functions

For solvers that include a Fortran interface module, the sunmatrix sparse module also includes

the Fortran-callable function FSUNSparseMatInit(code, M, N, NNZ, sparsetype, ier) to initial-

ize this sunmatrix sparse module for a given sundials solver. Here code is an integer input for the

solver id (1 for cvode, 2 for ida, 3 for kinsol, 4 for arkode); M,Nand NNZ are the corresponding

sparse matrix construction arguments (declared to match C type long int); sparsetype is an integer

ﬂag indicating the sparse storage type (0 for CSC, 1 for CSR); and ier is an error return ﬂag equal to

0 for success and -1 for failure. Each of code,sparsetype and ier are declared so as to match C type

int. Additionally, when using arkode with a non-identity mass matrix, the Fortran-callable function

FSUNSparseMassMatInit(M, N, NNZ, sparsetype, ier) initializes this sunmatrix sparse mod-

ule for storing the mass matrix.

Chapter 9

Description of the

SUNLinearSolver module

For problems that involve the solution of linear systems of equations, the sundials packages oper-

ate using generic linear solver modules deﬁned through the sunlinsol API. This allows sundials

packages to utilize any valid sunlinsol implementation that provides a set of required functions.

These functions can be divided into three categories. The ﬁrst are the core linear solver functions.

The second group consists of “set” routines to supply the linear solver object with functions provided

by the sundials package, or for modiﬁcation of solver parameters. The last group consists of “get”

routines for retrieving artifacts (statistics, residual vectors, etc.) from the linear solver. All of these

functions are deﬁned in the header ﬁle sundials/sundials linearsolver.h.

The implementations provided with sundials work in coordination with the sundials generic

nvector and sunmatrix modules to provide a set of compatible data structures and solvers for the

solution of linear systems using direct or iterative (matrix-based or matrix-free) methods. Moreover,

advanced users can provide a customized SUNLinearSolver implementation to any sundials package,

particularly in cases where they provide their own nvector and/or sunmatrix modules.

Historically, the sundials packages have been designed to speciﬁcally leverage the use of either

direct linear solvers or matrix-free, scaled, preconditioned, iterative linear solvers. However, matrix-

based iterative linear solvers are also supported.

The iterative linear solvers packaged with sundials leverage scaling and preconditioning, as ap-

plicable, to balance error between solution components and to accelerate convergence of the linear

solver. To this end, instead of solving the linear system Ax =bdirectly, these apply the underlying

iterative algorithm to the transformed system

A˜x=˜

b(9.1)

where

A=S1P−1

1AP −1

2S−1

b=S1P−1

1b, (9.2)

˜x=S2P2x,

and where

•P1is the left preconditioner,

•P2is the right preconditioner,

•S1is a diagonal matrix of scale factors for P−1

1b,

•S2is a diagonal matrix of scale factors for P2x.

238 Description of the SUNLinearSolver module

sundials packages request that iterative linear solvers stop based on the 2-norm of the scaled pre-

conditioned residual meeting a prescribed tolerance



b−˜

A˜x



2<tol.

When provided an iterative sunlinsol implementation that does not support the scaling matrices

S1and S2,sundials’ packages will adjust the value of tol accordingly (see §9.4.2 for more details).

In this case, they instead request that iterative linear solvers stop based on the criteria



P−1

1b−P−1

1Ax

2<tol.

We note that the corresponding adjustments to tol in this case are non-optimal, in that they cannot

balance error between speciﬁc entries of the solution x, only the aggregate error in the overall solution

vector.

We further note that not all of the sundials-provided iterative linear solvers support the full

range of the above options (e.g., separate left/right preconditioning), and that some of the sundials

packages only utilize a subset of these options. Further details on these exceptions are described in

the documentation for each sunlinsol implementation, or for each sundials package.

For users interested in providing their own sunlinsol module, the following section presents

the sunlinsol API and its implementation beginning with the deﬁnition of sunlinsol functions

in sections 9.1.1 –9.1.3. This is followed by the deﬁnition of functions supplied to a linear solver

implementation in section 9.1.4. A table of linear solver return codes is given in section 9.1.5. The

SUNLinearSolver type and the generic sunlinsol module are deﬁned in section 9.1.6. The section 9.2

discusses compatibility between the sundials-provided sunlinsol modules and sunmatrix modules.

Section 9.3 lists the requirements for supplying a custom sunlinsol module and discusses some

intended use cases. Users wishing to supply their own sunlinsol module are encouraged to use

the sunlinsol implementations provided with sundials as a template for supplying custom linear

solver modules. The sunlinsol functions required by this sundials package as well as other package

speciﬁc details are given in section 9.4. The remaining sections of this chapter present the sunlinsol

modules provided with sundials.

9.1 The SUNLinearSolver API

The sunlinsol API deﬁnes several linear solver operations that enable sundials packages to utilize

any sunlinsol implementation that provides the required functions. These functions can be divided

into three categories. The ﬁrst are the core linear solver functions. The second group of functions con-

sists of set routines to supply the linear solver with functions provided by the sundials time integrators

and to modify solver parameters. The ﬁnal group consists of get routines for retrieving linear solver

statistics. All of these functions are deﬁned in the header ﬁle sundials/sundials linearsolver.h.

9.1.1 SUNLinearSolver core functions

The core linear solver functions consist of four required routines to get the linear solver type

(SUNLinSolGetType), initialize the linear solver object once all solver-speciﬁc options have been set

(SUNLinSolInitialize), set up the linear solver object to utilize an updated matrix A

(SUNLinSolSetup), and solve the linear system Ax =b(SUNLinSolSolve). The remaining routine

for destruction of the linear solver object (SUNLinSolFree) is optional.

SUNLinSolGetType

Call type = SUNLinSolGetType(LS);

Description The required function SUNLinSolGetType returns the type identiﬁer for the linear solver

LS. It is used to determine the solver type (direct, iterative, or matrix-iterative) from

the abstract SUNLinearSolver interface.

9.1 The SUNLinearSolver API 239

Arguments LS (SUNLinearSolver) a sunlinsol object.

Return value The return value type (of type int) will be one of the following:

•SUNLINEARSOLVER DIRECT –0, the sunlinsol module requires a matrix, and com-

putes an ‘exact’ solution to the linear system deﬁned by that matrix.

•SUNLINEARSOLVER ITERATIVE –1, the sunlinsol module does not require a matrix

(though one may be provided), and computes an inexact solution to the linear

system using a matrix-free iterative algorithm. That is it solves the linear system

deﬁned by the package-supplied ATimes routine (see SUNLinSolSetATimes below),

even if that linear system diﬀers from the one encoded in the matrix object (if one

is provided). As the solver computes the solution only inexactly (or may diverge),

the linear solver should check for solution convergence/accuracy as appropriate.

•SUNLINEARSOLVER MATRIX ITERATIVE –2, the sunlinsol module requires a ma-

trix, and computes an inexact solution to the linear system deﬁned by that matrix

using an iterative algorithm. That is it solves the linear system deﬁned by the

matrix object even if that linear system diﬀers from that encoded by the package-

supplied ATimes routine. As the solver computes the solution only inexactly (or

may diverge), the linear solver should check for solution convergence/accuracy as

appropriate.

Notes See section 9.3.1 for more information on intended use cases corresponding to the linear

solver type.

SUNLinSolInitialize

Call retval = SUNLinSolInitialize(LS);

Description The required function SUNLinSolInitialize performs linear solver initialization (as-

suming that all solver-speciﬁc options have been set).

Arguments LS (SUNLinearSolver) a sunlinsol object.

Return value This should return zero for a successful call, and a negative value for a failure, ideally

returning one of the generic error codes listed in Table 9.1.

SUNLinSolSetup

Call retval = SUNLinSolSetup(LS, A);

Description The required function SUNLinSolSetup performs any linear solver setup needed, based

on an updated system sunmatrix A. This may be called frequently (e.g., with a full

Newton method) or infrequently (for a modiﬁed Newton method), based on the type of

integrator and/or nonlinear solver requesting the solves.

Arguments LS (SUNLinearSolver) a sunlinsol object.

A(SUNMatrix) a sunmatrix object.

Return value This should return zero for a successful call, a positive value for a recoverable failure

and a negative value for an unrecoverable failure, ideally returning one of the generic

error codes listed in Table 9.1.

SUNLinSolSolve

Call retval = SUNLinSolSolve(LS, A, x, b, tol);

Description The required function SUNLinSolSolve solves a linear system Ax =b.

Arguments LS (SUNLinearSolver) a sunlinsol object.

A(SUNMatrix) a sunmatrix object.

240 Description of the SUNLinearSolver module

x(N Vector) a nvector object containing the initial guess for the solution of the

linear system, and the solution to the linear system upon return.

b(N Vector) a nvector object containing the linear system right-hand side.

tol (realtype) the desired linear solver tolerance.

Return value This should return zero for a successful call, a positive value for a recoverable failure

and a negative value for an unrecoverable failure, ideally returning one of the generic

error codes listed in Table 9.1.

Notes Direct solvers: can ignore the tol argument.

Matrix-free solvers: (those that identify as SUNLINEARSOLVER ITERATIVE) can ignore

the sunmatrix input A, and should instead rely on the matrix-vector product function

supplied through the routine SUNLinSolSetATimes.

Iterative solvers: (those that identify as SUNLINEARSOLVER ITERATIVE or

SUNLINEARSOLVER MATRIX ITERATIVE) should attempt to solve to the speciﬁed toler-

ance tol in a weighted 2-norm. If the solver does not support scaling then it should

just use a 2-norm.

SUNLinSolFree

Call retval = SUNLinSolFree(LS);

Description The optional function SUNLinSolFree frees memory allocated by the linear solver.

Arguments LS (SUNLinearSolver) a sunlinsol object.

Return value This should return zero for a successful call and a negative value for a failure.

9.1.2 SUNLinearSolver set functions

The following set functions are used to supply linear solver modules with functions deﬁned by the

sundials packages and to modify solver parameters. Only the routine for setting the matrix-vector

product routine is required, and that is only for matrix-free linear solver modules. Otherwise, all other

set functions are optional. sunlinsol implementations that do not provide the functionality for any

optional routine should leave the corresponding function pointer NULL instead of supplying a dummy

routine.

SUNLinSolSetATimes

Call retval = SUNLinSolSetATimes(LS, A data, ATimes);

Description The function SUNLinSolSetATimes is required for matrix-free linear solvers; otherwise

it is optional.

This routine provides an ATimesFn function pointer, as well as a void* pointer to a

data structure used by this routine, to a linear solver object. sundials packages will

call this function to set the matrix-vector product function to either a solver-provided

diﬀerence-quotient via vector operations or a user-supplied solver-speciﬁc routine.

Arguments LS (SUNLinearSolver) a sunlinsol object.

A data (void*) data structure passed to ATimes.

ATimes (ATimesFn) function pointer implementing the matrix-vector product routine.

Return value This routine should return zero for a successful call, and a negative value for a failure,

ideally returning one of the generic error codes listed in Table 9.1.

9.1 The SUNLinearSolver API 241

SUNLinSolSetPreconditioner

Call retval = SUNLinSolSetPreconditioner(LS, Pdata, Pset, Psol);

Description The optional function SUNLinSolSetPreconditioner provides PSetupFn and PSolveFn

function pointers that implement the preconditioner solves P−1

1and P−1

2from equations

(9.1)-(9.2). This routine will be called by a sundials package, which will provide

translation between the generic Pset and Psol calls and the package- or user-supplied

routines.

Arguments LS (SUNLinearSolver) a sunlinsol object.

Pdata (void*) data structure passed to both Pset and Psol.

Pset (PSetupFn) function pointer implementing the preconditioner setup.

Psol (PSolveFn) function pointer implementing the preconditioner solve.

Return value This routine should return zero for a successful call, and a negative value for a failure,

ideally returning one of the generic error codes listed in Table 9.1.

SUNLinSolSetScalingVectors

Call retval = SUNLinSolSetScalingVectors(LS, s1, s2);

Description The optional function SUNLinSolSetScalingVectors provides left/right scaling vectors

for the linear system solve. Here, s1 and s2 are nvector of positive scale factors con-

taining the diagonal of the matrices S1and S2from equations (9.1)-(9.2), respectively.

Neither of these vectors need to be tested for positivity, and a NULL argument for either

indicates that the corresponding scaling matrix is the identity.

Arguments LS (SUNLinearSolver) a sunlinsol object.

s1 (N Vector) diagonal of the matrix S1

s2 (N Vector) diagonal of the matrix S2

Return value This routine should return zero for a successful call, and a negative value for a failure,

ideally returning one of the generic error codes listed in Table 9.1.

9.1.3 SUNLinearSolver get functions

The following get functions allow sundials packages to retrieve results from a linear solve. All routines

are optional.

SUNLinSolNumIters

Call its = SUNLinSolNumIters(LS);

Description The optional function SUNLinSolNumIters should return the number of linear iterations

performed in the last ‘solve’ call.

Arguments LS (SUNLinearSolver) a sunlinsol object.

Return value int containing the number of iterations

SUNLinSolResNorm

Call rnorm = SUNLinSolResNorm(LS);

Description The optional function SUNLinSolResNorm should return the ﬁnal residual norm from

the last ‘solve’ call.

Arguments LS (SUNLinearSolver) a sunlinsol object.

Return value realtype containing the ﬁnal residual norm

242 Description of the SUNLinearSolver module

SUNLinSolResid

Call rvec = SUNLinSolResid(LS);

Description If an iterative method computes the preconditioned initial residual and returns with

a successful solve without performing any iterations (i.e., either the initial guess or

the preconditioner is suﬃciently accurate), then this optional routine may be called

by the sundials package. This routine should return the nvector containing the

preconditioned initial residual vector.

Arguments LS (SUNLinearSolver) a sunlinsol object.

Return value N Vector containing the ﬁnal residual vector

Notes Since N Vector is actually a pointer, and the results are not modiﬁed, this routine

should not require additional memory allocation. If the sunlinsol object does not

retain a vector for this purpose, then this function pointer should be set to NULL in the

implementation.

SUNLinSolLastFlag

Call lflag = SUNLinSolLastFlag(LS);

Description The optional function SUNLinSolLastFlag should return the last error ﬂag encountered

within the linear solver. This is not called by the sundials packages directly; it allows

the user to investigate linear solver issues after a failed solve.

Arguments LS (SUNLinearSolver) a sunlinsol object.

Return value long int containing the most recent error ﬂag

SUNLinSolSpace

Call retval = SUNLinSolSpace(LS, &lrw, &liw);

Description The optional function SUNLinSolSpace should return the storage requirements for the

linear solver LS.

Arguments LS (SUNLinearSolver) a sunlinsol object.

lrw (long int*) the number of realtype words stored by the linear solver.

liw (long int*) the number of integer words stored by the linear solver.

Return value This should return zero for a successful call, and a negative value for a failure, ideally

returning one of the generic error codes listed in Table 9.1.

Notes This function is advisory only, for use in determining a user’s total space requirements.

9.1.4 Functions provided by sundials packages

To interface with the sunlinsol modules, the sundials packages supply a variety of routines for

evaluating the matrix-vector product, and setting up and applying the preconditioner. These package-

provided routines translate between the user-supplied ODE, DAE, or nonlinear systems and the generic

interfaces to the linear systems of equations that result in their solution. The types for functions

provided to a sunlinsol module are deﬁned in the header ﬁle sundials/sundials iterative.h,

and are described below.

ATimesFn

Deﬁnition typedef int (*ATimesFn)(void *A data, N Vector v, N Vector z);

Purpose These functions compute the action of a matrix on a vector, performing the operation

z=Av. Memory for zshould already be allocted prior to calling this function. The

vector vshould be left unchanged.

9.1 The SUNLinearSolver API 243

Arguments A data is a pointer to client data, the same as that supplied to SUNLinSolSetATimes.

vis the input vector to multiply.

zis the output vector computed.

Return value This routine should return 0 if successful and a non-zero value if unsuccessful.

PSetupFn

Deﬁnition typedef int (*PSetupFn)(void *P data)

Purpose These functions set up any requisite problem data in preparation for calls to the corre-

sponding PSolveFn.

Arguments P data is a pointer to client data, the same pointer as that supplied to the routine

SUNLinSolSetPreconditioner.

Return value This routine should return 0 if successful and a non-zero value if unsuccessful.

PSolveFn

Deﬁnition typedef int (*PSolveFn)(void *P data, N Vector r, N Vector z,

realtype tol, int lr)

Purpose These functions solve the preconditioner equation P z =rfor the vector z. Memory for

zshould already be allocted prior to calling this function. The parameter P data is a

pointer to any information about Pwhich the function needs in order to do its job (set

up by the corresponding PSetupFn). The parameter lr is input, and indicates whether

Pis to be taken as the left preconditioner or the right preconditioner: lr = 1 for left

and lr = 2 for right. If preconditioning is on one side only, lr can be ignored. If the

preconditioner is iterative, then it should strive to solve the preconditioner equation so

that

kP z −rkwrms < tol

where the weight vector for the WRMS norm may be accessed from the main package

memory structure. The vector rshould not be modiﬁed by the PSolveFn.

Arguments P data is a pointer to client data, the same pointer as that supplied to the routine

SUNLinSolSetPreconditioner.

ris the right-hand side vector for the preconditioner system.

zis the solution vector for the preconditioner system.

tol is the desired tolerance for an iterative preconditioner.

lr is ﬂag indicating whether the routine should perform left (1) or right (2) pre-

conditioning.

Return value This routine should return 0 if successful and a non-zero value if unsuccessful. On a

failure, a negative return value indicates an unrecoverable condition, while a positive

value indicates a recoverable one, in which the calling routine may reattempt the solution

after updating preconditioner data.

9.1.5 SUNLinearSolver return codes

The functions provided to sunlinsol modules by each sundials package, and functions within the

sundials-provided sunlinsol implementations utilize a common set of return codes, shown in Table

9.1. These adhere to a common pattern: 0 indicates success, a postitive value corresponds to a

recoverable failure, and a negative value indicates a non-recoverable failure. Aside from this pattern,

the actual values of each error code are primarily to provide additional information to the user in case

of a linear solver failure.

244 Description of the SUNLinearSolver module

Table 9.1: Description of the SUNLinearSolver error codes

Name Value Description

SUNLS SUCCESS 0 successful call or converged solve

SUNLS MEM NULL -1 the memory argument to the function is NULL

SUNLS ILL INPUT -2 an illegal input has been provided to the function

SUNLS MEM FAIL -3 failed memory access or allocation

SUNLS ATIMES FAIL UNREC -4 an unrecoverable failure occurred in the ATimes routine

SUNLS PSET FAIL UNREC -5 an unrecoverable failure occurred in the Pset routine

SUNLS PSOLVE FAIL UNREC -6 an unrecoverable failure occurred in the Psolve routine

SUNLS PACKAGE FAIL UNREC -7 an unrecoverable failure occurred in an external linear

solver package

SUNLS GS FAIL -8 a failure occurred during Gram-Schmidt orthogonalization

(sunlinsol spgmr/sunlinsol spfgmr)

SUNLS QRSOL FAIL -9 a singular Rmatrix was encountered in a QR factorization

(sunlinsol spgmr/sunlinsol spfgmr)

SUNLS RES REDUCED 1 an iterative solver reduced the residual, but did not con-

verge to the desired tolerance

SUNLS CONV FAIL 2 an iterative solver did not converge (and the residual was

not reduced)

SUNLS ATIMES FAIL REC 3 a recoverable failure occurred in the ATimes routine

SUNLS PSET FAIL REC 4 a recoverable failure occurred in the Pset routine

SUNLS PSOLVE FAIL REC 5 a recoverable failure occurred in the Psolve routine

SUNLS PACKAGE FAIL REC 6 a recoverable failure occurred in an external linear solver

package

SUNLS QRFACT FAIL 7 a singular matrix was encountered during a QR factoriza-

tion (sunlinsol spgmr/sunlinsol spfgmr)

SUNLS LUFACT FAIL 8 a singular matrix was encountered during a LU factorization

(sunlinsol dense/sunlinsol band)

9.1.6 The generic SUNLinearSolver module

sundials packages interact with speciﬁc sunlinsol implementations through the generic sunlinsol

module on which all other sunlinsol iplementations are built. The SUNLinearSolver type is a

pointer to a structure containing an implementation-dependent content ﬁeld, and an ops ﬁeld. The

type SUNLinearSolver is deﬁned as

typedef struct _generic_SUNLinearSolver *SUNLinearSolver;

struct _generic_SUNLinearSolver {

void *content;

struct _generic_SUNLinearSolver_Ops *ops;

};

where the generic SUNLinearSolver Ops structure is a list of pointers to the various actual lin-

ear solver operations provided by a speciﬁc implementation. The generic SUNLinearSolver Ops

structure is deﬁned as

struct _generic_SUNLinearSolver_Ops {

SUNLinearSolver_Type (*gettype)(SUNLinearSolver);

9.2 Compatibility of SUNLinearSolver modules 245

int (*setatimes)(SUNLinearSolver, void*, ATimesFn);

int (*setpreconditioner)(SUNLinearSolver, void*,

PSetupFn, PSolveFn);

int (*setscalingvectors)(SUNLinearSolver,

N_Vector, N_Vector);

int (*initialize)(SUNLinearSolver);

int (*setup)(SUNLinearSolver, SUNMatrix);

int (*solve)(SUNLinearSolver, SUNMatrix, N_Vector,

N_Vector, realtype);

int (*numiters)(SUNLinearSolver);

realtype (*resnorm)(SUNLinearSolver);

long int (*lastflag)(SUNLinearSolver);

int (*space)(SUNLinearSolver, long int*, long int*);

N_Vector (*resid)(SUNLinearSolver);

int (*free)(SUNLinearSolver);

};

The generic sunlinsol module deﬁnes and implements the linear solver operations deﬁned in

Sections 9.1.1-9.1.3. These routines are in fact only wrappers to the linear solver operations de-

ﬁned by a particular sunlinsol implementation, which are accessed through the ops ﬁeld of the

SUNLinearSolver structure. To illustrate this point we show below the implementation of a typical

linear solver operation from the generic sunlinsol module, namely SUNLinSolInitialize, which

initializes a sunlinsol object for use after it has been created and conﬁgured, and returns a ﬂag

denoting a successful/failed operation:

int SUNLinSolInitialize(SUNLinearSolver S)

{

return ((int) S->ops->initialize(S));

}

9.2 Compatibility of SUNLinearSolver modules

We note that not all sunlinsol types are compatible with all sunmatrix and nvector types provided

with sundials. In Table 9.2 we show the matrix-based linear solvers available as sunlinsol modules,

and the compatible matrix implementations. Recall that Table 4.1 shows the compatibility between

all sunlinsol modules and vector implementations.

Table 9.2: sundials matrix-based linear solvers and matrix implementations that can be used for

each. Linear Solver

Interface

Dense

Matrix

Banded

Matrix

Sparse

Matrix

User

Supplied

Dense X X

Band X X

LapackDense X X

LapackBand X X

klu X X

superlumt X X

User supplied X X X X

9.3 Implementing a custom SUNLinearSolver module

A particular implementation of the sunlinsol module must:

246 Description of the SUNLinearSolver module

•Specify the content ﬁeld of the SUNLinearSolver object.

•Deﬁne and implement a minimal subset of the linear solver operations. See the section 9.4 to

determine which sunlinsol operations are required for this sundials package.

Note that the names of these routines should be unique to that implementation in order to

permit using more than one sunlinsol module (each with diﬀerent SUNLinearSolver internal

data representations) in the same code.

•Deﬁne and implement user-callable constructor and destructor routines to create and free a

SUNLinearSolver with the new content ﬁeld and with ops pointing to the new linear solver

operations.

We note that the function pointers for all unsupported optional routines should be set to NULL in

the ops structure. This allows the sundials package that is using the sunlinsol object to know that

the associated functionality is not supported.

Additionally, a sunlinsol implementation may do the following:

•Deﬁne and implement additional user-callable “set” routines acting on the SUNLinearSolver,

e.g., for setting various conﬁguration options to tune the linear solver to a particular problem.

•Provide additional user-callable “get” routines acting on the SUNLinearSolver object, e.g., for

returning various solve statistics.

9.3.1 Intended use cases

The sunlinsol (and sunmatrix) APIs are designed to require a minimal set of routines to ease

interfacing with custom or third-party linear solver libraries. External solvers provide similar routines

with the necessary functionality and thus will require minimal eﬀort to wrap within custom sunmatrix

and sunlinsol implementations. Sections 8.1 and 9.4 include a list of the required set of routines that

compatible sunmatrix and sunlinsol implementations must provide. As sundials packages utilize

generic sunlinsol modules allowing for user-supplied SUNLinearSolver implementations, there exists

a wide range of possible linear solver combinations. Some intended use cases for both the sundials-

provided and user-supplied sunlinsol modules are discussd in the following sections.

Direct linear solvers

Direct linear solver modules require a matrix and compute an ‘exact’ solution to the linear system

deﬁned by the matrix. Multiple matrix formats and associated direct linear solvers are supplied with

sundials through diﬀerent sunmatrix and sunlinsol implementations. sundials packages strive

to amortize the high cost of matrix construction by reusing matrix information for multiple nonlinear

iterations. As a result, each package’s linear solver interface recomputes Jacobian information as

infrequently as possible.

Alternative matrix storage formats and compatible linear solvers that are not currently provided

by, or interfaced with, sundials can leverage this infrastructure with minimal eﬀort. To do so, a user

must implement custom sunmatrix and sunlinsol wrappers for the desired matrix format and/or

linear solver following the APIs described in Chapters 8and 9.This user-supplied sunlinsol module

must then self-identify as having SUNLINEARSOLVER DIRECT type.

Matrix-free iterative linear solvers

Matrix-free iterative linear solver modules do not require a matrix and compute an inexact solution to

the linear system deﬁned by the package-supplied ATimes routine.sundials supplies multiple scaled,

preconditioned iterative linear solver (spils) sunlinsol modules that support scaling to allow users to

handle non-dimensionalization (as best as possible) within each sundials package and retain variables

and deﬁne equations as desired in their applications. For linear solvers that do not support left/right

scaling, the tolerance supplied to the linear solver is adjusted to compensate (see section 9.4.2 for

9.4 IDAS SUNLinearSolver interface 247

more details); however, this use case may be non-optimal and cannot handle situations where the

magnitudes of diﬀerent solution components or equations vary dramatically within a single problem.

To utilize alternative linear solvers that are not currently provided by, or interfaced with, sundi-

als a user must implement a custom sunlinsol wrapper for the linear solver following the API

described in Chapter 9.This user-supplied sunlinsol module must then self-identify as having

SUNLINEARSOLVER ITERATIVE type.

Matrix-based iterative linear solvers (reusing A)

Matrix-based iterative linear solver modules require a matrix and compute an inexact solution to

the linear system deﬁned by the matrix. This matrix will be updated infrequently and resued across

multiple solves to amortize cost of matrix construction. As in the direct linear solver case, only

wrappers for the matrix and linear solver in sunmatrix and sunlinsol implementations need to be

created to utilize a new linear solver. This user-supplied sunlinsol module must then self-identify as

having SUNLINEARSOLVER MATRIX ITERATIVE type.

At present, sundials has one example problem that uses this approach for wrapping a structured-

grid matrix, linear solver, and preconditioner from the hypre library that may be used as a template

for other customized implementations (see examples/arkode/CXX parhyp/ark heat2D hypre.cpp).

Matrix-based iterative linear solvers (current A)

For users who wish to utilize a matrix-based iterative linear solver module where the matrix is purely

for preconditioning and the linear system is deﬁned by the package-supplied ATimes routine, we envision

two current possibilities.

The preferred approach is for users to employ one of the sundials spils sunlinsol implementa-

tions (sunlinsol spgmr,sunlinsol spfgmr,sunlinsol spbcgs,sunlinsol sptfqmr, or sunlin-

sol pcg) as the outer solver. The creation and storage of the preconditioner matrix, and interfacing

with the corresponding linear solver, can be handled through a package’s preconditioner ‘setup’ and

‘solve’ functionality (see §4.5.8.2) without creating sunmatrix and sunlinsol implementations. This

usage mode is recommended primarily because the sundials-provided spils modules support the scal-

ing as described above.

A second approach supported by the linear solver APIs is as follows. If the sunlinsol implemen-

tation is matrix-based, self-identiﬁes as having SUNLINEARSOLVER ITERATIVE type, and also provides

a non-NULL SUNLinSolSetATimes routine, then each sundials package will call that routine to attach

its package-speciﬁc matrix-vector product routine to the sunlinsol object. The sundials package

will then call the sunlinsol-provided SUNLinSolSetup routine (infrequently) to update matrix infor-

mation, but will provide current matrix-vector products to the sunlinsol implementation through

the package-supplied ATimesFn routine.

9.4 IDAS SUNLinearSolver interface

Table 9.3 below lists the sunlinsol module linear solver functions used within the idals interface. As

with the sunmatrix module, we emphasize that the ida user does not need to know detailed usage

of linear solver functions by the ida code modules in order to use ida. The information is presented

as an implementation detail for the interested reader.

The linear solver functions listed below are marked with Xto indicate that they are required, or

with †to indicate that they are only called if they are non-NULL in the sunlinsol implementation

that is being used. Note:

1. Although idals does not call SUNLinSolLastFlag directly, this routine is available for users to

query linear solver issues directly.

2. Although idals does not call SUNLinSolFree directly, this routine should be available for users

to call when cleaning up from a simulation.

248 Description of the SUNLinearSolver module

Table 9.3: List of linear solver function usage in the idals interface

DIRECT

ITERATIVE

MATRIX ITERATIVE

SUNLinSolGetType X X X

SUNLinSolSetATimes †X†

SUNLinSolSetPreconditioner †††

SUNLinSolSetScalingVectors †††

SUNLinSolInitialize X X X

SUNLinSolSetup X X X

SUNLinSolSolve X X X

SUNLinSolNumIters X X

SUNLinSolResid X X

1SUNLinSolLastFlag

2SUNLinSolFree

SUNLinSolSpace †††

Since there are a wide range of potential sunlinsol use cases, the following subsections describe

some details of the idals interface, in the case that interested users wish to develop custom sunlinsol

modules.

9.4.1 Lagged matrix information

If the sunlinsol object self-identiﬁes as having type SUNLINEARSOLVER DIRECT or

SUNLINEARSOLVER MATRIX ITERATIVE, then the sunlinsol object solves a linear system deﬁned by a

sunmatrix object. cvls will update the matrix information infrequently according to the strategies

outlined in §2.1. When solving a linear system J¯x=b, it is likely that the value ¯αused to construct

Jdiﬀers from the current value of αin the BDF method, since Jis updated infrequently. Therefore,

after calling the sunlinsol-provided SUNLinSolSolve routine, we test whether α/¯α6= 1, and if this

is the case we scale the solution ¯xto obtain the desired linear system solution xvia

x=2

1 + α/¯α¯x. (9.3)

For values of α/¯αthat are “close” to 1, this rescaling approximately solves the original linear system.

9.4.2 Iterative linear solver tolerance

If the sunlinsol object self-identiﬁes as having type SUNLINEARSOLVER ITERATIVE or

SUNLINEARSOLVER MATRIX ITERATIVE then idals will set the input tolerance delta as described in

§2.1. However, if the iterative linear solver does not support scaling matrices (i.e., the

SUNLinSolSetScalingVectors routine is NULL), then idals will attempt to adjust the linear solver

tolerance to account for this lack of functionality. To this end, the following assumptions are made:

1. All solution components have similar magnitude; hence the error weight vector Wused in the

WRMS norm (see §2.1) should satisfy the assumption

Wi≈Wmean,for i= 0, . . . , n −1.

9.5 The SUNLinearSolver Dense implementation 249

2. The sunlinsol object uses a standard 2-norm to measure convergence.

Since ida uses identical left and right scaling matrices, S1=S2=S= diag(W), then the linear

solver convergence requirement is converted as follows (using the notation from equations (9.1)-(9.2)):



b−˜

A˜x



2<tol

⇔

SP −1

1b−SP −1

1Ax

2<tol

⇔

n−1

i=0 WiP−1

1(b−Ax)i2<tol2

⇔W2

mean

n−1

i=0 P−1

1(b−Ax)i2<tol2

⇔

n−1

i=0 P−1

1(b−Ax)i2<tol

Wmean 2

⇔

P−1

1(b−Ax)

2<tol

Wmean

Therefore the tolerance scaling factor

Wmean =kWk2/√n

is computed and the scaled tolerance delta= tol/Wmean is supplied to the sunlinsol object.

9.5 The SUNLinearSolver Dense implementation

This section describes the sunlinsol implementation for solving dense linear systems. The sunlin-

sol dense module is designed to be used with the corresponding sunmatrix dense matrix type, and

one of the serial or shared-memory nvector implementations (nvector serial,nvector openmp,

or nvector pthreads).

To access the sunlinsol dense module, include the header ﬁle sunlinsol/sunlinsol dense.h.

We note that the sunlinsol dense module is accessible from sundials packages without separately

linking to the libsundials sunlinsoldense module library.

9.5.1 SUNLinearSolver Dense description

This solver is constructed to perform the following operations:

•The “setup” call performs a LU factorization with partial (row) pivoting (O(N3) cost), P A =

LU, where Pis a permutation matrix, Lis a lower triangular matrix with 1’s on the diago-

nal, and Uis an upper triangular matrix. This factorization is stored in-place on the input

sunmatrix dense object A, with pivoting information encoding Pstored in the pivots array.

•The “solve” call performs pivoting and forward and backward substitution using the stored

pivots array and the LU factors held in the sunmatrix dense object (O(N2) cost).

9.5.2 SUNLinearSolver Dense functions

The sunlinsol dense module provides the following user-callable constructor for creating a

SUNLinearSolver object.

250 Description of the SUNLinearSolver module

SUNLinSol Dense

Call LS = SUNLinSol Dense(y, A);

Description The function SUNLinSol Dense creates and allocates memory for a dense

SUNLinearSolver object.

Arguments y(N Vector) a template for cloning vectors needed within the solver

A(SUNMatrix) a sunmatrix dense matrix template for cloning matrices needed

within the solver

Return value This returns a SUNLinearSolver object. If either Aor yare incompatible then this

routine will return NULL.

Notes This routine will perform consistency checks to ensure that it is called with con-

sistent nvector and sunmatrix implementations. These are currently limited to

the sunmatrix dense matrix type and the nvector serial,nvector openmp,

and nvector pthreads vector types. As additional compatible matrix and vec-

tor implementations are added to sundials, these will be included within this

compatibility check.

Deprecated Name For backward compatibility, the wrapper function SUNDenseLinearSolver with

idential input and output arguments is also provided.

F2003 Name This function is callable as FSUNLinSol Dense when using the Fortran 2003 inter-

face module.

The sunlinsol dense module deﬁnes implementations of all “direct” linear solver operations listed

in Sections 9.1.1 –9.1.3:

•SUNLinSolGetType Dense

•SUNLinSolInitialize Dense – this does nothing, since all consistency checks are performed at

solver creation.

•SUNLinSolSetup Dense – this performs the LU factorization.

•SUNLinSolSolve Dense – this uses the LU factors and pivots array to perform the solve.

•SUNLinSolLastFlag Dense

•SUNLinSolSpace Dense – this only returns information for the storage within the solver object,

i.e. storage for N,last flag, and pivots.

•SUNLinSolFree Dense

All of the listed operations are callable via the Fortran 2003 interface module by prepending an ‘F’

to the function name.

9.5.3 SUNLinearSolver Dense Fortran interfaces

The sunlinsol dense module provides a Fortran 2003 module as well as Fortran 77 style interface

functions for use from Fortran applications.

FORTRAN 2003 interface module

The fsunlinsol dense mod Fortran module deﬁnes interfaces to all sunlinsol dense C functions

using the intrinsic iso c binding module which provides a standardized mechanism for interoperating

with C. As noted in the Cfunction descriptions above, the interface functions are named after the

corresponding Cfunction, but with a leading ‘F’. For example, the function SUNLinSol Dense is

interfaced as FSUNLinSol Dense.

The Fortran 2003 sunlinsol dense interface module can be accessed with the use statement,

i.e. use fsunlinsol dense mod, and linking to the library libsundials fsunlinsoldense mod.lib in

9.5 The SUNLinearSolver Dense implementation 251

addition to the Clibrary. For details on where the library and module ﬁle fsunlinsol dense mod.mod

are installed see Appendix A. We note that the module is accessible from the Fortran 2003 sundials

integrators without separately linking to the libsundials fsunlinsoldense mod library.

FORTRAN 77 interface functions

For solvers that include a Fortran 77 interface module, the sunlinsol dense module also includes

a Fortran-callable function for creating a SUNLinearSolver object.

FSUNDENSELINSOLINIT

Call FSUNDENSELINSOLINIT(code, ier)

Description The function FSUNDENSELINSOLINIT can be called for Fortran programs to create a

dense SUNLinearSolver object.

Arguments code (int*) is an integer input specifying the solver id (1 for cvode, 2 for ida, 3 for

kinsol, and 4 for arkode).

Return value ier is a return completion ﬂag equal to 0for a success return and -1 otherwise. See

printed message for details in case of failure.

Notes This routine must be called after both the nvector and sunmatrix objects have been

initialized.

Additionally, when using arkode with a non-identity mass matrix, the sunlinsol dense module

includes a Fortran-callable function for creating a SUNLinearSolver mass matrix solver object.

FSUNMASSDENSELINSOLINIT

Call FSUNMASSDENSELINSOLINIT(ier)

Description The function FSUNMASSDENSELINSOLINIT can be called for Fortran programs to create

a dense SUNLinearSolver object for mass matrix linear systems.

Arguments None

Return value ier is a int return completion ﬂag equal to 0for a success return and -1 otherwise.

See printed message for details in case of failure.

Notes This routine must be called after both the nvector and sunmatrix mass-matrix

objects have been initialized.

9.5.4 SUNLinearSolver Dense content

The sunlinsol dense module deﬁnes the content ﬁeld of a SUNLinearSolver as the following struc-

ture:

struct _SUNLinearSolverContent_Dense {

sunindextype N;

sunindextype *pivots;

long int last_flag;

};

These entries of the content ﬁeld contain the following information:

N- size of the linear system,

pivots - index array for partial pivoting in LU factorization,

last flag - last error return ﬂag from internal function evaluations.

252 Description of the SUNLinearSolver module

9.6 The SUNLinearSolver Band implementation

This section describes the sunlinsol implementation for solving banded linear systems. The sunlin-

sol band module is designed to be used with the corresponding sunmatrix band matrix type, and

one of the serial or shared-memory nvector implementations (nvector serial,nvector openmp,

or nvector pthreads).

To access the sunlinsol band module, include the header ﬁle sunlinsol/sunlinsol band.h.

We note that the sunlinsol band module is accessible from sundials packages without separately

linking to the libsundials sunlinsolband module library.

9.6.1 SUNLinearSolver Band description

This solver is constructed to perform the following operations:

•The “setup” call performs a LU factorization with partial (row) pivoting, P A =LU, where P

is a permutation matrix, Lis a lower triangular matrix with 1’s on the diagonal, and Uis an

upper triangular matrix. This factorization is stored in-place on the input sunmatrix band

object A, with pivoting information encoding Pstored in the pivots array.

•The “solve” call performs pivoting and forward and backward substitution using the stored

pivots array and the LU factors held in the sunmatrix band object.

•Amust be allocated to accommodate the increase in upper bandwidth that occurs during factor-

ization. More precisely, if Ais a band matrix with upper bandwidth mu and lower bandwidth ml,

then the upper triangular factor Ucan have upper bandwidth as big as smu = MIN(N-1,mu+ml).

The lower triangular factor Lhas lower bandwidth ml.

9.6.2 SUNLinearSolver Band functions

The sunlinsol band module provides the following user-callable constructor for creating a

SUNLinearSolver object.

SUNLinSol Band

Call LS = SUNLinSol Band(y, A);

Description The function SUNLinSol Band creates and allocates memory for a band

SUNLinearSolver object.

Arguments y(N Vector) a template for cloning vectors needed within the solver

A(SUNMatrix) a sunmatrix band matrix template for cloning matrices needed

within the solver

Return value This returns a SUNLinearSolver object. If either Aor yare incompatible then this

routine will return NULL.

Notes This routine will perform consistency checks to ensure that it is called with con-

sistent nvector and sunmatrix implementations. These are currently limited to

the sunmatrix band matrix type and the nvector serial,nvector openmp,

and nvector pthreads vector types. As additional compatible matrix and vec-

tor implementations are added to sundials, these will be included within this

compatibility check.

Additionally, this routine will verify that the input matrix Ais allocated with

appropriate upper bandwidth storage for the LU factorization.

Deprecated Name For backward compatibility, the wrapper function SUNBandLinearSolver with

idential input and output arguments is also provided.

F2003 Name This function is callable as FSUNLinSol Band when using the Fortran 2003 interface

module.

9.6 The SUNLinearSolver Band implementation 253

The sunlinsol band module deﬁnes band implementations of all “direct” linear solver operations

listed in Sections 9.1.1 –9.1.3:

•SUNLinSolGetType Band

•SUNLinSolInitialize Band – this does nothing, since all consistency checks are performed at

solver creation.

•SUNLinSolSetup Band – this performs the LU factorization.

•SUNLinSolSolve Band – this uses the LU factors and pivots array to perform the solve.

•SUNLinSolLastFlag Band

•SUNLinSolSpace Band – this only returns information for the storage within the solver object,

i.e. storage for N,last flag, and pivots.

•SUNLinSolFree Band

All of the listed operations are callable via the Fortran 2003 interface module by prepending an ‘F’

to the function name.

9.6.3 SUNLinearSolver Band Fortran interfaces

The sunlinsol band module provides a Fortran 2003 module as well as Fortran 77 style interface

functions for use from Fortran applications.

FORTRAN 2003 interface module

The fsunlinsol band mod Fortran module deﬁnes interfaces to all sunlinsol band C functions

using the intrinsic iso c binding module which provides a standardized mechanism for interoperat-

ing with C. As noted in the Cfunction descriptions above, the interface functions are named after

the corresponding Cfunction, but with a leading ‘F’. For example, the function SUNLinSol Band is

interfaced as FSUNLinSol Band.

The Fortran 2003 sunlinsol band interface module can be accessed with the use statement,

i.e. use fsunlinsol band mod, and linking to the library libsundials fsunlinsolband mod.lib in

addition to the Clibrary. For details on where the library and module ﬁle fsunlinsol band mod.mod

are installed see Appendix A. We note that the module is accessible from the Fortran 2003 sundials

integrators without separately linking to the libsundials fsunlinsolband mod library.

FORTRAN 77 interface functions

For solvers that include a Fortran 77 interface module, the sunlinsol band module also includes

a Fortran-callable function for creating a SUNLinearSolver object.

FSUNBANDLINSOLINIT

Call FSUNBANDLINSOLINIT(code, ier)

Description The function FSUNBANDLINSOLINIT can be called for Fortran programs to create a band

SUNLinearSolver object.

Arguments code (int*) is an integer input specifying the solver id (1 for cvode, 2 for ida, 3 for

kinsol, and 4 for arkode).

Return value ier is a return completion ﬂag equal to 0for a success return and -1 otherwise. See

printed message for details in case of failure.

Notes This routine must be called after both the nvector and sunmatrix objects have been

initialized.

Additionally, when using arkode with a non-identity mass matrix, the sunlinsol band module

includes a Fortran-callable function for creating a SUNLinearSolver mass matrix solver object.

254 Description of the SUNLinearSolver module

FSUNMASSBANDLINSOLINIT

Call FSUNMASSBANDLINSOLINIT(ier)

Description The function FSUNMASSBANDLINSOLINIT can be called for Fortran programs to create a

band SUNLinearSolver object for mass matrix linear systems.

Arguments None

Return value ier is a int return completion ﬂag equal to 0for a success return and -1 otherwise.

See printed message for details in case of failure.

Notes This routine must be called after both the nvector and sunmatrix mass-matrix

objects have been initialized.

9.6.4 SUNLinearSolver Band content

The sunlinsol band module deﬁnes the content ﬁeld of a SUNLinearSolver as the following struc-

ture:

struct _SUNLinearSolverContent_Band {

sunindextype N;

sunindextype *pivots;

long int last_flag;

};

These entries of the content ﬁeld contain the following information:

N- size of the linear system,

pivots - index array for partial pivoting in LU factorization,

last flag - last error return ﬂag from internal function evaluations.

9.7 The SUNLinearSolver LapackDense implementation

This section describes the sunlinsol implementation for solving dense linear systems with LA-

PACK. The sunlinsol lapackdense module is designed to be used with the corresponding sunma-

trix dense matrix type, and one of the serial or shared-memory nvector implementations (nvec-

tor serial,nvector openmp, or nvector pthreads).

To access the sunlinsol lapackdense module, include the header ﬁle

sunlinsol/sunlinsol lapackdense.h. The installed module library to link to is

libsundials sunlinsollapackdense.lib where .lib is typically .so for shared libraries and .a for

static libraries.

The sunlinsol lapackdense module is a sunlinsol wrapper for the LAPACK dense matrix

factorization and solve routines, *GETRF and *GETRS, where *is either Dor S, depending on whether

sundials was conﬁgured to have realtype set to double or single, respectively (see Section 4.2).

In order to use the sunlinsol lapackdense module it is assumed that LAPACK has been installed

on the system prior to installation of sundials, and that sundials has been conﬁgured appropriately

to link with LAPACK (see Appendix Afor details). We note that since there do not exist 128-bit

ﬂoating-point factorization and solve routines in LAPACK, this interface cannot be compiled when

using extended precision for realtype. Similarly, since there do not exist 64-bit integer LAPACK

routines, the sunlinsol lapackdense module also cannot be compiled when using 64-bit integers

for the sunindextype.

9.7.1 SUNLinearSolver LapackDense description

This solver is constructed to perform the following operations:

9.7 The SUNLinearSolver LapackDense implementation 255

•The “setup” call performs a LU factorization with partial (row) pivoting (O(N3) cost), P A =

LU, where Pis a permutation matrix, Lis a lower triangular matrix with 1’s on the diago-

nal, and Uis an upper triangular matrix. This factorization is stored in-place on the input

sunmatrix dense object A, with pivoting information encoding Pstored in the pivots array.

•The “solve” call performs pivoting and forward and backward substitution using the stored

pivots array and the LU factors held in the sunmatrix dense object (O(N2) cost).

9.7.2 SUNLinearSolver LapackDense functions

The sunlinsol lapackdense module provides the following user-callable constructor for creating a

SUNLinearSolver object.

SUNLinSol LapackDense

Call LS = SUNLinSol LapackDense(y, A);

Description The function SUNLinSol LapackDense creates and allocates memory for a LAPACK-

based, dense SUNLinearSolver object.

Arguments y(N Vector) a template for cloning vectors needed within the solver

A(SUNMatrix) a sunmatrix dense matrix template for cloning matrices needed

within the solver

Return value This returns a SUNLinearSolver object. If either Aor yare incompatible then this

routine will return NULL.

Notes This routine will perform consistency checks to ensure that it is called with con-

sistent nvector and sunmatrix implementations. These are currently limited to

the sunmatrix dense matrix type and the nvector serial,nvector openmp,

and nvector pthreads vector types. As additional compatible matrix and vec-

tor implementations are added to sundials, these will be included within this

compatibility check.

Deprecated Name For backward compatibility, the wrapper function SUNLapackDense with idential

input and output arguments is also provided.

The sunlinsol lapackdense module deﬁnes dense implementations of all “direct” linear solver

operations listed in Sections 9.1.1 –9.1.3:

•SUNLinSolGetType LapackDense

•SUNLinSolInitialize LapackDense – this does nothing, since all consistency checks are per-

formed at solver creation.

•SUNLinSolSetup LapackDense – this calls either DGETRF or SGETRF to perform the LU factor-

ization.

•SUNLinSolSolve LapackDense – this calls either DGETRS or SGETRS to use the LU factors and

pivots array to perform the solve.

•SUNLinSolLastFlag LapackDense

•SUNLinSolSpace LapackDense – this only returns information for the storage within the solver

object, i.e. storage for N,last flag, and pivots.

•SUNLinSolFree LapackDense

9.7.3 SUNLinearSolver LapackDense Fortran interfaces

For solvers that include a Fortran 77 interface module, the sunlinsol lapackdense module also

includes a Fortran-callable function for creating a SUNLinearSolver object.

256 Description of the SUNLinearSolver module

FSUNLAPACKDENSEINIT

Call FSUNLAPACKDENSEINIT(code, ier)

Description The function FSUNLAPACKDENSEINIT can be called for Fortran programs to create a

LAPACK-based dense SUNLinearSolver object.

Arguments code (int*) is an integer input specifying the solver id (1 for cvode, 2 for ida, 3 for

kinsol, and 4 for arkode).

Return value ier is a return completion ﬂag equal to 0for a success return and -1 otherwise. See

printed message for details in case of failure.

Notes This routine must be called after both the nvector and sunmatrix objects have been

initialized.

Additionally, when using arkode with a non-identity mass matrix, the sunlinsol lapackdense

module includes a Fortran-callable function for creating a SUNLinearSolver mass matrix solver ob-

ject.

FSUNMASSLAPACKDENSEINIT

Call FSUNMASSLAPACKDENSEINIT(ier)

Description The function FSUNMASSLAPACKDENSEINIT can be called for Fortran programs to create

a LAPACK-based, dense SUNLinearSolver object for mass matrix linear systems.

Arguments None

Return value ier is a int return completion ﬂag equal to 0for a success return and -1 otherwise.

See printed message for details in case of failure.

Notes This routine must be called after both the nvector and sunmatrix mass-matrix

objects have been initialized.

9.7.4 SUNLinearSolver LapackDense content

The sunlinsol lapackdense module deﬁnes the content ﬁeld of a SUNLinearSolver as the following

structure:

struct _SUNLinearSolverContent_Dense {

sunindextype N;

sunindextype *pivots;

long int last_flag;

};

These entries of the content ﬁeld contain the following information:

N- size of the linear system,

pivots - index array for partial pivoting in LU factorization,

last flag - last error return ﬂag from internal function evaluations.

9.8 The SUNLinearSolver LapackBand implementation

This section describes the sunlinsol implementation for solving banded linear systems with LA-

PACK. The sunlinsol lapackband module is designed to be used with the corresponding sunma-

trix band matrix type, and one of the serial or shared-memory nvector implementations (nvec-

tor serial,nvector openmp, or nvector pthreads).

To access the sunlinsol lapackband module, include the header ﬁle

sunlinsol/sunlinsol lapackband.h. The installed module library to link to is

libsundials sunlinsollapackband.lib where .lib is typically .so for shared libraries and .a for

static libraries.

9.8 The SUNLinearSolver LapackBand implementation 257

The sunlinsol lapackband module is a sunlinsol wrapper for the LAPACK band matrix

factorization and solve routines, *GBTRF and *GBTRS, where *is either Dor S, depending on whether

sundials was conﬁgured to have realtype set to double or single, respectively (see Section 4.2).

In order to use the sunlinsol lapackband module it is assumed that LAPACK has been installed

on the system prior to installation of sundials, and that sundials has been conﬁgured appropriately

to link with LAPACK (see Appendix Afor details). We note that since there do not exist 128-bit

ﬂoating-point factorization and solve routines in LAPACK, this interface cannot be compiled when

using extended precision for realtype. Similarly, since there do not exist 64-bit integer LAPACK

routines, the sunlinsol lapackband module also cannot be compiled when using 64-bit integers for

the sunindextype.

9.8.1 SUNLinearSolver LapackBand description