Manual

User Manual:

Open the PDF directly: View PDF .
Page Count: 216 [warning: Documents this large are best viewed by clicking the View PDF Link!]

Introduction
- Philosophy
- Acknowledgments
- Essential literature
- Installation
- What is a fieldstone?
- Why the Finite Element method?
- Notations
List of tutorials
The physical equations
- The heat transport equation - energy conservation equation
- The momentum conservation equations
- The mass conservation equations
- The equations in ASPECT manual
- the Boussinesq approximation: an Incompressible flow
- Stokes equation for elastic medium
- The strain rate tensor in all coordinate systems
- Boundary conditions
  - The Stokes equations
  - The heat transport equation
- Meaningful physical quantities
The building blocks of the Finite Element Method
- Numerical integration
- The mesh
- A bit of FE terminology
- Elements and basis functions in 1D
- Elements and basis functions in 2D
- Elements and basis functions in 3D
  - Linear basis functions in tetrahedra (P1)
  - Triquadratic basis functions in 3D (Q2)
Solving the flow equations with the FEM
- strong and weak forms
- Which velocity-pressure pair for Stokes?
- Other elements
- The penalty approach for viscous flow
- The mixed FEM for viscous flow
  - in three dimensions
  - Going from 3D to 2D
- Solving the elastic equations
- Solving the heat transport equation
Additional techniques and features
- Picard and Newton
- The SUPG formulation for the energy equation
- Tracking materials and/or interfaces
- Dealing with a free surface
- Convergence criterion for nonlinear iterations
- Static condensation
- The method of manufactured solutions
  - Analytical benchmark I
  - Analytical benchmark II
  - Analytical benchmark III
  - Analytical benchmark IV
- Assigning values to quadrature points
- Matrix (Sparse) storage
  - 2D domain - One degree of freedom per node
  - 2D domain - Two degrees of freedom per node
  - in fieldstone
- Mesh generation
- Visco-Plasticity
  - Tensor invariants
  - Scalar viscoplasticity
  - about the yield stress value Y
- Pressure smoothing
- Pressure scaling
- Pressure normalisation
- The choice of solvers
  - The Schur complement approach
- The GMRES approach
- The consistent boundary flux (CBF)
  - applied to the Stokes equation
  - applied to the heat equation
  - implementation - Stokes equation
- The value of the timestep
- mappings
- Exporting data to vtk format
- Runge-Kutta methods
- Am I in or not?
  - Two-dimensional space
  - Three-dimensional space
- Error measurements and convergence rates
- The initial temperature field
  - Single layer with imposed temperature b.c.
- Single layer with imposed heat flux b.c.
- Single layer with imposed heat flux and temperature b.c.
  - Half cooling space
  - Plate model
  - McKenzie slab
- Kinematic boundary conditions
  - In-out flux boundary conditions for lithospheric models
fieldstone_01: simple analytical solution
fieldstone_02: Stokes sphere
fieldstone_03: Convection in a 2D box
fieldstone_04: The lid driven cavity
- the lid driven cavity problem (ldc=0)
- the lid driven cavity problem - regularisation I (ldc=1)
- the lid driven cavity problem - regularisation II (ldc=2)
fieldstone_05: SolCx benchmark
fieldstone_06: SolKz benchmark
fieldstone_07: SolVi benchmark
fieldstone_08: the indentor benchmark
fieldstone_09: the annulus benchmark
fieldstone_10: Stokes sphere (3D) - penalty
fieldstone_11: stokes sphere (3D) - mixed formulation
fieldstone_12: consistent pressure recovery
fieldstone_13: the Particle in Cell technique (1) - the effect of averaging
fieldstone_f14: solving the full saddle point problem
fieldstone_f15: saddle point problem with Schur complement approach - benchmark
fieldstone_f16: saddle point problem with Schur complement approach - Stokes sphere
fieldstone_17: solving the full saddle point problem in 3D
- Constant viscosity
  - Variable viscosity
fieldstone_18: solving the full saddle point problem with Q2Q1 elements
fieldstone_19: solving the full saddle point problem with Q3Q2 elements
fieldstone_20: the Busse benchmark
fieldstone_21: The non-conforming Q1 P0 element
fieldstone_22: The stabilised Q1 Q1 element
fieldstone_23: compressible flow (1) - analytical benchmark
fieldstone_24: compressible flow (2) - convection box
- The physics
- The numerics
- The experimental setup
- Scaling
- Conservation of energy 1
  - under BA and EBA approximations
  - under no approximation at all
- Conservation of energy 2
- The problem of the onset of convection
- results - BA - Ra=104
- results - BA - Ra=105
- results - BA - Ra=106
- results - EBA - Ra=104
- results - EBA - Ra=105
- Onset of convection
fieldstone_25: Rayleigh-Taylor instability (1) - instantaneous
fieldstone_26: Slab detachment benchmark (1) - instantaneous
fieldstone_27: Consistent Boundary Flux
fieldstone_28: convection 2D box - Tosi et al, 2015
- Case 0: Newtonian case, a la Blankenbach et al., 1989
fieldstone_29: open boundary conditions
fieldstone_30: conservative velocity interpolation
- Couette flow
- SolCx
- Streamline flow
fieldstone_31: conservative velocity interpolation 3D
fieldstone_32: 2D analytical sol. from stream function
- Background theory
- A simple application
fieldstone_33: Convection in an annulus
fieldstone_34: the Cartesian geometry elastic aquarium
fieldstone_35: 2D analytical sol. in annulus from stream function
- Linking with our paper
- No slip boundary conditions
- Free slip boundary conditions
fieldstone_36: the annulus geometry elastic aquarium
fieldstone_37: marker advection and population control
fieldstone_38: Critical Rayleigh number
fieldstone: Gravity: buried sphere
Projects for guided research/MSc thesis
Three-dimensional applications

The Finite Element Method in Geodynamics

C. Thieulot

April 8, 2019

Contents

1 Introduction 6

1.1 Philosophy ............................................ 6

1.2 Acknowledgments......................................... 6

1.3 Essentialliterature........................................ 6

1.4 Installation ............................................ 6

1.5 Whatisaﬁeldstone?....................................... 6

1.6 Why the Finite Element method? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

1.7 Notations ............................................. 7

2 List of tutorials 8

3 The physical equations 10

3.1 The heat transport equation - energy conservation equation . . . . . . . . . . . . . . . . . 10

3.2 The momentum conservation equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

3.3 The mass conservation equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

3.4 The equations in ASPECT manual . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

3.5 the Boussinesq approximation: an Incompressible ﬂow . . . . . . . . . . . . . . . . . . . . 13

3.6 Stokes equation for elastic medium . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14

3.7 The strain rate tensor in all coordinate systems . . . . . . . . . . . . . . . . . . . . . . . . 15

3.7.1 Cartesiancoordinates .................................. 15

3.7.2 Polarcoordinates..................................... 15

3.7.3 Cylindrical coordinates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

3.7.4 Spericalcoordinates ................................... 15

3.8 Boundaryconditions....................................... 16

3.8.1 TheStokesequations................................... 16

3.8.2 The heat transport equation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

3.9 Meaningful physical quantities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

4 The building blocks of the Finite Element Method 19

4.1 Numericalintegration ...................................... 19

4.1.1 in1D-theory ...................................... 19

4.1.2 in1D-examples..................................... 21

4.1.3 in2D/3D-theory .................................... 22

4.2 Themesh ............................................. 22

4.3 AbitofFEterminology..................................... 22

4.4 Elements and basis functions in 1D . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

4.4.1 Linear basis functions (Q1) ............................... 23

4.4.2 Quadratic basis functions (Q2) ............................. 23

4.4.3 Cubic basis functions (Q3) ............................... 24

4.5 Elements and basis functions in 2D . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26

4.5.1 Bilinear basis functions in 2D (Q1)........................... 27

4.5.2 Biquadratic basis functions in 2D (Q2)......................... 29

4.5.3 Eight node serendipity basis functions in 2D (Qs

2) .................. 30

4.5.4 Bicubic basis functions in 2D (Q3) ........................... 31

4.5.5 Linear basis functions for triangles in 2D (P1)..................... 32

4.5.6 Quadratic basis functions for triangles in 2D (P2)................... 32

4.5.7 Cubic basis functions for triangles (P3) ........................ 33

4.6 Elements and basis functions in 3D . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34

4.6.1 Linear basis functions in tetrahedra (P1)........................ 34

4.6.2 Triquadratic basis functions in 3D (Q2) ........................ 35

5 Solving the ﬂow equations with the FEM 36

5.1 strongandweakforms...................................... 36

5.2 Which velocity-pressure pair for Stokes? . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36

5.2.1 The compatibility condition (or LBB condition) . . . . . . . . . . . . . . . . . . . . 36

5.2.2 The bi/tri-linear velocity - constant pressure element (Q1×P0)........... 36

5.2.3 The bi/tri-quadratic velocity - discontinuous linear pressure element (Q2×P−1) . 36

5.2.4 The bi/tri-quadratic velocity - bi/tri-linear pressure element (Q2×Q1) ...... 36

5.2.5 The stabilised bi/tri-linear velocity - bi/tri-linear pressure element (Q1×Q1-stab) 37

5.2.6 The MINI triangular element (P+

1×P1)........................ 37

5.2.7 The quadratic velocity - linear pressure triangle (P2×P1).............. 37

5.2.8 The Crouzeix-Raviart triangle (P+

2×P−1) ...................... 37

5.3 Otherelements .......................................... 37

5.4 The penalty approach for viscous ﬂow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38

5.5 The mixed FEM for viscous ﬂow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40

5.5.1 inthreedimensions.................................... 40

5.5.2 Goingfrom3Dto2D .................................. 46

5.6 Solving the elastic equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47

5.7 Solving the heat transport equation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47

6 Additional techniques and features 50

6.1 PicardandNewton........................................ 50

6.2 The SUPG formulation for the energy equation . . . . . . . . . . . . . . . . . . . . . . . . 50

6.3 Tracking materials and/or interfaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50

6.4 Dealingwithafreesurface.................................... 50

6.5 Convergence criterion for nonlinear iterations . . . . . . . . . . . . . . . . . . . . . . . . . 50

6.6 Staticcondensation........................................ 50

6.7 The method of manufactured solutions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51

6.7.1 AnalyticalbenchmarkI ................................. 51

6.7.2 Analytical benchmark II . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52

6.7.3 Analytical benchmark III . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53

6.7.4 Analytical benchmark IV . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54

6.8 Assigning values to quadrature points . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55

6.9 Matrix(Sparse)storage ..................................... 58

6.9.1 2D domain - One degree of freedom per node . . . . . . . . . . . . . . . . . . . . . 58

6.9.2 2D domain - Two degrees of freedom per node . . . . . . . . . . . . . . . . . . . . 59

6.9.3 inﬁeldstone........................................ 60

6.10Meshgeneration ......................................... 61

6.11Visco-Plasticity.......................................... 64

6.11.1 Tensorinvariants..................................... 64

6.11.2 Scalarviscoplasticity................................... 65

6.11.3 about the yield stress value Y.............................. 65

6.12Pressuresmoothing........................................ 66

6.13Pressurescaling.......................................... 68

6.14Pressurenormalisation...................................... 69

6.15Thechoiceofsolvers....................................... 70

6.15.1 The Schur complement approach . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70

6.16TheGMRESapproach...................................... 74

6.17 The consistent boundary ﬂux (CBF) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75

6.17.1 applied to the Stokes equation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75

6.17.2 applied to the heat equation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75

6.17.3 implementation - Stokes equation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76

6.18Thevalueofthetimestep .................................... 79

6.19mappings ............................................. 80

6.20 Exporting data to vtk format . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81

6.21Runge-Kuttamethods ...................................... 83

6.22AmIinornot?.......................................... 84

6.22.1 Two-dimensionalspace.................................. 84

6.22.2 Three-dimensional space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84

6.23 Error measurements and convergence rates . . . . . . . . . . . . . . . . . . . . . . . . . . 87

6.24 The initial temperature ﬁeld . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88

6.24.1 Single layer with imposed temperature b.c. . . . . . . . . . . . . . . . . . . . . . . 88

6.25 Single layer with imposed heat ﬂux b.c. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89

6.26 Single layer with imposed heat ﬂux and temperature b.c. . . . . . . . . . . . . . . . . . . 89

6.26.1 Halfcoolingspace .................................... 89

6.26.2 Platemodel........................................ 89

6.26.3 McKenzieslab ...................................... 89

6.27 Kinematic boundary conditions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90

6.27.1 In-out ﬂux boundary conditions for lithospheric models . . . . . . . . . . . . . . . 90

7fieldstone 01: simple analytical solution 91

8fieldstone 02: Stokes sphere 93

9fieldstone 03: Convection in a 2D box 94

10 fieldstone 04: The lid driven cavity 97

10.1 the lid driven cavity problem (ldc=0) ............................. 97

10.2 the lid driven cavity problem - regularisation I (ldc=1).................... 97

10.3 the lid driven cavity problem - regularisation II (ldc=2) ................... 97

11 fieldstone 05: SolCx benchmark 99

12 fieldstone 06: SolKz benchmark 101

13 fieldstone 07: SolVi benchmark 102

14 fieldstone 08: the indentor benchmark 104

15 fieldstone 09: the annulus benchmark 106

16 fieldstone 10: Stokes sphere (3D) - penalty 108

17 fieldstone 11: stokes sphere (3D) - mixed formulation 109

18 fieldstone 12: consistent pressure recovery 110

19 fieldstone 13: the Particle in Cell technique (1) - the eﬀect of averaging 112

20 fieldstone f14: solving the full saddle point problem 116

21 fieldstone f15: saddle point problem with Schur complement approach - benchmark

118

22 fieldstone f16: saddle point problem with Schur complement approach - Stokes

sphere 121

23 fieldstone 17: solving the full saddle point problem in 3D 123

23.0.1 Constantviscosity .................................... 125

23.0.2 Variableviscosity..................................... 126

24 fieldstone 18: solving the full saddle point problem with Q2×Q1elements 128

25 fieldstone 19: solving the full saddle point problem with Q3×Q2elements 130

26 fieldstone 20: the Busse benchmark 132

27 fieldstone 21: The non-conforming Q1×P0element 134

28 fieldstone 22: The stabilised Q1×Q1element 135

29 fieldstone 23: compressible ﬂow (1) - analytical benchmark 137

30 fieldstone 24: compressible ﬂow (2) - convection box 140

30.1Thephysics ............................................ 140

30.2Thenumerics ........................................... 140

30.3Theexperimentalsetup ..................................... 142

30.4Scaling............................................... 142

30.5Conservationofenergy1..................................... 143

30.5.1 under BA and EBA approximations . . . . . . . . . . . . . . . . . . . . . . . . . . 143

30.5.2 under no approximation at all . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 144

30.6Conservationofenergy2..................................... 144

30.7 The problem of the onset of convection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 145

30.8 results - BA - Ra = 104..................................... 147

30.9 results - BA - Ra = 105..................................... 149

30.10results - BA - Ra = 106..................................... 150

30.11results - EBA - Ra = 104.................................... 151

30.12results - EBA - Ra = 105.................................... 153

30.13Onsetofconvection........................................ 154

31 fieldstone 25: Rayleigh-Taylor instability (1) - instantaneous 156

32 fieldstone 26: Slab detachment benchmark (1) - instantaneous 158

33 fieldstone 27: Consistent Boundary Flux 160

34 fieldstone 28: convection 2D box - Tosi et al, 2015 164

34.0.1 Case 0: Newtonian case, a la Blankenbach et al., 1989 . . . . . . . . . . . . . . . . 165

34.0.2 Case1........................................... 166

34.0.3 Case2........................................... 168

34.0.4 Case3........................................... 170

34.0.5 Case4........................................... 172

34.0.6 Case5........................................... 174

35 fieldstone 29: open boundary conditions 176

36 fieldstone 30: conservative velocity interpolation 179

36.1Couetteﬂow............................................ 179

36.2SolCx ............................................... 179

36.3Streamlineﬂow.......................................... 179

37 fieldstone 31: conservative velocity interpolation 3D 180

38 fieldstone 32: 2D analytical sol. from stream function 181

38.1Backgroundtheory........................................ 181

38.2Asimpleapplication ....................................... 181

39 fieldstone 33: Convection in an annulus 185

40 fieldstone 34: the Cartesian geometry elastic aquarium 187

41 fieldstone 35: 2D analytical sol. in annulus from stream function 189

41.1Linkingwithourpaper...................................... 192

41.2 No slip boundary conditions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 192

41.3 Free slip boundary conditions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 193

42 fieldstone 36: the annulus geometry elastic aquarium 195

43 fieldstone 37: marker advection and population control 198

44 fieldstone 38: Critical Rayleigh number 199

45 fieldstone: Gravity: buried sphere 201

46 Projects for guided research/MSc thesis 203

A Three-dimensional applications 205

WARNING: this is work in progress

1 Introduction

1.1 Philosophy

This document was writing with my students in mind, i.e. 3rd and 4th year Geology/Geophysics stu-

dents at Utrecht University. I have chosen to use jargon as little as possible unless it is a term that is

commonly found in the geodynamics literature (methods paper as well as application papers). There is

no mathematical proof of any theorem or statement I make. These are to be found in generic Numerical

Analysic, Finite Element and Linear Algebra books.

The codes I provide here are by no means optimised as I value code readability over code eﬃciency. I

have also chosen to avoid resorting to multiple code ﬁles or even functions to favour a sequential reading of

the codes. These codes are not designed to form the basis of a real life application: Existing open source

highly optimised codes shoud be preferred, such as ASPECT [79, 65], CITCOM, LAMEM, PTATIN,

PYLITH, ...

All kinds of feedback is welcome on the text (grammar, typos, ...) or on the code(s). You will have

my eternal gratitude if you wish to contribute an example, a benchmark, a cookbook.

All the python scripts and this document are freely available at

https://github.com/cedrict/fieldstone

1.2 Acknowledgments

I have beneﬁtted from many discussions, lectures, tutorials, coﬀee machine discussions, debugging ses-

sions, conference poster sessions, etc ... over the years. I wish to name these instrumental people in

particular and in alphabetic order: Wolfgang Bangerth, Jean Braun, Rens Elbertsen, Philippe Fullsack,

Menno Fraters, Anne Glerum, Timo Heister, Robert Myhill, John Naliboﬀ, E. Gerry Puckett, Lukas van

de Wiel, Arie van den Berg, Tom Weir, and the whole ASPECT family/team.

1.3 Essential literature

http://www-udc.ig.utexas.edu/external/becker/Geodynamics557.pdf

1.4 Installation

python3.6 -m pip install --user numpy scipy matplotlib

1.5 What is a ﬁeldstone?

Simply put, it is stone collected from the surface of ﬁelds where it occurs naturally. It also stands for the

bad acronym: ﬁnite element deformation of stones which echoes the primary application of these codes:

geodynamic modelling.

1.6 Why the Finite Element method?

There are other methods, such as the Discrete Element Method (DEM) [46, 47, 55], or the Element Free

Galerkin Method (EFGM) [63].

1.7 Notations

Scalars such as temperature, density, pressure, etc ... are simply obtained in L

Xby using the math

mode: T,ρ,p. Although it is common to lump vectors and matrices/tensors together by using bold

fonts, I have decided in the interest of clarity to distinguish between those: vectors are denoted by an

arrow atop the quantity, e.g. 

ν,g, while matrices and tensors are in bold M,σ, etc ...

Also I use the ·notation between two vectors to denote a dot product u ·v =uivior a matrix-vector

multiplication M·a =Mij aj. If there is no ·between vectors, it means that the result a

b=aibjis a

matrix.

2 List of tutorials

tutorial number

element outer solver formulation physical problem

temperature

time stepping

nonlinear

compressible

analytical benchmark

numerical benchmark

elastomechanics

1Q1×P0penalty analytical benchmark †

2Q1×P0penalty Stokes sphere

3Q1×P0penalty Blankenbach et al., 1989 † †

4Q1×P0penalty Lid driven cavity

5Q1×P0penalty SolCx benchmark

6Q1×P0penalty SolKz benchmark

7Q1×P0penalty SolVi benchmark

8Q1×P0penalty Indentor †

9Q1×P0penalty annulus benchmark

10 Q1×P0penalty Stokes sphere †

11 Q1×P0full matrix mixed Stokes sphere †

12 Q1×P0penalty analytical benchmark

+ consistent press recovery

13 Q1×P0penalty Stokes sphere

+ markers averaging

14 Q1×P0full matrix mixed analytical benchmark

15 Q1×P0Schur comp. CG mixed analytical benchmark

16 Q1×P0Schur comp. PCG mixed Stokes sphere

17 Q2×Q1full matrix mixed Burstedde benchmark †

18 Q2×Q1full matrix mixed analytical benchmark

19 Q3×Q2full matrix mixed analytical benchmark

20 Q1×P0penalty Busse et al., 1993 † † †

21 Q1×P0R-T penalty analytical benchmark

22 Q1×Q1-

stab

full matrix mixed analytical benchmark

23 Q1×P0mixed analytical benchmark †

24 Q1×P0mixed convection box † † †

tutorial number

element outer solver formulation physical problem

temperature

time stepping

nonlinear

compressible

analytical benchmark

numerical benchmark

elastomechanics

25 Q1×P0full matrix mixed Rayleigh-Taylor instability

26 Q1×P0full matrix mixed Slab detachment †

27 Q1×P0full matrix mixed CBF benchmarks † †

28 Q1×P0full matrix mixed Tosi et al, 2015 † † † †

29 Q1×P0full matrix mixed Open Boundary conditions †

30 Q1, Q2X X Cons. Vel. Interp (cvi) † †

31 Q1, Q2X X Cons. Vel. Interp (cvi) † † †

32 Q1×P0full matrix mixed analytical benchmark †

33 Q1×P0penalty convection in annulus † † †

34 Q1elastic Cartesian aquarium † † †

36 Q1elastic annulus aquarium † † †

37 Q1, Q2X X population control, bmw test † †

38 Critical Rayleigh number

Analytical benchmark means that an analytical solution exists while numerical benchmark means that a comparison with other code(s) has been carried

out.

3 The physical equations

Symbol meaning unit

tTime s

x, y, z Cartesian coordinates m

r, θ Polar coordinates m,-

r, θ, z Cylindrical coordinates m,-,m

r, θ, φ Spherical coordinates m,-,-



νvelocity vector m·s−1

ρmass density kg/m3

ηdynamic viscosity Pa·s

λpenalty parameter Pa·s

Ttemperature K



∇gradient operator m−1



∇· divergence operator m−1

ppressure Pa

ε(

ν) strain rate tensor s−1

αthermal expansion coeﬃcient K−1

kthermal conductivity W/(m ·K)

CpHeat capacity J/K

Hintrinsic speciﬁc heat production W/kg

βTisothermal compressibility Pa−1

τdeviatoric stress tensor Pa

σfull stress tensor Pa

3.1 The heat transport equation - energy conservation equation

Let us start from the heat transport equation as shown in Schubert, Turcotte and Olson [119]:

ρCp

Dt −αT Dp

Dt =

∇ · k

∇T+Φ+ρH

with D/Dt being the total derivatives so that

Dt =∂T

∂t +

ν·

∇TDp

Dt =∂p

∂t +

ν·

∇p

Solving for temperature, this equation is often rewritten as follows:

ρCp

Dt −

∇ · k

∇T=αT Dp

Dt +Φ+ρH

A note on the shear heating term Φ: In many publications, Φ is given by Φ = τij ∂jui=τ:

∇

ν.

Φ = τij ∂jui

= 2η˙εd

ij ∂jui

= 2η1

2˙εd

ij ∂jui+ ˙εd

ji∂iuj

= 2η1

2˙εd

ij ∂jui+ ˙εd

ij ∂iuj

= 2η˙εd

2(∂jui+∂iuj)

= 2η˙εd

ij ˙εij

= 2η˙

εd:˙

= 2η˙

εd:˙

εd+1

3(

∇ ·

ν)1

= 2η˙

εd:˙

εd+ 2η˙

εd:1(

∇ ·

ν)

= 2η˙

εd:˙

εd(1)

Finally

Φ = τ:

∇

ν= 2η˙

εd:˙

εd= 2η( ˙εd

xx)2+ ( ˙εd

yy)2+ 2( ˙εd

xy)2

3.2 The momentum conservation equations

Because the Prandlt number is virtually zero in Earth science applications the Navier Stokes equations

reduce to the Stokes equation: 

∇ · σ+ρg = 0

Since

σ=−p1+τ

it also writes

−

∇p+

∇ · τ+ρg = 0

Using the relationship τ= 2η˙

εdwe arrive at

−

∇p+

∇ · (2η˙

εd) + ρg = 0

3.3 The mass conservation equations

The mass conservation equation is given by

Dρ

Dt +ρ

∇ ·

ν= 0

or,

∂ρ

∂t +

∇ · (ρ

ν)=0

In the case of an incompressible ﬂow, then ∂ρ/∂t = 0 and 

∇ρ= 0, i.e. Dρ/Dt = 0 and the remaining

equation is simply: 

∇ ·

ν= 0

3.4 The equations in ASPECT manual

The following is lifted oﬀ the ASPECT manual. We focus on the system of equations in a d= 2- or

d= 3-dimensional domain Ω that describes the motion of a highly viscous ﬂuid driven by diﬀerences in

the gravitational force due to a density that depends on the temperature. In the following, we largely

follow the exposition of this material in Schubert, Turcotte and Olson [119].

Speciﬁcally, we consider the following set of equations for velocity u, pressure pand temperature T:

−

∇ · 2η˙

ε(

ν)−1

3(

∇ ·

ν)1+

∇p=ρg in Ω,(2)



∇ · (ρv) = 0 in Ω,(3)

ρCp∂T

∂t +

ν·

∇T−

∇ · k

∇T=ρH

+ 2η˙ε(v)−1

3(

∇ ·

ν)1:˙ε(v)−1

3(

∇ ·

ν)1(4)

+αT v·

∇pin Ω,

where ˙

ε(

ν) = 1

2(

∇

ν+

∇

νT) is the symmetric gradient of the velocity (often called the strain rate).

In this set of equations, (253) and (254) represent the compressible Stokes equations in which v=

v(x, t) is the velocity ﬁeld and p=p(x, t) the pressure ﬁeld. Both ﬁelds depend on space xand time

t. Fluid ﬂow is driven by the gravity force that acts on the ﬂuid and that is proportional to both the

density of the ﬂuid and the strength of the gravitational pull.

Coupled to this Stokes system is equation (255) for the temperature ﬁeld T=T(x, t) that contains

heat conduction terms as well as advection with the ﬂow velocity v. The right hand side terms of this

equation correspond to

•internal heat production for example due to radioactive decay;

•friction (shear) heating;

•adiabatic compression of material;

In order to arrive at the set of equations that ASPECT solves, we need to

•neglect the ∂p/∂t.WHY?

•neglect the ∂ρ/∂t .WHY?

from equations above.

—————————————-

Also, their deﬁnition of the shear heating term Φ is:

Φ = kB(∇·v)2+ 2η˙

εd:˙

εd

For many ﬂuids the bulk viscosity kBis very small and is often taken to be zero, an assumption known

as the Stokes assumption: kB=λ+ 2η/3 = 0. Note that ηis the dynamic viscosity and λthe second

viscosity. Also,

τ= 2η˙

ε+λ(∇·v)1

but since kB=λ+ 2η/3 = 0, then λ=−2η/3 so

τ= 2η˙

ε−2

3η(∇·v)1= 2η˙

εd

3.5 the Boussinesq approximation: an Incompressible ﬂow

[from aspect manual] The Boussinesq approximation assumes that the density can be considered constant

in all occurrences in the equations with the exception of the buoyancy term on the right hand side of

(253). The primary result of this assumption is that the continuity equation (254) will now read

∇·v= 0

This implies that the strain rate tensor is deviatoric. Under the Boussinesq approximation, the equations

are much simpliﬁed:

−∇ · [2η˙

ε(v)] + ∇p=ρgin Ω,(5)

∇ · (ρv) = 0 in Ω,(6)

ρ0Cp∂T

∂t +v· ∇T−∇·k∇T=ρH in Ω (7)

Note that all terms on the rhs of the temperature equations have disappeared, with the exception of the

source term.

3.6 Stokes equation for elastic medium

What follows is mostly borrowed from Becker & Kaus lecture notes.

The strong form of the PDE that governs force balance in a medium is given by

∇·σ+f=0

where σis the stress tensor and fis a body force.

The stress tensor is related to the strain tensor through the generalised Hooke’s law:

σij =X

Cijklkl (8)

where Cis the fourth-order elastic tensor. In the case of an isotropic material, this relationship simpliﬁes

σij =λkkδij + 2µij or, σ=λ(∇·u)1+ 2µ(9)

where λis the Lam´e parameter and µis the shear modulus1. The term ∇·uis the isotropic dilation.

The strain tensor is related to the displacement as follows:

=1

2(∇u+∇uT)

The incompressibility (bulk modulus), K, is deﬁned as p=−K∇·uwhere pis the pressure with

p=−1

3T r(σ)

=−1

3[λ(∇·u)T r[1]+2µT r[]]

=−1

3[λ(∇·u)3 + 2µ(∇·u)]

=−[λ+2

3µ](∇·u) (10)

so that K=λ+2

3µ.

Remark : Eq. (8) and (9) are analogous to the ones that one has to solve in the context of viscous

ﬂow using the penalty method. In this case λis the penalty coeﬃcient, uis the velocity, and µis then

the dynamic viscosity.

The Lam´e parameter and the shear modulus are also linked to νthe poisson ratio, and E, Young’s

modulus:

λ=µ2ν

1−2ν=νE

(1 + ν)(1 −2ν)with E= 2µ(1 + ν)

The shear modulus, expressed often in GPa, describes the material’s response to shear stress. The poisson

ratio describes the response in the direction orthogonal to uniaxial stress. The Young modulus, expressed

in GPa, describes the material’s strain response to uniaxial stress in the direction of this stress.

1It is also sometimes written G

3.7 The strain rate tensor in all coordinate systems

The strain rate tensor ˙

εis given by

ε=1

2(

∇

ν+

∇

νT) (11)

3.7.1 Cartesian coordinates

˙εxx =∂u

∂x (12)

˙εyy =∂v

∂y (13)

˙εzz =∂w

∂z (14)

˙εyx = ˙εxy =1

2∂u

∂y +∂v

∂x (15)

˙εzx = ˙εxz =1

2∂u

∂z +∂w

∂x (16)

˙εzy = ˙εyz =1

2∂v

∂z +∂w

∂y (17)

3.7.2 Polar coordinates

˙εrr =∂vr

∂r (18)

˙εθθ =vr

r+1

∂vθ

∂θ (19)

˙εθr = ˙εrθ =1

2∂vθ

∂r −vθ

r+1

∂vr

∂θ (20)

3.7.3 Cylindrical coordinates

http://eml.ou.edu/equation/FLUIDS/STRAIN/STRAIN.HTM

3.7.4 Sperical coordinates

˙εrr =∂vr

∂r (21)

˙εθθ =vr

r+1

∂vθ

∂θ (22)

˙εφφ =1

rsin θ

∂vφ

∂φ (23)

˙εθr = ˙εrθ =1

2r∂

∂r (vθ

r) + 1

∂vr

∂θ (24)

˙εφr = ˙εrφ =1

21

rsin θ

∂vr

∂φ +r∂

∂r (vφ

r)(25)

˙εφθ = ˙εθφ =1

2sin θ

∂

∂θ (vφ

sin θ) + 1

rsin θ

∂vθ

∂φ (26)

3.8 Boundary conditions

In mathematics, the Dirichlet (or ﬁrst-type) boundary condition is a type of boundary condition, named

after Peter Gustav Lejeune Dirichlet. When imposed on an ODE or PDE, it speciﬁes the values that a

solution needs to take on along the boundary of the domain. Note that a Dirichlet boundary condition

may also be referred to as a ﬁxed boundary condition.

The Neumann (or second-type) boundary condition is a type of boundary condition, named after Carl

Neumann. When imposed on an ordinary or a partial diﬀerential equation, the condition speciﬁes the

values in which the derivative of a solution is applied within the boundary of the domain.

It is possible to describe the problem using other boundary conditions: a Dirichlet boundary condition

speciﬁes the values of the solution itself (as opposed to its derivative) on the boundary, whereas the Cauchy

boundary condition, mixed boundary condition and Robin boundary condition are all diﬀerent types of

combinations of the Neumann and Dirichlet boundary conditions.

3.8.1 The Stokes equations

You may ﬁnd the following terms in the computational geodynamics literature:

•free surface: this means that no force is acting on the surface, i.e. σ·n =

0. It is usually used on

the top boundary of the domain and allows for topography evolution.

•free slip: 

ν·n = 0 and (σ·n)×n =

0. This condition ensures a frictionless ﬂow parallel to the

boundary where it is prescribed.

•no slip: this means that the velocity (or displacement) is exactly zero on the boundary, i.e. 

ν=

•prescribed velocity: 

ν=

νbc

•stress b.c.:

•open .b.c.: see ﬁeldstone 29.

3.8.2 The heat transport equation

There are two types of boundary conditions for this equation: temperature boundary conditions (Dirichlet

boundary conditions) and heat ﬂux boundary conditions (Neumann boundary conditions).

3.9 Meaningful physical quantities

•Velocity (m/s):

•Root mean square velocity (m/s):

νrms =RΩ|

ν|2dΩ

RΩdΩ1/2

=1

VΩZΩ|

ν|2dΩ1/2

(27)

In Cartesian coordinates, for a cuboid domain of size Lx ×Ly×Lz, the vrms is simply given by:

νrms = 1

LxLyLzZLx

0ZLy

0ZLz

(u2+v2+w2)dxdydz!1/2

(28)

In the case of an annulus domain, although calculations are carried out in Cartesian coordinates,

it makes sense to look at the radial velocity component vrand the tangential velocity component

vθ, and their respective root mean square averages:

vr|rms =1

VΩZΩ

rdΩ1/2

(29)

vθ|rms =1

VΩZΩ

θdΩ1/2

(30)

•Pressure (Pa):

•Stress (Pa):

•Strain (X):

•Strain rate (s−1):

•Rayleigh number (X):

•Prandtl number (X):

•Nusselt number (X): the Nusselt number (Nu) is the ratio of convective to conductive heat transfer

across (normal to) the boundary. The conductive component is measured under the same conditions

as the heat convection but with a (hypothetically) stagnant (or motionless) ﬂuid.

In practice the Nusselt number Nu of a layer (typically the mantle of a planet) is deﬁned as follows:

Nu = q

(31)

where qis the heat transferred by convection while qc=k∆T/D is the amount of heat that would

be conducted through a layer of thickness Dwith a temperature diﬀerence ∆Tacross it with k

being the thermal conductivity.

For 2D Cartesian systems of size (Lx,Ly) the Nu is computed [15]

Nu =

LxRLx

0k∂T

∂y (x, y =Ly)dx

−1

LxRLx

0kT (x, y = 0)/Lydx =−LyRLx

∂T

∂y (x, y =Ly)dx

RLx

0T(x, y = 0)dx

i.e. it is the mean surface temperature gradient over the mean bottom temperature.

finish. not happy with definition. Look at literature

Note that in the case when no convection takes place then the measured heat ﬂux at the top is the

one obtained from a purely conductive proﬁle which yields Nu=1.

Note that a relationship Ra ∝Nuαexists between the Rayleigh number Ra and the Nusselt number

Nu in convective systems, see [142] and references therein.

Turning now to cylindrical geometries with inner radius R1and outer radius R2, we deﬁne f=

R1/R2. A small value of fcorresponds to a high degree of curvature. We assume now that

R2−R1= 1, so that R2= 1/(1 −f) and R1=f/(1 −f). Following [72], the Nusselt number at

the inner and outer boundaries are:

Nuinner =fln f

1−f

2πZ2π

0∂T

∂r r=R1

dθ (32)

Nuouter =ln f

1−f

2πZ2π

0∂T

∂r r=R2

dθ (33)

Note that a conductive geotherm in such an annulus between temperatures T1and T2is given by

Tc(r) = ln(r/R2)

ln(R1/R2)=ln(r(1 −f))

ln f

so that ∂Tc

∂r =1

ln f

We then ﬁnd:

Nuinner =fln f

1−f

2πZ2π

0∂Tc

∂r r=R1

dθ =fln f

1−f

ln f= 1 (34)

Nuouter =ln f

1−f

2πZ2π

0∂Tc

∂r r=R2

dθ =ln f

1−f

ln f= 1 (35)

As expected, the recovered Nusselt number at both boundaries is exactly 1 when the temperature

ﬁeld is given by a steady state conductive geotherm.

derive formula for Earth size R1 and R2

•Temperature (K):

•Viscosity (Pa.s):

•Density (kg/m3):

•Heat capacity cp(J.K−1.kg−1): It is the measure of the heat energy required to increase the

temperature of a unit quantity of a substance by unit degree.

•Heat conductivity, or thermal conductivity k(W.m−1.K−1). It is the property of a material that

indicates its ability to conduct heat. It appears primarily in Fourier’s Law for heat conduction.

•Heat diﬀusivity: κ=k/(ρcp) (m2.s−1). Substances with high thermal diﬀusivity rapidly adjust

their temperature to that of their surroundings, because they conduct heat quickly in comparison

to their volumetric heat capacity or ’thermal bulk’.

•thermal expansion α(K−1): it is the tendency of a matter to change in volume in response to a

change in temperature.

4 The building blocks of the Finite Element Method

4.1 Numerical integration

As we will see later, using the Finite Element method to solve problems involves computing integrals

which are more often than not too complex to be computed analytically/exactly. We will then need to

compute them numerically.

[wiki] In essence, the basic problem in numerical integration is to compute an approximate solution

to a deﬁnite integral Zb

f(x)dx

to a given degree of accuracy. This problem has been widely studied [?] and we know that if f(x) is a

smooth function, and the domain of integration is bounded, there are many methods for approximating

the integral to the desired precision.

There are several reasons for carrying out numerical integration.

•The integrand f(x) may be known only at certain points, such as obtained by sampling. Some

embedded systems and other computer applications may need numerical integration for this reason.

•A formula for the integrand may be known, but it may be diﬃcult or impossible to ﬁnd an an-

tiderivative that is an elementary function. An example of such an integrand is f(x) = exp(−x2),

the antiderivative of which (the error function, times a constant) cannot be written in elementary

form.

•It may be possible to ﬁnd an antiderivative symbolically, but it may be easier to compute a numerical

approximation than to compute the antiderivative. That may be the case if the antiderivative is

given as an inﬁnite series or product, or if its evaluation requires a special function that is not

available.

4.1.1 in 1D - theory

The simplest method of this type is to let the interpolating function be a constant function (a polynomial

of degree zero) that passes through the point ((a+b)/2, f((a+b)/2)).

This is called the midpoint rule or rectangle rule.

f(x)dx '(b−a)f(a+b

insert here figure

The interpolating function may be a straight line (an aﬃne function, i.e. a polynomial of degree 1)

passing through the points (a, f(a)) and (b, f(b)).

This is called the trapezoidal rule.

f(x)dx '(b−a)f(a) + f(b)

insert here figure

For either one of these rules, we can make a more accurate approximation by breaking up the interval

[a, b] into some number n of subintervals, computing an approximation for each subinterval, then adding

up all the results. This is called a composite rule, extended rule, or iterated rule. For example, the

composite trapezoidal rule can be stated as

f(x)dx 'b−a

n f(a)

n−1

k=1

f(a+kb−a

n) + f(b)

where the subintervals have the form [kh, (k+ 1)h], with h= (b−a)/n and k= 0,1,2, . . . , n −1.

a) b)

The interval [−2,2] is broken into 16 sub-intervals. The blue lines correspond to the approximation of

the red curve by means of a) the midpoint rule, b) the trapezoidal rule.

There are several algorithms for numerical integration (also commonly called ’numerical quadrature’,

or simply ’quadrature’) . Interpolation with polynomials evaluated at equally spaced points in [a, b] yields

the NewtonCotes formulas, of which the rectangle rule and the trapezoidal rule are examples. If we allow

the intervals between interpolation points to vary, we ﬁnd another group of quadrature formulas, such

as the Gauss(ian) quadrature formulas. A Gaussian quadrature rule is typically more accurate than a

NewtonCotes rule, which requires the same number of function evaluations, if the integrand is smooth

(i.e., if it is suﬃciently diﬀerentiable).

An n−point Gaussian quadrature rule, named after Carl Friedrich Gauss, is a quadrature rule con-

structed to yield an exact result for polynomials of degree 2n−1 or less by a suitable choice of the points

xiand weights wifor i= 1, . . . , n.

The domain of integration for such a rule is conventionally taken as [−1,1], so the rule is stated as

Z+1

−1

f(x)dx =

iq=1

wiqf(xiq)

In this formula the xiqcoordinate is the i-th root of the Legendre polynomial Pn(x).

It is important to note that a Gaussian quadrature will only produce good results if the function f(x)

is well approximated by a polynomial function within the range [−1,1]. As a consequence, the method

is not, for example, suitable for functions with singularities.

Gauss-Legendre points and their weights.

As shown in the above table, it can be shown that the weight values must fulﬁll the following condition:

wiq= 2 (36)

and it is worth noting that all quadrature point coordinates are symmetrical around the origin.

Since most quadrature formula are only valid on a speciﬁc interval, we now must address the problem

of their use outside of such intervals. The solution turns out to be quite simple: one must carry out a

change of variables from the interval [a, b] to [−1,1].

We then consider the reduced coordinate r∈[−1,1] such that

r=2

b−a(x−a)−1

This relationship can be reversed such that when ris known, its equivalent coordinate x∈[a, b] can be

computed:

x=b−a

2(1 + r) + a

From this it follows that

dx =b−a

2dr

and then Zb

f(x)dx =b−a

2Z+1

−1

f(r)dr 'b−a

iq=1

wiqf(riq)

4.1.2 in 1D - examples

example 1 Since we know how to carry out any required change of variables, we choose for simplicity

a=−1, b= +1. Let us take for example f(x) = π. Then we can compute the integral of this function

over the interval [a, b] exactly:

I=Z+1

−1

f(x)dx =πZ+1

−1

dx = 2π

We can now use a Gauss-Legendre formula to compute this same integral:

Igq =Z+1

−1

f(x)dx =

iq=1

wiqf(xiq) =

iq=1

wiqπ=π

iq=1

wiq

| {z }

= 2π

where we have used the property of the weight values of Eq.(36). Since the actual number of points was

never speciﬁed, this result is valid for all quadrature rules.

example 2 Let us now take f(x) = mx +pand repeat the same exercise:

I=Z+1

−1

f(x)dx =Z+1

−1

(mx +p)dx = [1

2mx2+px]+1

−1= 2p

Igq =Z+1

−1

f(x)dx=

iq=1

wiqf(xiq)=

iq=1

wiq(mxiq+p)= m

iq=1

wiqxiq

| {z }

iq=1

wiq

| {z }

= 2p

since the quadrature points are symmetric w.r.t. to zero on the x-axis. Once again the quadrature is able

to compute the exact value of this integral: this makes sense since an n-point rule exactly integrates a

2n−1 order polynomial such that a 1 point quadrature exactly integrates a ﬁrst order polynomial like

the one above.

example 3 Let us now take f(x) = x2. We have

I=Z+1

−1

f(x)dx =Z+1

−1

x2dx = [1

3x3]+1

−1=2

and

Igq =Z+1

−1

f(x)dx=

iq=1

wiqf(xiq)=

iq=1

wiqx2

•nq= 1: x(1)

iq = 0, wiq= 2. Igq = 0

•nq= 2: x(1)

q=−1/√3, x(2)

q= 1/√3, w(1)

q=w(2)

q= 1. Igq =2

•It also works ∀nq>2 !

4.1.3 in 2D/3D - theory

Let us now turn to a two-dimensional integral of the form

I=Z+1

−1Z+1

−1

f(x, y)dxdy

The equivalent Gaussian quadrature writes:

Igq '

iq=1

f(xiq, yjq)wiqwjq

4.2 The mesh

4.3 A bit of FE terminology

We introduce here some terminology for eﬃcient element descriptions [60]:

•For triangles/tetrahedra, the designation Pm×Pnmeans that each component of the velocity is

approximated by continuous piecewise complete Polynomials of degree mand pressure by continuous

piecewise complete Polynomials of degree n. For example P2×P1means

u∼a1+a2x+a3y+a4xy +a5x2+a6y2

with similar approximations for v, and

p∼b1+b2x+b3y

Both velocity and pressure are continuous across element boundaries, and each triangular element

contains 6 velocity nodes and three pressure nodes.

•For the same families, Pm×P−nis as above, except that pressure is approximated via piecewise

discontinuous polynomials of degree n. For instance, P2×P−1is the same as P2P1except that

pressure is now an independent linear function in each element and therefore discontinuous at

element boundaries.

•For quadrilaterals/hexahedra, the designation Qm×Qnmeans that each component of the velocity

is approximated by a continuous piecewise polynomial of degree min each direction on the quadri-

lateral and likewise for pressure, except that the polynomial is of degree n. For instance, Q2×Q1

means

u∼a1+a2x+a3y+a4xy +a5x2+a6y2+a7x2y+a8xy2+a9x2y2

and

p∼b1+b2x+b3y+b4xy

•For these same families, Qm×Q−nis as above, except that the pressure approximation is not

continuous at element boundaries.

•Again for the same families, Qm×P−nindicates the same velocity approximation with a pressure

approximation that is a discontinuous complete piecewise polynomial of degree n(not of degree n

in each direction !)

•The designation P+

mor Q+

mmeans that some sort of bubble function was added to the polynomial

approximation for the velocity. You may also ﬁnd the term ’enriched element’ in the literature.

•Finally, for n= 0, we have piecewise-constant pressure, and we omit the minus sign for simplicity.

Another point which needs to be clariﬁed is the use of so-called ’conforming elements’ (or ’non-

conforming elements’). Following again [60], conforming velocity elements are those for which the

basis functions for a subset of H1for the continuous problem (the ﬁrst derivatives and their squares are

integrable in Ω). For instance, the rotated Q1×P0element of Rannacher and Turek (see section ??) is

such that the velocity is discontinous across element edges, so that the derivative does not exist there.

Another typical example of non-conforming element is the Crouzeix-Raviart element [34].

4.4 Elements and basis functions in 1D

4.4.1 Linear basis functions (Q1)

Let f(r) be a C1function on the interval [−1 : 1] with f(−1) = f1and f(1) = f2.

Let us assume that the function f(r) is to be approximated on [−1,1] by the ﬁrst order polynomial

f(r) = a+br (37)

Then it must fulﬁll

f(r=−1) = a−b=f1

f(r= +1) = a+b=f2

This leads to

a=1

2(f1+f2)b=1

2(−f1+f2)

and then replacing a, b in Eq. (37) by the above values on gets

f(r) = 1

2(1 −r)f1+1

2(1 + r)f2

f(r) =

i=1

Ni(r)f1

with

N1(r) = 1

2(1 −r)

N2(r) = 1

2(1 + r) (38)

4.4.2 Quadratic basis functions (Q2)

Let f(r) be a C1function on the interval [−1 : 1] with f(−1) = f1,f(0) = f2and f(1) = f3.

Let us assume that the function f(r) is to be approximated on [−1,1] by the second order polynomial

f(r) = a+br +cr2(39)

Then it must fulﬁll

f(r=−1) = a−b+c=f1

f(r= 0) = a=f2

f(r= +1) = a+b+c=f3

This leads to

a=f2b=1

2(−f1 + f3) c=1

2(f1+f3−2f2)

and then replacing a, b, c in Eq. (39) by the above values on gets

f(r) = 1

2r(r−1)f1+ (1 −r2)f2+1

2r(r+ 1)f3

or,

f(r) =

i=1

Ni(r)fi

with

N1(r) = 1

2r(r−1)

N2(r) = (1 −r2)

N3(r) = 1

2r(r+ 1) (40)

4.4.3 Cubic basis functions (Q3)

The 1D basis polynomial is given by

f(r) = a+br +cr2+dr3

with the nodes at position -1,-1/3, +1/3 and +1.

f(−1) = a−b+c−d=f1

f(−1/3) = a−b

3+c

9−d

27 =f2

f(+1/3) = a−b

3+c

9−d

27 =f3

f(+1) = a+b+c+d=f4

Adding the ﬁrst and fourth equation and the second and third, one arrives at

f1+f4= 2a+ 2c f2+f3= 2a+2c

and ﬁnally:

a=1

16 (−f1+ 9f2+ 9f3−f4)

c=9

16 (f1−f2−f3+f4)

Combining the original 4 equations in a diﬀerent way yields

2b+ 2d=f4−f1

3+2d

27 =f3−f2

so that

b=1

16 (f1−27f2+ 27f3−f4)

d=9

16 (−f1+ 3f2−3f3+f4)

Finally,

f(r) = a+b+cr2+dr3

16(−1 + r+ 9r2−9r3)f1

16(9 −27r−9r2+ 27r3)f2

16(9 + 27r−9r2−27r3)f3

16(−1−r+ 9r2+ 9r3)f4

i=1

Ni(r)fi

where

N1=1

16(−1 + r+ 9r2−9r3)

N2=1

16(9 −27r−9r2+ 27r3)

N3=1

16(9 + 27r−9r2−27r3)

N4=1

16(−1−r+ 9r2+ 9r3)

Veriﬁcation:

•Let us assume f(r) = C, then

f(r) = XNi(r)fi=X

NiC=CX

Ni=C

so that a constant function is exactly reproduced, as expected.

•Let us assume f(r) = r, then f1=−1, f2=−1/3, f3= 1/3 and f4= +1. We then have

f(r) = XNi(r)fi

=−N1(r)−1

3N2(r) + 1

3N3(r) + N4(r)

= [−(−1 + r+ 9r2−9r3)

−1

3(9 −27r−9r2−27r3)

3(9 + 27r−9r2+ 27r3)

+(−1−r+ 9r2+ 9r3)]/16

= [−r+ 9r+ 9r−r]/16 + ...0...

=r(41)

The basis functions derivative are given by

∂N1

∂r =1

16(1 + 18r−27r2)

∂N2

∂r =1

16(−27 −18r+ 81r2)

∂N3

∂r =1

16(+27 −18r−81r2)

∂N4

∂r =1

16(−1 + 18r+ 27r2)

Veriﬁcation:

•Let us assume f(r) = C, then

∂ˆ

∂r =X

∂Ni

∂r fi

=CX

∂Ni

∂r

16[(1 + 18r−27r2)

+(−27 −18r+ 81r2)

+(+27 −18r−81r2)

+(−1 + 18r+ 27r2)]

= 0

•Let us assume f(r) = r, then f1=−1, f2=−1/3, f3= 1/3 and f4= +1. We then have

∂ˆ

∂r =X

∂Ni

∂r fi

16[−(1 + 18r−27r2)

−1

3(−27 −18r+ 81r2)

3(+27 −18r−81r2)

+(−1 + 18r+ 27r2)]

16[−2 + 18 + 54r2−54r2]

= 1

4.5 Elements and basis functions in 2D

Let us for a moment consider a single quadrilateral element in the xy-plane, as shown on the following

ﬁgure:

Let us assume that we know the values of a given ﬁeld uat the vertices. For a given point Minside

the element in the plane, what is the value of the ﬁeld uat this point? It makes sense to postulate that

uM=u(xM, yM) will be given by

uM=φ(u1, u2, u3, u4, xM, yM)

where φis a function to be determined. Although φis not unique, we can decide to express the value

uMas a weighed sum of the values at the vertices ui. One option could be to assign all four vertices the

same weight, say 1/4 so that uM= (u1+u2+u3+u4)/4, i.e. uMis simply given by the arithmetic mean

of the vertices values. This approach suﬀers from a major drawback as it does not use the location of

point Minside the element. For instance, when (xM, yM)→(x2, y2) we expect uM→u2.

In light of this, we could now assume that the weights would depend on the position of Min a

continuous fashion:

u(xM, yM) =

i=1

Ni(xM, yM)ui

where the Niare continous (”well behaved”) functions which have the property:

Ni(xj, yj) = δij

or, in other words:

N3(x1, y1) = 0 (42)

N3(x2, y2) = 0 (43)

N3(x3, y3) = 1 (44)

N3(x4, y4) = 0 (45)

The functions Niare commonly called basis functions.

Omitting the Msubscripts for any point inside the element, the velocity components uand vare

given by:

ˆu(x, y) =

i=1

Ni(x, y)ui(46)

ˆv(x, y) =

i=1

Ni(x, y)vi(47)

Rather interestingly, one can now easily compute velocity gradients (and therefore the strain rate tensor)

since we have assumed the basis functions to be ”well behaved” (in this case diﬀerentiable):

˙xx(x, y) = ∂u

∂x =

i=1

∂Ni

∂x ui(48)

˙yy(x, y) = ∂v

∂y =

i=1

∂Ni

∂y vi(49)

˙xy (x, y) = 1

∂u

∂y +1

∂v

∂x =1

i=1

∂Ni

∂y ui+1

i=1

∂Ni

∂x vi(50)

How we actually obtain the exact form of the basis functions is explained in the coming section.

4.5.1 Bilinear basis functions in 2D (Q1)

In this section, we place ourselves in the most favorables case, i.e. the element is a square deﬁned by

−1< r < 1, −1< s < 1 in the Cartesian coordinates system (r, s):

3===========2

| | (r_0,s_0)=(-1,-1)

| | (r_1,s_1)=(+1,-1)

| | (r_2,s_2)=(+1,+1)

| | (r_3,s_3)=(-1,+1)

| |

0===========1

This element is commonly called the reference element. How we go from the (x, y) coordinate system

to the (r, s) once and vice versa will be dealt later on. For now, the basis functions in the above reference

element and in the reduced coordinates system (r, s) are given by:

N1(r, s)=0.25(1 −r)(1 −s)

N2(r, s)=0.25(1 + r)(1 −s)

N3(r, s)=0.25(1 + r)(1 + s)

N4(r, s)=0.25(1 −r)(1 + s)

The partial derivatives of these functions with respect to rans sautomatically follow:

∂N1

∂r (r, s) = −0.25(1 −s)∂N1

∂s (r, s) = −0.25(1 −r)

∂N2

∂r (r, s) = +0.25(1 −s)∂N2

∂s (r, s) = −0.25(1 + r)

∂N3

∂r (r, s) = +0.25(1 + s)∂N3

∂s (r, s) = +0.25(1 + r)

∂N4

∂r (r, s) = −0.25(1 + s)∂N4

∂s (r, s) = +0.25(1 −r)

Let us go back to Eq.(47). And let us assume that the function v(r, s) = Cso that vi=Cfor

i= 1,2,3,4. It then follows that

ˆv(r, s) =

i=1

Ni(r, s)vi=C

i=1

Ni(r, s) = C[N1(r, s) + N2(r, s) + N3(r, s) + N4(r, s)] = C

This is a very important property: if the vfunction used to assign values at the vertices is constant, then

the value of ˆvanywhere in the element is exactly C. If we now turn to the derivatives of vwith respect

to rand s:

∂ˆv

∂r (r, s) =

i=1

∂Ni

∂r (r, s)vi=C

i=1

∂Ni

∂r (r, s) = C[−0.25(1 −s)+0.25(1 −s)+0.25(1 + s)−0.25(1 + s)] = 0

∂ˆv

∂s (r, s) =

i=1

∂Ni

∂s (r, s)vi=C

i=1

∂Ni

∂s (r, s) = C[−0.25(1 −r)−0.25(1 + r)+0.25(1 + r)+0.25(1 −r)] = 0

We reassuringly ﬁnd that the derivative of a constant ﬁeld anywhere in the element is exactly zero.

If we now choose v(r, s) = ar +bs with aand btwo constant scalars, we ﬁnd:

ˆv(r, s) =

i=1

Ni(r, s)vi(51)

i=1

Ni(r, s)(ari+bsi) (52)

i=1

Ni(r, s)ri

| {z }

i=1

Ni(r, s)si

| {z }

(53)

=a[0.25(1 −r)(1 −s)(−1) + 0.25(1 + r)(1 −s)(+1) + 0.25(1 + r)(1 + s)(+1) + 0.25(1 −r)(1 + s)(−1)]

+b[0.25(1 −r)(1 −s)(−1) + 0.25(1 + r)(1 −s)(−1) + 0.25(1 + r)(1 + s)(+1) + 0.25(1 −r)(1 + s)(+1)]

=a[−0.25(1 −r)(1 −s)+0.25(1 + r)(1 −s)+0.25(1 + r)(1 + s)−0.25(1 −r)(1 + s)]

+b[−0.25(1 −r)(1 −s)−0.25(1 + r)(1 −s)+0.25(1 + r)(1 + s)+0.25(1 −r)(1 + s)]

=ar +bs (54)

verify above eq. This set of bilinear shape functions is therefore capable of exactly representing a bilinear

ﬁeld. The derivatives are:

∂ˆv

∂r (r, s) =

i=1

∂Ni

∂r (r, s)vi(55)

i=1

∂Ni

∂r (r, s)ri+b

i=1

∂Ni

∂r (r, s)si(56)

=a[−0.25(1 −s)(−1) + 0.25(1 −s)(+1) + 0.25(1 + s)(+1) −0.25(1 + s)(−1)]

+b[−0.25(1 −s)(−1) + 0.25(1 −s)(−1) + 0.25(1 + s)(+1) −0.25(1 + s)(+1)]

4[(1 −s) + (1 −s) + (1 + s) + (1 + s)]

4[(1 −s)−(1 −s) + (1 + s)−(1 + s)]

=a(57)

Here again, we ﬁnd that the derivative of the bilinear ﬁeld inside the element is exact: ∂ˆv

∂r =∂v

∂r .

However, following the same methodology as above, one can easily prove that this is no more true

for polynomials of degree strivtly higher than 1. This fact has serious consequences: if the solution to

the problem at hand is for instance a parabola, the Q1shape functions cannot represent the solution

properly, but only by approximating the parabola in each element by a line. As we will see later, Q2

basis functions can remedy this problem by containing themselves quadratic terms.

4.5.2 Biquadratic basis functions in 2D (Q2)

This element is part of the so-called LAgrange family.

citation needed

Inside an element the local numbering of the nodes is as follows:

3=====6=====2

| | | (r_0,s_0)=(-1,-1) (r_4,s_4)=( 0,-1)

| | | (r_1,s_1)=(+1,-1) (r_5,s_5)=(+1, 0)

7=====8=====5 (r_2,s_2)=(+1,+1) (r_6,s_6)=( 0,+1)

| | | (r_3,s_3)=(-1,+1) (r_7,s_7)=(-1, 0)

| | | (r_8,s_8)=( 0, 0)

0=====4=====1

The basis polynomial is then

f(r, s) = a+br +cs +drs +er2+fs2+gr2s+hrs2+ir2s2

The velocity shape functions are given by:

N0(r, s) = 1

2r(r−1)1

2s(s−1)

N1(r, s) = 1

2r(r+ 1)1

2s(s−1)

N2(r, s) = 1

2r(r+ 1)1

2s(s+ 1)

N3(r, s) = 1

2r(r−1)1

2s(s+ 1)

N4(r, s) = (1 −r2)1

2s(s−1)

N5(r, s) = 1

2r(r+ 1)(1 −s2)

N6(r, s) = (1 −r2)1

2s(s+ 1)

N7(r, s) = 1

2r(r−1)(1 −s2)

N8(r, s) = (1 −r2)(1 −s2)

and their derivatives by:

∂N0

∂r =1

2(2r−1)1

2s(s−1) ∂N0

∂s =1

2r(r−1)1

2(2s−1)

∂N1

∂r =1

2(2r+ 1)1

2s(s−1) ∂N1

∂s =1

2r(r+ 1)1

2(2s−1)

∂N2

∂r =1

2(2r+ 1)1

2s(s+ 1) ∂N2

∂s =1

2r(r+ 1)1

2(2s+ 1)

∂N3

∂r =1

2(2r−1)1

2s(s+ 1) ∂N3

∂s =1

2r(r−1)1

2(2s+ 1)

∂N4

∂r = (−2r)1

2s(s−1) ∂N4

∂s = (1 −r2)1

2(2s−1)

∂N5

∂r =1

2(2r+ 1)(1 −s2)∂N5

∂s =1

2r(r+ 1)(−2s)

∂N6

∂r = (−2r)1

2s(s+ 1) ∂N6

∂s = (1 −r2)1

2(2s+ 1)

∂N7

∂r =1

2(2r−1)(1 −s2)∂N7

∂s =1

2r(r−1)(−2s)

∂N8

∂r = (−2r)(1 −s2)∂N8

∂s = (1 −r2)(−2s)

4.5.3 Eight node serendipity basis functions in 2D (Qs

Inside an element the local numbering of the nodes is as follows:

3=====6=====2

| | | (r_0,s_0)=(-1,-1) (r_4,s_4)=( 0,-1)

| | | (r_1,s_1)=(+1,-1) (r_5,s_5)=(+1, 0)

7=====+=====5 (r_2,s_2)=(+1,+1) (r_6,s_6)=( 0,+1)

| | | (r_3,s_3)=(-1,+1) (r_7,s_7)=(-1, 0)

|||

0=====4=====1

The main diﬀerence with the Q2element resides in the fact that there is no node in the middle of the

element The basis polynomial is then

f(r, s) = a+br +cs +drs +er2+fs2+gr2s+hrs2

Note that absence of the r2s2term which was previously associated to the center node. We ﬁnd that

N0(r, s) = 1

4(1 −r)(1 −s)(−r−s−1) (58)

N1(r, s) = 1

4(1 + r)(1 −s)(r−s−1) (59)

N2(r, s) = 1

4(1 + r)(1 + s)(r+s−1) (60)

N3(r, s) = 1

4(1 −r)(1 + s)(−r+s−1) (61)

N4(r, s) = 1

2(1 −r2)(1 −s) (62)

N5(r, s) = 1

2(1 + r)(1 −s2) (63)

N6(r, s) = 1

2(1 −r2)(1 + s) (64)

N7(r, s) = 1

2(1 −r)(1 −s2) (65)

The shape functions at the mid side nodes are products of a second order polynomial parallel to side

and a linear function perpendicular to the side while shape functions for corner nodes are modiﬁcations

of the bilinear quadrilateral element:

verify those

4.5.4 Bicubic basis functions in 2D (Q3)

Inside an element the local numbering of the nodes is as follows:

12===13===14===15 (r,s)_{00}=(-1,-1) (r,s)_{08}=(-1,+1/3)

|| || || || (r,s)_{01}=(-1/3,-1) (r,s)_{09}=(-1/3,+1/3)

08===09===10===11 (r,s)_{02}=(+1/3,-1) (r,s)_{10}=(+1/3,+1/3)

|| || || || (r,s)_{03}=(+1,-1) (r,s)_{11}=(+1,+1/3)

04===05===06===07 (r,s)_{04}=(-1,-1/3) (r,s)_{12}=(-1,+1)

|| || || || (r,s)_{05}=(-1/3,-1/3) (r,s)_{13}=(-1/3,+1)

00===01===02===03 (r,s)_{06}=(+1/3,-1/3) (r,s)_{14}=(+1/3,+1)

(r,s)_{07}=(+1,-1/3) (r,s)_{15}=(+1,+1)

The velocity shape functions are given by:

N1(r)=(−1 + r+ 9r2−9r3)/16 N1(t)=(−1 + t+ 9t2−9t3)/16

N2(r) = (+9 −27r−9r2+ 27r3)/16 N2(t) = (+9 −27t−9t2+ 27t3)/16

N3(r) = (+9 + 27r−9r2−27r3)/16 N3(t) = (+9 + 27t−9t2−27t3)/16

N4(r)=(−1−r+ 9r2+ 9r3)/16 N4(t)=(−1−t+ 9t2+ 9t3)/16

N01(r, s) = N1(r)N1(s)=(−1 + r+ 9r2−9r3)/16 ∗(−1 + t+ 9s2−9s3)/16

N02(r, s) = N2(r)N1(s) = (+9 −27r−9r2+ 27r3)/16 ∗(−1 + t+ 9s2−9s3)/16

N03(r, s) = N3(r)N1(s) = (+9 + 27r−9r2−27r3)/16 ∗(−1 + t+ 9s2−9s3)/16

N04(r, s) = N4(r)N1(s)=(−1−r+ 9r2+ 9r3)/16 ∗(−1 + t+ 9s2−9s3)/16

N05(r, s) = N1(r)N2(s) = (−1 + r+ 9r2−9r3)/16 ∗(9 −27s−9s2+ 27s3)/16

N06(r, s) = N2(r)N2(s) = (+9 −27r−9r2+ 27r3)/16 ∗(9 −27s−9s2+ 27s3)/16

N07(r, s) = N3(r)N2(s) = (+9 + 27r−9r2−27r3)/16 ∗(9 −27s−9s2+ 27s3)/16

N08(r, s) = N4(r)N2(s)=(−1−r+ 9r2+ 9r3)/16 ∗(9 −27s−9s2+ 27s3)/16

N09(r, s) = N1(r)N3(s) = (66)

N10(r, s) = N2(r)N3(s) = (67)

N11(r, s) = N3(r)N3(s) = (68)

N12(r, s) = N4(r)N3(s) = (69)

N13(r, s) = N1(r)N4(s) = (70)

N14(r, s) = N2(r)N4(s) = (71)

N15(r, s) = N3(r)N4(s) = (72)

N16(r, s) = N4(r)N4(s) = (73)

4.5.5 Linear basis functions for triangles in 2D (P1)

| \ (r_0,s_0)=(0,0)

| \ (r_1,s_1)=(1,0)

| \ (r_2,s_2)=(0,2)

0=======1

The basis polynomial is then

f(r, s) = a+br +cs

and the shape functions:

N0(r, s)=1−r−s(74)

N1(r, s) = r(75)

N2(r, s) = s(76)

4.5.6 Quadratic basis functions for triangles in 2D (P2)

| \ (r_0,s_0)=(0,0) (r_3,s_3)=(1/2,0)

5 4 (r_1,s_1)=(1,0) (r_4,s_4)=(1/2,1/2)

| \ (r_2,s_2)=(0,1) (r_5,s_5)=(0,1/2)

| \

0===3===1

The basis polynomial is then

f(r, s) = c1+c2r+c3s+c4r2+c5rs +c6s2

We have

f1=f(r1, s1) = c1

f2=f(r2, s2) = c1+c2+c4

f3=f(r3, s3) = c1+c3+c6

f4=f(r4, s4) = c1+c2/2 + c4/4

f5=f(r5, s5) = c1+c2/2 + c3/2

+c4/4 + c5/4 + c6/4

f6=f(r6, s6) = c1+c3/2 + c6/4

This can be cast as f=A·cwhere Ais a 6x6 matrix:







100000

110100

101001

1 1/201/4 0 0

1 1/2 1/2 1/4 1/4 1/4

101/2 0 0 1/4







It is rather trivial to compute the inverse of this matrix:

A−1=







1 0 0 0 0 0

−3−1 0 4 0 0

−3 0 −1004

220−4 0 0

400−4 4 −4

2 0 2 0 0 −4







In the end, one obtains:

f(r, s) = f1+ (−3f1−f2+ 4f4)r+ (−3f1−f3+ 4f6)s

+(2f1+ 2f2−4f4)r2+ (4f1−4f4+ 4f5−4f6)rs

+(2f1+ 2f3−4f6)s2

i=1

Ni(r, s)fi(77)

with

N1(r, s)=1−3r−3s+ 2r2+ 4rs + 2s2

N2(r, s) = −r+ 2r2

N3(r, s) = −s+ 2s2

N4(r, s)=4r−4r2−4rs

N5(r, s)=4rs

N6(r, s)=4s−4rs −4s2

4.5.7 Cubic basis functions for triangles (P3)

|\ (r_0,s_0)=(0,0) (r_5,s_5)=(2/3,1/3)

| \ (r_1,s_1)=(1,0) (r_6,s_6)=(1/3,2/3)

7 6 (r_2,s_2)=(0,1) (r_7,s_7)=(0,2/3)

| \ (r_3,s_3)=(1/3,0) (r_8,s_8)=(0,1/3)

8 9 5 (r_4,s_4)=(2/3,0) (r_9,s_9)=(1/3,1/3)

| \

0==3==4==1

The basis polynomial is then

f(r, s) = c1+c2r+c3s+c4r2+c5rs +c6s2+c7r3+c8r2s+c9rs2+c10s3

N0(r, s) = 9

2(1 −r−s)(1/3−r−s)(2/3−r−s) (78)

N1(r, s) = 9

2r(r−1/3)(r−2/3) (79)

N2(r, s) = 9

2s(s−1/3)(s−2/3) (80)

N3(r, s) = 27

2(1 −r−s)r(2/3−r−s) (81)

N4(r, s) = 27

2(1 −r−s)r(r−1/3) (82)

N5(r, s) = 27

2rs(r−1/3) (83)

N6(r, s) = 27

2rs(r−2/3) (84)

N7(r, s) = 27

2(1 −r−s)s(s−1/3) (85)

N8(r, s) = 27

2(1 −r−s)s(2/3−r−s) (86)

N9(r, s) = 27rs(1 −r−s) (87)

verify those

4.6 Elements and basis functions in 3D

4.6.1 Linear basis functions in tetrahedra (P1)

(r_0,s_0) = (0,0,0)

(r_1,s_1) = (1,0,0)

(r_2,s_2) = (0,2,0)

(r_3,s_3) = (0,0,1)

The basis polynomial is given by

f(r, s, t) = c0+c1r+c2s+c3t

f1=f(r1, s1, t1) = c0(88)

f2=f(r2, s2, t2) = c0+c1(89)

f3=f(r3, s3, t3) = c0+c2(90)

f4=f(r4, s4, t4) = c0+c3(91)

which yields:

c0=f1c1=f2−f1c2=f3−f1c3=f4−f1

f(r, s, t) = c0+c1r+c2s+c3t

=f1+ (f2−f1)r+ (f3−f1)s+ (f4−f1)t

=f1(1 −r−s−t) + f2r+f3s+f4t

Ni(r, s, t)fi

Finally,

N1(r, s, t)=1−r−s−t

N2(r, s, t) = r

N3(r, s, t) = s

N4(r, s, t) = t

4.6.2 Triquadratic basis functions in 3D (Q2)

N1= 0.5r(r−1) 0.5s(s−1) 0.5t(t−1)

N2= 0.5r(r+ 1) 0.5s(s−1) 0.5t(t−1)

N3= 0.5r(r+ 1) 0.5s(s+ 1) 0.5t(t−1)

N4= 0.5r(r−1) 0.5s(s+ 1) 0.5t(t−1)

N5= 0.5r(r−1) 0.5s(s−1) 0.5t(t+ 1)

N6= 0.5r(r+ 1) 0.5s(s−1) 0.5t(t+ 1)

N7= 0.5r(r+ 1) 0.5s(s+ 1) 0.5t(t+ 1)

N8= 0.5r(r−1) 0.5s(s+ 1) 0.5t(t+ 1)

N9= (1.−r2) 0.5s(s−1) 0.5t(t−1)

N10 = 0.5r(r+ 1) (1 −s2) 0.5t(t−1)

N11 = (1.−r2) 0.5s(s+ 1) 0.5t(t−1)

N12 = 0.5r(r−1) (1 −s2) 0.5t(t−1)

N13 = (1.−r2) 0.5s(s−1) 0.5t(t+ 1)

N14 = 0.5r(r+ 1) (1 −s2) 0.5t(t+ 1)

N15 = (1.−r2) 0.5s(s+ 1) 0.5t(t+ 1)

N16 = 0.5r(r−1) (1 −s2) 0.5t(t+ 1)

N17 = 0.5r(r−1) 0.5s(s−1) (1 −t2)

N18 = 0.5r(r+ 1) 0.5s(s−1) (1 −t2)

N19 = 0.5r(r+ 1) 0.5s(s+ 1) (1 −t2)

N20 = 0.5r(r−1) 0.5s(s+ 1) (1 −t2)

N21 = (1 −r2) (1 −s2) 0.5t(t−1)

N22 = (1 −r2) 0.5s(s−1) (1 −t2)

N23 = 0.5r(r+ 1) (1 −s2) (1 −t2)

N24 = (1 −r2) 0.5s(s+ 1) (1 −t2)

N25 = 0.5r(r−1) (1 −s2) (1 −t2)

N26 = (1 −r2) (1 −s2) 0.5t(t+ 1)

N27 = (1 −r2) (1 −s2) (1 −t2)

5 Solving the ﬂow equations with the FEM

In the case of an incompressible ﬂow, we have seen that the continuity (mass conservation) equation takes

the simple form 

∇·v = 0. In other word ﬂow takes place under the constraint that the divergence of its

velocity ﬁeld is exactly zero eveywhere (solenoidal constraint), i.e. it is divergence free.

We see that the pressure in the momentum equation is then a degree of freedom which is needed to

satisfy the incompressibilty constraint (and it is not related to any constitutive equation) [42]. In other

words the pressure is acting as a Lagrange multiplier of the incompressibility constraint.

Various approaches have been proposed in the literature to deal with the incompressibility constraint

but we will only focus on the penalty method (section 5.4) and the so-called mixed ﬁnite element method

5.5.

5.1 strong and weak forms

The strong form consists of the governing equation and the boundary conditions, i.e. the mass, momentum

and energy conservation equations supplemented with Dirichlet and/or Neumann boundary conditions

on (parts of) the boundary.

To develop the ﬁnite element formulation, the partial diﬀerential equations must be restated in an

integral form called the weak form. In essence the PDEs are ﬁrst multiplied by an arbitrary function and

integrated over the domain.

5.2 Which velocity-pressure pair for Stokes?

The success of a mixed ﬁnite element formulation crucially depends on a proper choice of the local

interpolations of the velocity and the pressure.

5.2.1 The compatibility condition (or LBB condition)

5.2.2 The bi/tri-linear velocity - constant pressure element (Q1×P0)

5.2.3 The bi/tri-quadratic velocity - discontinuous linear pressure element (Q2×P−1)

5.2.4 The bi/tri-quadratic velocity - bi/tri-linear pressure element (Q2×Q1)

This element, implemented in penalised form, is discussed in [12] and the follow-up paper [13]. CHECK

Biquadratic velocities, bilinear pressure. See Hood and Taylor. The element satisﬁes the inf-sup

condition [66]p215.

5.2.5 The stabilised bi/tri-linear velocity - bi/tri-linear pressure element (Q1×Q1-stab)

5.2.6 The MINI triangular element (P+

1×P1)

5.2.7 The quadratic velocity - linear pressure triangle (P2×P1)

From [120]. Taylor-Hood elements (Taylor and Hood 1973) are characterized by the fact that the

pressure is continuous in the region Ω. A typical example is the quadratic triangle (P2P1 element).

In this element the velocity is approximated by a quadratic polynomial and the pressure by a linear

polynomial. One can easily verify that both approximations are continuous over the element boundaries.

It can be shown, Segal (1979), that this element is admissible if at least 3 elements are used. The

quadrilateral counterpart of this triangle is the Q2×Q1element.

5.2.8 The Crouzeix-Raviart triangle (P+

2×P−1)

From [36]: seven-node Crouzeix-Raviart triangle with quadratic velocity shape functions enhanced by

a cubic bubble function and discontinuous linear interpolation for the pressure ﬁeld [e.g., Cuvelier et al.,

1986]. This element is stable and no additional stabilization techniques are required [Elman et al., 2005].

From [120]. These elements are characterized by a discontinuous pressure; discontinuous on element

boundaries. For output purposes (printing, plotting etc.) these discontinuous pressures are averaged in

vertices for all the adjoining elements.

The most simple Crouzeix-Raviart element is the non-conforming linear triangle with constant pressure

(P1×P0).

5.3 Other elements

P1P0

P1P1

Q2Q2

P2P2

5.4 The penalty approach for viscous ﬂow

In order to impose the incompressibility constraint, two widely used procedures are available, namely

the Lagrange multiplier method and the penalty method [9, 66]. The latter is implemented in elefant,

which allows for the elimination of the pressure variable from the momentum equation (resulting in a

reduction of the matrix size).

Mathematical details on the origin and validity of the penalty approach applied to the Stokes problem

can for instance be found in [35], [108] or [62].

The penalty formulation of the mass conservation equation is based on a relaxation of the incompress-

ibility constraint and writes



∇ ·

ν+p

λ= 0 (92)

where λis the penalty parameter, that can be interpreted (and has the same dimension) as a bulk

viscosity. It is equivalent to say that the material is weakly compressible. It can be shown that if one

chooses λto be a suﬃciently large number, the continuity equation 

∇ · 

ν= 0 will be approximately

satisﬁed in the ﬁnite element solution. The value of λis often recommended to be 6 to 7 orders of

magnitude larger than the shear viscosity [44, 67].

Equation (92) can be used to eliminate the pressure in Eq. (??) so that the mass and momentum

conservation equations fuse to become :



∇ · (2η˙ε(

ν)) + λ

∇(

∇ ·

ν) = ρg= 0 (93)

[90] have established the equivalence for incompressible problems between the reduced integration of

the penalty term and a mixed Finite Element approach if the pressure nodes coincide with the integration

points of the reduced rule.

In the end, the elimination of the pressure unknown in the Stokes equations replaces the original

saddle-point Stokes problem [10] by an elliptical problem, which leads to a symmetric positive deﬁnite

(SPD) FEM matrix. This is the major beneﬁt of the penalized approach over the full indeﬁnite solver

with the velocity-pressure variables. Indeed, the SPD character of the matrix lends itself to eﬃcient

solving stragegies and is less memory-demanding since it is suﬃcient to store only the upper half of the

matrix including the diagonal [59] .

list codes which use this approach

The stress tensor σis symmetric (i.e. σij =σji). For simplicity I will now focus on a Stokes ﬂow in

two dimensions.

Since the penalty formulation is only valid for incompressible ﬂows, then ˙

=˙

dso that the dsuper-

script is ommitted in what follows. The stress tensor can also be cast in vector format:





σxx

σyy

σxy 

=

−p

−p

0

+ 2η



˙xx

˙yy

˙xy 



=λ



˙xx + ˙yy

0

+ 2η



˙xx

˙yy

˙xy 



=





λ



110

000



| {z }

+η



200

020

001



| {z }





·





∂u

∂x

∂v

∂y

∂u

∂y +∂v

∂x







Remember that

∂u

∂x =

i=1

∂Ni

∂x ui

∂v

∂y =

i=1

∂Ni

∂y vi

∂u

∂y +∂v

∂x =

i=1

∂Ni

∂y ui+

i=1

∂Ni

∂x vi

so that







∂u

∂x

∂v

∂y

∂u

∂y +∂v

∂x







=





∂N1

∂x 0∂N2

∂x 0∂N3

∂x 0∂N4

∂x 0

0∂N1

∂y 0∂N2

∂y 0∂N3

∂y 0∂N4

∂y

∂N1

∂y

∂N1

∂x

∂N2

∂y

∂N2

∂x

∂N3

∂y

∂N3

∂x

∂N3

∂y

∂N4

∂x







| {z }













| {z }

Finally,

σ =



σxx

σyy

σxy 

= (λK+ηC)·B·V

We will now establish the weak form of the momentum conservation equation. We start again from



∇ · σ+

b=

For the Ni’s ’regular enough’, we can write:

ZΩe

Ni

∇ · σdΩ + ZΩe

NibdΩ=0

We can integrate by parts and drop the surface term2:

ZΩe



∇Ni·σdΩ = ZΩe

NibdΩ

or,

ZΩe



∂Ni

∂x 0∂Ni

∂y

0∂Ni

∂y

∂Ni

∂x



·



σxx

σyy

σxy 

dΩ = ZΩe

NibdΩ

Let i= 1,2,3,4 and stack the resulting four equations on top of one another.

ZΩe



∂N1

∂x 0∂N1

∂y

0∂N1

∂y

∂N1

∂x



·



σxx

σyy

σxy 

dΩ = ZΩe

N1bx

bydΩ (94)

ZΩe



∂N2

∂x 0∂N2

∂y

0∂N2

∂y

∂N2

∂x



·



σxx

σyy

σxy 

dΩ = ZΩe

Nibx

bydΩ (95)

ZΩe



∂N3

∂x 0∂N3

∂y

0∂N3

∂y

∂N3

∂x



·



σxx

σyy

σxy 

dΩ = ZΩe

N3bx

bydΩ (96)

ZΩe



∂N4

∂x 0∂N4

∂y

0∂N4

∂y

∂N4

∂x



·



σxx

σyy

σxy 

dΩ = ZΩe

N4bx

bydΩ (97)

We easily recognize BTinside the integrals! Let us deﬁne

b= (N1bx, N1by, ...N4bx, N4by)

2We will come back to this at a later stage

then we can write

ZΩe

BT·



σxx

σyy

σxy 

dΩ = ZΩe

NbdΩ

and ﬁnally: ZΩe

BT·[λK+ηC]·B·VdΩ = ZΩe

NbdΩ

Since Vcontains the velocities at the corners, it does not depend on the xor ycoordinates so it can be

taking outside of the integral:

ZΩe

BT·[λK+ηC]·BdΩ

| {z }

Ael(8×8)

·V

|{z}

(8x1)

=ZΩe

NbdΩ

| {z }

Bel(8×1)

or, 



ZΩe

λBT·K·BdΩ

| {z }

Aλ

el(8×8)

+ZΩe

ηBT·C·BdΩ

| {z }

Aη

el(8×8)





·V

|{z}

(8x1)

=ZΩe

NbdΩ

| {z }

Bel(8×1)

INTEGRATION - MAPPING

reduced integration [67]

1. partition domain Ω into elements Ωe,e= 1, ...nel.

2. loop over elements and for each element compute Ael,Bel

3. a node belongs to several elements

→need to assemble Ael and Bel in A,B

4. apply boundary conditions

5. solve system: x=A−1·B

6. visualise/analyse x

5.5 The mixed FEM for viscous ﬂow

5.5.1 in three dimensions

In what follows the ﬂow is assumed to be incompressible, isoviscous and isothermal.

The methodology to derive the discretised equations of the mixed system is quite similar to the one

we have used in the case of the penalty formulation. The big diﬀerence comes from the fact that we are

now solving for both velocity and pressure at the same time, and that we therefore must solve the mass

and momentum conservation equations together. As before, velocity inside an element is given by



νh(r) =

i=1

Nν

i(r)

νi(98)

where Nv

iare the polynomial basis functions for the velocity, and the summation runs over the mvnodes

composing the element. A similar expression is used for pressure:

ph(r) =

i=1

i(r)pi(99)

Note that the velocity is a vector of size while pressure (and temperature) is a scalar. There are then ndofv

velocity degrees of freedom per node and ndofppressure degrees of freedom. It is also very important

to remember that the numbers of velocity nodes and pressure nodes for a given element are more often

than not diﬀerent and that velocity and pressure nodes need not be colocated. Indeed, unless co-called

’stabilised elements’ are used, we have mv> mp, which means that the polynomial order of the velocity

ﬁeld is higher than the polynomial order of the pressure ﬁeld (usually by value 1).

insert here link(s) to manual and literature

Other notations are sometimes used for Eqs.(98) and (99):

uh(r) = 

Nν·u vh(r) = 

Nν·v wh(r) = 

Nν·w ph(r) = 

Np·p (100)

where 

ν= (u, v, w) and 

Nνis the vector containing all basis functions evaluated at location r:



Nv=Nν

1(r), Nν

2(r), Nν

3(r),...Nν

mv(r)(101)



Np=Np

1(r), Np

2(r), Np

3(r),...Np

mp(r)(102)

and with

u = (u1, u2, u3,...umv) (103)

v = (v1, v2, v3,...vmv) (104)

w = (w1, w2, w3,...wmv) (105)

p =p1, p2, p3,...pmp(106)

We will now establish the weak form of the momentum conservation equation. We start again from



∇ · σ+

b=

0 (107)



∇ ·v = 0 (108)

For the Nν

i’s and Np

i’regular enough’, we can write:

ZΩe

Nν

i

∇ · σdΩ + ZΩe

Nν

i

b dΩ = 

0 (109)

ZΩe

i

∇ ·vdΩ = 0 (110)

We can integrate by parts and drop the surface term3:

ZΩe



∇Nν

i·σdΩ = ZΩe

Nν

i

bdΩ (111)

ZΩe

i

∇ ·vdΩ = 0 (112)

or,

ZΩe







∂Nν

∂x 0 0 ∂Nν

∂y

∂Nν

∂z 0

0∂Nν

∂y 0∂Nν

∂x 0∂Nν

∂z

0 0 ∂Nν

∂z 0∂Nν

∂x

∂Nν

∂y





·







σxx

σyy

σzz

σxy

σxz

σyz







dΩ = ZΩe

Nν

i

b dΩ (113)

3We will come back to this at a later stage

As before (see section XXX) the above equation can ultimately be written:

ZΩe

BT·







σxx

σyy

σzz

σxy

σxz

σyz







dΩ = ZΩe



NbdΩ (114)

We have previously established that the strain rate vector 

˙εis:



˙ε=







∂u

∂x

∂v

∂y

∂w

∂z

∂u

∂y +∂v

∂x

∂u

∂z +∂w

∂x

∂v

∂z +∂w

∂y













∂Nν

∂x ui

∂Nν

∂y vi

∂Nν

∂z wi

(∂Nν

∂y ui+∂Nν

∂x vi)

(∂Nν

∂z ui+∂Nν

∂x wi)

(∂Nν

∂z vi+∂Nν

∂y wi)













∂Nν

∂x 0 0 ··· ∂Nν

∂x 0 0

0∂Nν

∂y 0··· 0∂Nν

∂y 0

0 0 ∂Nν

∂z ··· 0 0 ∂Nν

∂z

∂Nν

∂y

∂Nν

∂x 0··· ∂Nν

∂x

∂Nν

∂x 0

∂Nν

∂z 0∂Nν

∂x ··· ∂Nν

∂z 0∂Nν

∂x

0∂Nν

∂z

∂Nν

∂y ··· 0∂Nν

∂z

∂Nν

∂y







| {z }







. . .

umv

vmv

wmv







| {z }

(115)

or, 

˙ε=B·

Vwhere Bis the gradient matrix and 

Vis the vector of all vector degrees of freedom for the

element. The matrix Bis then of size 3 ×mvand the vector 

Vis mvlong. we have

σxx =−p+ 2η˙εd

xx (116)

σyy =−p+ 2η˙εd

yy (117)

σzz =−p+ 2η˙εd

zz (118)

σxy = 2η˙εd

xy (119)

σxz = 2η˙εd

xz (120)

σyz = 2η˙εd

yz (121)

Since we here only consider incompressible ﬂow, we have ˙

εd=˙

εso

σ =−













p+C·

˙ε=−















Np·

P+C·B·

V(122)

with

C=η







200000

020000

002000

000100

000010

000001









˙ε=







˙εxx

˙εyy

˙εzz

2 ˙εxy

2 ˙εxz

2 ˙εyz







(123)

Let us deﬁne matrix Npof size 6 ×mp:

Np=















Np=















(124)

so that

σ =−Np·

P+C·B·

V(125)

ﬁnally ZΩe

BT·[−Np·

P+C·B·

V]dΩ = ZΩe

NbdΩ (126)

or, −ZΩe

BT·NpdΩ

| {z }

·

P+ZΩe

BT·C·BdΩ

| {z }

·

V=ZΩe



NbdΩ

| {z }

(127)

where the matrix Kis of size (mv∗ndofv×mv∗ndofv), and matrix Gis of size (mv∗ndofv×mp∗ndofp).

Turning now to the mass conservation equation:

0 = ZΩe



Np

∇ ·v dΩ

=ZΩe



i=1 ∂Nν

∂x ui+∂N ν

∂y vi+∂N ν

∂z widΩ

=ZΩe







1mv

i=1

∂Nν

∂x ui+

i=1

∂Nν

∂y vi+

i=1

∂Nν

∂z wi

2mv

i=1

∂Nν

∂x ui+

i=1

∂Nν

∂y vi+

i=1

∂Nν

∂z wi

3mv

i=1

∂Nν

∂x ui+

i=1

∂Nν

∂y vi+

i=1

∂Nν

∂z wi

. . .

mpmv

i=1

∂Nν

∂x ui+

i=1

∂Nν

∂y vi+

i=1

∂Nν

∂z wi







dΩ

=ZΩe







1Np

1000

2Np

2000

3Np

3000

mpNp

mp000













∂Nν

∂x ui

∂Nν

∂y vi

∂Nν

∂z wi

(∂Nν

∂y ui+∂Nν

∂x vi)

(∂Nν

∂z ui+∂Nν

∂x wi)

(∂Nν

∂z vi+∂Nν

∂y wi)







dΩ

=ZΩe







1Np

1000

2Np

2000

3Np

3000

mpNp

mp000







|{z }

·

˙ε dΩ

=ZNp·BdΩ·

=−GT

e·

V(128)

say something about minus sign?

Ultimately we obtain the following system for each element:

KeGe

−GT

e0·



P=

0

Such a matrix is then generated for each element and then must me assembled into the global F.E. matrix.

Note that in this case the elemental Stokes matrix is antisymmetric. One can also deﬁne the following

symmetric modiﬁed Stokes matrix:

KeGe

e0·



P=

0

This matrix is symmetric, but indeﬁnite. It is non-singular if ker(GT) = 0, which is the case if t he

compatibility condition holds.

CHECK: Matrix Kis the viscosity matrix. Its size is (ndofv∗Nv)×(ndofv∗Nv) where ndofvis

the number of velocity degrees of freedom per node (typically 1,2 or 3) and Nvis the number of velocity

nodes. The size of matrix Gis (ndofv∗Nv)×(ndofp∗Np) where ndofp(= 1) is the number of velocity

degrees of freedom per node and Npis the number of pressure nodes. Conversely, the size of matrix GT

is (ndofp∗Np)×(ndofv∗Nv). The size of the global FE matrix is N=ndofv∗Nv+ndofp∗NpNote

that matrix Kis analogous to a discrete Laplacian operator, matrix Gto a discrete gradient operator,

and matrix GTto a discrete divergence operator.

On the physical dimensions of the Stokes matrix blocks We start from the Stokes equations:

−

∇p+

∇ · (2η˙

ε) + ρg= 0 (129)



∇ ·

ν= 0 (130)

The dimensions of the terms in the ﬁrst equation are: ML−2T−2. The blocks Kand Gstem from

the weak form which obtained by multiplying the strong form equations by the (dimensionless) basis

funstions and integrating over the domain, so that it follows that

[K·

V]=[G·

P]=[

f] = ML−2T−2L3=MLT −2

We can then easily deduce:

[K] = MT −1[G] = L2

On elemental level mass balance. Note that in what is above no assumption has been made about

whether the pressure basis functions are continuous or discontinuous from one element to another.

Indeed, as mentioned in [60], since the weak formulation of the momentum equation involves inte-

gration by parts of 

∇p, the resulting weak form contains no derivatives of pressure. This introduces the

possibility of approximating it by functions (piecewise polynomials, of course) that are not C0-continuous,

and indeed this has been done and is quite popular/useful.

It is then worth noting that only discontinuous pressure elements assure an element-level mass balance

[60]: if for instance Np

iis piecewise-constant on element e(of value 1), the elemental weak form of the

mass conservervation equation is

ZΩe

i

∇ ·

ν=ZΩe



∇ ·

ν=ZΓe

n ·

ν= 0

One potentially unwelcome consequence of using discontinuous pressure elements is that they do not

possess uniquely deﬁned pressure on the element boundaries; they are dual valued there, and often

multi-valued at certain velocity nodes.

On the Cmatrix The relationship between deviatoric stress and deviatoric strain rate tensor is

τ= 2η˙

εd(131)

= 2η˙

ε−1

3(

∇ ·v)1(132)

= 2η





˙εxx ˙εxy ˙εxz

˙εyx ˙εyy ˙εyz

˙εzx ˙εzy ˙εzz 

−1

3( ˙εxx + ˙εyy + ˙εzz)



100

010

001



(133)

3η



2 ˙εxx −˙εyy −˙εzz 3 ˙εxy 3 ˙εxz

3 ˙εyx −˙εyy + 2 ˙εyy −˙εyy 3 ˙εyz

3 ˙εzx 3 ˙εzy −˙εxx −˙εyy2 ˙εzz 

(134)

so that

τ =2

3η







2 ˙εxx −˙εyy −˙εzz

−˙εyy + 2 ˙εyy −˙εyy

−˙εxx −˙εyy + 2 ˙εzz

3 ˙εxy

3 ˙εxz

3 ˙εyz







=η







4−2−2000

−2 4 −2000

−2−2 4 0 0 0

0 0 0 3 0 0

0 0 0 0 3 0

0 0 0 0 0 3







| {z }







˙εxx

˙εyy

˙εzz

2 ˙εxy

2 ˙εxz

2 ˙εyz







=Cd·

˙ε(135)

In the case where we assume incompressible ﬂow from the beginning, i.e. ε=εd, then

τ =η







200000

020000

002000

000100

000010

000001







| {z }







˙εxx

˙εyy

˙εzz

2 ˙εxy

2 ˙εxz

2 ˙εyz







=C·

˙ε(136)

On the ’forgotten’ surface terms

5.5.2 Going from 3D to 2D

The world is three-dimensional. However, for many diﬀerent reasons one may wish to solve problems

which are two-dimensional.

Following ASPECT manual, we will think of two-dimensional models in the following way:

•We assume that the domain we want to solve on is a two-dimensional cross section (in the x−y

plane) that extends inﬁnitely far in both negative and positive zdirection.

•We assume that the velocity is zero in the zdirection and that all variables have no variation in

the zdirection.

As a consequence, two-dimensional models are three-dimensional ones in which the zcomponent of the

velocity is zero and so are all zderivatives. This allows to reduce the momentum conservation equations

from 3 equations to 2 equations. However, contrarily to what is often seen, the 3D deﬁnition of the

deviatoric strain rate remains, i.e. in other words:

εd=˙

ε−1

3(

∇ ·v)1(137)

and not 1/2. In light of all this, the full strain rate tensor and the deviatoric strain rate tensor in 2D are

given by:

ε=



˙εxx ˙εxy ˙εxz

˙εyx ˙εyy ˙εyz

˙εzx ˙εzy ˙εzz 

=





∂u

∂x

2∂u

∂y +∂v

∂x 0

2∂u

∂y +∂v

∂x ∂v

∂y 0

0 0 0





(138)

εd=1

3





2∂u

∂x −∂v

∂y

2∂u

∂y +∂v

∂x 0

2∂u

∂y +∂v

∂x −∂u

∂x + 2∂v

∂y 0

0 0 −∂u

∂x −∂v

∂y





(139)

Although the bottom right term may be surprising, it is of no consequence when this expression of the

deviatoric strain rate is used in the Stokes equation:



∇ · 2η˙

εd=

FINISH!

In two dimensions the velocity is then 

ν= (u, v) and the FEM building blocks and matrices are

simply:



˙ε=





˙εxx

˙εyy

2 ˙εxy







=





∂u

∂x

∂v

∂y

∂u

∂y +∂v

∂x













∂Nν

∂x 0∂Nν

∂x 0. . . ∂Nν

∂x 0

0∂Nν

∂y 0∂Nν

∂y . . . 0∂Nν

∂x

∂Nν

∂y

∂Nν

∂x

∂Nν

∂y

∂Nν

∂x

∂Nν

∂y

∂Nν

∂x . . . ∂Nν

∂y

∂Nν

∂x







| {z }







. . .

umv

vmv







| {z }

(140)

we have

σxx =−p+ 2η˙εxx (141)

σyy =−p+ 2η˙εyy (142)

σxy = +2η˙εxy (143)

σ =−



0

p+C·

˙ε=−



0



Np·

P+C·B·

V(144)

with

C=η



200

020

001

or C=η

3



4−2 0

−240

0 0 3 

(145)

check the right C

Finally the matrix Npis of size 3 ×mp:

Np=



0



Np=





0

(146)

5.6 Solving the elastic equations

5.7 Solving the heat transport equation

We start from the ’bare-bones’ heat transport equation (source terms are omitted):

ρcp∂T

∂t +

ν·

∇T=

∇ · k

∇T(147)

In what follows we assume that the velocity vield 

νis known so that temperature is the only unknwon.

Let Nθbe the temperature basis functions so that the temperature inside an element is given by4:

Th(r) =

i=1

Nθ(r)Ti=

Nθ·

T(148)

where 

Tis a vector of length mTThe weak form is then

ZΩ

Nθ

iρcp∂T

∂t +

ν·

∇TdΩ = ZΩ

Nθ

i

∇ · k

∇T dΩ (149)

4the θsuperscript has been chosen to denote temperature so as to avoid confusion with the transpose operator

ZΩ

Nθ

iρcp

∂T

∂t dΩ

| {z }

+ZΩ

Nθ

iρcp

ν·

∇T dΩ

| {z }

=ZΩ

Nθ

i

∇ · k

∇T dΩ

| {z }

III

i= 1, mT

Looking at the ﬁrst term:

ZΩ

Nθ

iρcp

∂T

∂t dΩ = ZΩ

Nθ

iρcp

Nθ·˙



T dΩ (150)

(151)

so that when we assemble all contributions for i= 1, mTwe get:

I=ZΩ



Nθρcp

Nθ·˙



T dΩ = ZΩ

ρcp

Nθ

NθdΩ·˙



T=MT·˙



where MTis the mass matrix of the system of size (mT×mT) with

ij =ZΩ

ρcpNθ

iNθ

jdΩ

Turning now to the second term:

ZΩ

Nθ

iρcp

ν·

∇T dΩ = ZΩ

Nθ

iρcp(u∂T

∂x +v∂T

∂y )dΩ (152)

=ZΩ

Nθ

iρcp(u∂

Nθ

∂x +v∂

Nθ

∂y )·

T dΩ (153)

(154)

so that when we assemble all contributions for i= 1, mTwe get:

II = ZΩ

ρcp

Nθ(u∂

Nθ

∂x +v∂

Nθ

∂y )dΩ!·

T=Ka·

where Kais the advection term matrix of size (mT×mT) with

(Ka)ij =ZΩ

ρcpNθ

i u∂Nθ

∂x +v∂N θ

∂y !dΩ

Now looking at the third term, we carry out an integration by part and neglect the surface term for now,

so that

ZΩ

Nθ

i

∇ · k

∇T dΩ = −ZΩ

k

∇Nθ

i·

∇T dΩ (155)

=−ZΩ

k

∇Nθ

i·

∇(

Nθ·

T)dΩ (156)

(157)

with



∇

Nθ=



∂xNθ

1∂xNθ

2. . . ∂xNθ

∂yNθ

1∂yNθ

2. . . ∂yNθ





so that ﬁnally:

III =−ZΩ

k(

∇

Nθ)T·

∇

NθdΩ·

T=−Kd·

where Kdis the diﬀusion term matrix:

Kd=ZΩ

k(

∇

Nθ)T·

∇

NθdΩ

Ultimately terms I, II, III together yield:

Mθ·˙



T+ (Ka+Kd)·

T=

What now remains to be done is to address the time derivative on the temperature vector. The most

simple approach would be to use an explicit Euler one, i.e.:

∂

∂t =

T(k)−

T(k−1)

δt

where 

T(k)is the temperature ﬁeld at time step kand δt is the time interval between two consecutive

time steps. In this case the discretised heat transport equation is:

Mθ+ (Ka+Kd)δt·

T(k)=Mθ·

T(k−1)

other time discr schemes!

6 Additional techniques and features

6.1 Picard and Newton

6.2 The SUPG formulation for the energy equation

6.3 Tracking materials and/or interfaces

6.4 Dealing with a free surface

6.5 Convergence criterion for nonlinear iterations

6.6 Static condensation

6.7 The method of manufactured solutions

The method of manufactured solutions is a relatively simple way of carrying out code veriﬁcation. In

essence, one postulates a solution for the PDE at hand (as well as the proper boundary conditions),

inserts it in the PDE and computes the corresponding source term. The same source term and boundary

conditions will then be used in a numerical simulation so that the computed solution can be compared

with the (postulated) true analytical solution.

Examples of this approach are to be found in [42, 25, ?].

python codes/ﬁeldstone

python codes/ﬁeldstone saddlepoint

python codes/ﬁeldstone saddlepoint q2q1

python codes/ﬁeldstone saddlepoint q3q2

python codes/ﬁeldstone burstedde

6.7.1 Analytical benchmark I

Taken from [44]. We consider a two-dimensional problem in the square domain Ω = [0,1] ×[0,1],

which possesses a closed-form analytical solution. The problem consists of determining the velocity ﬁeld



ν= (u, v) and the pressure psuch that

−η∆

ν+

∇p+

b=

0 in Ω

∇·v= 0 in Ω

v=0on Γ

where the ﬂuid viscosity is taken as η= 1. The components of the body force bare prescribed as

bx= (12 −24y)x4+ (−24 + 48y)x3+ (−48y+ 72y2−48y3+ 12)x2

+(−2 + 24y−72y2+ 48y3)x+ 1 −4y+ 12y2−8y3

by= (8 −48y+ 48y2)x3+ (−12 + 72y−72y2)x2

+(4 −24y+ 48y2−48y3+ 24y4)x−12y2+ 24y3−12y4

With this prescribed body force, the exact solution is

u(x, y) = x2(1 −x)2(2y−6y2+ 4y3)

v(x, y) = −y2(1 −y)2(2x−6x2+ 4x3)

p(x, y) = x(1 −x)−1/6

Note that the pressure obeys RΩp dΩ = 0 One can turn to the spatial derivatives of the ﬁelds:

∂u

∂x = (2x−6x2+ 4x3)(2y−6y2+ 4y3) (158)

∂v

∂y =−(2x−6x2+ 4x3)(2y−6y2+ 4y3) (159)

with of course 

∇ ·

ν= 0 and

∂p

∂x = 1 −2x(160)

∂p

∂y = 0 (161)

The velocity and pressure ﬁelds look like:

http://ww2.lacan.upc.edu/huerta/exercises/Incompressible/Incompressible Ex1.htm

As shown in [44], If the LBB condition is not satisﬁed, spurious oscillations spoil the pressure ap-

proximation. Figures below show results obtained with a mesh of 20x20 Q1P0 (left) and P1P1 (right)

elements:

]]

http://ww2.lacan.upc.edu/huerta/exercises/Incompressible/Incompressible Ex1.htm

Taking into account that the proposed problem has got analytical solution, it is easy to analyze

convergence of the diﬀerent pairs of elements:

http://ww2.lacan.upc.edu/huerta/exercises/Incompressible/Incompressible Ex1.htm

6.7.2 Analytical benchmark II

Taken from [41]. It is for a unit square with ν=µ/ρ = 1 and the smooth exact solution is

u(x, y) = x+x2−2xy +x3−3xy2+x2y(162)

v(x, y) = −y−2xy +y2−3x2y+y3−xy2(163)

p(x, y) = xy +x+y+x3y2−4/3 (164)

Note that the pressure obeys RΩp dΩ=0

bx=−(1 + y−3x2y2) (165)

by=−(1 −3x−2x3y) (166)

6.7.3 Analytical benchmark III

This benchmark begins by postulating a polynomial solution to the 3D Stokes equation [41]:

v=



x+x2+xy +x3y

y+xy +y2+x2y2

−2z−3xz −3yz −5x2yz 

(167)

and

p=xyz +x3y3z−5/32 (168)

While it is then trivial to verify that this velocity ﬁeld is divergence-free, the corresponding body force

of the Stokes equation can be computed by inserting this solution into the momentum equation with a

given viscosity µ(constant or position/velocity/strain rate dependent). The domain is a unit cube and

velocity boundary conditions simply use Eq. (229). Note that the pressure fulﬁlls

ZΩ

p(r)dΩ=0.

Constant viscosity In this case, the right hand side writes:

f=−∇p+µ



2+6xy

2+2x2+ 2y2

−10yz 



=−



yz + 3x2y3z

xz + 3x3y2z

xy +x3y3

+µ



2+6xy

2+2x2+ 2y2

−10yz 



We can compute the components of the strainrate tensor:

˙εxx = 1 + 2x+y+ 3x2y(169)

˙εyy = 1 + x+ 2y+ 2x2y(170)

˙εzz =−2−3x−3y−5x2y(171)

˙εxy =1

2(x+y+ 2xy2+x3) (172)

˙εxz =1

2(−3z−10xyz) (173)

˙εyz =1

2(−3z−5x2z) (174)

Note that we of course have ˙εxx + ˙εyy + ˙εzz = 0.

Variable viscosity In this case, the right hand side is obtains through

f=−∇p+µ



2+6xy

2+2x2+ 2y2

−10yz 



+



2 ˙εxx

2 ˙εxy

2 ˙εxz 

∂µ

∂x +



2 ˙εxy

2 ˙εyy

2 ˙εyz 

∂µ

∂y +



2 ˙εxz

2 ˙εyz

2 ˙εzz 

∂µ

∂z (175)

The viscosity can be chosen to be a smooth varying function:

µ=exp(1 −β(x(1 −x) + y(1 −y) + z(1 −z))) (176)

Choosing β= 0 yields a constant velocity µ=e1(and greatly simpliﬁes the right-hand side). One can

easily show that the ratio of viscosities µ?in the system follows µ?= exp(−3β/4) so that choosing β= 10

yields µ?'1808 and β= 20 yields µ?'3.269 ×106. In this case

∂µ

∂x =−4β(1 −2x)µ(x, y, z) (177)

∂µ

∂y =−4β(1 −2y)µ(x, y, z) (178)

∂µ

∂z =−4β(1 −2z)µ(x, y, z) (179)

[25] has carried out this benchmark for β= 4, i.e.:

µ(x, y, z) = exp(1 −4(x(1 −x) + y(1 −y) + z(1 −z)))

In a unit cube, this yields a variable viscosity such that 0.1353 < µ < 2.7182, i.e. a ratio of approx. 20

within the domain. We then have:

∂µ

∂x =−4(1 −2x)µ(x, y, z) (180)

∂µ

∂y =−4(1 −2y)µ(x, y, z) (181)

∂µ

∂z =−4(1 −2z)µ(x, y, z) (182)

sort out mess wrt Eq 26 of busa13

6.7.4 Analytical benchmark IV

From [12]. The two-dimensional domain is a unit square. The body forces are:

fx= 128[x2(x−1)212(2y−1) + 2(y−1)(2y−1)y(12x2−12x+ 2)]

fy= 128[y2(y−1)212(2x−1) + 2(x−1)(2x−1)y(12y2−12y+ 2)]

(183)

The solution is

u=−256x2(x−1)2y(y−1)(2y−1)

v= 256x2(y−1)2x(x−1)(2x−1)

p= 0 (184)

Another choice:

fx= 128[x2(x−1)212(2y−1) + 2(y−1)(2y−1)y(12x2−12x+ 2)] + y−1/2

fy= 128[y2(y−1)212(2x−1) + 2(x−1)(2x−1)y(12y2−12y+ 2)] + x−1/2

(185)

The solution is

u=−256x2(x−1)2y(y−1)(2y−1)

v= 256x2(y−1)2x(x−1)(2x−1)

p= (x−1/2)(y−1/2) (186)

6.8 Assigning values to quadrature points

As we have seen in Section ??, the building of the elemental matrix and rhs requires (at least) to assign

a density and viscosity value to each quadrature point inside the element. Depending on the type of

modelling, this task can prove more complex than one might expect and have large consequences on the

solution accuracy.

Here are several options:

•The simplest way (which is often used for benchmarks) consists in computing the ’real’ coordinates

(xq, yq, zq) of a given quadrature point based on its reduced coordinates (rq, sq, tq), and passing these

coordinates to a function which returns density and/or viscosity at this location. For instance, for

the Stokes sphere:

def rho(x,y):

if (x-.5)**2+(y-0.5)**2<0.123**2:

val=2.

else:

val=1.

return val

def mu(x,y):

if (x-.5)**2+(y-0.5)**2<0.123**2:

val=1.e2

else:

val=1.

return val

This is very simple, but it has been shown to potentially be problematic. In essence, it can introduce

very large contrasts inside a single element and perturb the quadrature. Please read section 3.3

of [65] and/or have a look at the section titled ”Averaging material properties” in the ASPECT

manual.

•another similar approach consists in assigning a density and viscosity value to the nodes of the FE

mesh ﬁrst, and then using these nodal values to assign values to the quadrature points. Very often

,and quite logically, the shape functions are used to this eﬀect. Indeed we have seen before that for

any point (r, s, t) inside an element we have

fh(r, s, t) =

fiNi(r, s, t)

where the fiare the nodal values and the Nithe corresponding basis functions.

In the case of linear elements (Q1basis functions), this is straightforward. In fact, the basis functions

Nican be seen as moving weights: the closer the point is to a node, the higher the weight (basis

function value).

However, this is quite another story for quadratic elements (Q2basis functions). In order to

illustrate the problem, let us consider a 1D problem. The basis functions are

N1(r) = 1

2r(r−1) N2(r)=1−r2N3(r) = 1

2r(r+ 1)

Let us further assign: ρ1=ρ2= 0 and ρ3= 1. Then

ρh(r) =

ρiNi(r) = N3(r)

There lies the core of the problem: the N3(r) basis function is negative for r∈[−1,0]. This means

that the quadrature point in this interval will be assigned a negative density, which is nonsensical

and numerically problematic!

use 2X Q1. write about it !

The above methods work ﬁne as long as the domain contains a single material. As soon as there are

multiple ﬂuids in the domain a special technique is needed to track either the ﬂuids themselves or their

interfaces. Let us start with markers. We are then confronted to the infernal trio (a menage a trois?)

which is present for each element, composed of its nodes, its markers and its quadrature points.

Each marker carries the material information (density and viscosity). This information must ul-

timately be projected onto the quadrature points. Two main options are possible: an algorithm is

designed and projects the marker-based ﬁelds onto the quadrature points directly or the marker ﬁelds

are ﬁrst projected onto the FE nodes and then onto the quadrature points using the techniques above.

————————–

At a given time, every element econtains nemarkers. During the FE matrix building process, viscosity

and density values are needed at the quadrature points. One therefore needs to project the values carried

by the markers at these locations. Several approaches are currently in use in the community and the

topic has been investigated by [40] and [45] for instance.

elefant adopts a simple approach: viscosity and density are considered to be elemental values, i.e.

all the markers within a given element contribute to assign a unique constant density and viscosity value

to the element by means of an averaging scheme.

While it is common in the literature to treat the so-called arithmetic, geometric and harmonic means

as separate averagings, I hereby wish to introduce the notion of generalised mean, which is a family of

functions for aggregating sets of numbers that include as special cases the arithmetic, geometric and

harmonic means.

If pis a non-zero real number, we can deﬁne the generalised mean (or power mean) with exponent p

of the positive real numbers a1, ... anas:

Mp(a1, ...an) = 1

i=1

i!1/p

(187)

and it is trivial to verify that we then have the special cases:

M−∞ = lim

p→−∞ Mp= min(a1, ...an) (minimum) (188)

M−1=n

a1+1

a2+··· +1

(harm.avrg.) (189)

M0= lim

p→0Mp=n

i=1

ai1/n

(geom.avrg.) (190)

M+1 =1

i=1

ai(arithm.avrg.) (191)

M+2 =v

i=1

i(root mean square) (192)

M+∞= lim

p→+∞Mp= max(a1, ...an) (maximum) (193)

Note that the proofs of the limit convergence are given in [23].

An interesting property of the generalised mean is as follows: for two real values pand q, if p < q

then Mp≤Mq. This property has for instance been illustrated in Fig. 20 of [117].

One can then for instance look at the generalised mean of a randomly generated set of 1000 viscosity

values within 1018P a.s and 1023P a.s for −5≤p≤5. Results are shown in the ﬁgure hereunder and the

arithmetic, geometric and harmonic values are indicated too. The function Mpassumes an arctangent-

like shape: very low values of p will ultimately yield the minimum viscosity in the array while very high

values will yield its maximum. In between, the transition is smooth and occurs essentially for |p| ≤ 5.

1e+18

1e+19

1e+20

1e+21

1e+22

1e+23

-4 -2 0 2 4

M(p)

geom.

arithm.

harm.

python codes/ﬁeldstone markers avrg

6.9 Matrix (Sparse) storage

The FE matrix is the result of the assembly process of all elemental matrices. Its size can become quite

large when the resolution is being increased (from thousands of lines/columns to tens of millions).

One important property of the matrix is its sparsity. Typically less than 1% of the matrix terms is

not zero and this means that the matrix storage can and should be optimised. Clever storage formats

were designed early on since the amount of RAM memory in computers was the limiting factor 3 or 4

decades ago. [112]

There are several standard formats:

•compressed sparse row (CSR) format

•compressed sparse column format (CSC)

•the Coordinate Format (COO)

•Skyline Storage Format

•...

I focus on the CSR format in what follows.

6.9.1 2D domain - One degree of freedom per node

Let us consider again the 3 ×2 element grid which counts 12 nodes.

8=======9======10======11

||||

| (3) | (4) | (5) |

||||

4=======5=======6=======7

||||

| (0) | (1) | (2) |

||||

0=======1=======2=======3

In the case there is only a single degree of freedom per node, the assembled FEM matrix will look

like this:







X X X X

X X X X X X

X X X X

X X X X X X

X X X X X X X X X

X X X X X X

X X X X

X X X X X X

X X X X







where the Xstand for non-zero terms. This matrix structure stems from the fact that

•node 0 sees nodes 0,1,4,5

•node 1 sees nodes 0,1,2,4,5,6

•node 2 sees nodes 1,2,3,5,6,7

•...

•node 5 sees nodes 0,1,2,4,5,6,8,9,10

•...

•node 10 sees nodes 5,6,7,9,10,11

•node 11 sees nodes 6,7,10,11

In light thereof, we have

•4 corner nodes which have 4 neighbours (counting themselves)

•2(nnx-2) nodes which have 6 neighbours

•2(nny-2) nodes which have 6 neighbours

•(nnx-2)×(nny-2) nodes which have 9 neighbours

In total, the number of non-zero terms in the matrix is then:

NZ = 4 ×4+4×6+2×6+2×9 = 70

In general, we would then have:

NZ = 4 ×4 + [2(nnx −2) + 2(nny −2)] ×6+(nnx −2)(nny −2) ×9

Let us temporarily assume nnx =nny =n. Then the matrix size (total number of unknowns) is

N=n2and

NZ = 16 + 24(n−2) + 9(n−2)2

A full matrix array would contain N2=n4terms. The ratio of N Z (the actual number of reals to store)

to the full matrix size (the number of reals a full matrix contains) is then

R=16 + 24(n−2) + 9(n−2)2

It is then obvious that when nis large enough R∼1/n2.

CSR stores the nonzeros of the matrix row by row, in a single indexed array A of double precision

numbers. Another array COLIND contains the column index of each corresponding entry in the A array.

A third integer array RWPTR contains pointers to the beginning of each row, which an additional pointer

to the ﬁrst index following the nonzeros of the matrix A. A and COLIND have length NZ and RWPTR

has length N+1.

In the case of the here-above matrix, the arrays COLIND and RWPTR will look like:

COLIND = (0,1,4,5,0,1,2,4,5,6,1,2,3,5,6,7, ..., 6,7,10,11)

RW P T R = (0,4,10,16, ...)

6.9.2 2D domain - Two degrees of freedom per node

When there are now two degrees of freedom per node, such as in the case of the Stokes equation in

two-dimensions, the size of the Kmatrix is given by

NfemV=nnp∗ndofV

In the case of the small grid above, we have NfemV=24. Elemental matrices are now 8 ×8 in size.

We still have

•4 corner nodes which have 4 ,neighbours,

•2(nnx-2) nodes which have 6 neighbours

•2(nny-2) nodes which have 6 neighbours

•(nnx-2)x(nny-2) nodes which have 9 neighbours,

but now each degree of freedom from a node sees the other two degrees of freedom of another node too.

In that case, the number of nonzeros has been multiplied by four and the assembled FEM matrix looks

like:







X X X X X X X X

X X X X X X X X X X X X

X X X X X X X X

X X X X X X X X X X X X

X X X X X X X X X X X X X X X X X X

X X X X X X X X X X X X

X X X X X X X X

X X X X X X X X X X X X

X X X X X X X X







Note that the degrees of freedom are organised as follows:

(u0, v0, u1, v1, u2, v2, ...u11, v11)

In general, we would then have:

NZ = 4 [4 ×4 + [2(nnx −2) + 2(nny −2)] ×6+(nnx −2)(nny −2) ×9]

and in the case of the small grid, the number of non-zero terms in the matrix is then:

NZ = 4 [4 ×4+4×6+2×6+2×9] = 280

In the case of the here-above matrix, the arrays COLIND and RWPTR will look like:

COLIND = (0,1,2,3,8,9,10,11,0,1,2,3,8,9,10,11, ...)

RW P T R = (0,8,16,28, ...)

6.9.3 in ﬁeldstone

The majority of the codes have the FE matrix being a full array

a mat = np . z e r os ( ( Nfem , Nfem ) , dtype=np . f l o a t 6 4 )

and it is converted to CSR format on the ﬂy in the solve phase:

s o l = s ps . l i n a l g . s p s o l v e ( s ps . c s r m a t r i x ( a mat ) , r hs )

Note that linked list storages can be used (lil matrix). Substantial memory savings but much longer

compute times.

6.10 Mesh generation

Before basis functions can be deﬁned and PDEs can be discretised and solved we must ﬁrst tesselate the

domain with polygons, e.g. triangles and quadrilaterals in 2D, tetrahedra, prisms and hexahedra in 3D.

When the domain is itself simple (e.g. a rectangle, a sphere, ...) the mesh (or grid) can be (more or

less) easily produced and the connectivity array ﬁlled with straightforward algorithms [131]. However,

real life applications can involve etremely complex geometries (e.g. a bridge, a human spine, a car chassis

and body, etc ...) and dedicated algorithms/softwares must be used (see [134, 53, 143]).

We usually distinguish between two broad classes of grids: structured grids (with a regular connec-

tivity) and unstructured grids (with an irregular connectivity).

On the following ﬁgure are shown various triangle- tetrahedron-based meshes. These are obviously

better suited for simulations of complex geometries:

add more examples coming from geo

Let us now focus on the case of a rectangular computational domain of size Lx ×Ly with a regular

mesh composed of nelx×nely=nel quadrilaterals. There are then nnx×nny=nnp grid points. The

elements are of size hx×hy with hx=Lx/nelx.

We have no reason to come up with an irregular/ilogical node numbering so we can number nodes

row by row or column by column as shown on the example hereunder of a 3×2 grid:

8=======9======10======11 2=======5=======8======11

||||||||

| (3) | (4) | (5) | | (1) | (3) | (5) |

||||||||

4=======5=======6=======7 1=======4=======7======10

||||||||

| (0) | (1) | (2) | | (0) | (2) | (4) |

||||||||

0=======1=======2=======3 0=======3=======6=======9

"row by row" "column by column"

The numbering of the elements themselves could be done in a somewhat chaotic way but we follow

the numbering of the nodes for simplicity. The row by row option is the adopted one in ﬁeldstoneand

the coordinates of the points are computed as follows:

x = np . empty (nnp , dtype=np . f l o a t 6 4 )

y = np . empty (nnp , dtype=np . f l o a t 6 4 )

co u nt er = 0

f o r jin range ( 0 , nny ) :

f o r ii n r a ng e ( 0 , nnx ) :

x [ c ou nt er ]= i ∗hx

y [ c ou nt er ]= j ∗hy

co u nt er += 1

The inner loop has iranging from 0to nnx-1 ﬁrst for j=0, 1, ... up to nny-1 which indeed corresponds

to the row by row numbering.

We now turn to the connectivity. As mentioned before, this is a structured mesh so that the so-called

connectivity array, named icon in our case, can be ﬁlled easily. For each element we need to store the

node identities of its vertices. Since there are nel elements and m=4 corners, this is a m×nel array. The

algorithm goes as follows:

i c o n =np . z e r o s ( (m, n e l ) , dtype=np . i n t 1 6 )

co u nt er = 0

f o r jin range ( 0 , n e l y ) :

f o r ii n r a ng e ( 0 , n e l x ) :

i c o n [ 0 , c o u n t er ] = i + j ∗nnx

i c o n [ 1 , c o u n t er ] = i + 1 + j ∗nnx

i c o n [ 2 , c o u n t er ] = i + 1 + ( j + 1 ) ∗nnx

i c o n [ 3 , c o u n t er ] = i + ( j + 1 ) ∗nnx

co u nt er += 1

In the case of the 3×2 mesh, the icon is ﬁlled as follows:

element id→0 1 2 3 4 5

node id↓

0 0 1 2 4 5 6

1 1 2 3 5 6 7

2 56791011

3 4 5 6 8 9 10

It is to be understood as follows: element #4 is composed of nodes 5, 6, 10 and 9. Note that nodes are

always stored in a counter clockwise manner, starting at the bottom left. This is very important since

the corresponding basis functions and their derivatives will be labelled accordingly.

In three dimensions things are very similar. The mesh now counts nelx×nely×nelz=nel elements

which represent a cuboid of size Lx×Ly×Lz. The position of the nodes is obtained as follows:

x = np . empty (nnp , dtype=np . f l o a t 6 4 )

y = np . empty (nnp , dtype=np . f l o a t 6 4 )

z = np . empty (nnp , dtype=np . f l o a t 6 4 )

co u nt er=0

f o r iin range ( 0 , nnx ) :

f o r jin range ( 0 , nny ) :

f o r kin range ( 0 , nnz ) :

x [ c ou nt er ]= i ∗hx

y [ c ou nt er ]= j ∗hy

z [ c ou nte r ]=k∗hz

co u nt er += 1

The connectivity array is now of size m×nel with m=8:

i c o n =np . z e r o s ( (m, n e l ) , dtype=np . i n t 1 6 )

co u nt er = 0

f o r iin range ( 0 , n e l x ) :

f o r jin range ( 0 , n e l y ) :

f o r kin range ( 0 , n e l z ) :

i c o n [ 0 , c o u n t er ]=nny ∗nnz∗( i )+nnz ∗( j )+k

i c o n [ 1 , c o u n t er ]=nny ∗nnz∗( i +1)+nnz ∗( j )+k

i c o n [ 2 , c o u n t er ]=nny ∗nnz∗( i +1)+nnz ∗( j +1)+k

i c o n [ 3 , c o u n t er ]=nny ∗nnz∗( i )+nnz ∗( j +1)+k

i c o n [ 4 , c o u n t er ]=nny ∗nnz∗( i )+nnz ∗( j )+k+1

i c o n [ 5 , c o u n t er ]=nny ∗nnz∗( i +1)+nnz ∗( j )+k+1

i c o n [ 6 , c o u n t er ]=nny ∗nnz∗( i +1)+nnz ∗( j +1)+k+1

i c o n [ 7 , c o u n t er ]=nny ∗nnz∗( i )+nnz ∗( j +1)+k+1

co u nt er += 1

produce drawing of node numbering

6.11 Visco-Plasticity

6.11.1 Tensor invariants

Before we dive into the world of nonlinear rheologies it is necessary to introduce the concept of tensor

invariants since they are needed further on. Unfortunately there are many diﬀerent notations used in the

literature and these can prove to be confusing.

Given a tensor T, one can compute its (moment) invariants as follows:

•ﬁrst invariant :

TI|2D=T r[T] = Txx +Tyy

TI|3D=T r[T] = Txx +Tyy +Tzz

•second invariant :

TII |2D=1

2T r[T2] = 1

Tij Tji =1

2(T2

xx +T2

yy) + T2

TII |3D=1

2T r[T2] = 1

Tij Tji =1

2(T2

xx +T2

yy +T2

yy) + T2

xy +T2

xz +T2

•third invariant :

TIII =1

3T r[T3] = 1

ijk

Tij TjkTki

The implementation of the plasticity criterions relies essentially on the second invariants of the (de-

viatoric) stress τand the (deviatoric) strainrate tensors ˙

ε:

τII |2D=1

2(τ2

xx +τ2

yy) + τ2

4(σxx −σyy)2+σ2

4(σ1−σ2)2

τII |3D=1

2(τ2

xx +τ2

yy +τ2

zz) + τ2

xy +τ2

xz +τ2

6(σxx −σyy )2+ (σyy −σzz)2+ (σxx −σzz)2+σ2

xy +σ2

xz +σ2

6(σ1−σ2)2+ (σ2−σ3)2+ (σ1−σ3)2

εII |2D=1

2( ˙εd

xx)2+ ( ˙εd

yy)2+ ( ˙εd

xy)2

21

4( ˙εxx −˙εyy)2+1

4( ˙εyy −˙εxx)2+ ˙ε2

4( ˙εxx −˙εyy)2+ ˙ε2

εII |3D=1

2( ˙εd

xx)2+ ( ˙εd

yy)2+ ( ˙εd

zz)2+ ( ˙εd

xy)2+ ( ˙εd

xz)2+ ( ˙εd

yz)2

6(˙xx −˙yy)2+ (˙yy −˙zz)2+ ( ˙xx −˙zz)2+ ˙2

xy + ˙2

xz + ˙2

Note that these (second) invariants are almost always used under a square root so we deﬁne:

τII =√τII ˙εII =p˙εII

Note that these quantities have the same dimensions as their tensor counterparts, i.e. Pa for stresses and

s−1for strain rates.

6.11.2 Scalar viscoplasticity

This formulation is quite easy to implement. It is widely used, e.g. [141, 132, 121], and relies on the

assumption that a scalar quantity ηp(the ’eﬀective plastic viscosity’) exists such that the deviatoric stress

tensor

τ= 2ηp˙

ε(194)

is bounded by some yield stress value Y. From Eq. (194) it follows that τII = 2ηp˙εII =Ywhich yields

ηp=Y

2˙εII

This approach has also been coined the Viscosity Rescaling Method (VRM) [75].

insert here the rederivation 2.1.1 of spmw16

It is at this stage important to realise that (i) in areas where the strainrate is low, the resulting eﬀective

viscosity will be large, and (ii) in areas where the strainrate is high, the resulting eﬀective viscosity will be

low. This is not without consequences since (eﬀective) viscosity contrasts up to 8-10 orders of magnitude

have been observed/obtained with this formulation and it makes the FE matrix very stiﬀ, leading to

(iterative) solver convergence issues. In order to contain these viscosity contrasts one usually resorts to

viscosity limiters ηmin and ηmax such that

ηmin ≤ηp≤ηmax

Caution must be taken when choosing both values as they may inﬂuence the ﬁnal results.

python codes/ﬁeldstone indentor

6.11.3 about the yield stress value Y

In geodynamics the yield stress value is often given as a simple function. It can be constant (in space

and time) and in this case we are dealing with a von Mises plasticity yield criterion. . We simply assume

YvM =Cwhere Cis a constant cohesion independent of pressure, strainrate, deformation history, etc ...

Another model is often used: the Drucker-Prager plasticity model. A friction angle φis then intro-

duced and the yield value Ytakes the form

YDP =psin φ+Ccos φ

and therefore depends on the pressure p. Because φis with the range [0◦,45◦], Yis found to increase

with depth (since the lithostatic pressure often dominates the overpressure).

Note that a slightly modiﬁed verion of this plasticity model has been used: the total pressure pis

then replaced by the lithostatic pressure plith.

6.12 Pressure smoothing

It has been widely documented that the use of the Q1×P0element is not without problems. Aside from

the consequences it has on the FE matrix properties, we will here focus on another unavoidable side

eﬀect: the spurious pressure checkerboard modes.

These modes have been thoroughly analysed [61, 30, 113, 114]. They can be ﬁltered out [30] or simply

smoothed [81].

On the following ﬁgure (a,b), pressure ﬁelds for the lid driven cavity experiment are presented for

both an even and un-even number of elements. We see that the amplitude of the modes can sometimes

be so large that the ’real’ pressure is not visible and that something as simple as the number of elements

in the domain can trigger those or not at all.

a) b)

a) element pressure for a 32x32 grid and for a 33x33 grid;

b) image from [42, p307] for a manufactured solution;

c) elemental pressure and smoothed pressure for the punch experiment [132]

The easiest post-processing step that can be used (especially when a regular grid is used) is explained in

[132]: ”The element-to-node interpolation is performed by averaging the elemental values from elements

common to each node; the node-to-element interpolation is performed by averaging the nodal values

element-by-element. This method is not only very eﬃcient but produces a smoothing of the pressure that

is adapted to the local density of the octree. Note that these two steps can be repeated until a satisfying

level of smoothness (and diﬀusion) of the pres- sure ﬁeld is attained.”

In the codes which rely on the Q1×P0element, the (elemental) pressure is simply deﬁned as

p=np . z e r o s ( ne l , dtype=np . f l o a t 6 4 )

while the nodal pressure is then deﬁned as

q=np . z e r os ( nnp , dtype=np . f l o a t 6 4 )

The element-to-node algorithm is then simply (in 2D):

cou nt=np . z e r o s ( nnp , dtype=np . i n t 1 6 )

f o r i e l in range ( 0 , n e l ) :

q [ i c o n [ 0 , i e l ]]+=p [ i e l ]

q [ i c o n [ 1 , i e l ]]+=p [ i e l ]

q [ i c o n [ 2 , i e l ]]+=p [ i e l ]

q [ i c o n [ 3 , i e l ]]+=p [ i e l ]

co un t [ i c o n [ 0 , i e l ]]+=1

co un t [ i c o n [ 1 , i e l ]]+=1

co un t [ i c o n [ 2 , i e l ]]+=1

co un t [ i c o n [ 3 , i e l ]]+=1

q=q/ count

Pressure smoothing is further discussed in [67].

produce figure to explain this

link to proto paper

link to least square and nodal derivatives

6.13 Pressure scaling

As perfectly explained in the step 32 of deal.ii5, we often need to scale the Gterm since it is many orders

of magnitude smaller than K, which introduces large inaccuracies in the solving process to the point that

the solution is nonsensical. This scaling coeﬃcient is η/L. We start from

K G

GT0·V

P=f

h

and introduce the scaling coeﬃcient as follows (which in fact does not alter the solution at all):

Kη

LGT0·V

ηP=f

Lh

We then end up with the modiﬁed Stokes system:

K G

GT0·V

P=f

h

where

G=η

LGP=L

ηPh=η

After the solve phase, we recover the real pressure with P=η

LP0.

5https://www.dealii.org/9.0.0/doxygen/deal.II/step 32.html

6.14 Pressure normalisation

When Dirichlet boundary conditions are imposed everywhere on the boundary, pressure is only present

by its gradient in the equations. It is thus determined up to an arbitrary constant. In such a case, one

commonly impose the average of the pressure over the whole domain or on a subsect of the boundary to

be have a zero average, i.e. ZΩ

pdV = 0

Another possibility is to impose the pressure value at a single node.

Let us assume that we are using Q1×P0elements. Then the pressure is constant inside each element.

The integral above becomes:

ZΩ

pdV =X

eZΩe

pdV =X

peZΩe

dV =X

peAe=

where the sum runs over all elements eof area Ae. This can be rewritten

LTP= 0

and it is a constraint on the pressure. As we have seen before ??, we can associate to it a Lagrange

multiplier λso that we must solve the modiﬁed Stokes system:





K G 0

GT0L

0LT0

·

V

λ

=



0



When higher order spaces are used for pressure (continuous or discontinuous) one must then carry out

the above integration numerically by means of (usually) a Gauss-Legendre quadrature.

python codes/ﬁeldstone stokes sphere 3D saddlepoint/

python codes/ﬁeldstone burstedde/

python codes/ﬁeldstone busse/

python codes/ﬁeldstone RTinstability1/

python codes/ﬁeldstone saddlepoint q2q1/

python codes/ﬁeldstone compressible2/

python codes/ﬁeldstone compressible1/

python codes/ﬁeldstone saddlepoint q3q2/

6.15 The choice of solvers

Let us ﬁrst look at the penalty formulation. In this case we are only solving for velocity since pressure is

recovered in a post-processing step. We also know that the penalty factor is many orders of magnitude

higher than the viscosity and in combination with the use of the Q1×P0element the resulting matrix

condition number is very high so that the use of iterative solvers in precluded. Indeed codes such as

SOPALE [54], DOUAR [17], or FANTOM [130] relying on the penalty formulation all use direct solvers

(BLKFCT, MUMPS, WSMP).

The main advantage of direct solvers is used in this case: They can solve ill-conditioned matrices.

However memory requirements for the storage of number of nonzeros in the Cholesky matrix grow very fast

as the number of equations/grid size increases, especially in 3D, to the point that even modern computers

with tens of Gb of RAM cannot deal with a 1003element mesh. This explains why direct solvers are

often used for 2D problems and rarely in 3D with noticeable exceptions [132, 144, 18, 88, 4, 5, 6, 140, 98].

In light of all this, let us start again from the (full) Stokes system:

K G

GT0·V

P=f

h

We need to solve this system in order to obtain the solution, i.e. the Vand Pvectors. But how?

Unfortunately, this question is not simple to answer and the ’right’ depends on many parameters.

How big is the matrix ? what is the condition number of the matrix K? Is it symmetric ? (i.e. how are

compressible terms handled?).

6.15.1 The Schur complement approach

Let us write the above system as two equations:

KV+GP=f(195)

GTV=h(196)

The ﬁrst line can be re-written V=K−1(f−GP) and can be inserted in the second:

GTV=GT[K−1(f−GP)] = h

or,

GTK−1GP=GTK−1f−h

The matrix S=GTK−1Gis called the Schur complement. It is Symmetric (since Kis symmetric) and

Postive-Deﬁnite6(SPD) if Ker(G) = 0. look in donea-huerta book for details Having solved this equation

(we have obtained P), the velocity can be recovered by solving KV=f−GP. For now, let us assume

that we have built the Smatrix and the right hand side ˜

f=GTK−1f−h. We must solve SP=˜

Since Sis SPD, the Conjugate Gradient (CG) method is very appropriate to solve this system. Indeed,

looking at the deﬁnition of Wikipedia: In mathematics, the conjugate gradient method is an algorithm for

the numerical solution of particular systems of linear equations, namely those whose matrix is symmetric

and positive-deﬁnite. The conjugate gradient method is often implemented as an iterative algorithm,

applicable to sparse systems that are too large to be handled by a direct implementation or other direct

methods such as the Cholesky decomposition. Large sparse systems often arise when numerically solving

partial diﬀerential equations or optimization problems.

The Conjugate Gradient algorithm is as follows:

6Mpositive deﬁnite ⇐⇒ xTM x > 0∀x∈Rn\0

Algorithm obtained from https://en.wikipedia.org/wiki/Conjugate_gradient_method

Let us look at this algorithm up close. The parts which may prove to be somewhat tricky are those

involving the matrix (in our case the Schur complement). We start the iterations with a guess pressure

P0( and an initial guess velocity which could be obtained by solving KV0=f−GP0).

r0=˜

f−SP0(197)

=GTK−1f−h−(GTK−1G)P0(198)

=GTK−1(f−GP0)−h(199)

=GTK−1KV0−h(200)

=GTV0−h(201)

(202)

We now turn to the αkcoeﬃcient:

αk=rT

krk

pkSpk

=rT

krk

pkGTK−1Gpk

=rT

krk

(Gpk)TK−1(Gpk)

We then deﬁne ˜pk=Gpk, so that αkcan be computed as follows:

1. compute ˜pk=Gpk

2. solve Kdk= ˜pk

3. compute αk= (rT

krk)/(˜pT

kdk)

Then we need to look at the term Spk:

Spk=GTK−1Gpk=GTK−1˜pk=GTdk

We can then rewrite the CG algorithm as follows [149]:

•r0=GTV0−h

•if r0is suﬃciently small, then return (V0,P0) as the result

•p0=r0

•k= 0

•repeat

–compute ˜pk=Gpk

–solve Kdk= ˜pk

–compute αk= (rT

krk)/(˜pT

kdk)

–Pk+1 =Pk+αkpk

–rk+1 =rk−αkGTdk

–if rk+1 is suﬃciently small, then exit loop

–βk= (rT

k+1rk+1)/(rT

krk)

–pk+1 =rk+1 +βkpk

–k=k+ 1

•return Pk+1 as result

We see that we have managed to solve the Schur complement equation with the Conjugate Gradient

method without ever building the matrix S. Having obtained the pressure solution, we can easily recover

the corresponding velocity with KVk+1 =f−GPk+1. However, this is rather unfortunate because it

requires yet another solve with the Kmatrix. As it turns out, we can slightly alter the above algorithm

to have it update the velocity as well so that this last solve is unnecessary.

We have

Vk+1 =K−1(f−GPp+1) = K−1(f−G(Pk+αkpk)) = K−1(f−GPk)−αkK−1Gpk=V−αkK−1˜pk=Vk−αkdk

and we can insert this minor extra calculation inside the algorithm and get the velocity solution nearly

for free. The ﬁnal CG algorithm is then

solver cg:

•compute V0=K−1(f−GP0)

•r0=GTV0−h

•if r0is suﬃciently small, then return (V0,P0) as the result

•p0=r0

•k= 0

•repeat

–compute ˜pk=Gpk

–solve Kdk= ˜pk

–compute αk= (rT

krk)/(˜pT

kdk)

–Pk+1 =Pk+αkpk

–Vk+1 =Vk−αkdk

–rk+1 =rk−αkGTdk

–if rk+1 is suﬃciently small (|rk+1|2/|r0|2< tol), then exit loop

–βk= (rT

k+1rk+1)/(rT

krk)

–pk+1 =rk+1 +βkpk

–k=k+ 1

•return Pk+1 as result

This iterative algorithm will converge to the solution with a rate which depends on the condition

number of the Smatrix, which is not easily obtainable since Sis never built. Also, we know that large

viscosity contrasts in the domain will inﬂuence this too. Finally, we notice that this algorithm requires

one solve with matrix Kper iteration but says nothing about the method employed to do so.

One thing we know improves the convergence of any iterative solver is the use of a preconditioner

matrix and therefore use the Preconditioned Conjugate Gradient method:

Algorithm obtained from https://en.wikipedia.org/wiki/Conjugate_gradient_method

In the algorithm above the preconditioner matrix Mhas to be symmetric positive-deﬁnite and ﬁxed,

i.e., cannot change from iteration to iteration.

We see that this algorithm introduces an additional vector zand a solve with the matrix Mat each

iteration, which means that Mmust be such that solving Mx =fwhere fis a given rhs vector must be

cheap. Ultimately, the PCG algorithm applied to the Schur complement equation takes the form:

solver pcg:

•compute V0=K−1(f−GP0)

•r0=GTV0−h

•if r0is suﬃciently small, then return (V0,P0) as the result

•z0=M−1r0

•p0=z0

•k= 0

•repeat

–compute ˜pk=Gpk

–solve Kdk= ˜pk

–compute αk= (rT

kzk)/(˜pT

kdk)

–Pk+1 =Pk+αkpk

–Vk+1 =Vk−αkdk

–rk+1 =rk−αkGTdk

–if rk+1 is suﬃciently small (|rk+1|2/|r0|2< tol), then exit loop

–zk+1 =M−1rk+1

–βk= (zT

k+1rk+1)/(zT

krk)

–pk+1 =zk+1 +βkpk

–k=k+ 1

•return Pk+1 as result

how to compute Mfor the Schur complement ?

6.16 The GMRES approach

6.17 The consistent boundary ﬂux (CBF)

The Consistent Boundary Flux technique was devised to alleviate the problem of the accuracy of primary

variables derivatives (mainly velocity and temperature) on boundaries, where basis function (nodal)

derivatives do not exist. These derivatives are important since they are needed to compute the heat ﬂux

(and therefore the NUsselt number) or dynamic topography and geoid.

The idea was ﬁrst introduced in [94] and later used in geodynamics [147]. It was ﬁnally implemented

in the CitcomS code [148] and more recently in the ASPECT code (dynamic topography postprocessor).

Note that the CBF should be seen as a post-processor step as it does not alter the primary variables

values.

The CBF method is implemented and used in ??.

6.17.1 applied to the Stokes equation

We start from the strong form:

∇·σ=b

and then write the weak form: ZΩ

N∇·σdV =ZΩ

NbdV

where Nis any test function. We then use the two equations:

∇·(Nσ) = N∇·σ+∇N·σ(chain rule)

ZΩ

(∇·f)dV =ZΓ

f·ndS (divergence theorem)

Integrating the ﬁrst equation over Ω and using the second, we can write:

ZΓ

Nσ·ndS −ZΩ∇N·σdV =ZΩ

NbdV

On Γ, the traction vector is given by t=σ·n:

ZΓ

NtdS =ZΩ∇N·σdV +ZΩ

NbdV

Considering the traction vector as an unknown living on the nodes on the boundary, we can expand (for

Q1elements)

tx=

i=1

tx|iNity=

i=1

ty|iNi

on the boundary so that the left hand term yields a mass matrix M0. Finally, using our previous experience

of discretising the weak form, we can write:

M0· T =−KV − GP+f

where Tis the vector of assembled tractions which we want to compute and Vand Tare the solutions

of the Stokes problem. Note that the assembly only takes place on the elements along the boundary.

Note that the assembled mass matrix is tri-diagonal can be easily solved with a Conjugate Gradient

method. With a trapezoidal integration rule (i.e. Gauss-Lobatto) the matrix can even be diagonalised

and the resulting matrix is simply diagonal, which results in a very cheap solve [147].

6.17.2 applied to the heat equation

We start from the strong form of the heat transfer equation (without the source terms for simplicity):

ρcp∂T

∂t +v·∇T=∇·k∇T

The weak form then writes:

ZΩ

Nρcp

∂T

∂t dV +ρcpZΩ

Nv·∇T dV =ZΩ

N∇·k∇T dV

Using once again integration by parts and divergence theorem:

ZΩ

Nρcp

∂T

∂t dV +ρcpZΩ

Nv·∇T dV =ZΓ

Nk∇T·ndΓ−ZΩ

∇N·k∇T dV

On the boundary we are interested in the heat ﬂux q=−k∇T

ZΩ

Nρcp

∂T

∂t dV +ρcpZΩ

Nv·∇T dV =−ZΓ

Nq·ndΓ−ZΩ

∇N·k∇T dV

or, ZΓ

Nq·ndΓ = −ZΩ

Nρcp

∂T

∂t dV −ρcpZΩ

Nv·∇T dV −ZΩ

∇N·k∇T dV

Considering the normal heat ﬂux qn=q·nas an unknown living on the nodes on the boundary,

qn=

i=1

qn|iNi

so that the left hand term becomes a mass matrix for the shape functions living on the boundary. We

have already covered the right hand side terms when building the FE system to solve the heat transport

equation, so that in the end

M0· Qn=−M·∂T

∂t −Ka·T−Kd·T

where Qnis the assembled vector of normal heat ﬂux components. Note that in all terms the assembly

only takes place over the elements along the boundary.

Note that the resulting matrix is symmetric.

6.17.3 implementation - Stokes equation

Let us start with a small example, a 3x2 element FE grid:

012

345

0 1 2 3

4 5 6 7

8 9 10 11

01234567

891011 1213 1415

16 17 18 19 20 21 22 23

Red color corresponds to the dofs in the x direction, blue color indicates a dof in the y direction.

We have nnp=12, nel=6, NfemV=24. Let us assume that free slip boundary conditions are applied.

The fix bc array is then:

bc fix=[ TTTTTTTTTTTTTTTTTTTTTTTT]

Note that since corners belong to two edges, we eﬀectively prescribed no-slip boundary conditions on

those.

We wish to compute the tractions on the boundaries, and more precisely for the dofs for which

a Dirichlecht velocity boundary condition has been prescribed. The number of (traction) unknowns

NfemTr is then the number of Tin the bc fix array. In our speciﬁc case, we wave NfemTr= . This means

that we need for each targeted dof to be able to ﬁnd its identity/number between 0 and NfemTr-1. We

therefore create the array bc nb which is ﬁlled as follows:

bc nb=[TTTTTTTTTTTTTTTTTTTTTTTT]

This translates as follows in the code:

NfemTr=np . sum ( b c f i x )

bc nb=np . z e r o s (NfemV , d type=np . i n t 32 )

co u nt er=0

f o r iin range ( 0 , NfemV ) :

i f ( b c f i x [ i ] ) :

bc nb [ i ]= c ou nte r

co u nt er+=1

Algorithm 1 dksjfhfjk ghdf

0: kjhg kf

0: gfhfk

The algorithm is then as follows

A Prepare two arrays to store the matrix Mcbf and its right hand side rhscbf

B Loop over all elements

C For each element touching a boundary, compute the residual vector Rel =−fel +KelVel +GelPel

D Loop over the four edges of the element using the connectivity array

E For each edge loop over the number of degrees of freedom (2 in 2D)

F For each edge assess whether the dofs on both ends are target dofs.

G If so, compute the mass matrix Medge for this edge

H extract the 2 values oﬀ the element residual vector and assemble these in rhscbf

I Assemble Medge into NfemTrxNfemTr matrix using bc nb

M cbf = np . z e r os ( ( NfemTr , NfemTr ) , np . f l o a t 6 4 ) # A

r h s c b f = np . z e r os ( NfemTr , np . f l o a t 6 4 )

f o r i e l in range ( 0 , n e l ) : # B

. . . compute e l e m e n t a l r e s i d u a l . . . # C

#boundary 0−1 # D

f o r ii n r a ng e ( 0 , ndofV ) : # E

i d o f 0 =2∗i c o n [ 0 , i e l ]+ i

i d o f 1 =2∗i c o n [ 1 , i e l ]+ i

i f ( b c f i x [ i d o f 0 ] and b c f i x [ i d o f 1 ] ) : # F

i d o f T r 0=bc n b [ i d o f 0 ]

i d o f T r 1=bc n b [ i d o f 1 ]

r h s c b f [ i d o f T r 0 ]+= r e s e l [0+ i ] # H

r h s c b f [ i d o f T r 1 ]+= r e s e l [2+ i ]

M cbf [ i d o f Tr 0 , i d o f T r 0 ]+=M edge [ 0 , 0 ] #

M cbf [ i d o f Tr 0 , i d o f T r 1 ]+=M edge [ 0 , 1 ] # I

M cbf [ i d o f Tr 1 , i d o f T r 0 ]+=M edge [ 1 , 0 ] #

M cbf [ i d o f Tr 1 , i d o f T r 1 ]+=M edge [ 1 , 1 ] #

#boundary 1−2 #[D]

...

#boundary 2−3 #[D]

...

#boundary 3−0 #[D]

...

6.18 The value of the timestep

The chosen time step dt used for time integration is chosen to comply with the Courant-Friedrichs-Lewy

condition [7].

δt =Cmin h

max |v|,h2

κ

where his a measure of the element size, κ=k/ρcpis the thermal diﬀusivity and C is the so-called CFL

number chosen in [0,1[.

In essence the CFL condition arises when solving hyperbolic PDEs . It limits the time step in many

explicit time-marching computer simulations so that the simulation does not produce incorrect results.

This condition is not needed when solving the Stokes equation but it is mandatory when solving

the heat transport equation or any kind of advection-diﬀusion equation. Note that any increase of grid

resolution (i.e. hbecomes smaller) yields an automatic decrease of the time step value.

6.19 mappings

The name isoparametric derives from the fact that the same (’iso’) functions are used as basis functions

and for the mapping to the reference element.

6.20 Exporting data to vtk format

This format seems to be the universally accepted format for 2D and 3D visualisation in Computational

Geodynamics. Such ﬁles can be opened with free softwares such as Paraview 7, MayaVi 8or Visit 9.

Unfortunately it is my experience that no simple tutorial exists about how to build such ﬁles. There

is an oﬃcial document which describes the vtk format10 but it delivers the information in a convoluted

way. I therefore describe hereafter how ﬁeldstone builds the vtk ﬁles.

I hereunder show vtk ﬁle corresponding to the 3x2 grid presented earlier 6.10. In this particular

example there are:

•12 nodes and 6 elements

•1 elemental ﬁeld: the pressure p)

•2 nodal ﬁelds: 1 scalar (the smoothed pressure q), 1 vector (the velocity ﬁeld u,v,0)

Note that vtk ﬁles are inherently 3D so that even in the case of a 2D simulation the z-coordinate of the

points and for instance their z-velocity component must be provided. The ﬁle, usually called solution.vtu

starts with a header:

We then proceed to write the node coordinates as follows:

0. 0 00 00 0 e+00 0. 00 00 00 e+00 0 . 00 00 00 e+00

3.333333e−01 0. 00 0 00 0 e+00 0 . 00 00 00 e+00

6.666667e−01 0. 00 0 00 0 e+00 0 . 00 00 00 e+00

1. 0 00 00 0 e+00 0. 00 00 00 e+00 0 . 00 00 00 e+00

0. 000 000 e+00 5 .0 000 00 e−01 0 .0 0 00 00 e+00

3.333333e−01 5. 00 000 0 e−01 0. 00 0 00 0 e+00

6.666667e−01 5. 00 000 0 e−01 0. 00 0 00 0 e+00

1. 000 000 e+00 5 .0 000 00 e−01 0 .0 0 00 00 e+00

0. 0 00 00 0 e+00 1. 00 00 00 e+00 0 . 00 00 00 e+00

3.333333e−01 1. 00 0 00 0 e+00 0 . 00 00 00 e+00

6.666667e−01 1. 00 0 00 0 e+00 0 . 00 00 00 e+00

1. 0 00 00 0 e+00 1. 00 00 00 e+00 0 . 00 00 00 e+00

</DataArray>

</Po ints >

These are followed by the elemental ﬁeld(s):

−1.333333 e+00

−3. 104414 e −10

1. 3 33 33 3 e+00

−1.333333 e+00

8.278417e−17

1. 3 33 33 3 e+00

</DataArray>

</CellData>

Nodal quantities are written next:

0. 0 00 00 0 e+00 0. 00 00 00 e+00 0 . 00 00 00 e+00

7https://www.paraview.org/

8https://docs.enthought.com/mayavi/mayavi/

9https://wci.llnl.gov/simulation/computer-codes/visit/

10https://www.vtk.org/wp-content/uploads/2015/04/ﬁle-formats.pdf

0. 0 00 00 0 e+00 0. 00 00 00 e+00 0 . 00 00 00 e+00

8.888885e−08 −8.278 405 e −24 0. 0 00 00 0 e+00

8.888885e−08 1. 65 568 2 e−23 0. 00 0 00 0 e+00

0. 0 00 00 0 e+00 0. 00 00 00 e+00 0 . 00 00 00 e+00

1. 0 00 00 0 e+00 0. 00 00 00 e+00 0 . 00 00 00 e+00

</DataArray>

−1.333333 e+00

−6. 666664 e −01

6.666664e−01

1. 3 33 33 3 e+00

−1.333333 e+00

−6. 666664 e −01

6.666664e−01

1. 3 33 33 3 e+00

−1.333333 e+00

−6. 666664 e −01

6.666664e−01

1. 3 33 33 3 e+00

</DataArray>

</PointData>

To these informations we must append 3 more datasets. The ﬁrst one is the connectivity, the second

one is the oﬀsets and the third one is the type. The ﬁrst one is trivial since said connectivity is needed

for the Finite Elements. The second must be understood as follows: when reading the connectivity

information in a linear manneer the oﬀset values indicate the beginning of each element (omitting the

zero value). The third simply is the type of element as given in the vtk format document (9 corresponds

to a generic quadrilateral with an internal numbering consistent with ours).

<Cells>

0 1 5 4

1 2 6 5

2 3 7 6

4 5 9 8

5 6 10 9

6 7 11 10

</DataArray>

</DataArray>

</DataArray>

</C e l l s >

The ﬁle is then closed with

</Pi ece >

</UnstructuredGrid>

</VTKFile>

The solution.vtu ﬁle can then be opened with ParaView, MayaVi or Visit and the reader is advised

to ﬁnd tutorials online on how to install and use these softwares.

6.21 Runge-Kutta methods

These methods were developed around 1900 by the German mathematicians Carl Runge and Martin

Kutta. The RK methods are methods for the numerical integration of ODEs. These methods are well

documented in any numerical analysis textbook and the reader is referred to [57, 69].

Any Runge-Kutta method is uniquely identiﬁed by its Butcher tableau.

The following method is called the Runge-Kutta-Fehlberg method and is commonly abbreviated

RKF45. Its Butcher tableau is as follows:

1/4 1/4

3/8 3/32 9/32

12/13 1932/2197 -7200/2197 7296/2197

1 439/216 -8 3680/513 -845/4104

1/2 -8/27 2 -3544/2565 1859/4104 -11/40

16/135 0 6656/12825 28561/56430 -9/50 2/55

25/216 0 1408/2565 2197/4104 -1/5 0

The ﬁrst row of coeﬃcients at the bottom of the table gives the ﬁfth-order accurate method, and the

second row gives the fourth-order accurate method.

6.22 Am I in or not?

It is quite common that at some point one must answer the question: ”Given a mesh and its connectivity

on the one hand, and the coordinates of a point on the other, how do I accurately and quickly determine

in which element the point resides?”

One typical occurence of such a problem is linked to the use of the Particle-In-Cell technique: particles

are advected and move through the mesh, and need to be localised at every time step. This question could

arise in the context of a benchmark where certain quantities need to be measured at speciﬁc locations

inside the domain.

6.22.1 Two-dimensional space

We shall ﬁrst focus on quadrilaterals. There are many kinds of quadrilaterals as shown hereunder:

I wish to arrive at a single algorithm which is applicable to all quadrilaterals and therefore choose an

irregular quadrilateral. For simplicity, let us consider a Q1element, with a single node at each corner.

Several rather simple options exist:

•we could subdivide the quadrilateral into two triangles and check whether point Mis inside any of

them (as it turns out, this problem is rather straightforward for triangles. Simply google it.)

•We could check that point Mis always on the left side of segments 0 →1, 1 →2, 2 →3, 3 →0.

•...

Any of these approaches will work although some might be faster than others. In three-dimensions

all will however become cumbersome to implement and might not even work at all. Fortunately, there is

an elegant way to answer the question, as detailed in the following subsection.

6.22.2 Three-dimensional space

If point Mis inside the quadrilateral, there exist a set of reduced coordinates r, s, t ∈[−1 : 1]3such that

i=1

Ni(rM, s, t)xi=xM

i=1

Ni(rM, s, t)yi=yM

i=1

Ni(rM, s, t)zi=zM

This can be cast as a system of three equations and three unknowns. Unfortunately, each shape function

Nicontains a term rst (as well as rs,rt, and st) so that it is not a linear system and standard techniques

are not applicable. We must then use an iterative technique: the algorithm starts with a guess for values

r, s, t and improves on their value iteration after iteration.

The classical way of solving nonlinear systems of equations is Newton’s method. We can rewrite the

equations above as F(r, s, t) = 0:

i=1

Ni(r, s, t)xi−xM= 0

i=1

Ni(r, s, t)yi−yM= 0

i=1

Ni(r, s, t)zi−zM= 0 (203)

or,

Fr(r, s, t)=0

Fs(r, s, t)=0

Ft(r, s, t)=0

so that we now have to ﬁnd the zeroes of continuously diﬀerentiable functions F:R→R. The

recursion is simply:





rk+1

sk+1

tk+1 

=



tk

−JF(rk, sk, tk)−1



Fr(rk, sk, tk)

Fs(rk, sk, tk)

Ft(rk, sk, tk)



where Jthe Jacobian matrix:

JF(rk, sk, tk) = 





∂Fr

∂r (rk, sk, tk)∂Fr

∂s (rk, sk, tk)∂Fr

∂t (rk, sk, tk)

∂Fs

∂r (rk, sk, tk)∂Fs

∂s (rk, sk, tk)∂Fs

∂t (rk, sk, tk)

∂Ft

∂r (rk, sk, tk)∂Ft

∂s (rk, sk, tk)∂Ft

∂t (rk, sk, tk)













i=1

∂Ni

∂r (rk, sk, tk)xi

i=1

∂Ni

∂s (rk, sk, tk)xi

i=1

∂Ni

∂t (rk, sk, tk)xi

i=1

∂Ni

∂r (rk, sk, tk)yi

i=1

∂Ni

∂s (rk, sk, tk)yi

i=1

∂Ni

∂t (rk, sk, tk)yi

i=1

∂Ni

∂r (rk, sk, tk)zi

i=1

∂Ni

∂s (rk, sk, tk)zi

i=1

∂Ni

∂t (rk, sk, tk)zi







In practice, we solve the following system:

JF(rk, sk, tk)





rk+1

sk+1

tk+1 

−



tk



=−



Fr(rk, sk, tk)

Fs(rk, sk, tk)

Ft(rk, sk, tk)



Finally, the algorithm goes as follows:

•set guess values for r, s, t (typically 0)

•loop over k=0,...

•Compute rhs= −F(rk, sk, tk)

•Compute matrix JF(rk, sk, tk)

•solve system for (drk, dsk, dtk)

•update rk+1 =rk+drk,sk+1 =sk+dsk,tk+1 =tk+dtk

•stop iterations when (drk, dsk, dtk) is small

•if rk, sk, tk∈[−1,1]3then Mis inside.

This method converges quickly but involves iterations, and multiple solves of 3 ×3 systems which, when

carried out for each marker and at each time step can prove to be expensive. A simple modiﬁcation can

be added to the above algorithm: iterations should be carried out only when the point Mis inside of a

cuboid of size [min

ixi: max

ixi]×[min

iyi: max

iyi]×[min

izi: max

izi] where the sums run over the vertices

of the element. In 2D this translates as follows: only carry out Newton iterations when Mis inside the

red rectangle!

Note that the algorithm above extends to high degree elements such as Q2and higher, even with

curved sides.

6.23 Error measurements and convergence rates

What follows is written in the case of a two-dimensional model. Generalisation to 3D is trivial. What

follows is mostly borrowed from [129].

When measuring the order of accuracy of the primitive variables vand p, it is standard to report

errors in both the L1and the L2norm. For a scalar quantity Ψ, the L1and L2norms are computed as

||Ψ||1=ZV|Ψ|dV ||Ψ||2=sZV

Ψ2dV

For a vector quantity k= (kx, ky), the L1and L2norms are deﬁned as:

||k||1=ZV

(|kx|+|ky|)dV ||k||2=sZV

(k2

x+k2

y)dV

To compute the respective norms the integrals in the above norms can be approximated by splitting

them into their element-wise contributions. The element volume integral can then be easily computed by

numerical integration using GaussLegendre quadrature.

The respective L1and L2norms for the pressure error can be evaluated via

||ep||1=

i=1

q=1 |ep(rq)|wq|Jq| ||ep||2=v

i=1

q=1 |ep(rq)|2wq|Jq|

where

ep(rq) = p(rq)−pth(rq)

is the pressure error evaluated at the qth quadrature associated with the ith element. neand nqrefer to

the number of elements and the number of quadrature points per element. wqand Jqare the quadrature

weight of the Jacobian associated with point q.

The velocity error evis evaluated using the following two norms

||ev||1=

i=1

q=1

[|eu(rq)|+|ev(rq)|]wq|Jq| ||ev||2=v

i=1

q=1

[|eu(rq)|2+ev(rq)|2]wq|Jq|

where

eu(rq) = u(rq)−uth(rq)ev(rq) = v(rq)−vth(rq)

We compute the diﬀerent error norms for epand evfor a set of numerical experiments with varying

resolution h. We expect the error norms to follow the following relationships:

||ev||1=Chrv||ev||2=Chr0

||ep||1=Chrp||ep||2=Chr0

where Cis a resolution-independent constant and rp,r0

pand rv,r0

vare the convergence rates for pressure

and velocity, respectively. Using linear regression on the logarithm of the respective error norm and the

resolution, we compute the convergence rates of the numerical solutions.

•For Q1P0, the theoretical lower bound for r0

vis 2 and for r0

pit is 1

•For Q2P−1, the theoretical lower bound for r0

vis 3 and for r0

pit is 2

We note that when using discontinuous pressure space (e.g., P0,P−1), these bounds remain valid even

when the viscosity is discontinuous provided that the element boundaries conform to the discontinuity.

6.24 The initial temperature ﬁeld

6.24.1 Single layer with imposed temperature b.c.

Let us take a single layer of material characterised by a heat capacity cp, a heat conductivity kand a

heat production term H.

The Heat transport equation writes

ρcp(∂T

∂t +v ·

∇T) = 

∇ · (k

∇T) + ρH

At steady state and in the absence of a velocity ﬁeld, and assuming that the material properties to be

independent of time and space, and that there is no heat production (H= 0), this equation simpliﬁes to

∆T= 0

Assuming the layer to be parallel to the x-axis, this yields to write

T(x, y) = T(y) = αT +β

In order to specify the constants αand β, we need two constraints.

At the bottom of the layer y=yba temperature Tbis prescribed while a temperature Ttis prescribed

at the top with y=yt. This ultimately yields a temperature ﬁeld in the layer given by

T(y) = Tt−Tb

yt−yb

(y−yb) + Tb

If now the heat production coeﬃcient is not zero, the diﬀerential equation reads

k∆T+H= 0

The temperature ﬁeld is then expected to be of the form

T(y) = −H

2ky2+αy +β

Supplied again with the same boundary conditions, this leads to

β=Tb+H

2ky2

b−αyb

ie,

T(y) = −H

2k(y2−y2

b) + α(y−yb) + Tb

and ﬁnally

α=Tt−Tb

yt−yb

2k(yb+yt)

or,

T(y) = −H

2k(y2−y2

b) + Tt−Tb

yt−yb

2k(yb+yt)(y−yb) + Tb

Taking H= 0 in this equation obviously yields the temperature ﬁeld obtained previously. Taking

k= 2.25, Tt= 0C,Tb= 550C,yt= 660km,yb= 630km yields the following temperature proﬁles and

heat ﬂuxes when the heat production Hvaries:

100

200

300

400

500

600

625000 630000 635000 640000 645000 650000 655000 660000

Temperature (C)

H=0.0e-6

H=0.8e-6

H=1.6e-6

0.01

0.02

0.03

0.04

0.05

0.06

0.07

625000 630000 635000 640000 645000 650000 655000 660000

heat flux (W/m2)

H=0.0e-6

H=0.4e-6

H=0.8e-6

H=1.0e-6

H=1.2e-6

H=1.6e-6

Looking at the values at the top, which are somewhat estimated to be about 55 −65mW/m2[73, table

8.6], one sees that value H= 0.8e−6 yields a very acceptable heat ﬂux. Looking at the bottom, the heat

ﬂux is then about 0.03W/m2which is somewhat problematic since the heat ﬂux at the Moho is reported

to be somewhere between 10 and 20 mW/m2in [73, table 7.1].

6.25 Single layer with imposed heat ﬂux b.c.

Let us now assume that heat ﬂuxes are imposed at the top and bottom of the layer:

We start again from the ODE

k∆T+H= 0

but only integrate it once:

kdT

dy +Hy +α= 0

At the bottom q=k(dT/dy)|y=yb=qband at the top q=k(dT/dy)|y=yt=qtso that

to finish

6.26 Single layer with imposed heat ﬂux and temperature b.c.

to finish

6.26.1 Half cooling space

6.26.2 Plate model

6.26.3 McKenzie slab

6.27 Kinematic boundary conditions

6.27.1 In-out ﬂux boundary conditions for lithospheric models

The velocity on the side is given by

u(y) = vext y < L1

u(y) = vin −vext

y2−y1

(y−y1) + vext y1< y < y2

u(y) = vin y > y2

The requirement for volume conservation is:

Φ = ZLy

u(y)dy = 0

Having chosen vin (the velocity of the plate), one can then compute vext as a function of y1and y2.

Φ = Zy1

u(y)dy +Zy2

u(y)dy +ZLy

u(y)dy

=vexty1+1

2(vin +vext)(y2−y1)+(Ly−y2)vin

=vext[y1+1

2(y2−y1)] + vin[1

2(y2−y1)+(Ly−y2)]

=vext

2(y1+y2) + vin[Ly−1

2(y1+y1)]

and ﬁnally

vext =−vin

Ly−1

2(y1+y1)

2(y1+y2)

7fieldstone 01: simple analytical solution

This ﬁeldstone was developed in collaboration with Job Mos.

From [42]. In order to illustrate the behavior of selected mixed ﬁnite elements in the solution of

stationary Stokes ﬂow, we consider a two-dimensional problem in the square domain Ω = [0,1] ×[0,1],

which possesses a closed-form analytical solution. The problem consists of determining the velocity ﬁeld

v= (u, v) and the pressure psuch that

−ν∆v+∇p=bin Ω

∇·v= 0 in Ω

v=0on Γ

where the ﬂuid viscosity is taken as ν= 1.

features

•Q1×P0element

•incompressible ﬂow

•penalty formulation

•Dirichlet boundary conditions (no-slip)

•direct solver

•isothermal

•isoviscous

•analytical solution

0.000001

0.000010

0.000100

0.001000

0.010000

0.100000

0.01 0.1

error

velocity

pressure

Quadratic convergence for velocity error, linear convergence for pressure error, as expected.

8fieldstone 02: Stokes sphere

Viscosity and density directly computed at the quadrature points.

features

•Q1×P0element

•incompressible ﬂow

•penalty formulation

•Dirichlet boundary conditions (free-slip)

•isothermal

•non-isoviscous

•buoyancy-driven ﬂow

•Stokes sphere

9fieldstone 03: Convection in a 2D box

This benchmark deals with the 2-D thermal convection of a ﬂuid of inﬁnite Prandtl number in a rect-

angular closed cell. In what follows, I carry out the case 1a, 1b, and 1c experiments as shown in [15]:

steady convection with constant viscosity in a square box.

The temperature is ﬁxed to zero on top and to ∆Tat the bottom, with reﬂecting symmetry at the

sidewalls (i.e. ∂xT= 0) and there are no internal heat sources. Free-slip conditions are implemented on

all boundaries.

The Rayleigh number is given by

Ra =αgy∆T h3

κν =αgy∆T h3ρ2cp

kµ (204)

In what follows, I use the following parameter values: Lx=Ly= 1,ρ0=cP=k=µ= 1, T0= 0,

α= 10−2,g= 102Ra and I run the model with Ra = 104,105and 106.

The initial temperature ﬁeld is given by

T(x, y) = (1 −y)−0.01 cos(πx) sin(πy) (205)

The perturbation in the initial temperature ﬁelds leads to a perturbation of the density ﬁeld and sets the

ﬂuid in motion.

Depending on the initial Rayleigh number, the system ultimately reaches a steady state after some

time.

The Nusselt number (i.e. the mean surface temperature gradient over mean bottom temperature) is

computed as follows [15]:

Nu =LyR∂T

∂y (y=Ly)dx

RT(y= 0)dx (206)

Note that in our case the denominator is equal to 1 since Lx= 1 and the temperature at the bottom is

prescribed to be 1.

Finally, the steady state root mean square velocity and Nusselt number measurements are indicated in

Table ?? alongside those of [15] and [125]. (Note that this benchmark was also carried out and published

in other publications [136, 3, 57, 37, 83] but since they did not provide a complete set of measurement

values, they are not included in the table.)

Blankenbach et al Tackley [125]

Ra = 104Vrms 42.864947 ±0.000020 42.775

Nu 4.884409 ±0.000010 4.878

Ra = 105Vrms 193.21454 ±0.00010 193.11

Nu 10.534095 ±0.000010 10.531

Ra = 106Vrms 833.98977 ±0.00020 833.55

Nu 21.972465 ±0.000020 21.998

Steady state Nusselt number N u and Vrms measurements as reported in the literature.

features

•Q1×P0element

•incompressible ﬂow

•penalty formulation

•Dirichlet boundary conditions (free-slip)

•Boussinesq approximation

•direct solver

•non-isothermal

•buoyancy-driven ﬂow

•isoviscous

•CFL-condition

ToDo:

implement steady state criterion

reach steady state

do Ra=1e4, 1e5, 1e6

plot against blankenbach paper and aspect

look at critical Ra number

10 fieldstone 04: The lid driven cavity

The lid driven cavity is a famous Computational Fluid Dynamics test case and has been studied in

countless publications with a wealth of numerical techniques (see [50] for a succinct review) and also in

the laboratory [78].

It models a plane ﬂow of an isothermal isoviscous ﬂuid in a rectangular (usually square) lid-driven

cavity. The boundary conditions are indicated in the Fig. ??a. The gravity is set to zero.

10.1 the lid driven cavity problem (ldc=0)

In the standard case, the upper side of the cavity moves in its own plane at unit speed, while the other

sides are ﬁxed. This thereby introduces a discontinuity in the boundary conditions at the two upper

corners of the cavity and yields an uncertainty as to which boundary (side or top) the corner points

belong to. In this version of the code the top corner nodes are considered to be part of the lid. If these

are excluded the recovered pressure showcases and extremely large checkboard pattern.

This benchmark is usually dicussed in the context of low to very high Reynolds number with the full

Navier-Stokes equations being solved (with the noticeable exception of [113, 114, 30, 48] which focus on

the Stokes equation). In the case of the incompressible Stokes ﬂow, the absence of inertia renders this

problem instantaneous so that only one time step is needed.

10.2 the lid driven cavity problem - regularisation I (ldc=1)

We avoid the top corner nodes issue altogether by prescribing the horizontal velocity of the lid as follows:

u(x) = x2(1 −x)2.(207)

In this case the velocity and its ﬁrst derivative is continuous at the corners. This is the so-called regularised

lid-driven cavity problem [102].

10.3 the lid driven cavity problem - regularisation II (ldc=2)

Another regularisation was presented in [39]. Here, a regularized lid driven cavity is studied which is

consistent in the sense that ∇·v= 0 holds also at the corners of the domain. There are no-slip conditions

at the boundaries x= 0, x= 1, and y= 0.

The velocity at y= 1 is given by

u(x)=1−1

41−cos(x1−x

π)2

x∈[0, x1]

u(x)=1 x∈[x1,1−x1]

u(x)=1−1

41−cos(x−(1 −x1)

π)2

x∈[1 −x1,1] (208)

Results are obtained with x1= 0.1.

features

•Q1×P0element

•incompressible ﬂow

•penalty formulation

•isothermal

•isoviscous

A 100x100 element grid is used. No-slip boundary conditions are prescribed on sides and bottom. A

zero vertical velocity is prescribed at the top and the exact form of the prescribed horizontal velocity is

controlled by the ldc parameter.

-200

-150

-100

-50

100

150

200

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

pressure

ldc0

ldc1

ldc2

11 fieldstone 05: SolCx benchmark

The SolCx benchmark is intended to test the accuracy of the solution to a problem that has a large jump

in the viscosity along a line through the domain. Such situations are common in geophysics: for example,

the viscosity in a cold, subducting slab is much larger than in the surrounding, relatively hot mantle

material.

The SolCx benchmark computes the Stokes ﬂow ﬁeld of a ﬂuid driven by spatial density variations,

subject to a spatially variable viscosity. Speciﬁcally, the domain is Ω = [0,1]2, gravity is g= (0,−1)T

and the density is given by

ρ(x, y) = sin(πy) cos(πx) (209)

Boundary conditions are free slip on all of the sides of the domain and the temperature plays no role in

this benchmark. The viscosity is prescribed as follows:

µ(x, y) = 1for x < 0.5

106for x > 0.5(210)

Note the strongly discontinuous viscosity ﬁeld yields a stagnant ﬂow in the right half of the domain and

thereby yields a pressure discontinuity along the interface.

The SolCx benchmark was previously used in [45] (references to earlier uses of the benchmark are

available there) and its analytic solution is given in [146]. It has been carried out in [79] and [58].

Note that the source code which evaluates the velocity and pressure ﬁelds for both SolCx and SolKz is

distributed as part of the open source package Underworld ([97], http://underworldproject.org).

In this particular example, the viscosity is computed analytically at the quadrature points (i.e. tracers

are not used to attribute a viscosity to the element). If the number of elements is even in any direction,

all elements (and their associated quadrature points) have a constant viscosity(1 or 106). If it is odd, then

the elements situated at the viscosity jump have half their integration points with µ= 1 and half with

µ= 106(which is a pathological case since the used quadrature rule inside elements cannot represent

accurately such a jump).

features

•Q1×P0element

•incompressible ﬂow

•penalty formulation

•Dirichlet boundary conditions (free-slip)

•direct solver

•isothermal

•non-isoviscous

•analytical solution

What we learn from this

100

12 fieldstone 06: SolKz benchmark

The SolKz benchmark [109] is similar to the SolCx benchmark. but the viscosity is now a function of the

space coordinates:

µ(y) = exp(By) with B= 13.8155 (211)

It is however not a discontinuous function but grows exponentially with the vertical coordinate so that

its overall variation is again 106. The forcing is again chosen by imposing a spatially variable density

variation as follows:

ρ(x, y) = sin(2y) cos(3πx) (212)

Free slip boundary conditions are imposed on all sides of the domain. This benchmark is presented in

[146] as well and is studied in [45] and [58].

0.00000001

0.00000010

0.00000100

0.00001000

0.00010000

0.00100000

0.01000000

0.10000000

0.01 0.1

error

velocity

pressure

101

13 fieldstone 07: SolVi benchmark

Following SolCx and SolKz, the SolVi inclusion benchmark solves a problem with a discontinuous viscosity

ﬁeld, but in this case the viscosity ﬁeld is chosen in such a way that the discontinuity is along a circle.

Given the regular nature of the grid used by a majority of codes and the present one, this ensures that the

discontinuity in the viscosity never aligns to cell boundaries. This in turns leads to almost discontinuous

pressures along the interface which are diﬃcult to represent accurately. [118] derived a simple analytic

solution for the pressure and velocity ﬁelds for a circular inclusion under simple shear and it was used in

[40], [124], [45], [79] and [58].

Because of the symmetry of the problem, we only have to solve over the top right quarter of the

domain (see Fig. ??a).

The analytical solution requires a strain rate boundary condition (e.g., pure shear) to be applied far

away from the inclusion. In order to avoid using very large domains and/or dealing with this type of

boundary condition altogether, the analytical solution is evaluated and imposed on the boundaries of

the domain. By doing so, the truncation error introduced while discretizing the strain rate boundary

condition is removed.

A characteristic of the analytic solution is that the pressure is zero inside the inclusion, while outside

it follows the relation

pm= 4˙µm(µi−µm)

µi+µm

r2cos(2θ) (213)

where µi= 103is the viscosity of the inclusion and µm= 1 is the viscosity of the background media,

θ= tan−1(y/x), and ˙= 1 is the applied strain rate.

[40] thoroughly investigated this problem with various numerical methods (FEM, FDM), with and

without tracers, and conclusively showed how various averagings lead to diﬀerent results. [45] obtained a

ﬁrst order convergence for both pressure and velocity, while [79] and [58] showed that the use of adaptive

mesh reﬁnement in respectively the FEM and FDM yields convergence rates which depend on reﬁnement

strategies.

102

0.00000001

0.00000010

0.00000100

0.00001000

0.00010000

0.00100000

0.01000000

0.10000000

1.00000000

10.00000000

0.01 0.1

error

velocity

pressure

103

14 fieldstone 08: the indentor benchmark

The punch benchmark is one of the few boundary value problems involving plastic solids for which there

exists an exact solution. Such solutions are usually either for highly simpliﬁed geometries (spherical or

axial symmetry, for instance) or simpliﬁed material models (such as rigid plastic solids) [75].

In this experiment, a rigid punch indents a rigid plastic half space; the slip line ﬁeld theory gives exact

solutions as shown in Fig. ??a. The plane strain formulation of the equations and the detailed solution

to the problem were derived in the Appendix of [132] and are also presented in [56].

The two dimensional punch problem has been extensively studied numerically for the past 40 years

[153, 152, 33, 32, 68, 145, 22, 107] and has been used to draw a parallel with the tectonics of eastern

China in the context of the India-Eurasia collision [127, 95]. It is also worth noting that it has been

carried out in one form or another in series of analogue modelling articles concerning the same region,

with a rigid indenter colliding with a rheologically stratiﬁed lithosphere [101, 38, 74].

Numerically, the one-time step punch experiment is performed on a two-dimensional domain of purely

plastic von Mises material. Given that the von Mises rheology yield criterion does not depend on pressure,

the density of the material and/or the gravity vector is set to zero. Sides are set to free slip boundary

conditions, the bottom to no slip, while a vertical velocity (0,−vp) is prescribed at the top boundary for

nodes whose xcoordinate is within [Lx/2−δ/2, Lx/2 + δ/2].

The following parameters are used: Lx= 1, Ly= 0.5, µmin = 10−3,µmax = 103,vp= 1, δ=

0.123456789 and the yield value of the material is set to k= 1.

The analytical solution predicts that the angle of the shear bands stemming from the sides of the

punch is π/4, that the pressure right under the punch is 1 + π, and that the velocity of the rigid blocks

on each side of the punch is vp/√2 (this is simply explained by invoking conservation of mass).

104

ToDo: smooth punch

features

•Q1×P0element

•incompressible ﬂow

•penalty formulation

•Dirichlet boundary conditions (no-slip)

•isothermal

•non-isoviscous

•nonlinear rheology

105

15 fieldstone 09: the annulus benchmark

This ﬁeldstone was developed in collaboration with Prof. E.G.P. Puckett.

This benchmark is based on Thieulot & Puckett [Subm.] in which an analytical solution to the

isoviscous incompressible Stokes equations is derived in an annulus geometry. The velocity and pressure

ﬁelds are as follows:

vr(r, θ) = g(r)ksin(kθ),(214)

vθ(r, θ) = f(r) cos(kθ),(215)

p(r, θ) = kh(r) sin(kθ),(216)

ρ(r, θ) = ℵ(r)ksin(kθ),(217)

with

f(r) = Ar +B/r, (218)

g(r) = A

2r+B

rln r+C

r,(219)

h(r) = 2g(r)−f(r)

r,(220)

ℵ(r) = g00 −g0

r−g

r2(k2−1) + f

r2+f0

r,(221)

A=−C2(ln R1−ln R2)

2ln R1−R2

1ln R2

,(222)

B=−CR2

2−R2

2ln R1−R2

1ln R2

.(223)

The parameters Aand Bare chosen so that vr(R1) = vr(R2) = 0, i.e. the velocity is tangential to

both inner and outer surfaces. The gravity vector is radial and of unit length. In the present case, we

set R1= 1, R2= 2 and C=−1.

features

•Q1×P0element

•incompressible ﬂow

•penalty formulation

•Dirichlet boundary conditions

•direct solver

•isothermal

•isoviscous

•analytical solution

•annulus geometry

•elemental boundary conditions

106

0.0001

0.001

0.01

0.1

0.01

error

velocity

pressure

107

16 fieldstone 10: Stokes sphere (3D) - penalty

features

•Q1×P0element

•incompressible ﬂow

•penalty formulation

•Dirichlet boundary conditions (free-slip)

•direct solver

•isothermal

•non-isoviscous

•3D

•elemental b.c.

•buoyancy driven

resolution is 24x24x24

108

17 fieldstone 11: stokes sphere (3D) - mixed formulation

This is the same setup as Section 16.

features

•Q1×P0element

•incompressible ﬂow

•mixed formulation

•Dirichlet boundary conditions (free-slip)

•direct solver

•isothermal

•non-isoviscous

•3D

•elemental b.c.

•buoyancy driven

109

18 fieldstone 12: consistent pressure recovery

What follows is presented in [151]. The second part of their paper wishes to establish a simple and

eﬀective numerical method to calculate variables eliminated by the penalisation process. The method

involves an additional ﬁnite element solution for the nodal pressures using the same ﬁnite element basis

and numerical quadrature as used for the velocity.

Let us start with:

p=−λ∇·v

which lead to

(q, p) = −λ(q, ∇·v)

and then ZN N dΩ·P=−λZN∇NdΩ·V

or,

M·P=−D·V

and ﬁnally

P=−M−1·D·V

with Mof size (np ×np), Dof size (np ∗ndof ×np ∗ndof) and Vof size (np ∗ndof). The vector P

contains the np nodal pressure values directly, with no need for a smoothing scheme. The mass matrix

Mis to be evaluated at the full integration points, while the constraint part (the right hand side of the

equation) is to be evaluated at the reduced integration point.

As noted by [151], it is interesting to note that when linear elements are used and the lumped matrices

are used for the Mthe resulting algebraic equation is identical to the smoothing scheme based on the

averaging method only if the uniform square ﬁnite element mesh is used. In this respect this method is

expected to yield diﬀerent results when elements are not square or even rectangular.

——-

q1is smoothed pressure obtained with the center-to-node approach.

q2is recovered pressure obtained with [151].

All three fulﬁll the zero average condition: RpdΩ = 0.

110

0.000001

0.000010

0.000100

0.001000

0.010000

0.100000

0.01 0.1

error

velocity

pressure (el)

pressure (q1)

pressure (q2)

x1.5

In terms of pressure error, q2is better than q1which is better than elemental.

QUESTION: why are the averages exactly zero ?!

TODO:

•add randomness to internal node positions.

•look at elefant algorithms

111

19 fieldstone 13: the Particle in Cell technique (1) - the eﬀect

of averaging

This ﬁeldstone is being developed in collaboration with BSc student Eric Hoogen.

features

•Q1×P0element

•incompressible ﬂow

•penalty formulation

•Dirichlet boundary conditions (no-slip)

•isothermal

•non-isoviscous

•particle-in-cell

After the initial setup of the grid, markers can then be generated and placed in the domain. One

could simply randomly generate the marker positions in the whole domain but unless a very large number

of markers is used, the chance that an element does not contain any marker exists and this will prove

problematic. In order to get a better control over the markers spatial distribution, one usually generates

the marker per element, so that the total number of markers in the domain is the product of the number

of elements times the user-chosen initial number of markers per element.

Our next concern is how to actually place the markers inside an element. Two methods come to mind:

on a regular grid, or in a random manner, as shown on the following ﬁgure:

In both cases we make use of the basis shape functions: we generate the positions of the markers

(random or regular) in the reference element ﬁrst (rim, sim), and then map those out to the real element

as follows:

xim =

Ni(rim, sim)xiyim =

Ni(rim, sim)yi(224)

where xi, yiare the coordinates of the vertices of the element.

A third option consists in the use of the so-called Poisson-disc sampling which produces points that

are tightly-packed, but no closer to each other than a speciﬁed minimum distance, resulting in a more

natural pattern 11. Note that the Poisson-disc algorithm ﬁlls the whole domain at once, not element after

element.

say smthg about avrg dist

insert here theory and link about Poisson disc

11https://en.wikipedia.org/wiki/Supersampling

112

Left: regular distribution, middle: random, right: Poisson disc.

16384 markers (32x32 grid, 16 markers per element).

When using active markers, one is faced with the problem of transferring the properties they carry to

the mesh on which the PDEs are to be solved. As we have seen, building the FE matrix involves a loop

over all elements, so one simple approach consists of assigning each element a single property computed as

the average of the values carried by the markers in that element. Often in colloquial language ”average”

refers to the arithmetic mean:

hφiam =1

φi(225)

where < φ >am is the arithmetic average of the nnumbers φi. However, in mathematics other means

are commonly used, such as the geometric mean:

hφigm = n

φi!(226)

and the harmonic mean:

hφihm = 1

φi!−1

(227)

Furthermore, there is a well known inequality for any set of positive numbers,

hφiam ≥ hφigm ≥ hφihm (228)

which will prove to be important later on.

Let us now turn to a simple concrete example: the 2D Stokes sphere. There are two materials

in the domain, so that markers carry the label ”mat=1” or ”mat=2”. For each element an average

density and viscosity need to be computed. The majority of elements contains markers with a single

material label so that the choice of averaging does not matter (it is trivial to verify that if φi=φ0then

hφiam =hφigm =hφihm =φ0. Remain the elements crossed by the interface between the two materials:

they contain markers of both materials and the average density and viscosity inside those depends on 1)

the total number of markers inside the element, 2) the ratio of markers 1 to markers 2, 3) the type of

averaging.

This averaging problem has been studied and documented in the literature [117, 40, 129, 104]

Nodal projection. Left: all markers inside elements to which the green node belongs to are taken into account.

Right: only the markers closest to the green node count.

113

Let kbe the green node of the ﬁgures above. Let (r, s) denote the coordinates of a marker inside its

element. For clarity, we deﬁne the follow three nodal averaging schemes:

•nodal type A:

fk=sum of values carried by markers in 4 neighbour elements

number of markers in 4 neighbour elements

•nodal type B:

fk=sum of values carried by markers inside dashed line

number of markers in area delimited by the dashed line

•nodal type C

fk=sum of values carried by markers in 4 neighbour elements ∗Np(r, s)

sum of Np(r, s)

where Npis the Q1basis function corresponding to node pdeﬁned on each element. Since these

functions are 1 on node kand then linearly decrease and become zero on the neighbouring nodes,

this eﬀectively gives more weight to those markers closest to node k.

This strategy is adopted in [2, 93] (although it is used to interpolate onto the nodes of Q2P−1

elements. It is formulated as follows:

”We assume that an arbitrary material point property f, is discretized via f(x)'δ(x−xp)fp. We

then utilize an approximate local L2projection of fponto a continuous Q1ﬁnite element space.

The corner vertices of each Q2ﬁnite element deﬁne the mesh fpis projected onto. The local

reconstruction for a node i is deﬁned by

fi=RΩiNi(x)f(x)

RΩiNi(x)'PpNi(xp)fp

PpNi(xp)

where the summation over pincludes all material points contained within the support Ωiof the

trilinear interpolant Ni”.

The setup is identical to the Stokes sphere experiment. The bash script script runall runs the code

for many resolutions, both initial marker distribution and all four averaging types. The viscosity of the

sphere has been set to 103while the viscosity of the surrounding ﬂuid is 1. The average density is always

computed with an arithmetic mean.

Conclusions:

•With increasing resolution (h→0) vrms values seem to converge towards a single value, irrespective

of the number of markers.

•At low resolution, say 32x32 (i.e. h=0.03125), vrms values for the three averagings diﬀer by about

10%. At higher resolution, say 128x128, vrms values are still not converged.

•The number of markers per element plays a role at low resolution, but less and less with increasing

resolution.

•Results for random and regular marker distributions are not identical but follow a similar trend

and seem to converge to the same value.

•elemental values yield better results (espcecially at low resolutions)

•harmonic mean yields overal the best results

114

Root mean square velocity results are shown hereunder:

0.001

0.002

0.003

0.004

0.005

0.006

0.007

0.008

0.009

0.01

0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1

vrms

random markers, elemental proj

am, 09 m

am, 16 m

am, 25 m

am, 36 m

am, 48 m

am, 64 m

gm, 09 m

gm, 16 m

gm, 25 m

gm, 36 m

gm, 48 m

gm, 64 m

hm, 09 m

hm, 16 m

hm, 25 m

hm, 36 m

hm, 48 m

hm, 64 m

0.001

0.002

0.003

0.004

0.005

0.006

0.007

0.008

0.009

0.01

0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1

vrms

random PD markers, elemental proj

am, 09 m

am, 16 m

am, 25 m

am, 36 m

am, 48 m

am, 64 m

gm, 09 m

gm, 16 m

gm, 25 m

gm, 36 m

gm, 48 m

gm, 64 m

hm, 09 m

hm, 16 m

hm, 25 m

hm, 36 m

hm, 48 m

hm, 64 m

0.001

0.002

0.003

0.004

0.005

0.006

0.007

0.008

0.009

0.01

0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1

vrms

regular markers, elemental proj

am, 09 m

am, 16 m

am, 25 m

am, 36 m

am, 48 m

am, 64 m

gm, 09 m

gm, 16 m

gm, 25 m

gm, 36 m

gm, 48 m

gm, 64 m

hm, 09 m

hm, 16 m

hm, 25 m

hm, 36 m

hm, 48 m

hm, 64 m

0.001

0.002

0.003

0.004

0.005

0.006

0.007

0.008

0.009

0.01

0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1

vrms

random markers, nodal proj A

am, 09 m

am, 16 m

am, 25 m

am, 36 m

am, 48 m

am, 64 m

gm, 09 m

gm, 16 m

gm, 25 m

gm, 36 m

gm, 48 m

gm, 64 m

hm, 09 m

hm, 16 m

hm, 25 m

hm, 36 m

hm, 48 m

hm, 64 m

0.001

0.002

0.003

0.004

0.005

0.006

0.007

0.008

0.009

0.01

0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1

vrms

random PD markers, nodal proj A

am, 09 m

am, 16 m

am, 25 m

am, 36 m

am, 48 m

am, 64 m

gm, 09 m

gm, 16 m

gm, 25 m

gm, 36 m

gm, 48 m

gm, 64 m

hm, 09 m

hm, 16 m

hm, 25 m

hm, 36 m

hm, 48 m

hm, 64 m

0.001

0.002

0.003

0.004

0.005

0.006

0.007

0.008

0.009

0.01

0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1

vrms

regular markers, nodal proj A

am, 09 m

am, 16 m

am, 25 m

am, 36 m

am, 48 m

am, 64 m

gm, 09 m

gm, 16 m

gm, 25 m

gm, 36 m

gm, 48 m

gm, 64 m

hm, 09 m

hm, 16 m

hm, 25 m

hm, 36 m

hm, 48 m

hm, 64 m

0.001

0.002

0.003

0.004

0.005

0.006

0.007

0.008

0.009

0.01

0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1

vrms

random markers, nodal proj B

am, 09 m

am, 16 m

am, 25 m

am, 36 m

am, 48 m

am, 64 m

gm, 09 m

gm, 16 m

gm, 25 m

gm, 36 m

gm, 48 m

gm, 64 m

hm, 09 m

hm, 16 m

hm, 25 m

hm, 36 m

hm, 48 m

hm, 64 m

0.001

0.002

0.003

0.004

0.005

0.006

0.007

0.008

0.009

0.01

0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1

vrms

random markers, nodal proj B

am, 09 m

am, 16 m

am, 25 m

am, 36 m

am, 48 m

am, 64 m

gm, 09 m

gm, 16 m

gm, 25 m

gm, 36 m

gm, 48 m

gm, 64 m

hm, 09 m

hm, 16 m

hm, 25 m

hm, 36 m

hm, 48 m

hm, 64 m

0.001

0.002

0.003

0.004

0.005

0.006

0.007

0.008

0.009

0.01

0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1

vrms

regular markers, nodal proj B

am, 09 m

am, 16 m

am, 25 m

am, 36 m

am, 48 m

am, 64 m

gm, 09 m

gm, 16 m

gm, 25 m

gm, 36 m

gm, 48 m

gm, 64 m

hm, 09 m

hm, 16 m

hm, 25 m

hm, 36 m

hm, 48 m

hm, 64 m

0.001

0.002

0.003

0.004

0.005

0.006

0.007

0.008

0.009

0.01

0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1

vrms

random markers, nodal proj C

am, 09 m

am, 16 m

am, 25 m

am, 36 m

am, 48 m

am, 64 m

gm, 09 m

gm, 16 m

gm, 25 m

gm, 36 m

gm, 48 m

gm, 64 m

hm, 09 m

hm, 16 m

hm, 25 m

hm, 36 m

hm, 48 m

hm, 64 m

0.001

0.002

0.003

0.004

0.005

0.006

0.007

0.008

0.009

0.01

0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1

vrms

random markers, nodal proj C

am, 09 m

am, 16 m

am, 25 m

am, 36 m

am, 48 m

am, 64 m

gm, 09 m

gm, 16 m

gm, 25 m

gm, 36 m

gm, 48 m

gm, 64 m

hm, 09 m

hm, 16 m

hm, 25 m

hm, 36 m

hm, 48 m

hm, 64 m

0.001

0.002

0.003

0.004

0.005

0.006

0.007

0.008

0.009

0.01

0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1

vrms

regular markers, nodal proj C

am, 09 m

am, 16 m

am, 25 m

am, 36 m

am, 48 m

am, 64 m

gm, 09 m

gm, 16 m

gm, 25 m

gm, 36 m

gm, 48 m

gm, 64 m

hm, 09 m

hm, 16 m

hm, 25 m

hm, 36 m

hm, 48 m

hm, 64 m

Left column: random markers, middle column: Poisson disc, right column: regular markers. First row: elemental

projection, second row: nodal 1 projection, third row: nodal 2 projection, fourth row: nodal 3 projection.

0.001

0.002

0.003

0.004

0.005

0.006

0.007

0.008

0.009

0.01

0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1

vrms

64 markers per element, arithmetic avrg

reg, el

rand, el

pd, el

reg, nodA

rand, nodA

pd, nodA

reg, nodB

rand, nodB

pd, nodB

reg, nodC

rand, nodC

pd, nodC

0.001

0.002

0.003

0.004

0.005

0.006

0.007

0.008

0.009

0.01

0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1

vrms

64 markers per element, geometric avrg

reg, el

rand, el

pd, el

reg, nodA

rand, nodA

pd, nodA

reg, nodB

rand, nodB

pd, nodB

reg, nodC

rand, nodC

pd, nodC

0.001

0.002

0.003

0.004

0.005

0.006

0.007

0.008

0.009

0.01

0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1

vrms

64 markers per element, harmonic avrg

reg, el

rand, el

pd, el

reg, nodA

rand, nodA

pd, nodA

reg, nodB

rand, nodB

pd, nodB

reg, nodC

rand, nodC

pd, nodC

Left to right: arithmetic, geometric, harmonic averaging for viscosity.

115

20 fieldstone f14: solving the full saddle point problem

The details of the numerical setup are presented in Section ??.

The main diﬀerence is that we no longer use the penalty formulation and therefore keep both velocity

and pressure as unknowns. Therefore we end up having to solve the following system:

K G

GT0·V

P=f

hor,A·X=rhs

Each block K,Gand vector f,hare built separately in the code and assembled into the matrix A

and vector rhs afterwards. Aand rhs are then passed to the solver. We will see later that there are

alternatives to solve this approach which do not require to build the full Stokes matrix A.

Each element has m= 4 vertices so in total ndofV ×m= 8 velocity dofs and a single pressure

dof, commonly situated in the center of the element. The total number of velocity dofs is therefore

NfemV =nnp ×ndofV while the total number of pressure dofs is NfemP =nel. The total number of

dofs is then Nfem =NfemV +N femP .

As a consequence, matrix Khas size N femV, N femV and matrix Ghas size N femV, N femP . Vector

fis of size NfemV and vector his of size N femP .

features

•Q1×P0element

•incompressible ﬂow

•mixed formulation

•Dirichlet boundary conditions (no-slip)

•direct solver (?)

•isothermal

•isoviscous

•analytical solution

•pressure smoothing

116

Unlike the results obtained with the penalty formualtion (see Section ??), the pressure showcases a

very strong checkerboard pattern, similar to the one in [43].

Left: pressure solution as shown in [43]; Right: pressure solution obtained with ﬁeldstone.

Rather interestingly, the nodal pressure (obtained with a simple center-to-node algorithm) fails to

recover a correct pressure at the four corners.

117

21 fieldstone f15: saddle point problem with Schur comple-

ment approach - benchmark

The details of the numerical setup are presented in Section ??. The main diﬀerence resides in the Schur

complement approach to solve the Stokes system, as presented in Section 6.15 (see solver cg). This

iterative solver is very easy to implement once the blocks Kand G, as well as the rhs vectors fand h

have been built.

Rather interestingly the pressure checkerboard modes are not nearly as present as in Section ?? which

uses a full matrix approach.

Looking at the discretisation errors for velocity and pressure, we of course recover the same rates and

values as in the full matrix case.

118

0.000001

0.000010

0.000100

0.001000

0.010000

0.100000

0.01 0.1

error

velocity

pressure

Finally, for each experiment the normalised residual (see solver cg) was recorded. We see that all

things equal the resolution has a strong inﬂuence on the number of iterations the solver must perform to

reach the required tolerance. This is one of the manifestations of the fact that the Q1×P0element is

not a stable element: the condition number of the matrix increases with resolution. We will see that this

is not the case of stable elements such as Q2×Q1.

0.000000001

0.000000010

0.000000100

0.000001000

0.000010000

0.000100000

0.001000000

0.010000000

0.100000000

1.000000000

0 10 20 30 40 50 60 70 80 90 100

normalised residual

# iteration

8x8

12x12

16x16

24x24

32x32

48x48

64x64

128x128

1e-8

119

features

•Q1×P0element

•incompressible ﬂow

•mixed formulation

•Schur complement approach

•isothermal

•isoviscous

•analytical solution

build S and have python compute its smallest and largest eigenvalues as a function of resolution?

120

22 fieldstone f16: saddle point problem with Schur comple-

ment approach - Stokes sphere

We are revisiting the 2D Stokes sphere problem, but this time we use the Schur complement approach

to solve the Stokes system, Because there are viscosity contrasts in the domain, it is advisable to use the

Preconditioned Conjugate Gradient as presented in Section 6.15 (see solver pcg).

The normalised residual (see solver pcg) was recorded. We see that all things equal the resolution

has a strong inﬂuence on the number of iterations the solver must perform to reach the required tolerance.

However, we see that the use of the preconditioner can substantially reduce the number of iterations inside

the Stokes solver. At resolution 128x128, this number is halved.

121

0.000000001

0.000000010

0.000000100

0.000001000

0.000010000

0.000100000

0.001000000

0.010000000

0.100000000

1.000000000

0 50 100 150 200 250 300 350 400

normalised residual

# iteration

32x32 (prec.)

32x32 (no prec.)

64x64 (prec.)

64x64 (no prec.)

128x128 (prec.)

128x128 (no prec.)

1e-8

features

•Q1×P0element

•incompressible ﬂow

•mixed formulation

•Schur complement approach

•isothermal

•non-isoviscous

•Stokes sphere

122

23 fieldstone 17: solving the full saddle point problem in 3D

When using Q1×P0elements, this benchmark fails because of the Dirichlet b.c. on all 6 sides and all

three components. However, as we will see, it does work well with Q2×Q1elements. .

This benchmark begins by postulating a polynomial solution to the 3D Stokes equation [41]:

v=



x+x2+xy +x3y

y+xy +y2+x2y2

−2z−3xz −3yz −5x2yz 

(229)

and

p=xyz +x3y3z−5/32 (230)

While it is then trivial to verify that this velocity ﬁeld is divergence-free, the corresponding body force

of the Stokes equation can be computed by inserting this solution into the momentum equation with a

given viscosity µ(constant or position/velocity/strain rate dependent). The domain is a unit cube and

velocity boundary conditions simply use Eq. (229). Following [25], the viscosity is given by the smoothly

varying function

µ= exp(1 −β(x(1 −x) + y(1 −y) + z(1 −z))) (231)

One can easily show that the ratio of viscosities µ?in the system follows µ?= exp(−3β/4) so that

choosing β= 10 yields µ?'1808 and β= 20 yields µ?'3.269 ×106.

We start from the momentum conservation equation:

−∇p+∇·(2µ˙

) = f

The x-component of this equation writes

fx=−∂p

∂x +∂

∂x (2µ˙xx) + ∂

∂y (2µ˙xy) + ∂

∂z (2µ˙xz) (232)

=−∂p

∂x + 2µ∂

∂x ˙xx + 2µ∂

∂y ˙xy + 2µ∂

∂z ˙xz + 2∂µ

∂x ˙xx + 2∂µ

∂y ˙xy + 2∂µ

∂z ˙xz (233)

Let us compute all the block separately:

˙xx = 1 + 2x+y+ 3x2y

˙yy = 1 + x+ 2y+ 2x2y

˙zz =−2−3x−3y−5x2y

2˙xy = (x+x3)+(y+ 2xy2) = x+y+ 2xy2+x3

2˙xz = (0) + (−3z−10xyz) = −3z−10xyz

2˙yz = (0) + (−3z−5x2z) = −3z−5x2z

123

In passing, one can verify that ˙xx + ˙yy + ˙zz = 0. We further have

∂

∂x 2 ˙xx = 2(2 + 6xy)

∂

∂y 2 ˙xy = 1 + 4xy

∂

∂z 2 ˙xz =−3−10xy

∂

∂x 2 ˙xy = 1 + 2y2+ 3x2

∂

∂y 2 ˙yy = 2(2 + 2x2)

∂

∂z 2 ˙yz =−3−5x2

∂

∂x 2 ˙xz =−10yz

∂

∂y 2 ˙yz = 0

∂

∂z 2 ˙zz = 2(0)

∂p

∂x =yz + 3x2y3z(234)

∂p

∂y =xz + 3x3y2z(235)

∂p

∂z =xy +x3y3(236)

Pressure normalisation Here again, because Dirichlet boundary conditions are prescribed on all

sides the pressure is known up to an arbitrary constant. This constant can be determined by (arbitrarily)

choosing to normalised the pressure ﬁeld as follows:

ZΩ

p dΩ = 0 (237)

This is a single constraint associated to a single Lagrange multiplier λand the global Stokes system takes

the form 



K G 0

GT0C

0CT0





λ



In this particular case the constraint matrix Cis a vector and it only acts on the pressure degrees of

freedom because of Eq.(237). Its exact expression is as follows:

ZΩ

p dΩ = X

eZΩe

p dΩ = X

eZΩeX

ipidΩ = X

iZΩe

idΩpi=X

eCe·pe

where peis the list of pressure dofs of element e. The elemental constraint vector contains the correspond-

ing pressure basis functions integrated over the element. These elemental constraints are then assembled

into the vector C.

124

23.0.1 Constant viscosity

Choosing β= 0 yields a constant velocity µ(x, y, z) = exp(1) '2.718 (and greatly simpliﬁes the right-

hand side) so that

∂

∂x µ(x, y, z) = 0 (238)

∂

∂y µ(x, y, z) = 0 (239)

∂

∂z µ(x, y, z) = 0 (240)

and

fx=−∂p

∂x + 2µ∂

∂x ˙xx + 2µ∂

∂y ˙xy + 2µ∂

∂z ˙xz

=−(yz + 3x2y3z) + 2(2 + 6xy) + (1 + 4xy)+(−3−10xy)

=−(yz + 3x2y3z) + µ(2 + 6xy)

fy=−∂p

∂y + 2µ∂

∂x ˙xy + 2µ∂

∂y ˙yy + 2µ∂

∂z ˙yz

=−(xz + 3x3y2z) + µ(1 + 2y2+ 3x2) + µ2(2 + 2x2) + µ(−3−5x2)

=−(xz + 3x3y2z) + µ(2 + 2x2+ 2y2)

fz=−∂p

∂z + 2µ∂

∂x ˙xz + 2µ∂

∂y ˙yz + 2µ∂

∂z ˙zz

=−(xy +x3y3) + µ(−10yz)+0+0

=−(xy +x3y3) + µ(−10yz)

Finally

f=−



yz + 3x2y3z

xz + 3x3y2z

xy +x3y3

+µ



2+6xy

2+2x2+ 2y2

−10yz 



Note that there seems to be a sign problem with Eq.(26) in [25].

125

0.00001

0.00010

0.00100

0.01000

0.1

error

velocity

pressure

23.0.2 Variable viscosity

The spatial derivatives of the viscosity are then given by

∂

∂x µ(x, y, z) = −(1 −2x)βµ(x, y, z)

∂

∂y µ(x, y, z) = −(1 −2y)βµ(x, y, z)

∂

∂z µ(x, y, z) = −(1 −2z)βµ(x, y, z)

126

and thr right-hand side by

f=−



yz + 3x2y3z

xz + 3x3y2z

xy +x3y3

+µ



2+6xy

2+2x2+ 2y2

−10yz 

(241)

−(1 −2x)βµ(x, y, z)



2˙xx

2˙xy

2˙xz 

−(1 −2y)βµ(x, y, z)



2˙xy

2˙yy

2˙yz 

−(1 −2z)βµ(x, y, z)



2˙xz

2˙yz

2˙zz 



=−



yz + 3x2y3z

xz + 3x3y2z

xy +x3y3

+µ



2+6xy

2+2x2+ 2y2

−10yz 



−(1 −2x)βµ 



2+4x+ 2y+ 6x2y

x+y+ 2xy2+x3

−3z−10xyz 

−(1 −2y)βµ 



x+y+ 2xy2+x3

2+2x+ 4y+ 4x2y

−3z−5x2z

−(1 −2z)βµ 

−3z−10xyz

−3z−5x2z

−4−6x−6y−10x2y



Note that at (x, y, z) = (0,0,0), µ= exp(1), and at (x, y, z) = (0.5,0.5,0.5), µ= exp(1 −3β/4) so

that the maximum viscosity ratio is given by

µ?=exp(1 −3β/4)

exp(1) = exp(−3β/4)

By varying βbetween 1 and 22 we can get up to 7 orders of magnitude viscosity diﬀerence.

features

•Q1×P0element

•incompressible ﬂow

•saddle point system

•Dirichlet boundary conditions (free-slip)

•direct solver

•isothermal

•non-isoviscous

•3D

•elemental b.c.

•analytical solution

127

24 fieldstone 18: solving the full saddle point problem with

Q2×Q1elements

The details of the numerical setup are presented in Section ??.

Each element has mV= 9 vertices so in total ndofV×mV= 18 velocity dofs and ndofP∗mP= 4

pressure dofs. The total number of velocity dofs is therefore Nf emV =nnp ×ndofV while the total

number of pressure dofs is N femP =nel. The total number of dofs is then Nfem =N f emV +N femP .

As a consequence, matrix Khas size N femV, N femV and matrix Ghas size N femV, N femP . Vector

fis of size NfemV and vector his of size N femP .

renumber all nodes to start at zero!! Also internal numbering does not work here

features

•Q2×Q1element

•incompressible ﬂow

•mixed formulation

•Dirichlet boundary conditions (no-slip)

•isothermal

•isoviscous

•analytical solution

128

0.0000001

0.0000010

0.0000100

0.0001000

0.0010000

0.0100000

0.1

error

velocity

pressure

129

25 fieldstone 19: solving the full saddle point problem with

Q3×Q2elements

The details of the numerical setup are presented in Section ??.

Each element has mV= 16 vertices so in total ndofV×mV= 32 velocity dofs and ndofP∗mP= 9

pressure dofs. The total number of velocity dofs is therefore Nf emV =nnp ×ndofV while the total

number of pressure dofs is N femP =nel. The total number of dofs is then Nfem =N f emV +N femP .

As a consequence, matrix Khas size N femV, N femV and matrix Ghas size N femV, N femP . Vector

fis of size NfemV and vector his of size N femP .

60===61===62===63===64===65===66===67===68===70

|| || || ||

50 51 52 53 54 55 56 57 58 59

|| || || ||

40 41 42 43 44 45 46 47 48 49

|| || || ||

30===31===32===33===34===35===36===37===38===39

|| || || ||

20 21 22 23 24 25 26 27 28 29

|| || || ||

10 11 12 13 14 15 16 17 18 19

|| || || ||

00===01===02===03===04===05===06===07===08===09

Example of 3x2 mesh. nnx=10, nny=7, nnp=70, nelx=3, nely=2, nel=6

12===13===14===15 06=====07=====08

|| || || || || || ||

08===09===10===11 || || ||

|| || || || 03=====04=====05

04===05===06===07 || || ||

|| || || || || || ||

00===01===02===03 00=====01=====02

Velocity (Q3) Pressure (Q2)

(r,s)_{00}=(-1,-1) (r,s)_{00}=(-1,-1)

(r,s)_{01}=(-1/3,-1) (r,s)_{01}=(0,-1)

(r,s)_{02}=(+1/3,-1) (r,s)_{02}=(+1,-1)

(r,s)_{03}=(+1,-1) (r,s)_{03}=(-1,0)

(r,s)_{04}=(-1,-1/3) (r,s)_{04}=(0,0)

(r,s)_{05}=(-1/3,-1/3) (r,s)_{05}=(+1,0)

(r,s)_{06}=(+1/3,-1/3) (r,s)_{06}=(-1,+1)

(r,s)_{07}=(+1,-1/3) (r,s)_{07}=(0,+1)

(r,s)_{08}=(-1,+1/3) (r,s)_{08}=(+1,+1)

(r,s)_{09}=(-1/3,+1/3)

(r,s)_{10}=(+1/3,+1/3)

(r,s)_{11}=(+1,+1/3)

(r,s)_{12}=(-1,+1)

(r,s)_{13}=(-1/3,+1)

(r,s)_{14}=(+1/3,+1)

(r,s)_{15}=(+1,+1)

Write about 4 point quadrature.

130

features

•Q3×Q2element

•incompressible ﬂow

•mixed formulation

•isothermal

•isoviscous

•analytical solution

1x10-10

1x10-9

1x10-8

1x10-7

1x10-6

1x10-5

0.0001

0.1

error

velocity

pressure

velocity error rate is cubic, pressure superconvergent since the pressure ﬁeld is quadratic and therefore

lies into the Q2 space.

131

26 fieldstone 20: the Busse benchmark

This three-dimensional benchmark was ﬁrst proposed by [26]. It has been subsequently presented in

[125, 136, 3, 100, 37, 79]. We here focus on Case 1 of [26]: an isoviscous bimodal convection experiment

at Ra = 3 ×105.

The domain is of size a×b×hwith a= 1.0079h,b= 0.6283hwith h= 2700km. It is ﬁlled

with a Newtonian ﬂuid characterised by ρ0= 3300kg.m−3,α= 10−5K−1,µ= 8.0198 ×1023Pa.s,

k= 3.564W.m−1.K−1,cp= 1080J.K−1.kg−1. The gravity vector is set to g= (0,0,−10)T. The

temperature is imposed at the bottom (T= 3700◦C) and at the top (T= 0◦C).

The various measurements presented in [26] are listed hereafter:

•The Nusselt number N u computed at the top surface following Eq. (206):

Nu =LzRRz=Lz

∂T

∂y dxdy

RRz=0 T dxdy

•the root mean square velocity vrms and the temperature mean square velocity Trms

•The vertical velocity wand temperature Tat points x1= (0,0, Lz/2), x2= (Lx,0, Lz/2), x3=

(0, Ly, Lz/2) and x4= (Lx, Ly, Lz/2);

•the vertical component of the heat ﬂux Qat the top surface at all four corners.

The values plotted hereunder are adimensionalised by means of a reference temperature (3700K), a

reference lengthscale 2700km, and a reference time L2

z/κ.

0.0001

0.0002

0.0003

0.0004

0.0005

0.0006

0.0007

0.0008

0.0009

0 2x108 4x108 6x108 8x108 1x109 1.2x109

1.4x109

1.6x109

vrms

time/year

2115

2116

2117

2118

2119

2120

2121

2122

2123

2124

0 2x108 4x108 6x108 8x108 1x109 1.2x109

1.4x109

1.6x109

<T>

time/year

132

-0.002

-0.0015

-0.001

-0.0005

0.0005

0.001

0.0015

0.002

0 2x108 4x108 6x108 8x108 1x109 1.2x109

1.4x109

1.6x109

w at Lz/2

time/year

-3.5

-3

-2.5

-2

-1.5

-1

-0.5

1x108 1.5x108 2x108 2.5x108 3x108 3.5x108

heat ﬂux

time/year

features

•Q1×P0element

•incompressible ﬂow

•mixed formulation

•Dirichlet boundary conditions (free-slip)

•direct solver

•isothermal

•non-isoviscous

•3D

•elemental b.c.

•buoyancy driven

ToDo: look at energy conservation. run to steady state and make sure the expected values are

retrieved.

133

27 fieldstone 21: The non-conforming Q1×P0element

features

•Non-conforming Q1×P0element

•incompressible ﬂow

•mixed formulation

•isothermal

•non-isoviscous

•analytical solution

•pressure smoothing

try Q1 mapping instead of isoparametric.

134

28 fieldstone 22: The stabilised Q1×Q1element

The details of the numerical setup are presented in Section ??.

We will here focus on the Q1×Q1element, which, unless stabilised, violates the LBB stability

condition and therefore is unusable. Stabilisation can be of two types: least-squares (see for example

[44]) [128, 76, ?], or by means of an additional term in the weak form as ﬁrst introduced in [41, 16]. It is

further analysed in [99, 84]. Note that an equal-order velocity-pressure formulation that does not exhibit

spurious pressure modes (without stabilisaion) has been presented in [110].

This element corresponds to Bilinear velocities, bilinear pressure (equal order interpolation for both

velocity and pressure) which is very convenient in terms of data structures since all dofs are colocated.

In geodynamics, it is used in the Rhea code [122, 25] and in Gale. It is also used in [83] in its stabilised

form, in conjunction with AMR.

The stabilisation term Centers the Stokes matrix in the (2,2) position:

K G

GTC·V

P=f

h

Let us take a simple example: a 3x2 element grid.

012

345

0123

4567

8 9 10 11

0,1 2,3 4,5 6,7

8,9 10,11 12,13 14,15

16,17 18,19 20,21 22,23

The Kmatrix is of size NfemV ×Nf emV with N femV =ndofV ×nnp = 2 ×12 = 24. The G

matrix is of size NfemV ×NfemP with N f emP =ndofP ×nnp = 1 ×12 = 12. The Cmatrix is of

size NfemP ×Nf emP .

A corner pdof sees 4 vdofs, a side pdof sees 12 vdofs and an inside pdof sees 18 vdofs, so that the

total number of nonzeros in Gcan be computed as follows:

NZG= 4

|{z}

corners

+ 2(nnx −2) ∗12

| {z }

2hor.sides

+ 2(nny −2) ∗12

| {z }

2vert.sides

+ (nnx −2)(nny −2) ∗18

| {z }

insidenodes

Concretely,

•pdof #0 sees vdofs 0,1,2,3,8,9,10,11

•pdof #1 sees vdofs 0,1,2,3,4,5,8,9,10,11,12,13

•pdof #5 sees vdofs 0,1,2,3,4,5,8,9,10,11,12,13,16,17,18,19,20,21

so that the GTmatrix non-zero structure then is as follows:

135

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23

We will here use the formulation of [41], which is very appealing since it does not require any free

(user-chosen) parameter. Furthermore the matrix Cis rather simple to compute as we will see hereafter.

We also impose RpdV = 0 which means that the following constraint is added to the Stokes matrix:





K G 0

GTC L

0LT0

·

V

λ

=



0



As in [41] we solve the benchmark problem presented in 6.7.2

0.00001

0.00010

0.00100

0.01000

0.10000

1.00000

0.1

error

velocity

pressure

x1.5

136

29 fieldstone 23: compressible ﬂow (1) - analytical benchmark

This work is part of the MSc thesis of T. Weir (2018).

We ﬁrst start with an isothermal Stokes ﬂow, so that we disregard the heat transport equation and

the equations we wish to solve are simply:

−∇ · 2η˙ε(v)−1

3(∇ · v)1+∇p=ρgin Ω,(242)

∇ · (ρv) = 0 in Ω (243)

The second equation can be rewritten ∇ · (ρv) = ρ∇ · v+v·∇ρ= 0 or,

∇ · v+1

ρv·∇ρ= 0

Note that this presupposes that the density is not zero anywhere in the domain.

We use a mixed formulation and therefore keep both velocity and pressure as unknowns. We end up

having to solve the following system:

K G

GT+Z0·V

P=f

hor,A·X=rhs

Where Kis the stiﬀness matrix, Gis the discrete gradient operator, GTis the discrete divergence operator,

Vthe velocity vector, Pthe pressure vector. Note that the term ZVderives from term v·∇ρin the

continuity equation.

Each block K,G,Zand vectors fand hare built separately in the code and assembled into the

matrix Aand vector rhs afterwards. Aand rhs are then passed to the solver. We will see later that

there are alternatives to solve this approach which do not require to build the full Stokes matrix A.

Remark: the term ZVis often put in the rhs (i.e. added to h) so that the matrix Aretains the same

structure as in the incompressible case. This is indeed how it is implemented in ASPECT. This however

requires more work since the rhs depends on the solution and some form of iterations is needed.

In the case of a compressible ﬂow the strain rate tensor and the deviatoric strain rate tensor are no

more equal (since ∇·v6= 0). The deviatoric strainrate tensor is given by12

d(v) = ˙

(v)−1

3T r(˙

)1=˙

(v)−1

3(∇·v)1

In that case:

˙d

xx =∂u

∂x −1

3∂u

∂x +∂v

∂y =2

∂u

∂x −1

∂v

∂y (244)

˙d

yy =∂v

∂y −1

3∂u

∂x +∂v

∂y =−1

∂u

∂x +2

∂v

∂y (245)

2˙d

xy =∂u

∂y +∂v

∂x (246)

and then

d(v) = 



∂u

∂x −1

∂v

∂y

∂u

∂y +1

∂v

∂x

∂u

∂y +1

∂v

∂x −1

∂u

∂x +2

∂v

∂y





From τ = 2ηdwe arrive at:





τxx

τyy

τxy 

= 2η



˙d

xy 

= 2η



2/3−1/3 0

−1/3 2/3 0

0 0 1/2

·



∂u

∂x

∂v

∂y

∂u

∂y +∂v

∂x



=η



4/3−2/3 0

−2/3 4/3 0

0 0 1 

·



∂u

∂x

∂v

∂y

∂u

∂y +∂v

∂x





or,

τ =CηBV

12See the ASPECT manual for a justiﬁcation of the 3 value in the denominator in 2D and 3D.

137

In order to test our implementation we have created a few manufactured solutions:

•benchmark #1 (ibench=1)): Starting from a density proﬁle of:

ρ(x, y) = xy (247)

We derive a velocity given by:

vx(x, y) = Cx

x, vy(x, y) = Cy

y(248)

With gx(x, y) = 1

xand gy(x, y) = 1

y, this leads us to a pressure proﬁle:

p=−η4Cx

3x2+4Cy

3y2+xy +C0(249)

This gives us a strain rate:

˙xx =−Cx

x2˙yy =−Cy

y2˙xy = 0

In what follows, we choose η= 1 and Cx=Cy= 1 and for a unit square domain [1 : 2] ×[1 : 2] we

compute C0so that the pressure is normalised to zero over the whole domain and obtain C0=−1.

•benchmark #2 (ibench=2): Starting from a density proﬁle of:

ρ= cos(x) cos(y) (250)

We derive a velocity given by:

vx=Cx

cos(x), vy=Cy

cos(y)(251)

With gx=1

cos(y)and gy=1

cos(x), this leads us to a pressure proﬁle:

p=η 4Cxsin(x)

3 cos2(x)+4Cysin(y)

3 cos2(y)!+ (sin(x) + sin(y)) + C0(252)

˙xx =Cx

sin(x)

cos2(x)˙yy =Cy

sin(y)

cos2(y)˙xy = 0

We choose η= 1 and Cx=Cy= 1. The domain is the unit square [0 : 1] ×[0 : 1] and we obtain C0

as before and obtain

C0= 2 −2 cos(1) + 8/3( 1

cos(1) −1) '3.18823730

(thank you WolframAlpha)

•benchmark #3 (ibench=3)

•benchmark #4 (ibench=4)

•benchmark #5 (ibench=5)

features

•Q1×P0element

•incompressible ﬂow

•mixed formulation

•Dirichlet boundary conditions (no-slip)

•isothermal

•isoviscous

•analytical solution

•pressure smoothing

138

ToDo:

•pbs with odd vs even number of elements

•q is ’ﬁne’ everywhere except in the corners - revisit pressure smoothing paper?

•redo A v d Berg benchmark (see Tom Weir thesis)

139

30 fieldstone 24: compressible ﬂow (2) - convection box

This work is part of the MSc thesis of T. Weir (2018).

30.1 The physics

Let us start with some thermodynamics. Every material has an equation of state. The equilibrium

thermodynamic state of any material can be constrained if any two state variables are speciﬁed. Examples

of state variables include the pressure pand speciﬁc volume ν= 1/ρ, as well as the temperature T.

After linearisation, the density depends on temperature and pressure as follows:

ρ(T, p) = ρ0((1 −α(T−T0) + βTp)

where αis the coeﬃcient of thermal expansion, also called thermal expansivity:

α=−1

ρ∂ρ

∂T p

αis the percentage increase in volume of a material per degree of temperature increase; the subscript p

means that the pressure is held ﬁxed.

βTis the isothermal compressibility of the ﬂuid, which is given by

βT=1

K=1

ρ∂ρ

∂P T

with Kthe bulk modulus. Values of βT= 10−12 −10−11 Pa−1are reasonable for Earth’s mantle, with

values decreasing by about a factor of 5 between the shallow lithosphere and core-mantle boundary. This

is the percentage increase in density per unit change in pressure at constant temperature. Both the

coeﬃcient of thermal expansion and the isothermal compressibility can be obtained from the equation of

state.

The full set of equations we wish to solve is given by

−∇ · 2η˙

d(v)+∇p=ρ0((1 −α(T−T0) + βTp)gin Ω (253)

∇ · v+1

ρv·∇ρ= 0 in Ω (254)

ρCp∂T

∂t +v· ∇T−∇·k∇T=ρH + 2η˙

d:˙

d+αT ∂p

∂t +v· ∇pin Ω,(255)

Note that this presupposes that the density is not zero anywhere in the domain.

30.2 The numerics

We use a mixed formulation and therefore keep both velocity and pressure as unknowns. We end up

having to solve the following system:

K G +W

GT+Z0·V

P=f

hor,A·X=rhs

Where Kis the stiﬀness matrix, Gis the discrete gradient operator, GTis the discrete divergence operator,

Vthe velocity vector, Pthe pressure vector. Note that the term ZVderives from term v·∇ρin the

continuity equation.

As perfectly explained in the step 32 of deal.ii13, we need to scale the Gterm since it is many orders

of magnitude smaller than K, which introduces large inaccuracies in the solving process to the point that

the solution is nonsensical. This scaling coeﬃcient is η/L. After building the Gblock, it is then scaled

as follows: G0=η

LGso that we now solve

13https://www.dealii.org/9.0.0/doxygen/deal.II/step 32.html

140

K G0+W

G0T+Z0·V

P0=f

h

After the solve phase, we recover the real pressure with P=η

LP0.

adapt notes since I should scale Wand Ztoo.hshould be caled too !!!!!!!!!!!!!!!

Each block K,G,Zand vectors fand hare built separately in the code and assembled into the

matrix Aand vector rhs afterwards. Aand rhs are then passed to the solver. We will see later that

there are alternatives to solve this approach which do not require to build the full Stokes matrix A.

Remark 1: the terms ZVand WPare often put in the rhs (i.e. added to h) so that the matrix

Aretains the same structure as in the incompressible case. This is indeed how it is implemented in

ASPECT, see also appendix A of [82]. This however requires more work since the rhs depends on the

solution and some form of iterations is needed.

Remark 2: Very often the adiabatic heating term αT (v· ∇p) is simpliﬁed as follows: If you assume

the vertical component of the gradient of the dynamic pressure to be small compared to the gradient of

the total pressure (in other words, the gradient is dominated by the gradient of the hydrostatic pressure),

then −ρg'∇pand then αT (v· ∇p)' −αρT v·g. We will however not be using this approximation in

what follows.

We have already established that

τ =CηBV

The following measurements are carried out:

•The root mean square velocity (vrms):

vrms =s1

VZV

v2dV

•The average temperature (Tavrg):

< T >=1

VZV

T dV

•The total mass (mass):

M=ZV

ρdV

•The Nusselt number (Nu):

Nu =−1

∆TZLx

∂T (x, y =Ly)

∂y dx

•The kinetic energy (EK):

EK=ZV

2ρv2dV

•The work done against gravity

< W >=−ZV

ρgyvydV

•The total viscous dissipation (visc diss)

<Φ>=ZΦdV =1

VZ2η˙

ε:˙

εdV

•The gravitational potential energy (EG)

EG=ZV

ρgy(Ly−y)dV

141

•The internal thermal energy (ET)

ET=ZV

ρ(0)CpT dV

Remark 3: Measuring the total mass can be misleading: indeed because ρ=ρ0(1 −αT ), then

measuring the total mass amounts to measuring a constant minus the volume-integrated tempera-

ture, and there is no reason why the latter should be zero, so that there is no reason why the total

mass should be zero...!

30.3 The experimental setup

The setup is as follows: the domain is Lx =Ly = 3000km. Free slip boundary conditions are imposed

on all four sides. The initial temperature is given by:

T(x, y) = Ly−y

Ly −0.01 cos(πx

) sin( πy

Ly )∆T+Tsurf

with ∆T= 4000K, Tsurf =T0= 273.15K. The temperature is set to ∆T+Tsurf at the bottom and

Tsurf at the top. We also set k= 3, Cp= 1250, |g|= 10, ρ0= 3000 and we keep the Rayleigh number

Ra and dissipation number Di as input parameters:

Ra =αg∆T L3ρ2

0Cp

ηk Di =αgL

From the second equation we get α=DiCp

gL , which we can insert in the ﬁrst one:

Ra =DiC2

p∆T L2ρ2

ηk or, η =DiC2

p∆T L2ρ2

Ra k

For instance, for Ra = 104and Di = 0.75, we obtain α'3·10−5and η'1025 which are quite reasonable

values.

30.4 Scaling

Following [77], we non-dimensionalize the equations using the reference values for density ρr, thermal

expansivity αr, temperature contrast ∆Tr(refTemp), thermal conductivity kr, heat capacity Cp, depth

of the ﬂuid layer Land viscosity ηr. The non-dimensionalization for velocity, ur, pressure prand time,

trbecome

ur=kr

ρrCpL(refvel)

pr=ηrkr

ρrCpL2(refpress)

tr=ρrCpL2

(reftime)

In the case of the setup described hereabove, and when choosing Ra = 104and Di = 0.5, we get:

alphaT 2.083333e-05

eta 8.437500e+24

reftime 1.125000e+19

refvel 2.666667e-13

refPress 7.500000e+05

142

30.5 Conservation of energy 1

30.5.1 under BA and EBA approximations

Following [82], we take the dot product of the momentum equation with the velocity vand integrate over

the whole volume14:ZV

[−∇ · τ+∇p]·vdV =ZV

ρg·vdV

or,

−ZV

(∇ · τ)·vdV +ZV

∇p·vdV =ZV

ρg·vdV

Let us look at each block separately:

−ZV

(∇ · τ)·vdV =−ZS

τ v ·n

|{z}

=0 (b.c.)

dS +ZV

τ:∇vdV =ZV

τ:˙

εdV =ZV

ΦdV

which is the volume integral of the shear heating. Then,

∇p·vdV =ZS

pv·n

|{z}

=0 (b.c.)

dS −ZV

∇·v

|{z}

=0 (incomp.)

pdV = 0

which is then zero in the case of an incompressible ﬂow. And ﬁnally

ρg·vdV =W

which is the work against gravity.

Conclusion for an incompressible ﬂuid: we should have

ΦdV =ZV

ρg·vdV (256)

This formula is hugely problematic: indeed, the term ρin the rhs is the full density. We know that to the

value of ρ0corresponds a lithostatic pressure gradient pL=ρ0gy. In this case one can write ρ=ρ0+ρ0

and p=pL+p0so that we also have

[−∇ · τ+∇p0]·vdV =ZV

ρ0g·vdV

which will ultimately yield

ΦdV =ZV

ρ0g·vdV =ZV

(ρ−ρ0)g·vdV (257)

Obviously Eqs.(256) and (257) cannot be true at the same time. The problem comes from the nature

of the (E)BA approximation: ρ=ρ0in the mass conservation equation but it is not constant in the

momentum conservation equation, which is of course inconsistent. Since the mass conservation equation

is ∇·v= 0 under this approximation then the term RV∇p·vdV is always zero for any pressure (full

pressure p, or overpressure p−pL), hence the paradox. This paradox will be lifted when a consistent set

of equations will be used (compressible formulation). On a practical note, Eqs.(256) is not veriﬁed by

the code, while (257) is.

In the end: ZV

ΦdV

| {z }

visc diss

=ZV

(ρ−ρ0)g·vdV

| {z }

work grav

(258)

14Check: this is akin to looking at the power, force*velocity, says Arie

143

30.5.2 under no approximation at all

∇p·vdV =ZS

pv·n

|{z}

=0 (b.c.)

dS −ZV

∇·vpdV = 0 (259)

=ZV

ρv·∇ρ pdV = 0 (260)

(261)

ToDo:see section 3 of [82] where this is carried out with the Adams-Williamson eos.

30.6 Conservation of energy 2

Also, following the Reynold’s transport theorem [91],p210, we have for a property A(per unit mass)

dt ZV

AρdV =ZV

∂

∂t (Aρ)dV +ZS

Aρv·ndS

Let us apply to this to A=CpTand compute the time derivative of the internal energy:

dt ZV

ρCpT dV =ZV

∂

∂t (ρCpT)dV +ZS

Aρ v·n

|{z}

=0 (b.c.)

dS =ZV

CpT∂ρ

∂t dV

| {z }

+ZV

ρCp

∂T

∂t dV

| {z }

(262)

In order to expand I, the mass conservation equation will be used, while the heat transport equation

will be used for II:

I=ZV

CpT∂ρ

∂t dV =−ZV

CpT∇·(ρv)dV =−ZV

CpT ρ v·n

|{z}

=0 (b.c.)

dS +ZV

ρCp∇T·vdV (263)

II =ZV

ρCp

∂T

∂t dV =ZV−ρCpv·∇T+∇·k∇T+ρH +Φ+αT ∂p

∂t +v·∇pdV (264)

=ZV−ρCpv·∇T+ρH +Φ+αT ∂p

∂t +v·∇pdV +ZV

∇·k∇T dV(265)

=ZV−ρCpv·∇T+ρH +Φ+αT ∂p

∂t +v·∇pdV +ZS

k∇T·ndS(266)

=ZV−ρCpv·∇T+ρH +Φ+αT ∂p

∂t +v·∇pdV −ZS

q·ndS (267)

Finally:

I+II =d

dt ZV

ρCpT dV

| {z }

=ZVρH +Φ+αT ∂p

∂t +v·∇pdV −ZS

q·ndS (268)

=ZV

ρHdV +ZV

ΦdV

| {z }

visc diss

+ZV

αT ∂p

∂t dV

| {z }

extra

+ZV

αT v·∇pdV

| {z }

adiab heating

−ZS

q·ndS

| {z }

heatflux boundary

(269)

This was of course needlessly complicated as the term ∂ρ/∂t is always taken to be zero, so that I= 0

automatically. The mass conservation equation is then simply ∇·(ρv) = 0. Then it follows that

0 = ZV

CpT∇·(ρv)dV =−ZV

CpT ρ v·n

|{z}

=0 (b.c.)

dS +ZV

ρCp∇T·vdV (270)

=ZV

ρCp∇T·vdV (271)

so that the same term in Eq.(267) vanishes too, and then Eq.(269) is always valid, although one should

be careful when computing ETin the BA and EBA cases as it should use ρ0and not ρ.

144

30.7 The problem of the onset of convection

[wiki] In geophysics, the Rayleigh number is of fundamental importance: it indicates the presence and

strength of convection within a ﬂuid body such as the Earth’s mantle. The mantle is a solid that behaves

as a ﬂuid over geological time scales.

The Rayleigh number essentially is an indicator of the type of heat transport mechanism. At low

Rayleigh numbers conduction processes dominate over convection ones. At high Rayleigh numbers it is

the other way around. There is a so-called critical value of the number with delineates the transition

from one regime to the other.

This problem has been studied and approached both theoretically and numerically [137, e.g.] and it

was found that the critical Rayleigh number Racis

Rac= (27/4)π4'657.5

in setups similar to ours.

VERY BIG PROBLEM

The temperature setup is built as follows: Tsurf is prescribed at the top, Tsurf + ∆Tis prescribed

at the bottom. The initial temperature proﬁle is linear between these two values. In the case of BA, the

actual value of Tsurf is of no consequence. However, for the EBA the full temperature is present in the

adiabatic heating term on the rhs of the hte, and the value of Tsurf will therefore inﬂuence the solution

greatly. This is very problematic as there is no real way to arrive at the surface temperature from the

King paper. On top of this, the density uses a reference temperature T0which too will inﬂuence the

solution without being present in the controlling Ra and Di numbers!!

In light thereof, it will be very diﬃcult to recover the values of King et al for EBA!

features

•Q1×P0element

•compressible ﬂow

•mixed formulation

•Dirichlet boundary conditions (no-slip)

•isoviscous

•analytical solution

•pressure smoothing

Relevant literature: [11, 70, 126, 82, 77, 83, 87, 65]

ToDo:

•heat ﬂux is at the moment elemental, so Nusselt and heat ﬂux on boundaries measurements not as

accurate as could be.

145

•implement steady state detection

•do Ra = 105and Ra = 106

•velocity average at surface

•non dimensional heat ﬂux at corners [15]

•depth-dependent viscosity (case 2 of [15])

146

30.8 results - BA - Ra = 104

These results were obtained with a 64x64 resolution, and CFL number of 1. Steady state was reached

after about 1250 timesteps.

(a)

1x10-6

2x10-6

3x10-6

4x10-6

5x10-6

6x10-6

7x10-6

0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000

time (Myr)

(b)

7.67188x1022

0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000

time (Myr)

(c)

-7.73687x1023

0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000

time (Myr)

(d)

2273.15

0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000

<T>

time (Myr)

(e)

500

1000

1500

2000

2500

3000

3500

4000

4500

0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000

Temperature

time (Myr)

min(T)

max(T)

(f)

-0.0015

-0.001

-0.0005

0.0005

0.001

0.0015

0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000

velocity

time (Myr)

min(u)

max(u)

min(v)

max(v)

(g)

-80000

-70000

-60000

-50000

-40000

-30000

-20000

-10000

0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000

adiabatic heating

time (Myr)

(h)

10000

20000

30000

40000

50000

60000

70000

80000

90000

0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000

viscous dissipation

time (Myr)

(i)

10000

20000

30000

40000

50000

60000

70000

80000

90000

0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000

work against gravity

time (Myr)

(j)

-5x10-8

-4x10-8

-3x10-8

-2x10-8

-1x10-8

1x10-8

2x10-8

3x10-8

0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000

heat ﬂux

time (Myr)

(k)

0.01

0.02

0.03

0.04

0.05

0.06

0.07

0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000

vrms (cm/yr)

time (Myr)

(l)

0 10000 20000 30000 40000 50000 60000 70000 80000 90000 100000

time (Myr)

4.88

(l)

-6x10-7

-4x10-7

-2x10-7

2x10-7

4x10-7

6x10-7

8x10-7

0 1x1010

2x1010

3x1010

4x1010

5x1010

6x1010

7x1010

8x1010

9x1010

1x1011

time (Myr)

dETdt

(m)

10000

20000

30000

40000

50000

60000

70000

80000

90000

0 1x1010

2x1010

3x1010

4x1010

5x1010

6x1010

7x1010

8x1010

9x1010

1x1011

time (Myr)

AH: adiabatic heating, VD: viscous dissipation, HF: heat ﬂux, WG: work against gravity

Eq.(269) is veriﬁed by (l) and Eq.(258) is veriﬁed by (m).

147

(a) (b) (c)

(d) (e) (f)

(g) (h) (i)

148

30.9 results - BA - Ra = 105

These results were obtained with a 64x64 resolution, and CFL number of 1. Steady state was reached

after about 1250 timesteps.

(a)

0.0001

0.0002

0.0003

0.0004

0.0005

0.0006

0.0007

0 5000 10000 15000 20000 25000 30000 35000

time (Myr)

(b)

7.67188x1022

0 5000 10000 15000 20000 25000 30000 35000

time (Myr)

(c)

-7.73687x1023

0 5000 10000 15000 20000 25000 30000 35000

time (Myr)

(d)

2273.149999999

2273.150000000

2273.150000001

0 5000 10000 15000 20000 25000 30000 35000

<T>

time (Myr)

(e)

500

1000

1500

2000

2500

3000

3500

4000

4500

0 5000 10000 15000 20000 25000 30000 35000

Temperature

time (Myr)

min(T)

max(T)

(f)

-0.015

-0.01

-0.005

0.005

0.01

0.015

0 5000 10000 15000 20000 25000 30000 35000

velocity

time (Myr)

min(u)

max(u)

min(v)

max(v)

(g)

-900000

-800000

-700000

-600000

-500000

-400000

-300000

-200000

-100000

0 5000 10000 15000 20000 25000 30000 35000

adiabatic heating

time (Myr)

(h)

100000

200000

300000

400000

500000

600000

700000

800000

900000

0 5000 10000 15000 20000 25000 30000 35000

viscous dissipation

time (Myr)

(i)

100000

200000

300000

400000

500000

600000

700000

800000

900000

0 5000 10000 15000 20000 25000 30000 35000

work against gravity

time (Myr)

(j)

-1x10-6

-5x10-7

5x10-7

1x10-6

1.5x10-6

2x10-6

2.5x10-6

0 5000 10000 15000 20000 25000 30000 35000

heat ﬂux

time (Myr)

(k)

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0 5000 10000 15000 20000 25000 30000 35000

vrms (cm/yr)

time (Myr)

193.2150

(l)

0 5000 10000 15000 20000 25000 30000 35000

time (Myr)

10.53

(l)

-8x10-6

-6x10-6

-4x10-6

-2x10-6

2x10-6

4x10-6

6x10-6

8x10-6

1x10-5

0 5x109 1x1010

1.5x1010

2x1010

2.5x1010

3x1010

3.5x1010

time (Myr)

dETdt

(m)

100000

200000

300000

400000

500000

600000

700000

800000

900000

0 5x109 1x1010 1.5x1010 2x1010 2.5x1010 3x1010 3.5x1010

time (Myr)

AH: adiabatic heating, VD: viscous dissipation, HF: heat ﬂux, WG: work against gravity

Eq.(269) is veriﬁed by (l) and Eq.(258) is veriﬁed by (m).

149

30.10 results - BA - Ra = 106

150

30.11 results - EBA - Ra = 104

These results were obtained with a 64x64 resolution, and CFL number of 1. Steady state was reached

after about 2500 timesteps

(a)

1x10-7

2x10-7

3x10-7

4x10-7

5x10-7

6x10-7

7x10-7

8x10-7

0 50000 100000 150000 200000 250000

time (Myr)

(b)

7.2x1022

7.25x1022

7.3x1022

7.35x1022

7.4x1022

7.45x1022

7.5x1022

7.55x1022

7.6x1022

7.65x1022

7.7x1022

0 50000 100000 150000 200000 250000

time (Myr)

(c)

-7.76x1023

-7.755x1023

-7.75x1023

-7.745x1023

-7.74x1023

-7.735x1023

0 50000 100000 150000 200000 250000

time (Myr)

(d)

2140

2160

2180

2200

2220

2240

2260

2280

0 50000 100000 150000 200000 250000

<T>

time (Myr)

(e)

500

1000

1500

2000

2500

3000

3500

4000

4500

0 50000 100000 150000 200000 250000

Temperature

time (Myr)

min(T)

max(T)

(f)

-0.0004

-0.0003

-0.0002

-0.0001

0.0001

0.0002

0.0003

0.0004

0 50000 100000 150000 200000 250000

velocity

time (Myr)

min(u)

max(u)

min(v)

max(v)

(g)

-10000

-9000

-8000

-7000

-6000

-5000

-4000

-3000

-2000

-1000

0 50000 100000 150000 200000 250000

adiabatic heating

time (Myr)

(h)

1000

2000

3000

4000

5000

6000

7000

8000

9000

10000

0 50000 100000 150000 200000 250000

viscous dissipation

time (Myr)

(i)

1000

2000

3000

4000

5000

6000

7000

8000

9000

10000

0 50000 100000 150000 200000 250000

work against gravity

time (Myr)

(j)

-1000

1000

2000

3000

4000

5000

6000

7000

8000

9000

10000

0 50000 100000 150000 200000 250000

heat ﬂux

time (Myr)

(k)

0.005

0.01

0.015

0.02

0.025

0 50000 100000 150000 200000 250000

vrms (cm/yr)

time (Myr)

(l)

0.8

1.2

1.4

1.6

1.8

2.2

2.4

0 50000 100000 150000 200000 250000

time (Myr)

(l)

-10000

-9000

-8000

-7000

-6000

-5000

-4000

-3000

-2000

-1000

1000

0 50000 100000 150000 200000 250000

time (Myr)

AH+VD+HF

dETdt

(m)

1000

2000

3000

4000

5000

6000

7000

8000

9000

10000

0 50000 100000 150000 200000 250000

time (Myr)

AH: adiabatic heating, VD: viscous dissipation, HF: heat ﬂux, WG: work against gravity

Eq.(269) is veriﬁed by (l) and Eq.(258) is veriﬁed by (m).

151

a) b) c)

d) e) f)

g) h) i)

j) k) l)

m) n)

152

30.12 results - EBA - Ra = 105

These results were obtained with a 64x64 resolution, and CFL number of 1. Simulation was stopped after

about 4300 timesteps.

(a)

1x10-5

2x10-5

3x10-5

4x10-5

5x10-5

6x10-5

7x10-5

8x10-5

9x10-5

0 20000 40000 60000 80000 100000 120000

time (Myr)

(b)

7.4x1022

7.45x1022

7.5x1022

7.55x1022

7.6x1022

7.65x1022

7.7x1022

0 20000 40000 60000 80000 100000 120000

time (Myr)

(c)

-7.75x1023

-7.748x1023

-7.746x1023

-7.744x1023

-7.742x1023

-7.74x1023

-7.738x1023

-7.736x1023

0 20000 40000 60000 80000 100000 120000

time (Myr)

(d)

2200

2210

2220

2230

2240

2250

2260

2270

2280

0 20000 40000 60000 80000 100000 120000

<T>

time (Myr)

(e)

500

1000

1500

2000

2500

3000

3500

4000

4500

5000

0 20000 40000 60000 80000 100000 120000

Temperature

time (Myr)

min(T)

max(T)

(f)

-0.005

-0.004

-0.003

-0.002

-0.001

0.001

0.002

0.003

0.004

0.005

0 20000 40000 60000 80000 100000 120000

velocity

time (Myr)

min(u)

max(u)

min(v)

max(v)

(g)

-120000

-100000

-80000

-60000

-40000

-20000

0 20000 40000 60000 80000 100000 120000

adiabatic heating

time (Myr)

(h)

20000

40000

60000

80000

100000

120000

0 20000 40000 60000 80000 100000 120000

viscous dissipation

time (Myr)

(i)

20000

40000

60000

80000

100000

120000

0 20000 40000 60000 80000 100000 120000

work against gravity

time (Myr)

(j)

-20000

-10000

10000

20000

30000

40000

50000

60000

70000

0 20000 40000 60000 80000 100000 120000

heat ﬂux

time (Myr)

(k)

0.05

0.1

0.15

0.2

0.25

0.3

0 20000 40000 60000 80000 100000 120000

vrms (cm/yr)

time (Myr)

(l)

0.5

1.5

2.5

3.5

4.5

5.5

0 20000 40000 60000 80000 100000 120000

time (Myr)

(l)

-60000

-50000

-40000

-30000

-20000

-10000

10000

20000

0 20000 40000 60000 80000 100000 120000

time (Myr)

AH+VD+HF

dETdt

(m)

20000

40000

60000

80000

100000

120000

0 20000 40000 60000 80000 100000 120000

time (Myr)

AH: adiabatic heating, VD: viscous dissipation, HF: heat ﬂux, WG: work against gravity

153

30.13 Onset of convection

The code can be run for values of Ra between 500 and 1000, at various resolutions for the BA formulation.

The value vrms(t)−vrms(0) is plotted as a function of Ra and for the 10 ﬁrst timesteps. If the vrms is

found to decrease, then the Rayleigh number is not high enough to allow for convection and the initial

temperature perturbation relaxes by diﬀusion (and then vrms(t)−vrms(0) <0. If the vrms is found to

increase, then vrms(t)−vrms(0) >0 and the system is going to showcase convection. The zero value of

vrms(t)−vrms(0) gives us the critical Rayleigh number, which is found between 775 and 790.

-4x10-15

-2x10-15

2x10-15

4x10-15

6x10-15

8x10-15

1000

vrms trend

16x16

32x32

48x48

64x64

-3.5x10-16

-3x10-16

-2.5x10-16

-2x10-16

-1.5x10-16

-1x10-16

-5x10-17

5x10-17

1x10-16

1.5x10-16

770 775 780 785 790

vrms trend

16x16

32x32

48x48

64x64

154

Appendix: Looking for the right combination of parameters for the King benchmark.

I run a quadruple do loop over L, ∆T,ρ0and η0between plausible values (see code targets.py) and

write in a ﬁle only the combination which yields the required Rayleigh and Dissipation number values

(down to 1% accuracy).

alp ha=3e−5

g=10

hcapa =1250

hcond=3

DTmin=1000 ; DTmax=4000 ; DTnpts=251

Lmin=1e6 ; Lmax=3e6 ; Lnpts=251

rhomin =3000 ; rhomax=3500 ; rh on pt s=41

etam in =19 ; etamax=25 ; e t a n p t s =100

On the following plots the ’winning’ combinations of these four parameters are shown:

0.24

0.245

0.25

0.255

0.26

9600 9800 10000 10200 10400

1x106

1.005x106

1.01x106

1.015x106

1.02x106

1.025x106

1.03x106

1.035x106

1.04x106

1.045x106

1.05x106

500 1000 1500 2000 2500 3000 3500 4000 4500

1x1023

2x1023

3x1023

4x1023

5x1023

6x1023

7x1023

8x1023

9x1023

2800 3000 3200 3400 3600 3800 4000 4200

eta

rho

We see that:

•the parameter L(being to the 3rd power in the Ra number) cannot vary too much. Although it is

varied between 1000 and 3000km there seems to be a ’right’ value at about 1040 km. (why?)

•viscosities are within 1023 and 1024 which are plausible values (although a bit high?).

•densities can be chosen freely between 3000 and 3500

•∆Tseems to be the most problematic value since it can range from 1000 to 4000K ...

155

31 fieldstone 25: Rayleigh-Taylor instability (1) - instantaneous

This numerical experiment was ﬁrst presented in [138]. It consists of an isothermal Rayleigh-Taylor

instability in a two-dimensional box of size Lx = 0.9142 and Ly= 1. Two Newtonian ﬂuids are present

in the system: the buoyant layer is placed at the bottom of the box and the interface between both ﬂuids

is given by y(x) = 0.2+0.02 cos πx

LxThe bottom ﬂuid is parametrised by its mass density ρ1and its

viscosity µ1, while the layer above is parametrised by ρ2and µ2.

No-slip boundary conditions are applied at the bottom and at the top of the box while free-slip

boundary conditions are applied on the sides.

In the original benchmark the system is run over 2000 units of dimensionless time and the timing

and position of various upwellings/downwellings is monitored. In this present experiment only the root

mean square velocity is measured at t= 0: the code is indeed not yet foreseen of any algorithm capable

of tracking deformation.

Another approach than the ones presented in the extensive literature which showcases results of this

benchmark is taken. The mesh is initially ﬁtted to the ﬂuids interface and the resolution is progressively

increased. This results in the following ﬁgure:

0.00001

0.00010

0.00100

0.01000

0.01 0.1

vrms

The green line indicates results obtained with my code ELEFANT with grids up to 2000x2000 with the

exact same methodology.

156

features

•Q1×P0element

•incompressible ﬂow

•mixed formulation

•isothermal

•numerical benchmark

157

32 fieldstone 26: Slab detachment benchmark (1) - instanta-

neous

As in [116], the computational domain is 1000km ×660km. No-slip boundary conditions are imposed

on the sides of the system while free-slip boundary conditions are imposed at the top and bottom. Two

materials are present in the domain: the lithosphere (mat.1) and the mantle (mat.2). The overriding

plate (mat.1) is 80km thick and is placed at the top of the domain. An already subducted slab (mat.1)

of 250km length hangs vertically under this plate. The mantle occupies the rest of the domain.

The mantle has a constant viscosity η0= 1021P a.s and a density ρ= 3150kg/m3. The slab has a

density ρ= 3300kg/m3and is characterised by a power-law ﬂow law so that its eﬀective viscosity depends

on the second invariant of the strainrate I2as follows:

ηeff =1

2A−1/nsI1/ns−1

2=1

2[(2 ×4.75×1011)−ns]−1/nsI1/ns−1

2= 4.75×1011I1/ns−1

2=η0I1/ns−1

2(272)

with ns= 4 and A= (2 ×4.75×1011)−ns, or η0= 4.75 ×1011.

Fields at convergence for 151x99 grid.

1x1020

1x1021

1x1022

1x1023

1x1024

0 100 200 300 400 500 600 700 800 900 1000

viscosity (Pa.s)

x (km)

nelx=32

nelx=48

nelx=51

nelx=64

nelx=100

nelx=128

nelx=151

1x1020

1x1021

1x1022

1x1023

1x1024

400 450 500 550 600

viscosity (Pa.s)

x (km)

nelx=32

nelx=48

nelx=51

nelx=64

nelx=100

nelx=128

nelx=151

1x1020

1x1021

1x1022

1x1023

1x1024

1x1025

1x1026

0 100 200 300 400 500 600 700

viscosity (Pa.s)

y (km)

nelx=32

nelx=48

nelx=51

nelx=64

nelx=100

nelx=128

nelx=151

0.0000001

0.0000010

0.0000100

0.0001000

0.0010000

0.0100000

0.1000000

1.0000000

0 5 10 15 20 25 30 35 40 45

normalised nonlinear residual

y (km)

nelx=32

nelx=48

nelx=51

nelx=64

nelx=100

nelx=128

nelx=151

1e-7

-8x10-16

-6x10-16

-4x10-16

-2x10-16

2x10-16

4x10-16

0 100 200 300 400 500 600 700 800 900 1000

exx

x (km)

nelx=32

nelx=48

nelx=51

nelx=64

nelx=100

nelx=128

nelx=151

-4x10-16

-2x10-16

2x10-16

4x10-16

6x10-16

8x10-16

1x10-15

0 100 200 300 400 500 600 700 800 900 1000

eyy

x (km)

nelx=32

nelx=48

nelx=51

nelx=64

nelx=100

nelx=128

nelx=151

-1.5x10-15

-1x10-15

-5x10-16

5x10-16

1x10-15

1.5x10-15

0 100 200 300 400 500 600 700 800 900 1000

exy

x (km)

nelx=32

nelx=48

nelx=51

nelx=64

nelx=100

nelx=128

nelx=151

158

Along the horizontal line

-1x10-15

-8x10-16

-6x10-16

-4x10-16

-2x10-16

2x10-16

4x10-16

0 100 200 300 400 500 600 700

exx

y (km)

nelx=32

nelx=48

nelx=51

nelx=64

nelx=100

nelx=128

nelx=151

-4x10-16

-2x10-16

2x10-16

4x10-16

6x10-16

8x10-16

1x10-15

0 100 200 300 400 500 600 700

eyy

y (km)

nelx=32

nelx=48

nelx=51

nelx=64

nelx=100

nelx=128

nelx=151

-6x10-17

-4x10-17

-2x10-17

2x10-17

4x10-17

6x10-17

8x10-17

0 100 200 300 400 500 600 700

exy

y (km)

nelx=32

nelx=48

nelx=51

nelx=64

nelx=100

nelx=128

nelx=151

Along the vertical line

features

•Q1×P0element

•incompressible ﬂow

•mixed formulation

•isothermal

•nonlinear rheology

•nonlinear residual

Todo: nonlinear mantle, pressure normalisation

159

33 fieldstone 27: Consistent Boundary Flux

In what follows we will be re-doing the numerical experiments presented in Zhong et al. [147].

The ﬁrst benchmark showcases a unit square domain with free slip boundary conditions prescribed

on all sides. The resolution is ﬁxed to 64 ×64 Q1×P0elements. The ﬂow is isoviscous and the buoyancy

force fis given by

fx= 0

fy=ρ0αT (x, y)

with the temperature ﬁeld given by

T(x, y) = cos(kx)δ(y−y0)

where k= 2π/λ and λis a wavelength, and y0represents the location of the buoyancy strip. We set

gy=−1 and prescribe ρ(x, y) = ρ0αcos(kx)δ(y−y0) on the nodes of the mesh.

One can prove ([147] and refs. therein) that there is an analytic solution for the surface stress σzz 15

σyy

ραgh =cos(kx)

sinh2(k)[k(1 −y0) sinh(k) cosh(ky0)−ksinh(k(1 −y0)) + sinh(k) sinh(ky0)]

We choose ρ0α= 64, η= 1 (note that in this case the normalising coeﬃcient of the stress is exactly

1 (since h=Lx/nelx = 1/64) so it is not implemented in the code). λ= 1 is set to 1 and we explore

y0=63

64 ,62

64 ,59

64 and y0= 32/64. Under these assumptions the density ﬁeld for y0= 59/64 is:

We can recover the stress at the boundary by computing the yy component of the stress tensor in the

top row of elements:

σyy =−p+ 2η˙yy

Note that pressure is by deﬁnition elemental, and that strain rate components are then also computed in

the middle of each element.

These elemental quantities can be projected onto the nodes (see section ??) by means of the C→N

algorithm or a least square algorithm (LS).

-0.2

-0.15

-0.1

-0.05

0.05

0.1

0.15

0.2

0 0.2 0.4 0.6 0.8 1

σyy

y0=32/64

analytical

elemental

C-N

cbf

-0.001

-0.0008

-0.0006

-0.0004

-0.0002

0.0002

0.0004

0.0006

0.0008

0.001

0 0.2 0.4 0.6 0.8 1

σyy error

y0=32/64

elemental

C-N

cbf

15Note that in the paper the authors use ραg which does not have the dimensions of a stress

160

-1

-0.8

-0.6

-0.4

-0.2

0.2

0.4

0.6

0.8

0 0.2 0.4 0.6 0.8 1

σyy

y0=59/64

analytical

elemental

C-N

cbf

-0.003

-0.002

-0.001

0.001

0.002

0.003

0 0.2 0.4 0.6 0.8 1

σyy error

y0=59/64

elemental

C-N

cbf

-1.5

-1

-0.5

0.5

1.5

0 0.2 0.4 0.6 0.8 1

σyy

y0=62

analytical

elemental

C-N

cbf

-0.1

-0.08

-0.06

-0.04

-0.02

0.02

0.04

0.06

0.08

0.1

0 0.2 0.4 0.6 0.8 1

σyy error

y0=62

elemental

C-N

cbf

-1.5

-1

-0.5

0.5

1.5

0 0.2 0.4 0.6 0.8 1

σyy

y0=63/64

analytical

elemental

C-N

cbf

-0.2

-0.15

-0.1

-0.05

0.05

0.1

0.15

0.2

0 0.2 0.4 0.6 0.8 1

σyy error

y0=63/64

elemental

C-N

cbf

The consistent boundary ﬂux (CBF) method allows us to compute traction vectors t=σ·non the

boundary of the domain. On the top boundary, n= (0,1) so that t= (σxy, σyy)Tand tyis the quantity

we need to consider and compare to other results.

In the following table are shown the results presented in [147] alongside the results obtained with

Fieldstone:

Method y0= 63/64 y0= 62/64 y0= 59/64 16 y0= 32/64

Analytic solution 0.995476 0.983053 0.912506 0.178136

Pressure smoothing [147] 1.15974 1.06498 0.911109 n.a.

CBF [147] 0.994236 0.982116 0.912157 n.a.

ﬁeldstone: elemental 0.824554 (-17.17 %) 0.978744 (-0.44%) 0.909574 (-0.32 %) 0.177771 (-0.20 %)

ﬁeldstone: nodal (C→N) 0.824554 (-17.17 %) 0.978744 (-0.44%) 0.909574 (-0.32 %) 0.177771 (-0.20 %)

ﬁeldstone: LS 1.165321 ( 17.06 %) 1.070105 ( 8.86%) 0.915496 ( 0.33 %) 0.178182 ( 0.03 %)

ﬁeldstone: CBF 0.994236 ( -0.13 %) 0.982116 (-0.10%) 0.912157 (-0.04 %) 0.177998 (-0.08 %)

We see that we recover the published results with the same exact accuracy, thereby validating our

implmentation.

On the following ﬁgures are shown the velocity, pressure and traction ﬁelds for two cases y0= 32/64

and y0= 63/64.

161

Here lies the superiority of our approach over the one presented in the original article: our code

computes all traction vectors on all boundaries at once.

explain how Medge is arrived at!

compare with ASPECT ??!

gauss-lobatto integration?

pressure average on surface instead of volume ?

162

features

•Q1×P0element

•incompressible ﬂow

•mixed formulation

•isothermal

•isoviscous

•analytical solution

•pressure smoothing

•consistent boundary ﬂux

163

34 fieldstone 28: convection 2D box - Tosi et al, 2015

This ﬁeldstone was developed in collaboration with Rens Elbertsen.

The viscosity ﬁeld µis calculated as the harmonic average between a linear part µlin that depends

on temperature only or on temperature and depth d, and a non-linear, plastic part µplast dependent on

the strain rate:

µ(T, z, ˙

)=21

µlin(T, z)+1

µplast(˙

)−1

.(273)

The linear part is given by the linearized Arrhenius law (the so-called Frank-Kamenetskii approxima-

tion [?]):

µlin(T, z) = exp(−γTT+γzz),(274)

where γT= ln(∆µT) and γz= ln(∆µz) are parameters controlling the total viscosity contrast due to

temperature (∆µT) and pressure (∆µz). The non-linear part is given by [?]:

µplast(˙

) = µ∗+σY

√˙

:˙

,(275)

where µ∗is a constant representing the eﬀective viscosity at high stresses [?] and σYis the yield

stress, also assumed to be constant. In 2-D, the denominator in the second term of equation (275) is

given explicitly by

√˙

:˙

=p˙ij ˙ij =s∂ux

∂x 2

2∂ux

∂y +∂uy

∂x 2

+∂uy

∂y 2

.(276)

The viscoplastic ﬂow law (equation 273) leads to linear viscous deformation at low stresses (equation

(274)) and to plastic deformation for stresses that exceed σY(equation (275)), with the decrease in

viscosity limited by the choice of µ∗[?].

In all cases that we present, the domain is a two-dimensional square box. The mechanical boundary

conditions are for all boundaries free-slip with no ﬂux across, i.e. τxy =τyx = 0 and u·n= 0, where

ndenotes the outward normal to the boundary. Concerning the energy equation, the bottom and top

boundaries are isothermal, with the temperature Tset to 1 and 0, respectively, while side-walls are

assumed to be insulating, i.e. ∂T/∂x = 0. The initial distribution of the temperature ﬁeld is prescribed

as follows:

T(x, y) = (1 −y) + Acos(πx) sin(πy),(277)

where A= 0.01 is the amplitude of the initial perturbation.

In the following Table ??, we list the benchmark cases according to the parameters used.

Case Ra ∆µT∆µyµ∗σYConvective regime

1 1021051 – – Stagnant lid

2 1021051 10−31 Mobile lid

3 10210510 – – Stagnant lid

4 10210510 10−31 Mobile lid

5a 10210510 10−34 Periodic

5b 10210510 10−33 – 5 Mobile lid – Periodic – Stagnant lid

Benchmark cases and corresponding parameters.

In Cases 1 and 3 the viscosity is directly calculated from equation (274), while for Cases 2, 4, 5a, and

5b, we used equation (273). For a given mesh resolution, Case 5b requires running simulations with yield

stress varying between 3 and 5

In all tests, the reference Rayleigh number is set at the surface (y= 1) to 102, and the viscosity

contrast due to temperature ∆µTis 105, implying therefore a maximum eﬀective Rayleigh number of

107for T= 1. Cases 3, 4, 5a, and 5b employ in addition a depth-dependent rheology with viscosity

contrast ∆µz= 10. Cases 1 and 3 assume a linear viscous rheology that leads to a stagnant lid regime.

164

Cases 2 and 4 assume a viscoplastic rheology that leads instead to a mobile lid regime. Case 5a also

assumes a viscoplastic rheology but a higher yield stress, which ultimately causes the emergence of a

strictly periodic regime. The setup of Case 5b is identical to that of Case 5a but the test consists in

running several simulations using diﬀerent yield stresses. Speciﬁcally, we varied σYbetween 3 and 5 in

increments of 0.1 in order to identify the values of the yield stress corresponding to the transition from

mobile to periodic and from periodic to stagnant lid regime.

34.0.1 Case 0: Newtonian case, a la Blankenbach et al., 1989

0 0.05 0.1 0.15 0.2 0.25 0.3 0.35

vrms

time

0 0.05 0.1 0.15 0.2 0.25 0.3 0.35

time

0 10 20 30 40 50 60 70 80 90

vrms

165

34.0.2 Case 1

In this case µ?= 0 and σY= 0 so that µplast can be discarded. The CFL number is set to 0.5 and the

viscosity is given by µ(T, z, ˙

) = µlin(T, z). And since ∆µz= 1 then γz= 0 so that µlin(T, z) = exp(−γTT)

100

150

200

250

300

350

0.001 0.01 0.1 1

vrms

time

ELEFANT, 32x32

ELEFANT, 48x48

ELEFANT, 64x64

ELEFANT, 100x100

243.872

0.5

1.5

2.5

3.5

0.001 0.01 0.1 1

time

3.3987

0.5

1.5

2.5

3.5

0 50 100 150 200 250 300 350

vrms

0.5

0.55

0.6

0.65

0.7

0.75

0.8

0.001 0.01 0.1 1

<T>

time

0.77368

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0.2

0.4

0.6

0.8

0.00010 0.00100 0.01000 0.10000

viscosity

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0 50 100 150 200 250 300 350 400

velocity

166

167

34.0.3 Case 2

100

200

300

400

500

600

0.001 0.01 0.1 1

vrms

time

141.518

0.001 0.01 0.1 1

time

8.17154

0 100 200 300 400 500 600

vrms

0.5

0.52

0.54

0.56

0.58

0.6

0.62

0.001 0.01 0.1 1

<T>

time

0.60521

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0.2

0.4

0.6

0.8

0.00010 0.00100 0.01000 0.10000

viscosity

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0 50 100 150 200 250 300 350 400

velocity

168

169

34.0.4 Case 3

100

110

0.001 0.01 0.1 1

vrms

time

100.018

0.5

1.5

2.5

3.5

0.001 0.01 0.1 1

time

3.03472

0.5

1.5

2.5

3.5

0 10 20 30 40 50 60 70 80 90 100 110

vrms

0.5

0.55

0.6

0.65

0.7

0.75

0.001 0.01 0.1 1

<T>

time

0.7277

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0.2

0.4

0.6

0.8

0.00010 0.00100 0.01000 0.10000

viscosity

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0 50 100 150 200 250 300 350 400

velocity

170

171

34.0.5 Case 4

100

150

200

250

300

350

400

450

500

0.001 0.01 0.1 1

vrms

time

79.4746

0.001 0.01 0.1 1

time

6.40359

0 50 100 150 200 250 300 350 400 450 500

vrms

0.495

0.5

0.505

0.51

0.515

0.52

0.525

0.53

0.001 0.01 0.1 1

<T>

time

0.528634

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0.2

0.4

0.6

0.8

0.00010 0.00100 0.01000 0.10000

viscosity

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0 50 100 150 200 250 300 350 400

velocity

172

173

34.0.6 Case 5

100

0 0.5 1 1.5 2 2.5 3 3.5 4 4.5

vrms

time

41.61

98.183

0 0.5 1 1.5 2 2.5 3 3.5 4 4.5

time

2.6912

7.0763

0 10 20 30 40 50 60 70 80 90 100

vrms

0.5

0.52

0.54

0.56

0.58

0.6

0.62

0.64

0.66

0.68

0 1 2 3 4 5 6 7

<T>

time

0.66971

0.65206

-200

-150

-100

-50

100

150

200

0 1 2 3 4 5 6 7

time

min(u)

max(u)

-150

-100

-50

100

150

200

250

300

0 1 2 3 4 5 6 7

time

min(v)

max(v)

174

175

35 fieldstone 29: open boundary conditions

In what follows we will investigate the use of the so-called open boundary conditions in the very simple

context of a 2D Stokes sphere experiment.

We start with a domain without the sphere. Essentially, it is what people would call an aquarium.

Free slip boundary conditions are prescribed on the sides and no-slip conditions at the bottom. The top

surface is left free. The ﬂuid has a density ρ0= 1 and a viscosity η0= 1. In the absence of any density

diﬀerence in the domain there is no active buoyancy force so that we expect a zero velocity ﬁeld and a

lithostatic pressure ﬁeld. This is indeed what we recover:

If we now implement a sphere parametrised by its density ρs=ρ0+ 1, its viscosity ηs= 103and

its radius Rs= 0.123 in the middle of the domain, we see clear velocity ﬁeld which logically shows the

sphere falling downward and a symmetric return ﬂow of the ﬂuid on each side:

Unfortunately it has been widely documented that the presence of free-slip boundary conditions

aﬀects the evolution of subduction [31], even when these are placed rather far from the subduction zone.

A proposed solution to this problem is the use of ’open boundary conditions’ which are in fact stress

boundary conditions. The main idea is to prescribe a stress on the lateral boundaries (instead of free

slip) so that it balances out exactly the existing lithostatic pressure inside the domain along the side

walls. Only pressure deviations with respect to the lithostatic are responsible for ﬂow and such boundary

conditions allow ﬂow across the boundaries.

We need the lithostatic pressure and compute it before hand (which is trivial in our case but can

prove to be a bit more tedious in real life situations when for instance density varies in the domain as a

function of temperature and/or pressure).

p l i t h = np . z e r os ( nnp , dtype=np . f l o a t 6 4 )

f o r iin range ( 0 , nnp ) :

p l i t h [ i ]= (Ly−y [ i ] ) ∗rho0 ∗abs ( gy )

Let us start with a somewhat pathological case: even in the absence of the sphere, what happens when

no boundary conditions are prescribed on the sides? The answer is simple: think about an aquarium

without side walls, or a broken dam. The velocity ﬁeld indeed shows a complete collapse of the ﬂuid left

and right of the bottom.

176

Let us then continue (still with no sphere) but let us now switch on the open boundary conditions.

Since the side boundary conditions match the lithostatic pressure we expect no ﬂow at all in the absence

of any density perturbation in the system. This is indeed what is recovered:

Finally, let us reintroduce the sphere. This time ﬂow is allowed through the left and right side

boundaries:

Finally, although horizontal velocity Dirichlet boundary conditions and open boundary conditions

are not compatible, the same is not true for the vertical component of the velocity: the open b.c.

implementation acts on the horizontal velocity dofs only, so that one can ﬁx the vertical component to

zero, as is shown hereunder:

We indeed see that the in/outﬂow on the sides is perpendicular to the boundaries.

Turning now to the actual implementation, we see that it is quite trivial, since all element edges are

vertical, and all have the same vertical dimension hx. Since we use a Q0approximation for the pressure

177

we need to prescribe a single pressure value in the middle of the element. Finally because of the sign of

the normal vector projection onto the x-axis, we obtain:

i f open bc left and x [ i c o n [ 0 , i e l ] ] <eps : # l e f t s i d e

pmid =0. 5∗( p l i t h [ i co n [ 0 , i e l ] ]+ p l i t h [ i c on [ 3 , i e l ] ] )

f e l [0 ]+= 0.5∗hy∗pmid

f e l [6 ]+= 0.5∗hy∗pmid

i f open bc right and x [ i c o n [ 1 , i e l ] ] >Lx−e ps : # r i g h t s i d e

pmid =0. 5∗( p l i t h [ i co n [ 1 , i e l ] ]+ p l i t h [ i c on [ 2 , i e l ] ] )

f e l [2 ] −= 0. 5∗hy∗pmid

f e l [4 ] −= 0. 5∗hy∗pmid

These few lines of code are added after the elemental matrices and rhs are built, and before the

application of other Dirichlet boundary conditions, and assembly.

features

•Q1×P0element

•incompressible ﬂow

•mixed formulation

•open boundary conditions

•isoviscous

178

36 fieldstone 30: conservative velocity interpolation

In this the Stokes equations are not solved. It is a 2D implementation of the cvi algorithm as introduced

in [139] which deals with the advection of markers. Q1and Q2basis functions are used and in both cases

the cvi algorithm can be toggled on/oﬀ. Markers can be distributed regularly or randomly at startup.

Three velocity ﬁelds are prescribed on the mesh:

•the so-called Couette ﬂow of [139]

•the SolCx solution [1]

•a ﬂow created by means of a stream line function (see ﬁeldstone 32)

36.1 Couette ﬂow

36.2 SolCx

36.3 Streamline ﬂow

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9

nmarker

time

Q1, no cvi, RK1

Q1, cvi, RK1

Q2, no cvi, RK1

Q2, cvi, RK1

Q1, no cvi, RK1, **

0.5

1.5

2.5

0 0.2 0.4 0.6 0.8 1

nmarker

time

Q1, no cvi, RK2

Q1, cvi, RK2

Q1, no cvi, RK4

Q1, no cvi, RK5

Q2, no cvi, RK2

Q2, no cvi, RK4

In this case RK order seems to be more important that cvi.

Explore why ?!

features

•Q1×P0element

•incompressible ﬂow

•penalty formulation

•Dirichlet boundary conditions (free-slip)

•direct solver

•isothermal

•non-isoviscous

•analytical solution

179

37 fieldstone 31: conservative velocity interpolation 3D

180

38 fieldstone 32: 2D analytical sol. from stream function

38.1 Background theory

The stream function is a function of coordinates and time of an inviscid liquid. It allows to determine

the components of velocity by diﬀerentiating the stream function with respect to the space coordinates.

A family of curves Ψ = const represent streamlines, i.e. the stream function remains constant along a

streamline. Although also valid in 3D, this approach is mostly used in 2D because of its relative simplicity

REFERENCES.

In two dimensions the velocity is obtained as follows:

v=∂Ψ

∂y ,−∂Ψ

∂x (278)

Provided the function Ψ is a smooth enough function, this automatically insures that the ﬂow is incom-

pressible:

∇·v=∂u

∂x +∂v

∂y =∂2Ψ

∂xy −∂2Ψ

∂xy = 0 (279)

Assuming constant viscosity, the Stokes equation writes:

−∇p+µ∆v=ρg(280)

Let us introduce the vector Wfor convenience such that in each dimension:

Wx=−∂p

∂x +µ∂2u

∂x2+∂2u

∂xy=ρgx(281)

Wy=−∂p

∂y +µ∂2v

∂x2+∂2v

∂xy=ρgy(282)

Taking the curl of the vector Wand only considering the component perpendicular to the xy-plane:

∂Vy

∂x −∂Vx

∂y =∂ρgy

∂x −∂ρgx

∂y (283)

The advantage of this approach is that the pressure terms cancel out (the curl of a gradient is always

zero), so that:

∂

∂x µ(∂2v

∂x2+∂2v

∂xy−∂

∂y µ∂2u

∂x2+∂2u

∂xy=∂ρgy

∂x −∂ρgx

∂y (284)

and then replacing u, v by the their stream function derivatives yields (for a constant viscosity):

µ∂4Ψ

∂x4+∂4Ψ

∂y4+ 2 ∂4Ψ

∂x2y2=∂ρgy

∂x −∂ρgx

∂y (285)

or,

∇4Ψ = ∂2

∂x2+∂2

∂y2∂2

∂x2+∂2

∂y2Ψ = ∂ρgy

∂x −∂ρgx

∂y (286)

These equations are also to be found in the geodynamics literature, eee Eq. 1.43 of Tackley book, p 70-71

of Gerya book.

38.2 A simple application

I wish to arrive at an analytical formulation for a 2D incompressible ﬂow in the square domain [−1 :

1] ×[−1 : 1] The ﬂuid has constant viscosity µ= 1 and is subject to free slip boundary conditions on all

sides. For reasons that will become clear in what follows I postulate the following stream function:

Ψ(x, y) = sin(mπx) sin(nπy) (287)

181

We have the velocity being deﬁned as:

v= (u, v) = ∂Ψ

∂y ,−∂Ψ

∂x = (nπ sin(mπx) cos(nπy),−mπ cos(mπx) sin(nπy)) (288)

The strain rate components are then:

˙εxx =∂u

∂x =mnπ2cos(mπx) cos(nπy) (289)

˙εyy =∂v

∂y =−mnπ2cos(mπx) cos(nπy) (290)

2 ˙εxy =∂u

∂y +∂v

∂x (291)

=∂2Ψ

∂y2−∂2Ψ

∂x2(292)

=−n2π2Ψ + m2π2Ψ (293)

= (m2−n2)π2sin(mπx) sin(nπy) (294)

Note that if m=nthe last term is identically zero, which is not desirable (ﬂow is too ’simple’) so in

what follows we will prefer m6=n.

It is also easy to verify that u= 0 on the sides and v= 0 at the top and bottom and that the term

˙εxy is nul on all four sides, thereby garanteeing free slip.

Our choice of stream function yields:

∇4Ψ = ∂4Ψ

∂x4+∂4Ψ

∂y4+ 2 ∂2Ψ

∂x2y2=π4(m4Ψ + n4Ψ+2m2n2Ψ) = (m4+n4+ 2m2n2)π4Ψ

We assume gx= 0 and gy=−1 so that we simply have

(m4+n4+ 2m2n2)π4Ψ = −∂ρ

∂x (295)

so that (assuming the integration constant to be zero):

ρ(x, y) = m4+n4+ 2m2n2

mπ3cos(mπx) sin(nπy)

The x-component of the momentum equation is

−∂p

∂x +∂2u

∂x2+∂2u

∂y2=−∂p

∂x −m2nπ3sin(mπx) cos(nπy)−n3π3sin(mπx) cos(nπy)=0

so ∂p

∂x =−(m2n+n3)π3sin(mπx) cos(nπy)

and the pressure ﬁeld is then (once again neglecting the integration constant):

p(x, y) = m2n+n3

mπ2cos(πx) cos(πy)

Note that in this case RpdV = 0 so that volume normalisation of the pressure is turned on (when free

slip boundary conditions are prescribed on all sides the pressure is known up to a constant and this

undeterminacy can be lifted by adding an additional constraint to the pressure ﬁeld).

182

Top to bottom: Velocity components uand v, pressure p, density ρand strain rate ˙εxy . From left to right:

(m, n) = (1,1), (m, n) = (1,2), (m, n) = (2,1), (m, n) = (2,2)

183

0.001

0.01

0.1

100

0.01 0.1

error

velocity

pressure

Errors for velocity and pressure for (m, n) = (1,1),(1,2),(2,1),(2,2)

Velocity arrows for (m, n) = (2,1)

184

39 fieldstone 33: Convection in an annulus

This ﬁeldstone was developed in collaboration with Rens Elbertsen.

This is based on the community benchmark for viscoplastic thermal convection in a 2D square box

[135] as already carried out in ??.

In this experiment the geometry is an annulus of inner radius R1= 1.22 and outer radius R2= 2.22.

The rheology and buoyancy forces are identical to those of the box experiment. The initial temperature

is now given by:

T(r, θ) = Tc(r) + A s(1 −s) cos(N0θ)s=R2−r

R2−R1∈[0,1]

where sin the normalised depth, Ais the amplitude of the perturbation and N0the number of lobes. In

this equation Tc(r) stands for the steady state purely conductive temperature solution which is obtained

by solving the Laplace’s equation in polar coordinates (all terms in θare dropped because of radial

symmetry) supplemented with two boundary conditions:

∆Tc=1

∂

∂r r∂T

∂r = 0 T(r=R1) = T1= 1 T(r=R2) = T2= 0

We obtain

Tc(r) = log(r/R2)

log(R1/R2)

Note that this proﬁle diﬀers from the straight line that is used in [135] and in section 34.

Examples of initial temperature ﬁelds for N0= 3,5,11

Boundary conditions can be either no-slip or free-slip on both inner and outer boundary. However,

when free-slip is used on both a velocity null space exists and must be ﬁltered out. In other words, the

solver may be able to come up with a solution to the Stokes operator, but that solution plus an arbitrary

rotation is also an equally valid solution. This additional velocity ﬁeld can be problematic since it is used

for advecting temperature (and/or compositions) and it also essentially determines the time step value

for a chosen mesh size (CFL condition).

For these reasons the nullspace must be removed from the obtained solution after every timestep.

There are two types of nullspace removal: removing net angular momentum, and removing net rotations.

We calculate the following output parameters:

•the average temperature < T >

hTi=RΩT dΩ

RΩdΩ=1

VΩZΩ

T dΩ (296)

•the root mean square velocity vrms as given by equation (27).

•the root mean square of the radial and tangential velocity components as given by equations (29)

and (30).

185

•the heat transfer through both boundaries Q:

Qinner,outer =ZΓi,o

q·ndΓ (297)

•the Nusselt number at both boundaries N u as given by equations (32) and (33).

•the power spectrum of the temperature ﬁeld:

P Sn(T) = ZΩ

T(r, θ)einθdΩ

.(298)

features

•Q1×P0element

•incompressible ﬂow

•penalty formulation

•Dirichlet boundary conditions

•non-isothermal

•non-isoviscous

•annulus geometry

186

40 fieldstone 34: the Cartesian geometry elastic aquarium

This ﬁeldstone was developed in collaboration with Lukas van de Wiel.

The setup is as follows: a 2D square of elastic material of size Lis subjected to the following boundary

conditions: free slip on the sides, no slip at the bottom and free at the top. It has a density ρand is

placed is a gravity ﬁeld g=−gey. For an isotropic elastic medium the stress tensor is given by:

σ=λ(∇·u)1+ 2µε

where λis the Lam´e parameter and µis the shear modulus. The displacement ﬁeld is u= (0, uy(y))

because of symmetry reasons (we do not expect any of the dynamic quantities to depend on the x

coordinate and also expect the horizontal displacement to be exactly zero). The velocity divergence is

then ∇·u=∂uy/∂y and the strain tensor:

ε= 0 0

0∂uy

∂y !

so that the stress tensor is:

σ= λ∂uy

∂y 0

0 (λ+ 2µ)∂uy

∂y !

∇·σ= (∂x∂y)· λ∂uy

∂y 0

0 (λ+ 2µ)∂uy

∂y != 0

(λ+ 2µ)∂2uy

∂y2!=0

ρg 

so that the vertical displacement is then given by:

uy(y) = 1

ρg

λ+ 2µy2+αy +β

where αand βare two integration constants. We need now to use the two boundary conditions: the

ﬁrst one states that the displacement is zero at the bottom, i.e. uy(y= 0) = 0 which immediately

implies β= 0. The second states that the stress at the top is zero (free surface), which implies that

∂uy/∂y(y=L) = 0 which allows us to compute α. Finally:

uy(y) = ρg

λ+ 2µ(y2

2−Ly)

The pressure is given by

p=−(λ+2

3µ)∇·u= (λ+2

3µ)ρg

λ+ 2µ(L−y) = λ+2

3µ

λ+ 2µρg(L−y) = 1 + 2µ

3λ

1+2µ/λ ρg(L−y)

In the incompressible limit, the poisson ratio is ν∼0.5. Materials are characterised by a ﬁnite Young’s

modulus E, which is related to νand λ:

λ=Eν

(1 + ν)(1 −2ν)µ=E

2(1 + ν)

It is then clear that for incompressible parameters λbecomes inﬁnite while µremains ﬁnite. In that case

the pressure then logically converges to the well known formula:

p=ρg(L−y)

187

In what follows we set L= 1000m, ρ= 2800, ν= 0.25, E= 6 ·1010,g= 9.81.

0.000001

0.000100

0.010000

1.000000

100.000000

10000.000000

1000000.000000

100000000.000000

0.01 0.1

error

velocity

pressure

188

41 fieldstone 35: 2D analytical sol. in annulus from stream

function

We seek an exact solution to the incompressible Stokes equations for an isoviscous, isothermal ﬂuid in

an annulus.Given the geometry of the problem, we work in polar coordinates. We denote the orthonormal

basis vectors by erand eθ, the inner radius of the annulus by R1and the outer radius by R2. Further,

we assume that the viscosity µis constant, which we set to µ= 1 we set the gravity vector to g=−grer

with gr= 1.

Given these assumptions, the incompressible Stokes equations in the annulus are [119]

Ar=∂2vr

∂r2+1

∂vr

∂r +1

∂2vr

∂θ2−vr

r2−2

∂uθ

∂θ −∂p

∂r =ρgr(299)

Aθ=∂2vθ

∂r2+1

∂vθ

∂r +1

∂2vθ

∂θ2+2

∂vr

∂θ −vθ

r2−1

∂p

∂θ = 0 (300)

∂(rvr)

∂r +1

∂vθ

∂θ = 0 (301)

Equations (299) and (300) are the momentum equations in polar coordinates while Equation (301) is the

incompressibility constraint. The components of the velocity are obtained from the stream function as

follows:

vr=1

∂Ψ

∂θ vθ=−∂Ψ

∂r

where vris the radial component and vθis the tangential component of the velocity vector.

The stream function is deﬁned for incompressible (divergence-free) ﬂows in 2D (as well as in 3D with

axisymmetry). The stream function can be used to plot streamlines, which represent the trajectories of

particles in a steady ﬂow. From calculus it is known that the gradient vector ∇Ψ is normal to the curve

Ψ = C. It can be shown that everywhere u·∇Ψ = 0 using the formula for uin terms of Ψ which proves

that level curves of Ψ are streamlines:

u· ∇Ψ = vr

∂Ψ

∂r +vθ

∂Ψ

∂θ =1

∂Ψ

∂θ

∂Ψ

∂r −∂Ψ

∂r

∂Ψ

∂θ = 0

In polar coordinates the curl of a vector Ais:

∇×A=1

r∂(rAθ)

∂r −∂Ar

∂θ 

Taking the curl of vector Ayields:

r∂(rAθ)

∂r −∂Ar

∂θ =1

r−∂(ρgr)

∂θ 

Multiplying on each side by r

∂(rAθ)

∂r −∂Ar

∂θ =−∂ρgr

∂θ

If we now replace Arand Aθby their expressions as a function of velocity and pressure, we see that the

pressure terms cancel out and assuming the viscosity and the gravity vector to be constant we get: Let

us assume the following separation of variables Ψ(r, θ) = φ(r)ξ(θ). Then

vr=1

∂Ψ

∂θ =φξ0

rvθ=−∂Ψ

∂r =−φ0ξ

189

Let us ﬁrst express Arand Aθas functions of Ψ and

Ar=∂2vr

∂r2+1

∂vr

∂r +1

∂2vr

∂θ2−vr

r2−2

∂uθ

∂θ (302)

=∂2

∂r2(φξ0

r) + 1

∂

∂r (φξ0

r) + 1

∂2

∂θ2(φξ0

r)−1

r2(φξ0

r)−2

∂

∂θ (−φ0ξ) (303)

= (φ00

r−2φ0

r2+ 2 φ

r3)ξ0+ ( φ0

r2−φ

r3)ξ0+φ

r3ξ000 −φξ0

r3+2

r2φ0ξ0(304)

=φ00ξ0

r+φ0ξ0

r2+φξ000

r3(305)

∂Ar

∂θ =φ00ξ00

r+φ0ξ00

r2+φξ0000

r3(306)

(307)

Aθ=∂2vθ

∂r2+1

∂vθ

∂r +1

∂2vθ

∂θ2+2

∂vr

∂θ −vθ

r2(308)

=∂2

∂r2(−φ0ξ) + 1

∂

∂r (−φ0ξ) + 1

∂2

∂θ2(−φ0ξ) + 2

∂

∂θ (φξ0

r)−1

r2(−φ0ξ) (309)

=−φ000ξ−φ00ξ

r−φ0ξ00

r2+2φξ00

r2+φ0ξ

r2(310)

∂(rAθ)

∂r = (311)

190

WRONG:

∂(r∆v)

∂r =∂

∂r ∂

∂r r∂v

∂r +1

∂2v

∂θ2(312)

=∂2

∂r2r∂v

∂r +∂

∂r 1

∂2v

∂θ2(313)

=∂2

∂r2r∂

∂r (−∂Ψ

∂r )+∂

∂r 1

∂2

∂θ2(−∂Ψ

∂r )(314)

=−∂2

∂r2r∂2Ψ

∂r2−∂

∂r 1

∂3Ψ

∂θ2∂r (315)

=−2∂3Ψ

∂r3−r∂4Ψ

∂r4+1

∂3Ψ

∂θ2∂r −1

∂4Ψ

∂θ2∂r2(316)

∂∆u

∂θ =∂

∂θ 1

∂

∂r r∂u

∂r +1

∂2u

∂θ2(317)

=∂

∂θ 1

∂

∂r r∂u

∂r +1

∂3u

∂θ3(318)

=∂

∂θ 1

∂

∂r r∂

∂r (1

∂Ψ

∂θ )+1

∂3

∂θ3(1

∂Ψ

∂θ ) (319)

∂2Ψ

∂θ2−1

∂3Ψ

∂r∂θ2+1

∂4Ψ

∂r2∂θ2+1

∂4Ψ

∂θ4(320)

Assuming the following separation of variables Ψ(r, θ) = φ(r)ξ(θ):

∂(r∆v)

∂r =−2φ000ξ−rφ0000ξ+1

r2φ0ξ00 −1

rφ00ξ00 (321)

∂∆u

∂θ =1

r3φξ00 −1

r2φ0ξ00 +1

rφ00ξ00 +1

r3φξ0000 (322)

so that

∂(r∆v)

∂r −∂∆u

∂θ =−2φ000ξ−rφ0000ξ+1

r2φ0ξ00 −1

rφ00ξ00 −1

r3φξ00 +1

r2φ0ξ00 −1

rφ00ξ00 −1

r3φξ0000

Further assuming ξ(θ) = cos(kθ) , then ξ00 =−k2ξand ξ0000 =k4ξthen

∂(r∆v)

∂r −∂∆u

∂θ =−2φ000ξ−rφ0000ξ−k21

r2φ0ξ+k21

rφ00ξ+k21

r3φξ −k21

r2φ0ξ+k21

rφ00ξ−k41

r3φξ

By choosing ρsuch that ρ=λ(r)Υ(θ) and such that ∂θΥ = ξ= cos(kθ) then we have

−2φ000ξ−rφ0000ξ−k21

r2φ0ξ+k21

rφ00ξ+k21

r3φξ −k21

r2φ0ξ+k21

rφ00ξ−k41

r3φξ =−1

ηλξgr

and then dividing by ξ: (IS THIS OK ?)

−2φ000 −rφ0000 −k21

r2φ0+k21

rφ00 +k21

r3φ−k21

r2φ0+k21

rφ00 −k41

r3φ=−1

ηλgr

−2φ000 −rφ0000 −2k21

r2φ0+ 2k21

rφ00 + (k2−k4)1

r3φ=−1

ηλgr

λ(r) = η

gr2φ000 +rφ0000 + 2k21

r2φ0−2k21

rφ00 −(k2−k4)1

r3φ

Also not forget Υ = 1

ksin(kθ)

191

41.1 Linking with our paper

We have

φ(r) = −rg(r) (323)

φ0(r) = −g(r)−rg0(r) = −f(r) (324)

φ00(r) = −f0(r) (325)

φ000(r) = −f00(r) (326)

φ0000(r) = −f000(r) (327)

f(r) = η0

g02f00(r) + rf000(r)+2k21

r2f(r)−2k21

rf0(r)+(k2−k4)1

r2g(r)

41.2 No slip boundary conditions

No-slip boundary conditions inside and outside impose that all components of the velocity must be zero

on both boundaries, i.e.

v(r=R1) = v(r=R2) = 0

Due to the separation of variables, and since ξ(θ) = cos(kθ) we have

u(r, θ) = 1

∂Ψ

∂θ =1

rφξ0=−1

rφ(r)ksin(kθ)v(r, θ) = −∂Ψ

∂r =−φ0(r)ξ=−φ0(r) cos(kθ)

It is obvious that the only way to insure no-slip boundary conditions is to have

φ(R1) = φ(R2) = φ0(R1) = φ0(R2) = 0

We could then choose

φ(r)=(r−R1)2(r−R2)2F(r) (328)

φ0(r) = 2(r−R1)(r−R2)2F(r) + 2(r−R1)2(r−R2)F(r)+(r−R1)2(r−R2)2F0(r) (329)

which are indeed identically zero on both boundaries. Here F(r) is any (smooth enough) function of r.

We would then have

Ψ(r, θ)=(r−R1)2(r−R2)2F(r) cos(kθ)

In what follows we will take F(r) = 1 for simplicity.

192

COMPUTE ffrom φand then the pressure.

41.3 Free slip boundary conditions

Before postulating the form of φ(r), let us now turn to the boundary conditions that the ﬂow must fulﬁll,

i.e. free-slip on both boundaries. Two conditions must be met:

•v·n= 0 (no ﬂow through the boundaries) which yields u(r=R1) = 0 and u(r=R2) = 0, :

∂Ψ

∂θ (r=R1, R2) = 0 ∀θ

193

which gives us the ﬁrst constraint since Ψ(r, θ) = φ(r)ξ(θ):

φ(r=R1) = φ(r=R2)=0

•(σ·n)×n=0(the tangential stress at the boundary is zero) which imposes: σθr = 0, with

σθr = 2η·1

2∂v

∂r −v

r+1

∂u

∂θ =η∂

∂r (−∂Ψ

∂r )−1

r(−∂Ψ

∂r ) + 1

∂

∂θ (1

∂Ψ

∂θ )

Finally Ψ must fulﬁll (on the boundaries!):

−∂2Ψ

∂r2+1

∂Ψ

∂r +1

∂2Ψ

∂θ2= 0

−φ00ξ+1

rφ0ξ+1

r2φξ00 = 0

or,

−φ00 +1

rφ0−k21

r2φ= 0

Note that this equation is a so-called Euler Diﬀerential Equation17. Since we are looking for a

solution φsuch that φ(R1) = φ(R2) = 0 then the 3rd term of the equation above is by deﬁnition

zero on the boundaries. We have to ensure the following equality on the boundary:

−φ00 +1

rφ0= 0 for r=R1, R2

The solution of this ODE is of the form φ(r) = ar2+band it becomes evident that it cannot satisfy

φ(r=R1) = φ(r=R2) = 0.

Separation of variables leads to solutions which cannot fulﬁll the free slip boundary conditions

17http://mathworld.wolfram.com/EulerDiﬀerentialEquation.html

194

42 fieldstone 36: the annulus geometry elastic aquarium

This ﬁeldstone was developed in collaboration with Lukas van de Wiel.

The domain is an annulus with inner radius R1and outer radius R2. It is ﬁlled with a single elastic

material characterised by a Young’s modulus Eand a Poisson ratio ν, a density ρ0. The gravity g=−g0er

is pointing towards the center of the domain.

The problem at hand is axisymmetric so that the tangential component of the displacement vector vθ

is assumed to be zero as well as all terms containing ∂θ. The components of the strain tensor are

εrr =∂vr

∂r (330)

εθθ =vr

r+1

∂vθ

∂θ =vr

r(331)

εrθ =1

2∂vθ

∂r −vθ

r+1

∂vr

∂θ = 0 (332)

so that the tensor simply is

ε=εrr εrθ

εrθ εθθ =∂vr

∂r 0

0vr

r(333)

The pressure is

p=−λ∇·v=−λ1

∂(rvr)

∂r (334)

and ﬁnally the stress tensor:

σ=−p1+ 2µε= λ1

∂(rvr)

∂r + 2µ∂vr

∂r 0

0λ1

∂(rvr)

∂r + 2µvr

r!(335)

The divergence of the stress tensor is given by [119]:

∇·σ=



∂σrr

∂r +σrr −σθθ

r+1

∂σθr

∂θ

∂σrθ

∂r +1

σθθ

∂θ +σrθ +σθr



(336)

Given the diagonal nature of the stress tensor this simpliﬁes to (also remember that ∂θ= 0):

∇·σ=



∂σrr

∂r +σrr −σθθ

0

(337)

Focusing on the r-component of the stress divergence:

(∇·σ)r=∂σrr

∂r +σrr −σθθ

r(338)

=∂

∂r λ1

∂(rvr)

∂r + 2µ∂vr

∂r +1

rλ1

∂(rvr)

∂r + 2µ∂vr

∂r −λ1

∂(rvr)

∂r −2µvr

r(339)

=λ∂

∂r

∂(rvr)

∂r + 2µ∂2vr

∂r2+λ1

∂(rvr)

∂r +2µ

∂vr

∂r −λ1

∂(rvr)

∂r −2µvr

r2(340)

=λ(−vr

r2+1

∂vr

∂r +∂2vr

∂r2)+2µ∂2vr

∂r2+2µ

∂vr

∂r −2µvr

r2(341)

=−(2µ+λ)vr

r2+ (2µ+λ)1

∂vr

∂r + (2µ+λ)∂2vr

∂r2(342)

So the momentum conservation in the rdirection is

(∇·σ+ρ0g)r=−(2µ+λ)vr

r2+ (2µ+λ)1

∂vr

∂r + (2µ+λ)∂2vr

∂r2−ρ0g0= 0 (343)

195

or,

∂2vr

∂r2+1

∂vr

∂r −vr

r2=ρ0g0

λ+ 2µ(344)

We now look at the boundary conditions. On the inner boundary we prescribe vr(r=R1) = 0 while free

surface boundary conditions are prescribed on the outer boundary, i.e. σ·n= 0 (i.e. there is no force

applied on the surface).

The general form of the solution can then be obtained:

vr(r) = C1r2+C2r+C3

r(345)

with

C1=ρ0g0

3(λ+ 2µ)C2=−C1R1−C3

C3=k1+k2

(R2

1+R2

2)(2µ+λ)+(R2

2−R2

1)λ(346)

and

k1= (2µ+λ)C1(2R2

1R3

2−R3

1R2

2)k2=λC1(R2

1R3

2−R3

1R2

2) (347)

Pressure can then be computed as follows:

p=−λ∇·v=−λ1

∂(rvr)

∂r =−λ1

r(3C1r2+ 2C2r)=−λ(3C1r+ 2C2) (348)

We choose R1= 2890km, R2= 6371km, g0= 9.81ms−2,ρ0= 3300, E= 6 ·1010,ν= 0.49.

-140000

-120000

-100000

-80000

-60000

-40000

-20000

3x106 3.5x106 4x106 4.5x106 5x106 5.5x106 6x106

radial displacement

2x1010

4x1010

6x1010

8x1010

1x1011

1.2x1011

3x106 3.5x106 4x106 4.5x106 5x106 5.5x106 6x106

radial displacement

radial proﬁles of the displacement and pressure ﬁelds

displacement and pressure ﬁelds in the domain

196

0.000001

0.000100

0.010000

1.000000

100.000000

10000.000000

0.01

error

velocity

pressure

197

43 fieldstone 37: marker advection and population control

The domain is a unit square. The Stokes equations are not solved, the velocity is prescribed everywhere

in the domain as follows:

u=−(z−0.5) (349)

v= 0 (350)

w= (x−0.5) (351)

At the moment, velocity is computed on the marker itself (rk0 algorithm). When markers are advected

outside, they are arbitrarily placed at location (-0.0123,-0.0123).

in construction.

198

44 fieldstone 38: Critical Rayleigh number

This ﬁeldstone was developed in collaboration with Arie van den Berg.

The system is a layer of ﬂuid between y= 0 and y= 1, with boundary conditions T(x, y = 0) = 1

and T(x, y = 1) = 0, characterized by ρ,cp,k,η0. The Rayleigh number of the system is

Ra = ρ0g0α∆T h3

η0κ

We have ∆T= 1, h= 1 and choose κ= 1 so that the Rayleigh number simpliﬁes to Ra = ρ0g0α/η0.

The Stokes equation is 

∇ · σ+

b=

0 with 

b=ρg. Then the components of the this equation on the

x- and y−axis are:

(

∇ · σ)x=−ρg ·ex= 0 (352)

(

∇ · σ)y=−ρg ·ey=ρg0(353)

since g and eyare in opposite directions (g =−g0ey, with g0>0). The stream function formulation of

the incompressible isoviscous Stokes equation is then

∇4Ψ = g0

η0

∂ρ

∂x

Assuming a linearised density ﬁeld with regards to temperature ρ(T) = ρ0(1 −αT ) we have

∂ρ

∂x =−ρ0α∂T

∂x

and then

∇4Ψ = −ρ0g0α

η0

g∂T

∂x =−Ra ∂T

∂x (354)

For small perturbations of the conductive state T0(y)=1−ywe deﬁne the temperature perturbation

T1(x, y) such that

T(x, y) = T0(y) + T1(x, y)

The temperature perturbation T1satisﬁes the homogeneous boundary conditions T1(x, y = 0) = 0 and

T1(x, y = 1) = 0. The temperature equation is

ρcp

Dt =ρcp∂T

∂t +

ν·

∇T=ρcp∂T0+T1

∂t +

ν·

∇(T0+T1)=k∆(T0+T1)

and can be simpliﬁed as follows:

ρcp∂T1

∂t +

ν·

∇T0=k∆T1

since T0does not depend on time, ∆T0= 0 and we assume the nonlinear term 

ν·

∇T1to be second

order (temperature perturbations and coupled velocity changes are assumed to be small). Using the

relationship between velocity and stream function vy=−∂xΨ we have v·∇T0=−vy=∂xΨ and since

κ=k/ρcp= 1 we get

∂T1

∂t −κ∆T1=−∂Ψ

∂x (355)

Looking at these equations, we immediately think about a separation of variables approach to solve these

equations. Both equations showcase the Laplace operator ∆, and the eigenfunctions of the biharmonic

operator and the Laplace operator are the same. We then pose that Ψ and T1can be written:

Ψ(x, y, t) = AΨexp(pt) exp(±ikxx) exp(±ikyy) = AΨEψ(x, y, t) (356)

T1(x, y, t) = ATexp(pt) exp(±ikxx) exp(±ikyy) = ATET(x, y, t) (357)

where kxand kyare the horizontal and vertical wave number respectively. Note that we then have

∇2Ψ = −(k2

x+k2

y)Ψ ∇2T1=−(k2

x+k2

y)T1

199

The boundary conditions on T1, coupled with a choice of a real function for the xdependence yields:

ET(x, y, t) = exp(pt) cos(kxx) sin(nπy).

from here onwards check for minus signs!

The velocity boundary conditions are vy(x, y = 0) = 0 and vy(x, y = 1) = 0 which imposes conditions

on ∂Ψ/∂x and we ﬁnd that we can use the same ydependence as for T1. Choosing again for a real

function for the xdependence yields:

EΨ(x, y, t) = exp(pt) sin(kxx) sin(nπz)

We then have

Ψ(x, y, t) = AΨexp(pt) sin(kxx) sin(nπz) = AΨEψ(x, y, t) (358)

T1(x, y, t) = ATexp(pt) cos(kxx) sin(nπz) = ATET(x, y, t) (359)

In what follows we simplify notations: k=kx. Then the two PDEs become:

pT1+κ(k2+n2π2)−kAΨexp(pt) cos(kxx) sin(nπz) = kAΨEθ(360)

−RaATcos(kx) sin(nπz) + κ(k2+n2π2)2AΨ=−RaATEΨ+κ(k2+n2π2)2AΨ= 0 (361)

These equations must then be veriﬁed for all ... which leads to write:

p+ (k2+n2π2)−k

−Ra k (k2+n2π2)2 Aθ

AΨ=0

0

The determinant of such system must be nul otherwise there is only a trivial solution to the problem, i.e.

Aθ= 0 and AΨ= 0 which is not helpful. CHECK/REPHRASE

D= [p+ (k2+n2π2)](k2+n2π2)2−Ra k2= 0

or,

p=Ra k2−(k2+n2π2)3

(k2+n2π2)2

The coeﬃcient pdetermines the stability of the system: if it is negative, the system is stable and both

Ψ and T1will decay to zero (return to conductive state). If p= 0, then the system is meta-stable, and if

p > 0 then the system is unstable and the perturbations will grow. The threshold is then p= 0 and the

solution of the above system is

200

45 fieldstone: Gravity: buried sphere

Before you proceed further, please read :

http://en.wikipedia.org/wiki/Gravity_anomaly

http://en.wikipedia.org/wiki/Gravimeter

Let us consider a vertical domain Lx ×Ly where Lx= 1000km and Ly = 500km. This domain

is discretised by means of a grid which counts nnp =nnx ×nny nodes. This grid then counts nel =

nelx ×nely = (nnx −1) ×(nny −1) cells. The horizontal spacing between nodes is hx and the vertical

spacing is hy.

Assume that this domain is ﬁlled with a rock type which mass density is given by ρmedium =

3000kg/m3, and that there is a circular inclusion of another rock type (ρsphere = 3200kg/m3) at lo-

cation (xsphere,ysphere) of radius rsphere. The density in the system is then given by

ρ(x, y) = ρsphere inside the circle

ρmedium outside the circle

Let us now assume that we place nsurf gravimeters at the surface of the model. These are placed

equidistantly between coordinates x= 0 and coordinates x=Lx. We will use the arrays xsurf and ysurf

to store the coordinates of these locations. The spacing between the gravimeters is δx=Lx/(nsurf −1).

At any given point (xi, yi) in a 2D space, one can show that the gravity anomaly due to the presence

of a circular inclusion can be computed as follows:

g(xi, yi)=2πG(ρsphere −ρ0)R2yi−ysphere

(xi−xsphere)2+ (yi−ysphere)2(362)

where rsphere is the radius of the inclusion, (xsphere, ysphere) are the coordinates of the center of the

inclusion, and ρ0is a reference density.

However, the general formula to compute the gravity anomaly at a given point (xi, yi) in space due

to a density anomaly of any shape is given by:

g(xi, yi)=2GZ ZΩ

∆ρ(x, y)(y−yi)

(x−xi)2+ (y−yi)2dxdy (363)

where Ω is the area of the domain on which the integration is to be carried out. Furthermore the density

anomaly can be written : ∆ρ(x, y) = ρ(x, y)−ρ0. We can then carry out the integration for each cell

and sum their contributions:

g(xi, yi)=2G

nel

ic=1 Z ZΩe

(ρ(x, y)−ρ0)(y−yi)

(x−xi)2+ (y−yi)2dxdy (364)

where Ωeis now the area of a single cell. Finally, one can assume the density to be constant within each

cell so that ρ(x, y)→ρ(ic) and RRΩedxdy →hx ×hy and then

g(xi, yi) = 2G

nel

ic=1

(ρ(ic)−ρ0)(y(ic)−yi)

(x(ic)−xi)2+ (y(ic)−yi)2sxsy(365)

We will then use the array gsurf to store the value of the gravity anomaly measured at each gravimeter

at the surface.

201

To go further

•explore the eﬀect of the size of the inclusion on the gravity proﬁle.

•explore the eﬀect of the ρ0value.

•explore the eﬀect of the grid resolution.

•measure the time that is required to compute the gravity. How does this time vary with nsurf ?

how does it vary when the grid resolution is doubled ?

•Assume now that ρ2< ρ1. What does the gravity proﬁle look like ?

•what happens when the gravimeters are no more at the surface of the Earth but in a satellite ?

•if you feel brave, redo the whole exercise in 3D...

202

46 Projects for guided research/MSc thesis

•Darcy ﬂow. redo WAFLE (see http://cedricthieulot.net/waﬂe.html)

•carry out critical Rayleigh experiments for various geometries/aspect ratios. Use Arie’s notes.

•Newton solver

•Corner ﬂow

•elasticity with markers

•Indentor/punch with stress b.c. ?

•chunk grid

•read in crust 1.0 in 2D on chunk

•compute gravity based on tetrahedra

•compare Q2 with Q2-serendipity

•NS a la http://ww2.lacan.upc.edu/huerta/exercises/Incompressible/Incompressible Ex2.htm

•produce example of mckenzie slab temperature

•write about impose bc on el matrix

•constraints

•discontinuous galerkin

•formatting of code style

•navier-stokes ? (LUKAS) use dohu matlab code

•nonlinear poiseuille

•compositions, marker chain

•free-slip bc on annulus and sphere . See for example p540 Gresho and Sani book.

•non-linear rheologies (two layer brick spmw16, tosn15)

•Picard vs Newton

•periodic boundary conditions

•free surface

•newton method to localise markers

•zaleski disk advection

•including phase changes (w. R. Myhill)

•compute strainrate in middle of element or at quad point for punch?

•GEO1442 code

•GEO1442 indenter setup in plane ?

•in/out ﬂow on sides for lith modelling

•Fehlberg RK advection

•redo puth17 2 layer experiment

203

https://peterkovesi.com/projects/colourmaps/ velocity: CET-D1A pressure: CET-D1 divv: CET-L1

density: CET-D3 strainrate: CET-R2

Problems to solve:

colorscale

better yet simple matrix storage ?

write Scott about matching compressible2 setup with his paper

deal with large matrices.

204

A Three-dimensional applications

In the following table I list many papers which showcase high-resolution models of geodynamical processes

(subduction, rifting, mantle ﬂow, plume transport, ...). Given the yearly output of our community and

the long list of journal in which research can be disseminated, this list can not be exhaustive.

Author(s) topic resolution

[8] Small-scale sublithospheric convection in the Paciﬁc 448 ×56 ×64

[123] Migration and morphology of subducted slabs in the upper mantle 50 ×50 ×25

[106] Subduction scissor across the South Island of New Zealand 17 ×9×9

[92] Inﬂuence of a buoyant oceanic plateau on subduction zones 80 ×40 ×80

[28] Subduction dynamics, origin of Andean orogeny and the Bolivian orocline 96 ×96 ×64

[49] Feedback between rifting and diapirism, ultrahigh-pressure rocks exhumation 100 ×64 ×20

[4] Numerical modeling of upper crustal extensional systems 160 ×160 ×12

[5] Rift interaction in brittle-ductile coupled systems 160 ×160 ×23

[103] Kinematic interpretation of the 3D shapes of metamorphic core complexes 67 ×67 ×33

[71] Role of rheology and slab shape on rapid mantle ﬂow: the Alaska slab edge 960 ×648 ×160

[27] Complex mantle ﬂow around heterogeneous subducting oceanic plates 96 ×96 ×64

[20] Oblique rifting and continental break-up 150 ×50 ×30

[14] Inﬂuence of mantle plume head on dynamics of a retreating subduction zone 80 ×40 ×80

[19] Rift to break-up evolution of the Gulf of Aden 83 ×83 ×40

[21] Thermo-mechanical impact of plume arrival on continental break-up 100 ×70 ×20

[29] Subduction and slab breakoﬀ controls on Asian indentation tectonics 96 ×96 ×64

[52] Modeling of upper mantle deformation and SKS splitting calculations 96 ×64 ×96

[115] Backarc extension/shortening, slab induced toroidal/poloidal mantle ﬂow 352 ×80 ×64

[89] Sediment transports in the context of oblique subduction modelling 500 ×164 ×100

[150] Crustal growth at active continental margins 404 ×164 ×100

[80] Dynamics of India-Asia collision 257 ×257 ×33

[140] Strain-partitioning in the Himalaya 256 ×256 ×40

[85] Collision of continental corner from 3-D numerical modeling 500 ×340 ×164

[105] Dependence of mid-ocean ridge morphology on spreading rate 196 ×196 ×100

[51] Mid mantle seismic anisotropy around subduction zones 197 ×197 ×53

[64] Oblique rifting of the Equatorial Atlantic 120 ×80 ×20

[96] Dynamics of continental accretion 256 ×96 ×96

[111] Thrust wedges: inﬂ. of decollement strength on transfer zones 309 ×85 ×149

[24] Asymmetric three-dimensional topography over mantle plumes 500 ×500 ×217

[133] modelled crustal systems undergoing orogeny and subjected to surface processes 96 ×32 ×14

[86] Thermo-mechanical modeling ontinental rifting and seaﬂoor spreading 197 ×197 ×197

205

References

[1]

[2] pTatin3D: High-Performance Methods for Long-Term Lithospheric Dynamics, 2014.

[3] M. Albers. A local mesh reﬁnement multigrid method for 3D convection problems with strongly

variable viscosity. J. Comp. Phys., 160:126–150, 2000.

[4] V. Allken, R. Huismans, and C. Thieulot. Three dimensional numerical modelling of upper crustal

extensional systems. J. Geophys. Res., 116:B10409, 2011.

[5] V. Allken, R. Huismans, and C. Thieulot. Factors controlling the mode of rift interaction in brittle-

ductile coupled systems: a 3d numerical study. Geochem. Geophys. Geosyst., 13(5):Q05010, 2012.

[6] V. Allken, R.S. Huismans, H. Fossen, and C. Thieulot. 3D numerical modelling of graben interaction

and linkage: a case study of the Canyonlands grabens, Utah. Basin Research, 25:1–14, 2013.

[7] J.D. Anderson. Computational Fluid Dynamics. McGraw-Hill, 1995.

[8] M.D. Ballmer, G. Ito, J. van Hunen, and P.J. Tackley. Small-scale sublithospheric convection

reconcilies geochemistry and geochronology of ’Superplume’ volcanism in th western and south

paciﬁc. Earth Planet. Sci. Lett., 290:224–232, 2010.

[9] K.-J. Bathe. Finite Element Procedures in Engineering Analysis. Prentice-Hall, 1982.

[10] M. Benzi, G.H. Golub, and J. Liesen. Numerical solution of saddle point problems. Acta Numerica,

14:1–137, 2005.

[11] D. Bercovici, G. Schubert, and G.A. Glatzmaier. Three-dimensional convection of an inﬁnite

Prandtl-number compressible ﬂuid in a basally heated spherical shell. J. Fluid Mech., 239:683–

719, 1992.

[12] M. Bercovier and M. Engelman. A ﬁnite-element for the numerical solution of viscous incompressible

ﬂows. J. Comp. Phys., 30:181–201, 1979.

[13] M. Bercovier and M. Engelman. A ﬁnite-element method for incompressible Non-Newtonian ﬂows.

J. Comp. Phys., 36:313–326, 1980.

[14] P.G. Betts, W.G. Mason, and L. Moresi. The inﬂuence of a mantle plume head on the dynamics of

a retreating subduction zone. Geology, 40(8):739–742, 2012.

[15] B. Blankenbach, F. Busse, U. Christensen, L. Cserepes, D. Gunkel, U. Hansen, H. Harder, G. Jarvis,

M. Koch, G. Marquart, D. Moore, P. Olson, H. Schmeling, and T. Schnaubelt. A benchmark

comparison for mantle convection codes. Geophys. J. Int., 98:23–38, 1989.

[16] P. B. Bochev, C. R. Dohrmann, and M. D. Gunzburger. Stabilization of low-order mixed ﬁnite

elements for the stokes equations. SIAM J. Numer. Anal., 44(1):82–101, 2006.

[17] J. Braun, C. Thieulot, P. Fullsack, M. DeKool, and R.S. Huismans. DOUAR: a new three-

dimensional creeping ﬂow model for the solution of geological problems. Phys. Earth. Planet. Inter.,

171:76–91, 2008.

[18] J. Braun and P. Yamato. Structural evolution of a three-dimensional, ﬁnite-width crustal wedge.

Tectonophysics, 484:181–192, doi:10.1016/j.tecto.2009.08.032, 2009.

[19] S. Brune and J. Autin. The rift to break-up evolution of the Gulf of Aden: Insights from 3D

numerical lithospheric-scale modelling. Tectonophysics, 607:65–79, 2013.

[20] S. Brune, A.A. Popov, and S. Sobolev. Modeling suggests that oblique extension facilitates rifting

and continental break-up. J. Geophys. Res., 117(B08402), 2012.

[21] S. Brune, A.A. Popov, and S. Sobolev. Quantifying the thermo-mechanical impact of plume arrival

on continental break-up. Tectonophysics, 604:51–59, 2013.

206

[22] H.H. Bui, R. Fukugawa, K. Sako, and S. Ohno. Lagrangian meshfree particles method (SPH) for

large deformation and failure ﬂows of geomaterial using elasticplastic soil constitutive model. Int.

J. Numer. Anal. Geomech., 32(12):1537–1570, 2008.

[23] P.S. Bullen. Handbook of Means and Their Inequalities. Springer; 2nd edition, 2003.

[24] E. Burov and T. Gerya. Asymmetric three-dimensional topography over mantle plumes. Nature,

513:doi:10.1038/nature13703, 2014.

[25] C. Burstedde, G. Stadler, L. Alisic, L.C. Wilcox, E. Tan, M. Gurnis, and O. Ghattas. Large-scale

adaptive mantle convection simulation. Geophy. J. Int., 192:889–906, 2013.

[26] F.H. Busse, U. Christensen, R. Clever, L. Cserepes, C. Gable, E. Giannandrea, L. Guillou, G. House-

man, H.-C. Nataf, M. Ogawa, M. Parmentier, C. Sotin, and B. Travis. 3D convection at inﬁnite

Prandtl number in Cartesian geometry - a benchmark comparison. Geophys. Astrophys. Fluid

Dynamics, 75:39–59, 1993.

[27] F.A. Capitanio and M. Faccenda. Complex mantle ﬂow around heterogeneous subducting oceanic

plates. Earth Planet. Sci. Lett., 353-354:29–37, 2012.

[28] F.A. Capitanio, C. Faccenna, S. Zlotnik, and D.R. Stegman. Subduction dynamics and the origin

of Andean orogeny and the Bolivian orocline. Nature, 480:doi:10.1038/nature10596, 2011.

[29] F.A. Capitanio and A. Replumaz. Subduction and slab breakoﬀ controls on Asian inden-

tation tectonics and Himalayan western syntaxis formation . Geochem. Geophys. Geosyst.,

14(9):doi:10.1002/ggge.20171, 2013.

[30] J.S. Chen, C. Pan, and T.Y.P. Chang. On the control of pressure oscillation in bilinear-displacement

constant-pressure element. Comput. Methods Appl. Mech. Engrg., 128:137–152, 1995.

[31] M.V. Chertova, T. Geenen, A. van den Berg, and W. Spakman. Using open sidewalls for modelling

self-consistent lithosphere subduction dynamics . Solid Earth, 3:313–326, 2012.

[32] Edmund Christiansen and Knud D. Andersen. Computation of collapse states with von mises type

yield condition. International Journal for Numerical Methods in Engineering, 46:1185–1202, 1999.

[33] Edmund Christiansen and Ole S. Pedersen. Automatic mesh reﬁnement in limit analysis. Interna-

tional Journal for Numerical Methods in Engineering, 50:1331–1346, 2001.

[34] M. crouzeix and P.A. Raviart. Conforming and non-conforming ﬁnite element methods for solving

the stationary Stokes equations. RAIRO, 7:33–76, 1973.

[35] C. Cuvelier, A. Segal, and A.A. van Steenhoven. Finite Element Methods and Navier-Stokes Equa-

tions. D. Reidel Publishing Company, 1986.

[36] M. Dabrowski, M. Krotkiewski, and D.W. Schmid. Milamin: Matlab based ﬁnite element solver

for large problems. Geochem. Geophys. Geosyst., 9(4):Q04030, doi:10.1029/2007GC001719, 2008.

[37] D.R. Davies, C.R. Wilson, and S.C. Kramer. Fluidity: A fully unstructured anisotropic adaptive

mesh computational modeling framework for geodynamics. Geochem. Geophys. Geosyst., 12(6),

2011.

[38] P. Davy and P. Cobbold. Indentation tectonics in nature and experiment. 1. experiments scaled for

gravity. Bulletin of the Geological Institutions of Uppsala, 14:129–141, 1988.

[39] J. de Frutos, V. John, and J. Novo. Projection methods for incompressible ow problems with

WENO nite diﬀerence schemes. J. Comp. Phys., 309:368–386, 2016.

[40] Y. Deubelbeiss and B.J.P. Kaus. Comparison of Eulerian and Lagrangian numerical techniques for

the Stokes equations in the presence of strongly varying viscosity. Phys. Earth Planet. Interiors,

171:92–111, 2008.

207

[41] C.R. Dohrmann and P.B. Bochev. A stabilized ﬁnite element method for the Stokes problem based

on polynomial pressure projections. Int. J. Num. Meth. Fluids, 46:183–201, 2004.

[42] J. Donea and A. Huerta. Finite Element Methods for Flow Problems. 2003.

[43] J. Donea and A. Huerta. Finite Element Methods for Flow Problems. John Wiley & Sons, 2003.

[44] Jean Donea and Antonio Huerta. Finite Element Methods for Flow Problems. John Wiley & Sons,

2003.

[45] T. Duretz, D.A. May, T.V. Gerya, and P.J. Tackley. Discretization errors and free surface stabilisa-

tion in the ﬁnite diﬀerence and marker-in-cell method for applied geodynamics: A numerical study.

Geochem. Geophys. Geosyst., 12(Q07004), 2011.

[46] David L. Egholm. A new strategy for discrete element numerical models: 1. Theory. J. Geo-

phys. Res., 112:B05203, doi:10.1029/2006JB004557, 2007.

[47] David L. Egholm, Mike Sandiford, Ole R. Clausen, and Søren B. Nielsen. A new strategy

for discrete element numerical models: 2. Sandbox applications. J. Geophys. Res., 112:B05204,

doi:10.1029/2006JB004558, 2007.

[48] R. Eid. Higher order isoparametric ﬁnite element solution of Stokes ﬂow . Applied Mathematics

and Computation, 162:1083–1101, 2005.

[49] S.M. Ellis, T.A. Little, L.M. Wallace, B.R. Hacker, and S.J.H. Buiter. Feedback between rifting

and diapirism can exhume ultrahigh-pressure rocks. Earth Planet. Sci. Lett., 311:427–438, 2011.

[50] E. Erturk. Discussions on Driven Cavity Flow. Int. J. Num. Meth. Fluids, 60:275–294, 2009.

[51] M. Faccenda. Mid mantle seismic anisotropy around subduction zones. Phys. Earth. Planet. Inter.,

227:1–19, 2014.

[52] M. Faccenda and F.A. Capitanio. Seismic anisotropy around subduction zones: Insights from three-

dimensional modeling of upper mantle deformation and SKS splitting calculations . Geochem. Geo-

phys. Geosyst., 14(1):doi:10.1029/2012GC004451, 2013.

[53] P.J. Frey and P.-L. George. Mesh generation. Hermes Science, 2000.

[54] P. Fullsack. An arbitrary Lagrangian-Eulerian formulation for creeping ﬂows and its application in

tectonic models. Geophy. J. Int., 120:1–23, 1995.

[55] M. Furuichi and D. Nishiura. Robust coupled ﬂuid-particle simulation scheme in Stokes-ﬂow

regime: Toward the geodynamic simulation including granular media. Geochem. Geophys. Geosyst.,

15:2865–2882, 2014.

[56] M. Gerbault, A.N.B. Poliakov, and M. Daignieres. Prediction of faulting from the theories of

elasticity and plasticity: what are the limits? Journal of Structural Geology, 20:301–320, 1998.

[57] Taras Gerya. Numerical Geodynamic Modelling. Cambridge University Press, 2010.

[58] T.V. Gerya, D.A. May, and T. Duretz. An adaptive staggered grid ﬁnite diﬀerence method for

modeling geodynamic Stokes ﬂows with strongly variable viscosity. Geochem. Geophys. Geosyst.,

14(4), 2013.

[59] G.H. Golub and C.F. van Loan. Matrix Computations, 4th edition. John Hopkins University Press,

2013.

[60] P.M. Gresho and R.L. Sani. Incompressible ﬂow and the Finite Element Method, vol II. John Wiley

and Sons, Ltd, 2000.

[61] D. Griﬃths and D. Silvester. Unstable modes of the q1-p0 element. Technical Report 257, University

of MAnchester/UMIST, 1994.

208

[62] M. Gunzburger. Finite Element Methods for Viscous Incompressible Flows: A Guide to Theory,

Practice and Algorithms. Academic, Boston, 1989.

[63] D.L. Hansen. A meshless formulation for geodynamic modeling. J. Geophys. Res.,

108:doi:10.1029/2003JB002460, 2003.

[64] C. Heine and S. Brune. Oblique rifting of the Equatorial Atlantic: Why there is no Saharan Atlantic

Ocean. Geology, 42(3):211–214, 2014.

[65] T. Heister, J. Dannberg, R. Gassm¨oller, and W. Bangerth. High Accuracy Mantle Convection

Simulation through Modern Numerical Methods. II: Realistic Models and Problems. Geophy. J. Int.,

210(2):833–851, 2017.

[66] T.J.R. Hughes. The Finite Element Method. Linear Static and Dynamic Finite Element Analysis.

Dover Publications, Inc., 2000.

[67] T.J.R. Hughes, W.K. Liu, and A. Brooks. Finite element analysis of incompressible viscous ﬂows

by the penalty function formulation. J. Comp. Phys., 30:1–60, 1979.

[68] Hoon Huh, Choong Ho Lee, and Wei H. Yang. A general algorithm for plastic ﬂow simulation by

ﬁnite element limit analysis. International Journal of Solids and Structures, 36:1193–1207, 1999.

[69] Alik Ismail-Zadeh and Paul Tackley. Computational Methods for Geodynamics. Cambridge Uni-

versity Press, 2010.

[70] J. Ita and S.D. King. Sensitivity of convection with an endothermic phase change to the form of gov-

erning equations, initial conditions, boundary conditions, and equation of state. J. Geophys. Res.,

99(B8):15,919–15,938, 1994.

[71] M.A. Jadamec and M.I. Billen. The role of rheology and slab shape on rapid mantle ﬂow: Three-

dimensional numerical models of the Alaska slab edge. J. Geophys. Res., 117(B02304), 2012.

[72] G.T. Jarvis. Eﬀects of curvature on two-dimensional models of mantle convection: cylindrical polar

coordinates. J. Geophys. Res., 98(B3):4477–4485, 1993.

[73] C. Jaupart and J.-C. Mareschal. Heat Generation and Transport in the Earth. Cambridge, 2011.

[74] L. Jolivet, P. Davy, and P. Cobbold. Right-lateral shear along the Northwest Paciﬁc margin and

the India-Eurasia collision. Tectonics, 9(6):1409–1419, 1990.

[75] L.M. Kachanov. Fundamentals of the Theory of Plasticity. Dover Publications, Inc., 2004.

[76] M. Kimmritz and M. Braack. iDiscretization of the hydrostatic Stokes system by stabilized ﬁnite

elements of equal order.

[77] S. King, C. Lee, P. van Keken, W. Leng, S. Zhong, E. Tan, N. Tosi, and M. Kameyama. A commu-

nity benchmark for 2D Cartesian compressible convection in the Earths mantle. Geophy. J. Int.,

180:7387, 2010.

[78] J.R. Koseﬀ and R.L. Street. The Lid-Driven Cavity Flow: A Synthesis of Qualitative and Quanti-

tative Observations. J. Fluids Eng, 106:390–398, 1984.

[79] M. Kronbichler, T. Heister, and W. Bangerth. High accuracy mantle convection simulation through

modern numerical methods . Geophy. J. Int., 191:12–29, 2012.

[80] S.M. Lechmann, S.M. Schmalholz, G. Hetenyi, D.A. May, and B.J.P. Kaus. Quantifying the impact

of mechanical layering and underthrusting on the dynamics of the modern India-Asia collisional

system with 3-D numerical models. J. Geophys. Res., 119:doi:10.1002/2012JB009748, 2014.

[81] R. Lee, P. Gresho, and R. Sani. Smooting techniques for certain primitive variable solutions of the

Navier-Stokes equations. . Int. Journal for Numerical Methods in Engineering, 14:1785–1804, 1979.

209

[82] W. Leng and S. Zhong. Viscous heating, adiabatic heating and energetic consistency in compressible

mantle convection. Geophy. J. Int., 173:693–702, 2008.

[83] W. Leng and S. Zhong. Implementation and application of adaptive mesh reﬁnement for thermo-

chemical mantle convection studies. Geochem. Geophys. Geosyst., 12(4), 2011.

[84] J. Li, Y. He, and Z. Chen. Performance of several stabilized ﬁnite element methods for the Stokes

equations based on the lowest equal-order pairs. Computing, 86:37–51, 2009.

[85] Z.-H. Li, Z. Xu, T. Gerya, and J.-P. Burg. Collision of continental corner from 3-D numerical

modeling. Earth Planet. Sci. Lett., 380:98–111, 2013.

[86] J. Liao and T. Gerya. From continental rifting to seaﬂoor spreading: Insight from 3D thermo-

mechanical modeling. Gondwana Research, 2014.

[87] X. Liu and S. Zhong. Analyses of marginal stability, heat transfer and boundary layer properties for

thermal convection in a compressible ﬂuid with inﬁnite Prandtl number. Geophy. J. Int., 194:125–

144, 2013.

[88] C. Loiselet, J. Braun, L. Husson, C. Le Carlier de Veslud, C. Thieulot, P. Yamato, and

D. Grujic. Subducting slabs: Jellyﬁshes in the Earth’s mantle. Geochem. Geophys. Geosyst.,

11(8):doi:10.1029/2010GC003172, 2010.

[89] C. Malatesta, T. Gerya, L. Crispini, L. Federico, and G. Capponi. Oblique subduction modelling

indicates along-trench tectonic transport of sediments. Nature Communications, 4, 2013.

[90] D.S. Malkus and T.J.R. Hughes. Mixed ﬁnite element methods - reduced and selective integration

techniques: a uniﬁcation of concepts. Comput. Meth. Appl. Mech. Eng., 15:63–81, 1978.

[91] L.E. Malvern. Introduction to the mechanics of a continuous medium. Prentice-Hall, Inc., 1969.

[92] W.G. Mason, L. Moresi, P.G. Betts, and M.S. Miller. Three-dimensional numerical models of the

inﬂuence of a buoyant oceanic plateau on subduction zones. Tectonophysics, 483:71–79, 2010.

[93] D.A. May, J. Brown, and L. Le Pourhiet. A scalable, matrix-free multigrid preconditioner for ﬁnite

element discretizations of heterogeneous Stokes ﬂow. Computer Methods in Applied Mechanics and

Engineering, 290:496–523, 2015.

[94] A. Mizukami. A mixed ﬁnite element method for boundary ﬂux computation. Computer Methods

in Applied Mechanics and Engineering, 57:239–243, 1986.

[95] P. Molnar and P. Tapponnier. Relation of the tectonics of eastern China to the India-Eurasia

collision: Application of the slip-line ﬁeld theory to large-scale continental tectonics. Geology,

5:212–216, 1977.

[96] L. Moresi, P.G. Betts, M.S. Miller, and R.A. Cayley. Dynamics of continental accretion. Nature,

2014.

[97] L. Moresi, S. Quenette, V. Lemiale, C. M´eriaux, B. Appelbe, and H.-B. M¨uhlhaus. Computational

approaches to studying non-linear dynamics of the crust and mantle. Phys. Earth. Planet. Inter.,

163:69–82, 2007.

[98] M. Nettesheim, T.A. Ehlers, D.M. Whipp, and A. Koptev. The inﬂuence of upper-plate advance

and erosion on overriding plate deformation in orogen syntaxes. Solid Earth, 9:1207–1224, 2018.

[99] S. Norburn and D. Silvester. Fourier analysis of stabilized Q1-Q1 mixed ﬁnite element approxima-

tion. SIAM J. Numer. Anal., 39:817–833, 2001.

[100] C. O’Neill, L. Moresi, D. M¨uller, R. Albert, and F. Dufour. Ellipsis 3D: a particle-in-cell ﬁnite

element hybrid code for modelling mantle convection and lithospheric deformation. Computers and

Geosciences, 32:1769–1779, 2006.

210

[101] G. Peltzer and P. Tapponnier. Formation and evolution of strike-slip faults, rifts, and basins during

the india-asia collision: an experimental approach. J. Geophys. Res., 93(B12):15085–15177, 1988.

[102] A. Pinelli and A. Vacca. Chebyshev collocation method and multidomain decomposition for the

incompressible Navier-Stokes equations. International Journal for numerical methods in ﬂuids,

18:781–799, 1994.

[103] L. Le Pourhiet, B. Huet, D.A. May, L. Labrousse, and L. Jolivet. Kinematic interpretation of the

3D shapes of metamorphic core complexes. Geochem. Geophys. Geosyst., 13(Q09002), 2012.

[104] A.E. Pusok, B.J.P. Kaus, and A.A. Popov. On the Quality of Velocity Interpolation

Schemes for Marker-in-Cell Method and Staggered Grids. Pure and Applied Geophysics, pages

doi:10.1007/s00024–016–1431–8, 2016.

[105] C. P¨uthe and T. Gerya. Dependence of mid-ocean ridge morphology on spreading rate in numerical

3-D models. Gondwana Research, 25:270–283, 2014.

[106] R.N. Pysklywec, S.M. Ellis, and A.R. Gorman. Three-dimensional mantle lithosphere deformation

at collisional plate boundaries: A subduction scissor across the South Island of New Zealand.

Earth Planet. Sci. Lett., 289:334–346, 2010.

[107] T. Rabczuk, P.M.A. Areias, and T. Belytschko. A simpliﬁed mesh-free method for shear bands

with cohesive surfaces . Int. J. Num. Meth. Eng., 69:993–1021, 2007.

[108] J.N. Reddy. On penalty function methods in the ﬁnite element analysis of ﬂow problems.

Int. J. Num. Meth. Fluids, 2:151–171, 1982.

[109] J. Revenaugh and B. Parsons. Dynamic topography and gravity anomalies for ﬂuid layers whose

viscosity varies exponentially with depth. Geophysical Journal of the Royal Astronomical Society,

90(2):349–368, 1987.

[110] J.G. Rice and R.J. Schnipke. An equal-order velocity-pressure formulation that does not exhibit

spurious pressure modes. Computer Methods in Applied Mechanics and Engineering, 58:135–149,

1986.

[111] J.B. Ruh, T. Gerya, and J.-P. Burg. High-resolution 3D numerical modeling of thrust wedges:

Inﬂuence of d´ecollement strength on transfer zones. Geochem. Geophys. Geosyst., 14(4):1131–1155,

2014.

[112] Y. Saad. Iterative methods for sparse linear systems. SIAM, 2003.

[113] R.L. Sani, P.M. Gresho, R.L. Lee, and D.F. Griﬃths. The cause and cure (?) of the spurious

pressures generated by certain FEM solutions of the incompressible Navier-Stokes equations: part

1. Int. J. Num. Meth. Fluids, 1:17–43, 1981.

[114] R.L. Sani, P.M. Gresho, R.L. Lee, D.F. Griﬃths, and M. Engelman. The cause and cure (?) of the

spurious pressures generated by certain fem solutions of the incompressible navier-stokes equations:

part 2. Int. J. Num. Meth. Fluids, 1:171–204, 1981.

[115] W.P. Schellart and L. Moresi. A new driving mechanism for backarc extension and backarc short-

ening through slab sinking induced toroidal and poloidal mantle ﬂow: Results from dynamic sub-

duction models with an overriding plate. J. Geophys. Res., 118:1–28, 2013.

[116] S.M. Schmalholz. A simple analytical solution for slab detachment. Earth Planet. Sci. Lett.,

304:45–54, 2011.

[117] H. Schmeling, A.Y. Babeyko, A. Enns, C. Faccenna, F. Funiciello, T. Gerya, G.J. Golabek,

S. Grigull, B.J.P. Kaus, G. Morra, S.M. Schmalholz, and J. van Hunen. A benchmark comparison of

spontaneous subduction models - Towards a free surface. Phys. Earth. Planet. Inter., 171:198–223,

2008.

211

[118] D.W. Schmid and Y.Y. Podlachikov. Analytical solutions for deformable elliptical inclusions in

general shear. Geophy. J. Int., 155:269–288, 2003.

[119] G. Schubert, D.L. Turcotte, and P. Olson. Mantle Convection in the Earth and Planets. Cambridge

University Press, 2001.

[120] A. Segal. Finite element methods for the incompressible Navier-Stokes equations. 2012.

[121] M. Spiegelman, D.A. May, and C. Wilson. On the solvability of incompressible Stokes with vis-

coplastic rheologies in geodynamics. Geochem. Geophys. Geosyst., 17:2213–2238, 2016.

[122] G. Stadler, M. Gurnis, C. Burstedde, L.C. Wilcox, L. Alisic, and O. Ghattas. The dynamics of

plate tectonics and mantle ﬂow: from local to global scales. Science, 329:1033–1038, 2010.

[123] D.R. Stegman, W.P. Schellart, and J. Freeman. Competing inﬂuences of plate width and far-ﬁeld

boundary conditions on trench migration and morphology of subducted slabs in the upper mantle.

Tectonophysics, 483:46–57, 2010.

[124] J. Suckale, J.-C. Nave, and B.H. Hager. It takes three to tango: 1. Simulating buoyancy-driven

ﬂow in the presence of large viscosity contrasts. J. Geophys. Res., 115(B07409), 2010.

[125] P. Tackley. Three-dimensional models of mantle convection: Inﬂuence of phase transitions and

temperature-dependent viscosity. PhD thesis, California Institute of Technology, 1994.

[126] E. Tan and M. Gurnis. Compressible thermochemical convection and application to lower mantle

structures. J. Geophys. Res., 112(B06304), 2007.

[127] Paul Tapponnier and Peter Molnar. Slip-line ﬁeld theory and large-scale continental tectonics.

Nature, 264:319–324, November 1976.

[128] T.E. Tezduyar, S. Mittal, S.E. Ray, and R. Shih. Incompressible ﬂow computations with stabilized

bilinear and linear equal-order-interpolation velocity-pressure elements. Comput. Methods Appl.

Mech. Engrg., 95:221–242, 1992.

[129] M. Thielmann, D.A. May, and B.J.P. Kaus. Discretization errors in the Hybrid Finite Element

Particle-In-Cell Method. Pure and Applied Geophysics, 2014.

[130] C. Thieulot. FANTOM: two- and three-dimensional numerical modelling of creeping ﬂows for the

solution of geological problems. Phys. Earth. Planet. Inter., 188:47–68, 2011.

[131] C. Thieulot. GHOST: Geoscientiﬁc Hollow Sphere Tesselation. Solid Earth, 9(1–9), 2018.

[132] C. Thieulot, P. Fullsack, and J. Braun. Adaptive octree-based ﬁnite element analysis of two- and

three-dimensional indentation problems. J. Geophys. Res., 113:B12207, 2008.

[133] C. Thieulot, P. Steer, and R.S. Huismans. Three-dimensional numerical simulations of crustal

systems undergoing orogeny and subjected to surface processes. Geochem. Geophys. Geosyst.,

15:doi:10.1002/2014GC005490, 2014.

[134] J.F. Thompson, B.K. Soni, and N.P. Weatherill. Handbook of grid generation. CRC press, 1998.

[135] N. Tosi, C. Stein, L. Noack, C. Huettig, P. Maierova, H. Samuel, D.R. Davies, C.R. Wilson,

S.C. Kramer, C. Thieulot, A. Glerum, M. Fraters, W. Spakman, A. Rozel, and P.J. Tackley. A

community benchmark for viscoplastic thermal convection in a 2-D square box. Geochem. Geo-

phys. Geosyst., 16(7):21752196, 2015.

[136] R.A. Trompert and U. Hansen. On the Rayleigh number dependence of convection with a strongly

temperature-dependent viscosity. Physics of Fluids, 10(2):351–360, 1998.

[137] D.L. Turcotte and G. Schubert. Geodynamics, 2nd edition. Cambridge, 2012.

[138] P.E. van Keken, S.D. King, H. Schmeling, U.R. Christensen, D. Neumeister, and M.-P. Doin.

A comparison of methods for the modeling of thermochemical convection. J. Geophys. Res.,

102(B10):22,477–22,495, 1997.

212

[139] H. Wang, R. Agrusta, and J. van Hunen. Advantages of a conservative velocity interpolation

(CVI) scheme for particle-in-cell methods with application in geodynamic modeling. Geochem. Geo-

phys. Geosyst., 16:doi:10.1002/2015GC005824, 2015.

[140] D.M. Whipp, C. Beaumont, and J. Braun. Feeding the aneurysm: Orogen-parallel mass

transport into Nanga Parbat and the western Himalayan syntaxis. J. Geophys. Res.,

119:doi:10.1002/2013JB010929, 2014.

[141] S.D. Willett. Dynamic and kinematic growth and change of a coulomb wedge. In K.R. McClay,

editor, Thrust Tectonics, pages 19–31. Chapman and Hall, 1992.

[142] M. Wolstencroft, J.H. Davies, and D.R. Davies. NusseltRayleigh number scaling for spherical shell

Earth mantle simulation up to a Rayleigh number of 109.Phys. Earth. Planet. Inter., 176:132–141,

2009.

[143] H. Xing, W. Yu, and J. Zhang. In Advances in Geocomputing, Lecture Notes in Earth Sciences.

Springer-Verlag, Berlin Heidelberg, 2009.

[144] P. Yamato, L. Husson, J. Braun, C. Loiselet, and C. Thieulot. Inﬂuence of surrounding plates on

3D subduction dynamics. Geophys. Res. Lett., 36(L07303):doi:10.1029/2008GL036942, 2009.

[145] X. Yu and F. Tin-Loi. A simple mixed ﬁnite element for static limit analysis. Computers and

Structures, 84:1906–1917, 2006.

[146] S. Zhong. Analytic solutions for Stokes ﬂow with lateral variations in viscosity. Geophys. J. Int.,

124:18–28, 1996.

[147] S. Zhong, M. Gurnis, and G. Hulbert. Accurate determination of surface normal stress in viscous

ﬂow from a consistent boundary ﬂux method. Phys. Earth. Planet. Inter., 78:1–8, 1993.

[148] S. Zhong, A. McNamara, E. Tan, L. Moresi, and M. Gurnis. A benchmark study on mantle

convection in a 3-D spherical shell using CITCOMS. Geochem. Geophys. Geosyst., 9(10), 2008.

[149] S.J. Zhong, D.A. Yuen, and L.N. Moresi. Treatise on Geophysics Volume 7 : Mantle Dynamics.

Elsevier B.V., 2007.

[150] G. Zhu, T.V. Gerya, P.J. Tackley, and E. Kissling. Four-dimensional numerical modeling of crustal

growth at active continental margins. J. Geophys. Res., 118:4682–4698, 2013.

[151] O. Zienkiewicz and S. Nakazawa. The penalty function method and its application to the numerical

solution of boundary value problems. The American Society of Mechanical Engineers, 51, 1982.

[152] O.C. Zienkiewicz, M. Huang, and M. Pastor. Localization problems in plasticity using ﬁnite ele-

ments with adaptive remeshing. International Journal for Numerical and Analytical Methods in

Geomechanics, 19:127–148, 1995.

[153] O.C. Zienkiewicz, C. Humpheson, and R.W. Lewis. Associated and non-associated visco-plasticity

and plasticity in soil mechanics . G´eotechnique, 25(4):671–689, 1975.

213

Index

P1, 32, 34

P2, 32

P3, 33

Pm×Pn, 22

Pm×P−n, 22

Q1×P0, 91, 97, 99, 105, 112, 116, 120, 122, 138,

145, 157, 159, 163, 178, 179

Q1, 23, 27

Q2×Q1, 22, 123, 128

Q2, 23, 29, 35

2, 30

Q3×Q2, 131

Q3, 24, 31

Qm×P−n, 22

Qm×Qn, 22

Qm×Q−n, 22

analytical solution, 91, 99, 116, 120, 128, 131, 134,

138, 145, 163, 179

arithmetic mean, 113

basis functions, 27

Boussinesq, 13

bubble function, 22

bulk modulus, 14, 140

bulk viscosity, 12

buoyancy-driven ﬂow, 93

CBF, 163

CG, 70

checkerboard, 66

cohesion, 65

Compressed Sparse Column, 58

Compressed Sparse Row, 58

compressibility, 140

compressible ﬂow, 145

conforming element, 22

conjugate gradient, 70

connectivity array, 62

convex polygon, 61

CSC, 58

CSR, 58

Dirichlet boundary condition, 16

divergence free, 36

Drucker-Prager, 65

dynamic viscosity, 12

Gauss quadrature, 20

geometric mean, 113

harmonic mean, 113

hyperbolic PDE, 79

incompressible ﬂow, 99, 105, 112, 116, 120, 122,

128, 131, 134, 138, 157, 159, 163, 178, 179

isoparametric, 80

isothermal, 91, 97, 99, 105, 112, 116, 120, 122, 128,

131, 134, 138, 157, 159, 163, 179

isoviscous, 91, 97, 116, 120, 128, 131, 138, 145, 163,

178

Lam´e parameter, 14

Legendre polynomial, 20

method of manufactured solutions, 51

midpoint rule, 19

mixed formulation, 116, 120, 122, 128, 131, 134,

138, 145, 157, 159, 163, 178

MMS, 51

Neumann boundary condition, 16

Newton’s method, 85

Newton-Cotes, 20

non-conforming element, 22

non-isoviscous, 99, 105, 112, 122, 134, 179

nonconforming Q1×P0, 134

nonlinear, 159

nonlinear rheology, 105

numerical benchmark, 157

open boundary conditions, 178

particle-in-cell, 112

penalty formulation, 38, 91, 93, 97, 99, 105, 112,

179

piecewise, 22

Poisson ratio, 14

preconditioned conjugate gradient, 72

pressure normalisation, 124

pressure scaling, 68

pressure smoothing, 66, 116, 134, 138, 145, 163

quadrature, 20

rectangle rule, 19

Rens Elbertsen, 164, 185

Schur complement, 70

Schur complement approach, 120, 122

second viscosity, 12

shear modulus, 14

SPD, 70

static condensation, 50

Stokes sphere, 93, 122

strain tensor, 14

strong form, 36

structured grid, 61

214

tensor invariant, 64

thermal expansion, 140

trapezoidal rule, 19

unstructured grid, 61

Viscosity Rescaling Method, 65

von Mises, 65

VRM, 65

weak form, 36, 39

work against gravity, 143

Young’s modulus, 14

215

Notes

oﬁnish. not happy with deﬁnition. Look at literature . . . . . . . . . . . . . . . . . . . . . . . . 17

oderive formula for Earth size R1 and R2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18

oinserthereﬁgure ............................................ 19

ocitationneeded ............................................. 29

overifythose ............................................... 31

overifythose ............................................... 34

olist codes which use this approach . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38

oinsert here link(s) to manual and literature . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41

osaysomethingaboutminussign?................................... 44

oothertimediscrschemes! ....................................... 49

osortoutmesswrtEq26ofbusa13 .................................. 54

oadd more examples coming from geo . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61

oproduce drawing of node numbering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63

oinsert here the rederivation 2.1.1 of spmw16 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65

oproduceﬁguretoexplainthis ..................................... 67

olinktoprotopaper .......................................... 67

olink to least square and nodal derivatives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67

ohow to compute MfortheSchurcomplement?........................... 73

otoﬁnish................................................. 89

obuild S and have python compute its smallest and largest eigenvalues as a function of resolution?120

oexplainhowMedgeisarrivedat! ................................... 162

ocomparewithASPECT??! ...................................... 162

ogauss-lobattointegration?....................................... 162

opressure average on surface instead of volume ? . . . . . . . . . . . . . . . . . . . . . . . . . . . 162

216

Manual

Navigation menu

Versions of this User Manual:

Views

Navigation