Manual SPECFEM3D Cartesian

User Manual:

Open the PDF directly: View PDF .
Page Count: 118 [warning: Documents this large are best viewed by clicking the View PDF Link!]

Contents
1 Introduction
- 1.1 Citation
- 1.2 Support
2 Getting Started
- 2.1 Adding OpenMP support in addition to MPI
- 2.2 Compiling on an IBM BlueGene
- 2.3 Visualizing the subroutine calling tree of the source code
- 2.4 Using the ADIOS library for I/O
- 2.5 Becoming a developer of the code, or making small modifications in the source code
3 Mesh Generation
- 3.1 Meshing with CUBIT
- 3.2 Meshing with xmeshfem3D
4 Creating the Distributed Databases
- 4.1 Main parameter file Par_file
- 4.2 Choosing the time step DT
5 Running the Solver xspecfem3D
- 5.1 Note on the viscoelastic model used
6 Kinematic and dynamic fault sources
- 6.1 Mesh Generation with Split Nodes
- 6.2 CUBIT-Python Scripts for Faults
- 6.3 Examples
- 6.4 Sign Convention for Fault Quantities
- 6.5 Input Files
- 6.6 Setting the Kelvin-Voigt Damping Parameter
- 6.7 Output Files
- 6.8 Post-processing and Visualization
7 Adjoint Simulations
- 7.1 Adjoint Simulations for Sources Only (not for the Model)
- 7.2 Adjoint Simulations for Finite-Frequency Kernels (Kernel Simulation)
8 Doing tomography based on the sensitivity kernels obtained
- 8.1 Tomographic full waveform inversion (FWI) / imaging using the sensitivity kernels obtained
  - 8.1.1 Principle
  - 8.1.2 Computation of the gradient based on the adjoint method
- 8.2 Tomographic tools
- 8.3 OLD VERSION OF THE SECTION, WILL SOON BE IMPROVED AND ENRICHED
9 Performing full waveform inversion (FWI) or source inversions
10 Noise Cross-correlation Simulations
- 10.1 Input Parameter Files
- 10.2 Noise Simulations: Step by Step
- 10.3 Example
11 Gravity integral calculations for the gravity field of an Earth model
12 Graphics
- 12.1 Meshes
- 12.2 Movies
  - 12.2.1 Movie Surface and Shakemaps
  - 12.2.2 Movie Volume
- 12.3 Finite-Frequency Kernels
13 Running through a Scheduler
14 Changing the Model
- 14.1 Using external tomographic Earth models
- 14.2 External (an)elastic Models
- 14.3 Anisotropic Models
- 14.4 Using external SEP models
15 Post-Processing Scripts
- 15.1 Process Data and Synthetics
- 15.2 Collect Synthetic Seismograms
- 15.3 Clean Local Database
- 15.4 Plot Movie Snapshots and Synthetic Shakemaps
  - 15.4.1 Script movie2gif.gmt.pl
  - 15.4.2 Script plot_shakemap.gmt.pl
- 15.5 Map Local Database
16 Information for developers of the code, and for people who want to learn how the technique works
Notes & Acknowledgments
Copyright
Bibliography
A Reference Frame Convention
B Channel Codes of Seismograms
C Troubleshooting
D License

User Manual

COMPUTATIONAL INFRASTRUCTURE FOR GEODYNAMICS (CIG)

Version 3.0

CNRS (FRANCE)

PRINCETON UNIVERSITY (USA)

ETH ZÜRICH (SWITZERLAND)

SPECFEM 3D

Cartesian

→

SPECFEM3D

Cartesian

User Manual

and ETH Zürich (Switzerland)

Version 3.0

June 15, 2018

Authors

The SPECFEM3D package was ﬁrst developed by Dimitri Komatitsch and Jean-Pierre Vilotte at Institut de Physique

du Globe (IPGP) in Paris, France from 1995 to 1997 and then by Dimitri Komatitsch and Jeroen Tromp at Harvard

University and Caltech, USA, starting in 1998. The story started on April 4, 1995, when Prof. Yvon Maday from

CNRS and University of Paris, France, gave a lecture to Dimitri Komatitsch and Jean-Pierre Vilotte at IPG about

the nice properties of the Legendre spectral-element method with diagonal mass matrix that he had used for other

equations. We are deeply indebted and thankful to him for that. That followed a visit by Dimitri Komatitsch to OGS

(Istituto Nazionale di Oceanograﬁa e di Geoﬁsica Sperimentale) in Trieste, Italy, in February 1995 to meet with Géza

Seriani and Enrico Priolo, who introduced him to their 2D Chebyshev version of the spectral-element method with a

non-diagonal mass matrix. We are deeply indebted and thankful to them for that.

Since then it has been developed and maintained by a development team: in alphabetical order, Michael Afanasiev,

Jean-Paul (Pablo) Ampuero, Étienne Bachmann, Kangchen Bai, Piero Basini, Céline Blitz, Alexis Bottero, Ebru Boz-

da˘

g, Emanuele Casarotti, Joseph Charles, Min Chen, Paul Cristini, Clément Durochat, Percy Galvez, Hom Nath

Gharti, Dominik Göddeke, Vala Hjörleifsdóttir, Sue Kientz, Dimitri Komatitsch, Jesús Labarta, Nicolas Le Goff,

Pieyre Le Loher, Matthieu Lefebvre, Qinya Liu, Youshan Liu, David Luet, Yang Luo, Alessia Maggi, Federica

Magnoni, Roland Martin, René Matzen, Dennis McRitchie, Matthias Meschede, Peter Messmer, David Michéa, Vadim

Monteiller, Surendra Nadh Somala, Masaru Nagaso, Tarje Nissen-Meyer, Daniel Peter, Kevin Pouget, Max Rietmann,

Elliott Sales de Andrade, Brian Savage, Bernhard Schuberth, Anne Sieminski, James Smith, Leif Strand, Carl Tape,

Jeroen Tromp, Brice Videau, Jean-Pierre Vilotte, Zhinan Xie, Chang-Hua Zhang, Hejun Zhu.

The cover graphic of the manual was created by Santiago Lombeyda from Caltech’s Center for Advanced Com-

puting Research (CACR), USA, with free satellite clipart pictures from http://www.4vector.com and http:

//www.clker.com added to it.

The code is released open-source under the GNU version 3 license, see the license at the end of this manual.

Current and past main participants or main sponsors of the SPECFEM project

(in no particular order)

Contents

Contents 3

1 Introduction 6

1.1 Citation .................................................. 8

1.2 Support .................................................. 10

2 Getting Started 11

2.1 Adding OpenMP support in addition to MPI ............................... 14

2.2 Compiling on an IBM BlueGene ..................................... 14

2.3 Visualizing the subroutine calling tree of the source code ........................ 15

2.4 Using the ADIOS library for I/O ..................................... 16

2.5 Becoming a developer of the code, or making small modiﬁcations in the source code ......... 17

3 Mesh Generation 18

3.1 Meshing with CUBIT ........................................... 18

3.1.1 Creating the Mesh with CUBIT ................................. 19

3.1.2 Exporting the Mesh with run_cubit2specfem3d.py ................... 20

3.1.3 Partitioning the Mesh with xdecompose_mesh ........................ 24

3.2 Meshing with xmeshfem3D ....................................... 25

4 Creating the Distributed Databases 30

4.1 Main parameter ﬁle Par_file ..................................... 30

4.2 Choosing the time step DT ........................................ 36

5 Running the Solver xspecfem3D 39

5.1 Note on the viscoelastic model used ................................... 45

6 Kinematic and dynamic fault sources 47

6.1 Mesh Generation with Split Nodes .................................... 47

6.2 CUBIT-Python Scripts for Faults ..................................... 49

6.3 Examples ................................................. 50

6.4 Sign Convention for Fault Quantities ................................... 50

6.5 Input Files ................................................. 51

6.6 Setting the Kelvin-Voigt Damping Parameter .............................. 53

6.7 Output Files ............................................... 54

6.8 Post-processing and Visualization .................................... 55

7 Adjoint Simulations 56

7.1 Adjoint Simulations for Sources Only (not for the Model) ........................ 56

7.2 Adjoint Simulations for Finite-Frequency Kernels (Kernel Simulation) ................. 57

CONTENTS 4

8 Doing tomography based on the sensitivity kernels obtained 59

8.1 Tomographic full waveform inversion (FWI) / imaging using the sensitivity kernels obtained ..... 59

8.1.1 Principle ............................................. 59

8.1.2 Computation of the gradient based on the adjoint method .................... 60

8.2 Tomographic tools ............................................ 62

8.2.1 Summing kernels ......................................... 63

8.2.2 Smoothing and post-processing ................................. 65

8.2.3 Model updating .......................................... 66

8.3 OLD VERSION OF THE SECTION, WILL SOON BE IMPROVED AND ENRICHED ....... 67

9 Performing full waveform inversion (FWI) or source inversions 68

10 Noise Cross-correlation Simulations 69

10.1 Input Parameter Files ........................................... 69

10.2 Noise Simulations: Step by Step ..................................... 70

10.2.1 Pre-simulation .......................................... 70

10.2.2 Simulations ............................................ 71

10.2.3 Post-simulation .......................................... 72

10.3 Example .................................................. 72

11 Gravity integral calculations for the gravity ﬁeld of an Earth model 73

12 Graphics 74

12.1 Meshes .................................................. 74

12.2 Movies .................................................. 74

12.2.1 Movie Surface and Shakemaps .................................. 75

12.2.2 Movie Volume .......................................... 76

12.3 Finite-Frequency Kernels ......................................... 77

13 Running through a Scheduler 80

14 Changing the Model 81

14.1 Using external tomographic Earth models ................................ 81

14.2 External (an)elastic Models ........................................ 83

14.3 Anisotropic Models ............................................ 84

14.4 Using external SEP models ........................................ 84

15 Post-Processing Scripts 85

15.1 Process Data and Synthetics ....................................... 85

15.1.1 Data processing script process_data.pl .......................... 85

15.1.2 Synthetics processing script process_syn.pl ........................ 86

15.1.3 Script rotate.pl ....................................... 86

15.2 Collect Synthetic Seismograms ...................................... 86

15.3 Clean Local Database ........................................... 87

15.4 Plot Movie Snapshots and Synthetic Shakemaps ............................. 87

15.4.1 Script movie2gif.gmt.pl .................................. 87

15.4.2 Script plot_shakemap.gmt.pl ............................... 87

15.5 Map Local Database ........................................... 87

16 Information for developers of the code, and for people who want to learn how the technique works 88

Notes & Acknowledgments 90

Bibliography 92

CONTENTS 5

A Reference Frame Convention 101

B Channel Codes of Seismograms 103

C Troubleshooting 105

Chapter 1

Introduction

FIRST ANNOUNCEMENT: SPECFEM3D can now perform full waveform inversion (FWI), i.e. invert for

models in an iterative fashion, and it can also perform source inversions in a constant model; please refer to

the two new directories, inverse_problem_for_model and inverse_problem_for_source, and the

README ﬁles they contain. For FWI inversions for the model, also refer to the new examples provided in

directory EXAMPLES.

SECOND ANNOUNCEMENT: SPECFEM3D can now perform coupling with an external code (DSM,

AxiSEM or FK) based on a database of displacement vectors and traction vectors on the outer edges of the

mesh created once and for all (see Monteiller et al. [2013,2015], Wang et al. [2016], Tong et al. [2014a,b], and if

you use that feature of the code please cite at least one of these papers).

To use coupling with FK, just use the set of parameters that is in the DATA/Par_ﬁle input ﬁle of the code:

#———————————————————–

# Coupling with an injection technique (DSM, AxiSEM, or FK)

#———————————————————–

COUPLE_WITH_INJECTION_TECHNIQUE = .false.

INJECTION_TECHNIQUE_TYPE = 3 # 1 = DSM, 2 = AxiSEM, 3 = FK

MESH_A_CHUNK_OF_THE_EARTH = .false.

TRACTION_PATH = ./DATA/AxiSEM_tractions/3/

FKMODEL_FILE = FKmodel

RECIPROCITY_AND_KH_INTEGRAL = .false. # does not work yet

That part (coupling with FK) is actively maintained and works ﬁne. See e.g. http://komatitsch.free.fr/

preprints/GRL_Ping_Tong_2014.pdf for some examples. There is also an example that is provided with

the code: specfem3d/EXAMPLES/small_example_coupling_FK_specfem.

Regarding coupling with DSM, that part is not actively maintained any more, but it is still included in the code, you may

have to test it again and make minor adjustments if needed. The necessary tools are in directory specfem3d/EXTERNAL_PACKAGES_coupled_with_SPECFEM3D/DSM_for_SPECFEM3D

, and there is a README ﬁle in specfem3d/EXTERNAL_PACKAGES_coupled_with_SPECFEM3D that should be

more or less up-to-date (there are about four steps to follow in total, the ﬁrst one being creating the database of DSM

tractions and displacements on the edges of the coupling box).

See e.g. http://komatitsch.free.fr/preprints/GJI1_Vadim_2013.pdf and http://komatitsch.

free.fr/preprints/GJI2_Vadim_2015.pdf for some examples.

THIRD ANNOUNCEMENT: SPECFEM3D can now perform gravity ﬁeld calculations in addition (or in-

stead of) seismic wave propagation only. See ﬂag GRAVITY_INTEGRALS in ﬁle setup/constants.h.in.

Please also refer to http://komatitsch.free.fr/preprints/GJI_Martin_gravimetry_2017.pdf.

And yes, that is the reason why I added a falling apple on the cover of the manual :-). Note that SPECFEM3D

can also model transient gravity perturbations induced by earthquake rupture, as developed and explained in

Harms et al. [2015]. These are two different things, and both are implemented and avaible in SPECFEM3D. To

CHAPTER 1. INTRODUCTION 7

use the second feature, please refer to doc/how_to/README_gravity_perturbation.txt.

The software package SPECFEM3D Cartesian simulates seismic wave propagation at the local or regional scale

and performs full waveform imaging (FWI) or adjoint tomography based upon the spectral-element method (SEM).

The SEM is a continuous Galerkin technique [Tromp et al.,2008,Peter et al.,2011], which can easily be made discon-

tinuous [Bernardi et al.,1994,Chaljub,2000,Kopriva et al.,2002,Chaljub et al.,2003,Legay et al.,2005,Kopriva,

2006,Wilcox et al.,2010,Acosta Minolia and Kopriva,2011]; it is then close to a particular case of the discontinuous

Galerkin technique [Reed and Hill,1973,Lesaint and Raviart,1974,Arnold,1982,Johnson and Pitkäranta,1986,

Bourdel et al.,1991,Falk and Richter,1999,Hu et al.,1999,Cockburn et al.,2000,Giraldo et al.,2002,Rivière and

Wheeler,2003,Monk and Richter,2005,Grote et al.,2006,Ainsworth et al.,2006,Bernacki et al.,2006,Dumbser and

Käser,2006,De Basabe et al.,2008,de la Puente et al.,2009,Wilcox et al.,2010,De Basabe and Sen,2010,Étienne

et al.,2010], with optimized efﬁciency because of its tensorized basis functions [Wilcox et al.,2010,Acosta Minolia

and Kopriva,2011]. In particular, it can accurately handle very distorted mesh elements [Oliveira and Seriani,2011].

In ﬂuids, when gravity is turned off, SPECFEM3D uses the classical linearized Euler equation; thus if you

have sharp local variations of density in the ﬂuid (highly heterogeneous ﬂuids in terms of density) or if density

becomes extremely small in some regions of your model (e.g. for upper-atmosphere studies), before using the

code please make sure the linearized Euler equation is a valid approximation in the case you want to study,

and/or see if you should turn gravity on. For more details on that see e.g. Jensen et al. [2011].

It has very good accuracy and convergence properties [Maday and Patera,1989,Seriani and Priolo,1994,Deville

et al.,2002,Cohen,2002,De Basabe and Sen,2007,Seriani and Oliveira,2008,Ainsworth and Wajid,2009,2010,

Melvin et al.,2012]. The spectral element approach admits spectral rates of convergence and allows exploiting hp-

convergence schemes. It is also very well suited to parallel implementation on very large supercomputers [Komatitsch

et al.,2003,Tsuboi et al.,2003,Komatitsch et al.,2008,Carrington et al.,2008,Komatitsch et al.,2010b] as well as

on clusters of GPU accelerating graphics cards [Komatitsch,2011,Michéa and Komatitsch,2010,Komatitsch et al.,

2009,2010a]. Tensor products inside each element can be optimized to reach very high efﬁciency [Deville et al.,2002],

and mesh point and element numbering can be optimized to reduce processor cache misses and improve cache reuse

[Komatitsch et al.,2008]. The SEM can also handle triangular (in 2D) or tetrahedral (in 3D) elements [Wingate and

Boyd,1996,Taylor and Wingate,2000,Komatitsch et al.,2001,Cohen,2002,Mercerat et al.,2006] as well as mixed

meshes, although with increased cost and reduced accuracy in these elements, as in the discontinuous Galerkin method.

Note that in many geological models in the context of seismic wave propagation studies (except for instance for

fault dynamic rupture studies, in which very high frequencies or supershear rupture need to be modeled near the fault,

see e.g. Benjemaa et al. [2007,2009], de la Puente et al. [2009], Tago et al. [2010]) a continuous formulation is

sufﬁcient because material property contrasts are not drastic and thus conforming mesh doubling bricks can efﬁciently

handle mesh size variations [Komatitsch and Tromp,2002a,Komatitsch et al.,2004,Lee et al.,2008,2009a,b].

For a detailed introduction to the SEM as applied to regional seismic wave propagation, please consult Peter et al.

[2011], Tromp et al. [2008], Komatitsch and Vilotte [1998], Komatitsch and Tromp [1999], Chaljub et al. [2007] and

in particular Lee et al. [2009b,a,2008], Godinho et al. [2009], van Wijk et al. [2004], Komatitsch et al. [2004]. A

detailed theoretical analysis of the dispersion and stability properties of the SEM is available in Cohen [2002], De

Basabe and Sen [2007], Seriani and Oliveira [2007], Seriani and Oliveira [2008] and Melvin et al. [2012].

Effects due to lateral variations in compressional-wave speed, shear-wave speed, density, a 3D crustal model,

topography and bathymetry are included. The package can accommodate full 21-parameter anisotropy (see Chen

and Tromp [2007]) as well as lateral variations in attenuation [Savage et al.,2010]. Adjoint capabilities and ﬁnite-

frequency kernel simulations are included [Tromp et al.,2008,Peter et al.,2011,Liu and Tromp,2006,Fichtner et al.,

2009a,Virieux and Operto,2009].

The SEM was originally developed in computational ﬂuid dynamics [Patera,1984,Maday and Patera,1989] and

has been successfully adapted to address problems in seismic wave propagation. Early seismic wave propagation ap-

plications of the SEM, utilizing Legendre basis functions and a perfectly diagonal mass matrix, include Cohen et al.

[1993], Komatitsch [1997], Faccioli et al. [1997], Casadei and Gabellini [1997], Komatitsch and Vilotte [1998] and

CHAPTER 1. INTRODUCTION 8

Komatitsch and Tromp [1999], whereas applications involving Chebyshev basis functions and a non-diagonal mass

matrix include Seriani and Priolo [1994], Priolo et al. [1994] and Seriani et al. [1995]. In the Legendre version that

we use in SPECFEM the mass matrix is purposely slightly inexact but diagonal (but can be made exact if needed, see

Teukolsky [2015]), while in the Chebyshev version it is exact but non diagonal.

Beware that, in a spectral-element method, some spurious modes (that have some similarities with classical

so-called "Hourglass modes" in ﬁnite-element techniques, although in the SEM they are not zero-energy modes)

can appear in some (but not all) cases in the spectral element in which the source is located. Fortunately, they

do not propagate away from the source element. However, this means that if you put a receiver in the same

spectral element as a source, the recorded signals may in some cases be wrong, typically exhibiting some spu-

rious oscillations, which are often even non causal. If that is the case, an easy option is to slightly change the

mesh in the source region in order to get rid of these Hourglass-like spurious modes, as explained in Duczek

et al. [2014], in which this phenomenon is described in details, and in which practical solutions to avoid it are

suggested.

All SPECFEM3D software is written in Fortran2003 with full portability in mind, and conforms strictly to the

Fortran2003 standard. It uses no obsolete or obsolescent features of Fortran. The package uses parallel programming

based upon the Message Passing Interface (MPI) [Gropp et al.,1994,Pacheco,1997].

SPECFEM3D won the Gordon Bell award for best performance at the SuperComputing 2003 conference in

Phoenix, Arizona (USA) (see Komatitsch et al. [2003] and www.sc-conference.org/sc2003/nrfinalaward.

html). It was a ﬁnalist again in 2008 for a run at 0.16 petaﬂops (sustained) on 149,784 processors of the ‘Jaguar’

Cray XT5 system at Oak Ridge National Laboratories (USA) [Carrington et al.,2008]. It also won the BULL Joseph

Fourier supercomputing award in 2010.

It reached the sustained one petaﬂop performance level for the ﬁrst time in February 2013 on the Blue Waters Cray

supercomputer at the National Center for Supercomputing Applications (NCSA), located at the University of Illinois

at Urbana-Champaign (USA).

This new release of the code includes Convolution or Auxiliary Differential Equation Perfectly Matched absorbing

Layers (C-PML or ADE-PML) [Martin et al.,2008b,c,Martin and Komatitsch,2009,Martin et al.,2010,Komatitsch

and Martin,2007]. It also includes support for GPU graphics card acceleration [Komatitsch,2011,Michéa and Ko-

matitsch,2010,Komatitsch et al.,2009,2010a].

The next release of the code will use the PT-SCOTCH parallel and threaded version of SCOTCH for mesh parti-

tioning instead of the serial version.

SPECFEM3D Cartesian includes coupled ﬂuid-solid domains and adjoint capabilities, which enables one to ad-

dress seismological inverse problems, but for linear rheologies only so far. To accommodate visco-plastic or non-linear

rheologies, the reader can refer to the GeoELSEsoftware package [Casadei and Gabellini,1997,Stupazzini et al.,

2009].

1.1 Citation

You can ﬁnd all the references below in BIBT

EXformat in ﬁle doc/USER_MANUAL/bibliography.bib.

If you use SPECFEM3D Cartesian for your own research, please cite at least one of the following articles:

Numerical simulations in general

Forward and adjoint simulations are described in detail in Tromp et al. [2008], Peter et al. [2011], Vai et al.

[1999], Komatitsch et al. [2009,2010a], Chaljub et al. [2007], Madec et al. [2009], Komatitsch et al. [2010b],

Carrington et al. [2008], Tromp et al. [2010a], Komatitsch et al. [2002], Komatitsch and Tromp [2002a,b,1999]

or Komatitsch and Vilotte [1998]. Additional aspects of adjoint simulations are described in Tromp et al. [2005],

Liu and Tromp [2006], Tromp et al. [2008], Liu and Tromp [2008], Tromp et al. [2010a], Peter et al. [2011].

CHAPTER 1. INTRODUCTION 9

Domain decomposition is explained in detail in Martin et al. [2008a], and excellent scaling up to 150,000

processor cores is shown for instance in Carrington et al. [2008], Komatitsch et al. [2008], Martin et al. [2008a],

Komatitsch et al. [2010a], Komatitsch [2011],

GPU computing

Computing on GPU graphics cards for acoustic or seismic wave propagation applications is described in detail

in Komatitsch [2011], Michéa and Komatitsch [2010], Komatitsch et al. [2009,2010a].

If you use this new version, which has non blocking MPI for much better performance for medium or large runs,

please cite at least one of these six articles, in which results of non blocking MPI runs are presented: Peter et al.

[2011], Komatitsch et al. [2010a,b], Komatitsch [2011], Carrington et al. [2008], Martin et al. [2008a].

If you use the C-PML absorbing layer capabilities of the code, please cite at least one article written by the

developers of the package, for instance:

•Xie et al. [2014],

•Xie et al. [2016].

If you use the UNDO_ATTENUATION option of the code in order to produce full anelastic/viscoelastic sensitivity

kernels, please cite at least one article written by the developers of the package, for instance (and in particular):

•Komatitsch et al. [2016].

More generally, if you use the attenuation (anelastic/viscoelastic) capabilities of the code, please cite at least one article

written by the developers of the package, for instance:

•Komatitsch et al. [2016],

•Blanc et al. [2016].

If you use the kernel capabilities of the code, please cite at least one article written by the developers of the package,

for instance:

•Tromp et al. [2008],

•Peter et al. [2011],

•Liu and Tromp [2006],

•Morency et al. [2009].

If you work on geophysical applications, you may be interested in citing some of these application articles as well,

among others:

Southern California simulations

Komatitsch et al. [2004], Krishnan et al. [2006a,b].

If you use the 3D southern California model, please cite Süss and Shaw [2003] (Los Angeles model), Lovely

et al. [2006] (Salton Trough), and Hauksson [2000] (southern California). The Moho map was determined by

Zhu and Kanamori [2000]. The 1D SoCal model was developed by Dreger and Helmberger [1990].

Anisotropy

Chen and Tromp [2007], Ji et al. [2005], Chevrot et al. [2004], Favier et al. [2004], Ritsema et al. [2002], Tromp

and Komatitsch [2000].

Attenuation

Savage et al. [2010], Komatitsch and Tromp [2002a,1999].

Topography

Lee et al. [2009b,a,2008], Godinho et al. [2009], van Wijk et al. [2004].

The corresponding BibT

EX entries may be found in ﬁle doc/USER_MANUAL/bibliography.bib.

CHAPTER 1. INTRODUCTION 10

1.2 Support

This material is based upon work supported by the USA National Science Foundation under Grants No. EAR-0406751

and EAR-0711177, by the French CNRS, French INRIA Sud-Ouest MAGIQUE-3D, French ANR NUMASIS under

Grant No. ANR-05-CIGC-002, and European FP6 Marie Curie International Reintegration Grant No. MIRG-CT-

2005-017461. Any opinions, ﬁndings, and conclusions or recommendations expressed in this material are those of the

authors and do not necessarily reﬂect the views of the USA National Science Foundation, CNRS, INRIA, ANR or the

European Marie Curie program.

Chapter 2

Getting Started

To download the SPECFEM3D_Cartesian software package, type this:

git clone --recursive --branch devel https://github.com/geodynamics/specfem3d.git

Note: for people who would like to run the package on Windows rather than on Unix machines, you can in-

stall Docker or VirtualBox (installing a Linux in VirtualBox in that latter case) and run it easily from inside that.

We recommend that you add ulimit -S -s unlimited to your .bash_profile ﬁle and/or limit stacksize

unlimited to your .cshrc ﬁle to suppress any potential limit to the size of the Unix stack.

Then, to conﬁgure the software for your system, run the configure shell script. This script will attempt to guess

the appropriate conﬁguration values for your system. However, at a minimum, it is recommended that you explicitly

specify the appropriate command names for your Fortran compiler (another option is to deﬁne FC, CC and MPIF90 in

your .bash_proﬁle or your .cshrc ﬁle):

./configure FC=gfortran CC=gcc

If you want to run in parallel, i.e., using more than one processor core, then you would type

./configure FC=gfortran CC=gcc MPIFC=mpif90 --with-mpi

You can replace the GNU compilers above (gfortran and gcc) with other compilers if you want to; for instance for

Intel ifort and icc use FC=ifort CC=icc instead.

Note that MPI must be installed with MPI-IO enabled because parts of SPECFEM3D perform I/Os through MPI-

IO.

Before running the configure script, you should probably edit ﬁle flags.guess to make sure that it contains

the best compiler options for your system. Known issues or things to check are:

Intel ifort compiler See if you need to add -assume byterecl for your machine. In the case of that

compiler, we have noticed that initial release versions sometimes have bugs or issues that can lead to wrong

results when running the code, thus we strongly recommend using a version for which at least one service

pack or update has been installed. In particular, for version 17 of that compiler, users have reported

problems (making the code crash at run time) with the -assume buffered_io option; if you notice

problems, remove that option from ﬁle flags.guess or change it to -assume nobuffered_io and

try again.

IBM compiler See if you need to add -qsave or -qnosave for your machine.

Mac OS You will probably need to install XCODE.

When compiling on an IBM machine with the xlf and xlc compilers, we suggest running the configure script

with the following options:

./configure FC=xlf90_r MPIFC=mpif90 CC=xlc_r CFLAGS="-O3 -q64" FCFLAGS="-O3 -q64" -with-scotch-dir=...

CHAPTER 2. GETTING STARTED 12

If you have problems conﬁguring the code on a Cray machine, i.e. for instance if you get an error message

from the configure script, try exporting these two variables: MPI_INC=$CRAY_MPICH2_DIR/include and

FCLIBS=" ", and for more details if needed you can refer to the utils/Cray_compiler_information

directory.

On SGI systems, flags.guess automatically informs configure to insert ‘‘TRAP_FPE=OFF” into the gen-

erated Makefile in order to turn underﬂow trapping off.

You can add -enable-vectorization to the conﬁguration options to speed up the code in the ﬂuid (acous-

tic) and elastic parts. This works ﬁne if (and only if) your computer always allocates a contiguous memory block

for each allocatable array; this is the case for most machines and most compilers, but not all. To disable this feature,

use option -disable-vectorization. For more details see github.com/geodynamics/specfem3d/issues/81 . To

check if that option works ﬁne on your machine, run the code with and without it for an acoustic/elastic model and

make sure the seismograms are identical.

Note that we use CUBIT (now called Trelis) to create meshes of hexahedra, but other packages can be used as well,

for instance GiD from http://gid.cimne.upc.es or Gmsh from http://geuz.org/gmsh [Geuzaine and

Remacle,2009]. Even mesh creation packages that generate tetrahedra, for instance TetGen from http://tetgen.

berlios.de, can be used because each tetrahedron can then easily be decomposed into four hexahedra as shown

in the picture of the TetGen logo at http://tetgen.berlios.de/figs/Delaunay-Voronoi-3D.gif;

while this approach does not generate hexahedra of optimal quality, it can ease mesh creation in some situations and

it has been shown that the spectral-element method can very accurately handle distorted mesh elements [Oliveira and

Seriani,2011].

The SPECFEM3D Cartesian software package relies on the SCOTCH library to partition meshes created with

CUBIT. METIS [Karypis and Kumar,1998a,c,b] can also be used instead of SCOTCH if you prefer, by editing

ﬁle src/decompose_mesh/decompose_mesh.F90 and uncommenting the ﬂag USE_METIS_INSTEAD_OF_SCOTCH.

You will also then need to install and compile Metis version 4.0 (do *NOT* install Metis version 5.0, which has in-

compatible function calls) and edit src/decompose_mesh/Makefile.in and uncomment the METIS link ﬂag

in that ﬁle before running configure.

The SCOTCH library [Pellegrini and Roman,1996] provides efﬁcient static mapping, graph and mesh partitioning

routines. SCOTCH is a free software package developed by François Pellegrini et al. from LaBRI and INRIA in

Bordeaux, France, downloadable from the web page https://gforge.inria.fr/projects/scotch/. In

case no SCOTCH libraries can be found on the system, the conﬁguration will bundle the version provided with the

source code for compilation. The path to an existing SCOTCH installation can to be set explicitly with the option

--with-scotch-dir. Just as an example:

./configure FC=ifort MPIFC=mpif90 --with-scotch-dir=/opt/scotch

If you use the Intel ifort compiler to compile the code, we recommend that you use the Intel icc C compiler to compile

Scotch, i.e., use:

./configure CC=icc FC=ifort MPIFC=mpif90

When compiling for GPU cards i.e. using CUDA, you can use:

./configure -with-cuda

for CUDA version 4, or

./configure -with-cuda=cuda5

for CUDA version 5, or

./configure -with-cuda=cuda6

for CUDA version 6, or

./configure -with-cuda=cuda8

for CUDA version 8.

Before CUDA version 5, one version supported basically one new architecture and needed a different kind of

compilation. Since version 5, the compilation has stayed the same, but newer versions supported newer architectures.

However at the moment, we still have one version linked to one speciﬁc architecture:

- CUDA 4 for Fermi, - CUDA 5 for Tesla K20 - CUDA 6 for Tesla K80 - CUDA 8 for Pascal P100.

CHAPTER 2. GETTING STARTED 13

So even if you have the new CUDA toolkit version 8, but you want to run on say a K20 GPU, then you would still

conﬁgure with:

./configure -with-cuda=cuda5

The compilation should work and the cuda5 setting chooses the right architecture (-gencode=arch=compute_35,code=sm_35

for K20 cards).

So, type

./configure -with-cuda=cuda8

for Pascal P100 boards.

When compiling the SCOTCH source code, if you get a message such as: "ld: cannot ﬁnd -lz", the Zlib com-

pression development library is probably missing on your machine and you will need to install it or ask your system

administrator to do so. On Linux machines the package is often called "zlib1g-dev" or similar. (thus "sudo apt-get

install zlib1g-dev" would install it)

To compile a serial version of the code for small meshes that ﬁts on one compute node and can therefore be run

serially, run configure with the --without-mpi option to suppress all calls to MPI.

A summary of the most important conﬁguration variables follows.

F90 Path to the Fortran compiler.

MPIF90 Path to MPI Fortran.

MPI_FLAGS Some systems require this ﬂag to link to MPI libraries.

FLAGS_CHECK Compiler ﬂags.

The conﬁguration script automatically creates for each executable a corresponding Makefile in the src/ subdi-

rectory. The Makefile contains a number of suggested entries for various compilers, e.g., Portland, Intel, Absoft,

NAG, and Lahey. The software has run on a wide variety of compute platforms, e.g., various PC clusters and machines

from Sun, SGI, IBM, Compaq, and NEC. Select the compiler you wish to use on your system and choose the related

optimization ﬂags. Note that the default ﬂags in the Makefile are undoubtedly not optimal for your system, so we

encourage you to experiment with these ﬂags and to solicit advice from your systems administrator. Selecting the right

compiler and optimization ﬂags can make a tremendous difference in terms of performance. We welcome feedback

on your experience with various compilers and ﬂags.

Now that you have set the compiler information, you need to select a number of ﬂags in the constants.h ﬁle

depending on your system:

LOCAL_PATH_IS_ALSO_GLOBAL Set to .false. on most cluster applications. For reasons of speed, the (par-

allel) distributed database generator typically writes a (parallel) database for the solver on the local disks of the

compute nodes. Some systems have no local disks, e.g., BlueGene or the Earth Simulator, and other systems

have a fast parallel ﬁle system, in which case this ﬂag should be set to .true.. Note that this ﬂag is not used

by the database generator or the solver; it is only used for some of the post-processing.

The package can run either in single or in double precision mode. The default is single precision because for almost

all calculations performed using the spectral-element method using single precision is sufﬁcient and gives the same

results (i.e. the same seismograms); and the single precision code is faster and requires exactly half as much memory.

Select your preference by selecting the appropriate setting in the constants.h ﬁle:

CUSTOM_REAL Set to SIZE_REAL for single precision and SIZE_DOUBLE for double precision.

In the precision.h ﬁle:

CUSTOM_MPI_TYPE Set to MPI_REAL for single precision and MPI_DOUBLE_PRECISION for double precision.

On many current processors (e.g., Intel, AMD, IBM Power), single precision calculations are signiﬁcantly faster; the

difference can typically be 10% to 25%. It is therefore better to use single precision. What you can do once for the

physical problem you want to study is run the same calculation in single precision and in double precision on your

system and compare the seismograms. If they are identical (and in most cases they will), you can select single preci-

sion for your future runs.

If your compiler has problems with the use mpi statements that are used in the code, use the script called

replace_use_mpi_with_include_mpif_dot_h.pl in the root directory to replace all of them with include

’mpif.h’ automatically.

CHAPTER 2. GETTING STARTED 14

2.1 Adding OpenMP support in addition to MPI

NOTE FROM JULY 2013: OpenMP support is maybe / probably not maintained any more. Thus the section below is

maybe obsolete.

OpenMP support can be enabled in addition to MPI. However, in many cases performance will not improve because

our pure MPI implementation is already heavily optimized and thus the resulting code will in fact be slightly slower.

A possible exception could be IBM BlueGene-type architectures.

To enable OpenMP, uncomment the OpenMP compiler option in two lines in ﬁle src/specfem3D/Makefile.in

(before running configure) and also uncomment the #define USE_OPENMP statement in ﬁle

src/specfem3D/specfem3D.F90.

The DO-loop using OpenMP threads has a SCHEDULE property. The OMP_SCHEDULE environment variable can

set the scheduling policy of that DO-loop. Tests performed by Marcin Zielinski at SARA (The Netherlands) showed

that often the best scheduling policy is DYNAMIC with the size of the chunk equal to the number of OpenMP threads,

but most preferably being twice as the number of OpenMP threads (thus chunk size = 8 for 4 OpenMP threads etc).

If OMP_SCHEDULE is not set or is empty, the DO-loop will assume generic scheduling policy, which will slow down

the job quite a bit.

2.2 Compiling on an IBM BlueGene

More recent installation instruction for IBM BlueGene, from April 2013:

Edit ﬁle flags.guess and put this for FLAGS_CHECK:

-g -qfullpath -O2 -qsave -qstrict -qtune=qp -qarch=qp -qcache=auto -qhalt=w

-qfree=f90 -qsuffix=f=f90 -qlanglvl=95pure -Q -Q+rank,swap_all -Wl,-relax

The most relevant are the -qarch and -qtune ﬂags, otherwise if these ﬂags are set to “auto” then they are wrongly

assigned to the architecture of the frond-end node, which is different from that on the compute nodes. You will need

to set these ﬂags to the right architecture for your BlueGene compute nodes, which is not necessarily “qp”; ask your

system administrator. On some machines if is necessary to use -O2 in these ﬂags instead of -O3 due to a compiler bug

of the XLF version installed. We thus suggest to ﬁrst try -O3, and then if the code does not compile or does not run

ﬁne then switch back to -O2. The debug ﬂags (-g, -qfullpath) do not inﬂuence performance but are useful to get at

least some insights in case of problems.

Before running configure, select the XL Fortran compiler by typing module load bgq-xl/1.0 or module

load bgq-xl (another, less efﬁcient option is to load the GNU compilers using module load bgq-gnu/4.4.6

or similar).

Then, to conﬁgure the code, type this:

./configure FC=bgxlf90_r MPIFC=mpixlf90_r CC=bgxlc_r LOCAL_PATH_IS_ALSO_GLOBAL=true

In order for the SCOTCH domain decomposer to compile, on some (but not all) Blue Gene systems you may need

to run configure with CC=gcc instead of CC=bgxlc_r.

Older installation instruction for IBM BlueGene, from 2011:

To compile the code on an IBM BlueGene, Laurent Léger from IDRIS, France, suggests the following: compile the

code with

FLAGS_CHECK="-O3 -qsave -qstrict -qtune=auto -qarch=450d -qcache=auto \

-qfree=f90 -qsuffix=f=f90 -g -qlanglvl=95pure -qhalt=w -Q \

-Q+rank,swap_all -Wl,-relax"

Option "-Wl,-relax" must be added on many (but not all) BlueGene systems to be able to link the binaries xmeshfem3D

and xspecfem3D because the ﬁnal link step is done by the GNU ld linker even if one uses FC=bgxlf90_r,

CHAPTER 2. GETTING STARTED 15

MPIFC=mpixlf90_r and CC=bgxlc_r to create all the object ﬁles. On the contrary, on some BlueGene systems

that use the native AIX linker option "-Wl,-relax" can lead to problems and must be suppressed from flags.guess.

Also, AR=ar, ARFLAGS=cru and RANLIB=ranlib are hardwired in all Makefile.in ﬁles by default, but to

cross-compile on BlueGene/P one needs to change these values to AR=bgar, ARFLAGS=cru and RANLIB=bgranlib.

Thus the easiest thing to do is to modify all Makefile.in ﬁles and the configure script to set them automatically

by configure. One then just needs to pass the right commands to the configure script:

./configure --prefix=/path/to/SPECFEM3DG_SP --host=Babel --build=BGP \

FC=bgxlf90_r MPIFC=mpixlf90_r CC=bgxlc_r AR=bgar ARFLAGS=cru \

RANLIB=bgranlib LOCAL_PATH_IS_ALSO_GLOBAL=false

This trick can be useful for all hosts on which one needs to cross-compile.

On BlueGene, one also needs to run the xcreate_header_file binary ﬁle manually rather than in the Makeﬁle:

bgrun -np 1 -mode VN -exe ./bin/xcreate_header_file

2.3 Visualizing the subroutine calling tree of the source code

Packages such as Doxywizard can be used to visualize the calling tree of the subroutines of the source code.

Doxywizard is a GUI front-end for conﬁguring and running Doxygen.

To do your own call graphs, you can follow these simple steps below.

0. Install Doxygen and graphviz (the two are usually in the package manager of classic Linux distribution).

1. Run in the terminal : doxygen -g, which creates a Doxyfile that tells doxygen what you want it to do.

2. Edit the Doxyﬁle. Two Doxyﬁle-type ﬁles have been already committed in the directory

specfem3d/doc/Call_trees:

•Doxyfile_truncated_call_tree will generate call graphs with maximum 3 or 4 levels of tree

structure,

•Doxyfile_complete_call_tree will generate call graphs with complete tree structure.

The important entries in the Doxyﬁle are:

PROJECT_NAME

OPTIMIZE_FOR_FORTRAN Set to YES

EXTRACT_ALL Set to YES

EXTRACT_PRIVATE Set to YES

EXTRACT_STATIC Set to YES

INPUT From the directory specfem3d/doc/Call_trees, it is "../../src/"

FILE_PATTERNS In SPECFEM case, it is *.f90* *.F90* *.c* *.cu* *.h*

HAVE_DOT Set to YES

CALL_GRAPH Set to YES

CALLER_GRAPH Set to YES

DOT_PATH The path where is located the dot program graphviz (if it is not in your $PATH)

RECURSIVE This tag can be used to turn specify whether or not subdirectories should be searched for input

ﬁles as well. In the case of SPECFEM, set to YES.

EXCLUDE Here, you can exclude:

CHAPTER 2. GETTING STARTED 16

../../src/specfem3D/older_not_maintained_partial_OpenMP_port

../../src/decompose_mesh/scotch

../../src/decompose_mesh/scotch_5.1.12b

DOT_GRAPH_MAX_NODES to set the maximum number of nodes that will be shown in the graph. If the

number of nodes in a graph becomes larger than this value, doxygen will truncate the graph, which is

visualized by representing a node as a red box. Minimum value: 0, maximum value: 10000, default value:

50.

MAX_DOT_GRAPH_DEPTH to set the maximum depth of the graphs generated by dot. A depth value of 3

means that only nodes reachable from the root by following a path via at most 3 edges will be shown.

Using a depth of 0 means no depth restriction. Minimum value: 0, maximum value: 1000, default value:

3. Run : doxygen Doxyfile, HTML and LaTeX ﬁles created by default in html and latex subdirectories.

4. To see the call trees, you have to open the ﬁle html/index.html in your browser. You will have many

informations about each subroutines of SPECFEM (not only call graphs), you can click on every boxes / sub-

routines. It show you the call, and, the caller graph of each subroutine : the subroutines called by the concerned

subroutine, and the previous subroutines who call this subroutine (the previous path), respectively. In the case

of a truncated calling tree, the boxes with a red border indicates a node that has more arrows than are shown (in

other words: the graph is truncated with respect to this node).

Finally, some useful links:

• a good and short summary for the basic utilisation of Doxygen:

http://www.softeng.rl.ac.uk/blog/2010/jan/30/callgraph-fortran-doxygen/,

• to conﬁgure the diagrams :

http://www.stack.nl/~dimitri/doxygen/manual/diagrams.html,

• the complete alphabetical index of the tags in Doxyﬁle:

http://www.stack.nl/~dimitri/doxygen/manual/config.html,

• more generally, the Doxygen manual:

http://www.stack.nl/~dimitri/doxygen/manual/index.html.

2.4 Using the ADIOS library for I/O

Regular POSIX I/O can be problematic when dealing with large simulations one large clusters (typically more than

10,000 processes). SPECFEM3D use the ADIOS library Liu et al. [2013] to deal transparently take advantage of

advanced parallel ﬁle system features. To enable ADIOS, the following steps should be done:

1. Install ADIOS (available from https://www.olcf.ornl.gov/center-projects/adios/). Make

sure that your environment variables reference it.

2. You may want to change ADIOS related values in the constants.h.in ﬁle. The default values probably

suit most cases.

3. Conﬁgure using the -with-adios ﬂag.

ADIOS is currently only usable for meshfem3D generated mesh (i.e. not for meshes generated with CUBTI). Addi-

tional control parameters are discussed in section 4.1.

CHAPTER 2. GETTING STARTED 17

2.5 Becoming a developer of the code, or making small modiﬁcations in the

source code

If you want to develop new features in the code, and/or if you want to make small changes, improvements, or bug

ﬁxes, you are very welcome to contribute. To do so, i.e. to access the development branch of the source code with

read/write access (in a safe way, no need to worry too much about breaking the package, there is a robot called

BuildBot that is in charge of checking and validating all new contributions and changes), please visit this Web page:

https://github.com/geodynamics/specfem3d/wiki/Using-Hub.

To visualize the call tree (calling tree) of the source code, you can see the Doxygen tool available in directory

doc/call_trees_of_the_source_code.

Chapter 3

Mesh Generation

The ﬁrst step in running a spectral-element simulation consists of constructing a high-quality mesh for the region

under consideration. We provide two possibilities to do so: (1) relying on the external, hexahedral mesher CUBIT, or

(2) using the provided, internal mesher xmeshfem3D. In the following, we explain these two approaches.

3.1 Meshing with CUBIT

CUBIT is a meshing tool suite for the creation of ﬁnite-element meshes for arbitrarily shaped models. It has been

developed and maintained at Sandia National Laboratories and can be purchased for a small academic institutional

fee at http://cubit.sandia.gov. Our experience showed that using CUBIT greatly facilitates and speeds up

the generation and preparation of hexahedral, conforming meshes for a variety of geophysical models with increasing

complexity.

Figure 3.1: Example of the graphical user interface of CUBIT. The hexahedral mesh shown in the main display consists

of a hexahedral discretization of a single volume with topography.

The basic steps in creating a load-balanced, partitioned mesh with CUBIT are:

1. setting up a hexahedral mesh with CUBIT,

CHAPTER 3. MESH GENERATION 19

2. exporting the CUBIT mesh into a SPECFEM3D Cartesian ﬁle format and

3. partitioning the SPECFEM3D Cartesian mesh ﬁles for a chosen number of cores.

Examples are provided in the SPECFEM3D Cartesian package in the subdirectory EXAMPLES/. We strongly encour-

age you to contribute your own example to this package by contacting the CIG Computational Seismology Mailing

List (cig-seismo@geodynamics.org).

3.1.1 Creating the Mesh with CUBIT

For the installation and handling of the CUBIT meshing tool suite, please refer to the CUBIT user manual and docu-

mentation. In order to give you a basic understanding of how to use CUBIT for our purposes, examples are provided

in the SPECFEM3D Cartesian package in the subdirectory EXAMPLES/:

homogeneous_halfspace Creates a single block model and assigns elastic material parameters.

layered_halfspace Combines two different, elastic material volumes and creates a reﬁnement layer between

the two. This example can be compared for validation against the solutions provided in subdirectory

VALIDATION_3D_SEM_SIMPLER_LAYER_SOURCE_DEPTH/.

waterlayered_halfspace Combines an acoustic and elastic material volume as in a schematic marine survey

example.

tomographic_model Creates a single block model whose material properties will have to be read in from a

tomographic model ﬁle during the databases creation by xgenerate_databases.

Figure 3.2: Screenshots of the CUBIT examples provided in subdirectory EXAMPLES/: homogeneous halfspace

(top-left), layered halfspace (top-right), water layered halfspace (bottom-left) and tomographic model (bottom-right).

In each example subdirectory you will ﬁnd a README ﬁle, which explains in a step-by-step tutorial the workﬂow

for the example. Please feel free to contribute your own example to this package by contacting the CIG Computational

CHAPTER 3. MESH GENERATION 20

Seismology Mailing List (cig-seismo@geodynamics.org).

In some cases, to re-create the meshes for the examples given, just type

claro ./create_mesh.py

or similar from the command line (claro is the command to run CUBIT from the command line).

IMPORTANT: In order to correctly set up GEOCUBIT and run the examples, please read the ﬁle called EXAMPLES/README;

in particular, please make sure you correctly set up the Python paths as indicated in that ﬁle.

Concerning the script create_mesh.py, you may ﬁnd the option use_explicit which chooses how to

assign the material properties for the volume (or domain):

(1) one way is to explicitly assign block attributes to the block of the corresponding volume, by commands like:

cubit.cmd(’block ’+str(id_block)+’ attribute index 2 2800’) # vp

This is done when using use_explicit = 1 in the script. The ﬁnal command:

cubit2specfem3d.export2SPECFEM3D(’MESH/’)

will then create the corresponding MESH/nummaterial_velocity_file for all such deﬁned block vol-

umes/domains.

(2) the other option with use_explicit = 0 is to let GEOCUBIT deal with a dummy entry and then overwrite

the nummaterial_velocity_file at the end with a corresponding command section:

f = open(nummaterial_velocity_file,’w’)

This second way is chosen by default because GEOCUBIT can handle partitions for several processes and

glues everything together automatically. Thus, it adds some more sophistication when the model gets more

complicated than just this single volume example.

You will ﬁnd out by experimenting what is easier for your case.

3.1.2 Exporting the Mesh with run_cubit2specfem3d.py

Once the geometric model volumes in CUBIT are meshed, you prepare the model for exportation with the deﬁnition

of material blocks and boundary surfaces. Thus, prior to exporting the mesh, you need to deﬁne blocks specify-

ing the materials and absorbing boundaries in CUBIT. This process could be done automatically using the script

run_cubit2specfem3d.py if the mesh meets some conditions or manually, following the block convention:

material_name Each material should have a speciﬁc block deﬁned by a unique name. The name convention of the

material is to start with either ’elastic’ or ’acoustic’. It must be then followed by a unique identiﬁer, e.g. ’elastic

1’,’elastic 2’, etc. The additional attributes to the block deﬁne the material description.

For an elastic material:

material_id An integer value which is unique for this material.

Vp P-wave speed of the material (given in m/s).

Vs S-wave speed of the material (given in m/s).

rho density of the material (given in kg/m3).

Qquality factor to use in case of a simulation with attenuation turned on. It should be between 1 and 9000.

In case no attenuation information is available, it can be set to zero. You can either specify a single

Q value, in which case it will be assumed to be pure shear attenuation Qµ, or two separate values for

bulk and shear attenuation, Qκand Qµrespectively. Note that Qmu is always equal to Qs, but Qkappa

is in general not equal to Qp. To convert one to the other see doc/note_on_Qkappa_versus_Qp.pdf and

utils/attenuation/conversion_from_Qkappa_Qmu_to_Qp_Qs_from_Dahlen_Tromp_959_960.f90.

CHAPTER 3. MESH GENERATION 21

Please note that your Vp- and Vs-speeds are given for a reference frequency. To change this refer-

ence frequency, you change the value of ATTENUATION_f0_REFERENCE in the main constants ﬁle

constants.h found in subdirectory src/shared/. The code uses a constant Qquality factor,

write(IMAIN,*) "but approximated based on a series of Zener standard linear solids (SLS). The approxima-

tion is thus performed in a given frequency band determined based on that ATTENUATION_f0_REFERENCE

reference frequency.

anisotropic_ﬂag Flag describing the anisotropic model to use in case an anisotropic simulation should be con-

ducted. See the ﬁle model_aniso.f90 in subdirectory src/generate_databases/ for an im-

plementation of the anisotropic models. In case no anisotropy is available, it can be set to zero.

Note that this material block has to be deﬁned using all the volumes which belong to this elastic material. For

volumes belonging to another, different material, you will need to deﬁne a new material block.

For an acoustic material:

material_id An integer value which is unique for this material.

Vp P-wave speed of the material (given in m/s).

0S-wave speed of the material is ignored.

rho density of the material (given in kg/m3).

face_topo Block deﬁnition for the surface which deﬁnes the free surface (which can have topography). The name of

this block must be ’face_topo’, the block has to be deﬁned using all the surfaces which constitute the complete

free surface of the model.

face_abs_xmin Block deﬁnition for the faces on the absorbing boundaries, one block for each surface with x=Xmin.

face_abs_xmax Block deﬁnition for the faces on the absorbing boundaries, one block for each surface with x=Xmax.

face_abs_ymin Block deﬁnition for the faces on the absorbing boundaries, one block for each surface with y=Ymin.

face_abs_ymax Block deﬁnition for the faces on the absorbing boundaries, one block for each surface with y=Ymax.

face_abs_bottom Block deﬁnition for the faces on the absorbing boundaries, one block for each surface with z=bottom.

Figure 3.3: Example of the block deﬁnitions for the free surface ’face_topo’ (left) and the absorbing boundaries,

deﬁned in a single block ’face_abs_xmin’ (right) in CUBIT.

Optionally, instead of specifying for each surface at the boundaries a single block like mentioned above, you

can also specify a single block for all boundary surfaces and name it as one of the absorbing blocks above, e.g.

’face_abs_xmin’.

CHAPTER 3. MESH GENERATION 22

After the block deﬁnitions are done, you export the mesh using the script cubit2specfem3d.py provided in

each of the example directories (linked to the common script CUBIT_GEOCUBIT/cubit2specfem3d.py). If the

export was successful, you should ﬁnd the following ﬁles in a subdirectory MESH/:

absorbing_cpml_ﬁle (only needed in case of C-PML absorbing conditions) Contains on the ﬁrst line the total

number of C-PML spectral elements in the mesh, and then on the following line the list of all these C-PML

elements with two numbers per line: ﬁrst the spectral element number, and then a C-PML ﬂag indicating to

which C-PML layer(s) that element belongs, according to the following convention:

• Flag = 1 : element belongs to a X CPML layer only (either in Xmin or in Xmax),

• Flag = 2 : element belongs to a Y CPML layer only (either in Ymin or in Ymax),

• Flag = 3 : element belongs to a Z CPML layer only (either in Zmin or in Zmax),

• Flag = 4 : element belongs to a X CPML layer and also to a Y CPML layer,

• Flag = 5 : element belongs to a X CPML layer and also to a Z CPML layer,

• Flag = 6 : element belongs to a Y CPML layer and also to a Z CPML layer,

• Flag = 7 : element belongs to a X, to a Y and to a Z CPML layer, i.e., it belongs to a CPML corner.

Note that it does not matter whether an element belongs to a Xmin or to a Xmax CPML, the ﬂag is the same in

both cases; the same is true for Ymin and Ymax, and also Zmin and Zmax.

When you have an existing CUBIT (or similar) mesh stored in SPECFEM3D format, i.e., if you have exist-

ing nodes_coords_file and mesh_file ﬁles but do not know how to assign CPML ﬂags to them, we

have created a small serial Fortran program that will do that automatically for you, i.e., which will create the

absorbing_cpml_file for you. That program is:

utils/CPML/convert_external_layers_of_a_given_mesh_to_CPML_layers.f90,

and a small Makeﬁle is provided in that directory (utils/CPML).

IMPORTANT: it is your responsibility to make sure that in the input CUBIT (or similar) mesh that this code will

read in SPECFEM3D format from ﬁles nodes_coords_file and mesh_file you have created layers of

elements that constitute a layer of constant thickness aligned with the coordinate grid axes (X, Y and/or Z), so

that this code can assign CPML ﬂags to them. This code does NOT check that (because it cannot, in any easy

way). The mesh inside these CPML layers does not need to be structured nor regular, any non-structured mesh

is ﬁne as long as it has ﬂat PML inner and outer faces, parallel to the axes, and thus of a constant thickness. The

thickness can be different for the X, Y and Z sides. But for X it must not vary, for Y it must not vary, and for Z

it must not vary. If you do not know the exact thickness, you can use a slightly LARGER value in this code (say

2% to 5% more) and this code will ﬁx that and will adjust it; never use a SMALLER value otherwise this code

will miss some CPML elements.

Note: in a future release we will remove the constraint of having CPML layers aligned with the coordinate

axes; we will allow for meshes that are titled by any constant angle in the horizontal plane. However this is not

implemented yet.

Note: in the case of ﬂuid-solid layers in contact, the ﬂuid-solid interface currently needs to be ﬂat and horizontal

inside the CPML layer (i.e., bathymetry should become ﬂat and horizontal when entering the CPML); this

(small) constraint will probably remain in the code for a while because it makes ﬂuid-solid matching inside the

CPML much easier.

materials_ﬁle Contains the material associations for each element. The format is:

element_ID material_ID

where element_ID is the element identiﬁer and material_ID is a unique identiﬁer, positive (for materials

taken from this list of materials, i.e. for which each spectral element has constant material properties taken from

this list) or negative (for tomographic models, i.e. for spectral element whose real velocities and density will be

assigned later by calling an external function to deﬁne model variations, for instance in the case of tomographic

models; in such a case, material properties can vary inside each spectral element, i.e. be different at each of its

Gauss-Lobatto-Legendre grid points).

CHAPTER 3. MESH GENERATION 23

nummaterial_velocity_ﬁle Deﬁnes the material properties.

• For classical materials (i.e., spectral elements for which the velocity and density model will not be assigned

by calling an external function to deﬁne for instance a tomographic model), the format is:

domain_ID material_ID rho vp vs Qkappa Qmu anisotropy_flag

where domain_ID is 1 for acoustic and 2 for elastic or viscoelastic materials, material_ID a

unique identiﬁer, rho the density in kg m−3,vp the P-wave speed in m s−1,vs the S-wave speed in

m s−1,Qthe quality factor and anisotropy_flag an identiﬁer for anisotropic models. Note that

both Qkappa and Qmu are ignored by the code unless ATTENUATION is set. If you want a model

with no Qmu attenuation, both set ATTENUATION to .false. in the Par_file and set Qmu to

9999 here. If you want a model with no Qkappa attenuation, set Qkappa to 9999 here. Note that

Qmu is always equal to Qs, but Qkappa is in general not equal to Qp. To convert one to the other see

doc/note_on_Qkappa_versus_Qp.pdf and utils/attenuation/conversion_from_Qkappa_Qmu_to_Qp_Qs_from_Dahlen_Tromp_959_960.f90.

• For tomographic velocity models, please read Chapter 14 and Section 14.1 ‘Using external tomographic

Earth models’ for further details.

nodes_coords_ﬁle Contains the point locations in Cartesian coordinates of the mesh element corners.

mesh_ﬁle Contains the mesh element connectivity. The hexahedral elements can have 8 or 27 nodes.

See picture doc/mesh_numbering_convention/numbering_convention_27_nodes.jpg to see

in which (standard) order the points must be cited. In the case of 8 nodes, just include the ﬁrst 8 points.

free_or_absorbing_surface_ﬁle_zmax Contains the free surface connectivity or

the surface connectivity of the absorbing boundary surface at the top (Zmax),

depending on whether the top surface is deﬁned as free or absorbing (STACEY_INSTEAD_OF_FREE_SURFACE

in DATA/Par_file).

You should put both the surface of acoustic regions and of elastic regions in that ﬁle; that is, list all the element

faces that constitute the surface of the model in that ﬁle.

absorbing_surface_ﬁle_xmax Contains the surface connectivity of the absorbing boundary surface at Xmax

(also needed in the case of C-PML absorbing conditions, in order for the code to be able to impose Dirichlet