AMD64 Architecture Programmer’s Manual, Volume 3: General Purpose And System Instructions EN Programmer's Manual 3

AMD64 Architecture Programmer's Manual Volume 3 General-Purpose and System Instructions manual pdf -FilePursuit

EN%20-%20AMD64%20Architecture%20Programmer's%20Manual%20Volume%203%20General-Purpose%20and%20System%20Instructions

User Manual: Pdf

Open the PDF directly: View PDF .
Page Count: 474 [warning: Documents this large are best viewed by clicking the View PDF Link!]

Contents
Figures
Tables
Revision History
Preface
1 Instruction Formats
2 Instruction Overview
3 General-Purpose Instruction Reference
4 System Instruction Reference
- ARPL
- CLGI
- CLI
- CLTS
- HLT
- INT 3
- INVD
- INVLPG
- INVLPGA
- IRET IRETD IRETQ
- LAR
- LGDT
- LIDT
- LLDT
- LMSW
- LSL
- LTR
- MONITOR
- MOV (CRn)
- MOV(DRn)
- MWAIT
- RDMSR
- RDPMC
- RDTSC
- RDTSCP
- RSM
- SGDT
- SIDT
- SKINIT
- SLDT
- SMSW
- STI
- STGI
- STR
- SWAPGS
- SYSCALL
- SYSENTER
- SYSEXIT
- SYSRET
- UD2
- VERR
- VERW
- VMLOAD
- VMMCALL
- VMRUN
- VMSAVE
- WBINVD
- WRMSR
Appendix A Opcode and Operand Encodings
Appendix B General-Purpose Instructions in 64-Bit Mode
Appendix C Differences Between Long Mode and Legacy Mode
Appendix D Instruction Subsets and CPUID Feature Sets
Appendix E Instruction Effects on RFLAGS
Index

Advanced Micro Devices

AMD64 Technology

AMD64 Architecture

Programmer’s Manual

Volume 3:

General-Purpose and

System Instructions

Publication No. Revision Date

24594 3.14 September 2007

AMD64 Technology 24594—Rev. 3.14—September 2007

Trademarks

AMD, the AMD arrow logo, AMD Athlon, and AMD Opteron, and combinations thereof, and 3DNow! are trademarks,

and AMD-K6 is a registered trademark of Advanced Micro Devices, Inc.

MMX is a trademark and Pentium is a registered trademark of Intel Corporation.

Windows NT is a registered trademark of Microsoft Corporation.

Other product names used in this publication are for identification purposes only and may be trademarks of their

respective companies.

The contents of this document are provided in connection with Advanced Micro

Devices, Inc. (“AMD”) products. AMD makes no representations or warranties with

respect to the accuracy or completeness of the contents of this publication and

reserves the right to make changes to specifications and product descriptions at

any time without notice. The information contained herein may be of a preliminary

or advance nature and is subject to change without notice. No license, whether

express, implied, arising by estoppel or otherwise, to any intellectual property rights

is granted by this publication. Except as set forth in AMD’s Standard Terms and

Conditions of Sale, AMD assumes no liability whatsoever, and disclaims any

express or implied warranty, relating to its products including, but not limited to, the

implied warranty of merchantability, fitness for a particular purpose, or infringement

of any intellectual property right.

AMD’s products are not designed, intended, authorized or warranted for use as

components in systems intended for surgical implant into the body, or in other appli-

cations intended to support or sustain life, or in any other application in which the

failure of AMD’s product could create a situation where personal injury, death, or

severe property or environmental damage may occur. AMD reserves the right to

discontinue or make changes to its products at any time without notice.

Contents i

24594—Rev. 3.14—September 2007 AMD64 Technology

Contents

Revision History. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii

Preface. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xv

About This Book. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xv

Audience . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xv

Organization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xv

Definitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xvi

Related Documents . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xxvi

1 Instruction Formats. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1

1.1 Instruction Byte Order . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

1.2 Instruction Prefixes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

Summary of Legacy Prefixes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

Operand-Size Override Prefix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .4

Address-Size Override Prefix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .6

Segment-Override Prefixes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .8

Lock Prefix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

Repeat Prefixes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

REX Prefixes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

1.3 Opcode. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

1.4 ModRM and SIB Bytes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .17

1.5 Displacement Bytes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

1.6 Immediate Bytes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

1.7 RIP-Relative Addressing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

Encoding . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20

REX Prefix and RIP-Relative Addressing. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20

Address-Size Prefix and RIP-Relative Addressing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20

2 Instruction Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .21

2.1 Instruction Subsets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

2.2 Reference-Page Format . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22

2.3 Summary of Registers and Data Types . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

General-Purpose Instructions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

System Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27

128-Bit Media Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .29

64-Bit Media Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .32

x87 Floating-Point Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34

2.4 Summary of Exceptions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35

2.5 Notation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37

Mnemonic Syntax . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37

Opcode Syntax. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39

Pseudocode Definitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .41

ii Contents

AMD64 Technology 24594—Rev. 3.14—September 2007

3 General-Purpose Instruction Reference . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .51

AAA. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53

AAD. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54

AAM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55

AAS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56

ADC. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57

ADD. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59

AND. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61

BOUND . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63

BSF . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65

BSR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66

BSWAP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67

BT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68

BTC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70

BTR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72

BTS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74

CALL (Near) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76

CALL (Far) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78

CBW

CWDE

CDQE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84

CWD

CDQ

CQO. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85

CLC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86

CLD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87

CLFLUSH . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88

CMC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90

CMOVcc . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91

CMP. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94

CMPS

CMPSB

CMPSW

CMPSD

CMPSQ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97

CMPXCHG . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99

CMPXCHG8B

CMPXCHG16B. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101

CPUID . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103

DAA. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105

DAS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106

DEC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107

DIV . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109

ENTER . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111

IDIV. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113

IMUL . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115

IN . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117

Contents iii

24594—Rev. 3.14—September 2007 AMD64 Technology

INC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 118

INS

INSB

INSW

INSD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120

INT. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 122

INTO . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129

Jcc . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130

JCXZ

JECXZ

JRCXZ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 134

JMP (Near). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 135

JMP (Far) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137

LAHF. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 142

LDS

LES

LFS

LGS

LSS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143

LEA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 145

LEAVE. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147

LFENCE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 148

LODS

LODSB

LODSW

LODSD

LODSQ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 149

LOOP

LOOPE

LOOPNE

LOOPNZ

LOOPZ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 151

LZCNT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 153

MFENCE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 155

MOV . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 156

MOVD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 159

MOVMSKPD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 162

MOVMSKPS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 164

MOVNTI . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 166

MOVS

MOVSB

MOVSW

MOVSD

MOVSQ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 168

MOVSX . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 170

MOVSXD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171

MOVZX. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 172

iv Contents

AMD64 Technology 24594—Rev. 3.14—September 2007

MUL . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173

NEG . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 175

NOP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 177

NOT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 178

OR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 179

OUT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181

OUTS

OUTSB

OUTSW

OUTSD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 182

PAUSE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 184

POP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 185

POPA

POPAD. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 187

POPCNT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 188

POPF

POPFD

POPFQ. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 190

PREFETCH

PREFETCHW . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 193

PREFETCHlevel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 195

PUSH . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 197

PUSHA

PUSHAD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 199

PUSHF

PUSHFD

PUSHFQ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 200

RCL . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 202

RCR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 204

RET (Near) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 206

RET (Far). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 207

ROL . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 211

ROR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 213

SAHF . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 215

SAL

SHL . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 216

SAR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 219

SBB . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 221

SCAS

SCASB

SCASW

SCASD

SCASQ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 223

SETcc. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 225

SFENCE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 227

SHL . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 228

SHLD. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 229

Contents v

24594—Rev. 3.14—September 2007 AMD64 Technology

SHR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 231

SHRD. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 233

STC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 235

STD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 236

STOS

STOSB

STOSW

STOSD

STOSQ. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 237

SUB . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 239

TEST . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 241

XADD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 243

XCHG . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 245

XLAT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 247

XLATB . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 247

XOR. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 248

4 System Instruction Reference. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .251

ARPL . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 252

CLGI . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 254

CLI. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 255

CLTS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 257

HLT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 258

INT 3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 259

INVD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 262

INVLPG. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 263

INVLPGA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 264

IRET

IRETD

IRETQ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 265

LAR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 271

LGDT. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 273

LIDT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 275

LLDT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 277

LMSW . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 279

LSL . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 280

LTR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 282

MONITOR. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 284

MOV (CRn) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 286

MOV(DRn) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 288

MWAIT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 290

RDMSR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 292

RDPMC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 293

RDTSC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 294

RDTSCP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 295

RSM. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 297

SGDT. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 299

SIDT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 300

vi Contents

AMD64 Technology 24594—Rev. 3.14—September 2007

SKINIT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 301

SLDT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 303

SMSW . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 304

STI . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 305

STGI . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 307

STR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 308

SWAPGS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 309

SYSCALL . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 311

SYSENTER . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 315

SYSEXIT. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 317

SYSRET . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 319

UD2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 323

VERR. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 324

VERW . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 326

VMLOAD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 327

VMMCALL. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 329

VMRUN. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 330

VMSAVE. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 335

WBINVD. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 337

WRMSR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 338

Appendix A Opcode and Operand Encodings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .339

A.1 Opcode-Syntax Notation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 339

A.2 Opcode Encodings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 340

One-Byte Opcodes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 340

Two-Byte Opcodes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 343

rFLAGS Condition Codes for Two-Byte Opcodes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 348

ModRM Extensions to One-Byte and Two-Byte Opcodes . . . . . . . . . . . . . . . . . . . . . . . . . . 348

ModRM Extensions to Opcodes 0F 01 and 0F AE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 351

3DNow!™ Opcodes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 351

x87 Encodings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 354

rFLAGS Condition Codes for x87 Opcodes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 363

A.3 Operand Encodings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 363

ModRM Operand References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 363

SIB Operand References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 369

Appendix B General-Purpose Instructions in 64-Bit Mode . . . . . . . . . . . . . . . . . . . . . . . .373

B.1 General Rules for 64-Bit Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 373

B.2 Operation and Operand Size in 64-Bit Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 374

B.3 Invalid and Reassigned Instructions in 64-Bit Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 399

B.4 Instructions with 64-Bit Default Operand Size . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 400

B.5 Single-Byte INC and DEC Instructions in 64-Bit Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . 401

B.6 NOP in 64-Bit Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 401

B.7 Segment Override Prefixes in 64-Bit Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 402

Appendix C Differences Between Long Mode and Legacy Mode. . . . . . . . . . . . . . . . . . . .403

Contents vii

24594—Rev. 3.14—September 2007 AMD64 Technology

Appendix D Instruction Subsets and CPUID Feature Sets. . . . . . . . . . . . . . . . . . . . . . . . .405

D.1 Instruction Subsets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 405

D.2 CPUID Feature Sets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 407

D.3 Instruction List. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 409

Appendix E Instruction Effects on RFLAGS. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .435

Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 439

viii Contents

AMD64 Technology 24594—Rev. 3.14—September 2007

Figures ix

24594—Rev. 3.14—September 2007 AMD64 Technology

Figures

Figure 1-1. Instruction Byte-Order . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

Figure 1-2. Little-Endian Byte-Order of Instruction Stored in Memory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

Figure 1-3. Encoding Examples of REX-Prefix R, X, and B Bits. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

Figure 1-4. ModRM-Byte Format . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18

Figure 1-5. SIB-Byte Format. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18

Figure 2-1. Format of Instruction-Detail Pages. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

Figure 2-2. General Registers in Legacy and Compatibility Modes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

Figure 2-3. General Registers in 64-Bit Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25

Figure 2-4. Segment Registers. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26

Figure 2-5. General-Purpose Data Types . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27

Figure 2-6. System Registers. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28

Figure 2-7. System Data Structures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29

Figure 2-8. 128-Bit Media Registers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30

Figure 2-9. 128-Bit Media Data Types . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31

Figure 2-10. 64-Bit Media Registers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32

Figure 2-11. 64-Bit Media Data Types . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33

Figure 2-12. x87 Registers. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34

Figure 2-13. x87 Data Types . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35

Figure 2-14. Syntax for Typical Two-Operand Instruction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37

Figure 3-1. MOVD Instruction Operation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 160

Figure A-1. ModRM-Byte Fields . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 348

Figure A-2. ModRM-Byte Format . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 364

Figure A-3. SIB Byte Format . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 370

Figure D-1. Instruction Subsets vs. CPUID Feature Sets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 406

xFigures

AMD64 Technology 24594—Rev. 3.14—September 2007

Tables xi

24594—Rev. 3.14—September 2007 AMD64 Technology

Tables

Table 1-1. Legacy Instruction Prefixes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

Table 1-2. Operand-Size Overrides . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

Table 1-3. Address-Size Overrides. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

Table 1-4. Pointer and Count Registers and the Address-Size Prefix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

Table 1-5. Segment-Override Prefixes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

Table 1-6. REP Prefix Opcodes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

Table 1-7. REPE and REPZ Prefix Opcodes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

Table 1-8. REPNE and REPNZ Prefix Opcodes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .11

Table 1-9. REX Instruction Prefixes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

Table 1-10. Instructions Not Requiring REX Size Prefix in 64-Bit Mode . . . . . . . . . . . . . . . . . . . . . . . . . . 12

Table 1-11. REX Prefix-Byte Fields . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

Table 1-12. Special REX Encodings for Registers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

Table 1-13. Encoding for RIP-Relative Addressing. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20

Table 2-1. Interrupt-Vector Source and Cause. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36

Table 2-2. +rb, +rw, +rd, and +rq Register Value . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40

Table 3-1. Instruction Support Indicated by CPUID Feature Bits . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51

Table 3-2. Processor Vendor Return Values . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104

Table 3-3. Locality References for the Prefetch Instructions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 195

Table A-1. One-Byte Opcodes, Low Nibble 0–7h . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 341

Table A-2. One-Byte Opcodes, Low Nibble 8–Fh . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 342

Table A-3. Second Byte of Two-Byte Opcodes, Low Nibble 0–7h . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 343

Table A-4. Second Byte of Two-Byte Opcodes, Low Nibble 8–Fh . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 345

Table A-5. rFLAGS Condition Codes for CMOVcc, Jcc, and SETcc . . . . . . . . . . . . . . . . . . . . . . . . . . . . 348

Table A-6. One-Byte and Two-Byte Opcode ModRM Extensions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 349

Table A-7. Opcode 0F 01 and 0F AE ModRM Extensions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 351

Table A-8. Immediate Byte for 3DNow!™ Opcodes, Low Nibble 0–7h . . . . . . . . . . . . . . . . . . . . . . . . . . 352

Table A-9. Immediate Byte for 3DNow!™ Opcodes, Low Nibble 8–Fh. . . . . . . . . . . . . . . . . . . . . . . . . . 353

Table A-10. x87 Opcodes and ModRM Extensions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 355

Table A-11. rFLAGS Condition Codes for FCMOVcc . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 363

Table A-12. ModRM Register References, 16-Bit Addressing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 364

Table A-13. ModRM Memory References, 16-Bit Addressing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 365

Table A-14. ModRM Register References, 32-Bit and 64-Bit Addressing . . . . . . . . . . . . . . . . . . . . . . . . . 367

Table A-15. ModRM Memory References, 32-Bit and 64-Bit Addressing . . . . . . . . . . . . . . . . . . . . . . . . . 368

Table A-16. SIB base Field References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 370

xii Tables

AMD64 Technology 24594—Rev. 3.14—September 2007

Table A-17. SIB Memory References. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 371

Table B-1. Operations and Operands in 64-Bit Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 374

Table B-2. Invalid Instructions in 64-Bit Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 399

Table B-3. Reassigned Instructions in 64-Bit Mode. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 400

Table B-4. Invalid Instructions in Long Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 400

Table B-5. Instructions Defaulting to 64-Bit Operand Size . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 400

Table C-1. Differences Between Long Mode and Legacy Mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 403

Table D-1. Instruction Subsets and CPUID Feature Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 409

Table E-1. Instruction Effects on RFLAGS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 435

Revision History xiii

24594—Rev. 3.14—September 2007 AMD64 Technology

Revision History

Date Revision Description

September

2007 3.14 Added minor clarifications and corrected typographical and

formatting errors.

July 2007 3.13

Added the following instructions: “LZCNT” on page 153, “POPCNT”

on page 188, “MONITOR” on page 284, and “MWAIT” on page 290.

Reformatted information on instruction support indicated by CPUID

feature bits into Table 3-1.

Added minor clarifications and corrected typographical and

formatting errors.

September

2006 3.12 Added minor clarifications and corrected typographical and

formatting errors.

December

2005 3.11 Added SVM instructions; added PAUSE instructions; made factual

changes.

January

2005 3.10

Clarified CPUID information in exception tables on instruction pages.

Added information under “CPUID” on page 103. Made numerous

small corrections.

September

2003 3.09

Corrected table of valid descriptor types for LAR and LSL instructions

and made several minor formatting, stylistic and factual corrections.

Clarified several technical definitions.

April 2003 3.08

Corrected description of the operation of flags for RCL, RCR, ROL,

and ROR instructions. Clarified description of the MOVSXD and

IMUL instructions. Corrected operand specification for the STOS

instruction. Corrected opcode of SETcc, Jcc, instructions. Added

thermal control and thermal monitoring bits to CPUID instruction.

Corrected exception tables for POPF, SFENCE, SUB, XLAT, IRET,

LSL, MOV(CRn), SGDT/SIDT, SMSW, and STI instructions.

Corrected many small typos and incorporated branding terminology.

xiv Revision History

AMD64 Technology 24594—Rev. 3.14—September 2007

Preface xv

24594—Rev. 3.14—September 2007 AMD64 Technology

Preface

About This Book

This book is part of a multivolume work entitled the AMD64 Architecture Programmer’s Manual. This

table lists each volume and its order number.

Audience

This volume (Volume 3) is intended for all programmers writing application or system software for a

processor that implements the AMD64 architecture. Descriptions of general-purpose instructions

assume an understanding of the application-level programming topics described in Volume 1.

Descriptions of system instructions assume an understanding of the system-level programming topics

described in Volume 2.

Organization

Volumes 3, 4, and 5 describe the AMD64 architecture’s instruction set in detail. Together, they cover

each instruction’s mnemonic syntax, opcodes, functions, affected flags, and possible exceptions.

The AMD64 instruction set is divided into five subsets:

•General-purpose instructions

•System instructions

•128-bit media instructions

•64-bit media instructions

•x87 floating-point instructions

Several instructions belong to—and are described identically in—multiple instruction subsets.

This volume describes the general-purpose and system instructions. The index at the end cross-

references topics within this volume. For other topics relating to the AMD64 architecture, and for

Title Order No.

Volume 1: Application Programming 24592

Volume 2: System Programming 24593

Volume 3: General-Purpose and System Instructions 24594

Volume 4: 128-Bit Media Instructions 26568

Volume 5: 64-Bit Media and x87 Floating-Point Instructions 26569

xvi Preface

AMD64 Technology 24594—Rev. 3.14—September 2007

information on instructions in other subsets, see the tables of contents and indexes of the other

volumes.

Definitions

Many of the following definitions assume an in-depth knowledge of the legacy x86 architecture. See

“Related Documents” on page xxvi for descriptions of the legacy x86 architecture.

Terms and Notation

In addition to the notation described below, “Opcode-Syntax Notation” on page 339 describes notation

relating specifically to opcodes.

1011b

A binary value—in this example, a 4-bit value.

F0EAh

A hexadecimal value—in this example a 2-byte value.

[1,2)

A range that includes the left-most value (in this case, 1) but excludes the right-most value (in this

case, 2).

7–4

A bit range, from bit 7 to 4, inclusive. The high-order bit is shown first.

128-bit media instructions

Instructions that use the 128-bit XMM registers. These are a combination of the SSE and SSE2

instruction sets.

64-bit media instructions

Instructions that use the 64-bit MMX registers. These are primarily a combination of MMX™ and

3DNow!™ instruction sets, with some additional instructions from the SSE and SSE2 instruction

sets.

16-bit mode

Legacy mode or compatibility mode in which a 16-bit address size is active. See legacy mode and

compatibility mode.

32-bit mode

Legacy mode or compatibility mode in which a 32-bit address size is active. See legacy mode and

compatibility mode.

Preface xvii

24594—Rev. 3.14—September 2007 AMD64 Technology

64-bit mode

A submode of long mode. In 64-bit mode, the default address size is 64 bits and new features, such

as register extensions, are supported for system and application software.

#GP(0)

Notation indicating a general-protection exception (#GP) with error code of 0.

absolute

Said of a displacement that references the base of a code segment rather than an instruction pointer.

Contrast with relative.

biased exponent

The sum of a floating-point value’s exponent and a constant bias for a particular floating-point data

type. The bias makes the range of the biased exponent always positive, which allows reciprocation

without overflow.

byte

Eight bits.

clear

To write a bit value of 0. Compare set.

compatibility mode

A submode of long mode. In compatibility mode, the default address size is 32 bits, and legacy 16-

bit and 32-bit applications run without modification.

commit

To irreversibly write, in program order, an instruction’s result to software-visible storage, such as a

CPL

Current privilege level.

CR0–CR4

A register range, from register CR0 through CR4, inclusive, with the low-order register first.

CR0.PE = 1

Notation indicating that the PE bit of the CR0 register has a value of 1.

direct

Referencing a memory location whose address is included in the instruction’s syntax as an

immediate operand. The address may be an absolute or relative address. Compare indirect.

dirty data

Data held in the processor’s caches or internal buffers that is more recent than the copy held in

main memory.

xviii Preface

AMD64 Technology 24594—Rev. 3.14—September 2007

displacement

A signed value that is added to the base of a segment (absolute addressing) or an instruction pointer

(relative addressing). Same as offset.

doubleword

Two words, or four bytes, or 32 bits.

double quadword

Eight words, or 16 bytes, or 128 bits. Also called octword.

DS:rSI

The contents of a memory location whose segment address is in the DS register and whose offset

relative to that segment is in the rSI register.

EFER.LME = 0

Notation indicating that the LME bit of the EFER register has a value of 0.

effective address size

The address size for the current instruction after accounting for the default address size and any

address-size override prefix.

effective operand size

The operand size for the current instruction after accounting for the default operand size and any

operand-size override prefix.

element

See vector.

exception

An abnormal condition that occurs as the result of executing an instruction. The processor’s

response to an exception depends on the type of the exception. For all exceptions except 128-bit

media SIMD floating-point exceptions and x87 floating-point exceptions, control is transferred to

the handler (or service routine) for that exception, as defined by the exception’s vector. For

floating-point exceptions defined by the IEEE 754 standard, there are both masked and unmasked

responses. When unmasked, the exception handler is called, and when masked, a default response

is provided instead of calling the handler.

FF /0

Notation indicating that FF is the first byte of an opcode, and a subopcode in the ModR/M byte has

a value of 0.

flush

An often ambiguous term meaning (1) writeback, if modified, and invalidate, as in “flush the cache

line,” or (2) invalidate, as in “flush the pipeline,” or (3) change a value, as in “flush to zero.”

Preface xix

24594—Rev. 3.14—September 2007 AMD64 Technology

GDT

Global descriptor table.

IDT

Interrupt descriptor table.

IGN

Ignore. Field is ignored.

indirect

Referencing a memory location whose address is in a register or other memory location. The

address may be an absolute or relative address. Compare direct.

IRB

The virtual-8086 mode interrupt-redirection bitmap.

IST

The long-mode interrupt-stack table.

IVT

The real-address mode interrupt-vector table.

LDT

Local descriptor table.

legacy x86

The legacy x86 architecture. See “Related Documents” on page xxvi for descriptions of the legacy

x86 architecture.

legacy mode

An operating mode of the AMD64 architecture in which existing 16-bit and 32-bit applications and

operating systems run without modification. A processor implementation of the AMD64

architecture can run in either long mode or legacy mode. Legacy mode has three submodes, real

mode, protected mode, and virtual-8086 mode.

long mode

An operating mode unique to the AMD64 architecture. A processor implementation of the

AMD64 architecture can run in either long mode or legacy mode. Long mode has two submodes,

64-bit mode and compatibility mode.

lsb

Least-significant bit.

LSB

Least-significant byte.

xx Preface

AMD64 Technology 24594—Rev. 3.14—September 2007

main memory

Physical memory, such as RAM and ROM (but not cache memory) that is installed in a particular

computer system.

mask

(1) A control bit that prevents the occurrence of a floating-point exception from invoking an

exception-handling routine. (2) A field of bits used for a control purpose.

MBZ

Must be zero. If software attempts to set an MBZ bit to 1, a general-protection exception (#GP)

occurs.

memory

Unless otherwise specified, main memory.

ModRM

A byte following an instruction opcode that specifies address calculation based on mode (Mod),

moffset

A 16, 32, or 64-bit offset that specifies a memory operand directly, without using a ModRM or SIB

byte.

msb

Most-significant bit.

MSB

Most-significant byte.

multimedia instructions

A combination of 128-bit media instructions and 64-bit media instructions.

octword

Same as double quadword.

offset

Same as displacement.

overflow

The condition in which a floating-point number is larger in magnitude than the largest, finite,

positive or negative number that can be represented in the data-type format being used.

packed

See vector.

Preface xxi

24594—Rev. 3.14—September 2007 AMD64 Technology

PAE

Physical-address extensions.

physical memory

Actual memory, consisting of main memory and cache.

probe

A check for an address in a processor’s caches or internal buffers. External probes originate

outside the processor, and internal probes originate within the processor.

protected mode

A submode of legacy mode.

quadword

Four words, or eight bytes, or 64 bits.

RAZ

Read as zero (0), regardless of what is written.

real-address mode

See real mode.

real mode

A short name for real-address mode, a submode of legacy mode.

relative

Referencing with a displacement (also called offset) from an instruction pointer rather than the

base of a code segment. Contrast with absolute.

reserved

Fields marked as reserved may be used at some future time.

To preserve compatibility with future processors, reserved fields require special handling when

read or written by software.

Reserved fields may be further qualified as MBZ, RAZ, SBZ or IGN (see definitions).

Software must not depend on the state of a reserved field, nor upon the ability of such fields to

return to a previously written state.

If a reserved field is not marked with one of the above qualifiers, software must not change the state

of that field; it must reload that field with the same values returned from a prior read.

REX

An instruction prefix that specifies a 64-bit operand size and provides access to additional

registers.

RIP-relative addressing

Addressing relative to the 64-bit RIP instruction pointer.

xxii Preface

AMD64 Technology 24594—Rev. 3.14—September 2007

set

To write a bit value of 1. Compare clear.

SIB

A byte following an instruction opcode that specifies address calculation based on scale (S), index

(I), and base (B).

SIMD

Single instruction, multiple data. See vector.

SSE

Streaming SIMD extensions instruction set. See 128-bit media instructions and 64-bit media

instructions.

SSE2

Extensions to the SSE instruction set. See 128-bit media instructions and 64-bit media

instructions.

SSE3

Further extensions to the SSE instruction set. See 128-bit media instructions.

sticky bit

A bit that is set or cleared by hardware and that remains in that state until explicitly changed by

software.

TOP

The x87 top-of-stack pointer.

TPR

Task-priority register (CR8).

TSS

Task-state segment.

underflow

The condition in which a floating-point number is smaller in magnitude than the smallest nonzero,

positive or negative number that can be represented in the data-type format being used.

vector

(1) A set of integer or floating-point values, called elements, that are packed into a single operand.

Most of the 128-bit and 64-bit media instructions use vectors as operands. Vectors are also called

packed or SIMD (single-instruction multiple-data) operands.

(2) An index into an interrupt descriptor table (IDT), used to access exception handlers. Compare

exception.

Preface xxiii

24594—Rev. 3.14—September 2007 AMD64 Technology

virtual-8086 mode

A submode of legacy mode.

word

Two bytes, or 16 bits.

x86

See legacy x86.

Registers

In the following list of registers, the names are used to refer either to a given register or to the contents

of that register:

AH–DH

The high 8-bit AH, BH, CH, and DH registers. Compare AL–DL.

AL–DL

The low 8-bit AL, BL, CL, and DL registers. Compare AH–DH.

AL–r15B

The low 8-bit AL, BL, CL, DL, SIL, DIL, BPL, SPL, and R8B–R15B registers, available in 64-bit

mode.

Base pointer register.

CRn

Control register number n.

Code segment register.

eAX–eSP

The 16-bit AX, BX, CX, DX, DI, SI, BP, and SP registers or the 32-bit EAX, EBX, ECX, EDX,

EDI, ESI, EBP, and ESP registers. Compare rAX–rSP.

EFER

Extended features enable register.

eFLAGS

16-bit or 32-bit flags register. Compare rFLAGS.

EFLAGS

32-bit (extended) flags register.

xxiv Preface

AMD64 Technology 24594—Rev. 3.14—September 2007

eIP

16-bit or 32-bit instruction-pointer register. Compare rIP.

EIP

32-bit (extended) instruction-pointer register.

FLAGS

16-bit flags register.

GDTR

Global descriptor table register.

GPRs

General-purpose registers. For the 16-bit data size, these are AX, BX, CX, DX, DI, SI, BP, and SP.

For the 32-bit data size, these are EAX, EBX, ECX, EDX, EDI, ESI, EBP, and ESP. For the 64-bit

data size, these include RAX, RBX, RCX, RDX, RDI, RSI, RBP, RSP, and R8–R15.

IDTR

Interrupt descriptor table register.

16-bit instruction-pointer register.

LDTR

Local descriptor table register.

MSR

Model-specific register.

r8–r15

The 8-bit R8B–R15B registers, or the 16-bit R8W–R15W registers, or the 32-bit R8D–R15D

registers, or the 64-bit R8–R15 registers.

rAX–rSP

The 16-bit AX, BX, CX, DX, DI, SI, BP, and SP registers, or the 32-bit EAX, EBX, ECX, EDX,

EDI, ESI, EBP, and ESP registers, or the 64-bit RAX, RBX, RCX, RDX, RDI, RSI, RBP, and RSP

registers. Replace the placeholder r with nothing for 16-bit size, “E” for 32-bit size, or “R” for 64-

bit size.

RAX

64-bit version of the EAX register.

RBP

64-bit version of the EBP register.

Preface xxv

24594—Rev. 3.14—September 2007 AMD64 Technology

RBX

64-bit version of the EBX register.

RCX

64-bit version of the ECX register.

RDI

64-bit version of the EDI register.

RDX

64-bit version of the EDX register.

rFLAGS

16-bit, 32-bit, or 64-bit flags register. Compare RFLAGS.

RFLAGS

64-bit flags register. Compare rFLAGS.

rIP

16-bit, 32-bit, or 64-bit instruction-pointer register. Compare RIP.

RIP

64-bit instruction-pointer register.

RSI

64-bit version of the ESI register.

RSP

64-bit version of the ESP register.

Stack pointer register.

Stack segment register.

TPR

Task priority register, a new register introduced in the AMD64 architecture to speed interrupt

management.

Task register.

xxvi Preface

AMD64 Technology 24594—Rev. 3.14—September 2007

Endian Order

The x86 and AMD64 architectures address memory using little-endian byte-ordering. Multibyte

values are stored with their least-significant byte at the lowest byte address, and they are illustrated

with their least significant byte at the right side. Strings are illustrated in reverse order, because the

addresses of their bytes increase from right to left.

Related Documents

•Peter Abel, IBM PC Assembly Language and Programming, Prentice-Hall, Englewood Cliffs, NJ,

1995.

•Rakesh Agarwal, 80x86 Architecture & Programming: Volume II, Prentice-Hall, Englewood

Cliffs, NJ, 1991.

•AMD, AMD-K6™ MMX™ Enhanced Processor Multimedia Technology, Sunnyvale, CA, 2000.

•AMD, 3DNow!™ Technology Manual, Sunnyvale, CA, 2000.

•AMD, AMD Extensions to the 3DNow!™ and MMX™ Instruction Sets, Sunnyvale, CA, 2000.

•Don Anderson and Tom Shanley, Pentium Processor System Architecture, Addison-Wesley, New

York, 1995.

•Nabajyoti Barkakati and Randall Hyde, Microsoft Macro Assembler Bible, Sams, Carmel, Indiana,

1992.

•Barry B. Brey, 8086/8088, 80286, 80386, and 80486 Assembly Language Programming,

Macmillan Publishing Co., New York, 1994.

•Barry B. Brey, Programming the 80286, 80386, 80486, and Pentium Based Personal Computer,

Prentice-Hall, Englewood Cliffs, NJ, 1995.

•Ralf Brown and Jim Kyle, PC Interrupts, Addison-Wesley, New York, 1994.

•Penn Brumm and Don Brumm, 80386/80486 Assembly Language Programming, Windcrest

McGraw-Hill, 1993.

•Geoff Chappell, DOS Internals, Addison-Wesley, New York, 1994.

•Chips and Technologies, Inc. Super386 DX Programmer’s Reference Manual, Chips and

Technologies, Inc., San Jose, 1992.

•John Crawford and Patrick Gelsinger, Programming the 80386, Sybex, San Francisco, 1987.

•Cyrix Corporation, 5x86 Processor BIOS Writer's Guide, Cyrix Corporation, Richardson, TX,

1995.

•Cyrix Corporation, M1 Processor Data Book, Cyrix Corporation, Richardson, TX, 1996.

•Cyrix Corporation, MX Processor MMX Extension Opcode Table, Cyrix Corporation, Richardson,

TX, 1996.

•Cyrix Corporation, MX Processor Data Book, Cyrix Corporation, Richardson, TX, 1997.

•Ray Duncan, Extending DOS: A Programmer's Guide to Protected-Mode DOS, Addison Wesley,

NY, 1991.

Preface xxvii

24594—Rev. 3.14—September 2007 AMD64 Technology

•William B. Giles, Assembly Language Programming for the Intel 80xxx Family, Macmillan, New

York, 1991.

•Frank van Gilluwe, The Undocumented PC, Addison-Wesley, New York, 1994.

•John L. Hennessy and David A. Patterson, Computer Architecture, Morgan Kaufmann Publishers,

San Mateo, CA, 1996.

•Thom Hogan, The Programmer’s PC Sourcebook, Microsoft Press, Redmond, WA, 1991.

•Hal Katircioglu, Inside the 486, Pentium, and Pentium Pro, Peer-to-Peer Communications, Menlo

Park, CA, 1997.

•IBM Corporation, 486SLC Microprocessor Data Sheet, IBM Corporation, Essex Junction, VT,

1993.

•IBM Corporation, 486SLC2 Microprocessor Data Sheet, IBM Corporation, Essex Junction, VT,

1993.

•IBM Corporation, 80486DX2 Processor Floating Point Instructions, IBM Corporation, Essex

Junction, VT, 1995.

•IBM Corporation, 80486DX2 Processor BIOS Writer's Guide, IBM Corporation, Essex Junction,

VT, 1995.

•IBM Corporation, Blue Lightning 486DX2 Data Book, IBM Corporation, Essex Junction, VT,

1994.

•Institute of Electrical and Electronics Engineers, IEEE Standard for Binary Floating-Point

Arithmetic, ANSI/IEEE Std 754-1985.

•Institute of Electrical and Electronics Engineers, IEEE Standard for Radix-Independent Floating-

Point Arithmetic, ANSI/IEEE Std 854-1987.

•Muhammad Ali Mazidi and Janice Gillispie Mazidi, 80X86 IBM PC and Compatible Computers,

Prentice-Hall, Englewood Cliffs, NJ, 1997.

•Hans-Peter Messmer, The Indispensable Pentium Book, Addison-Wesley, New York, 1995.

•Karen Miller, An Assembly Language Introduction to Computer Architecture: Using the Intel

Pentium, Oxford University Press, New York, 1999.

•Stephen Morse, Eric Isaacson, and Douglas Albert, The 80386/387 Architecture, John Wiley &

Sons, New York, 1987.

•NexGen Inc., Nx586 Processor Data Book, NexGen Inc., Milpitas, CA, 1993.

•NexGen Inc., Nx686 Processor Data Book, NexGen Inc., Milpitas, CA, 1994.

•Bipin Patwardhan, Introduction to the Streaming SIMD Extensions in the Pentium III,

www.x86.org/articles/sse_pt1/ simd1.htm, June, 2000.

•Peter Norton, Peter Aitken, and Richard Wilton, PC Programmer’s Bible, Microsoft Press,

Redmond, WA, 1993.

•PharLap 386|ASM Reference Manual, Pharlap, Cambridge MA, 1993.

•PharLap TNT DOS-Extender Reference Manual, Pharlap, Cambridge MA, 1995.

xxviii Preface

AMD64 Technology 24594—Rev. 3.14—September 2007

•Sen-Cuo Ro and Sheau-Chuen Her, i386/i486 Advanced Programming, Van Nostrand Reinhold,

New York, 1993.

•Jeffrey P. Royer, Introduction to Protected Mode Programming, course materials for an onsite

class, 1992.

•Tom Shanley, Protected Mode System Architecture, Addison Wesley, NY, 1996.

•SGS-Thomson Corporation, 80486DX Processor SMM Programming Manual, SGS-Thomson

Corporation, 1995.

•Walter A. Triebel, The 80386DX Microprocessor, Prentice-Hall, Englewood Cliffs, NJ, 1992.

•John Wharton, The Complete x86, MicroDesign Resources, Sebastopol, California, 1994.

•Web sites and newsgroups:

- www.amd.com

- news.comp.arch

- news.comp.lang.asm.x86

- news.intel.microprocessors

- news.microsoft

Instruction Formats 1

24594—Rev. 3.14—September 2007 AMD64 Technology

1 Instruction Formats

The format of an instruction encodes its operation, as well as the locations of the instruction’s initial

operands and the result of the operation. This section describes the general format and parameters used

by all instructions. For information on the specific format(s) for each instruction, see:

•Chapter 3, “General-Purpose Instruction Reference.”

•Chapter 4, “System Instruction Reference.”

•“128-Bit Media Instruction Reference” in Volume 4.

•“64-Bit Media Instruction Reference” in Volume 5.

•“x87 Floating-Point Instruction Reference” in Volume 5.

1.1 Instruction Byte Order

An instruction can be between one and 15 bytes in length. Figure 1-1 shows the byte order of the

instruction format.

Figure 1-1. Instruction Byte-Order

Instructions are stored in memory in little-endian order. The least-significant byte of an instruction is

stored at its lowest memory address, as shown in Figure 1-2 on page 2.

Legacy

Prefix

REX

Prefix

Opcode

(1 or 2 bytes) ModRM SIB

Displacement

(1, 2, 4, or 8 bytes)

Immediate

(1, 2, 4, or 8 bytes)

Instruction Length ≤ 15 Bytes

2Instruction Formats

AMD64 Technology 24594—Rev. 3.14—September 2007

Figure 1-2. Little-Endian Byte-Order of Instruction Stored in Memory

The basic operation of an instruction is specified by an opcode. The opcode is one or two bytes long, as

described in “Opcode” on page 17. An opcode can be preceded by any number of legacy prefixes.

These prefixes can be classified as belonging to any of the five groups of prefixes described in

“Instruction Prefixes” on page 3. The legacy prefixes modify an instruction’s default address size,

operand size, or segment, or they invoke a special function such as modification of the opcode, atomic

bus-locking, or repetition. The REX prefix can be used in 64-bit mode to access the register extensions

illustrated in “Application-Programming Register Set” in Volume 1. If a REX prefix is used, it must

immediately precede the first opcode byte.

An instruction’s opcode consists of one or two bytes. In several 128-bit and 64-bit media instructions,

a legacy operand-size or repeat prefix byte is used in a special-purpose way to modify the opcode. The

opcode can be followed by a mode-register-memory (ModRM) byte, which further describes the

operation and/or operands. The opcode, or the opcode and ModRM byte, can also be followed by a

scale-index-base (SIB) byte, which describes the scale, index, and base forms of memory addressing.

The ModRM and SIB bytes are described in “ModRM and SIB Bytes” on page 17, but their legacy

functions can be modified by the REX prefix (“Instruction Prefixes” on page 3).

The 15-byte instruction-length limit can only be exceeded by using redundant prefixes. If the limit is

exceeded, a general-protection exception occurs.

513-304.eps

Legacy Prefix

REX Prefix

SIB

Displacement

* optional, depending on the instruction

+ optional, with most instructions

Immediate

Opcode

ModRM

Least-significant

(lowest) address

+ (available only in 64-bit mode)

Most-significant

(highest) address

Immediate *

Displacement *

Opcode

≤ 15 Bytes

(all two-byte opcodes have 0Fh as their first byte)

Instruction Formats 3

24594—Rev. 3.14—September 2007 AMD64 Technology

1.2 Instruction Prefixes

The instruction prefixes shown in Figure 1-1 on page 1 are of two types: legacy prefixes and REX

prefixes. Each of the legacy prefixes has a unique byte value. By contrast, the REX prefixes, which

enable use of the AMD64 register extensions in 64-bit mode, are organized as a group of byte values in

which the value of the prefix indicates the combination of register-extension features to be enabled.

1.2.1 Summary of Legacy Prefixes

Table 1-1 on page 4 shows the legacy prefixes—that is, all prefixes except the REX prefixes, which are

described on page 11. The legacy prefixes are organized into five groups, as shown in the left-most

column of Table 1-1. A single instruction should include a maximum of one prefix from each of the

five groups. The legacy prefixes can appear in any order within the position shown in Figure 1-1 for

legacy prefixes. The result of using multiple prefixes from a single group is unpredictable.

Some of the restrictions on legacy prefixes are:

•Operand-Size Override—This prefix affects only general-purpose instructions and a few x87

instructions. When used with 128-bit and 64-bit media instructions, this prefix acts in a special

way to modify the opcode.

•Address-Size Override—This prefix affects only memory operands.

•Segment Override—In 64-bit mode, the CS, DS, ES, and SS segment override prefixes are ignored.

•LOCK Prefix—This prefix is allowed only with certain instructions that modify memory.

•Repeat Prefixes—These prefixes affect only certain string instructions. When used with 128-bit

and 64-bit media instructions, these prefixes act in a special way to modify the opcode.

4Instruction Formats

AMD64 Technology 24594—Rev. 3.14—September 2007

1.2.2 Operand-Size Override Prefix

The default operand size for an instruction is determined by a combination of its opcode, the D

(default) bit in the current code-segment descriptor, and the current operating mode, as shown in

Table 1-2. The operand-size override prefix (66h) selects the non-default operand size. The prefix can

Table 1-1. Legacy Instruction Prefixes

Prefix Group1Mnemonic Prefix

Byte (Hex) Description

Operand-Size Override none 662Changes the default operand size of a memory or

Address-Size Override none 673Changes the default address size of a memory operand,

as shown in Table 1-3 on page 6.

Segment Override

CS 2E4Forces use of the current CS segment for memory

operands.

DS 3E4Forces use of the current DS segment for memory

operands.

ES 264Forces use of the current ES segment for memory

operands.

FS 64 Forces use of the current FS segment for memory

operands.

GS 65 Forces use of the current GS segment for memory

operands.

SS 364Forces use of the current SS segment for memory

operands.

Lock LOCK F05Causes certain kinds of memory read-modify-write

instructions to occur atomically.

Repeat

REP

F36

Repeats a string operation (INS, MOVS, OUTS, LODS,

and STOS) until the rCX register equals 0.

REPE or

REPZ

Repeats a compare-string or scan-string operation

(CMPSx and SCASx) until the rCX register equals 0 or

the zero flag (ZF) is cleared to 0.

REPNE or

REPNZ F26

Repeats a compare-string or scan-string operation

(CMPSx and SCASx) until the rCX register equals 0 or

the zero flag (ZF) is set to 1.

Note:

1. A single instruction should include a maximum of one prefix from each of the five groups.

2. When used with 128-bit and 64-bit media instructions, this prefix acts in a special way to modify the opcode. The

prefix is ignored by 64-bit media floating-point (3DNow!™) instructions. See “Instructions that Cannot Use the Oper-

and-Size Prefix” on page 5.

3. This prefix also changes the size of the RCX register when used as an implied count register.

4. In 64-bit mode, the CS, DS, ES, and SS segment overrides are ignored.

5. The LOCK prefix should not be used for instructions other than those listed in “Lock Prefix” on page 8.

6. This prefix should be used only with compare-string and scan-string instructions. When used with 128-bit and 64-

bit media instructions, the prefix acts in a special way to modify the opcode.

Instruction Formats 5

24594—Rev. 3.14—September 2007 AMD64 Technology

be used with any general-purpose instruction that accesses non-fixed-size operands in memory or

general-purpose registers (GPRs), and it can also be used with the x87 FLDENV, FNSTENV,

FNSAVE, and FRSTOR instructions.

In 64-bit mode, the prefix allows mixing of 16-bit, 32-bit, and 64-bit data on an instruction-by-

instruction basis. In compatibility and legacy modes, the prefix allows mixing of 16-bit and 32-bit

operands on an instruction-by-instruction basis.

In 64-bit mode, most instructions default to a 32-bit operand size. For these instructions, a REX prefix

(page 13) can specify a 64-bit operand size, and a 66h prefix specifies a 16-bit operand size. The REX

prefix takes precedence over the 66h prefix. However, if an instruction defaults to a 64-bit operand

size, it does not need a REX prefix and it can only be overridden to a 16-bit operand size. It cannot be

overridden to a 32-bit operand size, because there is no 32-bit operand-size override prefix in 64-bit

mode. Two groups of instructions have a default 64-bit operand size in 64-bit mode:

•Near branches. For details, see “Near Branches in 64-Bit Mode” in Volume 1.

•All instructions, except far branches, that implicitly reference the RSP. For details, see “Stack

Operation” in Volume 1.

Instructions that Cannot Use the Operand-Size Prefix. The operand-size prefix should be used

only with general-purpose instructions and the x87 FLDENV, FNSTENV, FNSAVE, and FRSTOR

Table 1-2. Operand-Size Overrides

Operating Mode

Default

Operand

Size (Bits)

Effective

Operand

Size

(Bits)

Instruction Prefix1

66h REX.W3

Long

Mode

64-Bit

Mode 322

64 don’t care yes

32 no no

16 yes no

Compatibility

Mode

32 32 no

Not Appli-

cable

16 yes

16 32 yes

16 no

Legacy Mode

(Protected, Virtual-8086,

or Real Mode)

32 32 no

16 yes

16 32 yes

16 no

Note:

1. A “no’ indicates that the default operand size is used.

2. This is the typical default, although some instructions default to other operand

sizes. See Appendix B, “General-Purpose Instructions in 64-Bit Mode,” for details.

3. See “REX Prefixes” on page 11.

6Instruction Formats

AMD64 Technology 24594—Rev. 3.14—September 2007

instructions, in which the prefix selects between 16-bit and 32-bit operand size. The prefix is ignored

by all other x87 instructions and by 64-bit media floating-point (3DNow!™) instructions.

When used with 64-bit media integer instructions, the 66h prefix acts in a special way to modify the

opcode. This modification typically causes an access to an XMM register or 128-bit memory operand

and thereby converts the 64-bit media instruction into its comparable 128-bit media instruction. The

result of using an F2h or F3h repeat prefix along with a 66h prefix in 128-bit or 64-bit media

instructions is unpredictable.

Operand-Size and REX Prefixes. The REX operand-size prefix takes precedence over the 66h

prefix. See “REX.W: Operand Width” on page 13 for details.

1.2.3 Address-Size Override Prefix

The default address size for instructions that access non-stack memory is determined by the current

operating mode, as shown in Table 1-3. The address-size override prefix (67h) selects the non-default

address size. Depending on the operating mode, this prefix allows mixing of 16-bit and 32-bit, or of

32-bit and 64-bit addresses, on an instruction-by-instruction basis. The prefix changes the address size

for memory operands. It also changes the size of the RCX register for instructions that use RCX

implicitly.

For instructions that implicitly access the stack segment (SS), the address size for stack accesses is

determined by the D (default) bit in the stack-segment descriptor. In 64-bit mode, the D bit is ignored,

and all stack references have a 64-bit address size. However, if an instruction accesses both stack and

non-stack memory, the address size of the non-stack access is determined as shown in Table 1-3.

Table 1-3. Address-Size Overrides

Operating Mode

Default

Address

Size (Bits)

Effective

Address Size

(Bits)

Address-

Size Prefix

(67h)1

Required?

Long Mode

64-Bit

Mode 64 64 no

32 yes

Compatibility

Mode

32 32 no

16 yes

16 32 yes

16 no

Legacy Mode

(Protected, Virtual-8086, or Real

Mode)

32 32 no

16 yes

16 32 yes

16 no

Note:

1. A “no” indicates that the default address size is used.

Instruction Formats 7

24594—Rev. 3.14—September 2007 AMD64 Technology

As Table 1-3 shows, the default address size is 64 bits in 64-bit mode. The size can be overridden to 32

bits, but 16-bit addresses are not supported in 64-bit mode. In compatibility and legacy modes, the

default address size is 16 bits or 32 bits, depending on the operating mode (see “Processor

Initialization and Long Mode Activation” in Volume 2 for details). In these modes, the address-size

prefix selects the non-default size, but the 64-bit address size is not available.

Certain instructions reference pointer registers or count registers implicitly, rather than explicitly. In

such instructions, the address-size prefix affects the size of such addressing and count registers, just as

it does when such registers are explicitly referenced. Table 1-4 lists all such instructions and the

registers referenced using the three possible address sizes.

Table 1-4. Pointer and Count Registers and the Address-Size Prefix

Instruction

Pointer or Count Register

16-Bit

Address Size

32-Bit

Address Size

64-Bit

Address Size

CMPS, CMPSB, CMPSW,

CMPSD, CMPSQ—Compare

Strings

SI, DI, CX ESI, EDI, ECX RSI, RDI, RCX

INS, INSB, INSW, INSD—

Input String DI, CX EDI, ECX RDI, RCX

JCXZ, JECXZ, JRCXZ—Jump

on CX/ECX/RCX Zero CX ECX RCX

LODS, LODSB, LODSW,

LODSD, LODSQ—Load

String

SI, CX ESI, ECX RSI, RCX

LOOP, LOOPE, LOOPNZ,

LOOPNE, LOOPZ—Loop CX ECX RCX

MOVS, MOVSB, MOVSW,

MOVSD, MOVSQ—Move

String

SI, DI, CX ESI, EDI, ECX RSI, RDI, RCX

OUTS, OUTSB, OUTSW,

OUTSD—Output String SI, CX ESI, ECX RSI, RCX

REP, REPE, REPNE, REPNZ,

REPZ—Repeat Prefixes CX ECX RCX

SCAS, SCASB, SCASW,

SCASD, SCASQ—Scan

String

DI, CX EDI, ECX RDI, RCX

STOS, STOSB, STOSW,

STOSD, STOSQ—Store

String

DI, CX EDI, ECX RDI, RCX

XLAT, XLATB—Table Look-up

Translation BX EBX RBX

8Instruction Formats

AMD64 Technology 24594—Rev. 3.14—September 2007

1.2.4 Segment-Override Prefixes

Segment overrides can be used only with instructions that reference non-stack memory. Most

instructions that reference memory are encoded with a ModRM byte (page 17). The default segment

for such memory-referencing instructions is implied by the base register indicated in its ModRM byte,

as follows:

•Instructions that Reference a Non-Stack Segment—If an instruction encoding references any base

is the data segment (DS). These instructions can use the segment-override prefix to select one of

the non-default segments, as shown in Table 1-5.

•String Instructions—String instructions reference two memory operands. By default, they

reference both the DS and ES segments (DS:rSI and ES:rDI). These instructions can override their

DS-segment reference, as shown in Table 1-5, but they cannot override their ES-segment

reference.

•Instructions that Reference the Stack Segment—If an instruction’s encoding references the rBP or

rSP base register, the default segment is the stack segment (SS). All instructions that reference the

stack (push, pop, call, interrupt, return from interrupt) use SS by default. These instructions cannot

use the segment-override prefix.

Segment Overrides in 64-Bit Mode. In 64-bit mode, the CS, DS, ES, and SS segment-override

prefixes have no effect. These four prefixes are not treated as segment-override prefixes for the

purposes of multiple-prefix rules. Instead, they are treated as null prefixes.

The FS and GS segment-override prefixes are treated as true segment-override prefixes in 64-bit mode.

Use of the FS or GS prefix causes their respective segment bases to be added to the effective address

calculation. See “FS and GS Registers in 64-Bit Mode” in Volume 2 for details.

1.2.5 Lock Prefix

The LOCK prefix causes certain kinds of memory read-modify-write instructions to occur atomically.

The mechanism for doing so is implementation-dependent (for example, the mechanism may involve

Table 1-5. Segment-Override Prefixes

Mnemonic Prefix Byte

(Hex) Description

CS12E Forces use of current CS segment for memory operands.

DS13E Forces use of current DS segment for memory operands.

ES126 Forces use of current ES segment for memory operands.

FS 64 Forces use of current FS segment for memory operands.

GS 65 Forces use of current GS segment for memory operands.

SS136 Forces use of current SS segment for memory operands.

Note:

1. In 64-bit mode, the CS, DS, ES, and SS segment overrides are ignored.

Instruction Formats 9

24594—Rev. 3.14—September 2007 AMD64 Technology

bus signaling or packet messaging between the processor and a memory controller). The prefix is

intended to give the processor exclusive use of shared memory in a multiprocessor system.

The LOCK prefix can only be used with forms of the following instructions that write a memory

operand: ADC, ADD, AND, BTC, BTR, BTS, CMPXCHG, CMPXCHG8B, CMPXCHG16B, DEC,

INC, NEG, NOT, OR, SBB, SUB, XADD, XCHG, and XOR. An invalid-opcode exception occurs if

the LOCK prefix is used with any other instruction.

1.2.6 Repeat Prefixes

The repeat prefixes cause repetition of certain instructions that load, store, move, input, or output

strings. The prefixes should only be used with such string instructions. Two pairs of repeat prefixes,

REPE/REPZ and REPNE/REPNZ, perform the same repeat functions for certain compare-string and

scan-string instructions. The repeat function uses rCX as a count register. The size of rCX is based on

address size, as shown in Table 1-4 on page 7.

REP. The REP prefix repeats its associated string instruction the number of times specified in the

counter register (rCX). It terminates the repetition when the value in rCX reaches 0. The prefix can be

used with the INS, LODS, MOVS, OUTS, and STOS instructions. Table 1-6 shows the valid REP

prefix opcodes.

Table 1-6. REP Prefix Opcodes

Mnemonic Opcode

REP INS reg/mem8, DX

REP INSB F3 6C

REP INS reg/mem16/32, DX

REP INSW

REP INSD

F3 6D

REP LODS mem8

REP LODSB F3 AC

REP LODS mem16/32/64

REP LODSW

REP LODSD

REP LODSQ

F3 AD

REP MOVS mem8, mem8

REP MOVSB F3 A4

REP MOVS mem16/32/64, mem16/32/64

REP MOVSW

REP MOVSD

REP MOVSQ

F3 A5

REP OUTS DX, reg/mem8

REP OUTSB F3 6E

10 Instruction Formats

AMD64 Technology 24594—Rev. 3.14—September 2007

REPE and REPZ. REPE and REPZ are synonyms and have identical opcodes. These prefixes repeat

their associated string instruction the number of times specified in the counter register (rCX). The

repetition terminates when the value in rCX reaches 0 or when the zero flag (ZF) is cleared to 0. The

REPE and REPZ prefixes can be used with the CMPS, CMPSB, CMPSD, CMPSW, SCAS, SCASB,

SCASD, and SCASW instructions. Table 1-7 shows the valid REPE and REPZ prefix opcodes.

REPNE and REPNZ. REPNE and REPNZ are synonyms and have identical opcodes. These prefixes

repeat their associated string instruction the number of times specified in the counter register (rCX).

The repetition terminates when the value in rCX reaches 0 or when the zero flag (ZF) is set to 1. The

REPNE and REPNZ prefixes can be used with the CMPS, CMPSB, CMPSD, CMPSW, SCAS,

SCASB, SCASD, and SCASW instructions. Table 1-8 on page 11 shows the valid REPNE and

REPNZ prefix opcodes.

REP OUTS DX, reg/mem16/32

REP OUTSW

REP OUTSD

F3 6F

REP STOS mem8

REP STOSB F3 AA

REP STOS mem16/32/64

REP STOSW

REP STOSD

REP STOSQ

F3 AB

Table 1-7. REPE and REPZ Prefix Opcodes

Mnemonic Opcode

REPx CMPS mem8, mem8

REPx CMPSB F3 A6

REPx CMPS mem16/32/64, mem16/32/64

REPx CMPSW

REPx CMPSD

REPx CMPSQ

F3 A7

REPx SCAS mem8

REPx SCASB F3 AE

REPx SCAS mem16/32/64

REPx SCASW

REPx SCASD

REPx SCASQ

F3 AF

Table 1-6. REP Prefix Opcodes (continued)

Mnemonic Opcode

Instruction Formats 11

24594—Rev. 3.14—September 2007 AMD64 Technology

Instructions that Cannot Use Repeat Prefixes. In general, the repeat prefixes should only be used

in the string instructions listed in tables 1-6, 1-7, and 1-8, and in 128-bit or 64-bit media instructions.

When used in media instructions, the F2h and F3h prefixes act in a special way to modify the opcode

rather than cause a repeat operation. The result of using a 66h operand-size prefix along with an F2h or

F3h prefix in 128-bit or 64-bit media instructions is unpredictable.

Optimization of Repeats. Depending on the hardware implementation, the repeat prefixes can have a

setup overhead. If the repeated count is variable, the overhead can sometimes be avoided by substituting

a simple loop to move or store the data. Repeated string instructions can be expanded into equivalent

sequences of inline loads and stores or a sequence of stores can be used to emulate a REP STOS.

For repeated string moves, performance can be maximized by moving the largest possible operand

size. For example, use REP MOVSD rather than REP MOVSW and REP MOVSW rather than REP

MOVSB. Use REP STOSD rather than REP STOSW and REP STOSW rather than REP MOVSB.

Depending on the hardware implementation, string moves with the direction flag (DF) cleared to 0

(up) may be faster than string moves with DF set to 1 (down). DF = 1 is only needed for certain cases

of overlapping REP MOVS, such as when the source and the destination overlap.

1.2.7 REX Prefixes

REX prefixes are a group of instruction-prefix bytes that can be used only in 64-bit mode. They enable

access to the AMD64 register extensions. Figure 1-1 on page 1 and Figure 1-2 on page 2 show how a

REX prefix fits within the byte order of instructions. REX prefixes enable the following features in 64-

bit mode:

•Use of the extended GPR (Figure 2-3 on page 25) or XMM registers (Figure 2-8 on page 30).

•Use of the 64-bit operand size when accessing GPRs.

Table 1-8. REPNE and REPNZ Prefix Opcodes

Mnemonic Opcode

REPNx CMPS mem8, mem8

REPNx CMPSB F2 A6

REPNx CMPS mem16/32/64, mem16/32/64

REPNx CMPSW

REPNx CMPSD

REPNx CMPSQ

F2 A7

REPNx SCAS mem8

REPNx SCASB F2 AE

REPNx SCAS mem16/32/64

REPNx SCASW

REPNx SCASD

REPNx SCASQ

F2 AF

12 Instruction Formats

AMD64 Technology 24594—Rev. 3.14—September 2007

•Use of the extended control and debug registers, as described in “64-Bit-Mode Extended Control

Registers” in Volume 2 and “64-Bit-Mode Extended Debug Registers” in Volume 2.

•Use of the uniform byte registers (AL–R15).

Table 1-9 shows the REX prefixes. The value of a REX prefix is in the range 40h through 4Fh,

depending on the particular combination of AMD64 register extensions desired.

A REX prefix is normally required with an instruction that accesses a 64-bit GPR or one of the

extended GPR or XMM registers. Only a few instructions have an operand size that defaults to (or is

fixed at) 64 bits in 64-bit mode, and thus do not need a REX prefix. These exceptions to the normal

rule are listed in Table 1-10.

An instruction can have only one REX prefix, although the prefix can express several extension

features. If a REX prefix is used, it must immediately precede the first opcode byte in the instruction

format. Any other placement of a REX prefix, or any use of a REX prefix in an instruction that does

Table 1-9. REX Instruction Prefixes

Prefix Type Mnemonic Prefix Code

(Hex) Description

REX.W

401

through

4F1

Access an AMD64 register

extension.

REX.R

REX.X

REX.B

Note:

1. See Table 1-11 for encoding of REX prefixes.

Table 1-10. Instructions Not Requiring REX Size Prefix in 64-Bit Mode

CALL (Near) POP reg/mem

ENTER POP reg

Jcc POP FS

JrCXZ POP GS

JMP (Near) POPFQ

LEAVE PUSH imm8

LGDT PUSH imm32

LIDT PUSH reg/mem

LLDT PUSH reg

LOOP PUSH FS

LOOPcc PUSH GS

LTR PUSHFQ

MOV CR(n)RET (Near)

MOV DR(n)

Instruction Formats 13

24594—Rev. 3.14—September 2007 AMD64 Technology

not access an extended register, is ignored. The legacy instruction-size limit of 15 bytes still applies to

instructions that contain a REX prefix.

REX prefixes are a set of sixteen values that span one row of the main opcode map and occupy entries

40h through 4Fh. Table 1-11 and Figure 1-3 on page 15 show the prefix fields and their uses.

REX.W: Operand Width. Setting the REX.W bit to 1 specifies a 64-bit operand size. Like the

existing 66h operand-size prefix, the REX 64-bit operand-size override has no effect on byte

operations. For non-byte operations, the REX operand-size override takes precedence over the 66h

prefix. If a 66h prefix is used together with a REX prefix that has the REX.W bit set to 1, the 66h

prefix is ignored. However, if a 66h prefix is used together with a REX prefix that has the REX.W bit

cleared to 0, the 66h prefix is not ignored and the operand size becomes 16 bits.

REX.R: Register. The REX.R bit adds a 1-bit (high) extension to the ModRM reg field (page 17)

when that field encodes a GPR, XMM, control, or debug register. REX.R does not modify ModRM reg

when that field specifies other registers or opcodes. REX.R is ignored in such cases.

REX.X: Index. The REX.X bit adds a 1-bit (high) extension to the SIB index field (page 17).

REX.B: Base. The REX.B bit adds a 1-bit (high) extension to either the ModRM r/m field to specify

a GPR or XMM register, or to the SIB base field to specify a GPR. (See Table 2-2 on page 40 for more

about the REX.B bit.)

Encoding Examples. Figure 1-3 on page 15 shows four examples of how the R, X, and B bits of

REX prefixes are concatenated with fields from the ModRM byte, SIB byte, and opcode to specify

Table 1-11. REX Prefix-Byte Fields

Mnemonic Bit Position Definition

— 7–4 0100

REX.W 3 0 = Default operand size

1 = 64-bit operand size

REX.R 2

1-bit (high) extension of the ModRM reg

field1, thus permitting access to 16

registers.

REX.X 1 1-bit (high) extension of the SIB index field1,

thus permitting access to 16 registers.

REX.B 0

1-bit (high) extension of the ModRM r/m

field1, SIB base field1, or opcode reg field,

thus permitting access to 16 registers.

Note:

1. For a description of the ModRM and SIB bytes, see “ModRM and SIB Bytes” on

page 17.

14 Instruction Formats

AMD64 Technology 24594—Rev. 3.14—September 2007

Byte-Register Addressing. In the legacy architecture, the byte registers (AH, AL, BH, BL, CH, CL,

DH, and DL, shown in Figure 2-2 on page 24) are encoded in the ModRM reg or r/m field or in the

opcode reg field as registers 0 through 7. The REX prefix provides an additional byte-register

addressing capability that makes the least-significant byte of any GPR available for byte operations

(Figure 2-3 on page 25). This provides a uniform set of byte, word, doubleword, and quadword

registers better suited for register allocation by compilers.

Special Encodings for Registers. Readers who need to know the details of instruction encodings

should be aware that certain combinations of the ModRM and SIB fields have special meaning for

are not decoded (treated as don’t cares), thereby creating aliases of these encodings in the extended

registers. Table 1-12 on page 16 describes how each of these cases behaves.

Implications for INC and DEC Instructions. The REX prefix values are taken from the 16 single-

byte INC and DEC instructions, one for each of the eight GPRs. Therefore, these single-byte opcodes

for INC and DEC are not available in 64-bit mode, although they are available in legacy and

compatibility modes. The functionality of these INC and DEC instructions is still available in 64-bit

mode, however, using the ModRM forms of those instructions (opcodes FF /0 and FF /1).

Instruction Formats 15

24594—Rev. 3.14—September 2007 AMD64 Technology

Figure 1-3. Encoding Examples of REX-Prefix R, X, and B Bits

513-302.eps

REX Prefix

Case 1: Register-Register Addressing (No Memory Operand)

REX.X is not used

4WRXB

Opcode

ModRM Byte

mod regr/m

rrr

11 bbb

REX Prefix

Case 2: Memory Addressing Without an SIB Byte

Rrrr

Rrrr Bbbb

4WRXB

Opcode

ModRM Byte

rrr

!11 bbb

Bbbb

REX Prefix

Case 3: Memory Addressing With an SIB Byte

Rrrr

4WRXB

Opcode

ModRM Byte

rrr

!11 100

SIB Byte

scale index base

xxx

bb bbb

BbbbXxxx

REX.R is not used

REX.X is not used

REX Prefix

Case 4: Register Operand Coded in Opcode Byte

Bbbb

4WRXB

Opcode Byte

op reg

bbb

ModRM reg field != 100

16 Instruction Formats

AMD64 Technology 24594—Rev. 3.14—September 2007

Table 1-12. Special REX Encodings for Registers

ModRM and SIB

Encodings2

Meaning in Legacy and

Compatibility Modes

Implications in Legacy

and Compatibility

Modes

Additional REX

Implications

ModRM Byte:

•mod≠11

•r/m

1= 100 (ESP)

SIB byte is present. SIB byte is required for

ESP-based addressing.

REX prefix adds a fourth

bit (b), which is decoded

and modifies the base

Therefore, the SIB byte is

also required for R12-

based addressing.

ModRM Byte:

•mod=00

•r/m

1= x101 (EBP)

Base register is not used.

Using EBP without a

displacement must be

done by setting mod = 01

with a displacement of 0

(with or without an index

REX prefix adds a fourth

bit (x), which is not

decoded (don’t care).

Therefore, using RBP or

R13 without a

displacement must be

done via mod = 01 with a

displacement of 0.

SIB Byte:

• index1= x100 (ESP) Index register is not used. ESP cannot be used as

an index register.

REX prefix adds a fourth

bit (x), which is decoded.

Therefore, there are no

additional implications.

The expanded index field

is used to distinguish RSP

from R12, allowing R12 to

be used as an index.

SIB Byte:

• base = b101 (EBP)

• ModRM.mod = 00

Base register is not used

if ModRM.mod = 00.

Base register depends on

mod encoding. Using

EBP with a scaled index

and without a

displacement must be

done by setting mod = 01

with a displacement of 0.

REX prefix adds a fourth

bit (b), which is not

decoded (don’t care).

Therefore, using RBP or

R13 without a

displacement must be

done via mod = 01 with a

displacement of 0 (with or

without an index register).

Note:

1. The REX-prefix bit is shown in the fourth (most-significant) bit position of the encodings for the ModRM r/m, SIB

index, and SIB base fields. The lower-case “x” for ModRM r/m (rather than the upper-case “B” shown in Figure 1-3

on page 15) indicates that the REX-prefix bit is not decoded (don’t care).

2. For a description of the ModRM and SIB bytes, see “ModRM and SIB Bytes” on page 17.

Instruction Formats 17

24594—Rev. 3.14—September 2007 AMD64 Technology

1.3 Opcode

Each instruction has a unique opcode, although assemblers can support multiple mnemonics for a

single instruction opcode. The opcode specifies the operation that the instruction performs and, in

certain cases, the kinds of operands it uses. An opcode consists of one or two bytes, but certain 128-bit

media instructions also use a prefix byte in a special way to modify the opcode. The 3-bit reg field of

the ModRM byte (“ModRM and SIB Bytes” on page 17) is also used in certain instructions either for

three additional opcode bits or for a register specification.

128-Bit and 64-Bit Media Instruction Opcodes. Many 128-bit and 64-bit media instructions

include a 66h, F2h, or F3h prefix byte in a special way to modify the opcode. These same byte values

can be used in certain general-purpose and x87 instructions to modify operand size (66h) or repeat the

operation (F2h, F3h). In 128-bit and 64-bit media instructions, however, such prefix bytes modify the

opcode. If a 128-bit or 64-bit media instruction uses one of these three prefixes, and also includes any

other prefix in the 66h, F2h, and F3h group, the result is unpredictable.

All opcodes for 64-bit media instructions begin with a 0Fh byte. In the case of 64-bit floating-point

(3DNow!) instructions, the 0Fh byte is followed by a second 0Fh opcode byte. A third opcode byte

occupies the same position at the end of a 3DNow! instruction as would an immediate byte. The value

of the immediate byte is shown as the third opcode byte-value in the syntax for each instruction in

“64-Bit Media Instruction Reference” in Volume 5. The format is:

0Fh 0Fh ModRM [SIB] [displacement]

3DNow!_third_opcode_byte

For details on opcode encoding, see Appendix A, “Opcode and Operand Encodings.”

1.4 ModRM and SIB Bytes

The ModRM byte is used in certain instruction encodings to:

•Define a register reference.

•Define a memory reference.

•Provide additional opcode bits with which to define the instruction’s function.

ModRM bytes have three fields—mod, reg, and r/m. The reg field provides additional opcode bits with

which to define the function of the instruction or one of its operands. The mod and r/m fields are used

together with each other and, in 64-bit mode, with the REX.R and REX.B bits of the REX prefix

(page 11), to specify the location of an instruction’s operands and certain of the possible addressing

modes (specifically, the non-complex modes).

Figure 1-4 on page 18 shows the format of a ModRM byte.

18 Instruction Formats

AMD64 Technology 24594—Rev. 3.14—September 2007

Figure 1-4. ModRM-Byte Format

In some instructions, the ModRM byte is followed by an SIB byte, which defines memory addressing

for the complex-addressing modes described in “Effective Addresses” in Volume 1. The SIB byte has

three fields—scale, index, and base—that define the scale factor, index-register number, and base-

REX.X bits extend the encoding of the SIB byte’s base and index fields.

Figure 1-5 shows the format of an SIB byte.

Figure 1-5. SIB-Byte Format

The encodings of ModRM and SIB bytes not only define memory-addressing modes, but they also

specify operand registers. The encodings do this by using 3-bit fields in the ModRM and SIB bytes,

depending on the format:

•ModRM: the reg and r/m fields of the ModRM byte. (Case 1 in Figure 1-3 on page 15 shows an

example of this).

•ModRM with SIB: the reg field of the ModRM byte and the base and index fields of the SIB byte.

(Case 3 in Figure 1-3 on page 15 shows an example of this).

513-305.eps

mod

REX.R bit of REX prefix can

extend this field to 4 bits

REX.B bit of REX prefix can

extend this field to 4 bits

regr/mModRM

01234567Bits:

513-306.eps

Bits:

scale index base SIB

01234567

REX.X bit of REX prefix can

extend this field to 4 bits

REX.B bit of REX prefix can

extend this field to 4 bits

Instruction Formats 19

24594—Rev. 3.14—September 2007 AMD64 Technology

•Instructions without ModRM: the reg field of the opcode. (Case 4 in Figure 1-3 on page 15 shows

an example of this).

In 64-bit mode, the bits needed to extend each field for accessing the additional registers are provided

by the REX prefixes, as shown in Figure 1-4 and Figure 1-5 on page 18.

For details on opcode encoding, see Appendix A, “Opcode and Operand Encodings.”

1.5 Displacement Bytes

A displacement (also called an offset) is a signed value that is added to the base of a code segment

(absolute addressing) or to an instruction pointer (relative addressing), depending on the addressing

mode. The size of a displacement is 1, 2, or 4 bytes. If an addressing mode requires a displacement, the

bytes (1, 2, or 4) for the displacement follow the opcode, ModRM, or SIB byte (whichever comes last)

in the instruction encoding.

In 64-bit mode, the same ModRM and SIB encodings are used to specify displacement sizes as those

used in legacy and compatibility modes. However, the displacement is sign-extended to 64 bits during

effective-address calculations. Also, in 64-bit mode, support is provided for some 64-bit displacement

and immediate forms of the MOV instruction. See “Immediate Operand Size” in Volume 1 for more

information on this.

1.6 Immediate Bytes

An immediate is a value—typically an operand value—encoded directly into the instruction.

Depending on the opcode and the operating mode, the size of an immediate operand can be 1, 2, 4, or 8

bytes. 64-bit immediates are allowed in 64-bit mode on MOV instructions that load GPRs, otherwise

they are limited to 4 bytes. See “Immediate Operand Size” in Volume 1 for more information.

If an instruction takes an immediate operand, the bytes (1, 2, 4, or 8) for the immediate follow the

opcode, ModRM, SIB, or displacement bytes (whichever come last) in the instruction encoding. Some

128-bit media instructions use the immediate byte as a condition code.

1.7 RIP-Relative Addressing

In 64-bit mode, addressing relative to the contents of the 64-bit instruction pointer (program

counter)—called RIP-relative addressing or PC-relative addressing—is implemented for certain

instructions. In such cases, the effective address is formed by adding the displacement to the 64-bit RIP

of the next instruction.

In the legacy x86 architecture, addressing relative to the instruction pointer is available only in control-

transfer instructions. In the 64-bit mode, any instruction that uses ModRM addressing can use RIP-

relative addressing. This feature is particularly useful for addressing data in position-independent code

and for code that addresses global data.

20 Instruction Formats

AMD64 Technology 24594—Rev. 3.14—September 2007

Without RIP-relative addressing, ModRM instructions address memory relative to zero. With RIP-

relative addressing, ModRM instructions can address memory relative to the 64-bit RIP using a signed

32-bit displacement. This provides an offset range of ±2 Gbytes from the RIP.

Programs usually have many references to data, especially global data, that are not register-based. To

load such a program, the loader typically selects a location for the program in memory and then adjusts

program references to global data based on the load location. RIP-relative addressing of data makes

this adjustment unnecessary.

1.7.1 Encoding

Table 1-13 shows the ModRM and SIB encodings for RIP-relative addressing. Redundant forms of 32-

bit displacement-only addressing exist in the current ModRM and SIB encodings. There is one

ModRM encoding with several SIB encodings. RIP-relative addressing is encoded using one of the

redundant forms. In 64-bit mode, the ModRM Disp32 (32-bit displacement) encoding is redefined to

be RIP + Disp32 rather than displacement-only.

1.7.2 REX Prefix and RIP-Relative Addressing

ModRM encoding for RIP-relative addressing does not depend on a REX prefix. In particular, the r/m

encoding of 101, used to select RIP-relative addressing, is not affected by the REX prefix. For

example, selecting R13 (REX.B = 1, r/m = 101) with mod = 00 still results in RIP-relative addressing.

The four-bit r/m field of ModRM is not fully decoded. Therefore, in order to address R13 with no

displacement, software must encode it as R13 + 0 using a one-byte displacement of zero.

1.7.3 Address-Size Prefix and RIP-Relative Addressing

RIP-relative addressing is enabled by 64-bit mode, not by a 64-bit address-size. Conversely, use of the

address-size prefix (“Address-Size Override Prefix” on page 6) does not disable RIP-relative

addressing. The effect of the address-size prefix is to truncate and zero-extend the computed effective

address to 32 bits, like any other addressing mode.

Table 1-13. Encoding for RIP-Relative Addressing

ModRM and SIB

Encodings

Meaning in Legacy and

Compatibility Modes Meaning in 64-bit Mode Additional 64-bit

Implications

ModRM Byte:

•mod=00

• r/m = 101 (none)

Disp32 RIP + Disp32

Zero-based (normal)

displacement addressing

must use SIB form (see

next row).

SIB Byte:

• base = 101 (none)

• index = 100 (none)

• scale = 1, 2, 4,8

If mod = 00, Disp32 Same as Legacy None

Instruction Overview 21

24594—Rev. 3.14—September 2007 AMD64 Technology

2 Instruction Overview

2.1 Instruction Subsets

For easier reference, the instruction descriptions are divided into five instruction subsets. The

following sections describe the function, mnemonic syntax, opcodes, affected flags, and possible

exceptions generated by all instructions in the AMD64 architecture:

•Chapter 3, “General-Purpose Instruction Reference”—The general-purpose instructions are used

in basic software execution. Most of these load, store, or operate on data in the general-purpose

registers (GPRs), in memory, or in both. Other instructions are used to alter sequential program

flow by branching to other locations within the program or to entirely different programs.

•Chapter 4, “System Instruction Reference”—The system instructions establish the processor

operating mode, access processor resources, handle program and system errors, and manage

memory.

•“128-Bit Media Instruction Reference” in Volume 4—The 128-bit media instructions load, store,

or operate on data located in the 128-bit XMM registers. These instructions define both vector and

scalar operations on floating-point and integer data types. They include the SSE and SSE2

instructions that operate on the XMM registers. Some of these instructions convert source

operands in XMM registers to destination operands in GPR, MMX, or x87 registers or otherwise

affect XMM state.

•“64-Bit Media Instruction Reference” in Volume 5—The 64-bit media instructions load, store, or

operate on data located in the 64-bit MMX registers. These instructions define both vector and

scalar operations on integer and floating-point data types. They include the legacy MMX™

instructions, the 3DNow!™ instructions, and the AMD extensions to the MMX and 3DNow!

instruction sets. Some of these instructions convert source operands in MMX registers to

destination operands in GPR, XMM, or x87 registers or otherwise affect MMX state.

•“x87 Floating-Point Instruction Reference” in Volume 5—The x87 instructions are used in legacy

floating-point applications. Most of these instructions load, store, or operate on data located in the

x87 ST(0)–ST(7) stack registers (the FPR0–FPR7 physical registers). The remaining instructions

within this category are used to manage the x87 floating-point environment.

The description of each instruction covers its behavior in all operating modes, including legacy mode

(real, virtual-8086, and protected modes) and long mode (compatibility and 64-bit modes). Details of

certain kinds of complex behavior—such as control-flow changes in CALL, INT, or FXSAVE

instructions—have cross-references in the instruction-detail pages to detailed descriptions in volumes

1 and 2.

Two instructions—CMPSD and MOVSD—use the same mnemonic for different instructions.

Assemblers can distinguish them on the basis of the number and type of operands with which they are

used.

22 Instruction Overview

AMD64 Technology 24594—Rev. 3.14—September 2007

2.2 Reference-Page Format

Figure 2-1 on page 23 shows the format of an instruction-detail page. The instruction mnemonic is

shown in bold at the top-left, along with its name. In this example, POPFD is the mnemonic and POP

to EFLAGS Doubleword is the name. Next, there is a general description of the instruction’s operation.

Many descriptions have cross-references to more detail in other parts of the manual.

Beneath the general description, the mnemonic is shown again, together with the related opcode(s) and

a description summary. Related instructions are listed below this, followed by a table showing the flags

that the instruction can affect. Finally, each instruction has a summary of the possible exceptions that

can occur when executing the instruction. The columns labeled “Real” and “Virtual-8086” apply only

to execution in legacy mode. The column labeled “Protected” applies both to legacy mode and long

mode, because long mode is a superset of legacy protected mode.

The 128-bit and 64-bit media instructions also have diagrams illustrating the operation. A few

instructions have examples or pseudocode describing the action.

Instruction Overview 23

24594—Rev. 3.14—September 2007 AMD64 Technology

Figure 2-1. Format of Instruction-Detail Pages

24594 Rev. 3.07 September 2003 AMD64 Technology

AAM 63

Converts the value in the AL register from binary to two unpacked BCD digits in the

AH (most significant) and AL (least significant) registers using the following formula:

AH = (AL/10d)

AL = (AL mod 10d).

In most modern assemblers, the AAM instruction adjusts to base-10 values. However,

by coding the instruction directly in binary, it can adjust to any base specified by the

immediate byte value (ib) suffixed onto the D4h opcode. For example, code D408h for

octal, D40Ah for decimal, and D40Ch for duodecimal (base 12).

Using this instruction in 64-bit mode generates an invalid-opcode exception.

Related Instructions

AAA, AAD, AAS

rFLAGS Affected

Exceptions

AAM ASCII Adjust After Multiply

Mnemonic Opcode Description

AAM D4 0A Create a pair of unpacked BCD values in AH and AL.

(Invalid in 64-bit mode.)

(None) D4 ib Create a pair of unpacked values to the immediate byte base.

(Invalid in 64-bit mode.)

ID VIP VIF AC VM RF NT IOPL OF DF IF TF SF ZF AF PF CF

U MMUMU

21201918171614 13–12 11109876420

Note: Bits 31–22, 15, 5, 3, and 1 are reserved. A flag set to 1 or cleared to 0 is M. Unaffected flags are blank. Undefined flags are U.

Exception Real

Virtual

8086 Protected Cause of Exception

Divide by zero, #DE X X X 8-bit immediate value was 0.

Invalid opcode, #UD X This instruction was executed in 64-bit mode.

Mnemonic and any operands Opcode Description of operation

“M” means the flag is either set or

cleared, depending on the result.

Possible exceptions

and causes, by mode of

operation

“Protected” column

covers both legacy

and long mode

Alphabetic mnemonic locator

24 Instruction Overview

AMD64 Technology 24594—Rev. 3.14—September 2007

2.3 Summary of Registers and Data Types

This section summarizes the registers available to software using the five instruction subsets described

in “Instruction Subsets” on page 21. For details on the organization and use of these registers, see their

respective chapters in volumes 1 and 2.

2.3.1 General-Purpose Instructions

Registers. The size and number of general-purpose registers (GPRs) depends on the operating mode,

as do the size of the flags and instruction-pointer registers. Figure 2-2 shows the registers available in

legacy and compatibility modes.

Figure 2-2. General Registers in Legacy and Compatibility Modes

Figure 2-3 on page 25 shows the registers accessible in 64-bit mode. Compared with legacy mode,

registers become 64 bits wide, eight new data registers (R8–R15) are added and the low byte of all 16

GPRs is available for byte operations, and the four high-byte registers of legacy mode (AH, BH, CH,

and DH) are not available if the REX prefix is used. The high 32 bits of doubleword operands are zero-

extended to 64 bits, but the high bits of word and byte operands are not modified by operations in 64-

513-311.eps

31 15 016

EAX

EBX

ECX

EDX

ESI

EDI

EBP

ESP

16-bit

low

8-bit

high

8-bit32-bit

AH (4)

BH (7)

CH (5)

DH (6)

FLAGS

31 0

FLAGS

EFLAGS

EIP

encoding

Instruction Overview 25

24594—Rev. 3.14—September 2007 AMD64 Technology

bit mode. The RFLAGS register is 64 bits wide, but the high 32 bits are reserved. They can be written

with anything but they read as zeros (RAZ).

Figure 2-3. General Registers in 64-Bit Mode

For most instructions running in 64-bit mode, access to the extended GPRs requires a REX instruction

prefix (page 11).

513-309.eps

6331157081632

R10

R11

R12

R13

R14

R15

R8W

R9W

R10W

R11W

R12W

R13W

R14W

R15W

R8D

R9D

R10D

R11D

R12D

R13D

R14D

R15D

EAX

EBX

ECX

EDX

ESI

EDI

EBP

ESP

RAX

RBX

RCX

RDX

RSI

RDI

RBP

RSP

AH*

BH*

CH*

DH*

16-bit32-bit64-bit

encoding

zero-extended

for 32-bit operands

not modified for 8-bit operands

not modified for 16-bit operands

6331032

RFLAGS

RIP

low

8-bit

R8B

R9B

R10B

R11B

R12B

R13B

R14B

R15B

SIL**

DIL**

BPL**

SPL**

*Not addressable when

a REX prefix is used.

** Only addressable when

a REX

refix is used.

26 Instruction Overview

AMD64 Technology 24594—Rev. 3.14—September 2007

Figure 2-4 shows the segment registers which, like the instruction pointer, are used by all instructions.

In legacy and compatibility modes, all segments are accessible. In 64-bit mode, which uses the flat

(non-segmented) memory model, only the CS, FS, and GS segments are recognized, whereas the

contents of the DS, ES, and SS segment registers are ignored (the base for each of these segments is

assumed to be zero, and neither their segment limit nor attributes are checked). For details, see

“Segmented Virtual Memory” in Volume 2.

Figure 2-4. Segment Registers

Data Types. Figure 2-5 on page 27 shows the general-purpose data types. They are all scalar, integer

data types. The 64-bit (quadword) data types are only available in 64-bit mode, and for most

instructions they require a REX instruction prefix.

513-312.eps

15 0

(Base only)

(Attributes only)

Legacy Mode and

Compatibility Mode

64-Bit

Mode

ignored

Instruction Overview 27

24594—Rev. 3.14—September 2007 AMD64 Technology

Figure 2-5. General-Purpose Data Types

2.3.2 System Instructions

Registers. The system instructions use several specialized registers shown in Figure 2-6 on page 28.

System software uses these registers to, among other things, manage the processor’s operating

environment, define system resource characteristics, and monitor software execution. With the

exception of the RFLAGS register, system registers can be read and written only from privileged

software.

All system registers are 64 bits wide, except for the descriptor-table registers and the task register,

which include 64-bit base-address fields and other fields.

513-326.eps

127

Quadword

Double

Quadword

Doubleword

Word

Byte

Quadword

Unsigned Integer

Signed Integer

Doubleword

Word

Byte

Bit

8 bytes (64-bit mode only)

s16 bytes (64-bit mode only)

127

Double

Quadword

16 bytes (64-bit mode only)

4 bytes

2 bytes

Packed BCD

BCD Digit

8 bytes (64-bit mode only)

4 bytes

2 bytes

28 Instruction Overview

AMD64 Technology 24594—Rev. 3.14—September 2007

Figure 2-6. System Registers

Data Structures. Figure 2-7 on page 29 shows the system data structures. These are created and

maintained by system software for use in protected mode. A processor running in protected mode uses

these data structures to manage memory and protection, and to store program-state information when

an interrupt or task switch occurs.

Control Registers

CR0

CR2

CR3

CR4

CR8

System-Flags Register

RFLAGS

Debug Registers

DR0

DR1

DR2

DR3

DR6

DR7

513-260.eps

Memory-Typing Registers

MTRRcap

MTRRdefType

MTRRphysBasen

MTRRphysMaskn

MTRRfixn

PAT

TOP_MEM

TOP_MEM2

Machine-Check Registers

MCG_CAP

MCG_STAT

MCG_CTL

MCi_CTL

MCi_STATUS

MCi_ADDR

MCi_MISC

Performance-Monitoring Registers

TSC

PerfEvtSeln

PerfCtrn

Model-Specific Registers

Descriptor-Table Registers

GDTR

IDTR

LDTR

Task Register

Extended-Feature-Enable Register

EFER

Debug-Extension Registers

DebugCtlMSR

LastBranchFromIP

LastBranchToIP

LastIntFromIP

LastIntToIP

System-Configuration Register

SYSCFG

System-Linkage Registers

STAR

LSTAR

CSTAR

FS.base

GS.base

KernelGSbase

SYSENTER_CS

SYSENTER_ESP

SYSENTER_EIP

SFMASK

Instruction Overview 29

24594—Rev. 3.14—September 2007 AMD64 Technology

Figure 2-7. System Data Structures

2.3.3 128-Bit Media Instructions

Registers. The 128-bit media instructions use the 128-bit XMM registers. The number of available

XMM data registers depends on the operating mode, as shown in Figure 2-8 on page 30. In legacy and

compatibility modes, the eight legacy XMM data registers (XMM0–XMM7) are available. In 64-bit

mode, eight additional XMM data registers (XMM8–XMM15) are available when a REX instruction

prefix is used.

The MXCSR register contains floating-point and other control and status flags used by the 128-bit

media instructions. Some 128-bit media instructions also use the GPR (Figure 2-2 and Figure 2-3) and

513-261.eps

Segment Descriptors (Contained in Descriptor Tables)

Code

Stack

Data

Gate

Task-State Segment

Local-Descriptor Table

Task-State Segment

Page-Translation Tables

Page-Map Level-4 Page TablePage DirectoryPage-Directory Pointer

Global-Descriptor Table

Descriptor

. . .

Descriptor

Interrupt-Descriptor Table

Gate Descriptor

. . .

Gate Descriptor

Local-Descriptor Table

Descriptor

. . .

Descriptor

Descriptor Tables

30 Instruction Overview

AMD64 Technology 24594—Rev. 3.14—September 2007

the MMX registers (Figure 2-10 on page 32) or set or clear flags in the rFLAGS register (see

Figure 2-2 and Figure 2-3).

Figure 2-8. 128-Bit Media Registers

Data Types. Figure 2-9 on page 31 shows the 128-bit media data types. They include floating-point

and integer vectors and floating-point scalars. The floating-point data types include IEEE-754 single

precision and double precision types.

513-314.eps

XMM Data Registers

127 0

xmm0

xmm1

xmm2

xmm3

xmm4

xmm5

xmm6

xmm7

xmm8

xmm9

xmm10

xmm11

xmm12

xmm13

xmm14

xmm15

Available in all modes

Available only in 64-bit mode

31 0

MXCSR

128-Bit Media Control and Status Register

Instruction Overview 31

24594—Rev. 3.14—September 2007 AMD64 Technology

Figure 2-9. 128-Bit Media Data Types

ssss

ssssssss

ssss

ssssssssssssssss

ssssssss

ssss

ssssssssssssssss

513-316.eps

71523313947556371798795103111119127 0

bytebytebytebytebytebytebytebytebytebytebytebytebytebytebytebyte

31 22635495 86127 118 0

Vector (Packed) Floating-Point Double Precision and Single Precision

Vector (Packed) Unsigned Integer Quadword, Doubleword, Word, Byte

71523313947556371798795103111119127 0

quadwordquadword

doubleworddoubleworddoubleworddoubleword

wordwordwordwordwordwordwordword

quadwordquadword

doubleworddoubleworddoubleworddoubleword

wordwordwordwordwordwordwordword

127 0

Scalar Unsigned Integers

127

double quadword

bytebytebytebytebytebytebytebytebytebytebytebytebytebytebytebyte

Vector (Packed) Signed Integer Quadword, Doubleword, Word, Byte

significand

expsignificand

06351127 115

expsignificand

expsignificandexpsignificandexpsignificandexp

31 22 0

Scalar Floating-Point Double Precision and Single Precision

significand

expsignificand

6351 exp

quadword

doubleword

word

byte

bit

32 Instruction Overview

AMD64 Technology 24594—Rev. 3.14—September 2007

2.3.4 64-Bit Media Instructions

Registers. The 64-bit media instructions use the eight 64-bit MMX registers, as shown in

Figure 2-10. These registers are mapped onto the x87 floating-point registers, and 64-bit media

instructions write the x87 tag word in a way that prevents an x87 instruction from using MMX data.

Some 64-bit media instructions also use the GPR (Figure 2-2 and Figure 2-3) and the XMM registers

(Figure 2-8).

Figure 2-10. 64-Bit Media Registers

Data Types. Figure 2-11 on page 33 shows the 64-bit media data types. They include floating-point

and integer vectors and integer scalars. The floating-point data type, used by 3DNow! instructions,

consists of a packed vector or two IEEE-754 32-bit single-precision data types. Unlike other kinds of

floating-point instructions, however, the 3DNow!™ instructions do not generate floating-point

exceptions. For this reason, there is no register for reporting or controlling the status of exceptions in

the 64-bit-media instruction subset.

513-327.eps

MMX Data Registers

630

mmx0

mmx1

mmx2

mmx3

mmx4

mmx5

mmx6

mmx7

Instruction Overview 33

24594—Rev. 3.14—September 2007 AMD64 Technology

Figure 2-11. 64-Bit Media Data Types

ss ss

ssss

ssssssss

ssss

ssssssss

513-319.eps

7152331394755630

bytebytebytebytebytebytebytebyte

31 226354 0

Vector (Packed) Single-Precision Floating-Point

Vector (Packed) Unsigned Integers

7152331394755630

doubleworddoubleword

wordwordwordword

doubleworddoubleword

wordwordwordword

bytebytebytebytebytebytebytebyte

Vector (Packed) Signed Integers

significandexpsignificandexp

Unsigned Integers

Signed Integers

quadword

doubleword

word

byte

quadword

doubleword

word

byte

34 Instruction Overview

AMD64 Technology 24594—Rev. 3.14—September 2007

2.3.5 x87 Floating-Point Instructions

Registers. The x87 floating-point instructions use the x87 registers shown in Figure 2-12. There are

eight 80-bit data registers, three 16-bit registers that hold the x87 control word, status word, and tag

word, and three registers (last instruction pointer, last opcode, last data pointer) that hold information

about the last x87 operation.

The physical data registers are named FPR0–FPR7, although x87 software references these registers as

a stack of registers, named ST(0)–ST(7). The x87 instructions store operands only in their own 80-bit

floating-point registers or in memory. They do not access the GPR or XMM registers.

Figure 2-12. x87 Registers

Data Types. Figure 2-13 on page 35 shows all x87 data types. They include three floating-point

formats (80-bit double-extended precision, 64-bit double precision, and 32-bit single precision), three

signed-integer formats (quadword, doubleword, and word), and an 80-bit packed binary-coded

decimal (BCD) format.

Tag Word

Status Word

Control Word

513-321.eps

x87 Data Registers

79 0

fpr0

fpr1

fpr2

fpr3

fpr4

fpr5

fpr6

fpr7

015

010

Instruction Pointer (rIP)

Data Pointer (rDP)

Tag Word

Status Word

Control Word

Opcode

Instruction Overview 35

24594—Rev. 3.14—September 2007 AMD64 Technology

Figure 2-13. x87 Data Types

2.4 Summary of Exceptions

Table 2-1 on page 36 lists all possible exceptions. The table shows the interrupt-vector numbers,

names, mnemonics, source, and possible causes. Exceptions that apply to specific instructions are

documented with each instruction in the instruction-detail pages that follow.

513-317.eps

15 0

Quadword

Doubleword

Word

Signed Integer

Binary-Coded Decimal (BCD)

Floating-Point

8 bytes

4 bytes

Double Precision

Single Precision

2 bytes

079 71

Double-Extended

Precision

Packed Decimal

s i

significand

expsignificand

exp

36 Instruction Overview

AMD64 Technology 24594—Rev. 3.14—September 2007

Table 2-1. Interrupt-Vector Source and Cause

Vector Interrupt (Exception) Mnemonic Source Cause

0 Divide-By-Zero-Error #DE Software DIV, IDIV, AAM instructions

1 Debug #DB Internal Instruction accesses and data accesses

2 Non-Maskable-Interrupt #NMI External External NMI signal

3 Breakpoint #BP Software INT3 instruction

4 Overflow #OF Software INTO instruction

5 Bound-Range #BR Software BOUND instruction

6 Invalid-Opcode #UD Internal Invalid instructions

7 Device-Not-Available #NM Internal x87 instructions

8 Double-Fault #DF Internal Interrupt during an interrupt

9Coprocessor-Segment-Overrun —External Unsupported (reserved)

10 Invalid-TSS #TS Internal Task-state segment access and task

switch

11 Segment-Not-Present #NP Internal Segment access through a descriptor

12 Stack #SS Internal SS register loads and stack references

13 General-Protection #GP Internal Memory accesses and protection

checks

14 Page-Fault #PF Internal Memory accesses when paging

enabled

15 Reserved —

16 Floating-Point Exception-

Pending #MF Software x87 floating-point and 64-bit media

floating-point instructions

17 Alignment-Check #AC Internal Memory accesses

18 Machine-Check #MC Internal

External Model specific

19 SIMD Floating-Point #XF Internal 128-bit media floating-point instructions

20—29 Reserved (Internal and External) —

30 SVM Security Exception #SX External Security-Sensitive Events

31 Reserved (Internal and External) —

0—255 External Interrupts (Maskable) #INTR External External interrupt signal

0—255 Software Interrupts — Software INTn instruction

Instruction Overview 37

24594—Rev. 3.14—September 2007 AMD64 Technology

2.5 Notation

2.5.1 Mnemonic Syntax

Each instruction has a syntax that includes the mnemonic and any operands that the instruction can

take. Figure 2-14 shows an example of a syntax in which the instruction takes two operands. In most

instructions that take two operands, the first (left-most) operand is both a source operand (the first

source operand) and the destination operand. The second (right-most) operand serves only as a source,

not a destination.

Figure 2-14. Syntax for Typical Two-Operand Instruction

The following notation is used to denote the size and type of source and destination operands:

•cReg—Control register.

•dReg—Debug register.

•imm8—Byte (8-bit) immediate.

•imm16—Word (16-bit) immediate.

•imm16/32—Word (16-bit) or doubleword (32-bit) immediate.

•imm32—Doubleword (32-bit) immediate.

•imm32/64—Doubleword (32-bit) or quadword (64-bit) immediate.

•imm64—Quadword (64-bit) immediate.

•mem—An operand of unspecified size in memory.

•mem8—Byte (8-bit) operand in memory.

•mem16—Word (16-bit) operand in memory.

•mem16/32—Word (16-bit) or doubleword (32-bit) operand in memory.

•mem32—Doubleword (32-bit) operand in memory.

•mem32/48—Doubleword (32-bit) or 48-bit operand in memory.

•mem48—48-bit operand in memory.

513-322.eps

Mnemonic

First Source Operand

and Destination Operand

Second Source Operand

ADDPD xmm1, xmm2/mem128

38 Instruction Overview

AMD64 Technology 24594—Rev. 3.14—September 2007

•mem64—Quadword (64-bit) operand in memory.

•mem128—Double quadword (128-bit) operand in memory.

•mem16:16—Two sequential word (16-bit) operands in memory.

•mem16:32—A doubleword (32-bit) operand followed by a word (16-bit) operand in memory.

•mem32real—Single-precision (32-bit) floating-point operand in memory.

•mem32int—Doubleword (32-bit) integer operand in memory.

•mem64real—Double-precision (64-bit) floating-point operand in memory.

•mem64int—Quadword (64-bit) integer operand in memory.

•mem80real—Double-extended-precision (80-bit) floating-point operand in memory.

•mem80dec—80-bit packed BCD operand in memory, containing 18 4-bit BCD digits.

•mem2env—16-bit x87 control word or x87 status word.

•mem14/28env—14-byte or 28-byte x87 environment. The x87 environment consists of the x87

control word, x87 status word, x87 tag word, last non-control instruction pointer, last data pointer,

and opcode of the last non-control instruction completed.

•mem94/108env—94-byte or 108-byte x87 environment and register stack.

•mem512env—512-byte environment for 128-bit media, 64-bit media, and x87 instructions.

•mmx—Quadword (64-bit) operand in an MMX register.

•mmx1—Quadword (64-bit) operand in an MMX register, specified as the left-most (first) operand

in the instruction syntax.

•mmx2—Quadword (64-bit) operand in an MMX register, specified as the right-most (second)

operand in the instruction syntax.

•mmx/mem32—Doubleword (32-bit) operand in an MMX register or memory.

•mmx/mem64—Quadword (64-bit) operand in an MMX register or memory.

•mmx1/mem64—Quadword (64-bit) operand in an MMX register or memory, specified as the left-

most (first) operand in the instruction syntax.

•mmx2/mem64—Quadword (64-bit) operand in an MMX register or memory, specified as the right-

most (second) operand in the instruction syntax.

•moffset—Direct memory offset that specifies an operand in memory.

•moffset8—Direct memory offset that specifies a byte (8-bit) operand in memory.

•moffset16—Direct memory offset that specifies a word (16-bit) operand in memory.

•moffset32—Direct memory offset that specifies a doubleword (32-bit) operand in memory.

•moffset64—Direct memory offset that specifies a quadword (64-bit) operand in memory.

•pntr16:16—Far pointer with 16-bit selector and 16-bit offset.

•pntr16:32—Far pointer with 16-bit selector and 32-bit offset.

•reg—Operand of unspecified size in a GPR register.

•reg8—Byte (8-bit) operand in a GPR register.

Instruction Overview 39

24594—Rev. 3.14—September 2007 AMD64 Technology

•reg16—Word (16-bit) operand in a GPR register.

•reg16/32—Word (16-bit) or doubleword (32-bit) operand in a GPR register.

•reg32—Doubleword (32-bit) operand in a GPR register.

•reg64—Quadword (64-bit) operand in a GPR register.

•reg/mem8—Byte (8-bit) operand in a GPR register or memory.

•reg/mem16—Word (16-bit) operand in a GPR register or memory.

•reg/mem32—Doubleword (32-bit) operand in a GPR register or memory.

•reg/mem64—Quadword (64-bit) operand in a GPR register or memory.

•rel8off—Signed 8-bit offset relative to the instruction pointer.

•rel16off—Signed 16-bit offset relative to the instruction pointer.

•rel32off—Signed 32-bit offset relative to the instruction pointer.

•segReg or sReg—Word (16-bit) operand in a segment register.

•ST(0)—x87 stack register 0.

•ST(i)—x87 stack register i, where i is between 0 and 7.

•xmm—Double quadword (128-bit) operand in an XMM register.

•xmm1—Double quadword (128-bit) operand in an XMM register, specified as the left-most (first)

operand in the instruction syntax.

•xmm2—Double quadword (128-bit) operand in an XMM register, specified as the right-most

(second) operand in the instruction syntax.

•xmm/mem64—Quadword (64-bit) operand in a 128-bit XMM register or memory.

•xmm/mem128—Double quadword (128-bit) operand in an XMM register or memory.

•xmm1/mem128—Double quadword (128-bit) operand in an XMM register or memory, specified as

the left-most (first) operand in the instruction syntax.

•xmm2/mem128—Double quadword (128-bit) operand in an XMM register or memory, specified as

the right-most (second) operand in the instruction syntax.

2.5.2 Opcode Syntax

In addition to the notation shown above in “Mnemonic Syntax” on page 37, the following notation

indicates the size and type of operands in the syntax of an instruction opcode:

•/digit—Indicates that the ModRM byte specifies only one register or memory (r/m) operand. The

digit is specified by the ModRM reg field and is used as an instruction-opcode extension. Valid

digit values range from 0 to 7.

•/r—Indicates that the ModRM byte specifies both a register operand and a reg/mem (register or

memory) operand.

•cb, cw, cd, cp—Specifies a code-offset value and possibly a new code-segment register value. The

value following the opcode is either one byte (cb), two bytes (cw), four bytes (cd), or six bytes (cp).

40 Instruction Overview

AMD64 Technology 24594—Rev. 3.14—September 2007

•ib, iw, id, iq—Specifies an immediate-operand value. The opcode determines whether the value is

signed or unsigned. The value following the opcode, ModRM, or SIB byte is either one byte (ib),

two bytes (iw), or four bytes (id). Word and doubleword values start with the low-order byte.

•+rb, +rw, +rd, +rq—Specifies a register value that is added to the hexadecimal byte on the left,

forming a one-byte opcode. The result is an instruction that operates on the register specified by the

•m64—Specifies a quadword (64-bit) operand in memory.

•+i—Specifies an x87 floating-point stack operand, ST(i). The value is used only with x87 floating-

point instructions. It is added to the hexadecimal byte on the left, forming a one-byte opcode. Valid

values range from 0 to 7.

Table 2-2. +rb, +rw, +rd, and +rq Register Value

REX.B

Bit1Value Specified Register

+rb +rw +rd +rq

or no REX

Prefix

0ALAXEAXRAX

1 CL CX ECX RCX

2 DL DX EDX RDX

3BLBXEBXRBX

4AH, SPL

1SP ESP RSP

5 CH, BPL1BP EBP RBP

6DH, SIL

1SI ESI RSI

7 BH, DIL1DI EDI RDI

0 R8B R8W R8D R8

1 R9B R9W R9D R9

2 R10B R10W R10D R10

3 R11B R11W R11D R11

4 R12B R12W R12D R12

5 R13B R13W R13D R13

6 R14B R14W R14D R14

7 R15B R15W R15D R15

1. See “REX Prefixes” on page 11.

Instruction Overview 41

24594—Rev. 3.14—September 2007 AMD64 Technology

2.5.3 Pseudocode Definitions

Pseudocode examples are given for the actions of several complex instructions (for example, see

“CALL (Near)” on page 76). The following definitions apply to all such pseudocode examples:

/////////////////////////////////////////////////////////////////////////////////

// Basic Definitions

/////////////////////////////////////////////////////////////////////////////////

// All comments start with these double slashes.

REAL_MODE = (cr0.pe=0)

PROTECTED_MODE = ((cr0.pe=1) && (rflags.vm=0))

VIRTUAL_MODE = ((cr0.pe=1) && (rflags.vm=1))

LEGACY_MODE = (efer.lma=0)

LONG_MODE = (efer.lma=1)

64BIT_MODE = ((efer.lma=1) && (cs.L=1) && (cs.d=0))

COMPATIBILITY_MODE = (efer.lma=1) && (cs.L=0)

PAGING_ENABLED = (cr0.pg=1)

ALIGNMENT_CHECK_ENABLED = ((cr0.am=1) && (eflags.ac=1) && (cpl=3))

CPL = the current privilege level (0-3)

OPERAND_SIZE = 16, 32, or 64 (depending on current code and 66h/rex prefixes)

ADDRESS_SIZE = 16, 32, or 64 (depending on current code and 67h prefixes)

STACK_SIZE = 16, 32, or 64 (depending on current code and SS.attr.B)

old_RIP = RIP at the start of current instruction

old_RSP = RSP at the start of current instruction

old_RFLAGS = RFLAGS at the start of the instruction

old_CS = CS selector at the start of current instruction

old_DS = DS selector at the start of current instruction

old_ES = ES selector at the start of current instruction

old_FS = FS selector at the start of current instruction

old_GS = GS selector at the start of current instruction

old_SS = SS selector at the start of current instruction

RIP = the current RIP register

RSP = the current RSP register

RBP = the current RBP register

RFLAGS = the current RFLAGS register

next_RIP = RIP at start of next instruction

CS = the current CS descriptor, including the subfields:

sel base limit attr

SS = the current SS descriptor, including the subfields:

sel base limit attr

SRC = the instruction’s Source operand

DEST = the instruction’s Destination operand

temp_* // 64-bit temporary register

42 Instruction Overview

AMD64 Technology 24594—Rev. 3.14—September 2007

temp_*_desc // temporary descriptor, with subfields:

// if it points to a block of memory: sel base limit attr

// if it’s a gate descriptor: sel offset segment attr

NULL = 0x0000 // null selector is all zeros

// V,Z,A,S are integer variables, assigned a value when an instruction begins

// executing (they can be assigned a different value in the middle of an

// instruction, if needed)

V = 2 if OPERAND_SIZE=16

4 if OPERAND_SIZE=32

8 if OPERAND_SIZE=64

Z = 2 if OPERAND_SIZE=16

4 if OPERAND_SIZE=32

4 if OPERAND_SIZE=64

A = 2 if ADDRESS_SIZE=16

4 if ADDRESS_SIZE=32

8 if ADDRESS_SIZE=64

S = 2 if STACK_SIZE=16

4 if STACK_SIZE=32

8 if STACK_SIZE=64

/////////////////////////////////////////////////////////////////////////////////

// Bit Range Inside a Register

/////////////////////////////////////////////////////////////////////////////////

temp_data.[X:Y] // Bit X through Y in temp_data, with the other bits

// in the register masked off.

/////////////////////////////////////////////////////////////////////////////////

// Moving Data From One Register To Another

/////////////////////////////////////////////////////////////////////////////////

temp_dest.b = temp_src // 1-byte move (copies lower 8 bits of temp_src to

// temp_dest, preserving the upper 56 bits of temp_dest)

temp_dest.w = temp_src // 2-byte move (copies lower 16 bits of temp_src to

// temp_dest, preserving the upper 48 bits of temp_dest)

temp_dest.d = temp_src // 4-byte move (copies lower 32 bits of temp_src to

// temp_dest, and zeros out the upper 32 bits of temp_dest)

temp_dest.q = temp_src // 8-byte move (copies all 64 bits of temp_src to

// temp_dest)

temp_dest.v = temp_src // 2-byte move if V=2,

// 4-byte move if V=4,

// 8-byte move if V=8

Instruction Overview 43

24594—Rev. 3.14—September 2007 AMD64 Technology

temp_dest.z = temp_src // 2-byte move if Z=2,

// 4-byte move if Z=4

temp_dest.a = temp_src // 2-byte move if A=2,

// 4-byte move if A=4,

// 8-byte move if A=8

temp_dest.s = temp_src // 2-byte move if S=2,

// 4-byte move if S=4,

// 8-byte move if S=8

/////////////////////////////////////////////////////////////////////////////////

// Bitwise Operations

/////////////////////////////////////////////////////////////////////////////////

temp = a AND b

temp = a OR b

temp = a XOR b

temp = NOT a

temp = a SHL b

temp = a SHR b

/////////////////////////////////////////////////////////////////////////////////

// Logical Operations

/////////////////////////////////////////////////////////////////////////////////

IF (FOO && BAR)

IF (FOO || BAR)

IF (FOO = BAR)

IF (FOO != BAR)

IF (FOO > BAR)

IF (FOO < BAR)

IF (FOO >= BAR)

IF (FOO <= BAR)

/////////////////////////////////////////////////////////////////////////////////

// IF-THEN-ELSE

/////////////////////////////////////////////////////////////////////////////////

IF (FOO)

...

IF (FOO)

...

ELSIF (BAR)

...

ELSE

44 Instruction Overview

AMD64 Technology 24594—Rev. 3.14—September 2007

...

IF ((FOO && BAR) || (CONE && HEAD))

...

/////////////////////////////////////////////////////////////////////////////////

// Exceptions

/////////////////////////////////////////////////////////////////////////////////

EXCEPTION [#GP(0)] // error code in parenthesis

EXCEPTION [#UD] // if no error code

possible exception types:

#DE // Divide-By-Zero-Error Exception (Vector 0)

#DB // Debug Exception (Vector 1)

#BP // INT3 Breakpoint Exception (Vector 3)

#OF // INTO Overflow Exception (Vector 4)

#BR // Bound-Range Exception (Vector 5)

#UD // Invalid-Opcode Exception (Vector 6)

#NM // Device-Not-Available Exception (Vector 7)

#DF // Double-Fault Exception (Vector 8)

#TS // Invalid-TSS Exception (Vector 10)

#NP // Segment-Not-Present Exception (Vector 11)

#SS // Stack Exception (Vector 12)

#GP // General-Protection Exception (Vector 13)

#PF // Page-Fault Exception (Vector 14)

#MF // x87 Floating-Point Exception-Pending (Vector 16)

#AC // Alignment-Check Exception (Vector 17)

#MC // Machine-Check Exception (Vector 18)

#XF // SIMD Floating-Point Exception (Vector 19)

/////////////////////////////////////////////////////////////////////////////////

// READ_MEM

// General memory read. This zero-extends the data to 64 bits and returns it.

/////////////////////////////////////////////////////////////////////////////////

usage:

temp = READ_MEM.x [seg:offset] // where x is one of {v, z, b, w, d, q}

// and denotes the size of the memory read

definition:

IF ((seg AND 0xFFFC) = NULL) // GP fault for using a null segment to

// reference memory

EXCEPTION [#GP(0)]

IF ((seg=CS) || (seg=DS) || (seg=ES) || (seg=FS) || (seg=GS))

// CS,DS,ES,FS,GS check for segment limit or canonical

Instruction Overview 45

24594—Rev. 3.14—September 2007 AMD64 Technology

IF ((!64BIT_MODE) && (offset is outside seg’s limit))

EXCEPTION [#GP(0)]

// #GP fault for segment limit violation in non-64-bit mode

IF ((64BIT_MODE) && (offset is non-canonical))

EXCEPTION [#GP(0)]

// #GP fault for non-canonical address in 64-bit mode

ELSIF (seg=SS) // SS checks for segment limit or canonical

IF ((!64BIT_MODE) && (offset is outside seg’s limit))

EXCEPTION [#SS(0)]

// stack fault for segment limit violation in non-64-bit mode

IF ((64BIT_MODE) && (offset is non-canonical))

EXCEPTION [#SS(0)]

// stack fault for non-canonical address in 64-bit mode

ELSE // ((seg=GDT) || (seg=LDT) || (seg=IDT) || (seg=TSS))

// GDT,LDT,IDT,TSS check for segment limit and canonical

IF (offset > seg.limit)

EXCEPTION [#GP(0)] // #GP fault for segment limit violation

// in all modes

IF ((LONG_MODE) && (offset is non-canonical))

EXCEPTION [#GP(0)] // #GP fault for non-canonical address in long mode

IF ((ALIGNMENT_CHECK_ENABLED) && (offset misaligned, considering its

size and alignment))

EXCEPTION [#AC(0)]

IF ((64_bit_mode) && ((seg=CS) || (seg=DS) || (seg=ES) || (seg=SS))

temp_linear = offset

ELSE

temp_linear = seg.base + offset

IF ((PAGING_ENABLED) && (virtual-to-physical translation for temp_linear

results in a page-protection violation))

EXCEPTION [#PF(error_code)] // page fault for page-protection violation

// (U/S violation, Reserved bit violation)

IF ((PAGING_ENABLED) && (temp_linear is on a not-present page))

EXCEPTION [#PF(error_code)] // page fault for not-present page

temp_data = memory [temp_linear].x // zero-extends the data to 64

// bits, and saves it in temp_data

RETURN (temp_data) // return the zero-extended data

/////////////////////////////////////////////////////////////////////////////////

// WRITE_MEM // General memory write

/////////////////////////////////////////////////////////////////////////////////

usage:

WRITE_MEM.x [seg:offset] = temp.x // where <X> is one of these:

// {V, Z, B, W, D, Q} and denotes the

46 Instruction Overview

AMD64 Technology 24594—Rev. 3.14—September 2007

// size of the memory write

definition:

IF ((seg & 0xFFFC)= NULL) // GP fault for using a null segment

// to reference memory

EXCEPTION [#GP(0)]

IF (seg isn’t writable) // GP fault for writing to a read-only segment

EXCEPTION [#GP(0)]

IF ((seg=CS) || (seg=DS) || (seg=ES) || (seg=FS) || (seg=GS))

// CS,DS,ES,FS,GS check for segment limit or canonical

IF ((!64BIT_MODE) && (offset is outside seg’s limit))

EXCEPTION [#GP(0)]

// #GP fault for segment limit violation in non-64-bit mode

IF ((64BIT_MODE) && (offset is non-canonical))

EXCEPTION [#GP(0)]

// #GP fault for non-canonical address in 64-bit mode

ELSIF (seg=SS) // SS checks for segment limit or canonical

IF ((!64BIT_MODE) && (offset is outside seg’s limit))

EXCEPTION [#SS(0)]

// stack fault for segment limit violation in non-64-bit mode

IF ((64BIT_MODE) && (offset is non-canonical))

EXCEPTION [#SS(0)]

// stack fault for non-canonical address in 64-bit mode

ELSE // ((seg=GDT) || (seg=LDT) || (seg=IDT) || (seg=TSS))

// GDT,LDT,IDT,TSS check for segment limit and canonical

IF (offset > seg.limit)

EXCEPTION [#GP(0)]

// #GP fault for segment limit violation in all modes

IF ((LONG_MODE) && (offset is non-canonical))

EXCEPTION [#GP(0)]

// #GP fault for non-canonical address in long mode

IF ((ALIGNMENT_CHECK_ENABLED) && (offset is misaligned, considering

its size and alignment))

EXCEPTION [#AC(0)]

IF ((64_bit_mode) && ((seg=CS) || (seg=DS) || (seg=ES) || (seg=SS))

temp_linear = offset

ELSE

temp_linear = seg.base + offset

IF ((PAGING_ENABLED) && (the virtual-to-physical translation for

temp_linear results in a page-protection violation))

{

EXCEPTION [#PF(error_code)]

// page fault for page-protection violation

// (U/S violation, Reserved bit violation)

}

Instruction Overview 47

24594—Rev. 3.14—September 2007 AMD64 Technology

IF ((PAGING_ENABLED) && (temp_linear is on a not-present page))

EXCEPTION [#PF(error_code)] // page fault for not-present page

memory [temp_linear].x = temp.x // write the bytes to memory

/////////////////////////////////////////////////////////////////////////////////

// PUSH // Write data to the stack

/////////////////////////////////////////////////////////////////////////////////

usage:

PUSH.x temp // where x is one of these: {v, z, b, w, d, q} and

// denotes the size of the push

definition:

WRITE_MEM.x [SS:RSP.s - X] = temp.x // write to the stack

RSP.s = RSP - X // point rsp to the data just written

/////////////////////////////////////////////////////////////////////////////////

// POP // Read data from the stack, zero-extend it to 64 bits

/////////////////////////////////////////////////////////////////////////////////

usage:

POP.x temp // where x is one of these: {v, z, b, w, d, q} and

// denotes the size of the pop

definition:

temp = READ_MEM.x [SS:RSP.s] // read from the stack

RSP.s = RSP + X // point rsp above the data just written

/////////////////////////////////////////////////////////////////////////////////

// READ_DESCRIPTOR // Read 8-byte descriptor from GDT/LDT, return the descriptor

/////////////////////////////////////////////////////////////////////////////////

usage:

temp_descriptor = READ_DESCRIPTOR (selector, chktype)

// chktype field is one of the following:

// cs_chk used for far call and far jump

// clg_chk used when reading CS for far call or far jump through call gate

// ss_chk used when reading SS

// iret_chk used when reading CS for IRET or RETF

// intcs_chk used when readin the CS for interrupts and exceptions

definition:

temp_offset = selector AND 0xfff8 // upper 13 bits give an offset

48 Instruction Overview

AMD64 Technology 24594—Rev. 3.14—September 2007

// in the descriptor table

IF (selector.TI = 0) // read 8 bytes from the gdt, split it into

// (base,limit,attr) if the type bits

temp_desc = READ_MEM.q [gdt:temp_offset]

// indicate a block of memory, or split

// it into (segment,offset,attr)

// if the type bits indicate

// a gate, and save the result in temp_desc

ELSE

temp_desc = READ_MEM.q [ldt:temp_offset]

// read 8 bytes from the ldt, split it into

// (base,limit,attr) if the type bits

// indicate a block of memory, or split

// it into (segment,offset,attr) if the type

// bits indicate a gate, and save the result

// in temp_desc

IF (selector.rpl or temp_desc.attr.dpl is illegal for the current mode/cpl)

EXCEPTION [#GP(selector)]

IF (temp_desc.attr.type is illegal for the current mode/chktype)

EXCEPTION [#GP(selector)]

IF (temp_desc.attr.p=0)

EXCEPTION [#NP(selector)]

RETURN (temp_desc)

/////////////////////////////////////////////////////////////////////////////////

// READ_IDT // Read an 8-byte descriptor from the IDT, return the descriptor

/////////////////////////////////////////////////////////////////////////////////

usage:

temp_idt_desc = READ_IDT (vector)

// "vector" is the interrupt vector number

definition:

IF (LONG_MODE) // long-mode idt descriptors are 16 bytes long

temp_offset = vector*16

ELSE // (LEGACY_MODE) legacy-protected-mode idt descriptors are 8 bytes long

temp_offset = vector*8

temp_desc = READ_MEM.q [idt:temp_offset]

// read 8 bytes from the idt, split it into

// (segment,offset,attr), and save it in temp_desc

IF (temp_desc.attr.dpl is illegal for the current mode/cpl)

// exception, with error code that indicates this idt gate

Instruction Overview 49

24594—Rev. 3.14—September 2007 AMD64 Technology

EXCEPTION [#GP(vector*8+2)]

IF (temp_desc.attr.type is illegal for the current mode)

// exception, with error code that indicates this idt gate

EXCEPTION [#GP(vector*8+2)]

IF (temp_desc.attr.p=0)

EXCEPTION [#NP(vector*8+2)]

// segment-not-present exception, with an error code that

// indicates this idt gate

RETURN (temp_desc)

/////////////////////////////////////////////////////////////////////////////////

// READ_INNER_LEVEL_STACK_POINTER

// Read a new stack pointer (rsp or ss:esp) from the tss

/////////////////////////////////////////////////////////////////////////////////

usage:

temp_SS_desc:temp_RSP = READ_INNER_LEVEL_STACK_POINTER (new_cpl, ist_index)

definition:

IF (LONG_MODE)

{

IF (ist_index>0)

// if IST is selected, read an ISTn stack pointer from the tss

temp_RSP = READ_MEM.q [tss:ist_index*8+28]

ELSE // (ist_index=0)

// otherwise read an RSPn stack pointer from the tss

temp_RSP = READ_MEM.q [tss:new_cpl*8+4]

temp_SS_desc.sel = NULL + new_cpl

// in long mode, changing to lower cpl sets SS.sel to

// NULL+new_cpl

}

ELSE // (LEGACY_MODE)

{

temp_RSP = READ_MEM.d [tss:new_cpl*8+4] // read ESPn from the tss

temp_sel = READ_MEM.d [tss:new_cpl*8+8] // read SSn from the tss

temp_SS_desc = READ_DESCRIPTOR (temp_sel, ss_chk)

}

return (temp_RSP:temp_SS_desc)

50 Instruction Overview

AMD64 Technology 24594—Rev. 3.14—September 2007

/////////////////////////////////////////////////////////////////////////////////

// READ_BIT_ARRAY // Read 1 bit from a bit array in memory

/////////////////////////////////////////////////////////////////////////////////

usage:

temp_value = READ_BIT_ARRAY ([mem], bit_number)

definition:

temp_BYTE = READ_MEM.b [mem + (bit_number SHR 3)]

// read the byte containing the bit

temp_BIT = temp_BYTE SHR (bit_number & 7)

// shift the requested bit position into bit 0

return (temp_BIT & 0x01) // return ’0’ or ’1’

Instruction Reference 51

24594—Rev. 3.14—September 2007 AMD64 Technology

3 General-Purpose Instruction Reference

This chapter describes the function, mnemonic syntax, opcodes, affected flags, and possible

exceptions generated by the general-purpose instructions. General-purpose instructions are used in

basic software execution. Most of these instructions load, store, or operate on data located in the

general-purpose registers (GPRs), in memory, or in both. The remaining instructions are used to alter

the sequential flow of the program by branching to other locations within the program, or to entirely

different programs. With the exception of the MOVD, MOVMSKPD and MOVMSKPS instructions,

which operate on MMX/XMM registers, the instructions within the category of general-purpose

instructions do not operate on any other register set.

Most general-purpose instructions are supported in all hardware implementations of the AMD64

architecture, however it may be necessary to use the CPUID instruction to test for support for a small

set of general-purpose instructions. These instructions are listed in Table 3-1, along with the CPUID

function, the register and bit used to test for the presence of the instruction.

The general-purpose instructions can be used in legacy mode or 64-bit long mode. Compilation of

general-purpose programs for execution in 64-bit long mode offers three primary advantages: access to

the eight extended, 64-bit general-purpose registers (for a register set consisting of GPR0–GPR15),

access to the 64-bit virtual address space, and access to the RIP-relative addressing mode.

For further information about the general-purpose instructions and register resources, see:

Table 3-1. Instruction Support Indicated by CPUID Feature Bits

Instruction Register[Bit] Feature Mnemonic CPUID Function(s)

CMPXCHG8B EDX[8] CMPXCHG8B 0000_0001h, 8000_0001h

CMPXCHG16B ECX[13] CMPXCHG16B 0000_0001h

CMOVcc (Conditional Moves) EDX[15] CMOV 0000_0001h, 8000_0001h

CLFLUSH EDX[19] CLFSH 0000_0001h

LZCNT ECX[5] Advanced Bit

Manipulation (ABM) 8000_0001h

Long Mode instructions EDX[29] Long Mode (LM) 8000_0001h

MFENCE, LFENCE EDX[26] SSE2 0000_0001h

MOVD EDX[25] SSE 0000_0001h

EDX[26] SSE2

MOVNTI EDX[26] SSE2 0000_0001h

POPCNT ECX[23] POPCNT 0000_0001h

PREFETCH/W

ECX[8] 3DNow!™ Prefetch

8000_0001hEDX[29] LM

EDX[31] 3DNow!™

SFENCE EDX[25] FXSR 0000_0001h

52 Instruction Reference

AMD64 Technology 24594—Rev. 3.14—September 2007

•“General-Purpose Programming” in Volume 1.

•“Summary of Registers and Data Types” on page 24.

•“Notation” on page 37.

•“Instruction Prefixes” on page 3.

•Appendix B, “General-Purpose Instructions in 64-Bit Mode.” In particular, see “General Rules for

64-Bit Mode” on page 373.

Instruction Reference AAA 53

24594—Rev. 3.14—September 2007 AMD64 Technology

Adjusts the value in the AL register to an unpacked BCD value. Use the AAA instruction after using

the ADD instruction to add two unpacked BCD numbers.

If the value in the lower nibble of AL is greater than 9 or the AF flag is set to 1, the instruction

increments the AH register, adds 6 to the AL register, and sets the CF and AF flags to 1. Otherwise, it

does not change the AH register and clears the CF and AF flags to 0. In either case, AAA clears bits

7–4 of the AL register, leaving the correct decimal digit in bits 3–0.

This instruction also makes it possible to add ASCII numbers without having to mask off the upper

nibble ‘3’.

MXCSR Flags Affected

Using this instruction in 64-bit mode generates an invalid-opcode exception.

Related Instructions

AAD, AAM, AAS

rFLAGS Affected

Exceptions

AAA ASCII Adjust After Addition

Mnemonic Opcode Description

AAA 37 Create an unpacked BCD number.

(Invalid in 64-bit mode.)

ID VIP VIF AC VM RF NT IOPL OF DF IF TF SF ZF AF PF CF

U UUMUM

2120191817161413–1211109876420

Note: Bits 31–22, 15, 5, 3, and 1 are reserved. A flag set to 1 or cleared to 0 is M (modified). Unaffected flags are

blank. Undefined flags are U.

Exception Real

Virtual

8086 Protected Cause of Exception

Invalid opcode, #UD X This instruction was executed in 64-bit mode.

54 AAD Instruction Reference

AMD64 Technology 24594—Rev. 3.14—September 2007

Converts two unpacked BCD digits in the AL (least significant) and AH (most significant) registers to

a single binary value in the AL register using the following formula:

AL = ((10d * AH) + (AL))

After the conversion, AH is cleared to 00h.

In most modern assemblers, the AAD instruction adjusts from base-10 values. However, by coding the

instruction directly in binary, it can adjust from any base specified by the immediate byte value (ib)

suffixed onto the D5h opcode. For example, code D508h for octal, D50Ah for decimal, and D50Ch for

duodecimal (base 12).

Using this instruction in 64-bit mode generates an invalid-opcode exception.

Related Instructions

AAA, AAM, AAS

rFLAGS Affected

Exceptions

AAD ASCII Adjust Before Division

Mnemonic Opcode Description

AAD D5 0A Adjust two BCD digits in AL and AH.

(Invalid in 64-bit mode.)

(None) D5 ib Adjust two BCD digits to the immediate byte base.

(Invalid in 64-bit mode.)

ID VIP VIF AC VM RF NT IOPL OF DF IF TF SF ZF AF PF CF

U MMUMU

2120191817161413–1211109876420

Note: Bits 31–22, 15, 5, 3, and 1 are reserved. A flag set to 1 or cleared to 0 is M (modified). Unaffected flags are blank.

Undefined flags are U.

Exception Real

Virtual

8086 Protected Cause of Exception

Invalid opcode, #UD X This instruction was executed in 64-bit mode.

Instruction Reference AAM 55

24594—Rev. 3.14—September 2007 AMD64 Technology

Converts the value in the AL register from binary to two unpacked BCD digits in the AH (most

significant) and AL (least significant) registers using the following formula:

AH = (AL/10d)

AL = (AL mod 10d)

In most modern assemblers, the AAM instruction adjusts to base-10 values. However, by coding the

instruction directly in binary, it can adjust to any base specified by the immediate byte value (ib)

suffixed onto the D4h opcode. For example, code D408h for octal, D40Ah for decimal, and D40Ch for

duodecimal (base 12).

Using this instruction in 64-bit mode generates an invalid-opcode exception.

Related Instructions

AAA, AAD, AAS

rFLAGS Affected

Exceptions

AAM ASCII Adjust After Multiply

Mnemonic Opcode Description

AAM D4 0A Create a pair of unpacked BCD values in AH and AL.

(Invalid in 64-bit mode.)

(None) D4 ib

Create a pair of unpacked values to the immediate byte

base.

(Invalid in 64-bit mode.)

ID VIP VIF AC VM RF NT IOPL OF DF IF TF SF ZF AF PF CF

U MMUMU

2120191817161413–1211109876420

Note: Bits 31–22, 15, 5, 3, and 1 are reserved. A flag set to 1 or cleared to 0 is M. Unaffected flags are blank. Undefined

flags are U.

Exception Real

Virtual

8086 Protected Cause of Exception

Divide by zero, #DE X X X 8-bit immediate value was 0.

Invalid opcode, #UD X This instruction was executed in 64-bit mode.

56 AAS Instruction Reference

AMD64 Technology 24594—Rev. 3.14—September 2007

Adjusts the value in the AL register to an unpacked BCD value. Use the AAS instruction after using

the SUB instruction to subtract two unpacked BCD numbers.

If the value in AL is greater than 9 or the AF flag is set to 1, the instruction decrements the value in AH,

subtracts 6 from the AL register, and sets the CF and AF flags to 1. Otherwise, it clears the CF and AF

flags and the AH register is unchanged. In either case, the instruction clears bits 7–4 of the AL register,

leaving the correct decimal digit in bits 3–0.

Using this instruction in 64-bit mode generates an invalid-opcode exception.

Related Instructions

AAA, AAD, AAM

rFLAGS Affected

Exceptions

AAS ASCII Adjust After Subtraction

Mnemonic Opcode Description

AAS 3F

Create an unpacked BCD number from the contents of

the AL register.

(Invalid in 64-bit mode.)

ID VIP VIF AC VM RF NT IOPL OF DF IF TF SF ZF AF PF CF

U UUMUM

2120191817161413–1211109876420

Note: Bits 31–22, 15, 5, 3, and 1 are reserved. A flag set to 1 or cleared to 0 is M (modified). Unaffected flags are

blank. Undefined flags are U.

Exception Real

Virtual

8086 Protected Cause of Exception

Invalid opcode, #UD X This instruction was executed in 64-bit mode.

Instruction Reference ADC 57

24594—Rev. 3.14—September 2007 AMD64 Technology

Adds the carry flag (CF), the value in a register or memory location (first operand), and an immediate

value or the value in a register or memory location (second operand), and stores the result in the first

operand location. The instruction cannot add two memory operands. The CF flag indicates a pending

carry from a previous addition operation. The instruction sign-extends an immediate value to the

length of the destination register or memory location.

This instruction evaluates the result for both signed and unsigned data types and sets the OF and CF

flags to indicate a carry in a signed or unsigned result, respectively. It sets the SF flag to indicate the

sign of a signed result.

Use the ADC instruction after an ADD instruction as part of a multibyte or multiword addition.

The forms of the ADC instruction that write to memory support the LOCK prefix. For details about the

LOCK prefix, see “Lock Prefix” on page 8.

ADC Add with Carry

Mnemonic Opcode Description

ADC AL, imm8 14 ib Add imm8 to AL + CF.

ADC AX, imm16 15 iw Add imm16 to AX + CF.

ADC EAX, imm32 15 id Add imm32 to EAX + CF.

ADC RAX, imm32 15 id Add sign-extended imm32 to RAX + CF.

ADC reg/mem8, imm8 80 /2 ib Add imm8 to reg/mem8 + CF.

ADC reg/mem16, imm16 81 /2 iw Add imm16 to reg/mem16 + CF.

ADC reg/mem32, imm32 81 /2 id Add imm32 to reg/mem32 + CF.

ADC reg/mem64, imm32 81 /2 id Add sign-extended imm32 to reg/mem64 + CF.

ADC reg/mem16, imm8 83 /2 ib Add sign-extended imm8 to reg/mem16 + CF.

ADC reg/mem32, imm8 83 /2 ib Add sign-extended imm8 to reg/mem32 + CF.

ADC reg/mem64, imm8 83 /2 ib Add sign-extended imm8 to reg/mem64 + CF.

ADC reg/mem8, reg8 10 /r Add reg8 to reg/mem8 + CF

ADC reg/mem16, reg16 11 /r Add reg16 to reg/mem16 + CF.

ADC reg/mem32, reg32 11 /r Add reg32 to reg/mem32 + CF.

ADC reg/mem64, reg64 11 /r Add reg64 to reg/mem64 + CF.

ADC reg8, reg/mem8 12 /r Add reg/mem8 to reg8 + CF.

ADC reg16, reg/mem16 13 /r Add reg/mem16 to reg16 + CF.

ADC reg32, reg/mem32 13 /r Add reg/mem32 to reg32 + CF.

ADC reg64, reg/mem64 13 /r Add reg/mem64 to reg64 + CF.

58 ADC Instruction Reference

AMD64 Technology 24594—Rev. 3.14—September 2007

Related Instructions

ADD, SBB, SUB

rFLAGS Affected

Exceptions

ID VIP VIF AC VM RF NT IOPL OF DF IF TF SF ZF AF PF CF

M MMMMM

2120191817161413–1211109876420

Note: Bits 31–22, 15, 5, 3, and 1 are reserved. A flag set to 1 or cleared to 0 is M (modified). Unaffected flags are blank.

Undefined flags are U.

Exception Real

Virtual

8086 Protected Cause of Exception

Stack, #SS X X X A memory address exceeded the stack segment limit or was

non-canonical.

General protection,

#GP

XX XA memory address exceeded a data segment limit or was non-

canonical.

X The destination operand was in a non-writable segment.

X A null data segment was used to reference memory.

Page fault, #PF X X A page fault resulted from the execution of the instruction.

Alignment check,

#AC XX

An unaligned memory reference was performed while

alignment checking was enabled.

Instruction Reference ADD 59

24594—Rev. 3.14—September 2007 AMD64 Technology

Adds the value in a register or memory location (first operand) and an immediate value or the value in

a register or memory location (second operand), and stores the result in the first operand location. The

instruction cannot add two memory operands. The instruction sign-extends an immediate value to the

length of the destination register or memory operand.

This instruction evaluates the result for both signed and unsigned data types and sets the OF and CF

flags to indicate a carry in a signed or unsigned result, respectively. It sets the SF flag to indicate the

sign of a signed result.

The forms of the ADD instruction that write to memory support the LOCK prefix. For details about the

LOCK prefix, see “Lock Prefix” on page 8.

Related Instructions

ADC, SBB, SUB

ADD Signed or Unsigned Add

Mnemonic Opcode Description

ADD AL, imm8 04 ib Add imm8 to AL.

ADD AX, imm16 05 iw Add imm16 to AX.

ADD EAX, imm32 05 id Add imm32 to EAX.

ADD RAX, imm32 05 id Add sign-extended imm32 to RAX.

ADD reg/mem8, imm8 80 /0 ib Add imm8 to reg/mem8.

ADD reg/mem16, imm16 81 /0 iw Add imm16 to reg/mem16

ADD reg/mem32, imm32 81 /0 id Add imm32 to reg/mem32.

ADD reg/mem64, imm32 81 /0 id Add sign-extended imm32 to reg/mem64.

ADD reg/mem16, imm8 83 /0 ib Add sign-extended imm8 to reg/mem16

ADD reg/mem32, imm8 83 /0 ib Add sign-extended imm8 to reg/mem32.

ADD reg/mem64, imm8 83 /0 ib Add sign-extended imm8 to reg/mem64.

ADD reg/mem8, reg8 00 /r Add reg8 to reg/mem8.

ADD reg/mem16, reg16 01 /r Add reg16 to reg/mem16.

ADD reg/mem32, reg32 01 /r Add reg32 to reg/mem32.

ADD reg/mem64, reg64 01 /r Add reg64 to reg/mem64.

ADD reg8, reg/mem8 02 /r Add reg/mem8 to reg8.

ADD reg16, reg/mem16 03 /r Add reg/mem16 to reg16.

ADD reg32, reg/mem32 03 /r Add reg/mem32 to reg32.

ADD reg64, reg/mem64 03 /r Add reg/mem64 to reg64.

60 ADD Instruction Reference

AMD64 Technology 24594—Rev. 3.14—September 2007

rFLAGS Affected

Exceptions

ID VIP VIF AC VM RF NT IOPL OF DF IF TF SF ZF AF PF CF

M MMMMM

2120191817161413–1211109876420

Note: Bits 31–22, 15, 5, 3, and 1 are reserved. A flag set to 1 or cleared to 0 is M (modified). Unaffected flags are

blank. Undefined flags are U.

Exception Real

Virtual

8086 Protected Cause of Exception

Stack, #SS X X X A memory address exceeded the stack segment limit or was

non-canonical.

General protection,

#GP

XX XA memory address exceeded a data segment limit or was non-

canonical.

X The destination operand was in a non-writable segment.

X A null data segment was used to reference memory.

Page fault, #PF X X A page fault resulted from the execution of the instruction.

Alignment check,

#AC XX

An unaligned memory reference was performed while

alignment checking was enabled.

Instruction Reference AND 61

24594—Rev. 3.14—September 2007 AMD64 Technology

Performs a bitwise AND operation on the value in a register or memory location (first operand) and an

immediate value or the value in a register or memory location (second operand), and stores the result in

the first operand location. The instruction cannot AND two memory operands.

The instruction sets each bit of the result to 1 if the corresponding bit of both operands is set;

otherwise, it clears the bit to 0. The following table shows the truth table for the AND operation:

The forms of the AND instruction that write to memory support the LOCK prefix. For details about the

LOCK prefix, see “Lock Prefix” on page 8.

AND Logical AND

X Y X AND Y

000

010

100

111

Mnemonic Opcode Description

AND AL, imm8 24 ib AND the contents of AL with an immediate 8-bit value

and store the result in AL.

AND AX, imm16 25 iw AND the contents of AX with an immediate 16-bit value

and store the result in AX.

AND EAX, imm32 25 id AND the contents of EAX with an immediate 32-bit

value and store the result in EAX.

AND RAX, imm32 25 id AND the contents of RAX with a sign-extended

immediate 32-bit value and store the result in RAX.

AND reg/mem8, imm8 80 /4 ib AND the contents of reg/mem8 with imm8.

AND reg/mem16, imm16 81 /4 iw AND the contents of reg/mem16 with imm16.

AND reg/mem32, imm32 81 /4 id AND the contents of reg/mem32 with imm32.

AND reg/mem64, imm32 81 /4 id AND the contents of reg/mem64 with sign-extended

imm32.

AND reg/mem16, imm8 83 /4 ib AND the contents of reg/mem16 with a sign-extended

8-bit value.

AND reg/mem32, imm8 83 /4 ib AND the contents of reg/mem32 with a sign-extended

8-bit value.

AND reg/mem64, imm8 83 /4 ib AND the contents of reg/mem64 with a sign-extended

8-bit value.

AND reg/mem8, reg8 20 /r AND the contents of an 8-bit register or memory

location with the contents of an 8-bit register.

62 AND Instruction Reference

AMD64 Technology 24594—Rev. 3.14—September 2007

Related Instructions

TEST, OR, NOT, NEG, XOR

rFLAGS Affected

Exceptions

AND reg/mem16, reg16 21 /r AND the contents of a 16-bit register or memory

location with the contents of a 16-bit register.

AND reg/mem32, reg32 21 /r AND the contents of a 32-bit register or memory

location with the contents of a 32-bit register.

AND reg/mem64, reg64 21 /r AND the contents of a 64-bit register or memory

location with the contents of a 64-bit register.

AND reg8, reg/mem8 22 /r AND the contents of an 8-bit register with the contents

of an 8-bit memory location or register.

AND reg16, reg/mem16 23 /r AND the contents of a 16-bit register with the contents

of a 16-bit memory location or register.

AND reg32, reg/mem32 23 /r AND the contents of a 32-bit register with the contents

of a 32-bit memory location or register.

AND reg64, reg/mem64 23 /r AND the contents of a 64-bit register with the contents

of a 64-bit memory location or register.

ID VIP VIF AC VM RF NT IOPL OF DF IF TF SF ZF AF PF CF

0MMUM0

2120191817161413–1211109876420

Note: Bits 31–22, 15, 5, 3, and 1 are reserved. A flag set to 1 or cleared to 0 is M (modified). Unaffected flags are

blank. Undefined flags are U.

Exception Real

Virtual

8086 Protected Cause of Exception

Stack, #SS X X X A memory address exceeded the stack segment limit or was

non-canonical.

General protection,

#GP

XX XA memory address exceeded a data segment limit or was non-

canonical.

X The destination operand was in a non-writable segment.

X A null data segment was used to reference memory.

Page fault, #PF X X A page fault resulted from the execution of the instruction.

Alignment check,

#AC XX

An unaligned memory reference was performed while

alignment checking was enabled.

Mnemonic Opcode Description

Instruction Reference BOUND 63

24594—Rev. 3.14—September 2007 AMD64 Technology

Checks whether an array index (first operand) is within the bounds of an array (second operand). The

array index is a signed integer in the specified register. If the operand-size attribute is 16, the array

operand is a memory location containing a pair of signed word-integers; if the operand-size attribute is

32, the array operand is a pair of signed doubleword-integers. The first word or doubleword specifies

the lower bound of the array and the second word or doubleword specifies the upper bound.

The array index must be greater than or equal to the lower bound and less than or equal to the upper

bound. If the index is not within the specified bounds, the processor generates a BOUND range-

exceeded exception (#BR).

The bounds of an array, consisting of two words or doublewords containing the lower and upper limits

of the array, usually reside in a data structure just before the array itself, making the limits addressable

through a constant offset from the beginning of the array. With the address of the array in a register,

this practice reduces the number of bus cycles required to determine the effective address of the array

bounds.

Using this instruction in 64-bit mode generates an invalid-opcode exception.

Related Instructions

INT, INT3, INTO

rFLAGS Affected

None

Exceptions

BOUND Check Array Bound

Mnemonic Opcode Description

BOUND reg16, mem16&mem16 62 /r

Test whether a 16-bit array index is within the bounds

specified by the two 16-bit values in mem16&mem16.

(Invalid in 64-bit mode.)

BOUND reg32, mem32&mem32 62 /r

Test whether a 32-bit array index is within the bounds

specified by the two 32-bit values in mem32&mem32.

(Invalid in 64-bit mode.)

Exception Real

Virtual

8086 Protected Cause of Exception

Bound range, #BR X X X The bound range was exceeded.

Invalid opcode, #UD X X X The source operand was a register.

X Instruction was executed in 64-bit mode.

Stack, #SS X X X A memory address exceeded the stack segment limit

General protection,

#GP

X X X A memory address exceeded a data segment limit.

X A null data segment was used to reference memory.

64 BOUND Instruction Reference

AMD64 Technology 24594—Rev. 3.14—September 2007

Page fault, #PF X X A page fault resulted from the execution of the instruction.

Alignment check,

#AC XX

An unaligned memory reference was performed while

alignment checking was enabled.

Exception Real

Virtual

8086 Protected Cause of Exception

Instruction Reference BSF 65

24594—Rev. 3.14—September 2007 AMD64 Technology

Searches the value in a register or a memory location (second operand) for the least-significant set bit.

If a set bit is found, the instruction clears the zero flag (ZF) and stores the index of the least-significant

set bit in a destination register (first operand). If the second operand contains 0, the instruction sets ZF

to 1 and does not change the contents of the destination register. The bit index is an unsigned offset

from bit 0 of the searched value.

Related Instructions

BSR

rFLAGS Affected

Exceptions

BSF Bit Scan Forward

Mnemonic Opcode Description

BSF reg16, reg/mem16 0F BC /r Bit scan forward on the contents of reg/mem16.

BSF reg32, reg/mem32 0F BC /r Bit scan forward on the contents of reg/mem32.

BSF reg64, reg/mem64 0F BC /r Bit scan forward on the contents of reg/mem64

ID VIP VIF AC VM RF NT IOPL OF DF IF TF SF ZF AF PF CF

U UMUUU

2120191817161413–1211109876420

Note: Bits 31–22, 15, 5, 3, and 1 are reserved. A flag set to 1 or cleared to 0 is M (modified). Unaffected flags are

blank. Undefined flags are U.

Exception Real

Virtual

8086 Protected Cause of Exception

Stack, #SS X X X A memory address exceeded the stack segment limit or was

non-canonical.

General protection,

#GP

XX XA memory address exceeded a data segment limit or was non-

canonical.

X A null data segment was used to reference memory.

Page fault, #PF X X A page fault resulted from the execution of the instruction.

Alignment check,

#AC XX

An unaligned memory reference was performed while

alignment checking was enabled.

66 BSR Instruction Reference

AMD64 Technology 24594—Rev. 3.14—September 2007

Searches the value in a register or a memory location (second operand) for the most-significant set bit.

If a set bit is found, the instruction clears the zero flag (ZF) and stores the index of the most-significant

set bit in a destination register (first operand). If the second operand contains 0, the instruction sets ZF

to 1 and does not change the contents of the destination register. The bit index is an unsigned offset

from bit 0 of the searched value.

Related Instructions

BSF

rFLAGS Affected

Exceptions

BSR Bit Scan Reverse

Mnemonic Opcode Description

BSR reg16, reg/mem16 0F BD /r Bit scan reverse on the contents of reg/mem16.

BSR reg32, reg/mem32 0F BD /r Bit scan reverse on the contents of reg/mem32.

BSR reg64, reg/mem64 0F BD /r Bit scan reverse on the contents of reg/mem64.

ID VIP VIF AC VM RF NT IOPL OF DF IF TF SF ZF AF PF CF

U UMUUU

2120191817161413–1211109876420

Note: Bits 31–22, 15, 5, 3, and 1 are reserved. A flag set to 1 or cleared to 0 is M (modified). Unaffected flags are

blank. Undefined flags are U.

Exception Real

Virtual

8086 Protected Cause of Exception

Stack, #SS X X X A memory address exceeded the stack segment limit or was

non-canonical.

General protection,

#GP

XX XA memory address exceeded the data segment limit or was

non-canonical.

X A null data segment was used to reference memory.

Page fault, #PF X X A page fault resulted from the execution of the instruction.

Alignment check,

#AC XX

An unaligned memory reference was performed while

alignment checking was enabled.

Instruction Reference BSWAP 67

24594—Rev. 3.14—September 2007 AMD64 Technology

Reverses the byte order of the specified register. This action converts the contents of the register from

little endian to big endian or vice versa. In a doubleword, bits 7–0 are exchanged with bits 31–24, and

bits 15–8 are exchanged with bits 23–16. In a quadword, bits 7–0 are exchanged with bits 63–56, bits

15–8 with bits 55–48, bits 23–16 with bits 47–40, and bits 31–24 with bits 39–32. A subsequent use of

the BSWAP instruction with the same operand restores the original value of the operand.

The result of applying the BSWAP instruction to a 16-bit register is undefined. To swap the bytes of a

16-bit register, use the XCHG instruction and specify the respective byte halves of the 16-bit register

as the two operands. For example, to swap the bytes of AX, use XCHG AL, AH.

Related Instructions

XCHG

rFLAGS Affected

None

Exceptions

None

BSWAP Byte Swap

Mnemonic Opcode Description

BSWAP reg32 0F C8 +rd Reverse the byte order of reg32.

BSWAP reg64 0F C8 +rq Reverse the byte order of reg64.

68 BT Instruction Reference

AMD64 Technology 24594—Rev. 3.14—September 2007

Copies a bit, specified by a bit index in a register or 8-bit immediate value (second operand), from a bit

string (first operand), also called the bit base, to the carry flag (CF) of the rFLAGS register.

If the bit base operand is a register, the instruction uses the modulo 16, 32, or 64 (depending on the

operand size) of the bit index to select a bit in the register.

If the bit base operand is a memory location, bit 0 of the byte at the specified address is the bit base of

the bit string. If the bit index is in a register, the instruction selects a bit position relative to the bit base

in the range –263 to +263 – 1 if the operand size is 64, –231 to +231 – 1, if the operand size is 32, and

–215 to +215 – 1 if the operand size is 16. If the bit index is in an immediate value, the bit selected is

that value modulo 16, 32, or 64, depending on operand size.

When the instruction attempts to copy a bit from memory, it accesses 2, 4, or 8 bytes starting from the

specified memory address for 16-bit, 32-bit, or 64-bit operand sizes, respectively, using the following

formula:

Effective Address + (NumBytesi * (BitOffset DIV NumBitsi*8))

When using this bit addressing mechanism, avoid referencing areas of memory close to address space

holes, such as references to memory-mapped I/O registers. Instead, use a MOV instruction to load a

Related Instructions

BTC, BTR, BTS

BT Bit Test

Mnemonic Opcode Description

BT reg/mem16, reg16 0F A3 /r Copy the value of the selected bit to the carry flag.

BT reg/mem32, reg32 0F A3 /r Copy the value of the selected bit to the carry flag.

BT reg/mem64, reg64 0F A3 /r Copy the value of the selected bit to the carry flag.

BT reg/mem16, imm8 0F BA /4 ib Copy the value of the selected bit to the carry flag.

BT reg/mem32, imm8 0F BA /4 ib Copy the value of the selected bit to the carry flag.

BT reg/mem64, imm8 0F BA /4 ib Copy the value of the selected bit to the carry flag.

Instruction Reference BT 69

24594—Rev. 3.14—September 2007 AMD64 Technology

rFLAGS Affected

Exceptions

ID VIP VIF AC VM RF NT IOPL OF DF IF TF SF ZF AF PF CF

U UUUUM

2120191817161413–1211109876420

Note: Bits 31–22, 15, 5, 3, and 1 are reserved. A flag set to 1 or cleared to 0 is M (modified). Unaffected flags are

blank. Undefined flags are U.

Exception Real

Virtual

8086 Protected Cause of Exception

Stack, #SS X X X A memory address exceeded the stack segment limit or was

non-canonical.

General protection,

#GP

XX XA memory address exceeded a data segment limit or was non-

canonical.

X A null data segment was used to reference memory.

Page fault, #PF X X A page fault resulted from the execution of the instruction.

Alignment check,

#AC XX

An unaligned memory reference was performed while

alignment checking was enabled.

70 BTC Instruction Reference

AMD64 Technology 24594—Rev. 3.14—September 2007

Copies a bit, specified by a bit index in a register or 8-bit immediate value (second operand), from a bit

string (first operand), also called the bit base, to the carry flag (CF) of the rFLAGS register, and then

complements (toggles) the bit in the bit string.

If the bit base operand is a register, the instruction uses the modulo 16, 32, or 64 (depending on the

operand size) of the bit index to select a bit in the register.

If the bit base operand is a memory location, bit 0 of the byte at the specified address is the bit base of

the bit string. If the bit index is in a register, the instruction selects a bit position relative to the bit base

in the range –263 to +263 – 1 if the operand size is 64, –231 to +231 – 1, if the operand size is 32, and

–215 to +215 – 1 if the operand size is 16. If the bit index is in an immediate value, the bit selected is

that value modulo 16, 32, or 64, depending the operand size.

This instruction is useful for implementing semaphores in concurrent operating systems. Such an

application should precede this instruction with the LOCK prefix. For details about the LOCK prefix,

see “Lock Prefix” on page 8.

Related Instructions

BT, BTR, BTS

BTC Bit Test and Complement

Mnemonic Opcode Description

BTC reg/mem16, reg16 0F BB /r Copy the value of the selected bit to the carry flag, then

complement the selected bit.

BTC reg/mem32, reg32 0F BB /r Copy the value of the selected bit to the carry flag, then

complement the selected bit.

BTC reg/mem64, reg64 0F BB /r Copy the value of the selected bit to the carry flag, then

complement the selected bit.

BTC reg/mem16, imm8 0F BA /7 ib Copy the value of the selected bit to the carry flag, then

complement the selected bit.

BTC reg/mem32, imm8 0F BA /7 ib Copy the value of the selected bit to the carry flag, then

complement the selected bit.

BTC reg/mem64, imm8 0F BA /7 ib Copy the value of the selected bit to the carry flag, then

complement the selected bit.

Instruction Reference BTC 71

24594—Rev. 3.14—September 2007 AMD64 Technology

rFLAGS Affected

Exceptions

ID VIP VIF AC VM RF NT IOPL OF DF IF TF SF ZF AF PF CF

U UUUUM

2120191817161413–1211109876420

Note: Bits 31–22, 15, 5, 3, and 1 are reserved. A flag set to 1 or cleared to 0 is M (modified). Unaffected flags are

blank. Undefined flags are U.

Exception Real

Virtual

8086 Protected Cause of Exception

Stack, #SS X X X A memory address exceeded the stack segment limit or was

non-canonical.

General protection,

#GP

XX XA memory address exceeded a data segment limit or was non-

canonical.

X The destination operand was in a non-writable segment.

X A null data segment was used to reference memory.

Page fault, #PF X X A page fault resulted from the execution of the instruction.

Alignment check,

#AC XX

An unaligned memory reference was performed while

alignment checking was enabled.

72 BTR Instruction Reference

AMD64 Technology 24594—Rev. 3.14—September 2007

Copies a bit, specified by a bit index in a register or 8-bit immediate value (second operand), from a bit

string (first operand), also called the bit base, to the carry flag (CF) of the rFLAGS register, and then

clears the bit in the bit string to 0.

If the bit base operand is a register, the instruction uses the modulo 16, 32, or 64 (depending on the

operand size) of the bit index to select a bit in the register.

If the bit base operand is a memory location, bit 0 of the byte at the specified address is the bit base of

the bit string. If the bit index is in a register, the instruction selects a bit position relative to the bit base

in the range –263 to +263 – 1 if the operand size is 64, –231 to +231 – 1, if the operand size is 32, and

–215 to +215 – 1 if the operand size is 16. If the bit index is in an immediate value, the bit selected is

that value modulo 16, 32, or 64, depending on the operand size.

This instruction is useful for implementing semaphores in concurrent operating systems. Such

applications should precede this instruction with the LOCK prefix. For details about the LOCK prefix,

see “Lock Prefix” on page 8.

Related Instructions

BT, BTC, BTS

BTR Bit Test and Reset

Mnemonic Opcode Description

BTR reg/mem16, reg16 0F B3 /r Copy the value of the selected bit to the carry flag, then

clear the selected bit.

BTR reg/mem32, reg32 0F B3 /r Copy the value of the selected bit to the carry flag, then

clear the selected bit.

BTR reg/mem64, reg64 0F B3 /r Copy the value of the selected bit to the carry flag, then

clear the selected bit.

BTR reg/mem16, imm8 0F BA /6 ib Copy the value of the selected bit to the carry flag, then

clear the selected bit.

BTR reg/mem32, imm8 0F BA /6 ib Copy the value of the selected bit to the carry flag, then

clear the selected bit.

BTR reg/mem64, imm8 0F BA /6 ib Copy the value of the selected bit to the carry flag, then

clear the selected bit.

Instruction Reference BTR 73

24594—Rev. 3.14—September 2007 AMD64 Technology

rFLAGS Affected

Exceptions

ID VIP VIF AC VM RF NT IOPL OF DF IF TF SF ZF AF PF CF

U UUUUM

2120191817161413–1211109876420

Note: Bits 31–22, 15, 5, 3, and 1 are reserved. A flag set to 1 or cleared to 0 is M (modified). Unaffected flags are

blank. Undefined flags are U.

Exception Real

Virtual

8086 Protected Cause of Exception

Stack, #SS X X X A memory address exceeded the stack segment limit or was

non-canonical.

General protection,

#GP

XX XA memory address exceeded a data segment limit or was non-

canonical.

X The destination operand was in a non-writable segment.

X A null data segment was used to reference memory.

Page fault, #PF X X A page fault resulted from the execution of the instruction.

Alignment check,

#AC XX

An unaligned memory reference was performed while

alignment checking was enabled.

74 BTS Instruction Reference

AMD64 Technology 24594—Rev. 3.14—September 2007

Copies a bit, specified by bit index in a register or 8-bit immediate value (second operand), from a bit

string (first operand), also called the bit base, to the carry flag (CF) of the rFLAGS register, and then

sets the bit in the bit string to 1.