AgensGraph Developer Manual En

User Manual:

Open the PDF directly: View PDF .
Page Count: 342 [warning: Documents this large are best viewed by clicking the View PDF Link!]

AgensGraph Developer Manual

Restricted Rights Legend

Permission to use, copy, modify, and distribute this software and its documentation for any purpose, without fee,

and without a written agreement is hereby granted, provided that the above copyright notice and this paragraph

and the following two paragraphs appear in all copies.

IN NO EVENT SHALL THE UNIVERSITY OF CALIFORNIA BE LIABLE TO ANY PARTY FOR DIRECT,

INDIRECT, SPECIAL, INCIDENTAL, OR CONSEQUENTIAL DAMAGES, INCLUDING LOST PROFITS,

ARISING OUT OF THE USE OF THIS SOFTWARE AND ITS DOCUMENTATION, EVEN IF THE

UNIVERSITY OF CALIFORNIA HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

THE UNIVERSITY OF CALIFORNIA SPECIFICALLY DISCLAIMS ANY WARRANTIES, INCLUDING,

BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A

PARTICULAR PURPOSE. THE SOFTWARE PROVIDED HEREUNDER IS ON AN ”AS-IS” BASIS, AND

THE UNIVERSITY OF CALIFORNIA HAS NO OBLIGATIONS TO PROVIDE MAINTENANCE,

SUPPORT, UPDATES, ENHANCEMENTS, OR MODIFICATIONS.

Trademarks

AgensGraph® is a registered trademark of Bitnine Inc. Other products, titles or services may be registered

trademarks of their respective companies.

Open Source Software Notice

Some modules or les of this product are subject to the terms of the following licenses. : OpenSSL, RSA Data

Security, Inc., Apache Foundation, Jean-loup Gailly and Mark Adler, Paul Hsieh’s hash.

Information of technical documentation

Title : AgensGraph Developer Manual

Published date : July 16, 2018

S/W version : AgensGraph v1.3.1, based on PostgreSQL 9.6.2

Technical documentation version : v1.0

Contents

1 Introduction....................................... 8

1.1 AgensGraph Highlights . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

1.2 Graph Database Concepts . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

2 GetStarted ....................................... 14

2.1 InstallAgensGraph............................... 14

2.2 Get started with Cypher . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

3 DataType........................................ 18

3.1 NumericTypes................................. 19

3.2 CharacterTypes ................................ 22

3.3 Date/TimeTypes ............................... 24

3.4 BooleanType.................................. 33

3.5 GeometricTypes................................ 35

3.6 XMLType ................................... 37

3.7 JSONTypes .................................. 39

3.8 Arrays...................................... 47

3.9 RangeTypes .................................. 59

3.10 User-denedType ............................... 66

4 Functions ........................................ 69

4.1 Graph Database functions . . . . . . . . . . . . . . . . . . . . . . . . . . . 69

4.2 Relational Database functions . . . . . . . . . . . . . . . . . . . . . . . . . 81

4.3 User-denedfunction.............................. 232

5 CypherQueryLanguage................................ 234

5.1 Introduction .................................. 234

5.2 Syntax...................................... 238

5.3 HybridQuery.................................. 240

5.4 GeneralClauses................................. 242

5.5 ReadClauses .................................. 249

5.6 WriteClauses.................................. 252

6 Drivers.......................................... 259

6.1 Introduction .................................. 259

6.2 Usage of the Java Driver . . . . . . . . . . . . . . . . . . . . . . . . . . . . 259

7 Procedure........................................ 262

7.1 Procedurallanguage .............................. 262

7.2 PL/pgSQL - SQL Procedural Language . . . . . . . . . . . . . . . . . . . 264

7.3 PL/Python - Python Procedural Language . . . . . . . . . . . . . . . . . . 306

8 SupportedPlatform .................................. 325

8.1 Supported Platform Requirements . . . . . . . . . . . . . . . . . . . . . . 325

9 Agens FDW (Foreign Data Wrappers) . . . . . . . . . . . . . . . . . . . . . . . . 326

9.1 CDH&HiveFDW............................... 326

10 Appendix ........................................ 328

10.1 AgensGraph Error Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . 328

10.2 Terminology................................... 337

10.3 FAQ....................................... 339

About technical documentation

Objective of technical documentation

This technical article is described for application program developers who wish to develop

programs using various application libraries provided by AgensGraph® (hereinafter referred to

as ”AgensGraph”).

Prerequisites for technical documentation

In order to fully understand this article, you should be familiar with the following topics:

• Graph Database

• Relational Database

• Basic Programming

Limitations of technical documentation

This guide does not contain everything you need to apply or operate AgensGraph in practice.

Therefore, refer to each technical manual for operation and management such as installation

and environment setting.

About this document 5

Conventions of Technical Documentation

Mark Description

<AaBbCc123> The le name of the program source code, directory

[Button] Button or menu name in GUI

Bold Emphasis

“ ”(Quotes) Refer to other relevant guides or other chapters and sec-

tions within the guide

‘Input eld’ A description of the entries in the UI

Hyperlink Email account, Website

> Progress of menu

+— Has subdirectory or le

|— No subdirectories or les

Notes or cautions

Notes

[Figure 1.1] Figure name

[Table 1.1] Table name

Commands, output after execution, sample code

AaBbCc123

{} Required Argument Values

[] Optional Argument Value

| Selection Argument Value

6 AgensGraph Developer Manual

Locations

Korea

Bitnine Inc.

A1201 GangSeo Hangang Xi Tower 401,

Yangcheon-ro, Gangseo-gu, Seoul,

South Korea

Tel : +82-70-4800-3517

Fax : +82-70-8677-2552

Email : agens@bitnine.net

Web : bitnine.net

USA

Bitnine Global Inc.

3945 Freedom Cir., Suite 260,

Santa Clara, CA 95054

U.S.A

Tel : +1 (408) 352-5165

Email : agens@bitnine.net

Web : bitnine.net

About this document 7

1 Introduction

1.1 AgensGraph Highlights

AgensGraph, a database built on a graph data model that ensures ACID transactions, was implemented by utilizing

main features of PostgreSQL, an open source database.

Key features include:

• Multi-model database

–Supports Graph, Relational, and Document models

–Intuitive and lexible data modeling using Graph and JSON documents

• High-level query features

–Supports ANSI SQL and Cypher query

–Supports ACID transactions

–Possible to create a hybrid query statement that combines SQL and Cypher syntax when creating a query

–Able to create hierarchical graph labels

• High performance query statement processing

–Supports graph indexing for fast graph traversal

–Possible to generate vertex and edge index

–Supports a full-text search for JSON document processing

• Constraints

–Supports Unique, Mandatory, Check constraints

• High Availability

–Possible active-standby coniguration

• Visualization tools

–Visualizes the result data of graph queries

• Advanced security features

–Implements an authentication system using Kerberos and LDAP

–Encrypts via SSL/TLS protocols

8 AgensGraph Developer Manual

• Connectivity

–Provides JDBC and Hadoop drivers

1.2 Graph Database Concepts

This section introduces the graph data model.

1.2.1 AgensGraph Database

The graph database stores and manages objects of a real model in graph form.

A relation (edge) exists between objects (vertices), and a group of similar vertices can be expressed as a group (la-

bel).

As vertices and edges have data (properties), they can be called Property Graph Models as well.

Let's take a closer look at the components of this graph data model.

The following example shows components of a graph:

Vertices

Vertices are the most basic elements in the graph data model. They represent entities in the real world and have

properties.

A graph has vertices and edges as the base units. In AgensGraph, both vertices and edges may contain Properties.

While entities are usually represented by vertices, they may be indicated using edges in some cases. Unlike edges

and properties, vertices may have zero or multiple label values.

The simplest form of a graph consists of a single vertex. A vertex can have zero or more properties.

Introduction 9

The next step is to construct a graph with multiple vertices. Add two or more vertices to the graph of the previous

step and add one or more properties to the existing vertex.

Edges

Edges connect vertices. When two vertices are connected via en edge and each vertex plays as start vertex or end

vertex depending on the direction of the edge. Like vertices, edges have properties.

The edges between vertices play an important role in the graph database, especially when you search for linked data.

With edges, you may make vertices into a variety of data structures, such as lists, trees, maps, and composite entities.

By adding edges to the example we are building, we can represent more meaningful data.

In the example, ACTED_IN and DIRECTEDare used as edge types. The ACTED_IN property, Roles, stores the value of

array type.

The ACTED_IN edge has the Tom Hanks vertex as start vertex and the Forrest Gump vertex as end vertex. In other

words, we can say that the Tom Hanks vertex has an outgoing edge and the Forrest Gump vertex has an incoming

edge.

If there is an edge in a single direction, you do not have to duplicate the edge and add it in the opposite

direction; this is related to the graph traversal or performance.

Edges are always directional, but they may ignore directionality if it is not needed in your application. The diagram

below shows a vertex having an edge pointing to itself.

All edges are recommended to have an edge type to perform the graph traversal in a more eficient manner.

10 AgensGraph Developer Manual

Properties

Both vertices and edges may have properties. Properties are attribute values, and each attribute name should be

deined only as a string type.

The available data types for property values are:

• Numeric type

• String type

• Boolean type

• List type (a collection of various data types)

NULL values cannot be used as property values. If NULL is entered, the property itself is assumed to be

absent. NULL values, however, can be used in List.

Type Description Value range

boolean true/false

byte 8-bit integer -128 to 127, inclusive

short 16-bit integer -32768 to 32767, inclusive

int 32-bit integer -2147483648 to 2147483647, inclusive

long 64-bit integer -9223372036854775808 to 9223372036854775807,

inclusive

loat variable-precision, inexact 15 decimal digits precision

char 16-bit unsigned integers representing

Unicode characters

u0000 to uffff (0 to 65535)

String sequence of Unicod characters ininite

Labels

You may deine the roles or types of vertices or edges using labels. Vertices or edges with similar characteristics can

be grouped and the name of such a group can be deined, which is called a ``label.'' That is, all vertices or edges with

similar labels belong to the same group.

Database query statements can be performed only on the group (not the entire graph) using labels, which is helpful

for a more eficient querying.

Using labels for vertices is optional, and each vertex may have zero or only one label.

Labels can also be used to deine constraints on properties or to add indexes.

Introduction 11

You may also assign a label similar to a vertex to an edge. Unlike vertices, there is no edge without a label; all edges

should have at least one label.

Let us add Person and Movie labels to the existing example graph.

Label names

Label names can be expressed using letters and numbers, all converted to lowercase letters.

Labels stores a unique id of int type, which means that the database may contain up to 2ˆ16-1(65535) labels.

Traversal

Traversal is to traverse paths while exploring a graph to answer the requested query. Traversal is a process of search-

ing for the relevant vertices from start vertex to ind the answer to the requested query. In other words, it refers to

following the vertices that are traversing the graph and the derived edges according to a speciic rule.

In the examples illustrated so far, we try to ind a movie featured by Tom Hanks. Starting with the Tom Hanks vertex,

you can traverse all the processes that end at the vertex of Forrest Gump along the ACTED_IN edge associated with it.

By using the traversal of cypher query statements and additional techniques in the graph database, you may derive

better result data. For more information, see Cypher Query Language.

12 AgensGraph Developer Manual

Paths

Paths are the result data of a query statement or traversal, which shows one or more vertices and the edges con-

nected to them.

The path (traversal result data) from the previous example is as follows:

The length of the above path is 1. The shortest path length is 0, which is the case when a single vertex does not have

edges.

If the vertex has an edge pointing to itself, the length of path is 1.

Introduction 13

2 Get Started

2.1 Install AgensGraph

2.1.1 Pre-Installation on Linux

Get the pre-compiled binary

AgensGraph works on Linux/Windows and can be installed in two ways. One is to download its binary package, and

the other is to download its source code and compile the package. The binary package can be downloaded from the

Bitnines's website, and the source code can be downloaded from github. If you want to know your system environ-

ment, enter the following command in the command window.

uname -sm

Extract the package

Unzip the downloaded ile in the desired location. (e.g.: /usr/local/AgensGraph/)

tar xvf /path/to/your/use

2.1.2 Post-Installation Setup and Conguration

Setting environment variables (Optional)

Add the following three lines to your shell startup ile (e.g. .bash_proile).

export LD_LIBRARY_PATH=/usr/local/AgensGraph/lib:$LD_LIBRARY_PATH

export PATH=/usr/local/AgensGraph/bin:$PATH

export AGDATA=/path/to/make/db_cluster

Creating a database cluster

Create a database cluster using the following command: If you do not specify the -D option, use the AGDATA speciied

in .bash_profile.

initdb [-D /path/to/make/db_cluster]

14 AgensGraph Developer Manual

Starting the server

You can start AgensGraph with the following command.

ag_ctl start [-D /path/created/by/initdb]

Creating a database

createdb [dbname]

If you do not specify the database name and user name, the default value will apply. The default is the name of the

current user.

Running a terminal

agens [dbname]

When you run a terminal like above, you can see the following screen:

username=#

If it is a super user, ``= #'' will be displayed at the prompt; ``=>'' will be displayed for other users.

username=# CREATE

username-# (

username(# (

username(# )

username-#

In case you are in the middle of entering a query, -is displayed. If you are typing content to be included in parenthe-

ses, (is displayed. Even multi-parentheses can be expressed as needed. You may customize the input prompt, and

the detailed options can be found in the following link.

2.1.3 Setting server parameters

AgensGraph enables you to conigure the server to improve performance. Setting the server parameters in consid-

eration of the size of data and resources of the server (memory, CPU, disk size, and speed) is critical for improve-

ment of performance. The following server variables have a signiicant effect on AgensGraph's graph query perfor-

mance. You may change server parameters by changing $AGDATA/postgresql.conf. If you modify $AGDATA/post-

gresql.conf, you need to restart the server.

Get Started 15

•shared_buffers: Memory size for data object caching. This variable must conform to the product environ-

ment. It is optimal when this variable is as large as the size of data. shared_buffers should be set carefully con-

sidering the amount of memory allocated for concurrent sessions and each query. The recommended value is

half the physical memory.

•work_mem: Increases in size depending on the physical memory and the attributes of the query being executed.

• random_page_cost: A parameter for query optimization. This parameter should be lowered to 1 or 0.005 for

graph queries (when the graph data is completely cached in memory)

2.2 Get started with Cypher

This section introduces Cypher and the followings:

• Basic understanding on graphs and patterns

• Simple troubleshooting

• Cypher syntax writing

2.2.1 About Cypher

AgensGraph supports Cypher, a query language, for retrieving and processing graph data. Cypher is a declarative

language similar to SQL.

2.2.2 Pattern

AgensGraph's graph consists of vertices and edges. Vertices and edges can have many properties. The actual data

consists of simple graphs in patterns. AgensGraph searches and processes the patterns of graphs through cypher.

Creating Graphs

AgensGraph may store multiple graphs in a single database. Cypher cannot igure out multiple graphs. AgensGraph

supports variables for generating and managing graphs using DDL and Cypher. The following syntax generates a

graph called network and sets the current graph.

CREATE GRAPH network;

SET graph_path = network;

In this example, the graph_path variable is set to network. However, if graph_path is not set before creating a graph,

it will be set automatically after a graph is generated.

16 AgensGraph Developer Manual

Creating Users

CREATE ROLE user1 LOGIN IN ROLE graph_owner;

DROP ROLE user1;

You can create new users to manage ownership and other privileges on database objects. If a new user wants to gen-

erate a label in the graph, the new user must be in the same group as the user who created the graph. See this link

for more options regarding creating users.

Creating Labels

You should generate a label before generating graph data in principle. However, for your convenience, a label will be

automatically created if it is speciied when Cypher's CREATE is executed. In AgensGraph, you should have one label

for a vertex and an edge. The following example creates a vertex label named person and an edge label knows.

CREATE VLABEL person;

CREATE ELABEL knows;

CREATE (n:movie {title:'Matrix'});

Creating Vertices and Edges

This section uses CREATE of Cypher to create the person vertex and knows edge. The CREATE clause creates a pattern

consisting of vertices and edges. (variable:label {property: value, ...}) is a vertex type, and

-[variable:label {property: value, ...}]- is an edge type. The direction of the edge can be represented by <

or >. Variables may or may not exist in the forms of vertices and edges.

Note : AgensGraph does not support -- in edge patterns. -- means a comment at the end of a sentence.

In the following example, patterns such as ``Tom knows Summer,'' ``Pat knows Nikki'' and ``Olive knows Todd'' are

created.

CREATE (:person {name: 'Tom'})-[:knows]->(:person {name: 'Summer'});

CREATE (:person {name: 'Pat'})-[:knows]->(:person {name: 'Nikki'});

CREATE (:person {name: 'Olive'})-[:knows]->(:person {name: 'Todd'});

AgensGraph uses the jsonb type for vertex/edge attributes. Attributes are expressed using JSON objects.

Get Started 17

3 Data Type

AgensGraph provides diverse data types. You can add new types as well using CREATE TYPE command. Refer to the

User-deined Type clause for more information on CREATE TYPE.

The following table lists the generic data types provided by default, and some of the types that are used internally or

not used may not be included. (``Alias'' is an internally-used name.)

Name Alias Description

bigint int8 Signed 8-byte integer

bigserial Auto-incrementing 8-byte integer

bit [ (n) ] 16-bit integer Fixed length bit string

bit varying [ (n) ] varbit Variable-length bit string

boolean bool Logical Boolean (true / false)

box Square box on a plane

bytea Binary data (``byte array'')

character [ (n) ] char [ (n) ] Fixed-length character string

character varying [ (n) ] varchar [ (n) ] Variable-length character string

cidr IPv4 or IPv6 network address

circle Circle on a plane

date Calendar date (year, month, day)

double precision loat8 Double-precision loating-point number (8

bytes)

inet IPv4 or IPv6 host address

integer int, int4 Signed 4-byte integer

interval [ ields ] [ (p) ] Time range

json Text JSON data

jsonb Binary JSON data, disjointed

line Ininite straight line on a plane

lseg Segment on a plane

macaddr Media Access Control (MAC) address

money Trafic volume

numeric [ (p, s) ] decimal [ (p, s) ] The exact number of selectable digits

path Geometric path in the plane

pg_lsn AgensGraph log sequence number

point Geometric points on a plane

18 AgensGraph Developer Manual

Name Alias Description

polygon Geometrically-closed path in plane

real loat4 Single-precision loating-point number (4

bytes)

smallint int2 Signed two-byte integer

smallserial serial2 Auto-incrementing 2-byte integer

serial serial4 Auto-incrementing 4-byte integer

text Variable-length character string

time [ (p) ] [ without time zone ] Time (no time zone)

time [ (p) ] with time zone timetz Includes time and time zone

timestamp [ (p) ] [ without time zone ] Date and time (no timezone)

timestamp [ (p) ] with time zone timestamptz Date and time, including time zone

tsquery Text search query

tsvector Text Search Document

txid_snapshot User-level transaction ID snapshot

uuid Universal unique identiier

xml XML data

3.1 Numeric Types

A numeric type consists of a 2/4/8 byte integer, a 4/8 byte loating point numbers, and a selectable total number of

digits.

The following numeric types are available:

Name Storage Size Description Range

smallint 2 bytes A small range of integers -32768 to +32767

integer 4 bytes Common integer -2147483648 to +2147483647

bigint 8 bytes A large range of integers -9223372036854775808 to

+9223372036854775807

decimal variable Custom precision, correct Up to 131072 digits before the decimal

point,

up to 16383 digits after the decimal

point

Data Type 19

Name Storage Size Description Range

numeric variable Custom precision, correct Up to 131072 digits before the decimal

point,

up to 16383 digits after the decimal

point

real 4 bytes Variable precision, incorrect 6 digits precision

double

precision

8 bytes Variable precision, incorrect 15 digits precision

smallserial 2 bytes Automatic incremental constant

(small)

1 to 32767

serial 4 bytes Automatic incremental constant 1 to 2147483647

bigserial 8 bytes Automatic incremental integer

(large)

1 to 9223372036854775807

3.1.1 Integer Types

smallint, integer, and bigint types store a wide range of integers without decimal fractions. If you try to store a value

beyond the allowable range, an error will occur.

• integer type: This type is generally chosen as it provides the best balance point of range, storage size, and per-

formance.

• smallint type: Typically used only when there is insuficient disk space.

• bigint type: To be used when the integer type range is insuficient.

SQL speciies only integer (or int), smallint, and bigint. (available as int2, int4, and int8 as well).

3.1.2 Arbitrary Precision Numbers

Numeric types may store a very large number of digits and perform calculations correctly. It is especially recom-

mended when storing amounts and quantities that should be accurate. Arithmetic of numeric values, however, is

much slower than the integer types or loating-point types described in the next section.

Scale in the numeric type means the number of digits to the right of the decimal point. Precision means the total

number of signiicant digits of the total number. That is, the total number of digits on both sides of the decimal point.

Thus, precision and scale of the number 23.5141 is 6 and 4, respectively. The scale of whole number can be consid-

ered 0 (Scale 0).

You may conigure both the maximum precision and the maximum scale of a numeric column.

20 AgensGraph Developer Manual

To declare a column of numeric type, use the following syntax:

NUMERIC(precision, scale)

Precision must be positive and scale must be zero or positive.

NUMERIC(precision)

If you specify numeric without precision or scale as follows, it selects Scale 0.

NUMERIC

If you create a numeric column that can store precision and scale values without specifying them, you may store the

precision value. This kind of column can be used without restriction if it does not specify a speciic scale. However, a

numeric column with a scale value speciied will be limited by the scale value. (When transferring data, make sure to

specify the precision and scale always).

Note: The maximum allowable precision is 1000 if explicitly speciied in the type declaration. NUMERICs

that do not specify precision are limited to the ranges described in the table.

In the case where the scale in the value to be stored is greater than the scale declared in the column, the system

rounds off the value to the speciied scale; after rounding-off, if the number of digits to the right of the decimal point

exceeds ``the declared scale subtracted from the declared precision,'' an error will occur. Numeric values are physi-

cally stored without extra 0values. Therefore, the precision and scale of the declared column is the maximum (not a

ixed allocation). In this sense, numeric types are closer to varchar (n) than to char (n). The actual storage require-

ment is 2 bytes for each 4-digit group plus 3 for 8-byte overhead.

The decimal and numeric types are the same and both are SQL standards.

3.1.3 Floating-Point Types

The real and double precision data types are inaccurate variable-precision numeric types. Inaccuracy means that

some values cannot be accurately converted to their internal form and are stored as approximate values, making the

storage and retrieval of values somewhat inconsistent.

• If you need accurate storage and computation (e.g. amount), use the numeric type instead.

• If you need to perform complex calculations using this type for some unavoidable reason (e.g. when you need a

speciic behavior in boundary cases (ininity, underlow)), you should be careful with the implementation.

• Comparing two loating-point values to see if they are equal may not always work as expected.

On most platforms, the real type has a minimum range of 1E-37 to 1E+37, with precision of at least 6. The double

precision type generally has precision of at least 15, ranging from 1E-307 to 1E+308. Too large or too small values

will generate an error. If precision of the entered number is too large, it may be rounded up. An underlow error oc-

curs if the number is so close to zero that it cannot be marked as non-zero.

Data Type 21

3.1.4 Serial Types

The smallserial, serial, and bigserial data types are not actual types, but are international notations for creating

unique identiier columns (similar to the AUTO_INCREMENT attribute supported by some other databases).

The following two statements work in the same manner:

-- SERIAL

CREATE TABLE tablename (

colname SERIAL

);

-- SEQUENCE

CREATE SEQUENCE tablename_colname_seq;

CREATE TABLE tablename (

colname integer NOT NULL DEFAULT nextval ('tablename_colname_seq')

);

ALTER SEQUENCE tablename_colname_seq OWNED BY tablename.colname;

Create an integer column and sort it by default assigned in the sequence. A NOT NULL constraint is applied to pre-

vent null values from being inserted. (In most cases, it is possible to prevent duplicate values from being accidentally

entered with a UNIQUE or PRIMARY KEY constraint; such a constraint is not automatically generated.)

Finally, the sequence is marked as column ``owned by,'' so that it is not deleted unless the column or table is deleted.

The serial column default is speciied in order to insert the next value of sequence into the serial column. This can be

done by excluding columns from the column list of the INSERT statement or by using the DEFAULT keyword.

• The serial type is the same as serial4; both generate an integer column.

• The bigserial and serial8 types work the same except when creating a bigint column. bigserial should be used

if you expect to use more than 231 identiiers throughout the lifetime of the table.

• The smallserial and serial2 types operate the same except when creating a smallint column.

The sequence created for the serial column is automatically deleted when the owning column is deleted. You can

delete the sequence without deleting the column, but the column base expression is forcibly deleted.

3.2 Character Types

The following character types are available:

22 AgensGraph Developer Manual

Name Description

character varying(n), varchar(n) Variable with limit

character(n), char(n) Fixed length, ill in blank

text Unlimited variable length

The two basic character types deine character variation(n) and character(n). The value of n is a positive integer, and

both can store up to n length characters (not bytes).

If you store a string that is longer than n, an error occurs if the excess string is not empty; if it is blank, it is truncated

to the length of the speciied n value. If you store a string that is shorter than n characters, it is illed with blanks for

the character type. For character varying, the string except blanks is stored.

Trailing blanks in character types are treated as not syntactically signiicant and are ignored when comparing two

values of the character type. Trailing blanks are syntactically signiicant when using pattern-matching regular ex-

pressions such as character varying, text, and LIKE.

char(n) and varchar(n) are aliases of character variation(n) and character(n). A character without a speciier is

equivalent to a character(1), and a character varying without a speciier stores strings regardless of its size. It also

provides a text type for storing strings, regardless of length.

An example of using the character type is shown below:

create table test1 (a character (4));

CREATE TABLE

insert into test1 values ('ok');

INSERT 0 1

select a, char_length(a) from test1;

a | char_length

------+-------------

ok | 2

create table test2 (b varchar(5));

CREATE TABLE

insert into test2 values ('ok');

INSERT 0 1

insert into test2 values ('good ');

INSERT 0 1

insert into test2 values ('too long');

Error: character varying(5) tries to store data that is too long for the data type.

Data Type 23

insert into test2 values ('too long'::varchar(5));

INSERT 0 1

select b, char_length(b) from test2;

b | char_length

-------+-------------

ok | 2

good | 5

too l | 5

3.3 Date/Time Types

The following date/time types are available:

Name

Storage

Size Description

Low

Value

High

Value Resolution

timestamp [ (p) ] [

without time zone ]

8 bytes Both date and time (no

timezone)

4713 BC 294276

1 microsecond

/ 14 digits

timestamp [ (p) ] with

time zone

8 bytes Both date and time,

including timezone

4713 BC 294276

1 microsecond

/ 14 digits

date 4 bytes Date (no time) 4713 BC 5874897

1 day

time [ (p) ] [ without

time zone ]

8 bytes Time (no date) 00:00:00 24:00:00 1 microsecond

/ 14 digits

time [ (p) ] with time

zone

12 bytes Include time of day,

time zone

00:00:00

+1459

24:00:00-

1459

1 microsecond

/ 14 digits

interval [ ields ] [ (p) ] 16 bytes Time interval -

178000000

years

178000000

years

1 microsecond

/ 14 digits

Note: The SQL standard uses timestamp and timestamp without time zone equally, and timestamptz is

the abbreviation for timestamp with time zone.

time, timestamp, and interval may set the scale (the number of digits of the decimal fraction) on the second ield as

the p value, and there is no explicit restriction on precision (total number of digits) by default. The allowable range

of p is 0 to 6 for timestamp and interval type.

24 AgensGraph Developer Manual

For the time type, the allowable range of p is 0 to 6 when using 8-byte integer storage, and 0 to 10 when using loating-

point storage.

The interval type has an additional option of limiting the set of stored ields by writing one of the following state-

ments:

YEAR

MONTH

DAY

HOUR

MINUTE

SECOND

YEAR TO MONTH

DAY TO HOUR

DAY TO MINUTE

DAY TO SECOND

HOUR TO MINUTE

HOUR TO SECOND

MINUTE TO SECOND

3.3.1 Date/Time Input

The date and time input can be speciied in the order of day, month, and year as follows:

set datestyle to sql, mdy;

set datestyle to sql, dmy;

set datestyle to sql, ymd;

Set the DateStyle parameter to MDY to select the month-day-year interpretation, DMY to select the day-month-year

interpretation, or YMD to select the year-month-day interpretation.

The date or time input must be preceded and followed by single quotation marks, like a text string.

• Date

Available date types:

Example Description

1999-01-08 ISO 8601. January 8 in random mode (recommended format)

January 8, 1999 Ambiguous in datestyle input mode

Data Type 25

Example Description

1/8/1999 January 8 in MDY mode. August 1 in DMY mode

1/18/1999 January 18 in MDY mode. Rejected in other modes

01/02/03 January 2, 2003 in MDY mode

February 1, 2003 in DMY mode

February 3, 2001 in YMD mode

1999-Jan-08 January 8 in random mode

Jan-08-1999 January 8 in random mode

08-Jan-1999 January 8 in random mode

99-Jan-08 January 8 in YMD mode; others are errors

08-Jan-99 January 8, except error in YMD mode

Jan-08-99 January 8, except error in YMD mode

19990108 ISO 8601. January 8, 1999 in random mode

990108 ISO 8601. January 8, 1999 in random mode

1999.008 Year and day of the year

J2451187 Julian date

January 8, 99 BC Year 99 BC

• Time

The visual type is time [(p)] without time zone and time [(p)] with time zone. time alone is the same as time

without time zone. A valid entry of this type consists of time, followed by an optional time zone. (See the table

below.) If the time zone is speciied as an input for time without time zone, it is ignored. You can specify the

date, but ignore it except when using the time zone name associated with daylight saving time, such as Ameri-

ca/New_York. In this case, a date speciication is required to determine whether the standard time or daylight

saving time period is applied. The appropriate time zone offset is recorded in the time with time zone value.

Table - [Enter Time]

Example Description

04:05:06.789 ISO 8601

04:05:06 ISO 8601

04:05 ISO 8601

040506 ISO 8601

04:05 AM Same as 04:05. AM does not affect the value.

04:05 PM Same as 16:05. Input must be <= 12.

04:05:06.789-8 ISO 8601

26 AgensGraph Developer Manual

Example Description

04:05:06-08:00 ISO 8601

04:05-08:00 ISO 8601

040506-08 ISO 8601

04:05:06 PST Time zone speciied by abbreviation

2003-04-12 04:05:06 America/New_York Time zone speciied by full name

Table - [En ter Time Slot]

Example Description

PST Abbreviation (for Paciic Time)

America/New_York All time zone names

PST8PDT POSIX style time zone speciication

-8:00 ISO-8601 Offset for PST

-800 ISO-8601 Offset for PST

-8 ISO-8601 Offset for PST

zulu Military acronym for UTC

See the Time Zones section below for how to specify the time zone.

• Time stamp

The valid input of timestamp type consists of a connection of date and time followed by an optional time zone

and optional AD or BC. (AD/BC may appear before the time zone, which is not the preferred order.)

1999-01-08 04:05:06

1999-01-08 04:05:06 -8:00

These are valid values in accordance with the ISO 8601 standard. The following command format is also sup-

ported:

January 8 04:05:06 1999 PST

The SQL standard distinguishes timestamp without time zone and timestamp with time zone literals by the

presence of a ``+'' or ``-'' sign and the time zone offset after that time.

Data Type 27

-- timestamp without time zone

TIMESTAMP '2004-10-19 10:23:54'

-- timestamp with time zone

TIMESTAMP '2004-10-19 10:23:54+02'

If you do not specify timestamp with time zone, it is treated as timestamp without time zone; thus, if you use

timestamp with time zone, you must specify an explicit type.

TIMESTAMP WITH TIME ZONE '2004-10-19 10:23:54+02'

• Special values

Some special date/time input values are supported for convenience as shown in the table. The values ininity

and -ininity are simple abbreviations that are specially represented within the system, without modiication,

but otherwise converted to the default date/time value when read. (Note the now and related strings are con-

verted to speciic time values at the moment they are read.) If you use all of these values as constants in your

SQL command, you must use single quotation marks around the constants.

Input String Valid Type Description

epoch date, timestamp 1970-01-01 00: 00: 00 + 00 (Unix system time 0)

ininity date, timestamp After all other timestamps

-ininity date, timestamp Before all other timestamps

now date, time, timestamp Start time of current transaction

today date, timestamp Midnight today

tomorrow date, timestamp Tomorrow midnight

yesterday date, timestamp Yesterday midnight

allballs time 00:00:00.00 UTC

You can also use the following SQL-compatible functions to get the current time values of the data types such

as CURRENT_DATE, CURRENT_TIME, CURRENT_TIMESTAMP, LOCALTIME, and LOCALTIMESTAMP.

3.3.2 Date/Time Output

The output format of the date/time type can be set to one of four styles: ISO 8601, SQL (Ingres), typical POSTGRES

(Unix date format) or German. The default is ISO format.

The following table shows an example of each output style. The output of date/time type has usually only the date or

28 AgensGraph Developer Manual

time depending on the example given.

Style Speciication Description Example

ISO ISO 8601, SQL standard 1997-12-17 07:37:16-08

SQL Traditional style 12/17/1997 07:37:16.00 PST

Postgres Original style Wed Dec 17 07:37:16 1997 PST

German Local style | 17.12.1997 07:37:16.00 PST

If the DMY ield order is speciied, it is output in order of month, day. In other cases, it is output in order of day and

month. The following table is an example:

datestyle Setting Input Ordering Example Output

SQL, DMY day/month/year 17/12/1997 15:37:16.00 CET

SQL, MDY month/day/year 12/17/1997 07:37:16.00 PST

Postgres, DMY day/month/year Wed 17 Dec 07:37:16 1997 PST

The date/time style can be selected by the user using SET datestyle command, DateStyle parameter in the post-

gresql.conf coniguration ile, or PGDATESTYLE environment variables of the server or client. The formatting func-

tion to_char allows you to specify date/time output formats in a more lexible manner.

3.3.3 Time Zones

Time zones and notations are affected by political decisions in addition to topographical features. Worldwide time

zones have been standardized in the 1900s but are constantly changing due to daylight time regulations. Agens-

Graph uses the IANA (Olson) timezone database. For future times, it considers that the latest known rules for a spec-

iied time zone will be complied with indeinitely in the distant future. However, it has a peculiar mix of dates, time

types and features.

Two obvious problems are:

• Date type does not have an associated time zone (only time type has). In the real world, the time zone has no

meaning unless it is associated with date and time, since the offset can vary over the year containing the day-

light saving time boundary.

• The default time zone is speciied as a constant numeric offset from UTC. Thus, it may not be possible to adapt

to daylight saving time when performing date/time calculations across DST boundaries.

When using time zone to solve this problem, we recommend using date/time type that includes both date and time

(this is supported for compatibility but we do not recommend using the time with time zone type). Assume a local

Data Type 29

time zone for types that only contain dates or times. The date and time in the time zone are internally stored in UTC.

Then, they are converted to a local time in the Time Zone speciied by the TimeZone coniguration parameters before

being displayed to the client. Allow the time zone to be speciied in three different formats.

• The full names of time zone (e.g. America/New_York). The recognized time zone names are listed in the

pg_timezone_names view. (Identical time zone names are recognized by many other software applications as

well.)

• Time zone abbreviations (e.g. PST). In contrast to the full names of time zone that can imply the daylight time

conversion date rule set, the corresponding speciication simply deines a speciic offset from UTC. The recog-

nized abbreviations are listed in the pg_timezone_abbrevs view. You cannot set the coniguration parameters

TimeZone or log_timezone as a time zone abbreviation, but may use an abbreviation and AT TIME ZONE oper-

ator as date/time input values.

• In addition to time zone names and abbreviations, it accepts POSIX style time zone speciications of STDoffset

or STDoffsetDST; STD is a regional abbreviation, offsetUTC is the numerical offset for time of the west, and DST

is the optional summertime local abbreviation that is assumed to be one hour earlier than the speciied offset.

For instance, if EST5EDT is not yet a recognized local name, it is accepted and becomes functionally identical

to the US East Coast time. In this syntax, local abbreviations can be any string of characters or any string using

angle brackets (<>). If a daylight saving area abbreviation is present, it is assumed to be used in accordance

with the same daylight saving time conversion rules as those used in the posixrules entry in the IANA time

zone database. As posixrules are identical with US/Eastern in a standard AgensGraph installation, the POSIX

style time zone speciication complies with the USA Daylight Saving Time rules. You can adjust this behavior by

replacing the posixrules ile, if necessary.

Simply put, this is the difference between abbreviations and full names. Time zone abbreviations represent a speciic

offset from UTC, while many of time zone full names imply a local daylight time rule and have two possible UTC off-

sets. For instance, ``2014-06-04 12:00 America/New_York'' representing the noon in the New York area was Eastern

Daylight Time (UTC-4) for the day. Accordingly, ``2014-06-04 12:00 EDT'' speciies the same time instance. However,

``2014-06-04 12:00 EST'' speciies noon Eastern Standard Time (UTC-5) regardless of whether daylight saving time

is nominally in effect on that date.

To complicate the problem, some areas use the same time zone abbreviations meaning different UTC offsets at dif-

ferent times. For example, MSK in Moscow meant UTC+3 for several years, and UTC+4 for other cases. Even if these

abbreviations are interpreted according to their meanings for a given date (or most recent meaning), as in the above

EST example, this does not necessarily match the local time of the date. Note that the POSIX style time zone function

does not check for the correctness of local abbreviations; this means you may silently accept false input. For exam-

ple, SET TIMEZONE TO FOOBAR0 works by letting the system use speciic abbreviations for UTC. Another problem

to keep in mind is that positive offset is used for the Greenwich's position west in the POSIX time zone region name.

30 AgensGraph Developer Manual

Elsewhere, AgensGraph conforms to the ISO-8601 notation, where the positive time-zone offset is east of Greenwich.

Time zone names and abbreviations are case sensitive and are not embedded in the server; they are to be searched

from the coniguration iles stored in the installation directory (…/share/timezone/and …/share/timezonesets/).

TimeZone coniguration parameters can be set in other standard ways, which is different from postgresql.conf as

follows:

• The SQL command SET TIME ZONE sets a time zone for the session. You can use SET TIMEZONE TO, which is

more compatible with the SQL speciication.

• The PGTZ environment variable is used by the libpq client to send SET TIME ZONE command to the server

when connected.

3.3.4 Interval Input

The interval value can be written using the following detailed syntax:

[@] quantity unit [quantity unit...] [direction]

Where quantity is a number and unit can be microsecond, millisecond, second, minute, hour, day, week, month, year,

decade, century, millennium, or their abbreviations, singular or plural; direction can be ago or empty. The at (@)

sign is an optional noise. Quantities in different units are implicitly added using an appropriate sign accounting. This

syntax is also used for interval output when IntervalStyle is set to postgres_verbose.

Days, hours, minutes and seconds can be speciied without explicitly marking units. For example, ``1 12:59:10'' is

read the same as ``1 day 12 hours 59 min 10 sec.''

Combinations of years and months can be speciied using dashes as well. For example, ``200-10'' is read the same

as ``200 years 10 months.'' (This short format is actually the only one allowed by the SQL standards, and is used for

output if IntervalStyle is set to sql_standard).

The format using speciiers:

P quantity unit [ quantity unit ...] [ T [ quantity unit ...]]

The string must contain P, and may contain T to include the unit of time. Available unit abbreviations are listed in the

table below. Units can be omitted and can be speciied in any order, but units of less than one day must appear after

T. In particular, the meaning of M varies depending on whether it is before or after T.

Table [ISO 8601 Interval Unit Abbreviations]

Abbreviation Meaning

Y Year

M Month(in the date part)

Data Type 31

Abbreviation Meaning

W Week

D Day

H Hour

M Minute(in the time part)

S Second

Alternative format:

P [ years-months-days ] [ T hours:minutes:seconds ]

The string must start with P, and T separates the date and time of the interval. The value is speciied as a number

similar to the ISO 8601 date. If you create an interval constant with the ields speciication, or if you assign a string to

an interval column deined by the ields speciication, the interpretation of the unmarked quantity varies depending

on the ields. For example, INTERVAL `1' YEAR is interpreted as a year, whereas INTERVAL `1' means 1 second. The

lowest right ield value allowed by the ields speciication is ignored.

For example, INTERVAL `1 day 2:03:04' HOUR TO MINUTE will eventually delete the ``seconds'' ield (not the date

ield). As all ields of interval values must have the same signs according to the SQL standards, the leading negative

signs apply to all ields. For example, the negative sign in the interval literal `-1 2:03:04' applies to both date and

hour/minute/second parts. As the ields are allowed to have different signs and the signs of each ield are indepen-

dently processed in text representation, the hour/minute/second part is regarded as a positive value in this example.

If IntervalStyle is set to sql_standard, the leading sign is assumed to be applied to all ields (if there is no additional

sign).

If the ield is negative, it is better to explicitly append the (negative) sign to avoid ambiguity. Internally, interval val-

ues are stored as months, days, and seconds. This is because the number of days in the month is different, and a day

can be 23 or 25 hours depending on implementation of daylight saving time. The month and day ields are integers,

and the seconds ield can be stored as decimals. Since the interval is usually generated by constant strings or times-

tamp subtraction, this way of storage is not problematic in most cases. The functions justify_days and justify_hours

are useful when you want to control overlowing days and times in the normal range. Field values in some ields of

the detailed input format and the simpler input ield may have a decimal fraction (e.g. ``1.5 week'' or ``01:02:03.45'').

These inputs are converted to an appropriate number of months, days, and seconds for storage. This results in the

decimal(s) of months or days being added to a subield using the conversion factor Jan=30 days and 1 day=24 hours.

For example, ``1.5 month'' is one month and 15 days. Only the seconds are displayed with a decimal fraction. The ta-

ble below shows some examples of valid interval inputs.

Table [Interval Input]

32 AgensGraph Developer Manual

Example Description

1-2 SQL standard format: 1 year 2 months

3 4:05:06 SQL standard format: 3 days 4 hours 5 minutes 6 seconds

1 year 2 months 3 days 4 hours 5 minutes 6

seconds

Typical Postgres format: 1 year 2 months 3 days 4 hours 5

minutes 6 seconds

P1Y2M3DT4H5M6S ISO 8601 ``format with designators'': same as above

P0001-02-03T04:05:06 ISO 8601 ``alternative format'': same as above

3.3.5 Interval Output

The output format of interval type can be set to one of the four styles sql_standard, postgres, postgres_verbose, or

iso_8601 using the command SET intervalstyle. The default is postgres. The table below shows an example of each

output style. The sql_standard style generates an output that conforms to the SQL standard speciication for the in-

terval literal string if the interval value satisies the standard limit (year-month or day-minute only if positive and

negative values are not mixed). Otherwise, a sign is explicitly added to eliminate the confusion of the sign mixing in-

terval; the output appears as if the standard year-month literal string is followed by a day-time literal string.

Table [Example of interval output style]

Style Speciication Year-Month Interval Day-Time Interval Mixed Interval

sql_standard 1-2 3 4:05:06 -1-2 +3 -4:05:06

postgres 1 year 2 mons 3 days 04:05:06 -1 year -2 mons +3 days

-04:05:06

postgres_verbose @ 1 year 2 mons @ 3 days 4 hours 5

mins 6 secs

@ 1 year 2 mons -3 days 4 hours

5 mins 6 secs ago

iso_8601 P1Y2M P3DT4H5M6S P-1Y-2M3DT-4H-5M-6S

3.4 Boolean Type

It provides a standard SQL type boolean; on top of ``true'' and ``false'' states, it has ``unknown,'' a third state that is

expressed as SQL null.

Name Storage Size Description

boolean 1 byte True or False state

• Valid literal values in the true state are:

Data Type 33

TRUE

't'

'true'

'y'

'yes'

'on'

'1'

• In the false state, you may use the following values:

FALSE

'f'

'false'

'n'

'no'

'off'

'0'

Leading or trailing blanks are ignored and it is not case sensitive. Using the keywords TRUE and FALSE is preferred.

The following example shows that a boolean value is displayed (output) using the characters t and f.

An example of using the boolean type:

CREATE TABLE test1 (a boolean, b text);

INSERT INTO test1 VALUES (TRUE,'sic est');

INSERT INTO test1 VALUES (FALSE,'non est');

SELECT *FROM test1;

a | b

---+---------

t | sic est

f | non est

SELECT *FROM test1 WHERE a;

a | b

---+---------

t | sic est

34 AgensGraph Developer Manual

3.5 Geometric Types

Geometric types display two-dimensional spatial objects and the available geometric types are as follows:

Name Storage Size Description Representation

point 16bytes Point on the plane (x,y)

line 32bytes Ininite straight line {A,B,C}

lseg 32bytes Ininite segment ((x1,y1),(x2,y2))

box 32bytes Square box ((x1,y1),(x2,y2))

path 16+16n bytes Closed path (similar to a polygon) ((x1,y1),…)

path 16+16n bytes | Open path | [(x1,y1),…]

polygon 40+16n bytes Polygon (similar to a closed path) ((x1,y1),…)

circle 24bytes Circle <(x,y),r> (Center point and radius)

3.5.1 Points

Points are the basic two-dimensional building blocks for geometric types. The value of point type is speciied using

one of the following syntaxes:

(x,y)

x , y

Where x and y are the loating-point coordinates, respectively. Points are output using the irst syntax.

3.5.2 Lines

Lines are expressed by a linear equation Ax+By+C=0; where A and B are both nonzero. The value of line type is the

input and output in the following format.

{ A, B, C }

Alternatively, you can use one of the following formats for input:

[(x1,y1),(x2,y2)]

((x1,y1),(x2,y2))

(x1,y1),(x2,y2)

x1 , y1 , x2 , y2

Where (x1, y1) and (x2, y2) are two different points on a straight line.

Data Type 35

3.5.3 Line Segments

A line segment is represented by a pair of points that form the two end points of a line segment. The value of lseg

type is speciied using one of the following syntaxes.

[(x1,y1),(x2,y2)]

((x1,y1),(x2,y2))

(x1,y1),(x2,y2)

x1 , y1 , x2 , y2

Where (x1, y1) and (x2, y2) are the two end points of a line segment. The Line Segment is output using the irst syn-

tax.

3.5.4 Boxes

A box is represented by a pair of points that form the opposite corners of the box. The value of box type is speciied

using one of the following syntaxes:

((x1,y1),(x2,y2))

(x1,y1),(x2,y2)

x1 , y1 , x2 , y2

Where (x1, y1) and (x2, y2) are the two opposite corners of the box. A box is output using the second syntax. Two

opposite corners are provided as inputs, but values are reordered as needed to be saved as upper right corner and

lower left corner.

3.5.5 Paths

A path is represented by a list of connected points. If the irst and last points of the list are considered unconnected,

it is an open path. If the irst and last points are considered to be connected, it is a closed path. The value of path type

is speciied using one of the following syntaxes:

[(x1,y1),...,(xn,yn)]

((x1,y1),...,(xn,yn))

(x1,y1),...,(xn,yn)

( x1 , y1 , ... , xn , yn )

x1 , y1 , ... , xn , yn

Where the points are the two end points of a line segment constituting the path. Square brackets ([]) indicate open

paths and parentheses (()) indicate closed paths. If the outermost parentheses are omitted, as in the third through

ifth statements, they are considered closed paths. The path is output using the second syntax properly.

36 AgensGraph Developer Manual

3.5.6 Polygons

Polygons are represented by a list of points (polygon vertices). Polygons are very similar to closed paths, but are

stored differently and have their own set of routines supported. The value of polygon type is speciied using one of

the following syntaxes:

((x1,y1),...,(xn,yn))

(x1,y1),...,(xn,yn)

( x1 , y1 , ... , xn , yn )

x1 , y1 , ... , xn , yn

Where the points are the two end points of a line segment constituting the boundary of the polygon. The polygon is

output using the irst syntax.

3.5.7 Circles

A circle is represented by the center point and radius. The value of circle type is speciied using one of the following

syntaxes:

<(x,y),r>

((x,y),r)

(x,y),r

x,y ,r

Where (x, y) is the center of the circle and r is the radius. Circle is output using the irst syntax.

3.6 XML Type

The XML type is used to store XML data. It allows you to inspect input values in a well-formed format rather than

storing XML data in a text ield, and has functions that supports safe operations. This data type requires an installa-

tion built with conigure --with-libxml.

3.6.1 Creating XML Values

To generate an xml type value from character data, use the function below:

XMLPARSE ( { DOCUMENT | CONTENT } value)

Examples:

Data Type 37

XMLPARSE (DOCUMENT '<?xml version="1.0"?><book><title>Manual</title><chapter>...</chapter></book>')

XMLPARSE (CONTENT 'abc<foo>bar</foo><bar>foo</bar>')

The XML type does not check the input value for the document type deinition (DTD) even if the input value is speci-

ied as DTD. It does not support validation of other XML schema languages (e.g. XML schema).

The inverse operation to generate a string value in xml uses the following function:

XMLSERIALIZE ( { DOCUMENT | CONTENT } value AS type )

type can be character, character varying, text (or an alias in it). According to the SQL standard, this is the only way to

convert XML and character types, but may simply cast the values.

Choosing DOCUMENT and CONTENT is determined by the ``XML Option'' session coniguration parameters, which

can be set using standard commands when a string value is cast to xml type or passes through XMLSARIALIZE with-

out going through XMLPARSE or XMLSERIALIZE.

SET xmloption TO { DOCUMENT | CONTENT };

As the default is CONTENT, XML data in any format are allowed.

Note: You cannot directly convert a string containing DTD to an XML format, since the deinition of an

XML content fragment does not allow XML if the default XML option settings are used. You should use

XMLPARSE or change the XML option.

3.6.2 Encoding Handling

Care should be taken when you process multiple character encodings on the client and when XML data is passed

through it. If you use text mode to pass a query to the server and pass a query result to the client (normal mode),

convert all character data passed between the client and server, and convert the character encoding backwards

at each end. It contains a text representation of the XML value as shown in the example above. Usually this implies

that the encoding declaration contained in the XML data can be invalidated, as the embedded encoding declaration

does not change, if the character data is converted to another encoding during transmission between the client and

server. To cope with this behavior, the encoding declaration contained in the character string that exists for the XML

type input is ignored and CONTENT is regarded as the current server encoding. As a result, strings of XML data must

be transmitted from the client through the current client encoding for proper processing. It is the client's responsi-

bility to convert the document to the current client encoding or to properly adjust the client encoding before sending

it to the server. The value of type XML in the output does not have an encoding declaration, and the client considers

all data to be the current client encoding.

If you use binary mode to pass query parameters to the server and pass the query results back to the client, the char-

acter set conversion is not performed, making things more dificult. In this case, the encoding declaration of the XML

38 AgensGraph Developer Manual

data is complied with; if absent, the data is assumed to be UTF-8 (note that this is required by the XML standard and

does not support UTF-16). At the output, the data has an encoding declaration specifying the client encoding, if it is

not the client encoding is UTF-8; if it is, the encoding declaration will be omitted.

Processing XML data is less likely to cause errors, and is more eficient when the XML data encoding, client encod-

ing, and server encoding are all identical. Since XML data is internally processed as UTF-8, the calculation is most

eficient when the server is UTF-8 as well.

Note: Some XML-related functions may not work with non-ASCII data if the server encoding is not UTF-8.

This is especially known as a problem in xpath ().

3.6.3 Accessing XML Values

The XML data type is unique in that it does not provide a comparison operator. This is because there is no well-

deined and universally useful comparison algorithm for XML data. As a result, you cannot retrieve a row by com-

paring the xml column with the search value. You must use XML with a separate key ield (e.g. ID). An alternative

solution to compare XML values is to convert it to a string irst. Note, however, that string comparisons have nothing

to do with useful XML comparison methods.

Since there is no comparison operator for XML data types, it is not possible to create an index directly on this type

of column. If you need to retrieve XML data fast, the possible solutions include casting and indexing expressions to

character string types or indexing XPath expressions. Of course, the search should be adjusted by the indexed ex-

pressions in actual querying.

The text search feature can also be used to speed up retrieval of entire documents in XML data. However, the neces-

sary preprocessing support is not yet available.

3.7 JSON Types

The JSON data type is for storing JavaScript Object Notation (JSON) data as speciied in RFC 71591. The applicable

data can also be saved as text, but the JSON data type has an advantage of enforcing each stored value to be valid ac-

cording to the JSON rules. There are also a number of JSON-speciic functions and operators available for these types

of stored data.

There are two JSON data types: JSON and JSONB. These two types accept almost the same set of values as input. A

substantial difference between the two is eficiency. The JSON data type stores an exact copy of the input text, and

its processing function must be re-parsed for each execution. On the other hand, the JSONB data is stored in decom-

posed binary format, which is a bit slower on input due to the added conversion overhead, but much faster during

processing because no re-parsing is required. JSONB also supports indexing, which can be a signiicant advantage.

The JSON type stores exact copies of the input text, preserving syntactically-insigniicant whitespace between to-

Data Type 39

kens and key sequences in the JSON object. In addition, if a JSON object inside a value contains the same key more

than once, all key/value pairs are retained. (Processing functions assume the last value is valid.) Conversely, JSONB

does not retain whitespace, the order of the object keys, and duplicate object keys. If a duplicate key is speciied

in the input, only the last value is retained. In general, most applications should store JSON data as JSONB unless

there is a speciic reason such as existing assumptions about object key ordering. Only one character set encoding

per database is allowed. Therefore, when the database encoding is not UTF8, it is impossible for the JSON type to

strictly comply with the JSON speciication. Attempts to directly include characters that cannot be represented in the

database encoding would fail. In contrast, characters that cannot be represented by UTF8 but can be represented

by database encoding are allowed. RFC 7159 allows a JSON string to contain a Unicode escape sequence denoted by

\uXXXX. In a JSON type input function, Unicode escapes are allowed regardless of the database encoding, and are

checked for syntactical correctness (i.e. \u followed by four hexadecimal digits). However, the input function for

JSONB is stricter. When the database encoding is not UTF8, it does not allow Unicode escapes for non-ASCII charac-

ters (U+007F and above). In addition, in JSONB types, use of a Unicode surrogate pair that denies \u0000 (as it can-

not be represented as a text type) and speciies characters out of the Unicode Basic Multilingual Plane (BMT) must

be correct; it is converted to ASCII or UTF8 characters corresponding to a valid Unicode escape and stored (overlap-

ping surrogate pairs are included as a single character).

Note: Many JSON processing functions convert Unicode escapes to regular characters, and thus return an

error even if the input is of type json rather than jsonb. In general, it is best not to mix Unicode escapes

with UTF8 database encodings in JSON whenever possible.

If you convert the text JSON input to JSONB, the primitive types described in RFC 7159 are effectively mapped to the

native AgensGraph types, as shown in Table 8-23. Accordingly, several local constraints that do not apply to the JSON

type and JSON (even abstractly) and that constitute valid JSONB data are added; this corresponds to a restriction on

whether it can be represented as a primitive data type or not. In particular, JSONB rejects numbers that are outside

the range of the AgensGraph numeric data type and that do not leave JSON. Constraints on which the correspond-

ing implementations are deined are allowed by RFC 7159. In practice, however, this problem is much more com-

mon in other implementations, as JSON's number base type is commonly used to represent IEEE 754 double loat-

ing point (explicitly predicted and allowed in RFC 7159). If you use JSON in interchange format with the system, you

should consider the risk of loss of numeric precision when compared to the original data stored in AgensGraph. Con-

versely, there are some local constraints on the input type of the JSON base type that do not apply to the correspond-

ing AgensGraph type, as shown in the table.

[JSON basic type and corresponding AgensGraph type]

40 AgensGraph Developer Manual

JSON primitive

type Type Notes

string text If the database encoding is not UTF8, 0̆000 is not allowed as it is a

non-ASCII Unicode escape

number numeric NaN and ininity values are not allowed

boolean boolean Only lowercase spelling true and false are accepted

null (없음) SQL NULL is conceptually different

3.7.1 JSON Input and Output Syntax

The input/output syntaxes in the JSON data type can be speciied as in RFC 7159. The following example shows all

valid JSON (or JSONB) expressions.

-- Simple scalar/primitive value

-- Primitive values can be numbers, quoted strings, true, false, or null

SELECT '5'::json;

-- Array of zero or more elements (elements need not be of same type)

SELECT '[1, 2, "foo", null]'::json;

-- Object containing pairs of keys and values

-- Note that object keys must always be quoted strings

SELECT '{"bar": "baz", "balance": 7.77, "active": false}'::json;

-- Arrays and objects can be nested arbitrarily

SELECT '{"foo": [true, "bar"], "tags": {"a": 1, "b": null}}'::json;

As mentioned earlier, when a JSON value is input and printed without further processing, JSON outputs the same

text as the input, and JSONB does not retain syntactically-insigniicant content (e.g. spaces). Refer to the differences

presented below.

SELECT '{"bar": "baz", "balance": 7.77, "active":false}'::json;

json

-------------------------------------------------

{"bar":"baz","balance":7.77,"active":false}

(1row)

Data Type 41

SELECT '{"bar": "baz", "balance": 7.77, "active":false}'::jsonb;

jsonb

--------------------------------------------------

{"bar":"baz","active":false,"balance":7.77}

(1row)

One thing to note, though not syntactically-signiicant, is that it is in JSONB, where numbers are printed according to

the default numeric type. In practice, this means that the number marked as ``e'' will be omitted when printed. Here

is an example:

SELECT '{"reading": 1.230e-5}'::json,'{"reading": 1.230e-5}'::jsonb;

json | jsonb

-----------------------+-------------------------

{"reading":1.230e-5} | {"reading":0.00001230}

(1row)

As shown in this example, however, JSONB preserves the trailing zero (0) for checkup even if it is syntactically-insigniicant.

3.7.2 Designing JSON documents eectively

Expressing data in JSON can be much more lexible than that in traditional relational data models where require-

ments are enforced in a variable environment. Both methods may coexist and complement each other in the same

application. However, in applications that require maximum lexibility, it is recommended that the JSON documents

have a slightly ixed structure. Structures are not generally applicable (you can also apply some business rules declar-

atively), but using a predictable structure makes it easier to write queries that usefully summarize a ``document''

(datum) set of tables. JSON data is affected by the same concurrency control considerations as other data types when

stored in tables. It should be noted that, even though it is feasible to store a large document, updates obtain row-

level locking for the entire row. You should consider limiting the JSON document to a manageable size in order to

reduce lock contention between update transactions. In principle, JSON documents indicate that the atomic data

pointed by business rules cannot be segmented into smaller datums, each of which can be modiied independently.

3.7.3 JSONB Containment and Existence

Testing containment is an important feature of JSONB. There is no feature set similar to the JSON type. Containment

tests whether a single JSONB document is contained in another document. This example returns true except where

noted.

42 AgensGraph Developer Manual

-- Simple scalar/primitive values contain only the identical value:

SELECT '"foo"'::jsonb @> '"foo"'::jsonb;

-- The array on the right side is contained within the one on the left:

SELECT '[1, 2, 3]'::jsonb @> '[1, 3]'::jsonb;

-- Order of array elements is not significant, so this is also true:

SELECT '[1, 2, 3]'::jsonb @> '[3, 1]'::jsonb;

-- Duplicate array elements don't matter either:

SELECT '[1, 2, 3]'::jsonb @> '[1, 2, 2]'::jsonb;

-- The object with a single pair on the right side is contained

-- within the object on the left side:

SELECT '{"product": "PostgreSQL", "version": 9.4, "jsonb": true}'::jsonb

@> '{"version": 9.4}'::jsonb;

-- The array on the right side is not considered contained within the

-- array on the left, even though a similar array is nested within it:

SELECT '[1, 2, [1, 3]]'::jsonb @> '[1, 3]'::jsonb;-- yields false

-- But with a layer of nesting, it is contained:

SELECT '[1, 2, [1, 3]]'::jsonb @> '[[1, 3]]'::jsonb;

-- Similarly, containment is not reported here:

SELECT '{"foo": {"bar": "baz"}}'::jsonb @> '{"bar": "baz"}'::jsonb;-- yields false

-- A top-level key and an empty object is contained:

SELECT '{"foo": {"bar": "baz"}}'::jsonb @> '{"foo": {}}'::jsonb;

The general principle is that the structure and data content of the contained objects are consistent with the contain-

ing objects. It may be possible after discarding unmatched (inconsistent) array elements or object key/value pairs

from the containing object. However, when comparing containments, the order of array elements is not important,

and duplicate array elements are actually considered only once.

As a special exception to the general principle that structures must be matched, arrays may contain primitive values.

Data Type 43

-- This array contains the primitive string value:

SELECT '["foo", "bar"]'::jsonb @> '"bar"'::jsonb;

-- This exception is not reciprocal -- non-containment is reported here:

SELECT '"bar"'::jsonb @> '["bar"]'::jsonb;-- yields false

JSONB has the existence operator that is a variation of the containing theme, which tests whether a string (speciied

as a text value) appears at the top level of the JSONB value as an object key or array element. This example returns

true, except where noted.

-- String exists as array element:

SELECT '["foo", "bar", "baz"]'::jsonb ?'bar';

-- String exists as object key:

SELECT '{"foo": "bar"}'::jsonb ?'foo';

-- Object values are not considered:

SELECT '{"foo": "bar"}'::jsonb ?'bar';-- yields false

-- As with containment, existence must match at the top level:

SELECT '{"foo": {"bar": "baz"}}'::jsonb ?'bar';-- yields false

-- A string is considered to exist if it matches a primitive JSON string:

SELECT '"foo"'::jsonb ?'foo';

Unlike arrays, JSON objects are optimized internally for search and do not perform linear searches. This means that

they are more suitable than arrays that test for containment or existence when there are many related keys and ele-

ments.

3.7.4 JSONB Indexing

The GIN index can be used to eficiently search for a key or a key/value pair in a number of JSONB documents (da-

tums). Two GIN ``operator classes'' with different performance and lexibility tradeoffs are provided. The default GIN

operator classes for JSONB support queries using operators such as @>, ?, ?&, and ?|. Here is an example of creating

an index within this operator class:

44 AgensGraph Developer Manual

CREATE INDEX idxgin ON api USING GIN (jdoc);

jsonb_path_ops, which is not a default GIN operator class, supports indexing of @> operator only. Here is an example

of creating an index within this operator class:

CREATE INDEX idxginp ON api USING GIN (jdoc jsonb_path_ops);

Consider a table example that stores a JSON document retrieved from a third-party Web service using a documented

schema deinition. The general document will be as follows:

{

"guid":"9c36adc1-7fb5-4d5b-83b4-90356a46061a",

"name":"Angela Barton",

"is_active":true,

"company":"Magnafone",

"address":"178 Howard Place, Gulf, Washington, 702",

"registered":"2009-11-07T08:53:22 +08:00",

"latitude":19.793713,

"longitude":86.513373,

"tags": [

"enim",

"aliquip",

"qui"

]

}

Save this document as a JSONB column called jdoc in a table called api. When a GIN index is created in this column,

the following query uses the index.

-- Find documents in which the key "company" has value "Magnafone"

SELECT jdoc->'guid', jdoc->'name' FROM api WHERE jdoc @> '{"company": "Magnafone"}';

However, even though the operator ``?'' can be indexed, it is not directly applied to the indexed column jdoc. Thus,

the index cannot be used in the following query.

-- Find documents in which the key "tags" contains key or array element "qui"

SELECT jdoc->'guid', jdoc->'name' FROM api WHERE jdoc -> 'tags' ?'qui';

Still, the above query can use the index, if an expression index is properly used. If a query for a particular item is

common within the ``tags'' key, an index deinition like the one below is useful.

Data Type 45

CREATE INDEX idxgintags ON api USING GIN ((jdoc -> 'tags'));

WHERE clause jdoc->`tags' ? `qui' is an application of the indexable operator ``?'' and the indexed expression jdoc ->

`tags' is recognized.

Another way to query is to make the best use of containment. For example, a simple GIN index on a jdoc column can

support this query.

-- Find documents in which the key "tags" contains array element "qui"

SELECT jdoc->'guid', jdoc->'name' FROM api WHERE jdoc @> '{"tags": ["qui"]}';

However, while these indexes store copies of all the keys and values of the jdoc column, the expression index in the

previous example stores only the data found under the tags key. It is true that the simple index approach is much

more lexible (as it supports queries on arbitrary keys), but the targeted expression index is much smaller and more

fast-searched than the simple index.

The jsonb_path_ops operator class only supports queries using @> operator, but it has a good performance advan-

tage over the default operator class, jsonb_ops. With the same data, the size and search speciicity of the jsonb_path_ops

index are generally much smaller and better than those of the jsonb_ops index, respectively. This is especially true

for queries that contain keys that appear frequently in the data. Searches with this class are therefore generally much

better than using the default operator class.

A technical difference between jsonb_ops and jsonb_path_ops GIN indexes is that the former creates index entries

that are independent for each key and value of the data, while the latter only creates index entries for the data val-

ues. By default, each jsonb_path_ops index entry is a hash of values and keys leading to it. Let's take a look at an in-

dex {``foo'': {``bar'': ``baz''} as an example; a single index entry is generated by combining all three of foo, bar and

baz into a hash value. Thus, a constraint query that looks for such a structure results in an extremely speciic index

search. However, there is no way to know at all whether foo will appear as a key. Conversely, the jsonb_ops index

creates three index entries that represent foo, bar, and baz, respectively. Then, to execute a constraint query, it looks

up a row that contains all three of these items. A GIN index can perform its AND search very eficiently, but is less

speciic and slower than an equivalent jsonb_path_ops search, especially when there are a great deal of rows con-

taining one of the three index entries. A disadvantage of the jsonb_path_ops approach is that it does not create index

entries for JSON structures that do not contain values such as {``a'': {}}. A full index scan is required if a document

search containing the structure is requested, which is very slow. Thus, jsonb_path_ops is not suitable for applications

that perform frequent searches. JSONB also supports btree and hash indexes, which is useful only when checking the

equivalence of the entire JSON document. The B-tree ordering of JSONB data is not that important, but is necessary

for completeness.

Object > Array > Boolean > Number > String > Null

46 AgensGraph Developer Manual

Object with n pairs > object with n - 1 pairs

Array with n elements > array with n - 1 elements

Objects that have the same number of pairs are compared in the following order:

key-1, value-1, key-2 ...

Object keys are compared in the order of storage; in particular, storing short keys in front of long keys leads to non-

intuitive results:

{ "aa": 1, "c": 1} > {"b": 1, "d": 1}

Similarly, arrays with the same number of elements are compared in the following order:

element-1, element-2 ...

The native JSON values are compared using the comparison rules, which also apply to the primitive data types. Strings

are compared using the default database collation.

3.8 Arrays

Arrays allow the column type of the table to be deined as a variable length multidimensional array. You can create

arrays of built-in or user-deined base types, enum types, or composite types. Arrays of domains are not yet sup-

ported.

3.8.1 Declaration of Array Types

In order to explain use of array types, let's create a table as follows:

CREATE TABLE sal_emp (

name text,

pay_by_quarter integer[],

schedule text[][]

);

As indicated, the array data type is named by adding square brackets ([]) to the array element's data type name. The

above command creates a table called sal_emp with a column of type text (name), where a one-dimensional array

of type integer (pay_by_quarter) represents the employee's quarterly salary and a two-dimensional array of text

(schedule) indicates the employee's weekly schedule. CREATE TABLE allows the exact size of the array to be speci-

ied. For example:

Data Type 47

CREATE TABLE tictactoe (

squares integer[3][3]

);

However, the current implementation ignores the array size limits provided. That is, the operation is the same as

an array of unspeciied length. The current implementation does not enforce the declared number of dimensions.

Arrays of particular element types are considered to be of the same type, regardless of their size or the number of

dimensions. Therefore, declaring an array size or the number of dimensions in CREATE TABLE is simply documenta-

tion and does not affect runtime behavior.

You can utilize the keyword ARRAY to use an alternative syntax that conforms to the SQL standard for one-dimensional

arrays. pay_by_quarter may have been deined as:

pay_by_quarter integer ARRAY[4],

If the array size is not speciied:

pay_by_quarter integer ARRAY,

In any case, however, size constraints are not enforced.

3.8.2 Array Value Input

To create an array value as a literal constant, place the value in braces and separate it with a comma. You can use

double quotes around the value; you should do this if it contains a comma or brace. The general form of an array

constant:

'{ val1 delim val2 delim ... }'

In the example above, delim is the delimiter for the type recorded in the pg_type entry. All of the standard data types

provided by the AgensGraph distribution use commas, except for the type box that uses semicolons (;). Each val is a

constant of the array element type or subarray. For example:

'{{1,2,3},{4,5,6},{7,8,9}}'

This constant is a two-dimensional 3x3 array consisting of three integer sub-arrays. To set an element of an array

constant to NULL, create NULL for the element value. (NULL case can be changed.) If you want the actual string value

``NULL,'' use double quotation marks before and after NULL. (Constants are initially processed as strings and passed

to the array input conversion routine, which may require explicit type speciications.)

The following is an example of INSERT statement and its execution result:

48 AgensGraph Developer Manual

INSERT INTO sal_emp

VALUES ('Bill',

'{10000, 10000, 10000, 10000}',

'{{"meeting", "lunch"}, {"training", "presentation"}}');

INSERT INTO sal_emp

VALUES ('Carol',

'{20000, 25000, 25000, 25000}',

'{{"breakfast", "consulting"}, {"meeting", "lunch"}}');

SELECT *FROM sal_emp;

name | pay_by_quarter | schedule

-------+---------------------------+-------------------------------------------

Bill | {10000,10000,10000,10000} | {{meeting,lunch},{training,presentation}}

Carol | {20000,25000,25000,25000} | {{breakfast,consulting},{meeting,lunch}}

(2rows)

A multidimensional array must have a matching range for each dimension, and an inconsistency will cause the fol-

lowing error:

INSERT INTO sal_emp

VALUES ('Bill',

'{10000, 10000, 10000, 10000}',

'{{"meeting", "lunch"}, {"meeting"}}');

ERROR: multidimensional arrays must have array expressions with matching dimensions

The constructor syntax of ARRAY can also be used.

INSERT INTO sal_emp

VALUES ('Bill',

ARRAY[10000,10000,10000,10000],

ARRAY[['meeting','lunch'], ['training','presentation']]);

INSERT INTO sal_emp

VALUES ('Carol',

ARRAY[20000,25000,25000,25000],

ARRAY[['breakfast','consulting'], ['meeting','lunch']]);

Data Type 49

An array element is a regular SQL constant or expression. A string literal is enclosed in single quotes instead of dou-

ble quotes as in an array literal.

3.8.3 Accessing Arrays

You can run some queries from the table created above. Access a single element of an array. The following query re-

trieves the name of the employee whose salary has been changed in the second quarter.

SELECT name FROM sal_emp WHERE pay_by_quarter[1] <> pay_by_quarter[2];

name

-------

Carol

(1row)

Array numbers are enclosed in square brackets; use array notation starting from 1. That is, the array of n elements

starts at array[1] and ends at array[n].

This query retrieves the salary of all employees for the third quarter.

SELECT pay_by_quarter[3]FROM sal_emp;

pay_by_quarter

----------------

10000

25000

(2rows)

You can also access any rectangular slice in an array or subarray. An array slice is represented using lower-bound:upper-

bound for one or more array dimensions.

For example, the following query retrieves the irst item in Bill's schedule on the irst two days of the week.

SELECT schedule[1:2][1:1]FROM sal_emp WHERE name = 'Bill';

schedule

------------------------

{{meeting},{training}}

(1row)

50 AgensGraph Developer Manual

If you create a dimension as a slice (for example, including a colon), all dimensions are processed as slices. A dimen-

sion with only a single number (no colon) is processed as being from 1 to the speciied number. For example, [2] is

processed as [1: 2], as follows:

SELECT schedule[1:2][2]FROM sal_emp WHERE name = 'Bill';

schedule

-------------------------------------------

{{meeting,lunch},{training,presentation}}

(1row)

To avoid confusion in non-slice cases, it is best to use the slice syntax for all dimensions such as [1:2] [1:1] rather

than [2] [1:1].

You can omit the lower-bound or upper-bound of the slice speciier. The missing bounds are replaced by lower or

upper of the array, as shown in the following example:

SELECT schedule[:2][2:] FROM sal_emp WHERE name = 'Bill';

schedule

------------------------

{{lunch},{presentation}}

(1row)

SELECT schedule[:][1:1]FROM sal_emp WHERE name = 'Bill';

schedule

------------------------

{{meeting},{training}}

(1row)

The array subscript expression returns null if the array itself or subscript expression is null. In addition, null is re-

turned if the subscript is out of the bounds of the array (in which case no errors are produced). For example, sched-

ule [3][3], which is referenced when schedule is the current dimension [1:3] [1:2], prints NULL. Similarly, an array

referencing an incorrect number of subscripts prints null, not an error.

Array slice expressions similarly return null if the array itself or the subscript expression is null. However, in other

cases (e.g. selecting an array slice that is completely out of the bounds of the current array), the slice expression out-

puts an empty (zero-dimensional) array instead of null. If the requested slice partially overlaps the array bounds, it

Data Type 51

is reduced to an overlap region instead of returning null.

The current dimensions of array values can be retrieved using array_dims function.

SELECT array_dims(schedule) FROM sal_emp WHERE name = 'Carol';

array_dims

------------

[1:2][1:2]

(1row)

array_dims produces a text result, which is more readable to humans, but inconvenient to the program. Dimensions

can also be retrieved using array_upper and array_lower, which return the upper and lower bounds of the speciied

array dimension, respectively.

SELECT array_upper(schedule, 1)FROM sal_emp WHERE name = 'Carol';

array_upper

-------------

(1row)

array_length returns the length of the speciied array dimension.

SELECT array_length(schedule, 1)FROM sal_emp WHERE name = 'Carol';

array_length

--------------

(1row)

cardinality returns the total number of elements in an array of all dimensions; it is actually the number of rows gen-

erated by the call on unnest.

SELECT cardinality(schedule) FROM sal_emp WHERE name = 'Carol';

cardinality

-------------

(1row)

52 AgensGraph Developer Manual

3.8.4 Modifying Arrays

You may update the array value as a whole;

UPDATE sal_emp SET pay_by_quarter = '{25000,25000,27000,27000}'

WHERE name = 'Carol';

or use ARRAY expression syntax.

UPDATE sal_emp SET pay_by_quarter = ARRAY[25000,25000,27000,27000]

WHERE name = 'Carol';

An array can update a single element;

UPDATE sal_emp SET pay_by_quarter[4] = 15000

WHERE name = 'Bill';

or update it only partially.

UPDATE sal_emp SET pay_by_quarter[1:2] = '{27000,27000}'

WHERE name = 'Carol';

The omitted lower-bound or upper-bound slice syntax can be used only when updating NULL or nonzero array val-

ues.

The stored array value can be expanded by assigning it to an element that does not yet exist. The position between

the previously existing element and the newly allocated element is illed with null. For example, if the array myarray

currently has 4 elements, it will have 6 elements after an update of assigning to myarray [6]. myarray [5] contains

null. Currently, the expansion in this manner is only allowed for one-dimensional arrays (not multidimensional ar-

rays). Subscripted assignments allow you to create arrays whose subscripts do not start at one (1). For example, to

create an array with subscript values -2 through 7, you may assign it to myarray [-2:7].

The new array value can also be created using the concatenation operator ||.

SELECT ARRAY[1,2] || ARRAY[3,4]as array;

array

-----------

{1,2,3,4}

(1row)

SELECT ARRAY[5,6] || ARRAY[[1,2],[3,4]] as array;

Data Type 53

array

---------------------

{{5,6},{1,2},{3,4}}

(1row)

The concatenation operator lets you push a single element into the beginning or end of a one-dimensional array. It

accommodates two N-dimensional arrays (or N-dimensional and N+1-dimensional arrays).

If you push a single element to the beginning or end of a one-dimensional array as shown in the following example,

the result is an array illed with the same lower bound subscript as the array operand.

SELECT array_dims(1|| '[0:1]={2,3}'::int[]);

array_dims

------------

[0:2]

(1row)

SELECT array_dims(ARRAY[1,2] || 3);

array_dims

------------

[1:3]

(1row)

When concatenating two arrays having the same number of dimensions, the result holds the lower bound of the

outer dimension of the left operand. The result is that all the elements of the right operand are followed by an array

of all the elements of the left operand, as the following example shows:

SELECT array_dims(ARRAY[1,2] || ARRAY[3,4,5]);

array_dims

------------

[1:5]

(1row)

SELECT array_dims(ARRAY[[1,2],[3,4]] || ARRAY[[5,6],[7,8],[9,0]]);

array_dims

------------

[1:5][1:2]

(1row)

54 AgensGraph Developer Manual

If you push an N-dimensional array from the beginning to the end of an N+1-dimensional array, the result is similar

to the element array example above. Each N-dimensional subarray is essentially an element of the outer dimension

of the N+1-dimensional array, as shown in the following example:

SELECT array_dims(ARRAY[1,2] || ARRAY[[3,4],[5,6]]);

array_dims

------------

[1:3][1:2]

(1row)

You can also create functions using the functions array_prepend, array_append, or array_cat; while the irst two only

support one-dimensional arrays, array_cat supports multidimensional arrays. The concatenation operator discussed

above prefers the direct use of these functions. In effect, these functions exist primarily for use when implementing a

concatenation operator. However, they can also be useful when creating user-deined aggregates. For example:

SELECT array_prepend(1,ARRAY[2,3]);

array_prepend

---------------

{1,2,3}

(1row)

SELECT array_append(ARRAY[1,2], 3);

array_append

--------------

{1,2,3}

(1row)

SELECT array_cat(ARRAY[1,2], ARRAY[3,4]);

array_cat

-----------

{1,2,3,4}

(1row)

SELECT array_cat(ARRAY[[1,2],[3,4]], ARRAY[5,6]);

array_cat

---------------------

{{1,2},{3,4},{5,6}}

Data Type 55

(1row)

SELECT array_cat(ARRAY[5,6], ARRAY[[1,2],[3,4]]);

array_cat

---------------------

{{5,6},{1,2},{3,4}}

In a simple case, the concatenation operator described above is preferred over using these functions directly. How-

ever, using one of these functions may help avoid ambiguity, since the concatenation operator is overloaded to pro-

cess all three cases.

SELECT ARRAY[1,2] || '{3, 4}' as array;-- the untyped literal is taken as an array

array

-----------

{1,2,3,4}

SELECT ARRAY[1,2] || '7';-- so is this one

ERROR: malformed array literal: "7"

SELECT ARRAY[1,2] || NULL as array;-- so is an undecorated NULL

array

-------

{1,2}

(1row)

SELECT array_append(ARRAY[1,2], NULL); -- this might have been meant

array_append

--------------

{1,2,NULL}

In the above example, parser sees an integer array on one side of the concatenation operator, and a constant of inde-

terminate type on the other. An empirical method used to analyze the constant type is to assume it is the same type

as the other inputs of the operator (in this case, an integer array). So the concatenation operator is considered ar-

ray_cat, not array_append. If that is a wrong choice, you can modify the constant by casting it to an element type of

the array. Using array_append can be a desirable solution.

56 AgensGraph Developer Manual

3.8.5 Searching in Arrays

To retrieve the value of an array, you should check each value; you may do this if you know the size of the array. For

example:

SELECT *FROM sal_emp WHERE pay_by_quarter[1] = 10000 OR

pay_by_quarter[2] = 10000 OR

pay_by_quarter[3] = 10000 OR

pay_by_quarter[4] = 10000;

However, in large arrays, it quickly gets bored and is not helpful when the array size is unknown. The above query

can be replaced by:

SELECT *FROM sal_emp WHERE 10000 =ANY (pay_by_quarter);

You can also ind all the rows with an array value equal to 10000:

SELECT *FROM sal_emp WHERE 10000 =ALL (pay_by_quarter);

Alternatively, you can use the generate_subscripts function.

SELECT *FROM

(SELECT pay_by_quarter,

generate_subscripts(pay_by_quarter, 1)AS s

FROM sal_emp) AS foo

WHERE pay_by_quarter[s] = 10000;

It is also possible to search for an array using && operator, which checks whether the left operand overlaps with the

right operand.

SELECT *FROM sal_emp WHERE pay_by_quarter && ARRAY[10000];

You can also use array_position and array_positions functions to retrieve a speciic value in an array. The former re-

turns the subscript of the irst value in the array, and the latter returns an array with all the subscripts of the values

in the array.

SELECT array_position(ARRAY['sun','mon','tue','wed','thu','fri','sat'], 'mon');

array_positions

-----------------

Data Type 57

SELECT array_positions(ARRAY[1,4,3,1,3,4,2,1], 1);

array_positions

-----------------

{1,4,8}

3.8.6 Array Input and Output Syntax

The external text expression of an array value consists of the I/O conversion rules for the array element type and the

items that are interpreted according to the decoration denoting the array structure. Decoration consists of braces at

both ends of the value of an array and delimiters between adjacent items. The delimiters are usually commas (,), but

can be something else. It is determined by the typedelim setting for the element type of the array. All of the standard

data types use commas, except for type box that uses semicolons (;). In a multidimensional array, each dimension

(row, plane, cube, etc.) has its own level of braces and delimiters must be used between adjacent levels of brace enti-

ties.

Array output routines use double quotes around element values if they are empty strings, contain braces, delimiters,

double quotes, backslashes, or whitespace, or match the word NULL. Double quotes and backslashes embedded in

element values are escaped by a backslash. In the case of numeric data types, it is safe to assume that double quotes

never appear. However, for text data types you must cope with presence or absence of quotation marks. By default,

the lower bound index value is set to 1 in the dimension of the array. If you specify an array using another lower

bound, the range of array subscripts can be explicitly speciied before creating the array content. This type of dec-

oration consists of brackets ([]) before and after the lower/upper limits of each array dimension and colons (:) as

space separator s. An array dimension decoration is followed by an equal sign (=).

SELECT f1[1][-2][3]AS e1, f1[1][-1][5]AS e2

FROM (SELECT '[1:1][-2:-1][3:5]={{{1,2,3},{4,5,6}}}'::int[] AS f1) AS ss;

e1 | e2

----+----

1|6

(1row)

The array output routine includes an explicit dimension in the result only if there is more than one lower bound.

If the value created for an element is NULL (in the case of variation), the element is considered to be NULL. Presence

of quotes or backslashes disables this and allows input of the literal string value ``NULL.'' You may also set the ar-

ray_nulls coniguration parameter to off to prevent NULL from being recognized as NULL. As indicated earlier, you

can use double quotes around individual array elements when creating array values. This should be done in the case

58 AgensGraph Developer Manual

where the element values can be confused by the array value parser. For example, elements that contain braces,

commas (or data type separators), double quotes, backslashes, and leading or trailing whitespace must use double

quotes. Empty strings and strings that match NULL must also be quoted. To add double quotes or backslashes into

the quoted array element values, use an escape string syntax and precede it with a backslash. You may also use back-

slash escaping to avoid quotation marks and protect all data characters that might be processed incorrectly as array

syntax.

You may add a space before the left brace or after the right brace. You can also add a space before or after the indi-

vidual item string. Spaces are ignored in all these cases. Whitespaces within double-quote elements and spaces at

both ends of non-whitespace characters in elements are ignored.

3.9 Range Types

The range type is a data type that represents the range of values of some element type. For example, the range of

timestamps can be used to indicate the reserved time range of a meeting room. In this case, the data type is tsrange

(short for ``timestamp range'') and timestamp is subtype. The subtype must have a total order so that whether the

element value is within, before, or after the value range can be well-deined.

The range types are useful in that they can represent multiple element values as a single range value and clearly ex-

press concepts like the overlap range. One of the most obvious examples is to use time and date ranges for schedul-

ing. These types can also be useful for price ranges, measuring ranges of instruments, etc.

3.9.1 Built-in Range Types

The following built-in range types are provided:

int4range - Range of integer

int8range - Range of bigint

numrange - Range of numeric

tsrange - Range of timestamp without time zone

tstzrange - Range of timestamp with time zone

daterange - Range of date

You can also deine your own range types. See User-deined Type for more information.

3.9.2 Examples

CREATE TABLE reservation (room int, during tsrange);

INSERT INTO reservation VALUES

(1108,'[2010-01-01 14:30, 2010-01-01 15:30)');

Data Type 59

-- Containment

SELECT int4range(10,20) @> 3;

-- Overlaps

SELECT numrange(11.1,22.2) && numrange(20.0,30.0);

-- Extract the upper bound

SELECT upper(int8range(15,25));

-- Compute the intersection

SELECT int4range(10,20) * int4range(15,25);

-- Is the range empty?

SELECT isempty(numrange(1,5));

3.9.3 Inclusive and Exclusive Bounds

All non-empty ranges have two bounds, a lower bound and an upper bound. All points between these values are in-

cluded in the range. Inclusive bounds mean that the boundary points themselves are included in the range, and ex-

clusive bounds mean that the boundary points are not included in the range. In the text form of the range, the inclu-

sive lower bound is expressed as ``[" and the exclusive lower bound is expressed as ``(.'' Similarly, the inclusive upper

bound is expressed as ``]'', and the exclusive upper bound is expressed as ``).''

The functions lower_inc and upper_inc test the lower and upper bounds of the range value, respectively.

3.9.4 Innite (Unbounded) Ranges

The lower bound of the range can be omitted, meaning that all points below the upper bound are included in the

range. Likewise, if the upper bound of the range is omitted, all points above the lower bound are included in the

range. If both the lower and upper bounds are omitted, all values of the element type are considered to be included

in the range.

This corresponds to considering that the lower bound is ``negative ininity'' or the upper bound is ``positive inin-

ity.'' Note, however, that this ininite value is never a value of the element type of the range, and cannot be part of the

range. (Therefore, there is no such thing as inclusive ininite bounds; when you try to create one, it is automatically

converted to an exclusive bound).

It is true that some element types have an ``ininite'' notation, but it is another different value in relation to the range

60 AgensGraph Developer Manual

type mechanism. For example, in the timestamp range, [today,] is the same as [today,). However, [today, ininity] can

be sometimes different from [today, ininity). The latter excludes the special timestamp value ininity.

The functions lower_inf and upper_inf test ininite lower/upper bounds of each range, respectively.

3.9.5 Range Input/Output

The input for the range value should follow one of the following patterns:

(lower-bound,upper-bound)

(lower-bound,upper-bound]

[lower-bound,upper-bound)

[lower-bound,upper-bound]

empty

The parentheses or square brackets indicate whether the lower and upper bounds are excluded or included as de-

scribed above. The last pattern is empty, indicating an empty range (a range with no points). The lower-bound can

be a string that is a valid input to a subtype, or can be left empty if there is no lower bound. Similarly, upper-bound

can be a string that is a valid input to a subtype, or can be left empty if there is no upper bound. Each boundary value

can be quoted using ``(double quote) characters; this is a must since, if the boundary value contains parentheses,

square brackets, commas, double quotes, or backslashes, these characters may be mistaken for part of the range syn-

tax. To insert double quotes or backslashes to quoted boundary values, you must precede them with a backslash.

(Double quotation pairs within boundary values in double quotes are also processed as double quotation marks,

similar to the rules for single quotation marks in SQL literal strings.) To protect all data characters that might be pro-

cessed incorrectly with the range syntax, you may avoid quoting and use backslash escaping. In addition, when cre-

ating a boundary value that is an empty string, you should write''``; if you do not enter anything, it will mean ininite

boundary. Whitespaces are allowed before and after the range values, but spaces between parentheses or square

brackets are considered to be part of the lower or upper bound value. (It may or may not be important depending on

the element type).

Examples:

-- includes 3, does not include 7, and does include all points in between

SELECT '[3,7)'::int4range;

-- does not include either 3 or 7, but includes all points in between

SELECT '(3,7)'::int4range;

-- includes only the single point 4

Data Type 61

SELECT '[4,4]'::int4range;

-- includes no points (and will be normalized to 'empty')

SELECT '[4,4)'::int4range;

3.9.6 Constructing Ranges

Each range type has a constructor function with the same name as the range type. Using a constructor function is

more convenient than writing a range literal constant, because you do not need to quote additional boundary val-

ues. A constructor function accepts two or three arguments. While the three-argument form creates a range from

the third argument to the boundary of the form speciied, the two-argument form creates a range of standard forms

(lower bound, excluding upper bound). The third argument must be one of the followings strings: ``()'', ``(]'', ``[]'' or

``[]''

Examples:

-- The full form is: lower bound, upper bound, and text argument indicating

-- inclusivity/exclusivity of bounds.

SELECT numrange(1.0,14.0,'(]');

-- If the third argument is omitted, '[)' is assumed.

SELECT numrange(1.0,14.0);

-- Although '(]' is specified here, on display the value will be converted to

-- canonical form, since int8range is a discrete range type (see below).

SELECT int8range(1,14,'(]');

-- Using NULL for either bound causes the range to be unbounded on that side.

SELECT numrange(NULL,2.2);

3.9.7 Discrete Range Types

The discrete range is a well-deined ``step-by-step'' type of which element type is integer or date. In this type, if there

is no valid value between two elements, they can be said to be adjacent. In contrast to the continuous range, this is

always (or almost always) possible to recognize different element values between the two given values. For exam-

ple, a range beyond the numeric type, like the range beyond timestamp, is continuous. (timestamp can be processed

discretely in theory because of its precision limitations, but it is better to regard it as a sequence when the size of

62 AgensGraph Developer Manual

the step is not of interest.) Another way to think about the discrete range type is to have a clear idea of the ``next''

or ``previous'' value for each element value. It should be noted that, by selecting the next or previous element value

rather than the given original, it is possible to convert between expressions including the boundaries of the ranges

and expressions excluding the boundaries of the ranges. For example, integer range types [4,8] and (3,9) denote the

same set of values. This is not the case, however, for numerical ranges. The discrete range type must have a canon-

icalization function that recognizes the desired step size for the element type. The canonicalization function is es-

pecially responsible for converting the values equally to the range types that have an identity representation of a

consistent inclusion or exclusion range. Unless a canonicalization function is speciied, the ranges of different types

are always processed as non-equivalence, even though they actually denote the same set of values. The built-in range

types, int4range, int8range, and daterange, use the canonical form, which includes the lower bound and excludes the

upper bound (i.e.``[)''). User deined range types may use other notations.

3.9.8 Dening New Range Types

You may deine your own range type. The most common reason for doing this is to use a subtype range that is not

provided as a built-in range type. This is an example of deining a new range type for the subtype loat8.

CREATE TYPE floatrange AS RANGE (

subtype = float8,

subtype_diff = float8mi

);

SELECT '[1.234, 5.678]'::floatrange;

Since loat8 is not a meaningful ``step'', we do not deine a canonicalization function in this example. If the subtype

has a discrete value rather than a continuous value, CREATE TYPE command must specify a canonical function. The

canonicalization function must take an input range value and return an equivalent range value that may be different

from the boundary type. The canonical outputs of the two ranges representing the same set of values (e.g. integer

range [1, 7] and [1, 8)) should be the same. It does not matter which expression is chosen to be canonical, as long as

two equivalent values of the same type are always mapped to the same value of the same type. Besides controlling

inclusive/exclusive bounds format, the canonicalization function can process boundary values well if the desired

step size is greater than the subtype's storable size. For example, the step size of a range type beyond timestamp

can be deined as time. On this occasion, the canonicalization function may require rounding-off if it is not a multi-

ple of the time, or an error may occur instead. If you deine your own range type, you may specify different subtype

B-tree operator classes or collations to use in order to change the sort order that determines which values belong to

a given range. In addition, a range type to be used in a GiST or SP-GiST index should deine a subtype difference or

subtype_diff, a function(An index works without subtype_diff, but is much less eficient when a difference function is

Data Type 63

provided). The subtype difference function takes two input values of a subtype and returns the difference expressed

as a loat8 value (e.g. X minus Y). In the above example, you may use a function that underlies the normal loat8 sub-

traction operator, but for other subtypes, type conversions may be required. Several creative ideas about how to rep-

resent differences in numbers may be needed. To the greatest range possible, the subtype_diff function must agree

on the sort order implied by the selected operator class es and collations. That is, the result of this should always be

a positive value when the irst argument is greater than the second argument according to the sort order.

CREATE FUNCTION time_subtype_diff(x time, y time) RETURNS float8 AS

'SELECT EXTRACT(EPOCH FROM (x - y))' LANGUAGE sql STRICT IMMUTABLE;

CREATE TYPE timerange AS RANGE (

subtype = time,

subtype_diff = time_subtype_diff

);

SELECT '[11:10, 23:00]'::timerange;

For more information on creating range types, see User-deined Type.

3.9.9 Indexing

GiST and SP-GiST indexes are able to generate table columns of range type. The following is an example of creating a

GiST index.

CREATE INDEX reservation_idx ON reservation USING GIST (during);

GiST or SP-GiST indexes can speed up queries involving range operators such as =, &&, <@, @>, <<, >>, -|-, &<, and

&>.

B-tree and hash indexes may create table columns of range type as well. For these index types, the only useful range

operation is basically ``=''. Even if there is a B-tree sort order that uses ``<'' and ``>'' operators and is deined for

range values, the order itself is arbitrary and not very useful in the real world. The B-tree and hash support for range

types is intended primarily to allow sorting and hashing inside queries rather than creating actual indexes.

3.9.10 Constraints on Ranges

UNIQUE is a natural constraint on scalar values. However, it is appropriate not for range types but for exclusion con-

straints, mainly. An exclusion constraint allows the speciication of constraints such as ``nonoverlap'' in range types.

Examples:

64 AgensGraph Developer Manual

CREATE TABLE reservation (

during tsrange,

EXCLUDE USING GIST (during WITH &&)

);

The constraint prevents overlapping values from being simultaneously present in the table.

INSERT INTO reservation VALUES

('[2010-01-01 11:30, 2010-01-01 15:00)');

INSERT 0 1

INSERT INTO reservation VALUES

('[2010-01-01 14:45, 2010-01-01 15:45)');

ERROR: conflicting key value violates exclusion constraint "reservation_during_excl"

DETAIL: Key (during)=(["2010-01-01 14:45:00","2010-01-01 15:45:00")) conflicts

with existing key (during)=(["2010-01-01 11:30:00","2010-01-01 15:00:00")).

You can use the btree_gist extension to deine an exclusion constraint on a regular scalar data type, which makes it

possible to exclude/combine the range with the maximum lexibility. For instance, after btree_gist is installed, the

following constraint rejects overlapping ranges only when the number of meeting rooms is equal.

CREATE EXTENSION btree_gist;

CREATE TABLE room_reservation (

room text,

during tsrange,

EXCLUDE USING GIST (room WITH =, during WITH &&)

);

INSERT INTO room_reservation VALUES

('123A','[2010-01-01 14:00, 2010-01-01 15:00)');

INSERT 0 1

INSERT INTO room_reservation VALUES

('123A','[2010-01-01 14:30, 2010-01-01 15:30)');

ERROR: conflicting key value violates exclusion constraint "room_reservation_room_during_excl"

DETAIL: Key (room, during)=(123A, ["2010-01-01 14:30:00","2010-01-01 15:30:00")) conflicts

with existing key (room, during)=(123A, ["2010-01-01 14:00:00","2010-01-01 15:00:00")).

Data Type 65

INSERT INTO room_reservation VALUES

('123B','[2010-01-01 14:30, 2010-01-01 15:30)');

INSERT 0 1

3.10 User-dened Type

You can add a new type using CREATE TYPE command. There are ive types of CREATE TYPE: Composite Type, Enum

Type, Range Type, Base Type, and Shell Type.

Syntex :

CREATE TYPE name AS

( [ attribute_name data_type [ COLLATE collation ] [, ... ] ] )

CREATE TYPE name AS ENUM

( [ 'label' [, ... ] ] )

CREATE TYPE name AS RANGE (

SUBTYPE = subtype

[ , SUBTYPE_OPCLASS = subtype_operator_class ]

[ , COLLATION = collation ]

[ , CANONICAL = canonical_function ]

[ , SUBTYPE_DIFF = subtype_diff_function ]

)

CREATE TYPE name (

INPUT = input_function,

OUTPUT = output_function

[ , RECEIVE = receive_function ]

[ , SEND = send_function ]

[ , TYPMOD_IN = type_modifier_input_function ]

[ , TYPMOD_OUT = type_modifier_output_function ]

[ , ANALYZE = analyze_function ]

[ , INTERNALLENGTH = { internallength | VARIABLE } ]

[ , PASSEDBYVALUE ]

[ , ALIGNMENT = alignment ]

66 AgensGraph Developer Manual

[ , STORAGE =storage ]

[ , LIKE = like_type ]

[ , CATEGORY =category ]

[ , PREFERRED = preferred ]

[ , DEFAULT =default ]

[ , ELEMENT = element ]

[ , DELIMITER = delimiter ]

[ , COLLATABLE = collatable ]

)

CREATE TYPE name

Examples :

This is an example of creating a composite type and using it for function deinition.

CREATE TYPE compfoo AS (f1 int, f2 text);

CREATE FUNCTION getfoo() RETURNS SETOF compfoo AS $$

SELECT fooid, fooname FROM foo

$$ LANGUAGE SQL;

This is an example of creating an enum type and using it for table deinition.

CREATE TYPE bug_status AS ENUM ('new','open','closed');

CREATE TABLE bug (

id serial,

description text,

status bug_status

);

This is an example of creating a range type.

CREATE TYPE float8_range AS RANGE (subtype = float8, subtype_diff = float8mi);

This is an example of creating a base type and then using it for table deinition.

Data Type 67

CREATE TYPE box;

CREATE FUNCTION my_box_in_function(cstring) RETURNS box AS ... ;

CREATE FUNCTION my_box_out_function(box) RETURNS cstring AS ... ;

CREATE TYPE box (

INTERNALLENGTH = 16,

INPUT = my_box_in_function,

OUTPUT = my_box_out_function

);

CREATE TABLE myboxes (

id integer,

description box

);

68 AgensGraph Developer Manual

4 Functions

4.1 Graph Database functions

4.1.1 Aggregation functions

Create the data to be used in the example.

CREATE (:person {name: 'Elsa', age: 20});

CREATE (:person {name: 'Jason', age: 30});

CREATE (:person {name: 'James', age: 40});

CREATE (:person {name: 'Daniel', age: 50});

• avg()

Returns the average of numeric values.

MATCH (v:person)

RETURN avg(v.age);

• collect()

Returns a list containing the values returned by the expression; aggregates data by merging multiple records

or values into a single list.

MATCH (v:Person)

RETURN collect(v.age);

• count()

Prints the number of result rows; able to print the number or properties of vertices and edges and can be given

an alias.

MATCH (v:person)

RETURN count(v);

MATCH (v:person)-[k:knows]->(p)

RETURN count(*);

Functions 69

MATCH (v:person)

RETURN count(v.name) AS CNT;

• min()/max()

Takes the numeric attribute as input, returns the minimum/maximum values to the corresponding column.

MATCH (v:person)

RETURN max(v.age);

MATCH (v:person)

RETURN min(v.age);

• stDev()

Returns the standard deviation. The stDev function returns the standard deviation of the sample population

and must cast the property.

MATCH (v:person)

RETURN stDev(v.age);

• stDevP()

Returns the standard deviation. The stDevP function returns the standard deviation of the sample population

and must cast the property.

MATCH (v:person)

RETURN stDevP(v.age);

• sum()

Returns the sum of the numeric values. As it is the sum of the numeric values. The property must be cast.

MATCH (v:person)

RETURN sum(v.age);

4.1.2 Predicates functions

• all()

Returns true if all elements in the list satisfy the condition.

70 AgensGraph Developer Manual

RETURN ALL(x in [] WHERE x = 0);

• any()

Returns true if at least one element in the list satisies the condition.

RETURN ANY(x in [0]WHERE x = 0);

• none()

Returns true if no elements in the list satisfy the condition.

RETURN NONE(x in [] WHERE x = 0);

• single()

Returns true if a single function satisies only a condition in the list.

RETURN SINGLE(x in [] WHERE x = 0);

4.1.3 Scalar functions

• coalesce()

Returns the irst non-null value in the list.

CREATE (:person {name: 'Jack', job: 'Teacher'});

MATCH (a)

WHERE a.name = 'Jack'

RETURN coalesce(a.age, a.job);

• endNode()

Returns the last node in the relationship.

CREATE vlabel Developer;

CREATE vlabel language;

CREATE elabel be_good_at;

CREATE (:Developer {name: 'Jason'})-[:be_good_at]->(:language {name: 'C'});

CREATE (:Developer {name: 'David'})-[:be_good_at]->(:language {name: 'JAVA'});

MATCH (x:Developer)-[r]-()

RETURN endNode(r);

Functions 71

• head()

Returns the irst element in the list.

CREATE (:person {name: 'Richard', array: [ 1,2,3]});

MATCH(a)

where a.name = 'Richard'

RETURN a.array, head(a.array);

• id()

Returns the relationship or id of the node; returns the node id for all nodes speciied in the argument.

MATCH (a)

RETURN id(a);

• last()

Returns the last element in the list.

MATCH (a)

WHERE a.name = 'Richard'

RETURN a.array, last(a.array);

• length()

Returns the length of a string or path. If you specify the property of a string or a string type as an argument,

the number of characters in the string is returned.

RETURN length('string');

MATCH (a:person)

WHERE length(a.name) > 4

RETURN a.name;

• properties()

Converts the arguments to a list of key/value mappings. If the argument is already a key/value mapped list, it

is returned unchanged.

72 AgensGraph Developer Manual

CREATE (p:Person { name: 'Stefan', city: 'Berlin' })

RETURN properties(p);

• startNode()

Returns the starting node of the relationship.

MATCH (x:Developer)-[r]-()

RETURN startNode(r);

• toBoolean()

Converts a string to a boolean.

RETURN toBoolean('TRUE'), toBoolean('FALSE');

• type()

Returns the elabel of the edge passed as an argument. If the elabel of the edge also inherits another elabel, it

returns a parent elabel as well. You should be careful when passing arguments to the type function; when you

ind an edge that matches the pattern using MATCH clause, assign a variable, and then pass the variable as an

argument, the edge itself cannot be passed as an argument to the type function, but must always be passed as a

variable.

CREATE elabel loves;

CREATE (:person {name: 'Adam'})-[:loves]->(:person {name: 'Eve'});

MATCH ({name: 'Adam'})-[r]->({name: 'Eve'})

RETURN type(r);

4.1.4 List functions

• keys()

Returns a list containing strings for all attribute names of nodes, relationships, and maps.

MATCH (a)

WHERE a.name = 'Jack'

RETURN keys(a);

• labels()

Functions 73

Returns vlabel of the vertex passed as an argument. You should be careful when passing arguments to the la-

bel function; when you ind a vertex that matches the pattern using MATCH clause, assign a variable, and pass

that variable as an argument, the vertex itself cannot be passed as an argument to the label function, but must

always be passed as a variable.

MATCH (a)

WHERE a.name='Jack'

RETURN labels(a);

• nodes()

Returns a vertex that exists in the path passed as an argument. You should be careful when passing arguments

to the nodes function; when you ind a path that matches the pattern using MATCH clause, assign a variable,

and pass that variable as an argument, the path itself cannot be passed as an argument to the nodes function,

but must always be passed as variable. When used with the length function, the number of vertices in the path

can be found.

MATCH p = (a)-[r]->(b)

WHERE a.name = 'Adam' and b.name = 'Eve'

RETURN nodes(p);

MATCH p = (a)-[r]->(b)

WHERE a.name = 'Adam' and b.name = 'Eve'

RETURN length(nodes(p));

• relationships()

Returns the edges present in the path passed as an argument. You should be careful when passing arguments

to the relationships function; when you ind a path that matches the pattern using MATCH clause, assign a

variable, and pass that variable as an argument, the path itself cannot be passed as an argument to the rela-

tionships function, but must always be passed as variable. When used with the count function, the number of

edges in the path can be found.

MATCH p = (a)-[r]->(b)

WHERE a.name = 'Adam' and b.name = 'Eve'

RETURN relationships(p);

MATCH p = (a)-[r]->(b)

WHERE a.name = 'Adam' and b.name = 'Eve'

74 AgensGraph Developer Manual

RETURN count(relationships(p));

• tail()

Returns a list result that contains all elements except the irst element in the list.

MATCH (a)

WHERE a.name = 'Richard'

RETURN a.array, tail(a.array);

4.1.5 Mathematics functions

Number

• abs()

Returns a numeric value passed as an argument. It may be passed as a decimal number or as a subtraction.

The MATCH clause can be used to ind a speciic element and pass a subtraction of the properties, which are

numeric values, among the properties of the elements.

RETURN abs(-3.14);

RETURN abs(20-45);

MATCH (a {name:'Jack'}), (b {name:'Emily'})

RETURN abs(a.age-b.age);

• ceil(), loor(), round()

The ceil function rounds the numeric value passed as an argument to the irst decimal place. The loor function

returns the numeric value passed as an argument to the irst decimal place. The round function rounds the

numeric value passed as the argument to the irst decimal place.

RETURN ceil(3.1);

RETURN ceil(1);

RETURN ceil(-12.19);

RETURN floor(3.1);

RETURN floor(1);

RETURN floor(-12.19);

Functions 75

RETURN round(3.1);

RETURN round(3.6);

RETURN round(-12.19);

RETURN round(-12.79);

• rand()

Returns an arbitrary loating-point number between 0 and 1.

RETURN rand();

• sign()

Returns the sign of the numeric value passed as an argument; returns `1' if the argument passed is positive, `-1'

if negative, and `0' if zero.

RETURN sign(25);

RETURN sign(-19.93);

RETURN sign(0);

Logarithmic

• log()

Returns the log value of the input expression.

RETURN log(27);

• log10()

Returns a common logarithm of the entered expression (default 10).

RETURN log10(27);

• exp()

The exp function returns a power value to base (e) exponentiated by the numeric value passed as an argu-

ment. That is, exp(1) returns eˆ1≒2.71828182845905, exp(2) returns eˆ2≒7.38905609893065, and exp(-1),

eˆ-1≒0.367879441171442.

76 AgensGraph Developer Manual

RETURN exp(1);

RETURN exp(2);

RETURN exp(-1);

• sqrt()

Returns the square root of the numeric value passed as an argument. The sqrt function cannot pass a negative

number as an argument.

RETURN sqrt(25);

Trigonometric

• sin()/cos()/tan()

The sin, cos, and tan functions return the sine, cosine, and tangent values of the numeric values passed as ar-

guments, respectively. sin(), cos(), and tan() print the values in radians; sind(), cosd(), and tand() are used to

print the values in degrees.

RETURN sin(0.5);

RETURN sin(-1.5);

RETURN cos(0);

RETURN cos(-1);

RETURN tan(0);

RETURN tan(15.2);

• cot()/asin()/acos()/atan()/atan2()

The cot function returns a cotangent value (inverse of tangent) of the numeric value passed as an argument,

the asin function returns an arcsine value (inverse of sine) of the numeric value passed as the argument, and

the acos function returns an arccosine value of the numeric value (inverse of cosine). The atan, atan2 functions

return the arctangent value (inverse of tangent) of the numeric value passed as an argument. The argument

range of the acos function is a numeric value between -1 and 1. atan2 has two arguments in order to make

the atan function more granular. Conceptually, atan2(a, b) is equivalent to atan (a/b). However, it is not clear

whether atan(-5) is atan(-15/3) or atan (15/(-3)). The trigonometric function requires a deinite distinction

because the argument is a radian value. Therefore, using atan2 rather than atan makes it more accurate.

Functions 77

RETURN cot(1.5);

RETURN asin(1);

RETURN acos(0.5);

RETURN atan(-1);

RETURN atan2(-1.5,1.3);

• pi()/degrees()/radians()

The pi function returns pi as a number. The degrees function takes the arguments passed as radians and re-

turns them to degrees. The radians function converts the arguments passed to degrees into radians.

RETURN pi();

RETURN degrees(12.3);

RETURN degrees(pi());

RETURN radians(180);

4.1.6 String functions

• replace()

If the second argument is contained in the irst argument, replace the second argument with the third argu-

ment.

RETURN replace('ABCDEFG','C','Z');

RETURN replace('ABCDEFG','CD','Z');

RETURN replace('ABCDEFG','C','ZX');

RETURN replace('ABCDEFG','CD','ZXY');

• substring()

Prints the irst argument from the nth digit (n is the number indicated by the second argument). The third ar-

gument indicates how many characters to be printed. If there is no third argument and it is greater than the

number of characters in the irst argument, it prints to the end.

78 AgensGraph Developer Manual

RETURN substring('ABCDEFG',2);

RETURN substring('ABCDEFG',2,3);

RETURN substring('ABCDEFG',4,10);

• left()/right()

The left function prints the irst argument from the left, and the right function prints characters up to length of

n (n is the number indicated by in the second argument) from the right. If the value of the second argument is

larger than the number of characters remaining, the number of characters remaining will be printed out.

RETURN left('AAABBB',3);

RETURN right('AAABBB',3);

• lTrim()/rTrim()

The lTrim function removes all left whitespace from the passed argument, and the rTrim function removes all

right whitespace before printing.

RETURN lTrim(' ABCD ');

RETURN rTrim(' ABCD ');

• toLower()/toUpper()

The toLower function converts all passed arguments to lower case and the toUpper function converts them to

upper case.

RETURN toLower('AbCdeFG');

RETURN toUpper('AbCdeFG');

• reverse()

Prints the arguments in reverse order.

RETURN reverse('ABCDEFG');

• toString()

Converts an integer, loating, or boolean value to a string.

RETURN toString(11.5), toString('already a string'), toString(TRUE);

• trim()

Returns the original string with the leading and trailing spaces removed.

Functions 79

RETURN trim(' hello ');

80 AgensGraph Developer Manual

4.2 Relational Database functions

4.2.1 Comparison functions

• num_nonnulls(VARIADIC ``any'')

Returns the number of non-null arguments.

SELECT num_nonnulls(1,NULL,2);

Result:

num_nonnulls

------------

(1row)

• num_nulls(VARIADIC ``any'')

Returns the number of null arguments.

SELECT num_nulls(1,NULL,2);

Result:

num_nonnulls

------------

(1row)

4.2.2 Mathematics functions

It provides various functions related to numbers, and the argument speciied as dp indicates double precision.

• abs(x)

Returns absolute value of the argument.

SELECT abs(-17.4);

Result:

abs

Functions 81

-----

17.4

(1row)

• cbrt(dp)

Returns cube root of the argument.

SELECT cbrt(27.0);

Result:

cbrt

------

(1row)

• ceil(dp or numeric) or ceiling(dp or numeric)

Returns nearest integer greater than or equal to argument.

SELECT ceil(-42.8);

Result:

ceil

------

-42

(1row)

• degrees(dp)

Converts radians to degrees.

SELECT degrees(0.5);

Result:

degrees

------------------

28.6478897565412

(1row)

• div(y numeric, x numeric)

82 AgensGraph Developer Manual

Returns integer quotient of y/x.

SELECT div(9,4);

Result:

div

-----

(1row)

• exp(dp or numeric)

Returns exponential.

SELECT exp(1.0);

Result:

exp

--------------------

2.7182818284590452

(1row)

• loor(dp or numeric)

Returns nearest integer less than or equal to the argument.

SELECT floor(-42.8);

Result:

floor

-------

-43

(1row)

• ln(dp or numeric)

Returns natural logarithm of the argument.

SELECT ln(2.0);

Result:

Functions 83

-------------------

0.693147180559945

(1row)

• log(dp or numeric)

Returns base 10 logarithm.

SELECT log(100.0);

Result:

log

--------------------

2.0000000000000000

(1row)

• log(b numeric, x numeric)

Returns logarithm to base b.

SELECT log(2.0,64.0);

Result:

log

--------------------

6.0000000000000000

(1row)

• mod(y, x)

Returns remainder of y/x.

SELECT mod(9,4);

Result:

mod

-----

(1row)

84 AgensGraph Developer Manual

• pi()

Returns ``π'' constant.

SELECT pi();

Result:

------------------

3.14159265358979

(1row)

• power(a dp, b dp) or power(a numeric, b numeric)

Returns a raised to the power of b.

SELECT power(9.0,3.0);

Result:

power

--------------------

729.00000000000000

(1row)

• radians(dp)

Converts degrees to radians.

SELECT radians(45.0);

Result:

radians

-------------------

0.785398163397448

(1row)

• round(dp or numeric)

Rounds to nearest integer.

Functions 85

SELECT round(42.4);

Result:

round

-------

(1row)

• round(v numeric, s int)

Rounds to s decimal places of v argument.

SELECT round(42.4382,2);

Result:

round

-------

42.44

(1row)

• scale(numeric)

Returns scale of the argument.

SELECT scale(8.41);

Result:

scale

-------

(1row)

• sign(dp or numeric)

Returns the sign (-1, 0, 1) of the argument.

SELECT sign(-8.4);

Result:

sign

86 AgensGraph Developer Manual

------

-1

(1row)

• sqrt(dp or numeric)

Returns square root of the argument.

SELECT sqrt(2.0);

Result:

sqrt

-------------------

1.414213562373095

(1row)

• trunc(dp or numeric)

Truncates toward zero.

SELECT trunc(42.8);

Result:

trunc

-------

(1row)

• trunc(v numeric, s int)

Truncates to s decimal places.

SELECT trunc(42.4382,2);

Result:

trunc

-------

42.43

(1row)

• width_bucket(operand dp, b1 dp, b2 dp, count int) or width_bucket(operand numeric, b1 numeric, b2 numeric,

count int)

Functions 87

Returns the bucket number to which operand would be assigned in a histogram having count equal-width

buckets spanning the range b1 to b2; returns 0 or count+1 for an input outside the range.

SELECT width_bucket(5.35,0.024,10.06,5);

Result:

width_bucket

--------------

(1row)

• width_bucket(operand anyelement, thresholds anyarray)

Returns the bucket number to which operand would be assigned given an array listing the lower bounds of the

buckets; returns 0 for an input less than the irst lower bound; the thresholds array must be sorted, smallest

irst.

SELECT width_bucket(now(), array['yesterday','today','tomorrow']::timestamptz[]);

Result:

width_bucket

--------------

(1row)

4.2.3 String functions

• ascii(string)

ASCII code of the irst character of the argument. For UTF8 returns the Unicode code point of the character. For

other multibyte encodings, the argument must be an ASCII character.

SELECT ascii('x');

Result:

ascii

-------

120

(1row)

88 AgensGraph Developer Manual

• btrim(string text [, characters text])

Removes the longest string consisting only of characters in characters (a space by default) from the start and

end of string.

SELECT btrim('xyxtrimyyx','xyz');

Result:

btrim

-------

trim

(1row)

• chr(int)

Character with the given code. For UTF8 the argument is treated as a Unicode code point. For other multibyte

encodings the argument must designate an ASCII character. The NULL (0) character is not allowed because text

data types cannot store such bytes.

SELECT chr(65);

Result:

chr

-----

(1row)

• concat(str ``any'' [, str ``any'' [, …] ])

Concatenates the text representations of all the arguments. NULL arguments are ignored.

SELECT concat('abcde',2,NULL,22);

Result:

concat

----------

abcde222

(1row)

• concat_ws(sep text, str ``any'' [, str ``any'' [, …] ])

Functions 89

Concatenates all but the irst argument with separators. The irst argument is used as the separator string.

NULL arguments are ignored.

SELECT concat_ws(',','abcde',2,NULL,22);

Result:

concat_ws

------------

abcde,2,22

(1row)

• convert(string bytea, src_encoding name, dest_encoding name)

Converts string to dest_encoding. The original encoding is speciied by src_encoding. The string must be valid in

this encoding. Conversions can be deined by CREATE CONVERSION. Also there are some predeined conver-

sions. See this link for available conversions.

SELECT convert('text_in_utf8','UTF8','LATIN1');

Result:

convert

----------------------------

x746578745f696e5f75746638

(1row)

• convert_from(string bytea, src_encoding name)

Converts string to the database encoding. The original encoding is speciied by src_encoding. The string must

be valid in this encoding.

SELECT convert_from('text_in_utf8','UTF8');

Result:

convert_from

--------------

text_in_utf8 (A string represented by the current database encoding)

(1row)

• convert_to(string text, dest_encoding name)

Converts string to dest_encoding.

90 AgensGraph Developer Manual

SELECT convert_to('some text','UTF8');

Result:

convert_to

----------------------

x736f6d652074657874

(1row)

• decode(string text, format text)

Decodes binary data from textual representation in string. Options for format are same as in encode.

SELECT decode('MTIzAAE=','base64');

Result:

decode

--------------

x3132330001

(1row)

• encode(data bytea, format text)

Encode binary data into a textual representation. Supported formats are: base64,hex,escape.escape con-

verts zero bytes and high-bit-set bytes to octal sequences (\nnn) and doubles backslashes.

SELECT encode(E'123\\000\\001','base64');

Result:

encode

----------

MTIzAAE=

(1row)

• format(formatstr text [, formatarg ``any'' [, …] ])

Formats arguments according to a format string. This function is similar to the C function sprintf.

SELECT format('Hello %s, %1$s','World');

Result:

Functions 91

format

--------------------

Hello World, World

(1row)

• initcap(string)

Converts the irst letter of each word to upper case and the rest to lower case. Words are sequences of alphanu-

meric characters separated by non-alphanumeric characters.

SELECT initcap('hi THOMAS');

Result:

initcap

-----------

Hi Thomas

(1row)

• length(string)

Returns the number of characters in string.

SELECT length('jose');

Result:

length

--------

(1row)

• length(string bytea, encoding name)

Returns the number of characters in string in the given encoding. The string must be valid in this encoding.

SELECT length('jose','UTF8');

Result:

length

--------

(1row)

92 AgensGraph Developer Manual

• lpad(string text, length int [, ill text])

Fills up the string to length length by prepending the characters ill (a space by default). If the string is already

longer than length then it is truncated (on the right).

SELECT lpad('hi',5,'xy');

Result:

lpad

-------

xyxhi

(1row)

• ltrim(string text [, characters text])

Removes the longest string containing only characters from characters (a space by default) from the start of

string.

SELECT ltrim('zzzytest','xyz');

Result:

ltrim

-------

test

(1row)

• md5(string)

Calculates the MD5 hash of string, returning the result in hexadecimal.

SELECT md5('abc');

Result:

md5

----------------------------------

900150983cd24fb0d6963f7d28e17f72

(1row)

• parse_ident(qualiied_identiier text [, strictmode boolean DEFAULT true])

Splits qualiied_identiier into an array of identiiers, removing any quoting of individual identiiers. By default,

extra characters after the last identiier are considered an error; but if the second parameter is false, then

Functions 93

such extra characters are ignored. (This behavior is useful for parsing names for objects like functions). Note

that this function does not truncate over-length identiiers. If you want truncation you can cast the result to

name[].

SELECT parse_ident('"SomeSchema".someTable');

Result:

parse_ident

------------------------

{SomeSchema,sometable}

(1row)

• pg_client_encoding()

Returns current client encoding name.

SELECT pg_client_encoding();

Result:

pg_client_encoding

--------------------

SQL_ASCII

(1row)

• quote_ident(string text)

Returns the given string suitably quoted to be used as an identiier in an SQL statement string. Quotes are

added only if necessary (i.e., if the string contains non-identiier characters or would be case-folded). Embed-

ded quotes are properly doubled.

SELECT quote_ident('Foo bar');

Result:

quote_ident

-------------

"Foo bar"

(1row)

• quote_literal(string text)

94 AgensGraph Developer Manual

Returns the given string suitably quoted to be used as a string literal in an SQL statement string. Embedded

single-quotes and backslashes are properly doubled. Note that quote_literal returns null on null input; if

the argument might be null, quote_nullable is often more suitable.

SELECT quote_literal(E'O\'Reilly');

Result:

quote_literal

---------------

'O''Reilly'

(1 row)

• quote_literal(value anyelement)

Coerces the given value to text and then quote it as a literal. Embedded single-quotes and backslashes are

properly doubled.

SELECT quote_literal(42.5);

Result:

quote_literal

---------------

'42.5'

(1row)

• quote_nullable(string text)

Returns the given string suitably quoted to be used as a string literal in an statement string; or, if the argument

is null, return NULL. Embedded single-quotes and backslashes are properly doubled.

SELECT quote_nullable(NULL);

Result:

quote_nullable

---------------

NULL

(1row)

• quote_nullable(value anyelement)

Functions 95

Coerces the given value to text and then quote it as a literal; or, if the argument is null, return NULL. Embedded

single-quotes and backslashes are properly doubled.

SELECT quote_nullable(42.5);

Result:

quote_nullable

----------------

'42.5'

(1row)

• regexp_matches(string text, pattern text [, lags text])

Returns captured substring(s) resulting from the irst match of a POSIX regular expression to the string.

SELECT regexp_matches('foobarbequebaz','(bar)(beque)');

Result:

regexp_matches

----------------

{bar,beque}

(1row)

• regexp_replace(string text, pattern text, replacement text [, lags text])

Replaces substring(s) matching a POSIX regular expression.

SELECT regexp_replace('Thomas','.[mN]a.','M');

Result:

regexp_replace

----------------

ThM

(1row)

• regexp_split_to_array(string text, pattern text [, lags text])

Splits string using a POSIX regular expression as the delimiter.

96 AgensGraph Developer Manual

SELECT regexp_split_to_array('hello world', E'\\s+');

Result:

regexp_split_to_array

-----------------------

{hello,world}

(1row)

• regexp_split_to_table(string text, pattern text [, lags text])

Splits string using a POSIX regular expression as the delimiter.

SELECT regexp_split_to_table('hello world', E'\\s+');

Result:

regexp_split_to_array

-----------------------

hello

world

(2rows)

• repeat(string text, number int)

Repeats string the speciied number of times.

SELECT repeat('Pg',4);

Result:

repeat

----------

PgPgPgPg

(1row)

• replace(string text, from text, to text)

Replaces all occurrences in string of substring from with substring to.

SELECT replace('abcdefabcdef','cd','XX');

Result:

Functions 97

replace

--------------

abXXefabXXef

(1row)

• reverse(str)

Return reversed string.

SELECT reverse('abcde');

Result:

reverse

---------

edcba

(1row)

• right(str text, nint)

Returns last ncharacters in the string. When nis negative, it returns all but irst |n| characters.

SELECT right('abcde',2);

Result:

right

-------

(1row)

• rpad(string text, length int [, ill text])

Fills up the string to length length by appending the characters ill (a space by default). If the string is already

longer than length then it is truncated.

SELECT rpad('hi',5,'xy');

Result:

rpad

-------

hixyx

(1row)

98 AgensGraph Developer Manual

• rtrim(string text [, characters text])

Removes the longest string containing only characters from characters (a space by default) from the end of

string.

SELECT rtrim('testxxzx','xyz');

Result:

rtrim

-------

test

(1row)

• split_part(string text, delimiter text, ield int)

Splits string on delimiter and return the given ield (counting from one).

SELECT split_part('abc~@~def~@~ghi','~@~',2);

Result:

split_part

------------

def

(1row)

• strpos(string,substring)

Location of speciied substring (same as position (substring in string), but note the reversed argument order).

SELECT strpos('high','ig');

Result:

strpos

--------

(1row)

• substr(string,from [, count])

Extracts substring (same as substring(string from from for count)).

Functions 99

SELECT substr('alphabet',3,2);

Result:

substr

--------

(1row)

• to_ascii(string text [, encoding text])

Converts string to ASCII from another encoding (only supports conversion from LATIN1,LATIN2,LATIN9, and

WIN1250 encodings).

SELECT to_ascii('Karel');

Result:

to_ascii

----------

Karel

(1row)

• to_hex(number int or bigint)

Converts number to its equivalent hexadecimal representation.

SELECT to_hex(2147483647);

Result:

to_hex

----------

7fffffff

(1row)

• translate(string text, from text, to text)

Any character in string that matches a character in the from set is replaced by the corresponding character in

the to set. If from is longer than to, occurrences of the extra characters in from are removed.

100 AgensGraph Developer Manual

SELECT translate('12345','143','ax');

Result:

translate

-----------

a2x5

(1row)

4.2.4 Binary String functions

Deines some string functions that use key words, rather than commas, to separate arguments.

• octet_length(string)

Returns the number of bytes in binary string.

SELECT octet_length(E'jo\\000se'::bytea);

Result:

octet_length

--------------

(1row)

• overlay(string placing string from int [for int])

Replaces substring.

SELECT overlay(E'Th\\000omas'::bytea placing E'\\002\\003'::bytea FROM 2for 3);

Result:

overlay

----------------

x5402036d6173

(1row)

• position(substring in string)

Returns the location of speciied substring.

Functions 101

SELECT position(E'\\000om'::bytea in E'Th\\000omas'::bytea);

Result:

position

----------

(1row)

• substring(string [from int] [for int])

Extracts substring.

SELECT substring(E'Th\\000omas'::bytea FROM 2for 3);

Result:

substring

-----------

x68006f

(1row)

• trim([both] bytes from string)

Removes the longest string containing only bytes appearing in bytes from the start and end of string.

SELECT trim(E'\\000\\001'::bytea FROM E'\\000Tom\\001'::bytea);

Result:

trim

----------

x546f6d

(1row)

• btrim(string bytea, bytes bytea)

Removes the longest string containing only bytes appearing in bytes from the start and end of string.

SELECT btrim(E'\\000trim\\001'::bytea, E'\\000\\001'::bytea);

Result:

btrim

102 AgensGraph Developer Manual

------------

x7472696d

(1row)

• decode(string text, format text)

Decodes binary data from textual representation in string. Options for format are same as in encode.

SELECT decode(E'123\\000456','escape');

Result:

decode

------------------

x31323300343536

(1row)

• encode(data bytea, format text)

Encodes binary data into a textual representation. Supported formats are: base64,hex,escape.escape con-

verts zero bytes and high-bit-set bytes to octal sequences (\nnn) and doubles backslashes.

SELECT encode(E'123\\000456'::bytea,'escape');

Result:

encode

------------

123\000456

(1row)

• get_bit(string,offset)

Extracts bit from string.

SELECT get_bit(E'Th\\000omas'::bytea,45);

Result:

get_bit

---------

(1row)

Functions 103

• get_byte(string,offset)

Extract byte from string.

SELECT get_byte(E'Th\\000omas'::bytea,4);

Result:

get_byte

----------

109

(1row)

• length(string)

Returns the length of binary string.

SELECT length(E'jo\\000se'::bytea);

Result:

length

--------

(1row)

• md5(string)

Calculates the MD5 hash of string, returning the result in hexadecimal.

SELECT md5(E'Th\\000omas'::bytea);

Result:

md5

----------------------------------

8ab2d3c9689aaf18b4958c334c82d8b1

(1row)

• set_bit(string,offset,newvalue)

Returns the set bit in string.

104 AgensGraph Developer Manual

SELECT set_bit(E'Th\\000omas'::bytea,45,0);

Result:

set_bit

------------------

x5468006f6d4173

(1row)

• set_byte(string,offset,newvalue)

Returns the set byte in string.

SELECT set_byte(E'Th\\000omas'::bytea,4,64);

Result:

set_byte

------------------

x5468006f406173

(1row)

4.2.5 Date Type Formatting functions

The formatting functions provide a powerful set of tools for converting various data types (date/time, integer, loat-

ing point, and numeric to formatted strings and for converting from formatted strings to speciic data types. These

functions all follow a common calling convention: the irst argument is the value to be formatted and the second ar-

gument is a template that deines the output or input format.

• to_char(timestamp,text)

Converts timestamp to string.

SELECT to_char(current_timestamp, 'HH12:MI:SS');

Result:

to_char

----------

04:56:02

(1row)

• to_char(interval,text)

Functions 105

Converts interval to string.

SELECT to_char(interval '15h 2m 12s','HH24:MI:SS');

Result:

to_char

----------

15:02:12

(1row)

• to_char(int,text)

Converts integer to string.

SELECT to_char(125,'999');

Result:

to_char

----------

125

(1row)

• to_char(double precision, text)

Converts real/double precision to string.

SELECT to_char(125.8::real,'999D9');

Result:

to_char

----------

125.8

(1row)

• to_char(numeric,text)

Converts numeric to string.

SELECT to_char(-125.8,'999D99S');

Result:

106 AgensGraph Developer Manual

to_char

----------

125.80-

(1row)

• to_date(text,text)

Converts string to date.

SELECT to_date('05 Dec 2000','DD Mon YYYY');

Result:

to_date

------------

2000-12-05

(1row)

• to_number(text,text)

Converts string to numeric.

SELECT to_number('12,454.8-','99G999D9S');

Result:

to_number

-----------

-12454.8

(1row)

• to_timestamp(text,text)

Converts string to timestamp.

SELECT to_timestamp('05 Dec 2000','DD Mon YYYY');

Result:

to_timestamp

------------------------

2000-12-05 00:00:00+09

(1row)

Functions 107

4.2.6 Date/Time functions

All the functions and operators described below that take time or timestamp inputs actually come in two variants:

one that takes time with time zone or timestamp with time zone, and one that takes time without time zone or times-

tamp without time zone. For brevity, these variants are not shown separately. Also, + and * operators come in com-

mutative pairs (for example both date + integer and integer + date); we show only one of each such pair.

• age(timestamp,timestamp)

Subtracts arguments, producing a result that uses years and months, rather than just days.

SELECT age(timestamp '2001-04-10',timestamp '1957-06-13');

Result:

age

-------------------------

43 years 9mons 27 days

(1row)

• age(timestamp)

Subtracts from current_date (at midnight).

SELECT age(timestamp '1957-06-13');

Result:

age

-------------------------

60 years 3mons 15 days

(1row)

• clock_timestamp()

Returns current date and time (changes during statement execution).

SELECT clock_timestamp();

Result:

clock_timestamp

-------------------------------

108 AgensGraph Developer Manual

2017-09-28 17:47:31.208076+09

(1row)

• current_date

Returns current date.

SELECT current_date;

Result:

date

------------

2017-09-28

(1row)

• current_time

Returns current time of day.

SELECT current_time;

Result:

timetz

--------------------

17:53:23.972231+09

(1row)

• current_timestamp

Returns current date and time.

SELECT current_timestamp;

Result:

now

-------------------------------

2017-09-28 18:01:43.890977+09

(1row)

• date_part(text,timestamp)

Returns subield speciied in text.

Functions 109

SELECT date_part('hour',timestamp '2001-02-16 20:38:40');

Result:

date_part

-----------

(1row)

• date_part(text,interval)

Returns subield speciied in text.

SELECT date_part('month',interval '2 years 3 months');

Result:

date_part

-----------

(1row)

• date_trunc(text,timestamp)

Truncates to speciied precision.

SELECT date_trunc('hour',timestamp '2001-02-16 20:38:40');

Result:

date_trunc

---------------------

2001-02-16 20:00:00

(1row)

• date_trunc(text,interval)

Truncates to speciied precision.

SELECT date_trunc('hour',interval '2 days 3 hours 40 minutes');

Result:

date_trunc

110 AgensGraph Developer Manual

-----------------

2days 03:00:00

(1row)

• extract(ield from timestamp)

Extracts the speciied ield.

SELECT extract(hour FROM timestamp '2001-02-16 20:38:40');

Result:

date_part

-----------

(1row)

• extract(ield from interval)

Extracts the speciied ield.

SELECT extract(month FROM interval '2 years 3 months');

Result:

date_part

-----------

(1row)

• isinite(date)

Returns the result of testing whether the input argument is inite (no +/- ininite).

SELECT isfinite(date '2001-02-16');

Result:

isfinite

----------

(1row)

• isinite(timestamp)

Functions 111

Returns the result of testing whether the input argument is inite (no +/- ininite).

SELECT isfinite(timestamp '2001-02-16 21:28:30');

Result:

isfinite

----------

(1row)

• isinite(interval)

Returns the result of testing whether the input argument is inite.

SELECT isfinite(interval '4 hours');

Result:

isfinite

----------

(1row)

• justify_days(interval)

Adjusts interval so 30-day time periods are represented as months.

SELECT justify_days(interval '35 days');

Result:

justify_days

--------------

1mon 5days

(1row)

• justify_hours(interval)

Adjusts interval so 24-hour time periods are represented as days.

SELECT justify_hours(interval '27 hours');

Result:

112 AgensGraph Developer Manual

justify_hours

----------------

1day 03:00:00

(1row)

• justify_interval(interval)

Adjusts interval using justify_days and justify_hours, with additional sign adjustments.

SELECT justify_interval(interval '1 mon -1 hour');

Result:

justify_interval

------------------

29 days 23:00:00

(1row)

• localtime

Returns current time of day.

SELECT localtime;

Result:

time

----------------

11:27:04.72722

(1row)

• localtimestamp

Returns current date and time (start of current transaction).

SELECT localtimestamp;

Result:

timestamp

----------------------------

2017-09-29 11:29:52.230028

(1row)

Functions 113

• make_date(year int, month int, day int)

Creates date from year, month and day ields.

SELECT make_date(2013,7,15);

Result:

make_date

------------

2013-07-15

(1row)

• make_interval(years int DEFAULT 0, months int DEFAULT 0, weeks int DEFAULT 0, days int DEFAULT 0, hours int

DEFAULT 0, mins int DEFAULT 0, secs double precision DEFAULT 0.0)

Creates interval from years, months, weeks, days, hours, minutes and seconds ields.

SELECT make_interval(days => 10);

Result:

make_interval

---------------

10 days

(1row)

• make_time(hour int, min int, sec double precision)

Creates time from hour, minute and seconds ields.

SELECT make_time(8,15,23.5);

Result:

make_time

------------

08:15:23.5

(1row)

• make_timestamp(year int, month int, day int, hour int, min int, sec double precision)

Creates timestamp from year, month, day, hour, minute and seconds ields.

114 AgensGraph Developer Manual

SELECT make_timestamp(2013,7,15,8,15,23.5);

Result:

make_timestamp

-----------------------

2013-07-15 08:15:23.5

(1row)

• make_timestamptz(year int, month int, day int, hour int, min int, sec double precision, [ timezone text ])

Creates timestamp with time zone from year, month, day, hour, minute and seconds ields; if timezone is not

speciied, the current time zone is used.

SELECT make_timestamptz(2013,7,15,8,15,23.5);

Result:

make_timestampz

--------------------------

2013-07-15 08:15:23.5+01

(1row)

• now()

Returns current date and time (start of current transaction).

SELECT now();

Result:

now

-------------------------------

2017-10-11 16:09:51.154262+09

(1row)

• statement_timestamp()

Returns current date and time.

SELECT statement_timestamp();

Result:

Functions 115

statement_timestamp

-------------------------------

2017-10-11 16:08:59.641426+09

(1row)

• timeofday()

Returns current date and time.

SELECT timeofday();

Result:

timeofday

-------------------------------------

Wed Oct 11 16:09:26.934061 2017 KST

(1row)

• transaction_timestamp()

Returns current date and time.

SELECT transaction_timestamp();

Result:

transaction_timestamp

-------------------------------

2017-10-11 16:10:21.530521+09

(1row)

• to_timestamp(double precision)

Converts Unix epoch (seconds since 1970-01-01 00:00:00+00) to timestamp.

SELECT to_timestamp(1284352323);

Result:

to_timestamp

------------------------

2010-09-13 13:32:03+09

(1row)

116 AgensGraph Developer Manual

4.2.7 Enum Support functions

For enum types, there are several functions that allow cleaner programming without hard-coding particular values

of an enum type.

To execute the example in the function description, create an enum type as shown below irst.

CREATE TYPE rainbow AS ENUM ('red','orange','yellow','green','blue','purple');

• enum_irst(anyenum)

Returns the irst value of the input enum type.

SELECT enum_first(null::rainbow);

Result:

enum_first

------------

red

(1row)

• enum_last(anyenum)

Returns the last value of the input enum type.

SELECT enum_last(null::rainbow);

Result:

enum_last

-----------

purple

(1row)

• enum_range(anyenum)

Returns all values of the input enum type in an ordered array.

SELECT enum_range(null::rainbow);

Result:

enum_range

---------------------------------------

Functions 117

{red,orange,yellow,green,blue,purple}

(1row)

• enum_range(anyenum,anyenum)

Returns the range between the two given enum values, as an ordered array. The values must be from the same

enum type. If the irst parameter is null, the result will start with the irst value of the enum type. If the second

parameter is null, the result will end with the last value of the enum type.

SELECT enum_range('orange'::rainbow,'green'::rainbow);

Result:

enum_range

-----------------------

{orange,yellow,green}

(1row)

SELECT enum_range(NULL,'green'::rainbow);

Result:

enum_range

---------------------------

{red,orange,yellow,green}

(1row)

SELECT enum_range('orange'::rainbow,NULL);

Result:

enum_range

-----------------------------------

{orange,yellow,green,blue,purple}

(1row)

4.2.8 Geometric Functions

• area(object)

Returns area.

118 AgensGraph Developer Manual

SELECT area(box '((0,0),(1,1))');

Result:

area

------

(1row)

• center(object)

Returns the center coordinates of object.

SELECT center(box '((0,0),(1,2))');

Result:

center

--------

(0.5,1)

(1row)

• diameter(circle)

Returns the diameter of circle.

SELECT diameter(circle '((0,0),2.0)');

Result:

diameter

----------

(1row)

• height(box)

Returns the vertical size of box.

SELECT height(box '((0,0),(1,1))');

Result:

height

Functions 119

--------

(1row)

• isclosed(path)

Returns a logical value indicating whether the input path is a closed path.

SELECT isclosed(path '((0,0),(1,1),(2,0))');

Result:

isclosed

----------

(1row)

• isopen(path)

Returns a logical value indicating whether the input path is an open path.

SELECT isopen(path '[(0,0),(1,1),(2,0)]');

Result:

isopen

--------

(1row)

• length(object)

Returns length of the path.

SELECT length(path '((-1,0),(1,0))');

Result:

length

--------

(1row)

• npoints(path)

120 AgensGraph Developer Manual

Returns the number of points of the input path.

SELECT npoints(path '[(0,0),(1,1),(2,0)]');

Result:

npoints

---------

(1row)

• npoints(polygon)

Returns the number of polygon points.

SELECT npoints(polygon '((1,1),(0,0))');

Result:

npoints

---------

(1row)

• pclose(path)

Converts the input path to closed.

SELECT pclose(path '[(0,0),(1,1),(2,0)]');

Result:

pclose

---------------------

((0,0),(1,1),(2,0))

(1row)

• popen(path)

Converts the input path to open.

SELECT popen(path '((0,0),(1,1),(2,0))');

Result:

Functions 121

popen

---------------------

[(0,0),(1,1),(2,0)]

(1row)

• radius(circle)

Returns the radius of circle.

SELECT radius(circle '((0,0),2.0)');

Result:

radius

--------

(1row)

• width(box)

Returns the radius of circle.

SELECT width(box '((0,0),(1,1))');

Result:

width

-------

(1row)

• box(circle)

Returns a box circumscribed about circle.

SELECT box(circle '((0,0),2.0)');

Result:

box

---------------------------------------------------------------------------

(1.41421356237309,1.41421356237309),(-1.41421356237309,-1.41421356237309)

(1row)

122 AgensGraph Developer Manual

• box(point)

Returns a box whose width is zero (empty box) centered on the input point.

SELECT box(point '(0,0)');

Result:

box

-------------

(0,0),(0,0)

(1row)

• box(point,point)

Returns a box whose vertices are the two input points.

SELECT box(point '(0,0)', point '(1,1)');

Result:

box

-------------

(1,1),(0,0)

(1row)

• box(polygon)

Returns a box circumscribed about polygon.

SELECT box(polygon '((0,0),(1,1),(2,0))');

Result:

box

-------------

(2,1),(0,0)

(1row)

• bound_box(box,box)

Returns the smallest box that contains the two boxes entered.

Functions 123

SELECT bound_box(box '((0,0),(1,1))', box '((3,3),(4,4))');

Result:

bound_box

-------------

(4,4),(0,0)

(1row)

• circle(box)

Returns a circle circumscribed about box.

SELECT circle(box '((0,0),(1,1))');

Result:

circle

-------------------------------

<(0.5,0.5),0.707106781186548>

(1row)

• circle(point,double precision)

Returns the circle created using the center coordinates and radius of the input circle.

SELECT circle(point '(0,0)',2.0);

Result:

circle

-----------

<(0,0),2>

(1row)

• circle(polygon)

Returns a circle with the average of the input coordinate pairs as the center of the circle and the average dis-

tance from the point to the input coordinate pair as the radius.

SELECT circle(polygon '((0,0),(1,1),(2,0))');

Result:

124 AgensGraph Developer Manual

circle

-------------------------------------------

<(1,0.333333333333333),0.924950591148529>

(1row)

• line(point,point)

Returns the value of the line.

SELECT line(point '(-1,0)', point '(1,0)');

Result:

line

----------

{0,-1,0}

(1row)

• lseg(box)

Returns box diagonal to line segment.

SELECT lseg(box '((-1,0),(1,0))');

Result:

lseg

----------------

[(1,0),(-1,0)]

(1row)

• lseg(point,point)

Returns a line segment with two input points taken as start and end points.

SELECT lseg(point '(-1,0)', point '(1,0)');

Result:

lseg

----------------

[(1,0),(-1,0)]

(1row)

Functions 125

• path(polygon)

Returns a polygon path.

SELECT path(polygon '((0,0),(1,1),(2,0))');

Result:

path

---------------------

((0,0),(1,1),(2,0))

(1row)

• point(double precision,double precision)

Returns the value that construct the point.

SELECT point(23.4, -44.5);

Result:

point

--------------

(23.4,-44.5)

(1row)

• point(box)

Returns center of box.

SELECT point(box '((-1,0),(1,0))');

Result:

point

-------

(0,0)

(1row)

• point(circle)

Returns center of circle.

126 AgensGraph Developer Manual

SELECT point(circle '((0,0),2.0)');

Result:

point

-------

(0,0)

(1row)

• point(lseg)

Returns center of line segment.

SELECT point(lseg '((-1,0),(1,0))');

Result:

point

-------

(0,0)

(1row)

• point(polygon)

Returns center of polygon.

SELECT point(polygon '((0,0),(1,1),(2,0))');

Result:

point

-----------------------

(1,0.333333333333333)

(1row)

• polygon(box)

Returns box to 4-point polygon.

SELECT polygon(box '((0,0),(1,1))');

Result:

polygon

Functions 127

---------------------------

((0,0),(0,1),(1,1),(1,0))

(1row)

• polygon(circle)

Returns circle to 12-point polygon.

SELECT polygon(circle '((0,0),2.0)');

Result:

polygon

-----------------------------------------------------------------------------------------

((-2,0),(-1.73205080756888,1),(-1,1.73205080756888),(-1.22464679914735e-16,2),

(1,1.73205080756888),(1.73205080756888,1),(2,2.44929359829471e-16),

(1.73205080756888,-0.999999999999999),(1,-1.73205080756888),(3.67394039744206e-16,-2),

(-0.999999999999999,-1.73205080756888),(-1.73205080756888,-1))

(1row)

• polygon(npts,circle)

Returns circle to npts-point polygon.

SELECT polygon(12, circle '((0,0),2.0)');

polygon

-----------------------------------------------------------------------------------------

((-2,0),(-1.73205080756888,1),(-1,1.73205080756888),(-1.22464679914735e-16,2),

(1,1.73205080756888),(1.73205080756888,1),(2,2.44929359829471e-16),

(1.73205080756888,-0.999999999999999),(1,-1.73205080756888),(3.67394039744206e-16,-2),

(-0.999999999999999,-1.73205080756888),(-1.73205080756888,-1))

(1row)

• polygon(path)

Converts path into a polygon.

SELECT polygon(path '((0,0),(1,1),(2,0))');

Result:

128 AgensGraph Developer Manual

polygon

---------------------

((0,0),(1,1),(2,0))

(1row)

4.2.9 Network Address Functions

• abbrev(inet)

Returns abbreviated display format as text.

SELECT abbrev(inet '10.1.0.0/16');

Result:

abbrev

-------------

10.1.0.0/16

(1row)

• abbrev(cidr)

Returns abbreviated display format as text.

SELECT abbrev(cidr '10.1.0.0/16');

Result:

abbrev

---------

10.1/16

(1row)

• broadcast(inet)

Returns broadcast address for network.

SELECT broadcast('192.168.1.5/24');

Result:

broadcast

------------------

Functions 129

192.168.1.255/24

(1row)

• family(inet)

Extracts family of address; 4 for IPv4, 6 for IPv6.

SELECT family('::1');

Result:

family

--------

(1row)

• host(inet)

Extracts IP address as text.

SELECT host('192.168.1.5/24');

Result:

host

-------------

192.168.1.5

(1row)

• hostmask(inet)

Constructs host mask for network.

SELECT hostmask('192.168.23.20/30');

Result:

hostmask

----------

0.0.0.3

(1row)

• masklen(inet)

Extracts netmask length.

130 AgensGraph Developer Manual

SELECT masklen('192.168.1.5/24');

Result:

masklen

---------

(1row)

• netmask(inet)

Constructs netmask for network.

SELECT netmask('192.168.1.5/24');

Result:

netmask

---------------

255.255.255.0

(1row)

• network(inet)

Extracts network part of address.

SELECT network('192.168.1.5/24');

Result:

network

----------------

192.168.1.0/24

(1row)

• set_masklen(inet,int)

Returns the set netmask length for inet value.

SELECT set_masklen('192.168.1.5/24',16);

Result:

set_masklen

Functions 131

----------------

192.168.1.5/16

(1row)

• set_masklen(cidr,int)

Returns the set netmask length for cidr value.

SELECT set_masklen('192.168.1.0/24'::cidr,16);

Result:

set_masklen

----------------

192.168.0.0/16

(1row)

• text(inet)

Returns IP address and netmask length as text.

SELECT text(inet '192.168.1.5');

Result:

text

----------------

192.168.1.5/32

(1row)

• inet_same_family(inet,inet)

Returns a logical value indicating whether the address is a family value.

SELECT inet_same_family('192.168.1.5/24','::1');

Result:

inet_same_family

------------------

(1row)

• inet_merge(inet,inet)

132 AgensGraph Developer Manual

Returns the smallest network which includes all of the entered networks.

SELECT inet_merge('192.168.1.5/24','192.168.2.5/24');

Result:

inet_merge

----------------

192.168.0.0/22

(1row)

• trunc(macaddr)

Sets the last 3 bytes to zero.

SELECT trunc(macaddr '12:34:56:78:90:ab');

Result:

trunc

-------------------

12:34:56:00:00:00

(1row)

4.2.10 Text Search Functions

• array_to_tsvector(text[])

Converts array of lexemes to tsvector.

SELECT array_to_tsvector('{fat,cat,rat}'::text[]);

Result:

array_to_tsvector

-------------------

'cat' 'fat' 'rat'

(1row)

• get_current_ts_conig()

Returns the default text search coniguration.

Functions 133

SELECT get_current_ts_config();

Result:

get_current_ts_config

-----------------------

english

(1row)

• length(tsvector)

Returns number of lexemes in tsvector.

SELECT length('fat:2,4 cat:3 rat:5A'::tsvector);

Result:

length

--------

(1row)

• numnode(tsquery)

Returns the number of lexemes plus operators in tsquery.

SELECT numnode('(fat & rat) | cat'::tsquery);

Result:

numnode

---------

(1row)

• plainto_tsquery([ conig regconig , ] query text)

Produces tsquery ignoring punctuation.

SELECT plainto_tsquery('english','The Fat Rats');

Result:

plainto_tsquery

134 AgensGraph Developer Manual

-----------------

'fat' &'rat'

(1row)

• phraseto_tsquery([ conig regconig , ] query text)

Produces tsquery that searches for a phrase, ignoring punctuation.

SELECT phraseto_tsquery('english','The Fat Rats');

Result:

phraseto_tsquery

------------------

'fat' <-> 'rat'

(1row)

• querytree(query tsquery)

Returns indexable part of a tsquery.

SELECT querytree('foo & ! bar'::tsquery);

Result:

querytree

-----------

'foo'

(1row)

• setweight(vector tsvector, weight ``char'')

Assigns weight to each element of vector.

SELECT setweight('fat:2,4 cat:3 rat:5B'::tsvector,'A');

Result:

setweight

-------------------------------

'cat':3A 'fat':2A,4A 'rat':5A

(1row)

• setweight(vector tsvector, weight ``char'', lexemes text[])

Functions 135

Assigns weight to elements of vector that are listed in lexemes.

SELECT setweight('fat:2,4 cat:3 rat:5B'::tsvector,'A','{cat,rat}');

Result:

setweight

-----------------------------

'cat':3A 'fat':2,4'rat':5A

(1row)

• strip(tsvector)

Removes positions and weights from tsvector.

SELECT strip('fat:2,4 cat:3 rat:5A'::tsvector);

Result:

strip

-------------------

'cat' 'fat' 'rat'

(1row)

• to_tsquery([ conig regconig , ] query text)

Normalizes words and converts to tsquery.

SELECT to_tsquery('english','The & Fat & Rats');

Result:

to_tsquery

---------------

'fat' &'rat'

(1row)

• to_tsvector([ conig regconig , ] document text)

Reduce document text to tsvector.

SELECT to_tsvector('english','The Fat Rats');

Result:

136 AgensGraph Developer Manual

to_tsvector

-----------------

'fat':2 'rat':3

(1row)

• ts_delete(vector tsvector, lexeme text)

Removes given lexeme from vector.

SELECT ts_delete('fat:2,4 cat:3 rat:5A'::tsvector,'fat');

Result:

ts_delete

------------------

'cat':3 'rat':5A

(1row)

• ts_delete(vector tsvector, lexemes text[])

Removes any occurrence of lexemes in lexemes from vector.

SELECT ts_delete('fat:2,4 cat:3 rat:5A'::tsvector,ARRAY['fat','rat']);

Result:

ts_delete

-----------

'cat':3

(1row)

• ts_ilter(vector tsvector, weights ``char''[])

Selects only elements with given weights from vector.

SELECT ts_filter('fat:2,4 cat:3b rat:5A'::tsvector,'{a,b}');

Result:

ts_filter

-------------------

'cat':3B 'rat':5A

(1row)

Functions 137

• ts_headline([ conig regconig, ] document text, query tsquery [, options text ])

Displays a query match.

SELECT ts_headline('x y z','z'::tsquery);

Result:

ts_headline

--------------

x y <b>z</b>

(1row)

• ts_rank([ weights loat4[], ] vector tsvector, query tsquery [, normalization integer ])

Ranks documents for query.

SELECT ts_rank(to_tsvector('This is an example of document'), to_tsquery('example'));

Result:

ts_rank

-----------

0.0607927

(1row)

• ts_rank_cd([ weights loat4[], ] vector tsvector, query tsquery [, normalization integer ])

Ranks documents for query using cover density.

SELECT ts_rank_cd(to_tsvector('This is an example of document'), to_tsquery('example'));

Result:

ts_rank_cd

------------

0.1

(1row)

• ts_rewrite(query tsquery, target tsquery, substitute tsquery)

Replaces target with substitute within query.

138 AgensGraph Developer Manual

SELECT ts_rewrite('a & b'::tsquery,'a'::tsquery,'foo|bar'::tsquery);

Result:

ts_rewrite

-------------------------

'b' & ( 'foo' |'bar' )

(1row)

• ts_rewrite(query tsquery, select text)

Replaces the irst column value of the SELECT result with the second column value of the SELECT result.

create table aliases (t tsquery primary key, s tsquery);

insert into aliases values ('a','foo|bar');

SELECT ts_rewrite('a & b'::tsquery,'SELECT t,s FROM aliases');

Result:

ts_rewrite

-------------------------

'b' & ( 'bar' |'foo' )

(1row)

• tsquery_phrase(query1 tsquery, query2 tsquery)

Makes query that searches for query1 followed by query2 (same as <-> operator).

SELECT tsquery_phrase(to_tsquery('fat'), to_tsquery('cat'));

Result:

tsquery_phrase

-----------------

'fat' <-> 'cat'

(1row)

• tsquery_phrase(query1 tsquery, query2 tsquery, distance integer)

Makes query that searches for query1 followed by query2 at distance distance.

Functions 139

SELECT tsquery_phrase(to_tsquery('fat'), to_tsquery('cat'), 10);

Result:

tsquery_phrase

------------------

'fat' <10>'cat'

(1row)

• tsvector_to_array(tsvector)

Converts tsvector to array of lexemes.

SELECT tsvector_to_array('fat:2,4 cat:3 rat:5A'::tsvector);

Result:

tsvector_to_array

-------------------

{cat,fat,rat}

(1row)

• tsvector_update_trigger()

Triggers the function for automatic tsvector column update.

CREATE TRIGGER tsvectorupdate BEFORE INSERT OR UPDATE ON messages

FOR EACH ROW EXECUTE PROCEDURE

tsvector_update_trigger(tsv, 'pg_catalog.english', title, body);

INSERT INTO messages VALUES ('title here','the body text is here');

SELECT *FROM messages;

Result:

title | body | tsv

------------+-----------------------+----------------------------

title here | the body text is here | 'bodi':4 'text':5 'titl':1

(1row)

• tsvector_update_trigger_column()

140 AgensGraph Developer Manual

Triggers the function for automatic tsvector column update.

CREATE TRIGGER ... tsvector_update_trigger_column(tsv, configcol, title, body);

• unnest(tsvector, OUT lexeme text, OUT positions smallint[], OUT weights text)

Expands a tsvector to a set of rows.

SELECT unnest('fat:2,4 cat:3 rat:5A'::tsvector);

Result:

unnest

-----------------------

(cat,{3},{D})

(fat,"{2,4}","{D,D}")

(rat,{5},{A})

(3row)

• ts_debug([ conig regconig, ] document text, OUT alias text, OUT description text, OUT token text, OUT dictio-

naries regdictionary[], OUT dictionary regdictionary, OUT lexemes text[])

Tests a coniguration.

SELECT ts_debug('english','The Brightest supernovaes');

Result:

ts_debug

-----------------------------------------------------------------------------------

(asciiword,"Word, all ASCII",The,{english_stem},english_stem,{})

(blank,"Space symbols"," ",{},,)

(asciiword,"Word, all ASCII",Brightest,{english_stem},english_stem,{brightest})

(blank,"Space symbols"," ",{},,)

(asciiword,"Word, all ASCII",supernovaes,{english_stem},english_stem,{supernova})

(5row)

• ts_lexize(dict regdictionary, token text)

Tests a dictionary.

Functions 141

SELECT ts_lexize('english_stem','stars');

Result:

ts_lexize

-----------

{star}

(1row)

• ts_parse(parser_name text, document text, OUT tokid integer, OUT token text)

Tests a parser.

SELECT ts_parse('default','foo - bar');

Result:

ts_parse

-----------

(1,foo)

(12," ")

(12,"- ")

(1,bar)

(4row)

• ts_parse(parser_oid oid, document text, OUT tokid integer, OUT token text)

Tests a parser with oid.

SELECT ts_parse(3722,'foo - bar');

Result:

ts_parse

-----------

(1,foo)

(12," ")

(12,"- ")

(1,bar)

(4row)

• ts_token_type(parser_name text, OUT tokid integer, OUT alias text, OUT description text)

142 AgensGraph Developer Manual

Gets token types deined by parser.

SELECT ts_token_type('default');

Result:

ts_token_type

---------------------------------

(1,asciiword,"Word, all ASCII")

...

(23 row)

• ts_token_type(parser_oid oid, OUT tokid integer, OUT alias text, OUT description text)

Gets token types deined by parser.

SELECT ts_token_type(3722);

Result:

ts_token_type

---------------------------------

(1,asciiword,"Word, all ASCII")

...

(23 row)

• ts_stat(sqlquery text, [ weights text, ] OUT word text, OUT ndoc integer, OUT nentry integer)

Returns statistics of a tsvector column.

SELECT ts_stat('SELECT vector FROM apod');

Result:

ts_stat

------------

(foo,10,15)

...

(4row)

4.2.11 JSON Functions

• to_json(anyelement), to_jsonb(anyelement)

Functions 143

Returns the value as json or jsonb. Arrays and composites are converted to arrays and objects; otherwise, if

there is a cast from the type to json, the cast function will be used to perform the conversion or a scalar value

is produced. For any scalar type other than a number, a Boolean, or a null value, the text representation will be

used, in such a fashion that it is a valid json or jsonb value.

SELECT to_json('Fred said "Hi."'::text);

Result:

to_json

---------------------

"Fred said "Hi.\""

(1row)

• array_to_json(anyarray [, pretty_bool])

Returns the array as a JSON array. Line feeds will be added between dimension-1 elements if returns pretty_bool

is true.

SELECT array_to_json('{{1,5},{99,100}}'::int[]);

Result:

array_to_json

------------------

[[1,5],[99,100]]

(1row)

SELECT array_to_json('{{1,5},{99,100}}'::int[], true);

Result:

array_to_json

---------------

[[1,5], +

[99,100]]

(1row)

• row_to_json(record [, pretty_bool])

Returns the row as a JSON object. Line feeds will be added between level-1 elements if pretty_bool is true.

144 AgensGraph Developer Manual

SELECT row_to_json(row(1,'foo'));

Result:

row_to_json

---------------------

{"f1":1,"f2":"foo"}

(1row)

• json_build_array(VARIADIC ``any''), jsonb_build_array(VARIADIC ``any'')

Builds a possibly-heterogeneously-typed JSON array out of a variable argument list.

SELECT json_build_array(1,2,'3',4,5);

Result:

json_build_array

-------------------

[1,2,"3",4,5]

(1row)

• json_build_object(VARIADIC ``any''), jsonb_build_object(VARIADIC ``any'')

Builds a JSON object out of a variable argument list. By convention, the argument list consists of alternating

keys and values.

SELECT json_build_object('foo',1,'bar',2);

Result:

json_build_object

------------------------

{"foo" :1,"bar" :2}

(1row)

• json_object(text[]), jsonb_object(text[])

Builds a JSON object out of a text array. The array must have either exactly one dimension with an even num-

ber of members, in which case they are taken as alternating key/value pairs, or two dimensions such that each

inner array has exactly two elements, which are taken as a key/value pair.

Functions 145

SELECT json_object('{a, 1, b, "def", c, 3.5}');

SELECT json_object('{{a, 1},{b, "def"},{c, 3.5}}');

Result:

json_object

---------------------------------------

{"a" :"1","b" :"def","c" :"3.5"}

(1row)

• json_object(keys text[], values text[]), jsonb_object(keys text[], values text[])

This form of json_object takes keys and values pairwise from two separate arrays. In all other respects it is

identical to the one-argument form.

SELECT json_object('{a, b}','{1,2}');

Result:

json_object

------------------------

{"a" :"1","b" :"2"}

(1row)

• json_array_length(json), jsonb_array_length(jsonb)

Returns the number of elements in the outermost JSON array.

SELECT json_array_length('[1,2,3,{"f1":1,"f2":[5,6]},4]');

Result:

json_array_length

-------------------

(1row)

• json_each(json), jsonb_each(jsonb)

Expands the outermost JSON object into a set of key/value pairs.

146 AgensGraph Developer Manual

SELECT *FROM json_each('{"a":"foo", "b":"bar"}');

Result:

key | value

-----+-------

a | "foo"

b | "bar"

(2row)

• json_each_text(json), jsonb_each_text(jsonb)

Expands the outermost JSON object into a set of key/value pairs. The returned values will be of type text.

SELECT *FROM json_each_text('{"a":"foo", "b":"bar"}');

Result:

key | value

-----+-------

a | foo

b | bar

(2row)

• json_extract_path(from_json json, VARIADIC path_elems text[]), jsonb_extract_path(from_json jsonb, VARIADIC

path_elems text[])

Returns JSON value pointed to by path_elems (equivalent to #> operator).

SELECT *FROM json_extract_path('{"f2":{"f3":1},"f4":{"f5":99,"f6":"foo"}}','f4');

Result:

json_extract_path

----------------------

{"f5":99,"f6":"foo"}

(1row)

• json_extract_path_text(from_json json, VARIADIC path_elems text[]), jsonb_extract_path_text(from_json jsonb,

VARIADIC path_elems text[])

Returns JSON value pointed to by path_elems as text (equivalent to #>> operator).

Functions 147

SELECT json_extract_path_text('{"f2":{"f3":1},"f4":{"f5":99,"f6":"foo"}}','f4','f6');

Result:

json_extract_path_text

------------------------

foo

(1row)

• json_object_keys(json), jsonb_object_keys(jsonb)

Returns set of keys in the outermost JSON object.

SELECT json_object_keys('{"f1":"abc","f2":{"f3":"a", "f4":"b"}}');

Result:

json_object_keys

------------------

(2row)

• json_populate_record(base anyelement, from_json json), jsonb_populate_record(base anyelement, from_json

jsonb)

Expands the object in from_json to a row whose columns match the record type deined by base.

CREATE TABLE myrowtype (a int, b int);

SELECT *FROM json_populate_record(null::myrowtype,'{"a":1,"b":2}');

Result:

a | b

---+---

1|2

(1row)

• json_populate_recordset(base anyelement, from_json json), jsonb_populate_recordset(base anyelement, from_json

jsonb)

148 AgensGraph Developer Manual

Expands the outermost array of objects in from_json to a set of rows whose columns match the record type

deined by base.

SELECT *FROM json_populate_recordset(null::myrowtype,'[{"a":1,"b":2},{"a":3,"b":4}]');

Result:

a | b

---+---

1|2

3|4

(2row)

• json_array_elements(json), jsonb_array_elements(jsonb)

Expands a JSON array to a set of JSON values.

SELECT *FROM json_array_elements('[1,true, [2,false]]');

Result:

value

-----------

true

[2,false]]

(3row)

• json_array_elements_text(json), jsonb_array_elements_text(jsonb)

Expands a JSON array to a set of text values.

SELECT *FROM json_array_elements_text('["foo", "bar"]');

Result:

value

-------

foo

bar

(2row)

• json_typeof(json), jsonb_typeof(jsonb)

Functions 149

Returns the type of the outermost JSON value as a text string. Possible types are object,array,string,number,

boolean, and null.

SELECT json_typeof('-123.4');

Result:

json_typeof

-------------

number

(1row)

• json_to_record(json), jsonb_to_record(jsonb)

Builds an arbitrary record from a JSON object. As with all functions returning record, the caller must explicitly

deine the structure of the record with an AS clause.

SELECT *FROM json_to_record('{"a":1,"b":[1,2,3],"c":"bar"}')as x(a int, b text, d text);

Result:

a | b | d

---+---------+---

1| [1,2,3] |

(1row)

• json_to_recordset(json), jsonb_to_recordset(jsonb)

Builds an arbitrary set of records from a JSON array of objects. As with all functions returning record, the

caller must explicitly deine the structure of the record with an AS clause.

SELECT *FROM json_to_recordset('[{"a":1,"b":"foo"},{"a":"2","c":"bar"}]')

as x(a int, b text);

Result:

a | b

---+-----

1| foo

(2row)

• json_strip_nulls(from_json json), jsonb_strip_nulls(from_json jsonb)

150 AgensGraph Developer Manual

Returns from_json with all object ields that have null values omitted. Other null values are unchanged.

SELECT json_strip_nulls('[{"f1":1,"f2":null},2,null,3]');

Result:

json_strip_nulls

---------------------

[{"f1":1},2,null,3]

(1row)

• jsonb_set(target jsonb, path text[], new_value jsonb[, create_missing boolean])

Returns target with the section designated by path replaced by new_value, or with new_value added

if create_missing is true (default is true) and the item designated by path does not exist. As with the path

orientated operators, negative integers that appear in path count from the end of JSON arrays.

SELECT jsonb_set('[{"f1":1,"f2":null},2,null,3]','{0,f1}','[2,3,4]',false);

Result:

jsonb_set

---------------------------------------------

[{"f1": [2,3,4], "f2":null}, 2,null,3]

(1row)

SELECT jsonb_set('[{"f1":1,"f2":null},2]','{0,f3}','[2,3,4]');

Result:

jsonb_set

---------------------------------------------

[{"f1":1,"f2":null,"f3": [2,3,4]}, 2]

(1row)

• jsonb_insert(target jsonb, path text[], new_value jsonb, [insert_after boolean])

Returns target with new_value inserted. If target section designated by path is in a JSONB array, new_value

will be inserted before target or after if insert_after is true (default is false). If target section designated

by path is in JSONB object, new_value will be inserted only if target does not exist. As with the path orien-

tated operators, negative integers that appear in path count from the end of JSON arrays.

Functions 151

SELECT jsonb_insert('{"a": [0,1,2]}','{a, 1}','"new_value"');

Result:

jsonb_insert

-------------------------------

{"a": [0,"new_value",1,2]}

(1row)

SELECT jsonb_insert('{"a": [0,1,2]}','{a, 1}','"new_value"',true);

Result:

jsonb_insert

-------------------------------

{"a": [0,1,"new_value",2]}

(1row)

• jsonb_pretty(from_json jsonb)

Returns from_json as indented JSON text.

SELECT jsonb_pretty('[{"f1":1,"f2":null},2,null,3]');

Result:

jsonb_pretty

--------------------

[ +

{ +

"f1":1, +

"f2": null+

}, +

2, +

null, +

]

(1row)

152 AgensGraph Developer Manual

4.2.12 Sequence Manipulation Functions

This section describes functions for operating on sequences. Sequences can be created with CREATE SEQUENCE.

CREATE SEQUENCE serial increment by 1start 101;

• nextval(regclass)

Advances sequence and returns new value.

SELECT nextval('serial');

Result:

nextval

---------

101

(1row)

• currval(regclass)

Returns value most recently obtained with nextval for speciied sequence.

SELECT currval('serial');

Result:

currval

---------

101

(1row)

• lastval()

Returns value most recently obtained with nextval for any sequence.

SELECT lastval();

Result:

lastval

---------

101

(1row)

Functions 153

• setval(regclass, bigint)

Sets sequence's current value.

SELECT setval('serial',101);

Result:

setval

--------

101

(1row)

• setval(regclass, bigint, boolean)

Sets sequence's current value and is_called lag.

-- true

SELECT setval('serial',101,true);

Result:

setval

--------

101

(1row)

SELECT nextval('serial');

Result:

nextval

---------

102

(1row)

-- false

SELECT setval('serial',101,false);

Result:

setval

154 AgensGraph Developer Manual

--------

101

(1row)

SELECT nextval('serial');

Result:

nextval

---------

101

(1row)

4.2.13 Array Functions

• array_append(anyarray, anyelement)

Appends anyelement to the end of an array.

SELECT array_append(ARRAY[1,2], 3);

Result:

array_append

--------------

{1,2,3}

(1row)

• array_cat(anyarray, anyarray)

Concatenates two arrays.

SELECT array_cat(ARRAY[1,2,3], ARRAY[4,5]);

Result:

array_cat

-------------

{1,2,3,4,5}

(1row)

• array_ndims(anyarray)

Functions 155

Returns the number of dimensions of the array.

SELECT array_ndims(ARRAY[[1,2,3], [4,5,6]]);

Result:

array_ndims

-------------

(1row)

• array_dims(anyarray)

Returns a text representation of array's dimensions.

SELECT array_dims(ARRAY[[1,2,3], [4,5,6]]);

Result:

array_dims

------------

[1:2][1:3]

(1row)

• array_ill(anyelement, int[], [, int[]])

Returns an array initialized with optionally-supplied value and dimensions.

SELECT array_fill(7,ARRAY[3], ARRAY[2]);

Result:

array_fill

---------------

[2:4]={7,7,7}

(1row)

• array_length(anyarray, int)

Returns the length of the requested array dimension.

SELECT array_length(array[1,2,3], 1);

Result:

156 AgensGraph Developer Manual

array_length

--------------

(1row)

• array_lower(anyarray, int)

Returns the lower bound of the requested array dimension.

SELECT array_lower('[0:2]={1,2,3}'::int[], 1);

Result:

array_lower

-------------

(1row)

• array_position(anyarray, anyelement [, int])

Returns the index of the irst occurrence of the second argument in the array, starting at the element indicated

by the third argument or at the irst element (array must be one-dimensional).

SELECT array_position(ARRAY['sun','mon','tue','wed','thu','fri','sat'], 'mon');

Result:

array_position

----------------

(1row)

• array_positions(anyarray, anyelement)

Returns an array of indexes of all occurrences of the second argument in the array given as irst argument (ar-

ray must be one-dimensional).

SELECT array_positions(ARRAY['A','A','B','A'], 'A');

Result:

array_positions

-----------------

Functions 157

{1,2,4}

(1row)

• array_prepend(anyelement, anyarray)

Appends anyelement to the beginning of an array.

SELECT array_prepend(1,ARRAY[2,3]);

Result:

array_prepend

---------------

{1,2,3}

(1row)

• array_remove(anyarray, anyelement)

Removes all elements equal to the given value from the array.

SELECT array_remove(ARRAY[1,2,3,2], 2);

Result:

array_remove

--------------

{1,3}

(1row)

• array_replace(anyarray, anyelement, anyelement)

Replaces each array element equal to the given value with a new value.

SELECT array_replace(ARRAY[1,2,5,4], 5,3);

Result:

array_replace

---------------

{1,2,3,4}

(1row)

• array_to_string(anyarray, text [, text])

Concatenates array elements using speciied delimiter and optional null string.

158 AgensGraph Developer Manual

SELECT array_to_string(ARRAY[1,2,3,NULL,5], ',','*');

Result:

array_to_string

-----------------

1,2,3,*,5

(1row)

• array_upper(anyarray, int)

Returns upper bound of the requested array dimension.

SELECT array_upper(ARRAY[1,8,3,7], 1);

Result:

array_upper

-------------

(1row)

• cardinality(anyarray)

Returns the total number of elements in the array, or 0 if the array is empty.

SELECT cardinality(ARRAY[[1,2],[3,4]]);

Result:

cardinality

-------------

(1row)

• string_to_array(text, text [, text])

Splits string into array elements using supplied delimiter and optional null string.

SELECT string_to_array('xx~^~yy~^~zz','~^~','yy');

Result:

string_to_array

Functions 159

-----------------

{xx,NULL,zz}

(1row)

• unnest(anyarray)

Expands an array to a set of rows.

SELECT unnest(ARRAY[1,2]);

Result:

unnest

--------

(2row)

• unnest(anyarray, anyarray [, …])

Expands multiple arrays (possibly of different types) to a set of rows. This is only allowed in the FROM clause.

SELECT *FROM unnest(ARRAY[1,2],ARRAY['foo','bar','baz']);

Result:

unnest | unnest

--------+--------

1| foo

2| bar

| baz

(1row)

4.2.14 Range Functions and Operators

• lower(anyrange)

Returns the lower bound of the input numeric range.

160 AgensGraph Developer Manual

SELECT *FROM lower(numrange(1.1,2.2));

Result:

lower

-------

1.1

(1row)

• upper(anyrange)

Returns upper of the input numeric range.

SELECT *FROM upper(numrange(1.1,2.2));

Result:

upper

-------

2.2

(1row)

• isempty(anyrange)

Returns a boolean value indicating whether the entered number range is empty.

SELECT *FROM isempty(numrange(1.1,2.2));

Result:

isempty

---------

(1row)

• lower_inc(anyrange)

Returns whether the lower bound of the entered number range exists.

Functions 161

SELECT *FROM lower_inc(numrange(1.1,2.2));

Result:

lower_inc

-----------

(1row)

• upper_inc(anyrange)

Returns a logical value indicating whether the upper bound of the input numeral range exists.

SELECT *FROM upper_inc(numrange(1.1,2.2));

Result:

lower_inc

-----------

(1row)

• lower_inf(anyrange)

Returns a logical value indicating whether the lower bound of the entered number range is ininite.

SELECT *FROM lower_inf('(,)'::daterange);

Result:

lower_inf

-----------

(1row)

• upper_inf(anyrange)

Returns a logical value indicating whether the upper bound of the entered number range is ininite.

162 AgensGraph Developer Manual

SELECT *FROM upper_inf('(,)'::daterange);

Result:

upper_inf

-----------

(1row)

• range_merge(anyrange, anyrange)

Returns the smallest range which includes both of the given ranges.

SELECT *FROM range_merge('[1,2)'::int4range,'[3,4)'::int4range);

Result:

range_merge

-------------

[1,4)

(1row)

4.2.15 Aggregate Functions

• array_agg(expression)

Returns input values, including nulls, concatenated into an array.

Argument Type(s): any non-array type

Return Type: array of the argument type

• array_agg(expression)

Returns input arrays concatenated into array of one higher dimension (note: inputs must all have same dimen-

sionality, and cannot be empty or NULL).

Argument Type(s): any array type

Return Type: same as argument data type

• avg(expression)

Returns the average (arithmetic mean) of all input values.

Argument Type(s): smallint, int, bigint, real, double precision, numeric, or interval

Functions 163

Return Type: numeric for any integer-type argument, double precision for a floating-point

argument, otherwise the same as the argument data type

• bit_and(expression)

Returns the bitwise AND of all non-null input values, or null if none.

Argument Type(s): smallint, int, bigint, or bit

Return Type: same as argument data type

• bit_or(expression)

Returns the bitwise OR of all non-null input values, or null if none.

Argument Type(s): smallint, int, bigint, or bit

Return Type: same as argument data type

• bool_and(expression)

Returns true if all input values are true, otherwise false.

Argument Type(s): bool

Return Type: bool

• bool_or(expression)

Returns true if at least one input value is true, otherwise false.

Argument Type(s): bool

Return Type: bool

• count(anything)

Returns the number of input rows.

Argument Type(s): any

Return Type: bigint

• count(expression)

Returns the number of input rows for which the value of expression is not null.

Argument Type(s): any

Return Type: bigint

• every(expression)

Equivalent to bool_and.

Argument Type(s): bool

Return Type: bool

164 AgensGraph Developer Manual

• json_agg(expression)

Aggregates values as a JSON array.

Argument Type(s): any

Return Type: json

• jsonb_agg(expression)

Aggregates values as a JSON array.

Argument Type(s): any

Return Type: jsonb

• json_object_agg(name, value)

Aggregates name/value pairs as a JSON object.

Argument Type(s): (any, any)

Return Type: json

• jsonb_object_agg(name, value)

Aggregates name/value pairs as a JSON object.

Argument Type(s): (any, any)

Return Type: jsonb

• max(expression)

Returns the maximum value of expression across all input values.

Argument Type(s): any numeric, string, date/time, network, or enum type, or arrays of these

types

Return Type: same as argument type

• min(expression)

Returns the minimum value of expression across all input values.

Argument Type(s): any numeric, string, date/time, network, or enum type, or arrays of these

types

Return Type: same as argument type

• string_agg(expression, delimiter)

Returns the input values concatenated into a string, separated by delimiter.

Argument Type(s): (text, text) or (bytea, bytea)

Return Type: same as argument types

• sum(expression)

Functions 165

Returns the sum of expression across all input values.

Argument Type(s): smallint, int, bigint, real, double precision, numeric, interval, or money

Return Type: bigint for smallint or int arguments, numeric for bigint arguments, otherwise

the same as the argument data type

• xmlagg(expression)

Returns the concatenation of XML values.

Argument Type(s): xml

Return Type: xml

• corr(Y, X)

Returns correlation coeficient of the two entered numbers.