XQuery and XPath Data Model 4.0

2 Terminology and Concepts

This section outlines a number of general concepts that apply throughout this specification.

In this document, examples and material labeled as “Note” are provided for explanatory purposes and are not normative.

2.9 Type System

Every value manipulated by XPath, XQuery, or XSLT is a sequence comprising zero or more items.

[Definition: A sequence type constrains the set of permitted sequences, by defining the permitted item types and the permitted number of items in the sequence (exactly zero, exactly one, zero-or-more, one-or-more, zero-or-one).]

Every item is an instance of one or more item types:

All items are instances of the type item().
Every node is an instance of the type node(), and more specifically it is an instance of one of seven node kinds: document(), element(*), attribute(*), text(), comment(), processing-instruction(), or namespace(). Nodes may also be instances of more specific types characterized by the node name and type annotation.
Every atomic item is an instance of a specific atomic type determined by its type annotation; it is also an instance of every type from which that type is derived by restriction (directly or indirectly), and of every union type that includes that type as a member type.
Every function item is an instance of the generic type function(*), and also of a specific function type defining the types of the function's parameters and the type of the result.
A map item, as well as being a function, is also an instance of the generic map type map(*), of more specific map types map(K, V) defining the types of the keys and values, and perhaps of one or more record types that associate a type with specific key values.
An array item, as well as being a function, is also an instance of the generic array type array(*), and also of more specific array types array(M) defining the type of the array's members.

This section describes how item types relate to each other.

The diagrams below show how nodes, functions, primitive simple types, and user defined types fit together into a type system. In the diagrams, connecting lines represent relationships between derived types and the types from which they are derived; the latter are always higher and to the left of the latter.

The xs:IDREFS, xs:NMTOKENS, xs:ENTITIES types, and xs:numeric, and both the user-defined list types and user-defined union types are special types in that these types are lists or unions rather than types derived by extension or restriction.

The first diagram illustrates the relationship of various item types. Item types in the data model form a directed graph, rather than a hierarchy or lattice: in the relationship defined by the derived-from(A, B) function, some types are derived from more than one other type. Examples include functions (function(xs:string) as xs:int is substitutable for function(xs:NCName) as xs:int and also for function(xs:string) as xs:decimal), and union types (A is substitutable for the union type (A | B) and also for the union type (A | C)). In XDM, item types include node types, function types, and built-in atomic types. The list, which shows only hierarchic relationships, is therefore a simplification of the full model.

item (abstract)
- anyAtomicType (built-in atomic)
- node (node)
  - attribute (node)
    - user-defined attribute types (user-defined)
  - document (node)
    - user-defined document types (user-defined)
  - element (node)
    - user-defined element types (user-defined)
  - text (node)
  - comment (node)
  - processing-instruction (node)
  - namespace (node)
- function(*) (function item)
  - user-defined function item types (user-defined)
  - array(*) (function item)
    - user-defined array types (user-defined)
  - map(*) (function item)
    - user-defined map types (user-defined)
    - user-defined record types (user-defined)

Legend:

Supertype
- subtype

Abstract types (abstract)
Built-in atomic types (built-in atomic)
Node types (node)
Function item types (function item)
User-defined types (user-defined)

The XPath Data Model is the abstraction over which XPath expressions are evaluated. Historically, all of the items in the data model could be derived directly (nodes) or indirectly (typed values, sequences) from an XML document. However, as the XPath expression language has matured, new features have been added which require additional types of items to appear in the data model. These items have no direct XML serialization, but they are never the less part of the data model.

The next diagram shows all of the atomic types, including the primitive simple types and the built-in types derived from the primitive simple types. This includes all the built-in datatypes defined in [Schema Part 2]. Atomic types act both as item types (meaning they can be used to declare the types of variables and function arguments), and as schema types (meaning they can be used as type annotations on nodes).

anyAtomicType
- anyURI
- base64Binary
- boolean
- date
- dateTime
  - dateTimeStamp
- decimal
  - integer
    - long
      - int
        short
        byte
    - nonNegativeInteger
      - positiveInteger
      - unsignedLong
        unsignedInt
        unsignedShort
        unsignedByte
    - nonPositiveInteger
      - negativeInteger
- double
- duration
  - dayTimeDuration
  - yearMonthDuration
- float
- gDay
- gMonth
- gMonthDay
- gYear
- gYearMonth
- hexBinary
- NOTATION
- QName
- string
  - normalizedString
    - token
      - NMTOKEN
      - Name
        NCName
        ENTITY
        ID
        IDREF
      - language
- time
- untypedAtomic

Legend:

Supertype
- subtype

Built-in atomic types

2.9.5 Map Items

Changes in 4.0 ⬇ ⬆

Constructors are added, and the single accessor function is now an iterator over the key/value pairs in the map. [Issue 1335 20 July 2024]
Ordered maps are introduced. [Issue 1651 PR 1703 14 January 2025]

[Definition: A map item (also called simply a map) is an item that represents an ordered sequence of key/value pairs, in which the keys are unique.] In other languages this is sometimes called a hash, dictionary, or associative array. The keys are atomic items, and each key in the map is unique (there is no other key to which it is equal). Each key is associated with a value that may be any sequence of zero or more items. There is no uniqueness constraint on values, only on keys. The semantics of equality when comparing keys are described in Section 13.2.1 fn:atomic-equal^FO.

[Definition: The key/value pairs in a map are referred to as entries.]

[Definition: A map containing exactly one entry is referred to as a single-entry map.]

[Definition: A map containing no entries is referred to as an empty map.]

Note:

Maps have no intrinsic identity separate from their content. A map can be given a transient identity, represented by an id property in its label, by applying the fn:pin function. This property is expected to be used in defining operations for deep update of maps.

[Definition: The order of entries in a map is referred to as entry order.] The entry order affects the result of functions such as map:keys and map:for-each, and also determines the order of entries when a map is serialized using the JSON output method.

Constructor and accessor functions for maps are defined in the following sections.

2.9.5.1 `empty-map` Constructor

dm:empty-map() as map(*)

The dm:empty-map constructor returns an entry empty map, that is, a map containing no key/value pairs.

Note:

In XPath an entry empty map may be constructed using the expression {} or map {}.

2.9.6 Array Items

Changes in 4.0 ⬇ ⬆

Constructors are added, and the single accessor function is now an iterator over the members of the array. [Issue 1335 20 July 2024]

[Definition: An array item (also called simply an array) is a value that represents an array.] [Definition: An array is an ordered list of values; these values are called the members of the array.] Unlike sequences, a member of an array can be any value (including a sequence or an array). The number of members in an array is called its size, and they are referenced by their position, in the range 1 to the size of the array.

[Definition: An array containing exactly one member is referred to as a single-member array.]

[Definition: An array containing no members is referred to as an empty array.]

Note:

Arrays have no intrinsic identity separate from their content. An array can be given a transient identity, represented by an id property in its label, by applying the fn:pin function. This property is expected to be used in defining operations for deep update of arrays.

Constructor and accessor functions for arrays are defined in the following sections.

2.9.6.1 `empty-array` Constructor

dm:empty-array() as array(*)

The dm:empty-array constructor returns an entry empty array, that is, an array item containing no members.

The function is exposed in XPath as an empty array constructor, written [] or array {}.

C Glossary (Non-Normative)

absent

When a property has no value, we say that it is absent.

array item

An array item (also called simply an array) is a value that represents an array.

atomic item

An atomic item is a pair (T, D) where T (the type annotation) is an atomic type, and D (the datum) is a point in the value space of T.

atomic type

An atomic type is either a primitive simple typewith variety atomic, or a type derived by restriction from another atomic type.

character

A character is any Unicode character.

codepoint

A codepoint is a non-negative integer assigned to a character by the Unicode consortium, or reserved for future assignment to a character.

compatible (of schemas)

Two schemasX and Y are compatible if the union of X and Y is a valid schema.

datum

The datum of an atomic item is a point in the value space of its type, which is also a point in the value space of the primitive type from which that type is derived.

document

A tree whose root node is a document node is referred to as a document.

document order

A document order is defined among all the nodes accessible during a given query or transformation. Document order is a total ordering, although the relative order of some nodes is implementation-dependent. Informally, document order is the order in which nodes appear in the XML serialization of a document.

entry empty array

The key/value pairs in a map are referred to as entries.

entry

A map containing exactly one entry is referred to as a single-entry map.

An array containing no members is referred to as an empty array.

entry empty map

A map containing no entries is referred to as an empty map.

entry

An array containing exactly one member is referred to as a single-member array.

entry

An array containing no members is referred to as an empty array.

The key/value pairs in a map are referred to as entries.

entry order

The order of entries in a map is referred to as entry order.

expanded QName

An expanded QName is a triple consisting of a possibly absent prefix, a possibly absent namespace URI, and a local name.

fragment

A tree whose root node is not a document node is referred to as a fragment.

function arity

The arity of a function item is the number of its parameters.

function item

A function item is an item that can be called.

function signature

A function signature represents the type of a function.

implementation defined

Implementation-defined indicates an aspect that may differ between implementations, but must be specified by the implementer for each particular implementation.

implementation dependent

Implementation-dependent indicates an aspect that may differ between implementations, is not specified by this or any W3C specification, and is not required to be specified by the implementer for any particular implementation.

incompletely validated

An incompletely validated document is an XML document that has a corresponding schema but whose schema-validity assessment has resulted in one or more element or attribute information items being assigned values other than ‘valid’ for the [validity] property in the PSVI.

instance of the data model

Every instance of the data model is a sequence.

item

An item is either a node, a function, or an atomic item.

item type

An item type represents a class of items.

labeled item

A labeled item is a pair (S, L) where S (called the subject) is any item, and L (called the label) is a map containing supplementary information about the item.

map item

A map item (also called simply a map) is an item that represents an ordered sequence of key/value pairs, in which the keys are unique.

member

An array is an ordered list of values; these values are called the members of the array.

Namespace URI

This specification uses the term Namespace URI to refer to a namespace name, whether or not it is a valid URI or IRI

node

There are seven kinds of nodes in the data model: document, element, attribute, text, namespace, processing instruction, and comment.

primitive simple type

The primitive simple types are the types defined in 2.2.1 Types adopted from XML Schema.

root node

The root node is the topmost node of a tree, the node with no parent.

schema

Following the terminology of [Schema Part 1], a schema is defined as set of schema components. Schema components include, for example, element declarations and type definitions.

schema type

A schema type corresponds to a type definition component as defined in XSD.

sequence

A sequence is an ordered collection of zero or more items.

sequence type

A sequence type constrains the set of permitted sequences, by defining the permitted item types and the permitted number of items in the sequence (exactly zero, exactly one, zero-or-more, one-or-more, zero-or-one).

single-entry map

A map containing exactly one entry is referred to as a single-entry map.

single-member array

An array containing exactly one member is referred to as a single-member array.

stable

Document order is stable, which means that the relative order of two nodes will not change during the processing of a given query or transformation, even if this order is implementation-dependent.

string

A string is a sequence of zero or more characters.

type annotation

The term type annotation has two slightly different meanings. For an atomic item, the type annotation of the value is the most specific atomic type that it is an instance of (it is also an instance of every type from which that type is derived). For an element or attribute node, the type annotation is the schema type (a simple or complex type) against which the node has been validated, defaulting to xs:untypedAtomic for unvalidated attribute nodes, and xs:untyped for unvalidated element nodes.

value

Because every value is a sequence, the term value is used synonymously with sequence.

XQuery and XPath Data Model 4.0

W3C Editor's Draft 23 February 2026

Abstract

Status of this Document

Dedication

2 Terminology and Concepts

2.9 Type System

2.9.5 Map Items

2.9.5.1 `empty-map` Constructor

2.9.6 Array Items

2.9.6.1 `empty-array` Constructor

C Glossary (Non-Normative)

XQuery and XPath Data Model 4.0

W3C Editor's Draft 23 February 2026

Abstract

Status of this Document

Dedication

2 Terminology and Concepts

2.9 Type System

2.9.5 Map Items

2.9.5.1 empty-map Constructor

2.9.6 Array Items

2.9.6.1 empty-array Constructor

C Glossary (Non-Normative)

2.9.5.1 `empty-map` Constructor

2.9.6.1 `empty-array` Constructor