XPath and XQuery Functions and Operators 4.0

1 Introduction

Changes in 4.0 ⬇

Use the arrows to browse significant changes since the 3.1 version of this specification.
Sections with significant changes are marked Δ in the table of contents. New functions introduced in this version are marked ➕ in the table of contents.

The purpose of this document is to define functions and operators for inclusion in XPath 4.0, XQuery 4.0, and XSLT 4.0. The exact syntax used to call these functions and operators is specified in [XML Path Language (XPath) 4.0], [XQuery 4.0: An XML Query Language] and [XSL Transformations (XSLT) Version 4.0].

This document defines three classes of functions:

General purpose functions, available for direct use in user-written queries, stylesheets, and XPath expressions, whose arguments and results are values defined by the [XQuery and XPath Data Model (XDM) 3.1].
Constructor functions, used for creating instances of a datatype from values of (in general) a different datatype. These functions are also available for general use; they are named after the datatype that they return, and they always take a single argument.
Functions that specify the semantics of operators defined in [XML Path Language (XPath) 4.0] and [XQuery 4.0: An XML Query Language]. These exist for specification purposes only, and are not intended for direct calling from user-written code.

[XML Schema Part 2: Datatypes Second Edition] defines a number of primitive and derived datatypes, collectively known as built-in datatypes. This document defines functions and operations on these datatypes as well as the other types (for example, nodes and sequences of nodes) defined in Section 2.7 Schema Information ^DM31 of the [XQuery and XPath Data Model (XDM) 3.1]. These functions and operations are available for use in [XML Path Language (XPath) 4.0], [XQuery 4.0: An XML Query Language] and any other host language that chooses to reference them. In particular, they may be referenced in future versions of XSLT and related XML standards.

[XSD 1.1 Part 2] adds to the datatypes defined in [XML Schema Part 2: Datatypes Second Edition]. It introduces a new derived type xs:dateTimeStamp, and it incorporates as built-in types the two types xs:yearMonthDuration and xs:dayTimeDuration which were previously XDM additions to the type system. In addition, XSD 1.1 clarifies and updates many aspects of the definitions of the existing datatypes: for example, it extends the value space of xs:double to allow both positive and negative zero, and extends the lexical space to allow +INF; it modifies the value space of xs:Name to permit additional Unicode characters; it allows year zero and disallows leap seconds in xs:dateTime values; and it allows any character string to appear as the value of an xs:anyURI item. Implementations of this specification may support either XSD 1.0 or XSD 1.1 or both.

In some cases, this specification references XSD for the semantics of operations such as the effect of matching using regular expressions, or conversion of atomic items to strings. In most such cases there is no intended technical difference between the XSD 1.0 and XSD 1.1 specifications, but the 1.1 version often provides clearer explanations and sometimes also corrects technical errors. In such cases this specification often chooses to reference the XSD 1.1 specification. This should not be taken as implying that it is necessary to invoke an XSD 1.1 processor.

References to specific sections of some of the above documents are indicated by cross-document links in this document. Each such link consists of a pointer to a specific section followed a superscript specifying the linked document. The superscripts have the following meanings: XQ [XQuery 4.0: An XML Query Language], XT [XSL Transformations (XSLT) Version 4.0], XP [XML Path Language (XPath) 4.0], and DM [XQuery and XPath Data Model (XDM) 4.0].

1.3 Namespaces and prefixes

The functions and operators defined in this document are contained in one of several namespaces (see [Namespaces in XML]) and referenced using an xs:QName.

This document uses conventional prefixes to refer to these namespaces. User-written applications can choose a different prefix to refer to the namespace, so long as it is bound to the correct URI. The host language may also define a default namespace for function calls, in which case function names in that namespace need not be prefixed at all. In many cases the default namespace will be http://www.w3.org/2005/xpath-functions, allowing a call on the fn:name function (for example) to be written as name() rather than fn:name(); in this document, however, all example function calls are explicitly prefixed.

The URIs of the namespaces and the conventional prefixes associated with them are:

http://www.w3.org/2001/XMLSchema for constructors — associated with xs.
The section 21 Constructor functions22 Constructor functions2122 Constructor functions defines constructor functions for the built-in datatypes defined in [XML Schema Part 2: Datatypes Second Edition] and in Section 2.7 Schema Information ^DM31 of [XQuery and XPath Data Model (XDM) 3.1]. These datatypes and the corresponding constructor functions are in the XML Schema namespace, http://www.w3.org/2001/XMLSchema, and are named in this document using the xs prefix.
http://www.w3.org/2005/xpath-functions for functions — associated with fn.
The namespace prefix used in this document for most functions that are available to users is fn.
http://www.w3.org/2005/xpath-functions/math for functions — associated with math.
This namespace is used for some mathematical functions. The namespace prefix used in this document for these functions is math. These functions are available to users in exactly the same way as those in the fn namespace.
http://www.w3.org/2005/xpath-functions/map for functions — associated with map.
This namespace is used for some functions that manipulate maps (see 18.4 Functions that Operate on Maps). The namespace prefix used in this document for these functions is map. These functions are available to users in exactly the same way as those in the fn namespace.
http://www.w3.org/2005/xpath-functions/array for functions — associated with array.
This namespace is used for some functions that manipulate maps (see 19.2 Functions that Operate on Arrays). The namespace prefix used in this document for these functions is array. These functions are available to users in exactly the same way as those in the fn namespace.
http://www.w3.org/2005/xqt-errors — associated with err.
There are no functions in this namespace; it is used for error codes.
This document uses the prefix err to represent the namespace URI http://www.w3.org/2005/xqt-errors, which is the namespace for all XPath and XQuery error codes and messages. This namespace prefix is not predeclared and its use in this document is not normative.
http://www.w3.org/2010/xslt-xquery-serialization — associated with output.
There are no functions in this namespace: it is used for serialization parameters, as described in [XSLT and XQuery Serialization 3.1]
Functions defined with the op prefix are described here to underpin the definitions of the operators in [XML Path Language (XPath) 4.0], [XQuery 4.0: An XML Query Language] and [XSL Transformations (XSLT) Version 4.0]. These functions are not available directly to users, and there is no requirement that implementations should actually provide these functions. For this reason, no namespace is associated with the op prefix. For example, multiplication is generally associated with the * operator, but it is described as a function in this document:
op:numeric-multiply(
$arg1 as xs:numeric,
$arg2 as xs:numeric
) as xs:numeric
Sometimes there is a need to use an operator as a function. To meet this requirement, the function fn:op takes any simple binary operator as its argument, and returns a corresponding function. So for example fn:for-each-pair($seq1, $seq2, op("+")) performs a pairwise addition of the values in two input sequences.

Note:

The above namespace URIs are not expected to change from one version of this document to another. The contents of these namespaces may be extended to allow additional functions (and errors, and serialization parameters) to be defined.

1.8 Type System

The diagrams in this section show how nodes, functions, primitive simple types, and user defined types fit together into a type system. This type system comprises two distinct subsystems that both include the primitive atomic types. In the diagrams, connecting lines represent relationships between derived types and the types from which they are derived; the former are always below and to the right of the latter.

The xs:IDREFS, xs:NMTOKENS, xs:ENTITIES types, and xs:numeric and both the user-defined list types and user-defined union types are special types in that these types are lists or unions rather than types derived by extension or restriction.

1.8.1 Item Types

The first diagram illustrates the relationship of various item types.

Item types are used to characterize the various types of item that can appear in a sequence (nodes, atomic items, and functions), and they are therefore used in declaring the types of variables or the argument types and result types of functions.

In XDM, item types include node types, function types, and built-in atomic types. Item types form a directed graph, rather than a hierarchy or lattice: in the relationship defined by the derived-from(A, B) function, some types are derived from more than one other type. Examples include functions (function(xs:string) as xs:int is substitutable for function(xs:NCName) as xs:int and also for function(xs:string) as xs:decimal), and choice types (A is substitutable for the choice type (A | B) and also for (A | C). Record types provide an alternative way of categorizing maps: the instances of record(longitude, latitude) overlap with the instances of map(xs:string, xs:double). The diagram, which shows only hierarchic relationships, is therefore a simplification of the full model.

item (abstract)
- anyAtomicType (built-in atomic)
- node (node)GNode (node)nodeGNode (node)
  - attribute (node)XNode (node)attributeXNode (node)
    - user-defined attribute types (user-defined)
      attribute (node)
      - user-defined attribute types (user-defined)
    - document (node)
      - user-defined document types (user-defined)
    - document (node)
      - user-defined document types (user-defined)
    - element (node)
      - user-defined element types (user-defined)
    - element (node)
      - user-defined element types (user-defined)
    - text (node)
    - text (node)
    - comment (node)
    - comment (node)
    - processing-instruction (node)
    - processing-instruction (node)
    - namespace (node)
    - namespace (node)
  - document (node)
    - user-defined document types (user-defined)
  - document (node)
    - user-defined document types (user-defined)
  - element (node)
    - user-defined element types (user-defined)
  - element (node)
    - user-defined element types (user-defined)
  - text (node)
  - text (node)
  - comment (node)
  - comment (node)
  - processing-instruction (node)
  - processing-instruction (node)
  - namespace (node)JNode (node)namespaceJNode (node)
- function(*) (function item)
  - user-defined function item types (user-defined)
  - array(*) (function item)
    - user-defined array types (user-defined)
  - map(*) (function item)
    - user-defined map types (user-defined)
    - user-defined record types (user-defined)

Legend:

Supertype
- subtype

Abstract types (abstract)
Built-in atomic types (built-in atomic)
Node types (node)
Function item types (function item)
User-defined types (user-defined)

1.9 Terminology

Changes in 4.0 ⬇ ⬆

The term atomic value has been replaced by atomic item. [Issue 1337 PR 1361 2 August 2024]

The terminology used to describe the functions and operators on types defined in [XML Schema Part 2: Datatypes Second Edition] is defined in the body of this specification. The terms defined in this section are used in building those definitions.

Note:

Following in the tradition of [XML Schema Part 2: Datatypes Second Edition], the terms type and datatype are used interchangeably.

1.9.5 Properties of functions

This section is concerned with the question of whether two calls on a function, with the same arguments, may produce different results.

In this section the term function, unless otherwise specified, applies equally to function definitions^XP (which can be the target of a static function call) and function items^DM (which can be the target of a dynamic function call).

[Definition] An execution scope is a sequence of calls to the function library during which certain aspects of the state are required to remain invariant. For example, two calls to fn:current-dateTime within the same execution scope will return the same result. The execution scope is defined by the host language that invokes the function library. In XSLT, for example, any two function calls executed during the same transformation are in the same execution scope (except that static expressions, such as those used in use-when attributes, are in a separate execution scope).

The following definition explains more precisely what it means for two function calls to return the same result:

[Definition] Two values $V1 and $V2 are defined to be identical if they contain the same number of items and the items are pairwise identical. Two items are identical if and only if one of the following conditions applies:

Both items are atomic items, of precisely the same type, and the values are equal as defined using the eq operator, using the Unicode codepoint collation when comparing strings.
Both items are nodes, and represent the same node.
Both items are maps, both maps have the same number of entries, and for every entry E₁ in the first map there is an entry E₂ in the second map such that the keys of E₁ and E₂ are the same key, and the corresponding values V₁ and V₂ are identical.
Both items are arrays, both arrays have the same number of members, and the members are pairwise identical.
Both items are function items, neither item is a map or array, and the two function items have the same function identity. The concept of function identity is explained in Section 7.18.1 Function Items^DM.

Some functions produce results that depend not only on their explicit arguments, but also on the static and dynamic context.

[Definition] A function definition^XP may have the property of being context-dependent: the result of such a function depends on the values of properties in the static and dynamic evaluation context of the caller as well as on the actual supplied arguments (if any). A function definition may be context-dependent for some arities in its arity range, and context-independent for others: for example fn:name#0 is context-dependent while fn:name#1 is context-independent.

[Definition] A function definition^XP that is not context-dependent is called context-independent.

The main categories of context-dependent functions are:

Functions that explicitly deliver the value of a component of the static or dynamic context, for example fn:static-base-uri, fn:default-collation, fn:position, or fn:last.
Functions with an optional parameter whose default value is taken from the static or dynamic context of the caller, usually either the context value (for example, fn:node-name) or the default collation (for example, fn:index-of).
Functions that use the static context of the caller to expand or disambiguate the values of supplied arguments: for example fn:doc expands its first argument using the static base URI of the caller, and xs:QName expands its first argument using the in-scope namespaces of the caller.

[Definition] A function is focus-dependent if its result depends on the focus^XP31 (that is, the context item, position, or size) of the caller.

[Definition] A function that is not focus-dependent is called focus-independent.

Note:

Some functions depend on aspects of the dynamic context that remain invariant within an execution scope, such as the implicit timezone. Formally this is treated in the same way as any other context dependency, but internally, the implementation may be able to take advantage of the fact that the value is invariant.

Note:

User-defined functions in XQuery and XSLT may depend on the static context of the function definition (for example, the in-scope namespaces) and also in a limited way on the dynamic context (for example, the values of global variables). However, the only way they can depend on the static or dynamic context of the caller — which is what concerns us here — is by defining optional parameters whose default values are context-dependent.

Note:

Because the focus is a specific part of the dynamic context, all focus-dependent functions are also context-dependent. A context-dependent function, however, may be either focus-dependent or focus-independent.

A function definition that is context-dependent can be used as the target of a named function reference, can be partially applied, and can be found using fn:function-lookup. The principle in such cases is that the static context used for the function evaluation is taken from the static context of the named function reference, partial function application, or the call on fn:function-lookup; and the dynamic context for the function evaluation is taken from the dynamic context of the evaluation of the named function reference, partial function application, or the call of fn:function-lookup. These constructs all deliver a function item^DM having a captured context based on the static and dynamic context of the construct that created the function item. This captured context forms part of the closure of the function item.

The result of a dynamic call to a function item never depends on the static or dynamic context of the dynamic function call, only (where relevant) on the captured context held within the function item itself.

The fn:function-lookup function is a special case because it is potentially dependent on everything in the static and dynamic context. This is because the static and dynamic context of the call to fn:function-lookupform the captured context of the function item that fn:function-lookup returns.

[Definition] A function that is guaranteed to produce identical results from repeated calls within a single execution scope if the explicit and implicit arguments are identical is referred to as deterministic.

[Definition] A function that is not deterministic is referred to as nondeterministic.

All functions defined in this specification are deterministic unless otherwise stated. Exceptions include the following:

[Definition] Some functions (such as fn:distinct-values, fn:unordered, map:keys, and map:for-each) produce results in an implementation-defined or implementation-dependent order. In such cases two calls with the same arguments are not guaranteed to produce the results in the same order. These functions are said to be nondeterministic with respect to ordering.
Some functions (such as fn:analyze-string, fn:parse-xml, fn:parse-xml-fragment, fn:parse-html, and fn:json-to-xml) construct a tree of nodes to represent their results. There is no guarantee that repeated calls with the same arguments will return the same identical node (in the sense of the is operator). However, if non-identical nodes are returned, their content will be the same in the sense of the fn:deep-equal function. Such a function is said to be nondeterministic with respect to node identity.
Some functions (such as fn:doc and fn:collection) create new nodes by reading external documents. Such functions are guaranteed to be deterministic with the exception that an implementation is allowed to make them nondeterministic as a user option.

Where the results of a function are described as being (to a greater or lesser extent) implementation-defined or implementation-dependent, this does not by itself remove the requirement that the results should be deterministic: that is, that repeated calls with the same explicit and implicit arguments must return identical results.

[Definition] The function fn:concat is defined to be variadic: it accepts any number of arguments. No other function has this property.

2 Processing nodes

2.1 Accessors

Accessors and their semantics are described in [XQuery and XPath Data Model (XDM) 3.1]. Some of these accessors are exposed to the user through the functions described below.

Each of these functions has an arity-zero signature which is equivalent to the arity-one form, with the context value supplied as the implicit first argument. In addition, each of the arity-one functions accepts an empty sequence as the argument, in which case it generally delivers an empty sequence as the result: the exception is fn:string, which delivers a zero-length string.

Function	Accessor	Accepts	Returns
`fn:node-name`	`node-name`	node (optional)	`xs:QName` (optional)
`fn:nilled`	`nilled`	node (optional)	`xs:boolean` (optional)
`fn:string`	`string-value`	item (optional)	`xs:string`
`fn:data`	`typed-value`	zero or more items	a sequence of atomic items
`fn:base-uri`	`base-uri`	node (optional)	`xs:anyURI` (optional)
`fn:document-uri`	`document-uri`	node (optional)	`xs:anyURI` (optional)

Function	Meaning
`fn:node-name`	Returns the name of a node, as an `xs:QName`.
`fn:nilled`	Returns `true` for an element that is nilled.
`fn:string`	Returns the value of `$value` represented as an `xs:string`.
`fn:data`	Returns the result of atomizing a sequence. This process flattens arrays, and replaces nodes by their typed values.
`fn:base-uri`	Returns the base URI of a node.
`fn:document-uri`	Returns the URI of a resource where a document can be found, if available.

2.1.1 fn:node-name

Summary

Returns the name of a node, as an xs:QName.

Signature

`fn:node-name`(
`$node`	`as` `node()?`	`:=` `.`
) `as` `xs:QName?`

Properties

The zero-argument form of this function is deterministic, context-dependent, and focus-dependent.

The one-argument form of this function is deterministic, context-independent, and focus-independent.

Rules

If the argument is omitted, it defaults to the context value (.).

If $node is the empty sequence, the empty sequence is returned.

Otherwise, the function returns the result of the dm:node-name accessor as defined in [XQuery and XPath Data Model (XDM) 3.1] (see Section 6.7.107.5.10 node-name Accessor^DM).

Error Conditions

The following errors may be raised when $node is omitted:

If the context value is absent^DM, type error [err:XPDY0002]^XP.
If the context value is not an instance of the sequence type node()?, type error [err:XPTY0004]^XP.

Notes

For element and attribute nodes, the name of the node is returned as an xs:QName, retaining the prefix, namespace URI, and local part.

For processing instructions, the name of the node is returned as an xs:QName in which the prefix and namespace URI are absent^DM.

For a namespace node, the function returns an empty sequence if the node represents the default namespace; otherwise it returns an xs:QName in which prefix and namespace URI are absent^DM and the local part is the namespace prefix being bound.

For all other kinds of node, the function returns the empty sequence.

Examples

Variables
let $e := <doc> <p id="alpha" xml:id="beta">One</p> <p id="gamma" xmlns="http://example.com/ns">Two</p> <ex:p id="delta" xmlns:ex="http://example.com/ns">Three</ex:p> <?pi 3.14159?> </doc>

Expression	Result
`node-name($e//*[@id = 'alpha'])`	QName("", "p")
`node-name($e//*[@id = 'gamma'])`	QName("http://example.com/ns", "p")
`node-name($e//*[@id = 'delta'])`	QName("http://example.com/ns", "ex:p")
`node-name($e//processing-instruction())`	QName("", "pi")
`node-name($e//*[@id = 'alpha']/text())`	()
`node-name($e//*[@id = 'alpha']/@id)`	QName("", "id")
`node-name($e//*[@id = 'alpha']/@xml:id)`	#xml:id

2.1.2 fn:nilled

Summary

Returns true for an element that is nilled.

Signature

`fn:nilled`(
`$node`	`as` `node()?`	`:=` `.`
) `as` `xs:boolean?`

Properties

The zero-argument form of this function is deterministic, context-dependent, and focus-dependent.

The one-argument form of this function is deterministic, context-independent, and focus-independent.

Rules

If the argument is omitted, it defaults to the context value (.).

If $node is the empty sequence, the function returns the empty sequence.

Otherwise the function returns the result of the dm:nilled accessor as defined in [XQuery and XPath Data Model (XDM) 3.1] (see Section 6.7.87.5.8 nilled Accessor^DM).

Error Conditions

The following errors may be raised when $node is omitted:

If the context value is absent^DM, type error [err:XPDY0002]^XP
If the context value is not an instance of the sequence type node()?, type error [err:XPTY0004]^XP.

Notes

If $node is not an element node, the function returns the empty sequence.

If $node is an untyped element node, the function returns false.

In practice, the function returns true only for an element node that has the attribute xsi:nil="true" and that is successfully validated against a schema that defines the element to be nillable; the detailed rules, however, are defined in [XQuery and XPath Data Model (XDM) 3.1].

2.1.3 fn:string

Summary

Returns the value of $value represented as an xs:string.

Signature

`fn:string`(
`$value`	`as` `item()?`	`:=` `.`
) `as` `xs:string`

Properties

The zero-argument form of this function is deterministic, context-dependent, and focus-dependent.

The one-argument form of this function is deterministic, context-independent, and focus-independent.

Rules

In the zero-argument version of the function, $value defaults to the context value. That is, calling fn:string() is equivalent to calling fn:string(.).

If $value is the empty sequence, the function returns the zero-length string.

If $value is aan XNode^nodeDM, the function returns the string value of the node, as obtained using the dm:string-value accessor defined in [XQuery and XPath Data Model (XDM) 3.1] (see Section 6.7.127.5.12 string-value Accessor^DM).

If $value is a JNode^DM, the function returns the result of string(JNode-value($value)). This will fail in the case where JNode-value($value) is a map or an array.

If $value is an atomic item, the function returns the result of the expression $value cast as xs:string (see 22 Casting23 Casting2223 Casting).

In all other cases, a dynamic error occurs (see below).

Error Conditions

The following errors may be raised when $value is omitted:

If the context value is absent^DM, type error [err:XPDY0002]^XP.
If the context value is not an instance of the sequence type item()?, type error [err:XPTY0004]^XP.

A type error is raised [err:FOTY0014] if $value is a function item (this includes maps and arrays).

Notes

Every node has a string value, even an element with element-only content (which has no typed value). Moreover, casting an atomic item to a string always succeeds. Functions, maps, and arrays have no string value, so these are the only arguments that satisfy the typesatisfy the type signature but cause failure. Applying the string signature but cause failurefunction to a JNode succeeds if the JNode wraps a simple value such as a string, number, or boolean, or if it wraps an XNode, but it fails in the case where the JNode wraps a map or an array.

Examples

Variables
let $para := <para>There lived a <term author="Tolkien">hobbit</term>.</para>

Expression	Result
`string(23)`	"23"
`string(false())`	"false"
`string("Paris")`	"Paris"
`string((1, 2, 3))`	Raises error XPTY0004.
`string([ [ 1, 2 ], [ 3, 4 ] ])`	Raises error FOTY0014.
`string(abs#1)`	Raises error FOTY0014.
`string(JNode({"x": [10, 20, 30]}) ? x ? 3)`	"30"
`string(JNode({"x": [10, 20, 30]}) ? x ? 3)`	"30"
`string($para)`	"There lived a hobbit."

2.1.4 fn:data

Summary

Returns the result of atomizing a sequence. This process flattens arrays, and replaces nodes by their typed values.

Signature

`fn:data`(
`$input`	`as` `item()*`	`:=` `.`
) `as` `xs:anyAtomicType*`

Properties

The zero-argument form of this function is deterministic, context-dependent, and focus-dependent.

The one-argument form of this function is deterministic, context-independent, and focus-independent.

Rules

If the argument is omitted, it defaults to the context value (.).

The result of fn:data is the sequence of atomic items produced by applying the following rules to each item in $input:

If the item is an atomic item, it is appended to the result sequence.
If the item is aan XNode^nodeDM, the typed value of the node is appended to the result sequence. The typed value is a sequence of zero or more atomic items: specifically, the result of the dm:typed-value accessor as defined in [XQuery and XPath Data Model (XDM) 3.1] (See Section 6.7.147.5.14 typed-value Accessor^DM).
If the item is a JNode^DM, the atomized value of its ¶value property is appended to the result sequence.
If the item is a JNode^DM, the atomized value of its ¶value property is appended to the result sequence.
If the item is an array, the result of applying fn:data to each member of the array, in order, is appended to the result sequence.

Error Conditions

A type error is raised [err:FOTY0012] if an item in the sequence $input is a node that does not have a typed value.

A type error is raised [err:FOTY0013] if an item in the sequence $input is a function item other than an array.

A type error is raised [err:XPDY0002]^XP if $input is omitted and the context value is absent^DM.

Notes

The process of applying the fn:data function to a sequence is referred to as atomization. In many cases an explicit call on fn:data is not required, because atomization is invoked implicitly when a node or sequence of nodes is supplied in a context where an atomic item or sequence of atomic items is required.

The result of atomizing an empty sequence is an empty sequence.

The result of atomizing an empty array is an empty sequence.

Examples

Variables
let $para := <para>There lived a <term author="Tolkien">hobbit</term>.</para>

Expression	Result
`data(123)`	123
`data((123, 456))`	123, 456
`data([ [ 1, 2 ], [ 3, 4 ] ])`	1, 2, 3, 4
`data($para)`	xs:untypedAtomic("There lived a hobbit.")
`data($para/term/@author)`	xs:untypedAtomic("Tolkien")
`data(abs#1)`	Raises error FOTY0013.

2.1.5 fn:base-uri

Summary

Returns the base URI of a node.

Signature

`fn:base-uri`(
`$node`	`as` `node()?`	`:=` `.`
) `as` `xs:anyURI?`

Properties

The zero-argument form of this function is deterministic, context-dependent, and focus-dependent.

The one-argument form of this function is deterministic, context-independent, and focus-independent.

Rules

The zero-argument version of the function returns the base URI of the context node: it is equivalent to calling fn:base-uri(.).

The single-argument version of the function behaves as follows:

If $node is the empty sequence, the function returns the empty sequence.
Otherwise, the function returns the value of the dm:base-uri accessor applied to the node $node. This accessor is defined, for each kind of node, in the XDM specification (See Section 6.7.27.5.2 base-uri Accessor^DM).

Note:

As explained in XDM, document, element and processing-instruction nodes have a base-uri property which may be empty. The base-uri property for all other node kinds is the empty sequence. The dm:base-uri accessor returns the base-uri property of a node if it exists and is non-empty; otherwise it returns the result of applying the dm:base-uri accessor to its parent, recursively. If the node does not have a parent, or if the recursive ascent up the ancestor chain encounters a parentless node whose base-uri property is empty, the empty sequence is returned. In the case of namespace nodes, however, the result is always an empty sequence — it does not depend on the base URI of the parent element.

2.1.6 fn:document-uri

Changes in 4.0 ⬇ ⬆

The constraints on the result of the function have been relaxed. [Issues 898 1161 PR 1265 2 July 2024]

Summary

Returns the URI of a resource where a document can be found, if available.

Signature

`fn:document-uri`(
`$node`	`as` `node()?`	`:=` `.`
) `as` `xs:anyURI?`

Properties

The zero-argument form of this function is deterministic, context-dependent, and focus-dependent.

The one-argument form of this function is deterministic, context-independent, and focus-independent.

Rules

If the argument is omitted, it defaults to the context value (.).

If $node is the empty sequence, the function returns the empty sequence.

If $node is not a document node, the function returns the empty sequence.

Otherwise, the function returns the value of the document-uri accessor applied to $node, as defined in [XQuery and XPath Data Model (XDM) 3.1] (See Section 6.6.1.27.4.1.2 Accessors^DM).

Error Conditions

The following errors may be raised when $node is omitted:

If the context value is absent^DM, type error [err:XPDY0002]^XP
If the context value is not an instance of the sequence type node()?, type error [err:XPTY0004]^XP.

Notes

In the 3.1 version of this specification, it was mandated that two distinct documents could not have the same document-uri property: more specifically, it was guaranteed that for any document node $D, either document-uri($D) would be absent, or doc(document-uri($D)) would return $D.

For various reasons, this constraint has proved impractical. Different parts of an application may read the same external resource in different ways, for example with or without validation or whitespace stripping, leading to different document nodes derived from the same external resource having the same document-uri property. In addition, the specification explicitly allows implementations, at user request, to relax the requirements for determinism of resource access functions, which makes it possible for multiple calls of functions such as fn:doc, fn:json-doc, or fn:collection to return different results for the same supplied URI.

Although the uniqueness of the document-uri property is no longer an absolute constraint, it is still desirable that implementations should where possible respect the principle that URIs are usable as identifiers for resources.

In the case of a document node $D returned by the fn:doc function, it will generally be the case that fn:document-uri($D) returns a URI $U such that a call on fn:doc($U) in the same dynamic context will return the same document node $D. The URI $U will not necessarily be the same URI that was originally passed to the fn:doc function, since several URIs may identify the same resource.

It is recommended that implementations of fn:collection should ensure that any documents included in the returned collection, if they have a non-empty fn:document-uri property, should be such that a call on fn:doc supplying this URI returns the same document node.

2.2 Other functions on nodes

This section specifies further functions on nodes. Nodes are formally defined in Section 6 Nodes ^DM31.

Function	Meaning
`fn:name`	Returns the name of a node, as an `xs:string` that is either the zero-length string, or has the lexical form of an `xs:QName`.
`fn:local-name`	Returns the local part of the name of `$node` as an `xs:string` that is either the zero-length string, or has the lexical form of an `xs:NCName`.
`fn:namespace-uri`	Returns the namespace URI part of the name of `$node`, as an `xs:anyURI` value.
`fn:lang`	This function tests whether the language of `$node`, or the context value if the second argument is omitted, as specified by `xml:lang` attributes is the same as, or is a sublanguage of, the language specified by `$language`.
`fn:root`	Returns the root of the tree to which `$node` belongs. This will usually, but not necessarily, be a document nodeThe function can be applied both to XNodes^DM and to JNodes^DM.
`fn:path`	Returns a path expression that can be used to select the supplied node relative to the root of its containing document.
`fn:has-children`	Returns `true` if the supplied node has one or more child nodes (of any kind).
`fn:siblings`	Returns the supplied node together with its siblings, in document order.

2.2.2 fn:local-name

Summary

Returns the local part of the name of $node as an xs:string that is either the zero-length string, or has the lexical form of an xs:NCName.

Signature

`fn:local-name`(
`$node`	`as` `node()?`	`:=` `.`
) `as` `xs:string`

Properties

The zero-argument form of this function is deterministic, context-dependent, and focus-dependent.

The one-argument form of this function is deterministic, context-independent, and focus-independent.

Rules

If the argument is omitted, it defaults to the context value (.).

If the argument is supplied and is the empty sequence, the function returns the zero-length string.

If the node identified by $node has no name (that is, if it is a document node, a comment, a text node, or a namespace node having no name), the function returns the zero-length string.

Otherwise, the function returns the local part of the expanded-QName of the node identified by $node, as determined by the dm:node-name accessor defined in Section 6.7.107.5.10 node-name Accessor^DM. This will be an xs:string whose lexical form is an xs:NCName.

Error Conditions

The following errors may be raised when $node is omitted:

If the context value is absent^DM, type error [err:XPDY0002]^XP
If the context value is not a single node, type error [err:XPTY0004]^XP.

Examples

Variables
let $e := <doc> <p id="alpha" xml:id="beta">One</p> <p id="gamma" xmlns="http://example.com/ns">Two</p> <ex:p id="delta" xmlns:ex="http://example.com/ns">Three</ex:p> <?pi 3.14159?> </doc>

Expression	Result
`local-name($e//*[@id = 'alpha'])`	"p"
`local-name($e//*[@id = 'gamma'])`	"p"
`local-name($e//*[@id = 'delta'])`	"p"
`local-name($e//processing-instruction())`	"pi"
`local-name($e//*[@id = 'alpha']/text())`	""
`local-name($e//*[@id = 'alpha']/@id)`	"id"
`local-name($e//*[@id = 'alpha']/@xml:id)`	"id"

2.2.3 fn:namespace-uri

Summary

Returns the namespace URI part of the name of $node, as an xs:anyURI value.

Signature

`fn:namespace-uri`(
`$node`	`as` `node()?`	`:=` `.`
) `as` `xs:anyURI`

Properties

The zero-argument form of this function is deterministic, context-dependent, and focus-dependent.

The one-argument form of this function is deterministic, context-independent, and focus-independent.

Rules

If the argument is omitted, it defaults to the context node (.).

If the node identified by $node is neither an element nor an attribute node, or if it is an element or attribute node whose expanded-QName (as determined by the dm:node-name accessor in the Section 6.7.107.5.10 node-name Accessor^DM) is in no namespace, then the function returns the zero-length xs:anyURI value.

Otherwise, the result will be the namespace URI part of the expanded-QName of the node identified by $node, as determined by the dm:node-name accessor defined in Section 6.7.107.5.10 node-name Accessor^DM), returned as an xs:anyURI value.

Error Conditions

The following errors may be raised when $node is omitted:

If the context value is absent^DM, type error [err:XPDY0002]^XP
If the context value is not an instance of the sequence type node()?, type error [err:XPTY0004]^XP.

Examples

Variables
let $e := <doc> <p id="alpha" xml:id="beta">One</p> <p id="gamma" xmlns="http://example.com/ns">Two</p> <ex:p id="delta" xmlns:ex="http://example.com/ns">Three</ex:p> <?pi 3.14159?> </doc>

Expression	Result
`namespace-uri($e//*[@id = 'alpha'])`	""
`namespace-uri($e//*[@id = 'gamma'])`	"http://example.com/ns"
`namespace-uri($e//*[@id = 'delta'])`	"http://example.com/ns"
`namespace-uri($e//processing-instruction())`	""
`namespace-uri($e//*[@id = 'alpha']/text())`	""
`namespace-uri($e//*[@id = 'alpha']/@id)`	""
`namespace-uri($e//*[@id = 'alpha']/@xml:id)`	"http://www.w3.org/XML/1998/namespace"

2.2.5 fn:root

Summary

Returns the root of the tree to which $node belongs. This will usually, but not necessarily, be a document nodeThe function can be applied both to XNodes^DM and to JNodes^DM.

Signature

`fn:root`(
`$node`	`as` `node()?as` `GNode()?as` `nodeGNode()?`	`:=` `.`
) `as` `node()?`) `as` `GNode()?`) `as` `nodeGNode()?`

Properties

The zero-argument form of this function is deterministic, context-dependent, and focus-dependent.

The one-argument form of this function is deterministic, context-independent, and focus-independent.

Rules

If the function is called without an argument, the context value (.) is used as the default argument.

TheIf the (explicit or implicit) argument is a XNode^DM, the function returns the value of the expression ($arg/ancestor-or-self::node())[1[last()].

If the (explicit or implicit) argument is a JNode^DM, the function returns the value of the expression $arg?ancestor-or-self::*[last()].

Error Conditions

The following errors may be raised when $node is omitted:

If the context value is absent^DM, type error [err:XPDY0002]^XP
If the context value is not an instance of the sequence type nodeGNode()?, type error [err:XPTY0004]^XP.

Examples

These examples use some variables which could be defined in [XQuery 4.0: An XML Query Language] as:
let $i := <tool>wrench</tool> let $o := <order>{ $i }<quantity>5</quantity></order> let $odoc := document { $o } let $newi := $o/tool
Or they could be defined in [XSL Transformations (XSLT) Version 4.0] as:
<xsl:variable name="i" as="element()"> <tool>wrench</tool> </xsl:variable> <xsl:variable name="o" as="element()"> <order> <xsl:copy-of select="$i"/> <quantity>5</quantity> </order> </xsl:variable> <xsl:variable name="odoc"> <xsl:copy-of select="$o"/> </xsl:variable> <xsl:variable name="newi" select="$o/tool"/>
`root($i)` returns the element node `$i`
`root($o/quantity)` returns the element node `$o`
`root($odoc//quantity)` returns the document node `$odoc`
`root($newi)` returns the element node `$o`
The final three examples could be made type-safe by wrapping their operands with `exactly-one()`.

2.3 Functions on sequences of nodes

This section specifies functions on sequences of nodes.

Function	Meaning
`fn:distinct-ordered-nodes`	Removes duplicate nodesGNodes and sorts the input into document order.
`fn:innermost`	Returns every node within the input sequence that is not an ancestor of another member of the input sequence; the nodes are returned in document order with duplicates eliminated.
`fn:outermost`	Returns every node within the input sequence that has no ancestor that is itself a member of the input sequence; the nodes are returned in document order with duplicates eliminated.

2.3.1 fn:distinct-ordered-nodes

Changes in 4.0 ⬇ ⬆

New in 4.0 [Issue 1189 PR 1191 14 May 2024]

Summary

Removes duplicate nodesGNodes and sorts the input into document order.

Signature

`fn:distinct-ordered-nodes`(
`$nodes`	`as` `node()as` `GNode()as` `nodeGNode()*`
) `as` `node()`) `as` `GNode()`) `as` `nodeGNode()*`

Properties

This function is deterministic, context-independent, and focus-independent.

Rules

Any duplicate nodesGNodes (that is, XNodes or JNodes) in the input (based on node identity) are discarded. The remaining nodesGNodes are returned in document order^XP.

Notes

Document order is implementation-dependent (but stable) for nodesGNodes in different documentstrees. If some node in documentGNode in tree A precedes some node in documentGNode in tree B, then every nodeGNode in A precedes every nodeGNode in B.

Examples

Expression:	let $x := parse-xml('<doc><a/><b/><c/><d/><c/><e/></doc>') return distinct-ordered-nodes(($x//c, $x//b, $x//a, $x//b)) ! name()
Result:	"a", "b", "c", "c" (The two `$x//b` expressions select the same node; one of these is eliminated as a duplicate. The `$x//c` expression selects two nodes that have distinct identity, so both are retained.)
Expression:	let $x := {"a":{"a":{"a":1}}} return distinct-ordered-nodes( $x ? descendant::a ? descendant::a) => count()
Expression:	let $x := {"a":{"a":{"a":1}}} return distinct-ordered-nodes( $x ? descendant::a ? descendant::a) => count()
Result:	3 (The innermost map entry `"a":1` is selected by two different routes; the lookup operator does not eliminate duplicate JNodes.)
Result:	3 (The innermost map entry `"a":1` is selected by two different routes; the lookup operator does not eliminate duplicate JNodes.)

4 Processing numerics

This section specifies arithmetic operators on the numeric datatypes defined in [XML Schema Part 2: Datatypes Second Edition].

4.1 Numeric types

The operators described in this section are defined on the following atomic types.

- decimal
  - integer
- double
- float

Legend:

Supertype
- subtype

Built-in atomic types

They also apply to types derived by restriction from the above types.

The type xs:numeric is defined as a union type whose member types are (in order) xs:double, xs:float, and xs:decimal. This type is implicitly imported into the static context, so it can also be used in defining the signature of user-written functions. Apart from the fact that it is implicitly imported, it behaves exactly like a user-defined type with the same definition. This means, for example:

If the expected type of a function parameter is given as xs:numeric, the actual value supplied can be an instance of any of these three types, or any type derived from these three by restriction (this includes the built-in type xs:integer, which is derived from xs:decimal).
If the expected type of a function parameter is given as xs:numeric, and the actual value supplied is xs:untypedAtomic (or a node whose atomized value is xs:untypedAtomic), then it will be cast to the union type xs:numeric using the rules in 22.3.7 Casting to union types23.3.7 Casting to union types22.3.723.3.7 Casting to union types. Because the lexical space of xs:double subsumes the lexical space of the other member types, and xs:double is listed first, the effect is that if the untyped atomic item is in the lexical space of xs:double, it will be converted to an xs:double, and if not, a dynamic error occurs.
When the return type of a function is given as xs:numeric, the actual value returned will be an instance of one of the three member types (and perhaps also of types derived from these by restriction). The rules for the particular function will specify how the type of the result depends on the values supplied as arguments. In many cases, for the functions in this specification, the result is defined to be the same type as the first argument.

Note:

This specification uses [IEEE 754-2019] arithmetic for xs:float and xs:double values. One consequence of this is that some operations result in the value NaN (not a number), which has the unusual property that it is not equal to itself. Another consequence is that some operations return the value negative zero. This differs from [XML Schema Part 2: Datatypes Second Edition], which defines NaN as being equal to itself and defines only a single zero in the value space. The text accompanying several functions defines behavior for both positive and negative zero inputs and outputs in the interest of alignment with [IEEE 754-2019]. A conformant implementation must respect these semantics. In consequence, the expression -0.0e0 (which is actually a unary minus operator applied to an xs:double value) will always return negative zero: see 4.2.8 op:numeric-unary-minus. As a concession to implementations that rely on implementations of XSD 1.0, however, when casting from string to double the lexical form -0may be converted to positive zero, though negative zero is recommended.

XML Schema 1.1 introduces support for positive and negative zero as distinct values, and also uses the [IEEE 754-2019] semantics for comparisons involving NaN.

4.5 Parsing numbers

It is possible to convert strings to values of type xs:integer, xs:float, xs:decimal, or xs:double using the constructor functions described in 21 Constructor functions22 Constructor functions2122 Constructor functions or using cast expressions as described in 22 Casting23 Casting2223 Casting.

In addition the fn:number function is available to convert strings to values of type xs:double. It differs from the xs:double constructor function in that any value outside the lexical space of the xs:double datatype is converted to the xs:double value NaN.

Function	Meaning
`fn:number`	Returns the value indicated by `$value` or, if `$value` is not specified, the context value after atomization, converted to an `xs:double`.
`fn:parse-integer`	Converts a string to an integer, recognizing any radix in the range 2 to 36.

4.5.1 fn:number

Summary

Returns the value indicated by $value or, if $value is not specified, the context value after atomization, converted to an xs:double.

Signature

`fn:number`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:double`

Properties

The zero-argument form of this function is deterministic, context-dependent, and focus-dependent.

The one-argument form of this function is deterministic, context-independent, and focus-independent.

Rules

Calling the zero-argument version of the function is defined to give the same result as calling the single-argument version with the context value (.). That is, fn:number() is equivalent to fn:number(.), as defined by the rules that follow.

If $value is the empty sequence or if $value cannot be converted to an xs:double, the xs:double value NaN is returned.

Otherwise, $value is converted to an xs:double following the rules of 22.1.3.2 Casting to xs:double23.1.3.2 Casting to xs:double22.1.3.223.1.3.2 Casting to xs:double. If the conversion to xs:double fails, the xs:double value NaN is returned.

Error Conditions

A type error is raised [err:XPDY0002]^XP if $value is omitted and the context value is absent^DM.

As a consequence of the rules given above, a type error is raised [err:XPTY0004]^XP if the context value cannot be atomized, or if the result of atomizing the context value is a sequence containing more than one atomic item.

Notes

XSD 1.1 allows the string +INF as a representation of positive infinity; XSD 1.0 does not. It is implementation-defined whether XSD 1.1 is supported.

Generally fn:number returns NaN rather than raising a dynamic error if the argument cannot be converted to xs:double. However, a type error is raised in the usual way if the supplied argument cannot be atomized or if the result of atomization does not match the required argument type.

Examples

Variables
let $e := <e price="12.1" discount="NONE"/>

Expression	Result
`number(12)`	1.2e1
`number('12')`	1.2e1
`number('INF')`	xs:double('INF')
`number('NaN')`	xs:double('NaN')
`number('non-numeric')`	xs:double('NaN')
`number($e/@price)`	1.21e1
`number($e/@discount)`	xs:double('NaN')
`number($e/@misspelt)`	xs:double('NaN')
`("10", "11", "12") ! number()`	1.0e1, 1.1e1, 1.2e1

8 Processing booleans

This section defines functions and operators on the xs:boolean datatype.

8.3 Functions on Boolean values

The following functions are defined on boolean values:

Function	Meaning
`fn:boolean`	Computes the effective boolean value of the sequence `$input`.
`fn:not`	Returns `true` if the effective boolean value of `$input` is `false`, or `false` if it is `true`.

8.3.1 fn:boolean

Summary

Computes the effective boolean value of the sequence $input.

Signature

`fn:boolean`(
`$input`	`as` `item()*`
) `as` `xs:boolean`

Rules

The function computes the effective boolean value of a sequence, defined according to the following rules. See also Section 2.5.4 Effective Boolean Value^XP.

If $input is the empty sequence, fn:boolean returns false.
If $input is a sequence whose first item is a GNode^DMnode (a generalized node), fn:boolean returns true.
If $input is a singleton value of type xs:boolean or of a type derived from xs:boolean, fn:boolean returns $input.
If $input is a singleton value of type xs:untypedAtomic, xs:string, xs:anyURI, or a type derived from xs:string or xs:anyURI, fn:boolean returns false if the operand value has zero length; otherwise it returns true.
If $input is a singleton value of any numeric type or a type derived from a numeric type, fn:boolean returns false if the operand value is NaN or is numerically equal to zero; otherwise it returns true.

Error Conditions

In all cases other than those listed above, fn:boolean raises a type error [err:FORG0006].

Notes

The result of this function is not necessarily the same as $input cast as xs:boolean. For example, fn:boolean("false") returns the value true whereas "false" cast as xs:boolean (which can also be written xs:boolean("false")) returns false.

Examples

Variables
let $abc := ("a", "b", "")

Expression	Result
`boolean($abc[1])`	true()
`boolean($abc[0])`	false()
`boolean($abc[3])`	false()
`fn:boolean($abc)` raises a type error [err:FORG0006].
`fn:boolean([])` raises a type error [err:FORG0006].

14 Processing sequences

A sequence is an ordered collection of zero or more items. An item is a node, an atomic item, or a function, such as a map or an array. The terms sequence and item are defined formally in [XQuery 4.0: An XML Query Language] and [XML Path Language (XPath) 4.0].

14.2 Comparison functions

The functions in this section perform comparisons between the items in one or more sequences.

Function	Meaning
`fn:atomic-equal`	Determines whether two atomic items are equal, under the rules used for comparing keys in a map.
`fn:deep-equal`	This function assesses whether two sequences are deep-equal to each other. To be deep-equal, they must contain items that are pairwise deep-equal; and for two items to be deep-equal, they must either be atomic items that compare equal, or nodes of the same kind, with the same name, whose children are deep-equal, or maps with matching entries, or arrays with matching members.
`fn:compare`	Returns `-1`, `0`, or `1`, depending on whether the first value is less than, equal to, or greater than the second value.
`fn:distinct-values`	Returns the values that appear in a sequence, with duplicates eliminated.
`fn:duplicate-values`	Returns the values that appear in a sequence more than once.
`fn:index-of`	Returns a sequence of positive integers giving the positions within the sequence `$input` of items that are equal to `$target`.
`fn:starts-with-subsequence`	Determines whether one sequence starts with another, using a supplied callback function to compare items.
`fn:ends-with-subsequence`	Determines whether one sequence ends with another, using a supplied callback function to compare items.
`fn:contains-subsequence`	Determines whether one sequence contains another as a contiguous subsequence, using a supplied callback function to compare items.

14.2.2 fn:deep-equal

Changes in 4.0 ⬇ ⬆

When comments and processing instructions are ignored, any text nodes either side of the comment or processing instruction are now merged prior to comparison. [Issue 930 PR 933 16 January 2024]
The $options parameter has been added, absorbing the $collation parameter. [Issues 934 1167 PR 1191 21 May 2024]
A callback function can be supplied for comparing individual items. [Issues 99 1142 PRs 1120 1150 9 April 2024]

Summary

This function assesses whether two sequences are deep-equal to each other. To be deep-equal, they must contain items that are pairwise deep-equal; and for two items to be deep-equal, they must either be atomic items that compare equal, or nodes of the same kind, with the same name, whose children are deep-equal, or maps with matching entries, or arrays with matching members.

Signature

`fn:deep-equal`(
`$input1`	`as` `item()*`,
`$input2`	`as` `item()*`,
`$options`	`as` `(xs:string \| map(*))?`	`:=` `{}`
) `as` `xs:boolean`

Properties

The two-argument form of this function is deterministic, context-dependent, and focus-independent. It depends on collations, and implicit timezone.

The three-argument form of this function is deterministic, context-dependent, and focus-independent. It depends on collations, and static base URI, and implicit timezone.

Rules

The $options argument, if present, defines additional parameters controlling how the comparison is done. If it is supplied as a map, then the option parameter conventions apply.

For backwards compatibility reasons, the $options argument can also be set to a string containing a collation name. Supplying a string $S for this argument is equivalent to supplying the map { 'collation': $S }. Omitting the argument, or supplying the empty sequence, is equivalent to supplying an empty map.

If the two sequences ($input1 and $input2) are both empty, the function returns true.

If the two sequences are of different lengths, the function returns false.

If the two sequences are of the same length, the comparison is controlled by the ordered option:

By default, the option is true: The function returns true if and only if every item in the sequence $input1 is deep-equal to the item at the same position in the sequence $input2.
If the option is set to false, the function returns false if and only if every item in the sequence $input1 is deep-equal to an item at some position in the sequence $input2, and vice versa.

The rules for deciding whether two items are deep-equal appear below.

The entries that may appear in the $options map are as follows. The detailed rules for the interpretation of each option appear later.

`record(`
`base-uri?`	`as` `xs:boolean`,
`collation?`	`as` `xs:string`,
`comments?`	`as` `xs:boolean`,
`debug?`	`as` `xs:boolean`,
`id-property?`	`as` `xs:boolean`,
`idrefs-property?`	`as` `xs:boolean`,
`in-scope-namespaces?`	`as` `xs:boolean`,
`items-equal?`	`as` `fn(item(), item()) as xs:boolean?`,
`map-order?`	`as` `xs:boolean`,
`namespace-prefixes?`	`as` `xs:boolean`,
`nilled-property?`	`as` `xs:boolean`,
`normalization-form?`	`as` `xs:string?`,
`ordered?`	`as` `xs:boolean`,
`processing-instructions?`	`as` `xs:boolean`,
`timezones?`	`as` `xs:boolean`,
`type-annotations?`	`as` `xs:boolean`,
`type-variety?`	`as` `xs:boolean`,
`typed-values?`	`as` `xs:boolean`,
`unordered-elements?`	`as` `xs:QName*`,
`whitespace?`	`as` `enum("preserve", "strip", "normalize")`
`)`

Key	Meaning
`base-uri?`	Determines whether the `base-uri` of a node is significant. Type: `xs:boolean` Default: `false()`
`collation?`	Identifies a collation which is used at all levels of recursion when strings are compared (but not when names are compared), according to the rules in 5.3.7 Choosing a collation. If the argument is not supplied, or if it is empty, then the default collation from the dynamic context of the caller is used. Type: `xs:string` Default: `fn:default-collation()`
`comments?`	Determines whether comments are significant. Type: `xs:boolean` Default: `false()`
`debug?`	Requests diagnostics in the case where the function returns `false`. When this option is set and the two inputs are found to be not equal, the implementation should output messages (in an implementation-dependent format and to an implementation-dependent destination) indicating the nature of the differences that were found. Type: `xs:boolean` Default: `false()`
`id-property?`	Determines whether the `id` property of elements and attributes is significant. Type: `xs:boolean` Default: `false()`
`idrefs-property?`	Determines whether the `idrefs` property of elements and attributes is significant. Type: `xs:boolean` Default: `false()`
`in-scope-namespaces?`	Determines whether the in-scope namespaces of elements are significant. Type: `xs:boolean` Default: `false()`
`items-equal?`	A user-supplied function to test whether two items are considered equal. The function can return `true` or `false` to indicate that two items are or are not equal, overriding the normal rules that would apply to those items; or it can return an empty sequence, to indicate that the normal rules should be followed. Note that returning `()` is not equivalent to returning `false`. Type: `fn(item(), item()) as xs:boolean?` Default: `fn:void#0`
`map-order?`	Determines whether the order of entries in maps is significant. Type: `xs:boolean` Default: `false()`
`namespace-prefixes?`	Determines whether namespace prefixes in `xs:QName` values (particularly the names of elements and attributes) are significant. Type: `xs:boolean` Default: `false()`
`nilled-property?`	Determines whether the `nilled` property of elements and attributes is significant. Type: `xs:boolean` Default: `false()`
`normalization-form?`	If present, indicates that text and attributes are converted to the specified Unicode normalization form prior to comparison. The value is as for the corresponding argument of `fn:normalize-unicode`. Type: `xs:string?` Default: `()`
`ordered?`	Controls whether the top-level order of the items of the input sequences is considered. Type: `xs:boolean` Default: `true()`
`processing-instructions?`	Determines whether processing instructions are significant. Type: `xs:boolean` Default: `false()`
`timezones?`	Determines whether timezones in date/time values are significant. Type: `xs:boolean` Default: `false()`
`type-annotations?`	Determines whether type annotations are significant. Type: `xs:boolean` Default: `false()`
`type-variety?`	Determines whether the variety of the type annotation of an element (whether it has complex content or simple content) is significant. Type: `xs:boolean` Default: `true()`
`typed-values?`	Determines whether nodes are compared using their typed values rather than their string values. Type: `xs:boolean` Default: `true()`
`unordered-elements?`	A list of QNames of elements considered to be unordered: that is, their child elements may appear in any order. Type: `xs:QName` Default:* `()`
`whitespace?`	Determines the extent to which whitespace is treated as significant. The value `preserve` retains all whitespace. The value `strip` ignores text nodes consisting entirely of whitespace. The value `normalize` ignores whitespace text nodes in the same way as the `strip` option, and additionally compares text and attribute nodes after normalizing whitespace in accordance with the rules of the `fn:normalize-space` function. The detailed rules, given below, also take into account type annotations and `xml:space` attributes. Type: `enum("preserve", "strip", "normalize")` Default: `preserve`

Note:

As a general rule for boolean options (but not invariably), the value true indicates that the comparison is more strict.

In the following rules, where a recursive call on fn:deep-equal is made, this is assumed to use the same values of $options as the original call.

The rules reference a function equal-strings which compares two xs:string or xs:anyURI values as follows:

If the whitespace option is set to normalize, then each string is processed by calling the fn:normalize-space function.
If the normalization-form option is present, each string is then normalized by calling the fn:normalize-unicode function, supplying the specified normalization form.
The two strings are then compared for equality under the requested collation.

More formally, the equal-strings function is equivalent to the following implementation in XQuery:

declare function equal-strings(
  $string1  as xs:string,
  $string2  as xs:string, 
  $options  as map(*)
) as xs:boolean {
  let $n1 := if ($options?normalization-form)
             then normalize-unicode(?, $options?normalization-form) 
             else identity#1
  let $n2 := if ($options?whitespace = "normalize")
             then normalize-space#1 
             else identity#1               
  return compare($n1($n2($string1)), $n1($n2($string2)), $options?collation) eq 0    
}

The rules for deciding whether two items $i1 and $i2 are deep-equal are as follows.

Labels (see Section 3.3 Labeled Items^DM) are ignored. Specifically, if $i1 or $i2 is a labeled item then it is replaced by its subject.

The two items are nextfirst compared using the function supplied in the items-equal option. If this returns true then the items are deep-equal. If it returns false then the items are not deep-equal. If it returns an empty sequence (which is always the case if the option is not explicitly specified) then the two items are deep-equal if one or more of the following conditions are true:

All of the following conditions are true:
1. $i1 is an atomic item.
2. $i2 is an atomic item.
3. Either the type-annotations option is false, or both atomic items have the same type annotation.
4. One of the following conditions is true:
  1. If both $i1 and $i2 are instances of xs:string, xs:untypedAtomic, or xs:anyURI, equal-strings($i1, $i2, $collation, $options) returns true.
  2. If both $i1 and $i2 are instances of xs:date, xs:time or xs:dateTime, $i1 eq $i2 returns true.
  3. If both $i1 and $i2 are instances of xs:hexBinary or xs:base64Binary, $i1 eq $i2 returns true.
  4. Otherwise, fn:atomic-equal($i1, $i2) returns true.
  Note:
  If $i1 and $i2 are not comparable, that is, if the expression ($i1 eq $i2) would raise an error, then the function returns false; it does not report an error.
5. One of the following conditions is true:
  1. Option namespace-prefixes is false.
  2. Neither $i1 nor $i2 is of type xs:QName or xs:NOTATION.
  3. $i1 and $i2 are qualified names with the same namespace prefix.
6. One of the following conditions is true:
  1. Option timezones is false.
  2. Neither $i1 nor $i2 is of type xs:date, xs:time, xs:dateTime, xs:gYear, xs:gYearMonth, xs:gMonth, xs:gMonthDay, or xs:gDay.
  3. Neither $i1 nor $i2 has a timezone component.
  4. Both $i1 and $i2 have a timezone component and the timezone components are equal.
All of the following conditions are true:
1. $i1 is a map.
2. $i2 is a map.
3. Both maps have the same number of entries.
4. For every entry in the first map, there is an entry in the second map that:
  1. has the same key (note that the collation is not used when comparing keys), and
  2. has the same associated value (compared using the fn:deep-equal function, recursively).
5. Either map-order is false, or the entries in both maps appear in the same order, that is, the Nth key in the first map is the same key as the Nth key in the second map, for all N.
All the following conditions are true:
1. $i1 is an array.
2. $i2 is an array.
3. Both arrays have the same number of members (array:size($i1) eq array:size($i2)).
4. Members in the same position of both arrays are deep-equal to each other: that is, every $p in 1 to array:size($i1) satisfies deep-equal($i1($p), $i2($p), $collation, $options).
All the following conditions are true:
1. $i1 is a function item and is not a map or array.
2. $i2 is a function item and is not a map or array.
3. $i1 and $i2 have the same function identity. The concept of function identity is explained in Section 7.18.1 Function Items^DM.
All the following conditions are true:
1. $i1 is a node.$i1 is a node (specifically, an XNode).$i1 is a node (specifically, an XNode).
2. $i2 is a node.$i2 is a node (specifically, an XNode).$i2 is a node (specifically, an XNode).
3. Both nodes have the same node kind.
4. Either the base-uri option is false, or both nodes have the same value for their base URI property, or both nodes have an absent base URI.
5. Let significant-children($parent) be the sequence of nodes obtained by applying the following steps to the children of $parent, in turn:
  1. Comment nodes are discarded if the option comments is false.
  2. Processing instruction nodes are discarded if the option processing-instructions is false.
  3. Adjacent text nodes are merged.
  4. Whitespace-only text nodes are discarded if both the following conditions are true:
    1. The option whitespace is set to strip or normalize; and
    2. The text node is not within the scope of an element that has the attribute xml:space="preserve".
    Note:
    Whitespace text nodes will already have been discarded if $parent is a schema-validated element node whose type annotation is a complex type with an element-only or empty content model.
6. One of the following conditions is true.
  1. Both nodes are document nodes, and the sequence significant-children($i1) is deep-equal to the sequence significant-children($i2).
  2. Both nodes are element nodes, and all the following conditions are true:
    1. The two nodes have the same name, that is (node-name($i1) eq node-name($i2)).
    2. Either the option namespace-prefixes is false, or both element names have the same prefix.
    3. Either the option in-scope-namespaces is false, or both element nodes have the same in-scope namespace bindings.
    4. Either the option type-annotations is false, or both element nodes have the same type annotation.
    5. Either the option id-property is false, or both element nodes have the same value for their is-id property.
    6. Either the option idrefs-property is false, or both element nodes have the same value for their is-idrefs property.
    7. Either the option nilled-property is false, or both element nodes have the same value for their nilled property.
    8. One of the following conditions is true:
      1. The option type-variety is false.
      2. Both nodes are annotated as having simple content. For this purpose simple content means either a simple type or a complex type with simple content.
      3. Both nodes are annotated as having complex content. For this purpose complex content means a complex type whose variety is mixed, element-only, or empty.
      Note:
      It is a consequence of this rule that, by default, validating a document D against a schema will usually (but not necessarily) result in a document that is not deep-equal to D. The exception is when the schema allows all elements to have mixed content.
    9. The two nodes have the same number of attributes, and for every attribute $a1 in $i1/@* there exists an attribute $a2 in $i2/@* such that node-name($a1) eq node-name($a2) and $a1 and $a2 are deep-equal.
      Note:
      Attributes, like other items, may be compared using the supplied items-equal function. However, this function will not be called to compare two attribute nodes unless they have the same name.
    10. One of the following conditions holds:
      1. Both element nodes are annotated as having simple content (as defined above), the typed-values option is true, and the typed value of $i1 is deep-equal to the typed value of $i2.
        Note:
        The typed value of an element node is used only when the element has simple content, which means that no error can occur as a result of atomizing a node with no typed value.
      2. Both element nodes are annotated as having simple content (as defined above), the typed-values option is false, and the equal-strings function returns true when applied to the string value of $i1 and the string value of $i2.
      3. Both element nodes have a type annotation that is a complex type with element-only, mixed, or empty content, the (common) element name is not present in the unordered-elements option, and the sequence significant-children($i1) is deep-equal to the sequence significant-children($i2).
      4. Both element nodes have a type annotation that is a complex type with element-only, mixed, or empty content, the (common) element name is present in the unordered-elements option, and the sequence significant-children($i1) is deep-equal to some permutation of the sequence significant-children($i2).
        Note:
        Elements annotated as xs:untyped fall into this category.
        Including an element name in the unordered-elements list is unlikely to be useful except when the relevant elements have element-only content, but this is not a requirement: the rules apply equally to elements with mixed content, or even (trivially) to elements with empty content.
  3. Both nodes are attribute nodes, and all the following conditions are true:
    1. The two attribute nodes have the same name, that is (node-name($i1) eq node-name($i2)).
    2. Either the option namespace-prefixes is false, or both attribute names have the same prefix.
    3. Either the option type-annotations is false, or both attribute nodes have the same type annotation.
    4. Either the option id-property is false, or both attribute nodes have the same value for their is-id property.
    5. Either the option idrefs-property is false, or both attribute nodes have the same value for their is-idrefs property.
    6. Let T be true if the option typed-value is true and both attributes $i1 and $i2 have a type annotation other than xs:untypedAtomic.
      Then either T is true and the typed value of $i1 is deep-equal to the typed value of $i2, or T is false and the equal-strings function returns true when applied to the string value of $i1 and the string value of $i2.
  4. Both nodes are processing instruction nodes, and all the following conditions are true:
    1. The two nodes have the same name, that is (node-name($i1) eq node-name($i2)).
    2. The equal-strings function returns true when applied to the string value of $i1 and the string value of $i2.
  5. Both nodes are namespace nodes, and all the following conditions are true:
    1. The two nodes either have the same name or are both nameless, that is fn:deep-equal(node-name($i1), node-name($i2)).
    2. The string value of $i1 is equal to the string value of $i2 when compared using the Unicode codepoint collation.
    Note:
    Namespace nodes are not considered directly unless they appear in the top-level sequences passed explicitly to the fn:deep-equal function.
  6. Both nodes are comment nodes, and the equal-strings function returns true when applied to their string values.
  7. Both nodes are text nodes, and the equal-strings function returns true when applied to their string values.
All the following conditions are true:
1. $i1 is a JNode.
2. $i2 is a JNode.
3. The ¶value property of $i1 is deep-equal to the ¶value property of $i2.
  Note:
  The other properties of the two JNodes, such as ¶parent and ¶selector, are ignored. As with XNodes, deep equality considers only the subtree rooted at the node, and not its position within a containing tree.
All the following conditions are true:
1. $i1 is a JNode.
2. $i2 is a JNode.
3. The ¶value property of $i1 is deep-equal to the ¶value property of $i2.
  Note:
  The other properties of the two JNodes, such as ¶parent and ¶selector, are ignored. As with XNodes, deep equality considers only the subtree rooted at the node, and not its position within a containing tree.

In all other cases the result is false.

Error Conditions

A type error is raised [err:XPTY0004]^XP if the value of $options includes an entry whose key is defined in this specification, and whose value is not of the permitted type for that key.

A dynamic error is raised [err:FOJS0005] if the value of $options includes an entry whose key is defined in this specification, and whose value is not a permitted value for that key.

Notes

By default, whitespace in text nodes and attributes is considered significant. There are various ways whitespace differences can be ignored:

If nodes have been schema-validated, setting the typed-values option to true causes the typed values rather than the string values to be compared. This will typically cause whitespace to be ignored except where the type of the value is xs:string.
Setting the whitespace option to normalize causes all text and attribute nodes to have leading and trailing whitespace removed, and intermediate whitespace reduced to a single character.

By default, two nodes are not required to have the same type annotation, and they are not required to have the same in-scope namespaces. They may also differ in their parent, their base URI, and the values returned by the is-id and is-idrefs accessors (see Section 6.7.57.5.5 is-id Accessor^DM and Section 6.7.67.5.6 is-idrefs Accessor^DM). The order of children is significant, but the order of attributes is insignificant.

By default, the contents of comments and processing instructions are significant only if these nodes appear directly as items in the two sequences being compared. The content of a comment or processing instruction that appears as a descendant of an item in one of the sequences being compared does not affect the result. In previous versions of this specification, the presence of a comment or processing instruction, if it caused text to be split across two text nodes, might affect the result; this has been changed in 4.0 so that adjacent text nodes are merged after comments and processing instructions have been stripped.

Comparing items of different kind (for example, comparing an atomic item to a node, or a map to an array, or an integer to an xs:date) returns false, it does not return an error. So the result of fn:deep-equal(1, current-dateTime()) is false.

The items-equal callback function may be used to override the default rules for comparing individual items. For example, it might return true unconditionally when comparing two @timestamp attributes, if there is no expectation that the two trees will have identical timestamps. Given two nodes $n1 and $n2, it might compare them using the is operator, so that instead of comparing the descendants of the two nodes, the function simply checks whether they are the same node. Given two function items $f1 and $f2 it might return true unconditionally, knowing that there is no effective way to test if the functions are equivalent. Given two numeric values, it might return true if they are equal to six decimal places.

It is good practice for the items-equal callback function to be reflexive, symmetric, and transitive; if it is not, then the fn:deep-equal function itself will lack these qualities. Reflexive means that every item (including NaN) should be equal to itself; symmetric means that items-equal(A, B) should return the same result as items-equal(B, A), and transitive means that items-equal(A, B) and items-equal(B, C) should imply items-equal(A, C).

Setting the ordered option to false or supplying the unordered-elements option may result in poor performance when comparing long sequences, especially if the items-equal callback function is supplied.

Examples

Variables
let $at := <attendees> <name last="Parker" first="Peter"/> <name last="Barker" first="Bob"/> <name last="Parker" first="Peter"/> </attendees>

Expression:	`deep-equal($at, $at/*)`
Result:	false()
Expression:	`deep-equal($at/name[1], $at/name[2])`
Result:	false()
Expression:	`deep-equal($at/name[1], $at/name[3])`
Result:	true()
Expression:	`deep-equal($at/name[1], 'Peter Parker')`
Result:	false()
Expression:	deep-equal( $at//name[@first="Bob"], $at//name[@last="Barker"], options := { 'items-equal': op('is') } )
Result:	true() (Tests whether the two input sequences contain exactly the same nodes.)
Expression:	`deep-equal([ 1, 2, 3], [ 1, 2, 3 ])`
Result:	true()
Expression:	`deep-equal((1, 2, 3), [ 1, 2, 3 ])`
Result:	false()
Expression:	deep-equal( { 1: 'a', 2: 'b' }, { 2: 'b', 1: 'a' } )
Result:	true()
Expression:	deep-equal( (1, 2, 3, 4), (1, 4, 3, 2), options := { 'ordered': false() } )
Result:	true()
Expression:	deep-equal( (1, 1, 2, 3), (1, 2, 3, 3), options := { 'ordered': false() } )
Result:	false()
Expression:	deep-equal( parse-xml("<a xmlns='AA'/>"), parse-xml("<p:a xmlns:p='AA'/>") )
Result:	true() (By default, namespace prefixes are ignored).
Expression:	deep-equal( parse-xml("<a xmlns='AA'/>"), parse-xml("<p:a xmlns:p='AA'/>"), options := { 'namespace-prefixes': true() } )
Result:	false() (False because the namespace prefixes differ).
Expression:	deep-equal( parse-xml("<a xmlns='AA'/>"), parse-xml("<p:a xmlns:p='AA'/>"), options := { 'in-scope-namespaces': true() } )
Result:	false() (False because the in-scope namespace bindings differ).
Expression:	deep-equal( parse-xml("<a><b/><c/></a>"), parse-xml("<a><c/><b/></a>") )
Result:	false() (By default, order of elements is significant).
Expression:	deep-equal( parse-xml("<a><b/><c/></a>"), parse-xml("<a><c/><b/></a>"), options := { 'unordered-elements': #a) } )
Result:	true() (The `unordered-elements` option means that the ordering of the children of `a` is ignored.)
Expression:	deep-equal( parse-xml("<para style='bold'><span>x</span></para>"), parse-xml("<para style=' bold'> <span>x</span></para>") )
Result:	false() (By default, both the leading whitespace in the `style` attribute and the whitespace text node preceding the `span` element are significant.)
Expression:	deep-equal( parse-xml("<para style='bold'><span>x</span></para>"), parse-xml("<para style=' bold'> <span>x</span></para>"), options := { 'whitespace': 'normalize' } )
Result:	true() (The `whitespace` option causes both the leading space in the attribute value and the whitespace preceding the `span` element to be ignored.)
Expression:	deep-equal( (1, 2, 3), (1.0007, 1.9998, 3.0005), options := { 'items-equal': fn($x, $y) { if (($x, $y) instance of xs:numeric+) { abs($x - $y) lt 0.001 } } } )
Result:	true() (For numeric values, the callback function tests whether they are approximately equal. For any other items, it returns an empty sequence, so the normal comparison rules apply.)
Expression:	deep-equal( (1, 2, 3, 4, 5), (1, 2, 3, 8, 5), options := { 'items-equal': fn($x, $y) { trace((), `comparing { $x } and { $y }`) } } )
Result:	false() (The callback function traces which items are being compared, without changing the result of the comparison.)

14.5 Functions on node identifiers

This section defines a number of functions used to find elements by ID or IDREF value, or to generate identifiers.

Function	Meaning
`fn:id`	Returns the sequence of element nodes that have an `ID` value matching the value of one or more of the `IDREF` values supplied in `$values`.
`fn:element-with-id`	Returns the sequence of element nodes that have an `ID` value matching the value of one or more of the `IDREF` values supplied in `$values`.
`fn:idref`	Returns the sequence of element or attribute nodes with an `IDREF` value matching the value of one or more of the `ID` values supplied in `$values`.
`fn:generate-id`	This function returns a string that uniquely identifies a given nodeGNode.

14.5.1 fn:id

Summary

Returns the sequence of element nodes that have an ID value matching the value of one or more of the IDREF values supplied in $values.

Signature

`fn:id`(
`$values`	`as` `xs:string*`,
`$node`	`as` `node()`	`:=` `.`
) `as` `element()*`

Properties

The one-argument form of this function is deterministic, context-dependent, and focus-dependent.

The two-argument form of this function is deterministic, context-independent, and focus-independent.

Rules

The function returns a sequence, in document order with duplicates eliminated, containing every element node E that satisfies all the following conditions:

E is in the target document. The target document is the document containing $node, or the document containing the context value (.) if the second argument is omitted. The behavior of the function if $node is omitted is exactly the same as if the context value had been passed as $node.
E has an ID value equal to one of the candidate IDREF values, where:
- An element has an ID value equal to V if either or both of the following conditions are true:
  - The is-id property (See Section 6.7.57.5.5 is-id Accessor^DM.) of the element node is true, and the typed value of the element node is equal to V under the rules of the eq operator using the Unicode codepoint collation (http://www.w3.org/2005/xpath-functions/collation/codepoint).
  - The element has an attribute node whose is-id property (See Section 6.7.57.5.5 is-id Accessor^DM.) is true and whose typed value is equal to V under the rules of the eq operator using the Unicode code point collation (http://www.w3.org/2005/xpath-functions/collation/codepoint).
- Each xs:string in $values is parsed as if it were of type IDREFS, that is, each xs:string in $values is treated as a whitespace-separated sequence of tokens, each acting as an IDREF. These tokens are then included in the list of candidate IDREFs. If any of the tokens is not a lexically valid IDREF (that is, if it is not lexically an xs:NCName), it is ignored. Formally, the candidate IDREF values are the strings in the sequence given by the expression:
```
for $s in $values
return tokenize(normalize-space($s), ' ')[. castable as xs:IDREF]
```
If several elements have the same ID value, then E is the one that is first in document order.

Error Conditions

A dynamic error is raised [err:FODC0001] if $node, or the context value if the second argument is absent, is a node in a tree whose root is not a document node.

The following errors may be raised when $node is omitted:

If the context value is absent^DM, type error [err:XPDY0002]^XP
If the context value is not a single node, type error [err:XPTY0004]^XP.

Notes

The effect of this function is anomalous in respect of element nodes with the is-id property. For legacy reasons, this function returns the element that has the is-id property, whereas it would be more appropriate to return its parent, that being the element that is uniquely identified by the ID. A new function fn:element-with-id has been introduced with the desired behavior.

If the data model is constructed from an Infoset, an attribute will have the is-id property if the corresponding attribute in the Infoset had an attribute type of ID: typically this means the attribute was declared as an ID in a DTD.

If the data model is constructed from a PSVI, an element or attribute will have the is-id property if its typed value is a single atomic item of type xs:ID or a type derived by restriction from xs:ID.

No error is raised in respect of a candidate IDREF value that does not match the ID of any element in the document. If no candidate IDREF value matches the ID value of any element, the function returns the empty sequence.

It is not necessary that the supplied argument should have type xs:IDREF or xs:IDREFS, or that it should be derived from a node with the is-idrefs property.

An element may have more than one ID value. This can occur with synthetic data models or with data models constructed from a PSVI where the element and one of its attributes are both typed as xs:ID.

If the source document is well-formed but not valid, it is possible for two or more elements to have the same ID value. In this situation, the function will select the first such element.

It is also possible in a well-formed but invalid document to have an element or attribute that has the is-id property but whose value does not conform to the lexical rules for the xs:ID type. Such a node will never be selected by this function.

Examples

Variables
let $emp := validate lax { document { <employee xml:id="ID21256" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xs="http://www.w3.org/2001/XMLSchema"> <empnr xsi:type="xs:ID">E21256</empnr> <first>John</first> <last>Brown</last> </employee> } }

Variables

let $emp := validate lax {
  document {
    <employee xml:id="ID21256"
              xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"  
              xmlns:xs="http://www.w3.org/2001/XMLSchema">
      <empnr xsi:type="xs:ID">E21256</empnr>
      <first>John</first>
      <last>Brown</last>
    </employee>
  }
}

Expression	Result
$emp/id('ID21256')/name()	"employee" (The `xml:id` attribute has the `is-id` property, so the employee element is selected.)
$emp/id('E21256')/name()	"empnr" (Assuming the `empnr` element is given the type `xs:ID` as a result of schema validation, the element will have the `is-id` property and is therefore selected. Note the difference from the behavior of `fn:element-with-id`.)

14.5.2 fn:element-with-id

Summary

Returns the sequence of element nodes that have an ID value matching the value of one or more of the IDREF values supplied in $values.

Signature

`fn:element-with-id`(
`$values`	`as` `xs:string*`,
`$node`	`as` `node()`	`:=` `.`
) `as` `element()*`

Properties

The one-argument form of this function is deterministic, context-dependent, and focus-dependent.

The two-argument form of this function is deterministic, context-independent, and focus-independent.

Rules

Note:

The effect of this function is identical to fn:id in respect of elements that have an attribute with the is-id property. However, it behaves differently in respect of element nodes with the is-id property. Whereas the fn:id function, for legacy reasons, returns the element that has the is-id property, this function returns the element identified by the ID, which is the parent of the element having the is-id property.

The function returns a sequence, in document order with duplicates eliminated, containing every element node E that satisfies all the following conditions:

E is in the target document. The target document is the document containing $node, or the document containing the context value (.) if the second argument is omitted. The behavior of the function if $node is omitted is exactly the same as if the context value had been passed as $node.
E has an ID value equal to one of the candidate IDREF values, where:
- An element has an ID value equal to V if either or both of the following conditions are true:
  - The element has an child element node whose is-id property (See Section 6.7.57.5.5 is-id Accessor^DM.) is true and whose typed value is equal to V under the rules of the eq operator using the Unicode code point collation (http://www.w3.org/2005/xpath-functions/collation/codepoint).
  - The element has an attribute node whose is-id property (See Section 6.7.57.5.5 is-id Accessor^DM.) is true and whose typed value is equal to V under the rules of the eq operator using the Unicode code point collation (http://www.w3.org/2005/xpath-functions/collation/codepoint).
- Each xs:string in $values is parsed as if it were of type IDREFS, that is, each xs:string in $values is treated as a whitespace-separated sequence of tokens, each acting as an IDREF. These tokens are then included in the list of candidate IDREFs. If any of the tokens is not a lexically valid IDREF (that is, if it is not lexically an xs:NCName), it is ignored. Formally, the candidate IDREF values are the strings in the sequence given by the expression:
```
for $s in $arg
return tokenize(normalize-space($s), ' ')[. castable as xs:IDREF]
```
If several elements have the same ID value, then E is the one that is first in document order.

Error Conditions

A dynamic error is raised [err:FODC0001] if $node, or the context value if the second argument is omitted, is a node in a tree whose root is not a document node.

The following errors may be raised when $node is omitted:

If the context value is absent^DM, type error [err:XPDY0002]^XP
If the context value is not a single node, type error [err:XPTY0004]^XP.

Notes

This function is equivalent to the fn:id function except when dealing with ID-valued element nodes. Whereas the fn:id function selects the element containing the identifier, this function selects its parent.

It is not necessary that the supplied argument should have type xs:IDREF or xs:IDREFS, or that it should be derived from a node with the is-idrefs property.

If the source document is well-formed but not valid, it is possible for two or more elements to have the same ID value. In this situation, the function will select the first such element.

Examples

Variables
let $emp := validate lax { document { <employee xml:id="ID21256" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xs="http://www.w3.org/2001/XMLSchema"> <empnr xsi:type="xs:ID">E21256</empnr> <first>John</first> <last>Brown</last> </employee> } }

Variables

let $emp := validate lax {    
  document {
    <employee xml:id="ID21256"
              xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"  
              xmlns:xs="http://www.w3.org/2001/XMLSchema">
      <empnr xsi:type="xs:ID">E21256</empnr>
      <first>John</first>
      <last>Brown</last>
    </employee>
  }
}

Expression:	$emp/element-with-id('ID21256')/name()
Result:	"employee" (The `xml:id` attribute has the `is-id` property, so the employee element is selected.)
Expression:	`$emp/element-with-id('E21256')/name()`
Result:	"employee" (Assuming the `empnr` element is given the type `xs:ID` as a result of schema validation, the element will have the `is-id` property and is therefore its parent is selected. Note the difference from the behavior of `fn:id`.)

14.5.3 fn:idref

Summary

Returns the sequence of element or attribute nodes with an IDREF value matching the value of one or more of the ID values supplied in $values.

Signature

`fn:idref`(
`$values`	`as` `xs:string*`,
`$node`	`as` `node()`	`:=` `.`
) `as` `node()*`

Properties

The one-argument form of this function is deterministic, context-dependent, and focus-dependent.

The two-argument form of this function is deterministic, context-independent, and focus-independent.

Rules

The function returns a sequence, in document order with duplicates eliminated, containing every element or attribute node $N that satisfies all the following conditions:

$N is in the target document. The target document is the document containing $node, or the document containing the context value (.) if the second argument is omitted. The behavior of the function if $node is omitted is exactly the same as if the context value had been passed as $node.
$N has an IDREF value equal to one of the candidate ID values, where:
- A node $N has an IDREF value equal to V if both of the following conditions are true:
  - The is-idrefs property (see Section 6.7.67.5.6 is-idrefs Accessor^DM) of $N is true.
  - The sequence
```
tokenize(normalize-space(string($N)), ' ')
```
    contains a string that is equal to V under the rules of the eq operator using the Unicode code point collation (http://www.w3.org/2005/xpath-functions/collation/codepoint).
- Each xs:string in $values is parsed as if it were of lexically of type xs:ID. These xs:strings are then included in the list of candidate xs:IDs. If any of the strings in $values is not a lexically valid xs:ID (that is, if it is not lexically an xs:NCName), it is ignored. More formally, the candidate ID values are the strings in the sequence:
```
$values[. castable as xs:NCName]
```

Error Conditions

A dynamic error is raised [err:FODC0001] if $node, or the context value if the second argument is omitted, is a node in a tree whose root is not a document node.

The following errors may be raised when $node is omitted:

If the context value is absent^DM, type error [err:XPDY0002]^XP
If the context value is not a single node, type error [err:XPTY0004]^XP.

Notes

An element or attribute typically acquires the is-idrefs property by being validated against the schema type xs:IDREF or xs:IDREFS, or (for attributes only) by being described as of type IDREF or IDREFS in a DTD.

Because the function is sensitive to the way in which the data model is constructed, calls on this function are not always interoperable.

No error is raised in respect of a candidate ID value that does not match the IDREF value of any element or attribute in the document. If no candidate ID value matches the IDREF value of any element or attribute, the function returns the empty sequence.

It is possible for two or more nodes to have an IDREF value that matches a given candidate ID value. In this situation, the function will return all such nodes. However, each matching node will be returned at most once, regardless how many candidate ID values it matches.

It is possible in a well-formed but invalid document to have a node whose is-idrefs property is true but that does not conform to the lexical rules for the xs:IDREF type. The effect of the above rules is that ill-formed candidate ID values and ill-formed IDREF values are ignored.

If the data model is constructed from a PSVI, the typed value of a node that has the is-idrefs property will contain at least one atomic item of type xs:IDREF (or a type derived by restriction from xs:IDREF). It may also contain atomic items of other types. These atomic items are treated as candidate ID values if two conditions are met: their lexical form must be valid as an xs:NCName, and there must be at least one instance of xs:IDREF in the typed value of the node. If these conditions are not satisfied, such values are ignored.

Examples

Variables
let $emp := validate lax { document { <employees xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xs="http://www.w3.org/2001/XMLSchema"> <employee xml:id="ID21256"> <empnr xsi:type="xs:ID">E21256</empnr> <first>Anil</first> <last>Singh</last> <deputy xsi:type="xs:IDREF">E30561</deputy> </employee> <employee xml:id="ID30561"> <empnr xsi:type="xs:ID">E30561</empnr> <first>John</first> <last>Brown</last> <manager xsi:type="xs:IDREF">ID21256</manager> </employee> </employees> } }

Variables

let $emp := validate lax {  
  document {    
    <employees xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"  
               xmlns:xs="http://www.w3.org/2001/XMLSchema">  
      <employee xml:id="ID21256">
        <empnr xsi:type="xs:ID">E21256</empnr>
        <first>Anil</first>
        <last>Singh</last>
        <deputy xsi:type="xs:IDREF">E30561</deputy>
      </employee>
      <employee xml:id="ID30561">
        <empnr xsi:type="xs:ID">E30561</empnr>
        <first>John</first>
        <last>Brown</last>
        <manager xsi:type="xs:IDREF">ID21256</manager>
      </employee>
    </employees>
  }
}

Expression:	$emp/( element-with-id('ID21256')/@xml:id => idref() )/ancestor::employee/last => string()
Result:	"Brown" (Assuming that `manager` has the is-idref property, the call on `fn:idref` selects the `manager` element. If, instead, the `manager` had a `ref` attribute with the is-idref property, the call on `fn:idref` would select the attribute node.)
Expression:	$emp/( element-with-id('E30561')/empnr => idref() )/ancestor::employee/last => string()
Result:	"Singh" (Assuming that `employee/deputy` has the is-idref property, the call on `fn:idref` selects the `deputy` element.)

14.5.4 fn:generate-id

Summary

This function returns a string that uniquely identifies a given nodeGNode.

Signature

`fn:generate-id`(
`$node`	`as` `node()?as` `GNode()?as` `nodeGNode()?`	`:=` `.`
) `as` `xs:string`

Properties

The zero-argument form of this function is deterministic, context-dependent, and focus-dependent.

The one-argument form of this function is deterministic, context-independent, and focus-independent.

Rules

If the argument is omitted, it defaults to the context value (.).

If the argument is the empty sequence, the result is the zero-length string.

In other cases, the function returns a string that uniquely identifies a given node. More formally, it is guaranteed that within a single execution scope, fn:codepoint-equal(fn:generate-id($N), fn:generate-id($M)) returns true if and only if ($M is $N) returns true.

The returned identifier must consist of ASCII alphanumeric characters and must start with an alphabetic character. Thus, the string is syntactically an XML name.

Error Conditions

The following errors may be raised when $node is omitted:

If the context value is absent^DM, type error [err:XPDY0002]^XP
If the context value is not an instance of the sequence type nodeGNode()?, type error [err:XPTY0004]^XP.

Notes

An implementation is free to generate an identifier in any convenient way provided that it always generates the same identifier for the same nodeGNode and that different identifiers are always generated from different nodesGNodes. An implementation is under no obligation to generate the same identifiers each time a document is transformed or queried.

There is no guarantee that a generated unique identifier will be distinct from any unique IDs specified in the source document.

There is no inverse to this function; it is not directly possible to find the nodeGNode with a given generated ID. Of course, it is possible to search a given sequence of nodesGNodes using an expression such as $nodes[generate-id()=$id].

It is advisable, but not required, for implementations to generate IDs that are distinct even when compared using a case-blind collation.

Examples

The primary use case for this function is to generate hyperlinks. For example, when generating HTML, an anchor for a given section `$sect` can be generated by writing (in either XSLT or XQuery):
`<a name="{ generate-id($sect) }"/>`
and a link to that section can then be produced with code such as:
`see <a href="#{ generate-id($sect) }">here</a>`
Note that anchors generated in this way will not necessarily be the same each time a document is republished.
Since the keys in a map must be atomic items, it is possible to use generated IDs as surrogates for nodes when constructing a map. For example, in some implementations, testing whether a node `$N` is a member of a large node-set `$S` using the expression `exists($N intersect $S)` may be expensive; there may then be performance benefits in creating a map:
`let $SMap := map:merge($S ! { generate-id(.) : . })`
and then testing for membership of the node-set using:
`map:contains($SMap, generate-id($N))`

15 Parsing and serializing

These functions convert between the lexical representation and XPath and XQuery data model representation of various file formats.

15.1 Functions on XML Data

These functions convert between the lexical representation of XML and the tree representation.

(The fn:serialize function also handles HTML and JSON output, but is included in this section for editorial convenience.)

Function	Meaning
`fn:parse-xml`	This function takes as input an XML document, and returns the document node at the root of an XDM tree representing the parsed document.
`fn:parse-xml-fragment`	This function takes as input an XML external entity represented as a string, and returns the document node at the root of an XDM tree representing the parsed document fragment.
`fn:serialize`	This function serializes the supplied input sequence `$input` as described in [XSLT and XQuery Serialization 3.1], returning the serialized representation of the sequence as a string.
`fn:xsd-validator`	Given an XSD schema, delivers a function item that can be invoked to validate a document or element node against this schema.

15.1.4 XSD validation

Changes in 4.0 ⬇ ⬆

This description of the XSD validation process was previously found (with some duplication) in the XQuery and XSLT specifications; those specifications now reference this description. As a side-effects, the descriptions of the process in XQuery and XSLT are better aligned. [Issue 2029 PR 2030 28 May 2025]

This section describes a process called XSD validation, which validates a supplied node against a supplied XSD schema. The validation process refers to the process defined in [XML Schema Part 1: Structures Second Edition] or [XSD 1.1 Part 1].

The validation process takes the following inputs:

A schema to be used for validation, called the effective schema.
A boolean indicating whether any xsi:schemaLocation or xsi:noNamespaceSchemaLocation attributes are to be taken into consideration.
A document, element, or attribute node to be validated; this is called the operand node.
A validation mode, which is one of strictlax, or by-type.
Note:
XSLT also allows the value strip, but this does not invoke validation (instead, it invokes stripping of existing type annotations, and re-annotation of nodes as xs:untyped.)
If the validation mode is by-type, then a schema type to be used for validating the operand node. This may be any simple or complex type present in the effective schema: it must not be xs:untyped or xs:untypedAtomic.
Note:
An XQuery ValidateExpr allows the type to be specified as xs:untyped or xs:untypedAtomic, but this does not invoke validation (instead, it invokes stripping of existing type annotations and re-annotation of nodes as untyped.)

The output of the validation process comprises one or more of the following:

A boolean indicating whether the operand node was found to be valid.
If the operand node was found to be valid, a deep copy of the operand node augmented with type annotations corresponding to the types against which they were validated, the copies may also include expanded values for element and attribute defaults defined in the schema.
This creates a new node with its own identity and with no parent.
The base URI property of every node in the resulting XDM tree is the same as the base URI property of the corresponding node in the input tree.
If the operand node was not found to be valid, then optionally, a set of error diagnostics in implementation-defined format.

The operand node must be one of:

An element node
An attribute node
A well-formed document node, that is, a document node having among its children exactly one element node and zero or more comment and processing instruction nodes.

The term validation root is used to refer to the operand node if it is an element or attribute node, or to the single element child of the operand node when the operand node is a document node.

Note that a schema is defined as a collection of schema components (for example, element and attribute declarations, complex and simple type definitions). In some cases the schema that is used is the set of schema components found in the in-scope schema definitions^XP, but this is not the only possibility.

The result of the validation process is defined by the following rules.

The invoking application determines whether the validity assessment process takes account of any xsi:schemaLocation or xsi:noNamespaceSchemaLocation attributes in the tree being validated. If it does so, then it should adhere to the following rules:
1. Any schema loaded using these attributes must be compatible^DM with the existing effective schema.
2. Any schema loaded using these attributes must not override or redefine any schema components in the effective schema.
3. Any schema components loaded using this mechanism must be used for this validity assessment only, and must not affect the outcome of any subsequent validity assessments of other documents.
  Note:
  A processor may choose to cache such schema components but the existence of such a cache should only affect performance, not the validation outcome.
A consequence of validating a document using schema components that are not in the static context is that nodes may be annotated with types that are not in the static context. But the rules for schema compatibility^DM mean that this is not a problem.
If the instance being validated contains any xml:id attributes, such attributes are validated against the type xs:ID, making the containing element eligible as a target for the id function. Uniqueness checking of elements and attributes typed as xs:ID, however, is carried out only if the operand node is a document node.
If the operand node is a document node:
1. The children of the document node must consist of exactly one element node and zero or more comment and processing instruction nodes, in any order.
2. The element node child is validated, as described below.
3. The validation rule “Validation Root Valid (ID/IDREF)” is applied to the single element node child of the document node. This means that validation will fail if there are non-unique ID values or dangling IDREF values in the document tree.
  Note:
  This rule is not applied when the operand node is an element or attribute node.
4. There is no check that the tree contains unparsed entities whose names match the values of nodes of type xs:ENTITY or xs:ENTITIES. This is because it is not possible (either in XSLT or XQuery) to construct a tree containing unparsed entities. It is possible to add unparsed entity declarations to the result document by referencing a suitable DOCTYPE during serialization.
5. All other children of the document node (comments and processing instructions) are copied unchanged, and the results become the children of a new document node, which is returned as the validation result.
If the operand node is an element node, then:
1. For specification purposes, because the XSD specifications require the input document to be expressed as an XML Information Set ([XML Infoset]), the operand node is first converted to an Infoset according to the “Infoset Mapping” rules defined in [XQuery and XPath Data Model (XDM) 4.0]. Note that this process discards any existing type annotations.
  Validity assessment is carried out on the root element information item of the resulting Infoset, using the supplied schema. The process of validation applies recursively to contained elements and attributes to the extent required by the supplied schema.
  Note:
  A practical implementation is unlikely to perform any physical conversion, but the process is defined this way in order to align with the XSD specification.
2. If the validation mode is by-type, then Schema-validity assessment is carried out according to the rules defined in [XML Schema Part 1: Structures Second Edition] or [XSD 1.1 Part 1] Part 1, section 3.3.4 "Element Declaration Validation Rules", “Validation Rule: Schema-Validity Assessment (Element)”, clauses 1.2 and 2, using this type definition as the “processor-stipulated type definition” for validation.
3. If validation mode is strict, then strict validation is carried out as described in [XML Schema Part 1: Structures Second Edition] Part 1, section 5.2, “Assessing Schema-Validity”, item 2, or its counterpart in XSD 1.1. This means that the root element information item in the Infoset must either:
  1. have a name that matches a top-level element declaration in the effective schema, or
  2. have an xsi:type attribute whose value matches the name of a top-level type definition in the effective schema
  If there is no such element declaration or type definition, the element is assessed as invalid.
4. If validation mode is lax, then schema-validity assessment is carried out in accordance with [XML Schema Part 1: Structures Second Edition] Part 1, section 5.2, “Assessing Schema-Validity”, item 3, or its counterpart in XSD 1.1.
  If validation mode is lax and the root element information item has neither a top-level element declaration nor an xsi:type attribute, XSD 1.0 and XSD 1.1 define the recursive checking of children and attributes as optional. This specification prescribes that this recursive checking is required.
  Note:
  This means, for example, that when an instance document is structured as having an envelope in one namespace wrapping a payload in a different namespaces, and when schema definitions are available for the payload but not for the envelope, lax validation of the envelope may trigger validation of the payload.
5. If the operand node is an element node, the validation rules named “Validation Root Valid (ID/IDREF)” are not applied. This means that document-level constraints relating to uniqueness and referential integrity are not enforced.
6. There is no check that the document contains unparsed entities whose names match the values of nodes of type xs:ENTITY or xs:ENTITIES.
If the operand node is an attribute node, in particular when it is a parentless attribute node, then validation cannot be defined directly in terms of the XSD-defined validation process. Instead, conceptually, a copy of the attribute is first added to an element node that is created for the purpose, and namespace fixup is performed on this element node to ensure that it has an in-scope namespace binding for the prefix and namespace of the attribute name. The name of this element is of no consequence, but it must be the same as the name of a synthesized element declaration of the form:
```
<xs:element name="E">
  <xs:complexType>
    <xs:sequence/>
    <xs:attribute ref="A"/>
  </xs:complexType>
</xs:element>
```
where A is the name of the attribute being validated.
This synthetic element is then validated using the procedure given above for validating elements, and if it is found to be valid, a copy of the validated attribute is made, retaining its type annotation, but detaching it from the containing element (and thus, from any in-scope namespace bindings).
The XDM data model does not permit an attribute node with no parent to have a typed value that includes a namespace-qualified name, that is, a value whose type is derived from xs:QName or xs:NOTATION. This restriction is imposed because these types rely on the in-scope namespaces of a containing element to resolve namespace prefixes. Therefore, a parentless attribute is considered to be invalid against such a type.
The outcome of the validation expression depends on the validity property of the root element information item in the PSVI that results from the XSD validation process.
1. If the validity property of the root element information item is valid, or if validation mode is lax and the validity property of the root element information item is notKnown, the PSVI is converted back into a data model instance as described in [XQuery and XPath Data Model (XDM) 4.0] Section 3.3, “Construction from a PSVI”. The resulting node (a new node of the same kind as the operand node) is returned as the result of the validate expression.
  Otherwise, the operand node is deemed invalid.

Note:

During conversion of the PSVI into an XDM instance after validation, any element information items whose validity property is notKnown are converted into element nodes with type annotationxs:anyType, and any attribute information items whose validity property is notKnown are converted into attribute nodes with type annotationxs:untypedAtomic, as described in Section 6.5.3.1.17.3.3.1.1 Element and Attribute Node Types^DM.

15.2 Functions on HTML Data

Changes in 4.0 ⬇ ⬆

A new function is available for processing input data in HTML format. [Issues 74 850 1799 1889 1891 PRs 259 956 10 January 2023]

This function converts between the lexical representation of HTML and the XDM tree representation.

Function	Meaning
`fn:parse-html`	This function takes as input an HTML document, and returns the document node at the root of an XDM tree representing the parsed document.
`fn:html-doc`	Reads an external resource containing HTML, and returns the result of parsing the resource as HTML.

15.2.1 XDM Mapping from HTML DOM Nodes

The fn:parse-html function conceptually works in two phases:

The lexical HTML (supplied as a string) is parsed into an HTML DOM as defined by the HTML5 specification: see [HTML: Living Standard] and [DOM: Living Standard].
The resulting DOM is converted to an XDM tree as described in this section. This is described by defining the actions of the accessor functions defined in Section 6.77.5 Accessors^DM.

Note:

Because the [DOM: Living Standard] and [HTML: Living Standard] are not fixed, it is implementation-defined which versions are used.

Note:

An implementation must match the semantics of the mapping described in this section, but the specific way it achieves that is implementation-dependent.

Some possible implementation strategies are:

Parse the HTML to an HTML DOM and then convert the HTML DOM to an XDM node tree.
Parse the HTML to an HTML DOM and then implement a wrapper or facade that presents an XDM interface to the HTML DOM.
Parse the lexical HTML directly to an XDM node tree, bypassing the HTML DOM.

The [DOM: Living Standard] defines parsing algorithms for two different formats, which it refers to as the HTML and XML serializations (or concrete syntaxes). The XML serialization is an XML document which typically uses the namespace http://www.w3.org/1999/xhtml and the content type application/xhtml+xml, and is popularly referred to as XHTML. The HTML parsing algorithm constructs an HTML DOM HTMLDocument document object for the HTML document. The XHTML parsing algorithm constructs an HTML DOM XMLDocument object for the HTML document, following XML parsing rules. This mapping supports both of these document types.

The [DOM: Living Standard] specification defines HTML DOM nodes that are mapped to XDM nodes as follows:

The HTML DOM Document interface maps to Section 6.6.17.4.1 Document nodes^DM.
The HTML DOM Element interface maps to Section 6.6.27.4.2 Element nodes^DM.
The HTML DOM Attr interface maps to Section 6.6.37.4.3 Attribute nodes^DM.
Note:
Any HTML DOM Attr instances in an HTML DOM HTMLDocument that represent namespace declarations will have been filtered out: see 15.2.1.1 attributes Accessor.
The HTML DOM ProcessingInstruction interface maps to Section 6.6.57.4.5 Processing instruction nodes^DM.
Note:
The HTML parsing algorithm does not generate processing instruction nodes. If encountered they are parsed as comment nodes. The HTML DOM ProcessingInstruction interface is relevant only when the XHTML parsing algorithm is used.
The HTML DOM Comment interface maps to Section 6.6.67.4.6 Comment nodes^DM.
The HTML DOM Text interface maps to Section 6.6.77.4.7 Text nodes^DM. Adjacent HTML DOM Text nodes are combined into a single Section 6.6.77.4.7 Text nodes^DM.
Note:
The HTML DOM CDATASection interface is an instance of HTML DOM Text, so CDATA sections also map to Section 6.6.77.4.7 Text nodes^DM.
The use of CDATA sections can result in the HTML DOM containing adjacent text nodes, which the mapping to XDM will merge into a single node.

Note:

The HTML DOM DocumentFragment interface is not supported as an XML node. There are two places in the HTML DOM where this is used:

The HTML DOM ShadowRoot interface is not present in the main HTML DOM tree. It is only accessible via JavaScript.
The template element’s content property contains the child nodes of the template element. The behaviour of this is defined by the include-template-content key in the $options map.

If an implementation allows these nodes to be passed in via an API or similar mechanism, their behaviour is implementation-defined.

15.2.1.1 attributes Accessor

The result of the Section 6.7.17.5.1 attributes Accessor^DMdm:attributes($node) for an HTML DOM Node is as follows:

If the node is an instance of HTML DOM Element then the result is the value of the Element.attributes property mapped to a sequence as described below;
Otherwise, the result is an empty sequence.

An HTML DOM NamedNodeMap is mapped to a sequence as follows:

NamedNodeMap.length is the length of the sequence, where a length of 0 results in an empty sequence;
NamedNodeMap.item(n) is the n^th element of the sequence.

That sequence is then filtered as follows:

If the Attr.namespaceURI property is "http://www.w3.org/2000/xmlns/", the attribute is not included in this sequence;
If the Attr.localName property is "xmlns", the attribute is not included in this sequence;
If the Attr.localName property starts with "xmlns:", the attribute is not included in this sequence;
Otherwise, the attribute is included in this sequence using the XDM mapping rules described in this section.

Note:

The HTML DOM Element.attributes property includes namespace and non-namespace attributes in the list when the HTML or XML parser is used. As such, the namespace attributes have to be filtered from the resulting XDM attribute sequence.

Note:

When the resulting document is an HTML DOM HTMLDocument, the Attr.localName and Attr.name properties of HTML DOM Attr nodes are both set to the qualified name. This includes namespace declarations which are filtered out by the logic in this section.

Note:

The Attr.localName property will be ASCII lowercase. The [HTML: Living Standard] section 13.2.5.33, Attribute name state specifies that ASCII upper alpha characters are appended to the attribute’s name in lowercase.

15.2.1.2 base-uri Accessor

The result of the Section 6.7.27.5.2 base-uri Accessor^DMdm:base-uri($node) for an HTML DOM Node is the value of the Node.baseURI property mapped as follows:

If the value is null or an empty string, then the result is an empty sequence;
Otherwise, the string value is cast to an xs:anyURI.

15.2.1.3 children Accessor

The result of the Section 6.7.37.5.3 children Accessor^DMdm:children($node) for an HTML DOM Node is as follows:

If the node is an instance of HTML DOM Document then the result is the value of the Node.childNodes property mapped to a sequence;
If the node is an instance of HTML DOM HTMLTemplateElement then the result is determined as follows:
1. If the include-template-content key of the parse-html-options map is false(), the result is an empty sequence;
2. Select the HTML DOM DocumentFragment from the HTMLTemplateElement.content property;
3. The HTML DOM DocumentFragment’s Node.childNodes property is mapped to a sequence;
If the node is an instance of HTML DOM Element then the result the value of the Node.childNodes property mapped to a sequence;
Otherwise, the result is an empty sequence.

An HTML DOM NodeList is mapped to a sequence as follows:

NodeList.length is the length of the sequence, where a length of 0 results in an empty sequence;
NodeList.item(n) is the n^th element of the sequence.

That sequence is then filtered as follows:

If the child is an instance of HTML DOM DocumentType, that child is not included in this sequence;
A sequence of consecutive HTML DOM Text nodes is combined into a single XDM text node;
Otherwise, the HTML DOM Node nodes are mapped to XDM according to the rules in this section.

15.2.1.4 document-uri Accessor

The result of the Section 6.7.47.5.4 document-uri Accessor^DMdm:document-uri($node) for an HTML DOM Node is as follows:

If the node is an instance of HTML DOM Document then the value of the Document.documentURI property mapped as follows:
1. If the value is null or an empty string, then the result is an empty sequence;
2. Otherwise, the string value is cast to an xs:anyURI.
Otherwise, the result is an empty sequence.

15.2.1.5 is-id Accessor

The result of the Section 6.7.57.5.5 is-id Accessor^DMdm:is-id($node) for an HTML DOM Node is as follows:

If the node is an instance of HTML DOM Attr then:
1. If the Attr.name property (its qualified name) is "id", then:
  1. If the Attr.value is castable to an xs:NCName, the result is true;
  2. Otherwise, the result is false;
2. Otherwise, the result is false;
Otherwise, the result is false.

Note:

In [HTML: Living Standard] section 3.2.5, Global attributes, the id attribute is defined as being unique in the element’s tree, containing at least one character, and not having any ASCII whitespace characters. This means that an HTML id attribute may not conform to an xs:NCName.

If an HTML id is not a valid xs:NCName then that attribute is not an XML ID.

15.2.1.6 is-idrefs Accessor

The result of the Section 6.7.67.5.6 is-idrefs Accessor^DMdm:is-idrefs($node) for an HTML DOM Node is an empty sequence.

15.2.1.7 namespace-nodes Accessor

The result of the Section 6.7.77.5.7 namespace-nodes Accessor^DMdm:namespace-nodes($node) for an HTML DOM Node is as follows:

If the node is an instance of HTML DOM Element then an implementation-dependent sequence of namespace nodes that is sufficient to define the namespace context of the node.
Otherwise, the result is the empty sequence.

For the XHTML parsing algorithm, this will be equivalent to constructing the namespace nodes from an XML infoset, PSVI, or similar mapping.

For the HTML parsing algorithm, the [HTML: Living Standard] specification defines the namespace context in various places:

Section 2.1.3 XML compatibility defines the default element namespace to be http://www.w3.org/1999/xhtml.
Section 4.8.15 MathML defines rules for embedded MathML content in HTML documents. Section 13.1.2 Elements defines these elements as foreign elements, placing them in the MathML namespace (http://www.w3.org/1998/Math/MathML). The default element namespace for these elements is the MathML namespace.
Section 4.8.16 SVG defines rules for embedded SVG content in HTML documents. Section 13.1.2 Elements defines these elements as foreign elements, placing them in the SVG namespace (http://www.w3.org/2000/svg). The default element namespace for these elements is the SVG namespace.
Section 13.1.2.3 Attributes defines several namespaced attributes available on foreign elements. If any of these namespaced attributes are present, a namespace node for that namespace must be present on the element.
The supported namespace prefixes are:
1. xlink in the http://www.w3.org/1999/xlink namespace;
2. xml in the http://www.w3.org/XML/1998/namespace namespace; and
3. xmlns in the http://www.w3.org/2000/xmlns/ namespace.

No other namespaces are supported by the HTML parser.

Note:

Section number references to [HTML: Living Standard] may change over time.

15.2.1.8 nilled Accessor

The result of the Section 6.7.87.5.8 nilled Accessor^DMdm:nilled($node) for an HTML DOM Node is false().

15.2.1.9 node-kind Accessor

The result of the Section 6.7.97.5.9 node-kind Accessor^DMdm:node-kind($node) for an HTML DOM Node is as follows:

If the node is an instance of HTML DOM Document then the result is "document".
If the node is an instance of HTML DOM Element then the result is "element".
If the node is an instance of HTML DOM Attr then the result is "attribute".
If the node is an instance of HTML DOM ProcessingInstruction then the result is "processing-instruction".
If the node is an instance of HTML DOM Comment then the result is "comment".
If the node is an instance of HTML DOM Text then the result is "text".

15.2.1.10 node-name Accessor

The result of the Section 6.7.107.5.10 node-name Accessor^DMdm:node-name($node) for an HTML DOM Node is as follows:

If the node is an instance of HTML DOM Element then the result is determined as follows:
1. The local name is the value of the Element.localName property. This is derived as follows:
  1. The local name is initially set to the ASCII lowercase tag name. The [HTML: Living Standard] section 13.2.5.8, Tag name state specifies that ASCII upper alpha characters are appended to the element’s name in lowercase.
  2. If the local name is an SVG element name, the case-sensitive name is used. [HTML: Living Standard] section 13.2.6.5, The rules for parsing tokens in foreign content has a table mapping the lowercase element names to their SVG names.
  3. If the local name contains a character that is not a valid XML NameStartChar or NameChar, then an implementation-defined replacement string is used. The result must be a valid NCName.
    Note:
    [HTML: Living Standard] section 13.2.9 Coercing an HTML DOM into an infoset uses a Unnnnnn escape sequence. That would map : to U00003A.
    This local name escaping applies only to the HTML parsing algorithm. If the XHTML parsing algorithm is used, the localName and prefix will be correctly set for QName-based node names.
2. The namespace prefix is the value of the Element.prefix property, or empty if the value is null;
3. The namespace URI is the value of the Element.namespaceURI property, or empty if the value is null.
  1. If the element is an HTML element, the namespace URI is "http://www.w3.org/1999/xhtml".
  2. If the element is an SVG element, the namespace URI is "http://www.w3.org/2000/svg".
  3. If the element is a MathML element, the namespace URI is "http://www.w3.org/1998/Math/MathML".
If the node is an instance of HTML DOM Attr then the result is determined as follows:
1. The attribute name is the tokenized attribute name. The [HTML: Living Standard] section 13.2.5.33, Attribute name state specifies that ASCII upper alpha characters are appended to the attribute’s name in lowercase.
2. The local name is the value of the Attr.localName property. This is derived as follows:
  1. The local name is initially set to the attribute name.
  2. If the local name is an SVG or MathML attribute name, the case-sensitive name is used. [HTML: Living Standard] section 13.2.6.1, Creating and inserting nodes has a table mapping the lowercase attribute names to their SVG/MathML names.
  3. If the local name is an allowed xlink, xml, or xmlns attribute name the local name is the value of the local name column of the attribute name mapping table in [HTML: Living Standard] section 13.2.6.1, Creating and inserting nodes.
  4. If the local name contains a character that is not a valid XML NameStartChar or NameChar, then an implementation-defined replacement string is used. The result must be a valid NCName.
    Note:
    [DOM: Living Standard] section 13.2.9 Coercing an HTML DOM into an infoset uses a Unnnnnn escape sequence. That would map : to U00003A.
    This local name escaping applies only to the HTML parsing algorithm. If the XHTML parsing algorithm is used, the localName and prefix will be correctly set for QName-based node names.
3. The namespace prefix is the value of the Attr.prefix property, or empty if the value is null.
  1. If the attribute name is an allowed xlink, xml, or xmlns attribute name the namespace prefix is the value of the prefix column of the attribute name mapping table in [HTML: Living Standard] section 13.2.6.1, Creating and inserting nodes.
4. The namespace URI is the value of the Attr.namespaceURI property, or empty if the value is null;
  1. If the attribute name is an allowed xlink, xml, or xmlns attribute name the namespace URI is the value of the namespace column of the attribute name mapping table in [HTML: Living Standard] section 13.2.6.1, Creating and inserting nodes.
If the node is an instance of HTML DOM ProcessingInstruction then the result is an xs:QName constructed as follows:
1. The local name is the value of the ProcessingInstruction.target property;
2. The namespace prefix is empty;
3. The namespace URI is empty;
Otherwise, the result is an empty sequence.

Note:

When the resulting document is an HTML DOM HTMLDocument, the Element.localName and Element.name properties of HTML DOM Element nodes are both set to the qualified name.

Note:

When the resulting document is an HTML DOM HTMLDocument, the Attr.localName and Attr.name properties of HTML DOM Attr nodes are both set to the qualified name.

15.2.1.11 parent Accessor

The result of the Section 6.7.117.5.11 parent Accessor^DMdm:parent($node) for an HTML DOM Node is as follows:

Let $parent be the Node.parentNode property of the node;
If $parent is an instance of HTML DOM DocumentFragment, then for each HTML DOM HTMLTemplateElement$template in the parsed DOM tree:
1. Let $content be the value of the HTMLTemplateElement.content property of $template;
2. If $content is the same node as $parent, then the result is $template using the XDM mapping rules described in this section;
3. If there are no more $template nodes, then the result is an empty sequence;
If $parent is null, then the result is an empty sequence;
Otherwise, the result is $parent using the XDM mapping rules described in this section.

Note:

The current node can have a HTML DOM DocumentFragment parent node only if the include-template-content key of the html-parser-options is true().

Note:

The HTML DOM DocumentFragment’s Node.parentNode property is null, and a DocumentFragment attached to HTMLTemplateElement.content property does not have a host property connecting the fragment back to the template element.

If a future version of [DOM: Living Standard] adds a DocumentFragment.host property that references the node’s template element, or the implementation has access to that internal property, the implementation may choose to use that instead of traversing the parsed HTML tree.

15.2.1.12 string-value Accessor

The result of the Section 6.7.127.5.12 string-value Accessor^DMdm:string-value($node) for an HTML DOM Node is as follows:

If the node is an instance of HTML DOM Document, then use the algorithm described in 15.2.1.12.1 Tree string construction;
If the node is an instance of HTML DOM Element, then use the algorithm described in 15.2.1.12.1 Tree string construction;
If the node is an instance of HTML DOM Text, then use the algorithm described in 15.2.1.12.2 Text node string construction;
Otherwise, the result is the value of the Node.nodeValue property.

15.2.1.13 type-name Accessor

The result of the Section 6.7.137.5.13 type-name Accessor^DMdm:type-name($node) for an HTML DOM Node is as follows:

If the node is an instance of HTML DOM Element then the result is xs:untyped.
If the node is an instance of HTML DOM Attr then the result is xs:untypedAtomic.
If the node is an instance of HTML DOM Text then the result is xs:untypedAtomic.
Otherwise, the result is an empty sequence.

15.2.1.14 typed-value Accessor

The result of the Section 6.7.147.5.14 typed-value Accessor^DMdm:typed-value($node) for an HTML DOM Node is as follows:

Let $string-value be the 15.2.1.12 string-value Accessor for the node;
If the node is an instance of HTML DOM Document then the result is $string-value as an xs:untypedAtomic;
If the node is an instance of HTML DOM Element then the result is $string-value as an xs:untypedAtomic;
If the node is an instance of HTML DOM Attr then the result is $string-value as an xs:untypedAtomic;
If the node is an instance of HTML DOM Text then the result is $string-value as an xs:untypedAtomic;
Otherwise, the result is $string-value.

15.2.1.15 unparsed-entity-public-id Accessor

The result of the Section 6.7.157.5.15 unparsed-entity-public-id Accessor^DMdm:unparsed-entity-public-id($node) for an HTML DOM Node is an empty sequence.

15.2.1.16 unparsed-entity-system-id Accessor

The result of the Section 6.7.167.5.16 unparsed-entity-system-id Accessor^DMdm:unparsed-entity-system-id($node) for an HTML DOM Node is an empty sequence.

15.3 Functions on JSON Data

The functions listed in this section parse or serialize JSON data.

JSON is a popular format for exchange of structured data on the web: it is specified in [RFC 7159]. This section describes facilities allowing JSON data to be converted to and from XDM values.

This specification describes two ways of representing JSON data losslessly using XDM constructs. The first method uses XDM maps to represent JSON objects, and XDM arrays to represent JSON arrays. The second method represents all JSON constructs using XDM element and attribute nodes.

Function	Meaning
`fn:parse-json`	Parses input supplied in the form of a JSON text, returning the results typically in the form of a map or array.
`fn:json-doc`	Reads an external resource containing JSON, and returns the result of parsing the resource as JSON.
`fn:json-to-xml`	Parses a string supplied in the form of a JSON text, returning the results in the form of an XML document node.
`fn:xml-to-json`	Converts an XML tree, whose format corresponds to the XML representation of JSON defined in this specification, into a string conforming to the JSON grammar.
`fn:pin`	Adapts a map or array so that retrieval operations retain additional information.
`fn:pin`	Adapts a map or array so that retrieval operations retain additional information.
`fn:label`	Returns the label associated with a labeled item, as a map.
`fn:label`	Returns the label associated with a labeled item, as a map.

Note also:

The function fn:serialize has an option to generate JSON output from a structure of maps and arrays.
The function fn:element-to-map enables arbitrary XML node trees to be converted to trees of maps and arrays suitable for serializing as JSON.

15.3.8 fn:pin

Changes in 4.0 ⬇ ⬆

New in 4.0 [Issue 960 PR 988 27 February 2024]

Summary

Adapts a map or array so that retrieval operations retain additional information.

Signature

`fn:pin`(
`$input`	`as` `(map()\|array())`
) `as` `(map()\|array())`

Properties

This function is nondeterministic, context-independent, and focus-independent.

Rules

The function creates a deep copy of the supplied map or array, adapted so that navigation within the deep copy returns items that are labeled with additional information about their position within the containing tree structure.

Note:

The formal specification of the function describes it as constructing a deep copy of the entire tree, but a practical implementation is likely to use a lazy evaluation strategy, so the only costs incurred are for items actually selected within the tree.

The function makes use of the concept of labeled items, an extension to the data model described in Section 3.3 Labeled Items^DM.

The supplied value of $input must be either a map or an array.

The result is as follows:

If $input is a map M, the result is a map M′ derived from M as follows:
1. Any existing label on M is discarded.
2. M′ acquires a label having the property pinned set to the value true, and the property id set to an arbitrary xs:string value that is unique within the execution scope.
3. For every key-value pair (K, V) in M, M′ will have a key-value pair (K, V′) in which the key K is unchanged, and the value V′ is derived from V by applying the function derived-value(M', K, V), defined below.
4. The entry order^DM of M is retained in M′.
If $input is an array A, the result is an array A′ derived from A as follows:
1. Any existing label on A is discarded.
2. A′ acquires a label having the property pinned set to the value true, and the property id set to an arbitrary xs:string value that is unique within the execution scope.
3. For every member V in A, whose 1-based index position in A is X, A′ will have a member V′ derived from V by applying the function derived-value(A', X, V), defined below.
The id property described in the previous paragraphs is allocated only to the top-level map or array (the one supplied as an explicit argument to the fn:pin function). The function is notdeterministic: that is, if the function is called twice with the same arguments, it is implementation-dependent whether the same id property is allocated on both occasions.
If $input is anything other than a map or an array, a type error is raised.
The function derived-value(P, K, V) has the following logic. For every item J in V, V′ will contain an item J′ that is derived from J as follows:
1. Let TEMP be:
  1. If J is a map or array, then fn:pin(J).
    Note:
    Note however that the id property of TEMP is not used, so there is no need to generate it.
  2. Otherwise, J.
2. J′ is then a labeled item having the same subject as TEMP, together with a label having the following properties:
  pinned
  true
  key
  K
  position
  The 1-based position of J within V.
  parent
  P
  ancestors
  A zero-arity function item delivering the value of (?parent, ?parent ! label(.)?ancestors()).
  path
  A zero-arity function item delivering the value of (?parent ! label(.)?path(), ?key).

Notes

The effect of calling pin on a map or array is that subsequent retrieval operations within the pinned map or array return labeled results, whose labels contain useful information about where the results were found. For example, an expression such as json-doc($source)??name will return the values of all entries in the JSON tree having the key "name"; but very little can be done with this information because the result is simply a sequence of (typically) strings with no context. By contrast, the result of pin(json-doc($source))??name is the same set of strings, labeled with information about where they were found. For example, if $result is the result of the expression pin(json-doc($source))??name, then:

$result => label()?parent?ssn locates the map that contained each name, and returns the value of the ssn entry in that map.
$result => label()?ancestors()?course returns the values of any course entries in containing maps.
$result => label()?path() returns a sequence of map keys and array index values representing the location of the found entries within the JSON structure.

Editorial note
The `id` property on the root of a pinned map or array is intended to support deep update operations, which have not yet been defined.

Examples

Expression:	`pin([ "a", "b", "c" ])?1 ! label(.)?parent ! array:foot(.)`
Result:	"c"
Expression:	`pin([ "a", "b", "c", "d" ]) ! array:remove(., 2)?* ! label(.)?key`
Result:	1, 3, 4
Expression:	let $data := { "fr": { "capital": "Paris", "languages": [ "French" ] }, "de": { "capital": "Berlin", "languages": [ "German" ] } } return pin($data)??languages[. = 'German'] ! label(.)?path()[1]
Result:	"de"

15.3.9 fn:label

Changes in 4.0 ⬇ ⬆

New in 4.0 [Issue 960 PR 988 27 February 2024]

Summary

Returns the label associated with a labeled item, as a map.

Signature

`fn:label`(
`$input`	`as` `item()?`
) `as` `map(xs:string, item()*)?`

Properties

This function is deterministic, context-independent, and focus-independent.

Rules

If $input is an empty sequence, the function returns an empty sequence.

If $input is an item that has no label, the function returns an empty map.

If $input is a labeled item, the function returns the label, as a map.

Notes

The function makes use of the concept of labeled items, an extension to the data model described in Section 3.3 Labeled Items^DM.

The data model allows any item to be labeled, and allows the label to be any map with string-valued keys. Currently the only operation that creates labeled values is the fn:pin function. For examples illustrating the use of fn:label, see fn:pin.

17 Higher-order functions

17.1 Processing function items

The functions included in this section operate on function items, that is, values referring to a function.

[Definition] Functions that accept functions among their arguments, or that return functions in their result, are described in this specification as higher-order functions.

Note:

Some functions such as fn:parse-json allow the option of supplying a callback function for example to define exception behavior. Where this is not essential to the use of the function, the function has not been classified as higher-order for this purpose; in applications where function items cannot be created, these particular options will not be available.

Function	Meaning
`fn:function-lookup`	Returns a function item having a given name and arity, if there is one.
`fn:function-name`	Returns the name of the function identified by a function item.
`fn:function-arity`	Returns the arity of the function identified by a function item.
`fn:function-identity`	Returns a string representing the identity of a function item.
`fn:function-annotations`	Returns the annotations of the function item.

17.1.4 fn:function-identity

Summary

Returns a string representing the identity of a function item.

Signature

`fn:function-identity`(
`$function`	`as` `fn(*)`
) `as` `xs:string`

Properties

This function is deterministic, context-independent, and focus-independent.

Rules

The fn:function-identity function returns a string that represents the identity of $function.

The returned string has the property that fn:function-identity($f1) and fn:function-identity($f2) are codepoint-equal if and only if $f1 and $f2 have the same function identity. Apart from this property, the result is implementation-dependent.

Any label attached to a function item is ignored (see Section 3.3 Labeled Items^DM). Specifically, if L is a labeled item then fn:function-identity(L) returns the function identity of the subject of L.

In the case of maps and arrays, the result follows the following rule: If $X and $Y are both maps or arrays then fn:function-identity($X)must not be codepoint-equal to fn:function-identity($Y) unless $X and $Y are indistinguishable, that is unless every operator or function applied to $X returns the same result as for $Y. Even in this case, however, the result of the comparison fn:function-identity($X) eq fn:function-identity($Y) is implementation-dependent.

Notes

This function enables applications to test whether two expressions or variables reference the same function item. This may be useful, for example, to allow caching of function results to avoid repeated evaluation. The results of previous function invocations might be held in a map whose key is the function identity.

The function identity, by definition, is generated upon the creation of a function item. Specific expressions that create function items have their own rules for the identity of the returned functions: for example, it is guaranteed that evaluation of a function reference to a system function with no captured context (such as fn:abs#1) will always return the same function item.

It is not meaningful to store or compare the result of calling fn:function-identity across different execution scopes, because the string used to represent the function identity will generally vary from one execution scope to another.

The result of an expression such as function-identity(abs#1) eq function-identity(abs(?)) may be either true or false, because it is implementation-dependent whether abs#1 and abs(?) return the same function item.

Similarly, function-identity({ 1:() }) eq function-identity(map:entry(1, ())) may be either true or false.

Labels on function items are ignored because they typically represent information about how the function item was retrieved, rather than about the item itself. For example, a function item held in a map might be retrieved using a variety of lookup expressions, which may return the same function item but with different labels.

Examples

Expression	Result
`function-identity(abs#1) eq function-identity(abs#1)`	true()
`function-identity(abs#1) eq function-identity(round#1)`	false()
`function-identity({ 1: 0 }) eq function-identity({ 1: 1 })`	false()
`function-identity([ 0 ]) eq function-identity([ 1 ])`	false()

17.2 Basic higher-order functions

The following functions take function items as an argument.

Function	Meaning
`fn:apply`	Makes a dynamic call on a function with an argument list supplied in the form of an array.
`fn:do-until`	Processes a supplied value repeatedly, continuing when some condition is false, and returning the value that satisfies the condition.
`fn:every`	Returns `true` if every item in the input sequence matches a supplied predicate.
`fn:filter`	Returns those items from the sequence `$input` for which the supplied function `$predicate` returns `true`.
`fn:fold-left`	Processes the supplied sequence from left to right, applying the supplied function repeatedly to each item in turn, together with an accumulated result value.
`fn:fold-right`	Processes the supplied sequence from right to left, applying the supplied function repeatedly to each item in turn, together with an accumulated result value.
`fn:for-each`	Applies the function item `$action` to every item from the sequence `$input` in turn, returning the concatenation of the resulting sequences in order.
`fn:for-each-pair`	Applies the function item `$action` to successive pairs of items taken one from `$input1` and one from `$input2`, returning the concatenation of the resulting sequences in order.
`fn:highest`	Returns those items from a supplied sequence that have the highest value of a sort key, where the sort key can be computed using a caller-supplied function.
`fn:index-where`	Returns the positions in an input sequence of items that match a supplied predicate.
`fn:lowest`	Returns those items from a supplied sequence that have the lowest value of a sort key, where the sort key can be computed using a caller-supplied function.
`fn:partial-apply`	Performs partial application of a function item by binding values to selected arguments.
`fn:partition`	Partitions a sequence of items into a sequence of non-empty arrays containing the same items, starting a new partition when a supplied condition is true.
`fn:scan-left`	Produces the sequence of successive partial results from the evaluation of `fn:fold-left` with the same arguments.
`fn:scan-right`	Produces the sequence of successive partial results from the evaluation of `fn:fold-right` with the same arguments.
`fn:some`	Returns `true` if at least one item in the input sequence matches a supplied predicate.
`fn:sort`	Sorts a supplied sequence, based on the value of a sort key supplied as a function.
`fn:sort-by`	Sorts a supplied sequence, based on the value of a number of sort keys supplied as functions.
`fn:sort-with`	Sorts a supplied sequence, according to the order induced by the supplied comparator functions.
`fn:subsequence-where`	Returns a contiguous sequence of items from `$input`, with the start and end points located by applying predicates.
`fn:take-while`	Returns items from the input sequence prior to the first one that fails to match a supplied predicate.
`fn:transitive-closure`	Returns all the nodes reachable from a given start node by applying a supplied function repeatedly.
`fn:while-do`	Processes a supplied value repeatedly, continuing while some condition remains true, and returning the first value that does not satisfy the condition.

With all these functions, if the caller-supplied function fails with a dynamic error, this error is propagated as an error from the higher-order function itself.

17.2.12 fn:partial-apply

Changes in 4.0 ⬇ ⬆

New in 4.0 [Issue 1816 PR 1825 25 February 2025]

Summary

Performs partial application of a function item by binding values to selected arguments.

Signature

`fn:partial-apply`(
`$function`	`as` `fn(*)`,
`$arguments`	`as` `map(xs:positiveInteger, item()*)`
) `as` `fn(*)`

Properties

This function is deterministic, context-independent, and focus-independent.

Rules

The result is a function obtained by binding values to selected arguments of the function item $function. The arguments to be bound are represented by entries in the $arguments map: an entry with key $i and value $v causes the argument at position $i (1-based) to be bound to $v.

Any entries in $arguments whose keys are greater than the arity of $function are ignored.

If $arguments is an empty map then the function returns $function unchanged.

For example, the effect of calling fn:partial-apply($f, { 2: $x }) is the same as the effect of the partial appplication $f(?, $x, ?, ?, ....). The coercion rules are applied to the supplied arguments in the usual way.

Unlike a partial application using place-holder arguments:

The arity of $function need not be statically known.
It is possible to bind all the arguments of $function: the effect is to return a zero-arity function.

The result is a partially applied function^XP having the following properties (which are defined in Section 7.18.1 Function Items^DM):

name: absent.
identity: A new function identity distinct from the identity of any other function item.
Note:
See also Section 4.5.7 Function Identity^XP.
arity: The arity of $function minus the number of parameters in $function that map to supplied arguments in $arguments.
parameter names: The names of the parameters of $function that do not map to supplied arguments in $arguments.
signature: The parameters in the returned function are the parameters of $function that do not map to supplied arguments in $arguments, retaining order. The result type of the returned function is the same as the result type of $function.
An implementation that can determine a more specific signature (for example, through use of type analysis) is permitted to do so.
body: The body of $function.
captured context: The static and dynamic context of $function, augmented, for each supplied argument, with a binding of the converted argument value to the corresponding parameter name.

Error Conditions

A type error is raised if any of the supplied arguments, after applying the coercion rules, does not match the required type of the corresponding function parameter.

In addition, a dynamic error may be raised if any of the supplied arguments does not match other constraints on the value of that argument (for example, if the value supplied for a parameter expecting a regular expression is not a valid regular expression); or if the processor is able to establish that evaluation of the resulting function will fail for any other reason (for example, if an error is raised while evaluating a subexpression in the function body that depends only on explicitly supplied and defaulted parameters).

Notes

The function is useful where the arity of a function item is not known statically, or where all arguments in a function are to be bound, returning a zero-arity function.

Examples

Expression:	let $f := partial-apply(dateTime#2, {2: xs:time('00:00:00') }) return $f(xs:date('2025-03-01'))
Result:	xs:dateTime('2025-03-01T00:00:00')

18 Processing maps

Maps were introduced as a new datatype in XDM 3.1. This section describes functions that operate on maps.

A map is a kind of item.

[Definition] A map consists of a sequence of entries, also known as key-value pairs. Each entry comprises a key which is an arbitrary atomic item, and an arbitrary sequence called the associated value.

[Definition] Within a map, no two entries have the same key. Two atomic items K1 and K2 are the same key for this purpose if the function call fn:atomic-equal($K1, $K2) returns true.

It is not necessary that all the keys in a map should be of the same type (for example, they can include a mixture of integers and strings).

Maps are immutable, and have no identity separate from their content. For example, the map:remove function returns a map that differs from the supplied map by the omission (typically) of one entry, but the supplied map is not changed by the operation. Two calls on map:remove with the same arguments return maps that are indistinguishable from each other; there is no way of asking whether these are “the same map”.

A map can also be viewed as a function from keys to associated values. To achieve this, a map is also a function item. The function corresponding to the map has the signature function($key as xs:anyAtomicValue) as item()*. Calling the function has the same effect as calling the map:get function: the expression $map($key) returns the same result as get($map, $key). For example, if $books-by-isbn is a map whose keys are ISBNs and whose assocated values are book elements, then the expression $books-by-isbn("0470192747") returns the book element with the given ISBN. The fact that a map is a function item allows it to be passed as an argument to higher-order functions that expect a function item as one of their arguments.

18.4 Functions that Operate on Maps

The functions defined in this section use a conventional namespace prefix map, which is assumed to be bound to the namespace URI http://www.w3.org/2005/xpath-functions/map.

The function call map:get($map, $key) can be used to retrieve the value associated with a given key.

There is no operation to atomize a map or convert it to a string. The function fn:serialize can in some cases be used to produce a JSON representation of a map.

Function	Meaning
`map:build`	Returns a map that typically contains one entry for each item in a supplied input sequence.
`map:contains`	Tests whether a supplied map contains an entry for a given key.
`map:empty`	Returns `true` if the supplied map contains no entries.
`map:entries`	Returns a sequence containing all the key-value pairs present in a map, each represented as a single-entry map.
`map:entry`	Returns a single-entry map that represents a single key-value pair.
`map:filter`	Selects entries from a map, returning a new map.
`map:find`	Searches the supplied input sequence and any contained maps and arrays for a map entry with the supplied key, and returns the corresponding values.
`map:for-each`	Applies a supplied function to every entry in a map, returning the sequence concatenation^XP of the results.
`map:get`	Returns the value associated with a supplied key in a given map.
`map:items`	Returns a sequence containing all the values present in a map, in order.
`map:keys`	Returns a sequence containing all the keys present in a map.
`map:keys-where`	Returns a sequence containing selected keys present in a map.
`map:merge`	Returns a map that combines the entries from a number of existing maps.
`map:of-pairs`	Returns a map that combines data from a sequence of key-value pair maps.
`map:pair`	Returns a key-value pair map that represents a single key-value pair.
`map:pairs`	Returns a sequence containing all the key-value pairs present in a map, each represented as a key-value pair map.
`map:put`	Returns a map containing all the contents of the supplied map, but with an additional entry, which replaces any existing entry for the same key.
`map:remove`	Returns a map containing all the entries from a supplied map, except those having a specified key.
`map:size`	Returns the number of entries in the supplied map.

18.4.1 map:build

Changes in 4.0 ⬇ ⬆

New in 4.0 [Issue 151 PR 203 18 October 2022]

Summary

Returns a map that typically contains one entry for each item in a supplied input sequence.

Signature

`map:build`(
`$input`	`as` `item()*`,
`$key`	`as` `(fn($item as item(), $position as xs:integer) as xs:anyAtomicType*)?`	`:=` `fn:identity#1`,
`$value`	`as` `(fn($item as item(), $position as xs:integer) as item()*)?`	`:=` `fn:identity#1`,
`$options`	`as` `map(*)?`	`:=` `{}`
) `as` `map(*)`

Properties

This function is deterministic, context-independent, and focus-independent.

Rules

Informally, the function processes each item in $input in order. It calls the $key function on that item to obtain a sequence of key values, and the $value function to obtain an associated value. Then, for each key value:

If the key is not already present in the target map, the processor adds a new key-value pair to the map, with that key and that value.
If the key is already present, the processor combines the new value for the key with the existing value; the way they are combined is determined by the duplicates option.
By default, when two duplicate entries occur:
- A single combined entry will be present in the result.
- This entry will contain the sequence concatenation^XP of the supplied values.
- The position of the combined entry in the entry order^DM of the result map will correspond to the position of the first of the duplicates.
- The key of the combined entry will correspond to the key of one of the duplicates: it is implementation-dependent which one is chosen. (It is possible for two keys to be considered duplicates even if they differ: for example, they may have different type annotations, or they may be xs:dateTime values in different timezones.)
The $options argument can be used to control the way in which duplicate keys are handled. The allowed options, and their meanings, are the same as for the map:of-pairs function. The option parameter conventions apply.

Formal Equivalent

The effect of the function is equivalent to the result of the following XPath expression.

for-each(
  $input, 
  fn($item, $pos) {
    for-each($keys($item, $pos), fn($k) {
      map:pair($k, $value($item, $pos))
    }
  )}
)
=> map:of-pairs($options)

Error Conditions

An error is raised [err:FOJS0003] if the value of $options indicates that duplicates are to be rejected, and a duplicate key is encountered.

An error is raised [err:FOJS0005] if the value of $options includes an entry whose key is defined in this specification, and whose value is not a permitted value for that key.

Notes

The default function for both $keys and $value is the identity function. Although it is permitted to default both, this serves little purpose: usually at least one of these arguments will be supplied.

Examples

Expression:	`map:build((), string#1)`
Result:	{}
Expression:	`map:build(1 to 10, fn { . mod 3 })`
Result:	{ 0: (3, 6, 9), 1: (1, 4, 7, 10), 2: (2, 5, 8) } (Returns a map with one entry for each distinct value of `. mod 3`. The function to compute the value is the identity function, and duplicates are combined by sequence concatenation.)
Expression:	map:build( 1 to 5, value := format-integer(?, "w") )
Result:	{ 1: "one", 2: "two", 3: "three", 4: "four", 5: "five" } (Returns a map with five entries. The function to compute the key is an identity function, the function to compute the value invokes `fn:format-integer`.)
Expression:	map:build( ("January", "February", "March", "April", "May", "June", "July", "August", "September", "October", "November", "December"), substring(?, 1, 1) )
Result:	{ "A": ("April", "August"), "D": ("December"), "F": ("February"), "J": ("January", "June", "July"), "M": ("March", "May"), "N": ("November"), "O": ("October"), "S": ("September") }
Expression:	map:build( ("apple", "apricot", "banana", "blueberry", "cherry"), substring(?, 1, 1), string-length#1, { "duplicates": op("+") } )
Result:	{ "a": 12, "b": 15, "c": 6 } (Constructs a map where the key is the first character of an input item, and where the corresponding value is the total string-length of the items starting with that character.)
Expression:	map:build( ('Wang', 'Liu', 'Zhao'), key := fn($name, $pos) { $name }, value := fn($name, $pos) { $pos } )
Result:	{ "Wang": 1, "Liu": 2, "Zhao": 3 } (Returns an inverted index for the input sequence with the string stored as key and the position stored as value.)
Expression:	let $titles := <titles> <title>A Beginner’s Guide to <ix>Java</ix></title> <title>Learning <ix>XML</ix></title> <title>Using <ix>XML</ix> with <ix>Java</ix></title> </titles> return map:build($titles/title, fn($title) { $title/ix })
Result:	{ "Java": ( <title>A Beginner’s Guide to <ix>Java</ix></title>, <title>Using <ix>XML</ix> with <ix>Java</ix></title> ), "XML": ( <title>Learning <ix>XML</ix></title>, <title>Using <ix>XML</ix> with <ix>Java</ix></title> ) }
The following expression creates a map whose keys are employee `@ssn` values, and whose corresponding values are the employee nodes:
map:build(//employee, fn { @ssn })
The following expression creates a map whose keys are employee `@location` values, and whose corresponding values represent the number of employees at each distinct location. Any employees that lack an `@location` attribute will be excluded from the result.
map:build(//employee, fn { @location }, fn { 1 }, { "duplicates": op("+") })
The following expression creates a map whose keys are employee `@location` values, and whose corresponding values contain the employee node for the highest-paid employee at each distinct location:
map:build( //employee, key := fn { @location }, combine := fn($a, $b) { highest(($a, $b), fn { xs:decimal(@salary) }) } )
The following expression creates a map allowing efficient access to every element in a document by means of its `fn:generate-id` value:
map:build(//*, generate-id#1)
The following expression creates a map allowing efficient access to values in a recursive JSON structure using hierarchic paths:
The following expression creates a map allowing efficient access to values in a recursive JSON structure using hierarchic paths:
let $tree := parse-json('{ "type": "package", "name": "org", "content": [ { "type": "package", "name": "xml, "content: [ { "type": "package", "name": "sax", "content": [ { "type": "class", "name": "Attributes"}, { "type": "class", "name": "ContentHandler"}, { "type": "class", "name": "XMLReader"} ] }] }] }') return map:build($tree ? descendant::~[record(type, name, *)], fn{?ancestor-or-self::name => reverse() => string-join(,)}, fn{`{?type} {?name}`})
let $tree := parse-json('{ "type": "package", "name": "org", "content": [ { "type": "package", "name": "xml, "content: [ { "type": "package", "name": "sax", "content": [ { "type": "class", "name": "Attributes"}, { "type": "class", "name": "ContentHandler"}, { "type": "class", "name": "XMLReader"} ] }] }] }') return map:build($tree ? descendant::~[record(type, name, *)], fn{?ancestor-or-self::name => reverse() => string-join(,)}, fn{`{?type} {?name}`})
The result is the map:
The result is the map:
{ "org.xml.sax.Attributes": "class Attributes", "org.xml.sax.ContentHandler": "class ContentHandler", "org.xml.sax.XMLReader": "class XMLReader" }
{ "org.xml.sax.Attributes": "class Attributes", "org.xml.sax.ContentHandler": "class ContentHandler", "org.xml.sax.XMLReader": "class XMLReader" }

20 Processing JNodes

Changes in 4.0 ⬇ ⬆

Introduced the concept of JNodes. [Issue 2025 PR 2031 11 June 2025]

A JNode^DM is a wrapper around a map or array, or around a value that appears within the content of a map or array. JNodes are described at Section 8.4 JNodes^DM. Wrapping a map or array in a JNode enables the use of XPath lookup expressions such as $jnode?descendant::title, as described at Section 4.13.3 Lookup Expressions^XP.

In addition to the functions defined in this section, functions that operate on JNodes include:

fn:root
fn:generate-id
fn:distinct-ordered-nodes

20.1 Functions on JNodes

20.1.1 fn:JNode

Changes in 4.0 ⬇ ⬆

New in 4.0 [Issue 2025 PR 2031 12 June 2025]

Summary

Delivers a root JNode^DM wrapping a map or array, enabling the use of lookup expression to navigate a JTree^DM rooted at that map or array.

Signature

`fn:JNode`(
`$input`	`as` `(map()\|array())`
) `as` `JNode(map()\|array())`

Properties

This function is nondeterministic, context-independent, and focus-independent.

Rules

The function creates a JNode^DM that wraps the supplied map or array. Specifically, it creates a root JNode whose ¶value property is $input, and whose ¶parent, ¶position, and ¶selector properties are absent.

This has the effect that lookup expressions starting from this JNode retain information for subsequent navigation.

A JNode has unique identity. If two maps or arrays M₁ and M₂ have the same function identity, as determined by the function-identity function, then JNode(M₁) is JNode(M₂) must return true: that is, the same JNode must be delivered for both.

Notes

It is to some extent implementation-defined whether two maps or arrays have the same function identity. Processors should ensure as a minimum that when a variable $m is bound to a map or array, calling JNode($m) more than once (with the same variable reference) will deliver the same JNode each time.

The effect of the coercion rules is technically that if an existing JNode is supplied as $input, the wrapped value will be extracted, and then rewrapped as a JNode: in practice, this can be short-circuited by returning the supplied JNode unchanged.

Although fn:JNode is available as a function for user applications to call explicitly, it is also invoked implicitly by some expressions, notably when a lookup expression is written in a form such as $map?child::*. Specifically, if the left-hand operand of the lookup operator is a map or array, and the right-hand side uses an explicit axis such as child::, then the supplied map or array is implicitly wrapped in a JNode. The same is true when the deep lookup operator ?? is used.

The effect of applying fn:JNode to a map or array is that subsequent retrieval operations within the wrapped map or array return results that retain useful information about where the results were found. For example, consider an expression such as json-doc($source)??name. In this case the call on fn:JNode is implicit. This expression returns a set of JNodes representing all entries in the JTree having the key "name"; each of these JNodes contains not only the value of the relevant "name" entry, but also the key (which in this simple example is always "name" and the containing map. This means, for example, if $result is the result of the expression json-doc($source) ?? name, then:

$result ? .. ? ssn locates the map that contained each name, and returns the value of the ssn entry in that map.
$result ? ancestor::course returns the values of any course entries in containing maps.
$result ? ancestor::* => selector() returns a sequence of map keys and array index values representing the location of the found entries within the JSON structure.

An alternative way of wrapping a map or array, rather than calling JNode($X), is to use the lookup expression $X?..

Examples

Expression:	`JNode([ "a", "b", "c" ]) ? child::1 ? parent::* ! array:foot(.)`
Result:	"c"
Expression:	`JNode([ "a", "b", "c", "d" ]) ? child::* => selector()`
Result:	1, 2, 3, 4
Expression:	let $data := { "fr": { "capital": "Paris", "languages": [ "French" ] }, "de": { "capital": "Berlin", "languages": [ "German" ] } } return JNode($data) ?? languages[. = 'German'] ? .. ? capital) => string()
Result:	"Berlin"

20.1.2 fn:JNode-value

Summary

Returns the ¶value property of a JNode.

Signature

`fn:JNode-value`(
`$input`	`as` `JNode()`
) `as` `item()*`

Properties

This function is deterministic, context-independent, and focus-independent.

Rules

If $input is an empty sequence, the function returns an empty sequence.

Otherwise, the function returns the ¶value property of $input.

Notes

In many cases it is unnecessary to make an explicit call on JNode-value, because the coercion rules will take care of this automatically. For example, in an expression such as $X ? descendant::name [matches(., '^J')], the call on matches is supplied with a JNode as its first argument; atomization ensures that the actual value being passed to the first argument of matches is the atomized value of the ¶value property.

One case where the function call may be needed is when computing the effective boolean value. As with XNodes, writing if (?child::*[1]) ... tests for the existence of a child, it does not test its value. To test its value, write if (JNode-value(?child::*[1])) ..., or equivalently if (xs:boolean(?child::*[1])) ....

Examples

Expression:	let $array := [1, 3, 4.5, 7, "eight", 10] return $array ? child::~xs:integer =!> JNode-value()
Result:	1, 3, 7, 10
Expression:	let $map := {'Mo': 'Monday', 'Tu': 'Tuesday', 'We': 'Wednesday'} return $map ? child::("Mo", "We", "Fr", "Su") =!> JNode-value()
Result:	"Monday", "Wednesday"
Expression:	let $array := [[4, 18], [30, 4, 22]] return $array ? descendant::[. gt 25][1] ? ancestor-or-self:: =!> JNode-value() => reverse()
Result:	[[4, 18], [30, 4, 22]], [30, 4, 22], 30

20.1.3 fn:JNode-selector

Changes in 4.0 ⬇ ⬆

New in 4.0 [Issue 2025 PR 2031 12 June 2025]

Summary

Returns the ¶selector property of a JNode.

Signature

`fn:JNode-selector`(
`$input`	`as` `JNode()`
) `as` `xs:anyAtomicType?`

Properties

This function is deterministic, context-independent, and focus-independent.

Rules

If $input is an empty sequence, the function returns an empty sequence.

If $input is a root JNode (one in which the ¶selector property is absent), the function returns an empty sequence.

Otherwise, the function returns the ¶selector property of $input. In the case where the parent JNode wraps a map, this will be the key of the relevant entry within that map; in the case where the parent JNode wraps an array, it will be the 1-based index of the relevant member of the array.

Examples

Expression:	let $array := [1, 3, 4.5, 7, "eight", 10] return $array ? child::~xs:integer =!> JNode-selector()
Result:	1, 2, 4, 6
Expression:	let $map := {'Mo': 'Monday', 'Tu': 'Tuesday', 'We': 'Wednesday'} return $map ? child::("Mo", "We", "Fr", "Su") =!> JNode-selector()
Result:	"Mo", "We"
Expression:	let $array := [[4, 18], [30, 4, 22]] return $array ? descendant::[. gt 25][1] ? ancestor:: =!> JNode-selector() => reverse()
Result:	2, 1

20.1.4 fn:JNode-position

Changes in 4.0 ⬇ ⬆

New in 4.0 [Issue 2025 PR 2031 12 June 2025]

Summary

Returns the ¶position property of a JNode.

Signature

`fn:JNode-position`(
`$input`	`as` `JNode()`
) `as` `xs:anyAtomicType?`

Properties

This function is deterministic, context-independent, and focus-independent.

Rules

If $input is an empty sequence, the function returns an empty sequence.

If $input is a root JNode (one in which the ¶position property is absent), the function returns an empty sequence.

Otherwise, the function returns the ¶position property of $input. The value of this property will be 1 (one) except in cases where the value of an entry in a map, or a member in an array, is a sequence that contains multiple items including maps and/or arrays; in such cases the position will be the 1-based position of the relevant map or array.

Notes

This function is relevant only when there are maps whose entries are multi-item sequences that include maps and arrays, or arrays whose members include such multi-item sequences. Such structures are uncommon, and never arise from parsing of JSON source text. It is generally best to avoid such structures by using arrays rather than sequences within array and map content; apart from other considerations, this allows the data to be serialized in JSON format.

If an entry within a map, or a member of an array, contains a sequence of items that mixes arrays and maps with other content (for example the array [1, 2, ([1,2], [3,4], 5)), then a lookup using the child axis will only construct JNodes in respect of those items that are non-empty maps or arrays. This may leave gaps in the position numbering sequence, as illustrated in the examples below.

Examples

Expression:	let $input := { "a": [10, 20, 30], "b": ([40, 50, 60], [], 0, [70, 80, (90, 100)]) } return $input ? child::b ? * ! { "position": JNode-position(.), "index": JNode-selector(.) "value": JNode-value(.) }
Result:	{ "position": 1, "index": 1, "value": 40 }, { "position": 1, "index": 2, "value": 50 }, { "position": 1, "index": 3, "value": 60 }, { "position": 4, "index": 1, "value": 70 }, { "position": 4, "index": 2, "value": 80 }, { "position": 4, "index": 3, "value": (90, 100) }
Expression:	let $input := { "a": {"x": 10, "y": 20, "z": 30}, "b": ( {"x": 40, "y": 50, "z": 60}, {}, {"x": 70, "y": 80, "z": (90, 100)}) } return $input ? child::b ? * ! { "position": JNode-position(.), "key": JNode-selector(.) "value": JNode-value(.) }
Result:	{ "position": 1, "key": "x", "value": 40 }, { "position": 1, "key": "y", "value": 50 }, { "position": 1, "key": "z", "value": 60 }, { "position": 3, "key": "x", "value": 70 }, { "position": 3, "key": "y", "value": 80 }, { "position": 3, "key": "z", "value": (90, 100) }

20.2 Deep Update

A function is provided to make a modified copy of a tree rooted at either an XNode or JNode.

20.2.1 fn:update

Summary

Updates the contents of a tree of XNodes or JNodes, returning a modified copy.

Signature

`fn:update`(
`$root`	`as` `GNode()?`,
`$select`	`as` `GNode()*`,
`$action`	`as` `fn(GNode()) as item()*`
) `as` `GNode()?`

Properties

This function is deterministic, context-independent, and focus-independent.

Rules

If $input is an empty sequence, the function returns an empty sequence.

Informally, the function returns a modified copy of $root, in which any GNodes appearing in the value of $select are modified by applying the function $action.

The result of the $action function must be compatible with its input. Specifically,

If the input is an attribute node, the result must be a sequence of zero or more attribute nodes. These replace any existing attribute nodes having the same name; but if two or more replacement attribute nodes on the same element have the same name, then an error is raised.
If the input is an element, text, comment, or processing instruction node, then the result must be a sequence of element, text, comment, or processing instruction nodes.
If the input is a JNode representing an entry in a map (that is, the JNode J has JNode-parent(J) instance of JNode(map(*))), the result must be a map. The entries in this map replace any existing map entries with the same key, but if two or more replacement map entries have the same key, then an error is raised.
If the input is a JNode representing an entry in an array (that is, the JNode J has JNode-parent(J) instance of JNode(array(*))), the result must be an array. The members of this array replace the selected array member.

Note:

The GNode supplied to the $action function will always be in one of the above categories.

The effect of the function is equivalent to the following XSLT 4.0 implementation, except for error handling:

<xsl:function name="fn:update" as="GNode()?">
  <xsl:param name="root" as="GNode()?"/> 
  <xsl:param name="select" as="GNode()*"/>
  <xsl:param name="action" as="fn(GNode()) as GNode()*"/>
  
  <!-- Function to process an individual GNode. If it is a selected GNode,
       call the supplied $action function. Otherwise, call fn:update
       to process it recursively using the same options -->
  <xsl:variable 
      name="process" as="fn(GNode()) as GNode()*"
      select="fn{ if (. intersect $select) 
                  then $action(.) 
                  else update(., $select, $action) }"/>
  
  <xsl:choose>
  
    <!-- Processing for XNodes -->
    <xsl:when test="$root instance of node()">
      <xsl:copy select="$root">
        <xsl:sequence select="$root ! (@*, node()) ! process(.)"/>
      </xsl:copy>
    </xsl:when>
    
    <!-- Processing for JNodes that wrap maps -->
    <xsl:when test="$root instance of JNode(map(*))
       select="map:merge( $root ? child::* ! $process(.) ) => JNode()"/>
              
    <!-- Processing for JNodes that wrap arrays -->          
    <xsl:when test="$root instance of JNode(array(*))
       select="array:join( $root ? child::* ! $process(.) ) => JNode()"/>
    
    <!-- Processing anything else -->
    <xsl:otherwise select="()"/>
    
  </xsl:choose>
</xsl:function>

Notes

The $select argument identifies those GNodes within the tree rooted at $root that are to be replaced.

A GNode selected by the $select argument is effectively ignored if:

it does not not have $root as an ancestor; or
it has an ancestor GNode that is itself selected in $select

A dynamic error occurs if the replacement for a selected GNode is unsuitable. For example, an error will arise if an attribute is replaced by an element, or an element by an attribute (unless it happens to be the last attribute or the first element).

When updating an XTree, each node in the new tree has new node identity. Although optimizations are always possible, the complexities of handling node identity, type annotations, in-scope namespaces, and parent pointers mean that in practice, it is likely that the function will make a physical copy of the entire tree.

By contrast, when updating a JTree, none of these complexities arise, and by using persistent (also known as immutable or functional) data structures, an implementation may well be able to reuse those parts of the JTree that are not affected by changes, meaning that the cost in time and space will be proportional to the extent of the change, not to the size of the input tree.

Examples

Expression:	let $tree := parse-xml('<a><b v="1"/><b v="2"/></a>') return update($tree, $tree//@v, fn{.+1})
Result:	parse-xml('<a><b v="2"/><b v="3"/></a> (Modifies the value of selected attributes.)
Expression:	let $tree := parse-xml('<a><b v="1"/><b v="2"/></a>') return update($tree, $tree//@v, fn{()})
Result:	parse-xml('<a><b/><b/></a> (Deletes selected attributes.)
Expression:	let $tree := <a><b v="1"/><b v="2"/></a> return update($tree, $tree//@v[. gt 1], fn{., attribute #new {'true'})
Result:	<a><b v="1"/><b v="2" new="true"/></a> (Inserts new attributes under specified conditions.)
Expression:	let $tree := <a><b/><b>12</b></a> return update($tree, $tree//b[empty(child::node)], fn{., text {'default'})
Result:	<a><b>default</b><b>12</b></a> (Expands empty elements with default values.)
Expression:	let $tree := {1: ["a", "b", "c"], 2: ["x", "y", "z"]} return update($tree, $tree/?*?2, fn{[upper-case(.)]})
Result:	{1: ["a", "B", "c"], 2: ["x", "Y", "z"]} (Updates selected members of selected arrays.)
Expression:	let $tree := {1: ["a", "b", "c"], 2: ["x", "y", "z"]} return update($tree, $tree/?*?2, fn{[]})
Result:	{1: ["a", "c"], 2: ["x", "z"]} (Deletes selected members of selected arrays.)

2021 Processing types

Changes in 4.0 ⬇ ⬆

New functions are provided to obtain information about built-in types and types defined in an imported schema. [Issue 148 PR 1523 5 November 2024]

The functions in this section deliver information about schema types (including simple types and complex types). These may represent built-in types (such as xs:dateTime), user-defined types found in the static context (typically because they appear in an imported schema), or types used as type annotations on schema-validated nodes.

For more information on schema types, see 1.8.2 Schema Type Hierarchy. The properties of a schema type are described in terms of the properties of a Simple Type Definition or Complex Type Definition component as described in Section 3.16.1 The Simple Type Definition Schema Component ^XS11-1 and Section 3.4.1 The Complex Type Definition Schema Component ^XS11-1 respectively. Not all properties are exposed.

The structured representation of a schema type is described in 20.1.1 Record fn:schema-type-record21.1.1 Record fn:schema-type-record20.1.121.1.1 Record fn:schema-type-record.

Note:

Simple properties of a schema type that can be expressed as strings or booleans are represented in this record structure directly as atomic field values, while complex properties whose values are themselves types (for example, base-type and primitive-type) are represented as functions. This is done partly to make it easier for implementations to compute complex properties on demand rather than in advance, and partly to ensure that the overall structure is always acyclic. For example, the primitive type of xs:decimal is itself xs:decimal, and if this were represented as a field value without a guarding function, serialization of the map using the JSON output method would not terminate.

20.121.1 Functions returning type information

Function	Meaning
`fn:schema-type`	Returns a record containing information about a named schema type in the static context.
`fn:type-of`	Returns information about the type of a value, as a string.
`fn:atomic-type-annotation`	Returns a record containing information about the type annotation of an atomic value.
`fn:node-type-annotation`	Returns a record containing information about the type annotation of an element or attribute node.

20.1.121.1.1 Record fn:schema-type-record

This record type represents the properties of a simple or complex type in a schema.

Name	Meaning
`name`	The name of the type. Empty in the case of an anonymous type. Corresponds to {name}^XS11-1 and {target namespace}^XS11-1 in the XSD component model for simple and complex type components. Type: `xs:QName?`
`is-simple`	True for a simple type, false for a complex type. Type: `xs:boolean`
`base-type`	Function item returning the base type (the type from which this type is derived by restriction or extension). The function is always present, and returns an empty sequence in the case of the type `xs:anyType`. Corresponds to the {base type definition}^XS11-1 property in the XSD component model. Type: `fn() as schema-type-record?`
`primitive-type?`	For an atomic type, a function item returning the primitive type from which this type is ultimately derived. Corresponds to the {primitive type definition}^XS11-1 in the XSD component model for simple types. Absent if the type is non atomic, or if it is the simple type `xs:anyAtomicType`. If this is a primitive type, the function item is idempotent. Type: `fn() as schema-type-record`
`variety?`	For a simple type, one of `"atomic"`, `"list"`, or `"union"`, corresponding to the {variety}^XS11-1 of the simple type in the XSD component model. For a complex type, one of `"empty"`, `"simple"`, `"element-only"`, or `"mixed"`, corresponding to the {content type}^XS11-1.{variety}^XS11-1 of the complex type in the XSD component model. The value is absent in cases where the {variety}^XS11-1 in the XSD component model is absent, for example for the type `xs:anySimpleType`. Type: `enum("atomic", "list", "union", "empty", "simple", "element-only", "mixed")`
`members?`	For a simple type with variety `"union"`, a function that returns a sequence of records representing the member types of the union, in order, corresponding to the {member type definitions}^XS11-1 property in the XSD component model. For a simple type with variety `"list"`, a function that returns a record representing the item type of the list type, corresponding to the {item type definition}^XS11-1 property in the XSD component model. In all other cases, absent. Type: `fn() as schema-type-record*`
`simple-content-type?`	For a complex type with variety `"simple"` (that is, a complex type with simple content), a function that returns a record representing the relevant simple type, corresponding to the {content type}^XS11-1.{simple type definition}^XS11-1 property in the XSD complex type component. In all other cases, absent. Type: `fn() as schema-type-record`
`matches?`	For a generalized atomic type^XP, a function item that can be called to establish whether the supplied atomic item is an instance of this type. In all other cases, absent. Type: `fn(xs:anyAtomicType) as xs:boolean`
`constructor?`	For a simple type, a function item that can be used to construct instances of this type. In the case of a named type that is present in the dynamic context, the result is the same function as returned by `fn:function-lookup` applied to the type name (with arity one). For details see 21.122.1 Constructor functions for XML Schema built-in atomic types and 21.522.5 Constructor functions for user-defined atomic and union types. Constructor function items are also available for anonymous types, and for types that might not be present in the dynamic context. The field is absent for complex types and for the abstract types `xs:anyAtomicType`, `xs:anySimpleType`, and `xs:NOTATION`. It is also absent for all namespace-sensitive^XP types. Type: `fn(xs:anyAtomicType?) as xs:anyAtomicType?`
`*`	The record type is extensible (it may contain additional fields beyond those listed).

20.1.221.1.2 fn:schema-type

Changes in 4.0 ⬇ ⬆

New in 4.0 [Issue 148 PR 1523 22 October 2024]

Summary

Returns a record containing information about a named schema type in the static context.

Signature

`fn:schema-type`(
`$name`	`as` `xs:QName`
) `as` `schema-type-record?`

Properties

This function is deterministic, context-dependent, and focus-independent.

Rules

If the static context (specifically, the in-scope schema types^XP) includes a schema type whose name matches $name, the function returns a schema-type-record containing information about that schema type. If not, it returns an empty sequence.

Examples

Expression:	`schema-type( #xs:integer ) ? name`
Result:	#xs:integer
Expression:	`schema-type( #xs:long ) ? primitive-type() ? name`
Result:	#xs:decimal
Expression:	`schema-type( #xs:positiveInteger ) ? base-type() ? name`
Result:	#xs:nonNegativeInteger
Expression:	`schema-type( #xs:integer ) ? matches(23)`
Result:	true()
Expression:	`schema-type( #xs:numeric ) ? variety`
Result:	"union"
Expression:	`schema-type( #xs:numeric ) ? members() ? name`
Result:	#xs:double, #xs:float, #xs:decimal

20.1.321.1.3 fn:type-of

Changes in 4.0 ⬇ ⬆

New in 4.0 [Issue 1550 PR 1570 12 November 2024]

Summary

Returns information about the type of a value, as a string.

Signature

`fn:type-of`(
`$value`	`as` `item()*`
) `as` `xs:string`

Properties

This function is deterministic, context-independent, and focus-independent.

Rules

The function returns a string, whose lexical form will always match the grammar of SequenceType^XP, representing a sequence type that matches $value.

If $value is the empty sequence, the function returns the string "empty-sequence()".

Otherwise, the returned string is the concatenation of:

A string representing the distinct item types that are present in $value, formed as follows:
1. For each item in $value, construct a string representing its item type as described below.
2. Eliminate duplicate strings from this list by applying the fn:distinct-values function, forming a sequence of strings $ss.
3. If $ss contains only one string, use that string.
4. Otherwise, return the result of the expression `({ fn:string-join($ss, "|") })`.
An occurrence indicator: absent if $value contains exactly one item, or "+" if it contains more than one item.

The string representing the type of an individual item J is constructed as follows:

If J is aan XNode^nodeDM, the result is one of the following strings, determined by the node kind of the node (see Section 6.7.97.5.9 node-kind Accessor^DM):
"document-node()"
"element()"
"attribute()"
"text()"
"processing-instruction()"
"comment()"
"namespace-node()"
If J is a JNode^DM, the result is in the form JNode(T), where T is the result of applying the type-of function to the ¶value property of J.
If J is a JNode^DM, the result is in the form JNode(T), where T is the result of applying the type-of function to the ¶value property of J.
If J is an atomic item, the result is a string chosen as follows:
1. Let T be the type denoted by the type annotation of J.
2. If T is an anonymous type, set T to the base type of T, and repeat until a type is reached that is not anonymous.
3. If the name of T is in the namespace http://www.w3.org/2001/XMLSchema, return the string "xs:local" where local is the local part of the name of T.
4. Otherwise, return the name of T in the form of a URIQualifiedName^XP (that is, "Q{uri}local", or "Q{}local" if the name is in no namespace).
If J is a function item:
1. If J is an array, return "array(*)".
2. If J is a map, return "map(*)".
3. Otherwise, return "function(*)".

Error Conditions

If the $value argument is omitted and the context value is absent^DM, the function raises type error [err:XPDY0002]^XP.

Notes

In general, an item matches more than one type, and there are cases where there is no single matching type that is more specific than all the others. This is especially true with functions, maps, and arrays. This function therefore selects one of the types that matches the item, which is not necessarily the most specific type.

This function should not be used as a substitute for an instance of test. The precise type annotation of the result of an expression is not always predictable, because processors are free to deliver a more specific type than is mandated by the specification. For example, if $n is of type xs:positiveInteger, then the result of abs($n) is guaranteed to be an instance of xs:integer, but an implementation might reasonably return the supplied value unchanged: that is, a value whose actual type annotation is xs:positiveInteger. Similarly the type annotation of the value returned by position() might be xs:long rather than xs:integer.

Implementations should, however, refrain from exposing types that are purely internal. For example, an implementation might have an optimized internal representation for strings consisting entirely of ASCII characters, or for single-character strings; if this is the case then the type annotation returned by this function should be a user-visible supertype such as xs:string.

Examples

Variables
let $e := <doc> <p id="alpha" xml:id="beta">One</p> <p id="gamma" xmlns="http://example.com/ns">Two</p> <ex:p id="delta" xmlns:ex="http://example.com/ns">Three</ex:p> <?pi 3.14159?> </doc>

Expression	Result
`type-of($e//*[@id = 'alpha'])`	"element()"
`type-of($e//*)`	"element()+"
`type-of($e//@id[. = 'gamma'])`	"attribute()"
`type-of($e//node()[. = '3.14159'])`	"processing-instruction()"
`type-of($e//no-such-node)`	"empty-sequence()"
`type-of($e/child::node())`	"(element()\|processing-instruction())+"
`type-of(1)`	"xs:integer"
`type-of(1 to 5)`	"xs:integer+"
`type-of((1, 1.2, 2))`	"(xs:integer\|xs:decimal)+"
`type-of([ 1, 2, 3 ])`	"array(*)"
`type-of({ 'a': 1 })`	"map(*)"
`type-of(type-of#1)`	"function(*)"
`type-of(JNode([]))`	"JNode(array(*))"
`type-of(JNode([]))`	"JNode(array(*))"

20.1.421.1.4 fn:atomic-type-annotation

Changes in 4.0 ⬇ ⬆

New in 4.0 [Issue 148 PR 1523 22 October 2024]

Summary

Returns a record containing information about the type annotation of an atomic value.

Signature

`fn:atomic-type-annotation`(
`$value`	`as` `xs:anyAtomicType`
) `as` `schema-type-record`

Properties

This function is deterministic, context-independent, and focus-independent.

Rules

Given an atomic value, the function returns a schema-type-record containing information about the atomic type represented by its type annotation^DM.

Notes

The result will always have ?is-simple = true() and ?variety = "atomic". In a non-schema-aware environment the type will always be a built-in atomic type in the xs namespace: see 1.8.3 Atomic Type Hierarchy. Where a schema is in use, however, the result may be an atomic type defined in the schema, which may be an anonymous type.

Note that under the function coercion rules, it is possible to supply a node as the argument, which will then be atomized. In simple cases the type annotation on the atomized value will be the same as the type annotation on the node. But this is not always true: for example the type annotation on the node might be a complex type with simple content, while the type annotation on its atomized value is the corresponding simple content type. To get the type annotation on the node, use the function fn:node-type-annotation.

Examples

Expression:	atomic-type-annotation(23) ? name
Result:	#xs:integer
Expression:	let $x := 23, $y := 93.7 return atomic-type-annotation($x) ? matches($y)
Result:	false()
Expression:	atomic-type-annotation(xs:numeric('23.2')) ? name
Result:	#xs:double

20.1.521.1.5 fn:node-type-annotation

Changes in 4.0 ⬇ ⬆

New in 4.0 [Issue 148 PR 1523 22 October 2024]

Summary

Returns a record containing information about the type annotation of an element or attribute node.

Signature

`fn:node-type-annotation`(
`$node`	`as` `(element() \| attribute())`
) `as` `schema-type-record`

Properties

This function is deterministic, context-independent, and focus-independent.

Rules

Given an element or attribute node, the function returns a schema-type-record containing information about the schema type represented by its type annotation^DM.

Notes

For an element that has not been schema-validated, the type annotation is always xs:untyped.

For an attribute that has not been schema-validated, the type annotation is always xs:untypedAtomic.

The type annotation of an attribute node is always a simple type; the type annotation of an element node may be simple or complex.

Examples

Expression:	let $e := parse-xml("<e/>")/* return node-type-annotation($e) ? name
Result:	#xs:untyped
Expression:	let $a := parse-xml("<e a='3'/>")//@a return node-type-annotation($a) ? name
Result:	#xs:untypedAtomic
Expression:	let $x := json-to-xml('[23, 24]', { 'validate': true() }) return node-type-annotation($x/*) ? name
Result:	#fn:arrayType
Expression:	let $x := json-to-xml('[23, 24]', { 'validate': true() }) let $n23 := $x//fn:number[. = 23] let $type := node-type-annotation($n23) return ($type ? name, $type ? base-type() ? name, $type ? base-type() ? base-type() ? name)
Result:	#fn:numberType, #fn:finiteNumberType, #xs:double

2122 Constructor functions

Changes in 4.0 ⬇ ⬆

Constructor functions now have a zero-arity form; the first argument defaults to the context item. [Issue 658 PR 662 29 August 2023]

Constructor functions are used to convert a supplied value to a given type, and the name of the function is the same as the name of the target type. This section describes constructor functions corresponding to the following types:

Simple types (atomic types, union types, and list types as defined in [XML Schema Part 2: Datatypes Second Edition]), which are present in the static context either because they appear in the in-scope schema types^XP or because they appear as named item types^XP.
These constructor functions always take a single argument.
Record types defined as named item types^XP.
These take one argument for each named field of the record type. Constructor functions for record types are defined in 21.622.6 Constructor functions for named record types.

Constructor functions are defined for all user-defined named simple types, and for most built-in atomic, list, and union types. The only named simple types that have no constructor function are those that have no instances other than instances of their derived types: specifically, xs:anySimpleType, xs:anyAtomicType, and xs:NOTATION.

21.122.1 Constructor functions for XML Schema built-in atomic types

Every built-in atomic type that is defined in [XML Schema Part 2: Datatypes Second Edition], except xs:anyAtomicType and xs:NOTATION, has an associated constructor function. The type xs:untypedAtomic, defined in Section 2.7 Schema Information ^DM31 and the two derived types xs:yearMonthDuration and xs:dayTimeDuration defined in Section 2.7 Schema Information ^DM31 also have associated constructor functions. Implementations may additionally provide a constructor functions for the new datatype xs:dateTimeStamp introduced in [XSD 1.1 Part 2].

A constructor function is not defined for xs:anyAtomicType as there are no atomic items with type annotation xs:anyAtomicType at runtime, although this can be a statically inferred type. A constructor function is not defined for xs:NOTATION since it is defined as an abstract type in [XML Schema Part 2: Datatypes Second Edition]. If the static context (See Section 2.1.1 Static Context ^XP31) contains a type derived from xs:NOTATION then a constructor function is defined for it. See 21.522.5 Constructor functions for user-defined atomic and union types.

The form of the constructor function for an atomic type eg:TYPE is:

`eg:TYPE`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `eg:TYPE?`

If $arg is the empty sequence, the empty sequence is returned. For example, the signature of the constructor function corresponding to the xs:unsignedInt type defined in [XML Schema Part 2: Datatypes Second Edition] is:

`xs:unsignedInt`(
`$arg`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:unsignedInt?`

Calling the constructor function xs:unsignedInt(12) returns the xs:unsignedInt value 12. Another call of that constructor function that returns the same xs:unsignedInt value is xs:unsignedInt("12"). The same result would also be returned if the constructor function were to be called with a node that had a typed value equal to the xs:unsignedInt 12. The standard features described in Section 2.4.2 Atomization ^XP31 would atomize the node to extract its typed value and then call the constructor with that value. If the value passed to a constructor is not in the lexical space of the datatype to be constructed, and cannot be converted to a value in the value space of the datatype under the rules in this specification, then an dynamic error is raised [err:FORG0001].

The semantics of the constructor function xs:TYPE(arg) are identical to the semantics of arg cast as xs:TYPE? . See 22 Casting23 Casting2223 Casting.

If the argument to a constructor function is a literal, the result of the function may be evaluated statically; if an error is found during such evaluation, it may be reported as a static error.

Special rules apply to constructor functions for xs:QName and types derived from xs:QName and xs:NOTATION. See 21.222.2 Constructor functions for xs:QName and xs:NOTATION.

The argument is optional, and defaults to the context value (which will be atomized if necessary).

The following constructor functions for the built-in atomic types are supported:

xs:string(
$value as xs:anyAtomicType? := .
) as xs:string?
xs:boolean(
$value as xs:anyAtomicType? := .
) as xs:boolean?
xs:decimal(
$value as xs:anyAtomicType? := .
) as xs:decimal?
xs:float(
$value as xs:anyAtomicType? := .
) as xs:float?
Implementations should return negative zero for xs:float("-0.0E0"). But because [XML Schema Part 2: Datatypes Second Edition] does not distinguish between the values positive zero and negative zero, implementations may return positive zero in this case.
xs:double(
$value as xs:anyAtomicType? := .
) as xs:double?
Implementations should return negative zero for xs:double("-0.0E0"). But because [XML Schema Part 2: Datatypes Second Edition] does not distinguish between the values positive zero and negative zero, implementations may return positive zero in this case.
xs:duration(
$value as xs:anyAtomicType? := .
) as xs:duration?
xs:dateTime(
$value as xs:anyAtomicType? := .
) as xs:dateTime?
xs:time(
$value as xs:anyAtomicType? := .
) as xs:time?
xs:date(
$value as xs:anyAtomicType? := .
) as xs:date?
xs:gYearMonth(
$value as xs:anyAtomicType? := .
) as xs:gYearMonth?
xs:gYear(
$value as xs:anyAtomicType? := .
) as xs:gYear?
xs:gMonthDay(
$value as xs:anyAtomicType? := .
) as xs:gMonthDay?
xs:gDay(
$value as xs:anyAtomicType? := .
) as xs:gDay?
xs:gMonth(
$value as xs:anyAtomicType? := .
) as xs:gMonth?
xs:hexBinary(
$value as xs:anyAtomicType? := .
) as xs:hexBinary?
xs:base64Binary(
$value as xs:anyAtomicType? := .
) as xs:base64Binary?
xs:anyURI(
$value as xs:anyAtomicType? := .
) as xs:anyURI?
xs:QName(
$value as xs:anyAtomicType? := .
) as xs:QName?
See 21.222.2 Constructor functions for xs:QName and xs:NOTATION for special rules.

xs:normalizedString(
$value as xs:anyAtomicType? := .
) as xs:normalizedString?
xs:token(
$value as xs:anyAtomicType? := .
) as xs:token?
xs:language(
$value as xs:anyAtomicType? := .
) as xs:language?
xs:NMTOKEN(
$value as xs:anyAtomicType? := .
) as xs:NMTOKEN?
xs:Name(
$value as xs:anyAtomicType? := .
) as xs:Name?
xs:NCName(
$value as xs:anyAtomicType? := .
) as xs:NCName?
xs:ID(
$value as xs:anyAtomicType? := .
) as xs:ID?
xs:IDREF(
$value as xs:anyAtomicType? := .
) as xs:IDREF?
xs:ENTITY(
$value as xs:anyAtomicType? := .
) as xs:ENTITY?
See 22.1.10 Casting to xs:ENTITY23.1.10 Casting to xs:ENTITY22.1.1023.1.10 Casting to xs:ENTITY for rules related to constructing values of type xs:ENTITY and types derived from it.
xs:integer(
$value as xs:anyAtomicType? := .
) as xs:integer?
xs:nonPositiveInteger(
$value as xs:anyAtomicType? := .
) as xs:nonPositiveInteger?
xs:negativeInteger(
$value as xs:anyAtomicType? := .
) as xs:negativeInteger?
xs:long(
$value as xs:anyAtomicType? := .
) as xs:long?
xs:int(
$value as xs:anyAtomicType? := .
) as xs:int?
xs:short(
$value as xs:anyAtomicType? := .
) as xs:short?
xs:byte(
$value as xs:anyAtomicType? := .
) as xs:byte?
xs:nonNegativeInteger(
$value as xs:anyAtomicType? := .
) as xs:nonNegativeInteger?
xs:unsignedLong(
$value as xs:anyAtomicType? := .
) as xs:unsignedLong?
xs:unsignedInt(
$value as xs:anyAtomicType? := .
) as xs:unsignedInt?
xs:unsignedShort(
$value as xs:anyAtomicType? := .
) as xs:unsignedShort?
xs:unsignedByte(
$value as xs:anyAtomicType? := .
) as xs:unsignedByte?
xs:positiveInteger(
$value as xs:anyAtomicType? := .
) as xs:positiveInteger?

xs:yearMonthDuration(
$value as xs:anyAtomicType? := .
) as xs:yearMonthDuration?
xs:dayTimeDuration(
$value as xs:anyAtomicType? := .
) as xs:dayTimeDuration?
xs:untypedAtomic(
$value as xs:anyAtomicType? := .
) as xs:untypedAtomic??

xs:dateTimeStamp(
$value as xs:anyAtomicType? := .
) as xs:dateTimeStamp?
Available only if the implementation supports XSD 1.1.

21.222.2 Constructor functions for xs:QName and xs:NOTATION

Special rules apply to constructor functions for the types xs:QName and xs:NOTATION, for two reasons:

Values cannot belong directly to the type xs:NOTATION, only to its subtypes.
The lexical representation of these types uses namespace prefixes, whose meaning is context-dependent.

These constraints result in the following rules:

There is no constructor function for xs:NOTATION. Constructors are defined, however, for xs:QName, for types derived or constructed from xs:QName, and for types derived or constructed from xs:NOTATION.
When converting from an xs:string, the prefix within the lexical xs:QName supplied as the argument is resolved to a namespace URI using the statically known namespaces from the static context. If the lexical xs:QName has no prefix, the namespace URI of the resulting expanded-QName is the default namespace for elements and types, taken from the static context. Components of the static context are defined in Section 2.1.1 Static Context ^XP31. A dynamic error is raised [err:FONS0004] if the prefix is not bound in the static context. As described in Section 2.1 Terminology ^DM31, the supplied prefix is retained as part of the expanded-QName value.

When a constructor function for a namespace-sensitive type is used as a literal function item or in a partial function application (for example, xs:QName#1 or xs:QName(?)) the namespace bindings that are relevant are those from the static context of the literal function item or partial function application. When a constructor function for a namespace-sensitive type is obtained by means of the fn:function-lookup function, the relevant namespace bindings are those from the static context of the call on fn:function-lookup.

Note:

When the supplied argument to the xs:QName constructor function is a node, the node is atomized in the usual way, and if the result is xs:untypedAtomic it is then converted as if a string had been supplied. The effect might not be what is desired. For example, given the attribute xsi:type="my:type", the expression xs:QName(@xsi:type) might fail on the grounds that the prefix my is undeclared. This is because the namespace bindings are taken from the static context (that is, from the query or stylesheet), and not from the source document containing the @xsi:type attribute. The solution to this problem is to use the function call resolve-QName(@xsi:type, .) instead.

21.322.3 Constructor functions for XML Schema built-in list types

Each of the three built-in list types defined in [XML Schema Part 2: Datatypes Second Edition], namely xs:NMTOKENS, xs:ENTITIES, and xs:IDREFS, has an associated constructor function.

The function signatures are as follows:

xs:NMTOKENS(
$value as xs:string? := .
) as xs:NMTOKEN*
xs:ENTITIES(
$value as xs:string? := .
) as xs:ENTITY*
xs:IDREFS(
$value as xs:string? := .
) as xs:IDREF*

The semantics are equivalent to casting to the corresponding types from xs:string.

All three of these types have the facet minLength = 1 meaning that there must always be at least one item in the list. The return type, however, allows for the fact that when the argument to the function is an empty sequence, the result is an empty sequence.

Note:

In the case of atomic types, it is possible to use an expression such as xs:date(@date-of-birth) to convert an attribute value to an instance of xs:date, knowing that this will work both in the case where the attribute is already annotated as xs:date, and also in the case where it is xs:untypedAtomic. This approach does not work with list types, because it is not permitted to use a value of type xs:NMTOKEN* as input to the constructor function xs:NMTOKENS. Instead, it is necessary to use conditional logic that performs the conversion only in the case where the input is untyped: if (@x instance of attribute(*, xs:untypedAtomic)) then xs:NMTOKENS(@x) else data(@x)

21.422.4 Constructor functions for XML Schema built-in union types

There is a constructor function for the union type xs:numeric defined in [XQuery and XPath Data Model (XDM) 3.1]. The function signature is:

xs:numeric(
$value as xs:anyAtomicType? := .
) as xs:numeric?

The semantics are determined by the rules in 22.3.7 Casting to union types23.3.7 Casting to union types22.3.723.3.7 Casting to union types. These rules have the effect that:

If the argument is an instance of xs:double, xs:float, or xs:decimal, then the result is an instance of the same primitive type, with the same value;
If the argument is an instance of xs:boolean, the result is the xs:double value 0.0e0 or 1.0e0;
If the argument is an instance of xs:string or xs:untypedAtomic, then:
1. If the value is in the lexical space of xs:double, the result will be the corresponding xs:double value;
2. Otherwise, a dynamic error [err:FORG0001] occurs;
Note:
The result will never be an instance of xs:float, xs:decimal, or xs:integer. This is because xs:double appears first in the list of member types of xs:numeric, and its lexical space subsumes the lexical space of the other numeric types. Thus, unlike XPath numeric literals, the result does not depend on the lexical form of the supplied value. The reason for this design choice is to retain compatibility with the function conversion rules: functions such as fn:abs and fn:round are declared to expect an instance of xs:numeric as their first or only argument, and compatibility with the function conversion rules defined in earlier versions of these specifications demands that when an untyped atomic item (or untyped node) is supplied as the argument, it is converted to an xs:double value even if its lexical form is that (say) of an integer.
In all other cases, a dynamic error [err:FORG0001] occurs.

In the case of an implementation that supports XSD 1.1, there is a constructor function associated with the built-in union type xs:error.

The function signature is as follows:

xs:error(
$value as xs:anyAtomicType? := .
) as xs:error?

The semantics are equivalent to casting to the corresponding union type (see 22.3.7 Casting to union types23.3.7 Casting to union types22.3.723.3.7 Casting to union types).

Note:

Because xs:error has no member types, and therefore has an empty value space, casting will always fail with a dynamic error except in the case where the supplied argument is an empty sequence, in which case the result is also an empty sequence.

21.522.5 Constructor functions for user-defined atomic and union types

For every named user-defined simple type in the static context (See Section 2.1.1 Static Context ^XP31), there is a constructor function whose name is the same as the name of the type.

For named atomic types, the rules are the same as the rules for constructing built-in derived atomic types defined in 21.122.1 Constructor functions for XML Schema built-in atomic types. For a named atomic type T, the signature of the function takes the form T($value as xs:anyAtomicType? := .) as T?, and the semantics are the same as casting to derived types: see 22.3.1 Casting to derived types23.3.1 Casting to derived types22.3.123.3.1 Casting to derived types..

For named union types, the rules follow the same principles as the rules for constructing built-in union types defined in 21.422.4 Constructor functions for XML Schema built-in union types. For a named union type U, the signature of the function takes the form U($value as xs:anyAtomicType? := .) as U?, and the semantics are the same as casting to union types: see 22.3.7 Casting to union types23.3.7 Casting to union types22.3.723.3.7 Casting to union types.

For named list types, the rules follow the same principles as the rules for constructing built-in list types defined in 21.322.3 Constructor functions for XML Schema built-in list types. For a named list type L, where the item type of L is I, the signature of the function takes the form L($value as xs:string? := .) as I*, and the semantics are the same as casting to list types: see 22.3.8 Casting to list types23.3.8 Casting to list types22.3.823.3.8 Casting to list types.

Constructor functions are available both for named types defined in an imported schema (that is, named simple types in the in-scope schema types^XP), and for types defined by means of named item types^XP. Specifically, named enumeration types follow the same rules as schema types derived by restricting xs:string, and named local union types follow the same rules as union types defined in a schema.

Special rules apply to constructor functions for namespace-sensitive types, that is, atomic types derived from xs:QName and xs:NOTATION, list types that have a namespace-sensitive item type, and union types that have a namespace-sensitive member type. See 21.222.2 Constructor functions for xs:QName and xs:NOTATION.

Example: Using a Constructor Function for a User-Defined Atomic Type

Consider a situation where the static context contains an atomic type called hatSize defined in a schema whose target namespace is bound to the prefix eg. In such a case the following constructor function is available to users:

`eg:hatSize`(
`$value`	`as` `xs:anyAtomicType`
) `as` `my:hatSize?`

The resulting function may be used in an expression such as eg:hatSize("10½").

Note:

To construct an instance of a user-defined type that is not in a namespace, it is possible to use an EQName (for example Q{}hatsize(17)). Alternatives are to use a cast expression (17 cast as hatsize) or (if the host language allows it) to undeclare the default function namespace.

21.622.6 Constructor functions for named record types

Changes in 4.0 ⬇ ⬆

Constructor functions for named record types have been introduced. [Issue 617 PR 953 20 February 2024]

Both XQuery 4.0 and XSLT 4.0 provide syntax to declare named record types; such a declaration implicitly adds a constructor function for values of that type to the (See Section 2.1.1 Static Context ^XP31).

For example, if there is a named item type with the XQuery definition:

declare record my:location (
  latitude  as xs:double,
  longitude as xs:double
)

then there will be a function definition equivalent to:

declare function my:location (
  $latitude  as xs:double,
  $longitude as xs:double
) as my:location {
  { 'latitude': $latitude, 'longitude': $longitude }
}

Equivalently using XSLT syntax, if there is a named item type with the XSLT definition:

<xsl:record name="my:location"
  as="record(latitude as xs:double, longitude as xs:double)"/>

then there will be a function definition equivalent to:

<xsl:function name="my:location" as="my:location">
  <xsl:param name="latitude" as="xs:double"/>
  <xsl:param name="longitude" as="xs:double"/>
  <xsl:map>
    <xsl:map-entry key="'latitude'" select="$latitude"/>
    <xsl:map-entry key="'longitude'" select="$longitude"/>
  </xsl:map>
</xsl:function>

The rules defining the relationship of the function definition to the record type are given for XQuery 4.0 in Section 5.20.2 Constructor Functions for Named Record Types^XQ.

Editorial note
TODO: Add cross-reference to XSLT here. Anticipates resolution of issue #1485.

2223 Casting

Constructor functions and cast expressions accept an expression and return a value of a given type. They both convert a source value SV, of a source type, ST to a target value TV, of the given target type TT.

Constructor functions and cast expressions have identical semantics but different syntax. The name of the constructor function is the same as the name of the built-in [XML Schema Part 2: Datatypes Second Edition] datatype or the datatype defined in Section 2.7 Schema Information ^DM31 of [XQuery and XPath Data Model (XDM) 3.1] (see 21.122.1 Constructor functions for XML Schema built-in atomic types) or the user-derived datatype (see 21.522.5 Constructor functions for user-defined atomic and union types) that is the target for the conversion, and the semantics are exactly the same as for a cast expression; for example, xs:date("2003-01-01") means exactly the same as "2003-01-01" cast as xs:date?.

The cast expression takes a type name to indicate the target type of the conversion. See Section 3.14.2 Cast ^XP31. If the type name allows the empty sequence and the expression to be cast is the empty sequence, the empty sequence is returned. If the type name does not allow the empty sequence and the expression to be cast is the empty sequence, a type error is raised [err:XPTY0004]^XP.

Where the argument to a cast is a literal, the result of the function may be evaluated statically; if an error is encountered during such evaluation, it may be reported as a static error.

The general rules for casting from primitive types to primitive types are defined in 22.123.1 Casting from primitive types to primitive types, and subsections describe the rules for specific target types. The general rules for casting from xs:string (and xs:untypedAtomic) follow in 22.223.2 Casting from xs:string and xs:untypedAtomic. Casting to non-primitive types, including atomic types derived by restriction, union types, and list types, is described in 22.323.3 Casting involving non-primitive types. Casting from derived types is defined in 22.3.423.3.4 Casting from derived types to parent types, 22.3.523.3.5 Casting within a branch of the type hierarchy and 22.3.623.3.6 Casting across the type hierarchy.

Casting is not supported to or from xs:anySimpleType. Casting to xs:anySimpleType is not permitted and raises a static error: [err:XPST0080]^XP.

Similarly, casting is not supported to or from xs:anyAtomicType and will raise a static error: [err:XPST0080]^XP. There are no atomic items with the type annotation xs:anyAtomicType, although this can be a statically inferred type.

22.123.1 Casting from primitive types to primitive types

Changes in 4.0 ⬇ ⬆

This section now uses the term primitive type strictly to refer to the 20 atomic types that are not derived by restriction from another atomic type: that is, the 19 primitive atomic types defined in XSD, plus xs:untypedAtomic. The three types xs:integer, xs:dayTimeDuration, and xs:yearMonthDuration, which have custom casting rules but are not strictly-speaking primitive, are now handled in other subsections. [Issue 1401 PR 1409]

This section defines casting between primitive types (specifically, the 19 primitive types defined in [XML Schema Part 2: Datatypes Second Edition] plus xs:untypedAtomic. The type conversions that are supported between primitive atomic types are indicated in the table below; casts between other (non-primitive) types are defined in terms of these primitives.

Where the target type TT is a primitive type, the result TV will always be an instance of TT. The result may also be an instance of a type derived from TT: for example casting an xs:NCNameSV to xs:stringmay return SV unchanged, with its original type annotation.

In this table, there is a row for each primitive type acting as the source of the conversion and there is a column for each primitive type acting as the target of the conversion. The intersections of rows and columns contain one of three characters:

Y indicates that a conversion from values of the type to which the row applies to the type to which the column applies is supported;
N indicates that there are no supported conversions from values of the type to which the row applies to the type to which the column applies;
M indicates that a conversion from values of the type to which the row applies to the type to which the column applies may succeed for some values in the value space and fail for others.

There is no row or column for xs:untypedAtomic because the casting rules are exactly the same as for xs:string. When casting from xs:string or xs:untypedAtomic the semantics in 22.223.2 Casting from xs:string and xs:untypedAtomic apply, regardless of target type.

[XML Schema Part 2: Datatypes Second Edition] defines xs:NOTATION as an abstract type. Thus, casting to xs:NOTATION from any other type including xs:NOTATION is not permitted and raises a static error [err:XPST0080]^XP. However, casting from one subtype of xs:NOTATION to another subtype of xs:NOTATION is permitted.

Casting is not supported to or from xs:anySimpleType. Thus, there is no row or column for this type in the table below. For any node that has not been validated or has been validated as xs:anySimpleType, the typed value of the node is an atomic item of type xs:untypedAtomic. There are no atomic items with the type annotation xs:anySimpleType at runtime. Casting to xs:anySimpleType is not permitted and raises a static error: [err:XPST0080]^XP.

If casting is attempted from an ST to a TT for which casting is not supported, as defined in the table below, a type error is raised [err:XPTY0004]^XP.

In the following table, the columns and rows are identified by short codes that identify simple types as follows:

aURI = xs:anyURI
b64 = xs:base64Binary
bool = xs:boolean
dat = xs:date
gDay = xs:gDay
dbl = xs:double
dec = xs:decimal
dT = xs:dateTime
dur = xs:duration
flt = xs:float
hxB = xs:hexBinary
gMD = xs:gMonthDay
gMon = xs:gMonth
NOT = xs:NOTATION
QN = xs:QName
str = xs:string
tim = xs:time
gYM = xs:gYearMonth
gYr = xs:gYear

In the following table, the notation “S\T” indicates that the source (“S”) of the conversion is indicated in the column below the notation and that the target (“T”) is indicated in the row to the right of the notation.

S\T	str	flt	dbl	dec	dur	dT	tim	dat	gYM	gYr	gMD	gDay	gMon	bool	b64	hxB	aURI	QN	NOT
str	Y	M	M	M	M	M	M	M	M	M	M	M	M	M	M	M	M	M	M
flt	Y	Y	Y	M	N	N	N	N	N	N	N	N	N	Y	N	N	N	N	N
dbl	Y	Y	Y	M	N	N	N	N	N	N	N	N	N	Y	N	N	N	N	N
dec	Y	Y	Y	Y	N	N	N	N	N	N	N	N	N	Y	N	N	N	N	N
dur	Y	N	N	N	Y	N	N	N	N	N	N	N	N	N	N	N	N	N	N
dT	Y	N	N	N	N	Y	Y	Y	Y	Y	Y	Y	Y	N	N	N	N	N	N
tim	Y	N	N	N	N	N	Y	N	N	N	N	N	N	N	N	N	N	N	N
dat	Y	N	N	N	N	Y	N	Y	Y	Y	Y	Y	Y	N	N	N	N	N	N
gYM	Y	N	N	N	N	N	N	N	Y	N	N	N	N	N	N	N	N	N	N
gYr	Y	N	N	N	N	N	N	N	N	Y	N	N	N	N	N	N	N	N	N
gMD	Y	N	N	N	N	N	N	N	N	N	Y	N	N	N	N	N	N	N	N
gDay	Y	N	N	N	N	N	N	N	N	N	N	Y	N	N	N	N	N	N	N
gMon	Y	N	N	N	N	N	N	N	N	N	N	N	Y	N	N	N	N	N	N
bool	Y	Y	Y	Y	N	N	N	N	N	N	N	N	N	Y	N	N	N	N	N
b64	Y	N	N	N	N	N	N	N	N	N	N	N	N	N	Y	Y	N	N	N
hxB	Y	N	N	N	N	N	N	N	N	N	N	N	N	N	Y	Y	N	N	N
aURI	Y	N	N	N	N	N	N	N	N	N	N	N	N	N	N	N	Y	N	N
QN	Y	N	N	N	N	N	N	N	N	N	N	N	N	N	N	N	N	Y	M
NOT	Y	N	N	N	N	N	N	N	N	N	N	N	N	N	N	N	N	Y	M

22.1.123.1.1 Casting to `xs:untypedAtomic`

Any atomic item SV can be cast to xs:untypedAtomic.

The effect is the same as casting to xs:string (see 22.1.2 Casting to xs:string23.1.2 Casting to xs:string22.1.223.1.2 Casting to xs:string) and then returning the xs:untypedAtomic value comprising the same sequence of characters.

22.1.223.1.2 Casting to `xs:string`

Any atomic item SV can be cast to xs:string.

The resulting xs:string value TV depends on the source type ST as follows.

If SV is an instance of xs:string, TV is an instance of xs:string comprising the same sequence of characters as SV.
Note:
The implementation is free to return SV unchanged, including its original type annotation.
If SV is an instance of xs:anyURI, the result TV is an instance of xs:string comprising the same sequence of characters as SV, but with a type annotation of xs:anyURI. No escaping of special characters takes place.
If SV is an instance of xs:QName or xs:NOTATION:
- if the qualified name has a prefix, then TV is the concatenation of the prefix of SV, a single colon (:), and the local name of SV.
- otherwise TV is the local name of SV.
If SV is an instance of xs:numeric, the rules in 22.1.2.123.1.2.1 Casting numeric values to xs:string apply.
If SV is an instance of xs:dateTime, xs:date or xs:time, the rules in 22.1.2.223.1.2.2 Casting date/time values to xs:string apply.
If ST is xs:duration, or any subtype thereof including xs:yearMonthDuration and xs:dayTimeDuration, then the rules in 22.1.2.323.1.2.3 Casting xs:duration values to xs:string apply.
In all other cases, TV is the [XML Schema Part 2: Datatypes Second Edition] canonical representation of SV. For datatypes that do not have a canonical representation defined an implementation-dependent canonical representation may be used.

To cast as xs:untypedAtomic the value is cast as xs:string, as described above, and the type annotation changed to xs:untypedAtomic.

22.1.2.123.1.2.1 Casting numeric values to `xs:string`

The following rules apply when the source type ST is xs:decimal, xs:double, or xs:float, or any subtype of these including xs:integer.

If SV is an instance of xs:decimal, then the canonical representation of SV is returned, as defined in [XSD 1.1 Part 2]. Specifically, see decimalCanonicalMap.
Note:
Unlike previous versions of this specification, no special rule is given for the case where SV is an instance of xs:integer. This is because the general rule for xs:decimal gives the same result. The result in this case will be a sequence of decimal digits in the range U+0030 (DIGIT ZERO, 0) to U+0039 (DIGIT NINE, 9) , optionally preceded by a minus sign, with no leading zeroes. For example: 42, -1, 0, or 1000000000.
Note:
An xs:decimal that is equal to an integer is converted to a string as if it were first cast to an xs:integer. Specifically, there will be no decimal point and no fractional part.
If the value is not equal to an integer, then there will be a decimal point and a fractional part, which will be a sequence of decimal digits with no trailing zeroes. For example: 42.3, -1.5, or 0.00001.
If SV is an instance of xs:float or xs:double, then:
1. TV will be an xs:string in the lexical space of xs:double or xs:float that when converted to an xs:double or xs:float under the rules of 22.223.2 Casting from xs:string and xs:untypedAtomic produces a value that is equal to SV, or is NaN if SV is NaN. In addition, TV must satisfy the constraints in the following sub-bullets.
  1. If SV has an absolute value that is greater than or equal to 0.000001 (one millionth) and less than 1000000 (one million), then the value is converted to an xs:decimal and the resulting xs:decimal is converted to an xs:string according to the rules above, as though using an implementation of xs:decimal that imposes no limits on the totalDigits or fractionDigits facets.
  2. If SV has the value positive or negative zero, TV is "0" or "-0" respectively.
  3. If SV is positive or negative infinity, TV is the string "INF" or "-INF" respectively.
  4. In other cases, the result consists of a mantissa, which has the lexical form of an xs:decimal, followed by the letter "E", followed by an exponent which has the lexical form of an xs:integer. Leading zeroes and "+" signs are prohibited in the exponent. For the mantissa, there must be a decimal point, and there must be exactly one digit before the decimal point, which must be non-zero. The "+" sign is prohibited. There must be at least one digit after the decimal point. Apart from this mandatory digit, trailing zero digits are prohibited.
Note:
The above rules allow more than one representation of the same value. For example, the xs:float value whose exact decimal representation is 1.26743223E15 might be represented by any of the strings "1.26743223E15", "1.26743222E15" or "1.26743224E15" (inter alia). It is implementation-dependent which of these representations is chosen.

Note:

The string representations of numeric values are backwards compatible with XPath 1.0 except for the special values positive and negative infinity, negative zero and values outside the range 1.0e-6 to 1.0e+6.

22.1.2.223.1.2.2 Casting date/time values to `xs:string`

Changes in 4.0 ⬇ ⬆

The rules for conversion of dates and times to strings are now defined entirely in terms of XSD 1.1 canonical mappings, since these deliver exactly the same result as the XPath 3.1 rules. [Issue 1401 PR 1409]

If SV is an instance of xs:dateTime, xs:date, xs:time, xs:gYear, xs:gYearMonth, xs:gMonth, xs:gMonthDay, or xs:gDay, then TV is the canonical representation of SV as defined in [XSD 1.1 Part 2].

Note:

The result TV includes the original timezone if a timezone is present.

All these data types contain different combinations of the components year, month, day, hour, minute, second, and timezone; all the components relevant to the data type (with the exception of the timezone) are output, and the results are concatenated together with suitable punctuation. Specifically:

The year component is represented as a xs:string of four digits, or more if needed. A leading minus sign is present for BCE years.
The month, day, hour and minute components are represented as two digits (with a leading zero if needed). For example, February is represented as 02.
The hours component will never be "24": midnight is always represented as "00:00:00".
The second component is output using as a two-digit integer if it is a whole number (for example, 30, 05, or 00), or if it is fractional, as two digits followed by a decimal point followed by as many digits as are necessary, with no trailing zeroes (for example 30.5 or 00.001).
The timezone component, if present, is cast to xs:string by applying the function eg:convertTZtoString given in 22.1.5 Casting to date and time types23.1.5 Casting to date and time types22.1.523.1.5 Casting to date and time types. Examples are Z, +01:00, -05:00, or +05:30.

22.1.2.323.1.2.3 Casting `xs:duration` values to `xs:string`

Changes in 4.0 ⬇ ⬆

The rules for conversion of durations to strings are now defined entirely in terms of XSD 1.1 canonical mappings, since the XSD 1.1 rules deliver exactly the same result as the XPath 3.1 rules. [Issue 1401 PR 1409]

If SV is an instance of xs:duration (including its subtypes xs:yearMonthDuration and xs:dayTimeDuration), then TV is the canonical representation of SV as defined in [XSD 1.1 Part 2]. Specifically, see durationCanonicalMap.

Note:

The rules have the effect of normalizing the value so that the number of months is always less than 12, the number of hours less than 24, and the number of minutes and seconds less than 60. Zero-valued components are omitted. Fractional seconds follow the same rules as xs:decimal. For example, the duration P15MT30H is represented as P1Y3M1DT6H. A zero-length duration is output as PT0S.

Note:

At the time of writing, the published XSD 1.1 recommendation contains cut-and-paste errors in the definition of the dayTimeDuration canonical mapping. The binding of variable s should be to dt's ·seconds· (not ·months·) component, and the return expression given as sgn & 'P' & ·duYearMonthCanonicalFragmentMap·(|s|) should read sgn & 'P' & ·duDayTimeCanonicalFragmentMap·(|s|)

In reading these XSD formulations, be aware that a & b represents string concatenation, while |s| computes the absolute value of a number.

22.1.323.1.3 Casting to numeric types

This section defines the rules for casting to the primitive numeric types xs:float, xs:double, and xs:decimal. Rules for casting to the derived type xs:integer are given in 22.3.2 Casting to xs:integer23.3.2 Casting to xs:integer22.3.223.3.2 Casting to xs:integer.

22.1.3.123.1.3.1 Casting to xs:float

When a value of any simple type is cast as xs:float, the xs:floatTV is derived from the ST and the SV as follows:

If ST is xs:float, then TV is SV and the conversion is complete.
If ST is xs:double, then TV is obtained as follows:
- if SV is the xs:double value INF, -INF, NaN, positive zero, or negative zero, then TV is the xs:float value INF, -INF, NaN, positive zero, or negative zero respectively.
- otherwise, SV can be expressed in the form m × 2^e where the mantissa m and exponent e are signed xs:integers whose value range is defined in [XML Schema Part 2: Datatypes Second Edition], and the following rules apply:
  - if m (the mantissa of SV) is outside the permitted range for the mantissa of an xs:float value (-2^24-1 to +2^24-1), then it is divided by 2^N where N is the lowest positive xs:integer that brings the result of the division within the permitted range, and the exponent e is increased by N. This is integer division (in effect, the binary value of the mantissa is truncated on the right). Let M be the mantissa and E the exponent after this adjustment.
  - if E exceeds 104 (the maximum exponent value in the value space of xs:float) then TV is the xs:float value INF or -INF depending on the sign of M.
  - if E is less than -149 (the minimum exponent value in the value space of xs:float) then TV is the xs:float value positive or negative zero depending on the sign of M
  - otherwise, TV is the xs:float value M × 2^E.
If ST is xs:decimal, or xs:integer, then TV is xs:float(SV cast as xs:string) and the conversion is complete.
If ST is xs:boolean, SV is converted to 1.0E0 if SV is true and to 0.0E0 if SV is false and the conversion is complete.
If ST is xs:untypedAtomic or xs:string, see 22.223.2 Casting from xs:string and xs:untypedAtomic.
Note:
XSD 1.1 adds the value +INF to the lexical space, as an alternative to INF. XSD 1.1 also adds negative zero to the value space.

Note:

Implementations should return negative zero for xs:float("-0.0E0"). But because [XML Schema Part 2: Datatypes Second Edition] does not distinguish between the values positive zero and negative zero. Implementations may return positive zero in this case.

22.1.3.223.1.3.2 Casting to xs:double

When a value of any simple type is cast as xs:double, the xs:double value TV is derived from the ST and the SV as follows:

If ST is xs:double, then TV is SV and the conversion is complete.
If ST is xs:float or a type derived from xs:float, then TV is obtained as follows:
- if SV is the xs:float value INF, -INF, NaN, positive zero, or negative zero, then TV is the xs:double value INF, -INF, NaN, positive zero, or negative zero respectively.
- otherwise, SV can be expressed in the form m × 2^e where the mantissa m and exponent e are signed xs:integer values whose value range is defined in [XML Schema Part 2: Datatypes Second Edition], and TV is the xs:double value m × 2^e.
If ST is xs:decimal or xs:integer, then TV is xs:double(SV cast as xs:string) and the conversion is complete.
If ST is xs:boolean, SV is converted to 1.0E0 if SV is true and to 0.0E0 if SV is false and the conversion is complete.
If ST is xs:untypedAtomic or xs:string, see 22.223.2 Casting from xs:string and xs:untypedAtomic.
Note:
XSD 1.1 adds the value +INF to the lexical space, as an alternative to INF. XSD 1.1 also adds negative zero to the value space.

Note:

Implementations should return negative zero for xs:double("-0.0E0"). But because [XML Schema Part 2: Datatypes Second Edition] does not distinguish between the values positive zero and negative zero. Implementations may return positive zero in this case.

22.1.3.323.1.3.3 Casting to xs:decimal

This section defines the rules for casting to the primitive type xs:decimal. The rules are also invoked implicitly as part of the process of converting to types derived from xs:decimal. There are special rules, however, if the target type TT is xs:integer, or a type derived from xs:integer: those rules are given in 22.3.2 Casting to xs:integer23.3.2 Casting to xs:integer22.3.223.3.2 Casting to xs:integer.

When the target type TT is xs:decimal, the resulting xs:decimal value TV is derived from ST and SV as follows:

If ST is xs:decimal or a subtype thereof (including xs:integer), then the result TV has the same datum as SV. The type annotation may be xs:decimal or any subtype of xs:decimal for which this is a valid instance, including the original type ST.
If ST is xs:float or xs:double, then TV is the xs:decimal value, within the set of xs:decimal values that the implementation is capable of representing, that is numerically closest to SV. If two values are equally close, then the one that is closest to zero is chosen. If SV is too large to be accommodated as an xs:decimal, (see [XML Schema Part 2: Datatypes Second Edition] for implementation-defined limits on numeric values) a dynamic error is raised [err:FOCA0001]. If SV is one of the special xs:float or xs:double values NaN, INF, or -INF, a dynamic error is raised [err:FOCA0002].
If ST is xs:boolean, the result TV is 1.0 if SV is 1 or true and to 0.0 if SV is 0 or false. The type annotation of the result may be any subtype of xs:decimal whose value space includes the integer values 0 and 1.
If ST is xs:untypedAtomic or xs:string, see 22.223.2 Casting from xs:string and xs:untypedAtomic.

22.1.423.1.4 Casting to duration types

This section defines the rules for casting to the primitive duration type xs:duration. Rules for casting to the derived types xs:yearMonthDuration and xs:dayTimeDuration are given in 22.3.323.3.3 Casting to xs:yearMonthDuration and xs:dayTimeDuration.

If the source value SV is an instance of xs:duration (including instances of subtypes such as xs:yearMonthDuration and xs:dayTimeDuration, then the datum of the result TV is the same as the datum of SV, and the type annotation is xs:duration or any subtype thereof that includes this datum in its value space (in particular, it may be the same as the type annotation of SV).
If ST is xs:untypedAtomic or xs:string, see 22.223.2 Casting from xs:string and xs:untypedAtomic.

22.1.523.1.5 Casting to date and time types

In several situations, casting to date and time types requires the extraction of a component from SV or from the result of fn:current-dateTime and converting it to an xs:string. These conversions must follow certain rules. For example, converting an xs:integer year value requires converting to an xs:string with four or more characters, preceded by a minus sign if the value is negative.

This document defines four functions to perform these conversions. These functions are for illustrative purposes only and make no recommendations as to style or efficiency. References to these functions from the following text are not normative.

The arguments to these functions come from functions defined in this document. Thus, the functions below assume that they are correct and do no range checking on them.

declare function eg:convertYearToString($year as xs:integer) as xs:string {
  let $plusMinus := if ($year >= 0) then "" else "-"
  let $yearString := abs($year) cast as xs:string
  let $length := string-length($yearString)
  return if ($length = 1) then concat($plusMinus, "000", $yearString)
         else if ($length = 2) then concat($plusMinus, "00", $yearString)
         else if ($length = 3) then concat($plusMinus, "0", $yearString)
         else concat($plusMinus, $yearString)
};

declare function eg:convertTo2CharString($value as xs:integer) as xs:string {
  let $string := $value cast as xs:string
  return if (string-length($string) = 1) then concat("0", $string)
         else $string
};

declare function eg:convertSecondsToString($seconds as xs:decimal) as xs:string {
  let $string := $seconds cast as xs:string
  let $intLength := string-length(($seconds cast as xs:integer) cast as xs:string)
  return if ($intLength = 1) then concat("0", $string)
         else $string
};

declare function eg:convertTZtoString($tz as xs:dayTimeDuration?) as xs:string {
  if (empty($tz)) then ""
  else if ($tz eq xs:dayTimeDuration('PT0S')) then "Z"
  else let $tzh := hours-from-duration($tz)
       let $tzm := minutes-from-duration($tz)
       let $plusMinus := if ($tzh >= 0) then "+" else "-"
       let $tzhString := eg:convertTo2CharString(abs($tzh))
       let $tzmString := eg:convertTo2CharString(abs($tzm))
       return concat($plusMinus, $tzhString, ":", $tzmString)
};

Conversion from primitive types to date and time types follows the rules below.

When a value of any primitive type is cast as xs:dateTime, the xs:dateTime value TV is derived from ST and SV as follows:
- If ST is xs:dateTime, then TV is SV.
- If ST is xs:date, then let SYR be eg:convertYearToString( year-from-date(SV)), let SMO be eg:convertTo2CharString( month-from-date(SV)), let SDA be eg:convertTo2CharString( day-from-date(SV)) and let STZ be eg:convertTZtoString( timezone-from-date(SV)); TV is xs:dateTime( concat(SYR, '-', SMO, '-', SDA, 'T00:00:00 ', STZ) ).
- If ST is xs:untypedAtomic or xs:string, see 22.223.2 Casting from xs:string and xs:untypedAtomic.
When a value of any primitive type is cast as xs:time, the xs:time value TV is derived from ST and SV as follows:
- If ST is xs:time, then TV is SV.
- If ST is xs:dateTime, then TV is xs:time( concat( eg:convertTo2CharString( hours-from-dateTime(SV)), ':', eg:convertTo2CharString( minutes-from-dateTime(SV)), ':', eg:convertSecondsToString( seconds-from-dateTime(SV)), eg:convertTZtoString( timezone-from-dateTime(SV)) )).
- If ST is xs:untypedAtomic or xs:string, see 22.223.2 Casting from xs:string and xs:untypedAtomic.
When a value of any primitive type is cast as xs:date, the xs:date value TV is derived from ST and SV as follows:
- If ST is xs:date, then TV is SV.
- If ST is xs:dateTime, then let SYR be eg:convertYearToString( year-from-dateTime(SV)), let SMO be eg:convertTo2CharString( month-from-dateTime(SV)), let SDA be eg:convertTo2CharString( day-from-dateTime(SV)) and let STZ be eg:convertTZtoString(timezone-from-dateTime(SV)); TV is xs:date( concat(SYR, '-', SMO, '-', SDA, STZ) ).
- If ST is xs:untypedAtomic or xs:string, see 22.223.2 Casting from xs:string and xs:untypedAtomic.
When a value of any primitive type is cast as xs:gYearMonth, the xs:gYearMonth value TV is derived from ST and SV as follows:
- If ST is xs:gYearMonth, then TV is SV.
- If ST is xs:dateTime, then let SYR be eg:convertYearToString( year-from-dateTime(SV)), let SMO be eg:convertTo2CharString( month-from-dateTime(SV)) and let STZ be eg:convertTZtoString( timezone-from-dateTime(SV)); TV is xs:gYearMonth( concat(SYR, '-', SMO, STZ) ).
- If ST is xs:date, then let SYR be eg:convertYearToString( year-from-date(SV)), let SMO be eg:convertTo2CharString( month-from-date(SV)) and let STZ be eg:convertTZtoString( timezone-from-date(SV)); TV is xs:gYearMonth( concat(SYR, '-', SMO, STZ) ).
- If ST is xs:untypedAtomic or xs:string, see 22.223.2 Casting from xs:string and xs:untypedAtomic.
When a value of any primitive type is cast as xs:gYear, the xs:gYear value TV is derived from ST and SV as follows:
- If ST is xs:gYear, then TV is SV.
- If ST is xs:dateTime, let SYR be eg:convertYearToString( year-from-dateTime(SV)) and let STZ be eg:convertTZtoString( timezone-from-dateTime(SV)); TV is xs:gYear(concat(SYR, STZ)).
- If ST is xs:date, let SYR be eg:convertYearToString( year-from-date(SV)); and let STZ be eg:convertTZtoString( timezone-from-date(SV)); TV is xs:gYear(concat(SYR, STZ)).
- If ST is xs:untypedAtomic or xs:string, see 22.223.2 Casting from xs:string and xs:untypedAtomic.
When a value of any primitive type is cast as xs:gMonthDay, the xs:gMonthDay value TV is derived from ST and SV as follows:
- If ST is xs:gMonthDay, then TV is SV.
- If ST is xs:dateTime, then let SMO be eg:convertTo2CharString( month-from-dateTime(SV)), let SDA be eg:convertTo2CharString( day-from-dateTime(SV)) and let STZ be eg:convertTZtoString( timezone-from-dateTime(SV)); TV is xs:gYearMonth( concat( '--', SMO '-', SDA, STZ) ).
- If ST is xs:date, then let SMO be eg:convertTo2CharString( month-from-date(SV)), let SDA be eg:convertTo2CharString( day-from-date(SV)) and let STZ be eg:convertTZtoString( timezone-from-date(SV)); TV is xs:gYearMonth( concat( '--', SMO, '-', SDA, STZ) ).
- If ST is xs:untypedAtomic or xs:string, see 22.223.2 Casting from xs:string and xs:untypedAtomic.
When a value of any primitive type is cast as xs:gDay, the xs:gDay value TV is derived from ST and SV as follows:
- If ST is xs:gDay, then TV is SV.
- If ST is xs:dateTime, then let SDA be eg:convertTo2CharString( day-from-dateTime(SV)) and let STZ be eg:convertTZtoString( timezone-from-dateTime(SV)); TV is xs:gDay( concat( '---', SDA, STZ)).
- If ST is xs:date, then let SDA be eg:convertTo2CharString( day-from-date(SV)) and let STZ be eg:convertTZtoString( timezone-from-date(SV)); TV is xs:gDay( concat( '---', SDA, STZ)).
- If ST is xs:untypedAtomic or xs:string, see 22.223.2 Casting from xs:string and xs:untypedAtomic.
When a value of any primitive type is cast as xs:gMonth, the xs:gMonth value TV is derived from ST and SV as follows:
- If ST is xs:gMonth, then TV is SV.
- If ST is xs:dateTime, then let SMO be eg:convertTo2CharString( month-from-dateTime(SV)) and let STZ be eg:convertTZtoString( timezone-from-dateTime(SV)); TV is xs:gMonth( concat( '--' , SMO, STZ)).
- If ST is xs:date, then let SMO be eg:convertTo2CharString( month-from-date(SV)) and let STZ be eg:convertTZtoString( timezone-from-date(SV)); TV is xs:gMonth( concat( '--', SMO, STZ)).
- If ST is xs:untypedAtomic or xs:string, see 22.223.2 Casting from xs:string and xs:untypedAtomic.

22.1.623.1.6 Casting to `xs:boolean`

When the target type TT is xs:boolean, the resulting xs:boolean value TV is derived from the source value SV as follows:

If SV is an instance of xs:boolean, then TV is SV.
If SV is an instance of xs:numeric and SV is 0, +0, -0, 0.0, 0.0E0 or NaN, then TV is false.
If ST is is an instance of xs:numeric and SV is not one of the above values, then TV is true.
If ST is xs:untypedAtomic or xs:string, see 22.223.2 Casting from xs:string and xs:untypedAtomic.

22.1.723.1.7 Casting to `xs:base64Binary` and `xs:hexBinary`

Values of type xs:base64Binary can be cast as xs:hexBinary and vice versa, since the two types have the same value space. Casting to xs:base64Binary and xs:hexBinary is also supported from the same type and from xs:untypedAtomic, xs:string and subtypes of xs:string using [XML Schema Part 2: Datatypes Second Edition] semantics.

22.1.823.1.8 Casting to xs:anyURI

Casting to xs:anyURI is supported only from the same type, xs:untypedAtomic or xs:string.

When a value of any primitive type is cast as xs:anyURI, the xs:anyURI value TV is derived from the ST and SV as follows:

If ST is xs:untypedAtomic or xs:string see 22.223.2 Casting from xs:string and xs:untypedAtomic.

22.1.923.1.9 Casting to xs:QName and xs:NOTATION

Casting from xs:string or xs:untypedAtomic to xs:QName or xs:NOTATION is described in 22.223.2 Casting from xs:string and xs:untypedAtomic.

It is also possible to cast from xs:NOTATION to xs:QName, or from xs:QName to any type derived by restriction from xs:NOTATION. (Casting to xs:NOTATION itself is not allowed, because xs:NOTATION is an abstract type.) The resulting xs:QName or xs:NOTATION has the same prefix, local name, and namespace URI parts as the supplied value.

Note:

See 21.222.2 Constructor functions for xs:QName and xs:NOTATION for a discussion of how the combination of atomization and casting might not produce the desired effect.

22.1.1023.1.10 Casting to xs:ENTITY

[XML Schema Part 2: Datatypes Second Edition] says that “The value space of ENTITY is the set of all strings that match the NCName production ... and have been declared as an unparsed entity in a document type definition.” However, [XSL Transformations (XSLT) Version 4.0] and [XQuery 4.0: An XML Query Language] do not check that constructed values of type xs:ENTITY match declared unparsed entities. Thus, this rule is relaxed in this specification and, in casting to xs:ENTITY and types derived from it, no check is made that the values correspond to declared unparsed entities.

22.223.2 Casting from xs:string and xs:untypedAtomic

Changes in 4.0 ⬇ ⬆

When casting from a string to a duration or time or dateTime, it is now specified that when there are more digits in the fractional seconds than the implementation is able to retain, excess digits are truncated. Rounding upwards (which could affect the number of minutes or hours in the value) is not permitted. [Issue 1089 PR 1090 19 March 2024]

This section applies when the supplied value SV is an instance of xs:string or xs:untypedAtomic, including types derived from these by restriction. If the value is xs:untypedAtomic, it is treated in exactly the same way as a string containing the same sequence of characters.

The supplied string is mapped to a typed value of the target type as defined in [XML Schema Part 2: Datatypes Second Edition]. Whitespace normalization is applied as indicated by the whiteSpace facet for the datatype. The resulting whitespace-normalized string must be a valid lexical form for the datatype. The semantics of casting follow the rules of XML Schema validation. For example, "13" cast as xs:unsignedInt returns the xs:unsignedInt typed value 13. This could also be written xs:unsignedInt("13").

The target type can be any simple type other than an abstract type. Specifically, it can be a type whose variety is atomic, union, or list. In each case the effect of casting to the target type is the same as constructing an element with the supplied value as its content, validating the element using the target type as the governing type, and atomizing the element to obtain its typed value.

When the target type is a derived type that is restricted by a pattern facet, the lexical form is first checked against the pattern before further casting is attempted (See 22.3.1 Casting to derived types23.3.1 Casting to derived types22.3.123.3.1 Casting to derived types). If the lexical form does not conform to the pattern, a dynamic error [err:FORG0001] is raised.

For example, consider a user-defined type my:boolean which is derived by restriction from xs:boolean and specifies the pattern facet value="0|1". The expression "true" cast as my:boolean would fail with a dynamic error [err:FORG0001].

Facets other than pattern are checked after the conversion. For example if there is a user-defined datatype called my:height defined as a restriction of xs:integer with the facet <maxInclusive value="84"/>, then the expression "100" cast as my:height would fail with a dynamic error [err:FORG0001].

Casting to the types xs:NOTATION, xs:anySimpleType, or xs:anyAtomicType is not permitted because these types are abstract (they have no immediate instances).

Special rules apply when casting to namespace-sensitive types. The types xs:QName and xs:NOTATION are namespace-sensitive. Any type derived by restriction from a namespace-sensitive type is itself namespace-sensitive, as is any union type having a namespace-sensitive type among its members, and any list type having a namespace-sensitive type as its item type. For details, see 21.222.2 Constructor functions for xs:QName and xs:NOTATION.

Note:

Since version 3.0 of this specification, casting has been allowed between xs:QName and xs:NOTATION in either direction; this was not permitted in previous Recommendations. Version 3.0 also removed the rule that only a string literal (rather than a dynamic string) may be cast to an xs:QName

When casting to a numeric type:

If the value is too large or too small to be accurately represented by the implementation, it is handled as an overflow or underflow as defined in 4.2 Arithmetic operators on numeric values.
If the target type is xs:float or xs:double, the string -0 (and equivalents such as -0.0 or -000) should be converted to the value negative zero. However, if the implementation is reliant on an implementation of XML Schema 1.0 in which negative zero is not part of the value space for these types, these lexical forms may be converted to positive zero.

In casting to xs:decimal or to a type derived from xs:decimal, if the value is not too large or too small but nevertheless cannot be represented accurately with the number of decimal digits available to the implementation, the implementation may round to the nearest representable value or may raise a dynamic error [err:FOCA0006]. The choice of rounding algorithm and the choice between rounding and error behavior is implementation-defined.

When casting to xs:duration, xs:dateTime, or xs:time, if the seconds component has more fractional digits than are supported by the implementation, excess digits must be truncated. This rule ensures that components other than the seconds component are unaffected: for example xs:dateTime('2023-12-31T23:59:59.999999999') is guaranteed to deliver an xs:dateTime value whose year component is 2023 rather than 2024.

Note:

Implementations are required to support millisecond precision or greater.

In casting to xs:date, xs:dateTime, xs:gYear, or xs:gYearMonth (or types derived from these), if the value is too large or too small to be represented by the implementation, a dynamic error [err:FODT0001] is raised.

In casting to a duration value, if the value is too large or too small to be represented by the implementation, a dynamic error [err:FODT0002] is raised.

For xs:anyURI, the extent to which an implementation validates the lexical form of xs:anyURI is implementation-dependent.

If the cast fails for any other reason, a dynamic error [err:FORG0001] is raised.

22.323.3 Casting involving non-primitive types

Casting from xs:string and xs:untypedAtomic to any other type (primitive or non-primitive) has been described in 22.223.2 Casting from xs:string and xs:untypedAtomic. This section defines how other casts to non-primitive types operate, including casting to types derived by restriction, to union types, and to list types.

22.3.123.3.1 Casting to derived types

Casting a value to a derived type can be separated into a number of cases. In these rules:

The types xs:integer, xs:yearMonthDuration, and xs:dayTimeDuration are treated as quasi-primitive types (alongside the 20 truly primitive types).
For any atomic type T, let P(T) denote the most specific primitive or quasi-primitive type such that itemType-subtype(T, P(T)) is true.

The rules are then:

When the source type ST is the same type as the target type TT: this case always succeeds, returning the source value SV unchanged.
When itemType-subtype(ST, TT) is true: see 22.3.423.3.4 Casting from derived types to parent types.
When TT is the quasi-primitive type xs:integer and SV is an instance of xs:numeric: see 22.3.2 Casting to xs:integer23.3.2 Casting to xs:integer22.3.223.3.2 Casting to xs:integer.
When TT is the quasi-primitive type xs:yearMonthDuration or xs:dayTimeDuration and SV is an instance of xs:duration: see 22.3.323.3.3 Casting to xs:yearMonthDuration and xs:dayTimeDuration.
When P(ST) is the same type as P(TT): see 22.3.523.3.5 Casting within a branch of the type hierarchy.
Otherwise (P(ST) is not the same type as P(TT)): see 22.3.623.3.6 Casting across the type hierarchy.

22.3.223.3.2 Casting to xs:integer

When an atomic item SV is cast as xs:integer, the resulting xs:integer value TV is obtained as follows:

If ST is xs:decimal, xs:float or xs:double, then TV is SV with the fractional part discarded and the value converted to xs:integer. Thus, casting 3.1456 returns 3 while -17.89 returns -17. Casting 3.124E1 returns 31. If SV is too large to be accommodated as an integer, (see [XML Schema Part 2: Datatypes Second Edition] for implementation-defined limits on numeric values) a dynamic error is raised [err:FOCA0003]. If SV is one of the special xs:float or xs:double values NaN, INF, or -INF, a dynamic error is raised [err:FOCA0002].
In all other cases, the general rules of 22.3.1 Casting to derived types23.3.1 Casting to derived types22.3.123.3.1 Casting to derived types apply.

Note:

When casting to a subtype of xs:integer (for example, xs:long), the rules in 22.3.1 Casting to derived types23.3.1 Casting to derived types22.3.123.3.1 Casting to derived types apply. Note, however, that these rules treat xs:integer as a quasi-primitive type.

22.3.323.3.3 Casting to `xs:yearMonthDuration` and `xs:dayTimeDuration`

When the source value SV is an instance of xs:duration (including any subtype of xs:duration), then:

If the target type TT is xs:yearMonthDuration, the result is an instance of xs:yearMonthDuration whose months component is equal to the months component of SV. The seconds component of SV is ignored.
If the target type TT is xs:dayTimeDuration, the result is an instance of xs:dayTimeDuration whose seconds component is equal to the seconds component of SV. The months component of SV is ignored.

In all other cases, the general rules of 22.3.1 Casting to derived types23.3.1 Casting to derived types22.3.123.3.1 Casting to derived types apply.

Note:

In general, casting to xs:yearMonthDuration or xs:dayTimeDuration loses information.

Note:

When casting to a subtype of xs:dayTimeDuration or xs:yearMonthDuration, the rules in 22.3.1 Casting to derived types23.3.1 Casting to derived types22.3.123.3.1 Casting to derived types apply. Note, however, that these rules treat xs:dayTimeDuration and xs:yearMonthDuration as quasi-primitive types.

22.3.423.3.4 Casting from derived types to parent types

It is always possible to cast an atomic item A to a type T if the relation A instance of T is true, provided that T is not an abstract type.

For example, it is possible to cast an xs:unsignedShort to an xs:unsignedInt, to an xs:integer, to an xs:decimal, or to a union type whose member types are xs:integer and xs:double.

Since the value space of the original type is a subset of the value space of the target type, such a cast is always successful.

For the expression A instance of T to be true, T must be either an atomic type, or a union type that has no constraining facets. It cannot be a list type, nor a union type derived by restriction from another union type, nor a union type that has a list type among its member types.

The result will have the same value as the original, but will have a new type annotation:

If T is an atomic type, then the type annotation of the result is T.
If T is a union type, then the type of the result is an atomic type M such that M is one of the atomic types in the transitive membership of the union type T and A instance of M is true; if there is more than one type M that satisfies these conditions (which could happen, for example, if T is the union of two overlapping types such as xs:int and xs:positiveInteger) then the first one is used, taking the member types in the order in which they appear within the definition of the union type.

22.3.523.3.5 Casting within a branch of the type hierarchy

It is possible to cast an SV to a TT if the type of the SV and the TT type are both derived by restriction (directly or indirectly) from the same primitive type, provided that the supplied value conforms to the constraints implied by the facets of the target type. This includes the case where the target type is derived from the type of the supplied value, as well as the case where the type of the supplied value is derived from the target type. For example, an instance of xs:byte can be cast as xs:unsignedShort, provided the value is not negative.

If the value does not conform to the facets defined for the target type, then a dynamic error is raised [err:FORG0001]. See [XML Schema Part 2: Datatypes Second Edition]. In the case of the pattern facet (which applies to the lexical space rather than the value space), the pattern is tested against the canonical representation of the value, as defined for the source type (or the result of casting the value to an xs:string, in the case of types that have no canonical representation defined for them).

Note that this will cause casts to fail if the pattern excludes the canonical lexical representation of the source type. For example, if the type my:distance is defined as a restriction of xs:decimal with a pattern that requires two digits after the decimal point, casting of an xs:integer to my:distance will always fail, because the canonical representation of an xs:integer does not conform to this pattern.

In some cases, casting from a parent type to a derived type requires special rules. See 22.1.4 Casting to duration types23.1.4 Casting to duration types22.1.423.1.4 Casting to duration types for rules regarding casting to xs:yearMonthDuration and xs:dayTimeDuration. See 22.1.10 Casting to xs:ENTITY23.1.10 Casting to xs:ENTITY22.1.1023.1.10 Casting to xs:ENTITY, below, for casting to xs:ENTITY and types derived from it.

22.3.623.3.6 Casting across the type hierarchy

When the ST and the TT are derived, directly or indirectly, from different primitive types, this is called casting across the type hierarchy. Casting across the type hierarchy is logically equivalent to three separate steps performed in order. Errors can occur in either of the latter two steps.

Cast the SV, up the hierarchy, to the primitive type of the source, as described in 22.3.423.3.4 Casting from derived types to parent types.
1. If SV is an instance of xs:string or xs:untypedAtomic, check its value against the pattern facet of TT, and raise a dynamic error [err:FORG0001] if the check fails.
Let P(TT) be the most specific primitive or quasi-primitive type of which TT is a subtype, as described in 22.3.1 Casting to derived types23.3.1 Casting to derived types22.3.123.3.1 Casting to derived types.
Cast the value to P(TT), as described in 22.123.1 Casting from primitive types to primitive types if P(TT) is primitive, or as described in 22.3.1 Casting to derived types23.3.1 Casting to derived types22.3.123.3.1 Casting to derived types if P(TT) is quasi-primitive.
If TT is derived from xs:NOTATION, assume for the purposes of this rule that casting to xs:NOTATION succeeds.
Cast the value down to the target type TT, as described in 22.3.523.3.5 Casting within a branch of the type hierarchy

22.3.723.3.7 Casting to union types

If the target type of a cast expression (or a constructor function) is a type with variety union, the supplied value must be one of the following:

A value of type xs:string or xs:untypedAtomic. This case follows the general rules for casting from strings, and has already been described in 22.223.2 Casting from xs:string and xs:untypedAtomic.
If the union type has a pattern facet, the pattern is tested against the supplied value after whitespace normalization, using the whiteSpace normalization rules of the member datatype against which validation succeeds.
A value that is an instance of one of the atomic types in the transitive membership of the union type, and of the union type itself. This case has already been described in 22.3.423.3.4 Casting from derived types to parent types
This situation only applies when the value is an instance of the union type, which means it will never apply when the union is derived by facet-based restriction from another union type.
A value that is castable to one or more of the atomic types in the transitive membership of the union type (in the sense that the castable as operator returns true).
In this case the supplied value is cast to each atomic type in the transitive membership of the union type in turn (in the order in which the member types appear in the declaration) until one of these casts is successful; if none of them is successful, a dynamic error occurs [err:FORG0001]. If the union type has constraining facets then the resulting value must satisfy these facets, otherwise a dynamic error occurs [err:FORG0001].
If the union type has a pattern facet, the pattern is tested against the canonical representation of the result value.
Only the atomic types in the transitive membership of the union type are considered. The union type may have list types in its transitive membership, but (unless the supplied value is of type xs:string or xs:untypedAtomic, in which case the rules in 22.223.2 Casting from xs:string and xs:untypedAtomic apply), any list types in the membership are effectively ignored.

If more than one of these conditions applies, then the casting is done according to the rules for the first condition that applies.

If none of these conditions applies, the cast fails with a dynamic error [err:FORG0001].

Example: consider a type U whose member types are xs:integer and xs:date.

The expression "123" cast as U returns the xs:integer value 123.
The expression current-date() cast as U returns the current date as an instance of xs:date.
The expression 23.1 cast as U returns the xs:integer value 23.

Example: consider a type V whose member types are xs:short and xs:negativeInteger.

The expression "-123" cast as V returns the xs:short value -123.
The expression "-100000" cast as V returns the xs:negativeInteger value -100000.
The expression 93.7 cast as V returns the xs:short value 93.
The expression "93.7" cast as V raises a dynamic error [err:FORG0001] on the grounds that the string "93.7" is not in the lexical space of the union type.

Example: consider a type W that is derived from the above type V by restriction, with a pattern facet of -?\d\d.

The expression "12" cast as V returns the xs:short value 12.
The expression "123" cast as V raises an dynamic error [err:FORG0001] on the grounds that the string "123" does not match the pattern facet.

22.3.823.3.8 Casting to list types

If the target type of a cast expression (or a constructor function) is a type with variety list, the supplied value must be of type xs:string or xs:untypedAtomic. The rules follow the general principle for all casts from xs:string outlined in 22.223.2 Casting from xs:string and xs:untypedAtomic.

If the supplied value is not of type xs:string or xs:untypedAtomic, a type error is raised [err:XPTY0004]^XP.

The semantics of the operation are consistent with validation: that is, the effect of casting a string S to a list type L is the same as constructing an element or attribute node whose string value is S, validating it using L as the governing type, and atomizing the resulting node. The result will always be either failure, or a sequence of zero or more atomic items each of which is an instance of the item type of L (or if the item type of L is a union type, an instance of one of the atomic types in its transitive membership).

If the item type of the list type is namespace-sensitive, then the namespace bindings in the static context will be used to resolve any namespace prefix, in the same way as when the target type is xs:QName.

If the list type has a pattern facet, the pattern must match the supplied value after collapsing whitespace (an operation equivalent to the use of the fn:normalize-space function).

For example, the expression cast "A B C D" as xs:NMTOKENS produces a sequence of four xs:NMTOKEN values, ("A", "B", "C", "D").

For example, given a user-defined type my:coordinates defined as a list of xs:integer with the facet <xs:length value="2"/>, the expression my:coordinates("2 -1") will return a sequence of two xs:integer values (2, -1), while the expression my:coordinates("1 2 3") will result in a dynamic error because the length of the list does not conform to the length facet. The expression my:coordinates("1.0 3.0") will also fail because the strings 1.0 and 3.0 are not in the lexical space of xs:integer.

`op:numeric-multiply`(
`$arg1`	`as` `xs:numeric`,
`$arg2`	`as` `xs:numeric`
) `as` `xs:numeric`

`xs:string`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:string?`

`xs:boolean`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:boolean?`

`xs:decimal`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:decimal?`

`xs:float`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:float?`

`xs:double`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:double?`

`xs:duration`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:duration?`

`xs:dateTime`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:dateTime?`

`xs:time`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:time?`

`xs:date`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:date?`

`xs:gYearMonth`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:gYearMonth?`

`xs:gYear`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:gYear?`

`xs:gMonthDay`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:gMonthDay?`

`xs:gDay`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:gDay?`

`xs:gMonth`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:gMonth?`

`xs:hexBinary`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:hexBinary?`

`xs:base64Binary`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:base64Binary?`

`xs:anyURI`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:anyURI?`

`xs:QName`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:QName?`

`xs:normalizedString`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:normalizedString?`

XPath and XQuery Functions and Operators 4.0

W3C Editor's Draft 218 February 2026

Abstract

Status of this Document

Dedication

1 Introduction

1.3 Namespaces and prefixes

1.8 Type System

1.8.1 Item Types

1.9 Terminology

1.9.5 Properties of functions

2 Processing nodes

2.1 Accessors

2.1.1 fn:node-name

2.1.2 fn:nilled

2.1.3 fn:string

2.1.4 fn:data

2.1.5 fn:base-uri

2.1.6 fn:document-uri

2.2 Other functions on nodes

2.2.2 fn:local-name

2.2.3 fn:namespace-uri

2.2.5 fn:root

2.3 Functions on sequences of nodes

2.3.1 fn:distinct-ordered-nodes

4 Processing numerics

4.1 Numeric types

4.5 Parsing numbers

4.5.1 fn:number

8 Processing booleans

8.3 Functions on Boolean values

8.3.1 fn:boolean

14 Processing sequences

14.2 Comparison functions

14.2.2 fn:deep-equal

14.5 Functions on node identifiers

14.5.1 fn:id

14.5.2 fn:element-with-id

14.5.3 fn:idref

14.5.4 fn:generate-id

15 Parsing and serializing

15.1 Functions on XML Data

15.1.4 XSD validation

15.2 Functions on HTML Data

15.2.1 XDM Mapping from HTML DOM Nodes

15.2.1.1 attributes Accessor

15.2.1.2 base-uri Accessor

15.2.1.3 children Accessor

15.2.1.4 document-uri Accessor

15.2.1.5 is-id Accessor

15.2.1.6 is-idrefs Accessor

15.2.1.7 namespace-nodes Accessor

15.2.1.8 nilled Accessor

15.2.1.9 node-kind Accessor

15.2.1.10 node-name Accessor

15.2.1.11 parent Accessor

15.2.1.12 string-value Accessor

15.2.1.13 type-name Accessor

15.2.1.14 typed-value Accessor

15.2.1.15 unparsed-entity-public-id Accessor

15.2.1.16 unparsed-entity-system-id Accessor

15.3 Functions on JSON Data

15.3.8 fn:pin

15.3.9 fn:label

17 Higher-order functions

17.1 Processing function items

17.1.4 fn:function-identity

17.2 Basic higher-order functions

17.2.12 fn:partial-apply

18 Processing maps

18.4 Functions that Operate on Maps

18.4.1 map:build

20 Processing JNodes

20.1 Functions on JNodes

20.1.1 fn:JNode

20.1.2 fn:JNode-value

20.1.3 fn:JNode-selector

20.1.4 fn:JNode-position

20.2 Deep Update

20.2.1 fn:update

22.1.123.1.1 Casting to `xs:untypedAtomic`

22.1.223.1.2 Casting to `xs:string`

22.1.2.123.1.2.1 Casting numeric values to `xs:string`

22.1.2.223.1.2.2 Casting date/time values to `xs:string`

22.1.2.323.1.2.3 Casting `xs:duration` values to `xs:string`

22.1.623.1.6 Casting to `xs:boolean`

22.1.723.1.7 Casting to `xs:base64Binary` and `xs:hexBinary`

22.3.323.3.3 Casting to `xs:yearMonthDuration` and `xs:dayTimeDuration`

`xs:token`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:token?`

`xs:language`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:language?`

`xs:NMTOKEN`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:NMTOKEN?`

`xs:Name`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:Name?`

`xs:NCName`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:NCName?`

`xs:ID`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:ID?`

`xs:IDREF`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:IDREF?`

`xs:ENTITY`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:ENTITY?`

`xs:integer`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:integer?`

`xs:nonPositiveInteger`(
`$value`	`as` `xs:anyAtomicType?`	`:=` `.`
) `as` `xs:nonPositiveInteger?`