XML Tree
   HOME

TheInfoList



OR:

XML Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing arbitrary data. It defines a set of rules for encoding documents in a format that is both human-readable and machine-readable ...
documents have a hierarchical structure and can conceptually be interpreted as a tree structure, called an XML tree. XML documents must contain a root element (one that is the parent of all other elements). All elements in an XML document can contain sub elements, text and attributes. The tree represented by an XML document starts at the root element and branches to the lowest level of elements. Although there is no consensus on the terminology used on XML Trees, at least two standard terminologies have been released by the
W3C The World Wide Web Consortium (W3C) is the main international standards organization for the World Wide Web. Founded in 1994 and led by Tim Berners-Lee, the consortium is made up of member organizations that maintain full-time staff working to ...
: * The terminology used in the
XPath XPath (XML Path Language) is an expression language designed to support the query or transformation of XML documents. It was defined by the World Wide Web Consortium (W3C) and can be used to compute values (e.g., strings, numbers, or Boolean v ...
Data Model * The terminology used in the
XML Information Set XML Information Set (XML Infoset) is a W3C specification describing an abstract data model of an XML document in terms of a set of ''information items''. The definitions in the XML Information Set specification are meant to be used in ''other'' sp ...
. XPath defines a syntax named ''XPath expressions'' that identifies one or more internal components (elements, attributes, etc.) of an XML document. XPath is widely used to accesses XML-encoded data. The XML Information Set, or XML infoset, describes an abstract data model for XML documents in terms of information items. It is often used in the specifications of XML languages, for its convenience in describing constraints on constructs those languages allow.


Representation as trees

In mathematics, a tree is an undirected graph in which any two vertices are connected by exactly one simple path. Any connected graph without simple cycles is a tree. A tree data structure simulates a hierarchical tree structure with a set of linked nodes. A hierarchy consists of an order defined on a set. The term hierarchy is used to stress a hierarchical relation among the elements. The XML specification defines an XML document as a well-formed text if it satisfies a list of syntax rules defined in the specification. This specification is long, however 2 key points relating to the tree structure of an XML document are: * The begin, end, and empty-element tags that delimit the elements are correctly nested, with none missing and none overlapping * A single "root" element contains all the other elements These features resemble those of trees, in that there is a single root node, and an order to the elements. XML has appeared as a first-class data type in other languages. The JavaScript (E4X) extension explicitly defines two specific objects (XML and XMLList), which support XML document nodes and XML node lists as distinct objects and use a dot-notation specifying parent-child relationships. These data structures represent XML documents as a tree structure. An XML Tree represented graphically can be as simple as an ASCII chart or a more graphically complex hierarchy. For instance, the XML document and the ASCII tree have the same structure. XML Trees do not show the content in an Instance document, only the structure of the document. In this example Product is the Root Element of the tree and the two child nodes of Product are Name and Details. Details contains two child nodes, Description and Price. The tree command in Windows and *nix also produce a similar tree structure and path. Product ├───Name └───Details └───Description └───Price Widget
This Widget is the highest quality widget. 5.50


XPath Data Model

XPath XPath (XML Path Language) is an expression language designed to support the query or transformation of XML documents. It was defined by the World Wide Web Consortium (W3C) and can be used to compute values (e.g., strings, numbers, or Boolean v ...
, the XML Path Language, is a query language for selecting nodes from an XML document. XPath defines a syntax named ''XPath expressions'' that can query an XML document for one or more internal components (elements, attributes, etc.). XPath is widely used in other core-XML specifications and in programming libraries for accessing XML-encoded data.


XPath Data Model terminology

The XPath Data Model is a long specification, and goes into many features unrelated to XML trees. Listed below are key terms from that specification and the XML specification. ;Instance: The data model represented as a sequence. ;Instance document: A document using and conforming to the same sequence/XML tree. ;Sequence: An order collection of zero or more items. A sequence cannot be a member of a sequence. A single item appearing individually is modeled as a sequence containing one item. ;Element: A node within the sequence that may contain ;Node: Any item represented in the XML tree/sequence. ;Root Node: The topmost element of the tree. All other elements and nodes must be contained within the root node. ;Item: A node or an atomic value. ;Value space: The part of an item that contains data rather than additional elements. ;Atomic type: A primitive simple type or a type derived by restriction from another atomic type. ;Atomic value: A value contained in the value space that is an atomic type. ;QName: The qualified name of an element. It must conform to naming rules of XML objects. (i.e. must start with a letter or underscore, case-sensitive, cannot start with the letters xml(in any case), can contain letters, digits, hyphens, underscores, and periods, cannot contain spaces.) ;Expanded-QName: The fully qualified name of an element. It may include a prefix and namespace. It must include the local name of the element. Within a given tree, document order satisfies the following constraints: * The root node is the first node. * Every node occurs before all of its children and descendants. * Namespace Nodes immediately follow the Element Node with which they are associated. The relative order of Namespace Nodes is stable but implementation-dependent. * Attribute Nodes immediately follow the Namespace Nodes of the element with which they are associated. If there are no Namespace Nodes associated with a given element, then the Attribute Nodes associated with that element immediately follow the element. The relative order of Attribute Nodes is stable but implementation-dependent. * The relative order of siblings is the order in which they occur in the children property of their parent node. * Children and descendants occur before following siblings.


XML Information Set

XML Information Set XML Information Set (XML Infoset) is a W3C specification describing an abstract data model of an XML document in terms of a set of ''information items''. The definitions in the XML Information Set specification are meant to be used in ''other'' sp ...
(XML Infoset) describes an abstract data model of an XML document in terms of a set of information items. The definitions in the XML Information Set specification are meant to be used in other specifications that need to refer to the information in a well-formed XML document. The infoset makes it convenient to describe constraints on the XML constructs other XML languages allow. An XML document has an information set if it is well-formed and satisfies the namespace constraints. An information set can contain up to eleven different types of information items: * The Document Information Item (always present) * Element Information Items * Attribute Information Items * Processing Instruction Information Items * Unexpanded Entity Reference Information Items * Character Information Items * Comment Information Items * The Document Type Declaration Information Item * Unparsed Entity Information Items * Notation Information Items * Namespace Information Items


XML Information Set terminology

The XML Information Set is a long specification, and goes into many features unrelated to XML trees. Listed below are the most important terms relating to XML tree terminology: "There is exactly one document information item in the information set, and all other information items are accessible from the properties of the document information item, either directly or indirectly through the properties of other information items. The document information item has the following properties: * hildren * ocument element * otations * nparsed entities * ase URI * haracter encoding scheme * tandalone * ersion * ll declarations processed There is an element information item for each element appearing in the XML document. One of the element information items is the value of the ocument elementproperty of the document information item, corresponding to the root of the element tree, and all other element information items are accessible by recursively following its hildrenproperty. An element information item has the following properties: * amespace name* ocal name* refix* hildren* ttributes* amespace attributes* n-scope namespaces* ase URI*
arent Arent can refer to *Arent (given name) *Arent (surname) Arend and Arent are primarily Low German patronymic surnames from the given name Arend. The Dutch surnames "Van den Arend" and "Den Arend" means "(from) the eagle" and have a toponymic origin ...
There is an attribute information item for each attribute (specified or defaulted) of each element in the document, including namespace declarations. The latter however appear as members of an element's amespace attributesproperty rather than its ttributesproperty. Attributes declared in the DTD with no default value and not specified in the element's start tag are not represented by attribute information items. An attribute information item has the following properties: * amespace name * ocal name * refix * ormalized value * pecified * ttribute type * eferences *
wner element WNER (1410 AM) is a sports radio station in Watertown, New York, United States. The station is owned by Stephens Media Group. It broadcasts the national programming of Fox Sports Radio. History WOTT On June 11, 1958, Thousand Islands Broadcasti ...
ref>XML Information Set (Second Edition), February 4 2004, http://www.w3.org/TR/xml-infoset/


Notes

{{DEFAULTSORT:Xml Tree XML