There is a bug in the previous author's function. If a tag without children contains both attributes and a value, the function skips the attributes.
-- Editor's note: The previsou author was godseth at o2 dot pl. His note was removed. --
Here's an improved version that keeps both:
<?php
$xml = new XMLReader();
function parseXml($xml) {
$assoc = null;
$dc = -1;
while($xml->read()){
switch ($xml->nodeType) {
case XMLReader::END_ELEMENT: return $assoc;
case XMLReader::ELEMENT:
if(!isset($assoc[$xml->name])) {
if($xml->hasAttributes) {
$assoc[$xml->name][] = $xml->isEmptyElement ? '' : parseXml($xml);
} else {
if($xml->isEmptyElement) {
$assoc[$xml->name] = '';
} else {
$assoc[$xml->name] = parseXml($xml);
}
}
} elseif (is_array($assoc[$xml->name])) {
if($xml->hasAttributes) {
$assoc[$xml->name][] = $xml->isEmptyElement ? '' : parseXml($xml);
} else {
if($xml->isEmptyElement) {
$assoc[$xml->name][] = '';
} else {
$assoc[$xml->name][] = parseXml($xml);
}
}
} else {
$mOldVar = $assoc[$xml->name];
$assoc[$xml->name] = array($mOldVar);
if($xml->hasAttributes) {
$assoc[$xml->name][] = $xml->isEmptyElement ? '' : parseXml($xml);
} else {
if($xml->isEmptyElement) {
$assoc[$xml->name][] = '';
} else {
$assoc[$xml->name][] = parseXml($xml);
}
}
}
if($xml->hasAttributes){
$el =& $assoc[$xml->name][count($assoc[$xml->name]) - 1];
while($xml->moveToNextAttribute()) {
$el[$xml->name] = $xml->value;
}
}
break;
case XMLReader::TEXT:
case XMLReader::CDATA:
//$assoc .= $xml->value;
$assoc[++$dc] = $xml->value;
}
}
return $assoc;
}
?>
The XMLReader class
Introduction
The XMLReader extension is an XML Pull parser. The reader acts as a cursor going forward on the document stream and stopping at each node on the way.
Class synopsis
Properties
- attributeCount
-
The number of attributes on the node
- baseURI
-
The base URI of the node
- depth
-
Depth of the node in the tree, starting at 0
- hasAttributes
-
Indicates if node has attributes
- hasValue
-
Indicates if node has a text value
- isDefault
-
Indicates if attribute is defaulted from DTD
- isEmptyElement
-
Indicates if node is an empty element tag
- localName
-
The local name of the node
- name
-
The qualified name of the node
- namespaceURI
-
The URI of the namespace associated with the node
- nodeType
-
The node type for the node
- prefix
-
The prefix of the namespace associated with thenode
- value
-
The text value of the node
- xmlLang
-
The xml:lang scope which the node resides
Predefined Constants
XMLReader Node Types
- XMLReader::NONE
-
No node type
- XMLReader::ELEMENT
-
Start element
- XMLReader::ATTRIBUTE
-
Attribute node
- XMLReader::TEXT
-
Text node
- XMLReader::CDATA
-
CDATA node
- XMLReader::ENTITY_REF
-
Entity Reference node
- XMLReader::ENTITY
-
Entity Declaration node
- XMLReader::PI
-
Processing Instruction node
- XMLReader::COMMENT
-
Comment node
- XMLReader::DOC
-
Document node
- XMLReader::DOC_TYPE
-
Document Type node
- XMLReader::DOC_FRAGMENT
-
Document Fragment node
- XMLReader::NOTATION
-
Notation node
- XMLReader::WHITESPACE
-
Whitespace node
- XMLReader::SIGNIFICANT_WHITESPACE
-
Significant Whitespace node
- XMLReader::END_ELEMENT
-
End Element
- XMLReader::END_ENTITY
-
End Entity
- XMLReader::XML_DECLARATION
-
XML Declaration node
XMLReader Parser Options
- XMLReader::LOADDTD
-
Load DTD but do not validate
- XMLReader::DEFAULTATTRS
-
Load DTD and default attributes but do not validate
- XMLReader::VALIDATE
-
Load DTD and validate while parsing
- XMLReader::SUBST_ENTITIES
-
Substitute entities and expand references
Table of Contents
- XMLReader::close — Close the XMLReader input
- XMLReader::expand — Returns a copy of the current node as a DOM object
- XMLReader::getAttribute — Get the value of a named attribute
- XMLReader::getAttributeNo — Get the value of an attribute by index
- XMLReader::getAttributeNs — Get the value of an attribute by localname and URI
- XMLReader::getParserProperty — Indicates if specified property has been set
- XMLReader::isValid — Indicates if the parsed document is valid
- XMLReader::lookupNamespace — Lookup namespace for a prefix
- XMLReader::moveToAttribute — Move cursor to a named attribute
- XMLReader::moveToAttributeNo — Move cursor to an attribute by index
- XMLReader::moveToAttributeNs — Move cursor to a named attribute
- XMLReader::moveToElement — Position cursor on the parent Element of current Attribute
- XMLReader::moveToFirstAttribute — Position cursor on the first Attribute
- XMLReader::moveToNextAttribute — Position cursor on the next Attribute
- XMLReader::next — Move cursor to next node skipping all subtrees
- XMLReader::open — Set the URI containing the XML to parse
- XMLReader::read — Move to next node in document
- XMLReader::readInnerXML — Retrieve XML from current node
- XMLReader::readOuterXML — Retrieve XML from current node, including it self
- XMLReader::readString — The methodName purpose
- XMLReader::setParserProperty — Set or Unset parser options
- XMLReader::setRelaxNGSchema — Set the filename or URI for a RelaxNG Schema
- XMLReader::setRelaxNGSchemaSource — Set the data containing a RelaxNG Schema
- XMLReader::setSchema — Validate document against XSD
- XMLReader::XML — Set the data containing the XML to parse
XMLReader
12-Nov-2008 04:32
19-Jun-2008 06:51
Next version xml2assoc with some improve fixes:
- no doubled data
- no buffer arrays
<?php
/*
Read XML structure to associative array
--
Using:
$xml = new XMLReader();
$xml->open([XML file]);
$assoc = xml2assoc($xml);
$xml->close();
*/
function xml2assoc($xml) {
$assoc = null;
while($xml->read()){
switch ($xml->nodeType) {
case XMLReader::END_ELEMENT: return $assoc;
case XMLReader::ELEMENT:
$assoc[$xml->name][] = array('value' => $xml->isEmptyElement ? '' : xml2assoc($xml));
if($xml->hasAttributes){
$el =& $assoc[$xml->name][count($assoc[$xml->name]) - 1];
while($xml->moveToNextAttribute()) $el['attributes'][$xml->name] = $xml->value;
}
break;
case XMLReader::TEXT:
case XMLReader::CDATA: $assoc .= $xml->value;
}
}
return $assoc;
}
?>
16-Mar-2008 02:03
make some modify from Sergey Aikinkulov's note
<?php
function xml2assoc(&$xml){
$assoc = NULL;
$n = 0;
while($xml->read()){
if($xml->nodeType == XMLReader::END_ELEMENT) break;
if($xml->nodeType == XMLReader::ELEMENT and !$xml->isEmptyElement){
$assoc[$n]['name'] = $xml->name;
if($xml->hasAttributes) while($xml->moveToNextAttribute()) $assoc[$n]['atr'][$xml->name] = $xml->value;
$assoc[$n]['val'] = xml2assoc($xml);
$n++;
}
else if($xml->isEmptyElement){
$assoc[$n]['name'] = $xml->name;
if($xml->hasAttributes) while($xml->moveToNextAttribute()) $assoc[$n]['atr'][$xml->name] = $xml->value;
$assoc[$n]['val'] = "";
$n++;
}
else if($xml->nodeType == XMLReader::TEXT) $assoc = $xml->value;
}
return $assoc;
}
?>
add else if($xml->isEmptyElement)
may be some xml has emptyelement
15-Feb-2008 11:30
<?php
function parseXML($node,$seq,$path) {
global $oldpath;
if (!$node->read())
return;
if ($node->nodeType != 15) {
print '<br/>'.$node->depth;
print '-'.$seq++;
print ' '.$path.'/'.($node->nodeType==3?'text() = ':$node->name);
print $node->value;
if ($node->hasAttributes) {
print ' [hasAttributes: ';
while ($node->moveToNextAttribute()) print '@'.$node->name.' = '.$node->value.' ';
print ']';
}
if ($node->nodeType == 1) {
$oldpath=$path;
$path.='/'.$node->name;
}
parseXML($node,$seq,$path);
}
else parseXML($node,$seq,$oldpath);
}
$source = "<tag1>this<tag2 id='4' name='foo'>is</tag2>a<tag2 id='5'>common</tag2>record</tag1>";
$xml = new XMLReader();
$xml->XML($source);
print htmlspecialchars($source).'<br/>';
parseXML($xml,0,'');
?>
Output:
<tag1>this<tag2 id='4' name='foo'>is</tag2>a<tag2 id='5'>common</tag2>record</tag1>
0-0 /tag1
1-1 /tag1/text() = this
1-2 /tag1/tag2 [hasAttributes: @id = 4 @name = foo ]
2-3 /tag1/text() = is
1-4 /text() = a
1-5 /tag2 [hasAttributes: @id = 5 ]
2-6 /text() = common
1-7 /text() = record
15-Feb-2006 07:50
Some more documentation (i.e. examples) would be nice :-)
This is how I read some mysql parameters in an xml file:
<?php
$xml = new XMLReader();
$xml->open("config.xml");
$xml->setParserProperty(2,true); // This seems a little unclear to me - but it worked :)
while ($xml->read()) {
switch ($xml->name) {
case "mysql_host":
$xml->read();
$conf["mysql_host"] = $xml->value;
$xml->read();
break;
case "mysql_username":
$xml->read();
$conf["mysql_user"] = $xml->value;
$xml->read();
break;
case "mysql_password":
$xml->read();
$conf["mysql_pass"] = $xml->value;
$xml->read();
break;
case "mysql_database":
$xml->read();
$conf["mysql_db"] = $xml->value;
$xml->read();
break;
}
}
$xml->close();
?>
The XML file used:
<?xml version='1.0'?>
<MySQL_INIT>
<mysql_host>localhost</mysql_host>
<mysql_database>db_database</mysql_database>
<mysql_username>root</mysql_username>
<mysql_password>password</mysql_password>
</MySQL_INIT>
