Xml2json
Translate any XML document to corresponding JSON or vice versa
Install / Use
/learn @henrikingo/Xml2jsonREADME
xml2json
Translate any XML document to corresponding JSON or vice versa
HISTORY
Most of this code was written by Stefan Gössner and published at: http://goessner.net/download/prj/jsonxml/
You can read a description of what the library originally did at: http://www.xml.com/pub/a/2006/05/31/converting-between-xml-and-json.html
I did 2 major changes to the original code:
- XML attributes are translated as siblings, rather than children, to their corresponding "parent" element in JSON. This produces more deterministic translation.
- Use JSON list structure more frequently. This ensures lossless translation in more cases than the original and also is capable of handling some "mixed" cases that the original would just pass along as untranslated XML.
Details of these changes are explained below. For anything else, read the xml.com article linked above.
USAGE
The xml2json library works both in a browser and node.js. (In node.js, it depends on libxml plugin.)
For example usage in a browser, see xmljson_demo.html
For example usage in node.js, see http://github.com/henrikingo/node-xml2json
DETAILS
An explanation of changes wrt the original library by Stefan Grössner follows:
Consider the following 2 translations in the original library:
<e>text</e>
{
"e":"text"
}
vs
<e name="value">text</e>
{
"e":{
"@name":"value",
"#text":"text"
}
}
=> the path to "text" changes depending on whether the element has attributes present or not. Very bad if there could be optional attributes, or attributes added over time.
For more examples, see http://goessner.net/download/prj/jsonxml/xmljson_test.html
Solution is to lift out attributes to be on the same level as the element itself:
<e name="value">text</e>
{
e@name:"value",
e:"text"
}
Note that for empty elements, we still explicitly print null for clarity:
<e name="value" />
{
e@name:"value",
e:null
}
The other fix is to use lists to more closely resemble original xml structure in case of multiple children with same node name.
Original:
<e> <a>text</a> <a>text</a> </e>
{
"e":{"a":[
"text",
"text"
]}
}
Me:
<e> <a>text</a> <a>text</a> </e>
{
e:[
{
a:"text"
},
{
a:"text"
}
]
}
This approach allows to handle "mixed content" elements better, where original gives up:
<a>x<c/>y</a>
{
"a":"x<c/>y"
}
Me:
<a>x<c/>y</a>
{
a:[
"x",
{
c:null
},
"y"
]
}
(Same fix also supports CDATA sections, should there be any.)
TODO
All types are strings. Preferably types would be defined in xml schema, and the xml parser would return node values in correct javascript type. If this is not the case, we can emulate something by converting strings to other types in a separate function to be done later (ie xml schema hard coded into a js function) For database searches, having the correct types will help.
Not yet supported in json->xml:
Multi-dimensional arrays. What is the correct xml presentation for this?
{ e : [["a", "b"], ["c", "d"]] }
Related Skills
node-connect
341.0kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
84.4kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
341.0kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
commit-push-pr
84.4kCommit, push, and open a PR
