World: r3wp

Join the discussions in the REBOL3 world...

[XML] xml related conversations

older newer	first last
Chris 3-Dec-2008 [592x11]	Also, a tag with no attributes containing only text will only contain text:
	>> load-xml "<try>This</try>" == [ <try> "This" ]
	response: context [ status: name: value: none ] example: {<rsp> <status>Good</status> <payload> <value name="one">two</value> </payload> </rsp>} probe make response [ parse load-xml example [ <rsp> into [ <status> set status ["Good" \| "Bad"] <payload> into [ <value> into [ /name set name string! # set value string! ] ] ] ] ]
	All the 'into values are a bit of a pain, but work can be broken up...
	Note, this parser is destructive - ie. flattening will only provide an approximation of the original xml string.
	So ymmv depending on need.
	; Next, Quick DOM: do http://www.ross-gill.com/r/qdom.r
	Only one method at the moment - get-by-tagname Note, this is not an attempt to implement W3 DOM. Just a quick approximation for fast manipulation (hence the name). It's object happy, not sure of the weight considerations as such.
	do http://www.ross-gill.com/r/qdom.r doc: load-dom {<some><xml>to try</xml></some>} values: doc/get-by-tagname <xml> values/1/value = "to try"
	; You can still parse the tree too: parse doc/tree [<some> into [<xml> "to try"]]
	This is not an exercise in bloat, I plan to implement only a few key methods. Though if anyone has any requests?
Chris 4-Dec-2008 [603]	Ok, another revision. This has a few more methods, I may strip them down to read-only, as I don't need to manipulate the object though I left them in for completeness. >> do http://www.ross-gill.com/r/qdom.r connecting to: www.ross-gill.com Script: "QuickDOM" (none) >> doc: load-dom {<some><xml id="foo">to try</xml></some>} >> foo: doc/get-by-id "foo" >> foo/name == <xml> >> foo/value == [ /id "foo" # "to try" ] >> kids: foo/children == [make object! [ name: # value: "to try" tree: [ # "to try" ] position: [ ... >> kids/1/value == "to try" >> doc/tree/<some>/<xml>/(#) == "to try"
Geomol 2-Mar-2009 [604]	RebXML spec: http://www.fys.ku.dk/~niclasen/rebxml/rebxml-spec.html Scripts are in the Library: http://www.rebol.org
Graham 22-Jun-2009 [605x4]	Has anyone written anything to format/index XML documents?
	/indent .. not index
	No matter .. it was easy enough.
	Now has anyone written a recursive routine to turn a rebol object into XML? I couldn't find anything like this on rebol.org yet it doesn't sound hard to do ...
Gregg 22-Jun-2009 [609]	There must be something, but I don't have anything here that turned up, and I don't remember doing one myself. If it helps, you could use the JSON converter in %json.r as a starting point.
Graham 22-Jun-2009 [610x5]	This seems to work for me ... obj2xml: func [ obj [object!] out [string!] /local o ][ foreach element next first obj [ repend out [ to-tag element newline ] either object? o: get in obj element [ obj2xml o out ][ repend out [ o newline ] ] repend out [ to-tag join "/" element newline ] ] ]
	using this obj2xml: func [ obj [object!] out [string!] /local o ][ foreach element next first obj [ repend out [ to-tag element newline ] either object? o: get in obj element [ obj2xml o out ][ repend out [ o newline ] ] repend out [ to-tag join "/" element newline ] ] ]
	crap ... clipboard bug
	>> probe obj make object! [ a: "testing" b: "again" c: make object! [ d: "testing2" e: "again2" f: make object! [ g: "testing3" h: "again3" ] ] i: "finished" ]
	gives this <a> testing </a> <b> again </b> <c> <d> testing2 </d> <e> again2 </e> <f> <g> testing3 </g> <h> again3 </h> </f> </c> <i> finished </i>
Steeve 22-Jun-2009 [615]	Hmm.. Really, have you the tabulations ?
Graham 22-Jun-2009 [616x2]	Yes, separate script does the tabulations
Graham 22-Jun-2009 [616x2]	probably should change line repend out [ o newline ] to repend out [ any [ o copy "" ] newline ]
Steeve 22-Jun-2009 [618]	then, for NONE! values, it will add an empty line
Graham 22-Jun-2009 [619x3]	format-xml: func [ xml /local out space prev ][ out: copy "" spacer: copy "" prev: copy </tag> foreach tag load/markup xml [ either tag = find tag "/" [ ; we have a close tag ; reduce the spacer by a tab unless the previous was an open tag either not tag? prev [ ; not a tag remove/part spacer 4 ][ ; is a tag if prev = find prev "/" [ ; last was a closing tag remove/part spacer 4 ] ] ][ either tag? tag [ ; current is tag ; indent only if the prev is not a closing tag if not prev = find prev "/" [ insert/dup spacer " " 4 ] ][ ; is data insert/dup spacer " " 4 ] ] repend out rejoin [ spacer tag newline ] prev: copy tag ] view layout compose [ area (out) 400x400 ] ] obj2xml: func [ obj [object!] out [string!] /local o ][ foreach element next first obj [ repend out [ to-tag element ] either object? o: get in obj element [ obj2xml o out ][ repend out any [ o copy "" ] ] repend out [ to-tag join "/" element ] ] ]
	remove the newlines to solve that issue :)
	I was using rebelxml to construct xml ... but I came across some bugs. So this way of doing it looks easier ....
Graham 23-Jun-2009 [622]	What are people using to construct large XML documents ... of 100s of lines?
Maxim 23-Jun-2009 [623x4]	a modified version of John's rebXML tools. changed the output structure to allow rebol's path notation to be used to traverse the loaded xml.
	I also replaced the use of url for the tag words because they fail when using namespaced xml elements.
	building output objects instead would be simple, but the RAM/Speed/symbol table implications of all the binding involved makes this un-optimal.
	(...objects instead of blocks ....)
Graham 23-Jun-2009 [627]	So, you used blocks instead of objects?
Maxim 23-Jun-2009 [628x2]	yes all the time. accessing is exactly the same as for objects. its actually much more flexible.
Maxim 23-Jun-2009 [628x2]	cause you have have the same element severall times, which is valid xml, but invalid in contexts.
Graham 23-Jun-2009 [630]	True
Maxim 23-Jun-2009 [631]	and you can easily separate attributes from elements, just by affecting them to different types.
Graham 23-Jun-2009 [632x3]	Although the XML I'm dealing with doesn't have duplicate elements.
	Or name spaces
	Have you posted your modifications anywhere??
Maxim 23-Jun-2009 [635x3]	the most stable engine I built which accepted all xml possibilities ended loading xml like so: [ <element> [<subelement> [#attribute "attr-value" . "subelement content"]]]
	the . is assigned the value of the elements. the above would result from the following XML: <element> <subelement attribute="attr-value"> subelement content </subelement> </element>
	And you can access it this way: document/<element>/<subelement>/#attribute document/<element>/<subelement>/.
Graham 23-Jun-2009 [638x2]	I find working with objects much easier though ...
Graham 23-Jun-2009 [638x2]	I guess the duplicate elements could be solved by using blocks for them
Maxim 23-Jun-2009 [640]	I've never posted that specific version cause it was closed source for a client. but I have my own new engine, which does the same, but attacking the parse rules directly... its probably faster. I've not released it.
BrianH 23-Jun-2009 [641]	Really? I went positional: ["element" "namespace" ["attribute" "value"] ["subelement" ...] "text" ...] with missing namespace or attribute block being #[none], so defaults can be done with ANY.
older newer	first last