World: r3wp

Join the discussions in the REBOL3 world...

[Parse] Discussion of PARSE dialect

older newer	first last
Pekr 15-Sep-2010 [5185]	ah, there was USE 1 (BrianH) and USE 2 proposal (Peta). USE 1 was assigned for implementation, but deferred along with LIMIT, OF, REVERSE and some other proposals ...
Ladislav 15-Sep-2010 [5186]	Pekr, there are no "parse variables", as far as I know. My above code was inspired by BrianH and USE2
Pekr 15-Sep-2010 [5187]	I know, but I reacted upon Carl's recent blog, and if Carl dares to use the incorrect terminology, then I can too, no? :-) This function is useful because PARSE rules often store data in variables which, depending on how you handle them, can be overwritten by rule recursion. (Which is done often in parsing.)
Ladislav 15-Sep-2010 [5188]	parse rules often store data in variables is fine, but that does not mean, that there are "parse variables", those are just variables
Pekr 15-Sep-2010 [5189]	ok, as for terminology, how do I refer to following: start: copy user to "</user>" :end (temp: find blk user) while the code is not real, I can see three different "variable types": - start: :end markers - user parse level variable? - temp: rebol level word? Thanks :-)
Ladislav 15-Sep-2010 [5190x5]	Sorry if you find it nitpicking, but I wanted to point out, that an assumption that "parse variables are safe for recursion rules" should be wrong exactly because there are only variables in REBOL, which the interpreter does not have any reason to make "recursion safe" unless you specifically do something about it.
	Regarding the above "three different variable types" - as far as the interpreter is concerned, there is no difference between the 'start 'end and 'copy variables. You used them in a specific way, but, later, you can very well use another expression, where you write: user: copy start to "whatever" etc. So, you can easily see, that neither the 'start nor the 'user variable has any "variable type" you could infer based upon their usage in one specific expression.
	err, I meant the 'start 'end and 'user variables
	if you call 'temp a "rebol level variable" in the above example expression, how would you call it in the expression below? (temp: either 1 > 0 [none] [[end skip]]) temp
	So, generally speaking, it may be useful for you to use some some variables only as "markers", but that "variable type" is something you impose just to make the things easy for you to handle. The interpreter does not (and should not) make such distinctions.
Pekr 15-Sep-2010 [5195]	temp is "rebol level variable", whose value is bound to parse dialect :-)
Ladislav 15-Sep-2010 [5196x2]	Hmm, what does the "bound" word mean?
Ladislav 15-Sep-2010 [5196x2]	As I said, you can use some variables for specific purposes only, but that is your decision, and is not supported by any interpreter property, and certainly is not useful for all purposes. I may want to use some variables in more "roles" than you want to.
Pekr 15-Sep-2010 [5198x2]	hmm, then maybe not. Temp following the paren simply refers to the rule, resulting from previous paren expression evaluation ....
Pekr 15-Sep-2010 [5198x2]	and hence is probably a regular rebol level word, just used inside the parse dialect?
Ladislav 15-Sep-2010 [5200]	In my opinion, all of the above 'start 'end 'user and 'temp words are "regular" REBOL words, while e.g. the 'copy word above is actually a parse dialect keyword
Pekr 15-Sep-2010 [5201x2]	corret ...
Pekr 15-Sep-2010 [5201x2]	and - it was not nitpicking on your part - using the correct terminology where possible is important, as it helps to correctly interpret the way in which the language works. It is just lack of knowledge on my part, that I confuse the terms ... exactly because I often don't understand the internals ...
Izkata 15-Sep-2010 [5203]	start: :end and temp: would just be called set-words and get-words, either within the Parse dialect or the Do dialect (Do dialect being "rebol level")
Anton 15-Sep-2010 [5204]	To be even more precise; The 'copy word above is also a "regular" rebol word (it just happens, additionally, to be interpreted by PARSE as a keyword in the PARSE dialect).
Ladislav 15-Sep-2010 [5205]	Re start: and :start e.g. - they are values of different datatypes, but can be seen as just one variable.
Ladislav 16-Sep-2010 [5206]	http://www.rebol.org/view-script.r?script=use-rule.rupdated. Changes: now it implements both USE-RULE variants as described in http://www.rebol.net/wiki/Parse_Project .
Ladislav 18-Sep-2010 [5207x6]	Hi, I rewrote http://www.fm.tul.cz/~ladislav/rebol/evaluate.rto contain slightly more comments and to use the USE-RULE function, so that it now became a more complex example of its usage.
	I hope everyone struggling with local variables in PARSE to find this a much more comfortable way.
	On the other hand, I expect the %evaluate.r to have a value of its own, since it shows how to handle expressions using PARSE, while being able to respect different priority/associativity rules sets.
	The funny thing about it is, that just one local variable per parse rule was quite sufficient in this case, although the example isn't trivial in my opinion.
	http://en.wikibooks.org/wiki/REBOL_Programming/Language_Features/Parse#Local_variables_in_parse_rules , a new subsection in the REBOL Programming wikibook.
	http://en.wikibooks.org/wiki/REBOL_Programming/Language_Features/Parse#Local_variables_in_parse_rules a new section in the REBOL Programming wikibook
Janko 18-Sep-2010 [5213]	wowo, most awesome Ladislav!
Ladislav 18-Sep-2010 [5214x2]	Pekr, knowing that you were interested in the subject as well, I sincerely hope, that you read the wikibook subsection, run the examples, and let me know what you are missing in the doc and in the functionality.
Ladislav 18-Sep-2010 [5214x2]	thanks, Janko
Claude 20-Sep-2010 [5216x4]	hi, i would you to parse a string like this {number: 123456 name: abcd message: i like rebol}
	i would like to use parse to do it
	something like this way parse mystring ["number:" number: "name:" name: "message:" msg:]]
	a get number, name and msg . is that possible ? and how ?
Maxim 20-Sep-2010 [5220]	is the data guaranteed to be rebol compatible? (you can use 'LOAD on it and it will never return an erro)
Claude 20-Sep-2010 [5221x3]	but mystring is like this {Number : 10017 Name : Disable Message Partner Application: MXS Severity : Info Type : Alarm Event Date-Time : 20/09/10 12:39:43 GMT : Mon Sep 20 10:39:43 2010}
	i start to use => parse mystring [thru "Number :" number: to end]
	i would like to do => parse mystring [thru "Number :" number: to "Name :" ] but this don't work !!!!]
Maxim 20-Sep-2010 [5224x4]	using 'TO or THRU rules is garanteed to make you life a nightmare for arbitrary string parsing, unless its very linear in content.
	isn't there a rebol string parser on rebol.org which has the basic parsing rules to filter out rebol data?
	with a few search I didn't zero-in on it, but I seem to recall seeing that before.
	otherwise, you can just do a line by line parsing, filtering out "keywords" at the start and then loading the data after the ":"
Claude 20-Sep-2010 [5228x2]	if i can't use parse i will do this
Claude 20-Sep-2010 [5228x2]	but it is very strange. i heard that parse is very powerfull. and now i can't use it because if to difficult !!!!
Maxim 20-Sep-2010 [5230]	parse is very powerfull and easy to use but mastering it isn't.
Janko 20-Sep-2010 [5231]	[thru "Number :" copy number [ to newline \| to end ] ] maybe?
Maxim 20-Sep-2010 [5232x2]	although it is much more approachable than regexp in certain ways.
Maxim 20-Sep-2010 [5232x2]	here is an overblown example, with lots of extra prints, so you can see it running... it parses your above example text. -------------------------------- rebol [] data: {Number : 10017 Name : Disable Message Partner Application: MXS Severity : Info Type : Alarm Event Date-Time : 20/09/10 12:39:43 GMT : Mon Sep 20 10:39:43 2010} ; rules token: ["Number" \| "Name" \| "Severity" \| "Type" \| "Date-Time" \| "GMT" \| "Application"] space: charset " ^-" spaces: [any space] content: complement charset "^/" ; all but newlines. contents: [some content] ; used as variables in the rules v-token: none v-content: none parse/all data [ some [ ( print "^/0>" ) copy v-token token ( print "1>" probe v-token ) spaces ; ignore ( print "2>" ) ":" ; end of token ( print "3>" ) spaces ; ignore ( print "4>" ) copy v-content contents ;grab all but newline ( print "5>" probe v-content ) [ ; this is important, it ensures that you've got a whole line. [some "^/"] \| end ] ( print "6>" ) ( ; do something with your data here ?? v-token ?? v-content ) ] ] probe "." ask "done"
Claude 20-Sep-2010 [5234]	thank you i will test it and try to understand too ;-)
older newer	first last