r3wp [groups: 83 posts: 189283]
  • Home
  • Script library
  • AltME Archive
  • Mailing list
  • Articles Index
  • Site search
 

World: r3wp

[Parse] Discussion of PARSE dialect

Maxim
20-Sep-2010
[5225x3]
isn't there a rebol string parser on rebol.org which has the basic 
parsing rules to filter out rebol data?
with a few search I didn't zero-in on it, but I seem to recall seeing 
that before.
otherwise, you can just do a line by line parsing, filtering out 
"keywords" at the start and then loading the data after the ":"
Claude
20-Sep-2010
[5228x2]
if i can't use parse i will do this
but it is very strange. i heard that parse is very powerfull. and 
now i can't use it because if to difficult !!!!
Maxim
20-Sep-2010
[5230]
parse is very powerfull and easy to *use*  but *mastering* it isn't.
Janko
20-Sep-2010
[5231]
[thru "Number  :"  copy number [ to newline | to end ] ] maybe?
Maxim
20-Sep-2010
[5232x2]
although it is much more approachable than regexp in certain ways.
here is an overblown example, with lots of extra prints, so you can 
see it running... it parses your above example text.
--------------------------------
rebol []

data: {Number : 10017
Name       : Disable Message Partner
Application: MXS
Severity   : Info
Type       : Alarm Event
Date-Time  : 20/09/10 12:39:43
GMT        : Mon Sep 20 10:39:43 2010}


; rules

token: ["Number" | "Name" | "Severity" | "Type" | "Date-Time" | "GMT" 
| "Application"]
space: charset " ^-"
spaces: [any space]

content: complement charset "^/" ; all but newlines.
contents: [some content]


; used as variables in the rules
v-token: none
v-content: none
parse/all data [
	some [
		(
			print "^/0>"
		)
		copy v-token token 
		(
			print "1>"
			probe v-token
		)
		spaces ; ignore
		(
			print "2>"
		)
		":" ; end of token
		(
			print "3>"
		)
		spaces ; ignore
		(
			print "4>"
		)
		copy v-content contents ;grab all but newline
		(
			print "5>"
			probe v-content
		)
		[
			; this is important, it ensures that you've got a whole line.
			[some "^/"] | end
		]
		(
			print "6>"
		)
		(
			; do something with your data here
			?? v-token
			?? v-content
		)
	]

]


probe "."

ask "done"
Claude
20-Sep-2010
[5234]
thank you i will test it and try to understand too ;-)
Maxim
20-Sep-2010
[5235]
I made it easy for you to figure out (at least as easy as can be 
 ;-)
Claude
20-Sep-2010
[5236x2]
Maxim here the result
0>
1>
Number
2>
.
done
Maxim
20-Sep-2010
[5238]
using which rebol?  I did run it through and it worked ok on 2.7.7
Claude
20-Sep-2010
[5239x2]
it stop at ":" !!!
2.7.7.4.2
Steeve
20-Sep-2010
[5241]
Much easier with R3.
assuming, 
>> src: {Number : 10017
Name       : Disable Message Partner
Application: MXS
Severity   : Info
Type       : Alarm Event
Date-Time  : 20/09/10 12:39:43
GMT        : Mon Sep 20 10:39:43 2010}

then, 


>> construct parse copy src [return while[and[some #" " #":"] remove[some 
#" "]| skip]]
== make object! [
    Number: "10017"
    Name: "Disable Message Partner"
    Application: "MXS"
    Severity: "Info"
    Type: "Alarm Event"
    Date-Time: "20/09/10 12:39:43"
    GMT: "Mon Sep 20 10:39:43 2010"
]
Claude
20-Sep-2010
[5242x2]
thank you steeve but i need it for a production server  to parse 
log file for nagios
i would prefer to use rebol 2
Maxim
20-Sep-2010
[5244x4]
claude it might be because of copy/pasting... you are on linux.. 
did you just copy the code and paste it into the console?
the best is to copy it to a file and run it.
I just (re)tried it and it does work on my system...
(windows xp box)
Claude
20-Sep-2010
[5248]
well i on linux and on linux altme do work well with ctrl-c !!!!!
Maxim
20-Sep-2010
[5249x2]
but here we are parsing and any character code has to be loaded properly.. 
in theory it should work.. but it seems not...

add this just before the ":"

		here:
		(probe copy/part here 20)


it will print what is exactly at that point ... may provide clues 
as to why the ":" isn't matching...
note: I just tried to copy/paste the above in my R2 console .. it 
worked ok..
Claude
20-Sep-2010
[5251]
0>
1>
Number
2>
 : 10017^/Name       
.
done
Maxim
20-Sep-2010
[5252]
strange... it didn't match the spaces!
Steeve
20-Sep-2010
[5253]
parse/all ?
Maxim
20-Sep-2010
[5254x2]
already is using parse/all
claude... so, did you try to run it as a script?


one thing I would do... since this is a strange error is to retype 
this:

 " ^-"


in your editor... to make sure its not using the wrong ascii character 
on your os... it should not be a problem... but there is something 
weird going on here.
Claude
20-Sep-2010
[5256]
i do a copy/paste http://host4.altme.com/altweb/rebol3/chat210.html
into a file.r  and i execut it . i have got the same error
Maxim
20-Sep-2010
[5257x3]
steeve does it work for you?
oh.. he's gone... 

anyone else care to test it?  this is a strange bug...
this is what I get in my console....

0>
1>
Number
2>
3>
4>
5>
10017
6>
v-token: "Number"
v-content: "10017"

0>
1>
Name
2>
3>
4>
5>
Disable Message Partner
6>
v-token: "Name"
v-content: "Disable Message Partner"

0>
1>
Application
2>
3>
4>
5>
MXS
6>
v-token: "Application"
v-content: "MXS"

0>
1>
Severity
2>
3>
4>
5>
Info
6>
v-token: "Severity"
v-content: "Info"

0>
1>
Type
2>
3>
4>
5>
Alarm Event
6>
v-token: "Type"
v-content: "Alarm Event"

0>
1>
Date-Time
2>
3>
4>
5>
20/09/10 12:39:43
6>
v-token: "Date-Time"
v-content: "20/09/10 12:39:43"

0>
1>
GMT
2>
3>
4>
5>
Mon Sep 20 10:39:43 2010
6>
v-token: "GMT"
v-content: "Mon Sep 20 10:39:43 2010"
Claude
20-Sep-2010
[5260]
i will try it on the server tomorrow morning (it is a windows one 
;-) ) i will keep you informed
Maxim
20-Sep-2010
[5261]
k
Claude
20-Sep-2010
[5262]
thanks a lot for your tecahting
Maxim
20-Sep-2010
[5263]
no eprobml  ;-)
Claude
20-Sep-2010
[5264x2]
teaching
;-)
Anton
20-Sep-2010
[5266]
Maxim, your first big code works for me on Linux R2.
Maxim
20-Sep-2010
[5267]
ok, so its definitely somethings screwy with his c&p on that system
Anton
20-Sep-2010
[5268x3]
That is a distinct possibility.
I tested on 
View 2.7.7.4.2  (latest)
View 2.7.6.4.2  (long time favourite prior version)
View 1.3.2.4.2  (much older version)
Ways to identify the problem:

- In AltME, copy Max's entire post which contains the code. You can 
do this with a right click when no text is selected in the message.

  Switch to the rebol console and type:   checksum read clipboard:// 
     and tell us what you get.

- Then you can also type:  print mold read clipboard://    and examine 
what is printed carefully.

- Try to reproduce the problem with shorter code and shorter input. 
Two lines of input should do.
Anton
21-Sep-2010
[5271x3]
(I don't always see consistent behaviour with AltME and rebol on 
linux, by the way.)
(With regards to the clipboard, anyway.)
I copied Max's big message containing the big code.
>> checksum read clipboard://
== 2842420
Claude
21-Sep-2010
[5274]
maxim thank a lot it's works perfectly ;-) . yesterday i was tried 
i think