World: r3wp

Join the discussions in the REBOL3 world...

[Parse] Discussion of PARSE dialect

older newer	first last
BrianH 5-Nov-2008 [2756]	Didn't you get Carl's simplicity blog? It was posted on Reddit and PLNews :)
Graham 5-Nov-2008 [2757x2]	can you use unicode for variable names? I guess so ...
Graham 5-Nov-2008 [2757x2]	allows for me one letter variables available
BrianH 5-Nov-2008 [2759]	Yes, you will be able to use Unicode for variable names. I wouldn't suggest it though: You have to use Character Map to enter half of Perl 6's operators because they did exactly what you suggested :(
Graham 5-Nov-2008 [2760]	Must be Carl's driving force to implement unicode
PeterWood 5-Nov-2008 [2761]	I think Anton is correct about Return...the Return should preceed the bracket: eg end return ("the return value") is mch better than end (return "the return value").
Anton 5-Nov-2008 [2762]	I just followed Brian on that.
BrianH 5-Nov-2008 [2763]	I was thinking that you not even have the parens at all, and just have RETURN rule.
Anton 5-Nov-2008 [2764]	ie. return [ rule ... ]
BrianH 5-Nov-2008 [2765]	That way you could do this to return the first HTML tag from data: [to "<" return thru ">"]
Chris 5-Nov-2008 [2766]	So that would replace 'true?
BrianH 5-Nov-2008 [2767x2]	Yup.
BrianH 5-Nov-2008 [2767x2]	In particular, it would return a copy, like the COPY directive, not the SET directive.
Chris 5-Nov-2008 [2769]	Like! Would that work for values? -- [to "<" copy a thru ">" "<" return a] ; - returns a if there is a < next?
Anton 5-Nov-2008 [2770]	What would you do when you need to process the data a bit first ? eg. You return tags from different places in a rule, and to distinguish them you need to also return something extra, by prepending a code to the beginning, for example.
BrianH 5-Nov-2008 [2771x2]	That's what code in parens is for. The USE directive will make that safer.
BrianH 5-Nov-2008 [2771x2]	I don't think that returning values would really work because named rules are values.
Anton 5-Nov-2008 [2773x2]	Yes, you're right. Can be achieved with parens, of course.
Anton 5-Nov-2008 [2773x2]	Looking forward to using USE.
BrianH 5-Nov-2008 [2775]	Carl was kinda weirded out by the modifying operations, but I pointed out that people do this anyway and get it wrong a lot.
Anton 5-Nov-2008 [2776]	Really ? They look so useful !
BrianH 5-Nov-2008 [2777]	They do, but he was weirded out by the idea of PARSE modifying its data in place. Apparently he doesn't do that :)
Anton 5-Nov-2008 [2778x2]	(We can live without them, though.)
Anton 5-Nov-2008 [2778x2]	What's wrong with that ? Isn't that what RNA does ?
BrianH 5-Nov-2008 [2780]	They're going in. If we were just doing what Carl wants, he wouldn't need to ask for suggestions and ask me to compile them.
Anton 5-Nov-2008 [2781x2]	(I mean enzymes that process RNA..)
Anton 5-Nov-2008 [2781x2]	You mean, Carl has already accepted them ?
BrianH 5-Nov-2008 [2783]	Everything in that Parse Proposals page has already been discussed with Carl and could go in, barring insurmountable problems with implementation. I stopped putting stuff in when he stopped working for the day. There will likely be a couple going in tonight, but Carl is actively involved in this process.
Anton 5-Nov-2008 [2784]	Well, that sounds all good.
BrianH 5-Nov-2008 [2785x3]	We've already made some changes based on these discussions, which is why there are differences from the REPs.
	It was freeing when Carl said that we could use modifier keywords like ONLY for CHANGE and INSERT. I look forward to seeing how that would affect the INTO-STRING proposal :)
	The main thing that Carl is concerned about now is that some of the proposals make use of the value calculated in a paren on occasion. I don't know why this would be a problem, but I'm sure it will be worked out or around.
Chris 5-Nov-2008 [2788x2]	Using 'remove -- a) removing a bracket only at the end of a string (as per Graham's example): parse "[this]" [remove "[" to "]" remove ["]" end]] b) where you go down a false path: parse "abcdef123" [remove "abc" "123" \| remove "abcd" "ef123"]
Chris 5-Nov-2008 [2788x2]	Would a) work? Would b) reset the string as the first rule didn't match?
BrianH 5-Nov-2008 [2790]	a) would work. b) would not likely reset the string, just like code blocks don't undo.
BrianH 6-Nov-2008 [2791x3]	You might be able to do b) like this: parse "abcdef123" [use [a] [remove ["abc" a: "123" :a] \| remove ["abcd" a: "ef123" :a] to end]] or like this: parse "abcdef123" [use [a] [remove ["abc" a: "123" :a \| "abcd" a: "ef123" :a] to end]]
	The standard method of putting the longest match first will still be the best.
	Ooo, I just figured out another possibility for b): parse "abcdef123" [to "123" reverse remove ["abc" end] \| to "ef123" reverse remove ["abcd" end]]
Chris 6-Nov-2008 [2794]	parse "[this]" [remove "[" to end reverse remove ["]"]]
BrianH 6-Nov-2008 [2795x2]	In reverse, the end is the beginning :)
BrianH 6-Nov-2008 [2795x2]	And the beginning is the end :)
Chris 6-Nov-2008 [2797x2]	Shorter with tail, perhaps? parse "[this]" [remove "[" tail reverse remove "]"]
Chris 6-Nov-2008 [2797x2]	How about this? parse "abc" ["a" to end reverse "bc"]
Graham 6-Nov-2008 [2799]	Is there anything that can be done to assist block parsing?
BrianH 6-Nov-2008 [2800]	Yup, that would work.
Graham 6-Nov-2008 [2801x2]	If you turn a string into a block and there's something present which is an illegal rebol value ... you get an error.
Graham 6-Nov-2008 [2801x2]	Is there a way such values can be turned into a legal type ....and not get an error?
BrianH 6-Nov-2008 [2803]	That sounds like unbound words. In R3 unbound words are not an illegal type.
Graham 6-Nov-2008 [2804x2]	great.
Graham 6-Nov-2008 [2804x2]	so if we have words like asdfs@ etc, we don't get illegal email types etc?
older newer	first last