World: r3wp
[Parse] Discussion of PARSE dialect
older newer | first last |
PeterWood 16-Apr-2007 [1740] | Does that imply there won't be a Unicode Charset with which to parse unicode strings? |
btiffin 16-Apr-2007 [1741x2] | There is going to be a unicode! datatype |
sorry. Didn't finish...hit wrong key... but I'm not sure what that means for charsets | |
Henrik 17-Apr-2007 [1743] | Perhaps vector! will play a part in solving the unicode problem |
Pekr 17-Apr-2007 [1744] | having Unicode datytype does not mean we have unicode datatype ;-) |
Gabriele 17-Apr-2007 [1745] | you can make a bitset with 65000 bits in r2... so why not in r3? |
Pekr 17-Apr-2007 [1746x2] | Gabriele - how do you create unicode (mostly 16bit, or varrying) charset in R2? :-) |
btw - pity R3 does not integrate any parser helpers, which could make life easier | |
Ladislav 17-Apr-2007 [1748] | what do you mean by "does not integrate" and by "parser helpers"? |
Pekr 17-Apr-2007 [1749x3] | helpers = suggestions ... historically there were 2 or 3 lists with various suggestions, to some of them even Carl said he could consider to add them. First one was Robert's site, next one was yours, and several times ono various altme channels? |
I don't know, as for me, I just wanted to|thru [a | b | c] :-) | |
I would let others more skilled parse gurus to raise their wishes ... | |
Gabriele 17-Apr-2007 [1752x2] | petr, i didn't say r3 won't improve on parse, i said that the first release may not have the new features. |
we won't stop at 3.0... there will be a 3.1 and so on... at least we hope so :) | |
Gabriele 18-Apr-2007 [1754x3] | (lost my connection yesterday night and the message i was typing...) |
parsing utf-8 in r2 is easy. utf-16 and utf-32 are less confortable but should be easy too. | |
of course with r3 it's much easier because - i guess - you can just parse the unicode! string. | |
Rebolek 24-May-2007 [1757] | Is it possible to combine two bitset!s? |
Maxim 24-May-2007 [1758] | using 'union |
Rebolek 24-May-2007 [1759x2] | great, thanks! |
Is there some way to make this work: parse "aaa" [some "a" "a"] or PARSE just don't work this way? | |
Geomol 24-May-2007 [1761] | What do you mean? >> parse "aaa" [some "a"] == true Why the second "a"? |
Rebolek 24-May-2007 [1762] | It may seem strange I know, but this is automaticaly created rule |
Geomol 24-May-2007 [1763] | Parsing for [some "a" "a"] will return false, because you've already parsed past the "a"s. |
Rebolek 24-May-2007 [1764] | OK I need to find some other way :) Is it possible to go back in parse? -1 skip doesn't seem to work. |
Geomol 24-May-2007 [1765] | I was thinking the same. I seem to remember, that at some time (some version of REBOL), -1 skip did work!? Hmm... |
Rebolek 24-May-2007 [1766] | Wasn't it just proposed for R3? |
Geomol 24-May-2007 [1767] | A clumsy way of doing it: >> parse "aaa" [some "a" p: (p: skip p -1) :p "a"] == true |
Rebolek 24-May-2007 [1768] | OK thanks, that may help |
Anton 24-May-2007 [1769x2] | That's not so clumsy. You want to backtrack and that's what you're doing. |
obviously you could use (p: back p) | |
Rebolek 24-May-2007 [1771] | Even better. Thanks Anton. Seems that "-1 skip" should not be that hard to implement |
BrianH 24-May-2007 [1772] | parse "aaa" [some [p: "a"] :p "a"] |
Rebolek 24-May-2007 [1773] | I think this needs (p: back p) before :p. |
BrianH 24-May-2007 [1774x2] | Not in my version. The p is set before the position advances past the "a", so it is already back. |
The p is reset before "a" is consumed - that is why I put [p: "a"] in []. | |
Rebolek 24-May-2007 [1776] | So why it does return 'false here? p is empty on :p. |
BrianH 24-May-2007 [1777x3] | Interesting. It seems to be setting the last p before it fails on the last iteration of "a". |
Clearly I need a temporary. | |
parse "aaa" [some [p1: "a" (p2: :p1)] :p2 "a"] | |
Anton 24-May-2007 [1780] | I have done this sort of thing before. |
Rebolek 24-May-2007 [1781] | temporary, or step back |
BrianH 24-May-2007 [1782] | A temporary will work better with parts of unknown size, and be faster too. |
Rebolek 24-May-2007 [1783] | OK |
BrianH 24-May-2007 [1784x2] | Still, you might want to apply rewrite rules to your generated parse rules - that code seems a little sloppy. |
Peephole fixing? | |
Rebolek 24-May-2007 [1786] | rewrite rules? |
Oldes 24-May-2007 [1787] | that you will not have [some "a" "a"] but just [some "a"] |
Rebolek 24-May-2007 [1788] | Well, I'm not exactly sure if that's possible, I have to do some tests |
BrianH 24-May-2007 [1789] | By rewrite rules, I mean something like what Gabriele came up with for the rebcode assembler a while ago. Since I helped refine his work, I may still have a copy somewhere. I'll take a look. |
older newer | first last |