r3wp [groups: 83 posts: 189283]
  • Home
  • Script library
  • AltME Archive
  • Mailing list
  • Articles Index
  • Site search
 

World: r3wp

[!REBOL3-OLD1]

Geomol
31-Jul-2009
[16445]
Some languages only allow 7-bit ascii in the source except for strings.
Sunanda
31-Jul-2009
[16446]
re: 666 -- I'll dig that script out and run it tomorrow :)
Geomol
31-Jul-2009
[16447]
I could use 8-bit danish letters in my REBOL source, if I would:

>> ęble: "apple"
== "apple"


But I don't do it. I'm not sure, if it's a good idea to allow this. 
I guess, 8-bit values are allowed, because it makes the lexer faster.
BrianH
31-Jul-2009
[16448x5]
All standard functions and syntax in REBOL fit within 7-bit ASCII, 
which is why R3 source is UTF-8.
UTF-8 encoded binary!
Adjusted #537 to reflect the post-alpha-39 behavior of TRANSCODE 
(can't believe I forgot to do so before now).
Geomol, please take another look...
Sunanda, #1167 created for that to-word "a," error.
Ladislav
31-Jul-2009
[16453x4]
{Is there a problem with getting operators?

>> get to word! "="
** Script error: = word is not bound to a context}


This is not operator-specific, no variable can be handled like that:

a: 1
get to word! "a"
** Script error: a word is not bound to a context
{>> get to-word '=
== op!}

- in the above code the usage of TO-WORD is just a no-op
{Brian knows the internals.} - see the http://www.rebol.net/wiki/Comparisons
article containing the general principles
{>> get to-word quote <>}

- again, TO-WORD is just no-op above
BrianH
31-Jul-2009
[16457]
Knowing the internals isn't hard. I haven't seen the native source 
- I've just seen the mezzanine source, followed the conversations, 
and read sites like the onee Ladislav linked above. The rest is deduction.
Ladislav
31-Jul-2009
[16458x3]
{so making string to words and making lit-words to words isn't quite 
the same, it seems!?
>> (to word! '=) = (to word! "=")}

- again, the first TO WORD! is just a no-op
{What's the need for QUOTE, when we have the get-word syntax?} - 
it serves a totally different purpose, e.g.:

quote (1 + 1) ; == (1 + 1)

for comparison:

first [(1 + 1)] ; = (1 + 1)

(1 + 1) ; == 2
another QUOTE:

quote  'a ; == 'a

(another way how to obtain that would be e.g.):

first ['a] ; == 'a
BrianH
31-Jul-2009
[16461]
I *really* like the new :a parameter behavior in R3 that makes QUOTE 
possible :)
Ladislav
31-Jul-2009
[16462x5]
yes, that makes the thing really useful :-)
Or, since the get-word syntax was mentioned:

quote :x ; == :x
, so, the most useful property is consistency; it always does the 
same (actually nothing)
{while other operators work ok as lit-words:
>> '=
== =}


- just a note; in the above example there is no operator, just a 
lit-word, that is handled as "valid" by Rebol loader in this case 
(as expected)
the power/simplicity of QUOTE is, that even in case the lit-word 
syntax isn't handled correctly, QUOTE works as expected:

quote <> ; == <>
Geomol
1-Aug-2009
[16467x2]
I think, the lit-word syntax should be fixed anyway.
I wasn't aware, that this doesn't work in R3:

get to word! "a"


So it's not a problem with operators, but a general change. The above 
code is valid R2 code. I can't judge, if it's a needed change or 
not, but it will trigger thoughts about R2 compability. I guess, 
the change will break a lot of R2 code!?
Henrik
1-Aug-2009
[16469]
that would be because words aren't bound by default in R3.
Geomol
1-Aug-2009
[16470]
I see. Does it make sense to not bind words by default? I feel, Carl 
might have a blog about this. :-)
Henrik
1-Aug-2009
[16471]
I think this was quite a big topic a while ago. It has something 
to do with modules.
Geomol
1-Aug-2009
[16472x2]
I came across another funny thing. Are << and >> planned as operators?

>> ? >>
No information on >>

So >> is a valid word. But >>> is not:

>> ? >>>
** Syntax error: Invalid "word" -- ">>>"
I think, the lit-word syntax should be fixed anyway.
Same can be said about get-word syntax.
BrianH
1-Aug-2009
[16474x3]
Yes, it makes sense to not bind words by default. Carl has written 
many blogs about this, going back more than a year.
There is no one shared global context anymore, so which context to 
bind to is a policy decision. Default needs to be unbound to do this
>> and << are likely allocated for user-defined operators. Please 
don't allocate >>> and <<< - it would be too hard to discourage their 
use if they are allowed syntax. We don't want REBOL to become a write-only 
language like Perl.
Sunanda
1-Aug-2009
[16477]
Any reason why NONE acts as an honorary empty block in each of these 
under R3:
    a: none forskip a 1 [print a]
    a: none forall a [print a]
    foreach a none [print a]
R2 would not be happy!
Oldes
1-Aug-2009
[16478]
Why not.. it seems to be logical to me.
Sunanda
1-Aug-2009
[16479]
Logical, maybe. Harmless, perhaps. But odd ..... and inconsistent 
with REPEAT:
    repeat a none [print a]

    ** Script error: repeat does not allow none! for its value argument
So I am wondering if it is a deep feature, or an oversight.
BrianH
1-Aug-2009
[16480x2]
The REPEAT behavior sounds like an oversight - but the none argument 
was left out of MAP on purpose, so who knows?
As for the other functions, it is an intentional change from R2, 
which reduces special-case code wrapped around FIND and SELECT.
Sunanda
2-Aug-2009
[16482]
Thanks -- I'll add a curecode so the possible oversight is on the 
checklist.
Henrik
4-Aug-2009
[16483x2]
I'm wondering now if there will be easy ways to check whether a char! 
is: lowercase, uppercase, a number or an international char.
that might be useful during PARSE
PeterWood
4-Aug-2009
[16485x4]
I think that distinguishing between upper and lower case chars is 
very difficult with Unicode.
Carl seems to have done a great job with Latin characters:

>> uppercase to string! #{C3A0}
== "Ą"

>> lowercase uppercase to string! #{C3A0}
== "ą"

>> uppercase to string! #{C48D}           
== "\u010c"
>> lowercase uppercase to string! #{C48D} 
== "\u010d"
Though don't know what the above will look like in AltME under Windows 
or Mac
Pekr
4-Aug-2009
[16489]
looks good - R with comma upon it ...
PeterWood
4-Aug-2009
[16490x2]
There seems to be some problems with other alphabets though:

>> uppercase to string! #{E382A1}         
== "\u30a1"


\u30A1 is a small katakana letter A. The unicode for a caplital katakana 
A is \u30A2
Pekr - it is actually an a with a grave accent over it in UTF-8
Gabriele
5-Aug-2009
[16492x2]
hmm, should uppercase and lowercase really work with katakana and 
hiragana? the "small" versions have a completely different meaning 
and usage than our "lowercase" has.
it seems to me, that uppercase and lowercase should not modify kana... 
but I haven't read what the unicode standard mandates here.
PeterWood
5-Aug-2009
[16494]
No doubt you are right. I haven't read the unicode standard and know 
nothing about "non-Latin" alphabets.