r3wp [groups: 83 posts: 189283]
  • Home
  • Script library
  • AltME Archive
  • Mailing list
  • Articles Index
  • Site search
 

World: r3wp

[Core] Discuss core issues

Ladislav
16-Oct-2010
[20x12]
Note to the {...} syntax. It actually is a heredoc-type syntax. The 
only trouble with it is, that it does not have the variable part.
...so it has to use escaping for #"{" and #"}"
, which is a problem, since the use of escaping defies the whole 
purpose of it
What do we call it?
 - I propose "the heredoc syntax"
Is its main purpose to make dealing with curly-brace languages easier

 - not, I would use it to transform the core test suite to REBOL (which 
 it actually isn't now)
(that is exactly what you proposed in the Testing and Tools group, 
every test could be)

#[[test
; this is my test
test]]
Moreover, you can easily embed just about any text in REBOL, not 
just the source of curly-braced languages, but even REBOL code examples, 
or any general text that comes to mind without worrying about escaping
REBOL is not just a language allowing computers to communicate (if 
it were, escaping would be enough for every string), but also a language 
allowing humans and computers to communicate. And, I do not find 
escaped strings to be easily writable/readable for me, that is why 
I prefer to have the heredoc syntax available.
So, in general, the usage of the heredoc syntax shall make REBOL 
more readable/writable/handleable for humans, which is a good thing 
(tm), if it weren't there would be no reason to not use the machine 
language still.
#[[Graham

Why can't we redefine chars eg. in some sql dialects you can define 
the terminating character.  So, why can't we redefine the another 
character temporarily so that has the same functionality and then 
{} become ordinary characters?

Graham]] - I do not understand how would such a proposal work. Can 
you be more specific, showing an example?
Generally spoken, I do know what the advantages of the heredoc syntax 
are, and I hope I succeeded to communicate that to you. The syntax 
is used in other programming languages, making them more comfortable 
for humans. Not having the syntax in REBOL is not good, especially 
taking into account, that REBOL, as opposed to the above mentioned 
languages, is not just a programming language, meaning, that in REBOL 
you can write more than just programs.
#[[Carl

Although it can be used for programming, writing functions, and performing 
processes, its greatest strength is the ability to easily create 
domain-specific languages or dialects.
Carl]]
GrahamC
16-Oct-2010
[32x2]
@Ladislav ... the { } only have special significance because the 
interpreter is so written ...  so why not make it user definable 
?
Maybe it's not possible without an extensive modification to the 
parser .. I don't know
Ladislav
16-Oct-2010
[34x2]
OK, Graham, show me, how would you handle this string:

#[[example
'^{<+-*/%([#&$a0:;=\|,.}>)]
example]]
not to mention, that I could have put in all 127 ASCII characters
GrahamC
16-Oct-2010
[36]
You would define another character(s) instead of { } to be used .. 
normally most languages have them, so if you switch to that language, 
you would use that character
Ladislav
16-Oct-2010
[37]
I would do nothing, I am interested in knowing what would you do.
GrahamC
16-Oct-2010
[38x2]
I presume the idea of this is to easily generate javascript and the 
like
You as in the generic person and not you as in Ladislav
Ladislav
16-Oct-2010
[40]
Do not forget, that the string contains more characters that need 
to be escaped, than just the #"{" or #"}"
GrahamC
16-Oct-2010
[41]
actually, you as Carl as the rebol language implementor
Ladislav
16-Oct-2010
[42]
#[[Graham

I presume the idea of this is to easily generate javascript and the 
likeĻ
Graham]] - I presume, that I wrote above, that this is not true
BrianH
16-Oct-2010
[43x3]
I was on the fence about the heredoc proposal, but now that Ladislav 
has come up with a syntax that makes sense (this was missing from 
the previous proposal) I am now all for it. One caveat though: It 
would be best if, like other string syntaxes, the syntax details 
are thrown away after loading. By this I mean that
a
{a}
#[[blah
a
blah]]

should all generate the same string. Once it is loaded, there should 
be no way to determine that it was specified as a heredoc. Heredoc 
should be syntax only, not in any way affect semantics.
I can see many uses for heredoc syntax, not just generating Javascript. 
But it would be good for that too.
Oh, and not just ASCII; full Unicode.
Gregg
16-Oct-2010
[46]
I don't think we can throw away the meta info about it being a heredoc 
string, unless we want to use Oldes's idea of adding a refinement 
to MOLD. And then you have to choose when to use it. While we can 
say that it's easy for MOLD to add escapes to curly braces, that 
doesn't mean it will be easier for humans to read generated strings 
that contain them.


I can't say I've ever needed it personally. The { } syntax works 
well, and is as much support as TCL has for heredoc strings it seems 
(we even have TRIM/AUTO to help with indenting issues).  


I want to like the idea a lot, but I only like it a little so far. 
Mainly I wonder if it's worth adding for the sake of just the curly-brace 
chars. I understand the usefulness for the curly-language and test 
dialect scenarios, I'm just not sure of the cost/benefit ratio.
BrianH
16-Oct-2010
[47x3]
No refinement to MOLD needed. MOLD should know nothing about heredocs. 
Use a separate formatter function, like this:
mold-heredoc: func [value tag [string!]] [ 

 ajoin ["#[[" tag "^/" either string? :value [value] [mold :value] 
 "^/" tag "]]"] 
]
And once it is loaded, it is a string. There is no escaping in memory.
We *really* don't want to add another string type, since that would 
lead to conversion overhead. Another syntax for the existing, unchanged 
string! type is fine though.
Ladislav
16-Oct-2010
[50x3]
Yes, my point is exactly the same, even without any new datatype, 
etc., having just a new syntax to specify strings will be of advantage.
I hope, that we all agree, that, as opposed to other string syntaxes, 
there will be no escaping in heredoc, meaning, that the two strings 
below will be equal:

{^^}

#[[
^
]]
The proof is in the pudding - even now we do have two syntaxes for 
strings, and no string contains any information specifying which 
syntax was used (it is even possible, that none, since a string read 
from a file was not defined using any of the two.
BrianH
16-Oct-2010
[53]
Interesting, I was assuming that we were going to make the tag required. 
Mention that it is optional (or possibly an empty string, if you 
prefer) in the ticket.
Ladislav
16-Oct-2010
[54x2]
Whether the tag is optional or not - well, I am not firmly at either 
side. Nevertheless, the empty tag can be considered a tag as well, 
just a special one.
Will mention that in the ticket
BrianH
16-Oct-2010
[56]
As long as the constraints I mentioned in the first comment to that 
ticket are kept, I'm all for it. Plus, no MOLD refinements.
Ladislav
16-Oct-2010
[57]
That is OK with me
BrianH
16-Oct-2010
[58x2]
Let's also stick to text. No binary data in our strings.
Some languages have raw binary heredocs, but that doesn't work well 
with Unicode strings, nor does it post in text mode well.
Ladislav
16-Oct-2010
[60]
Agreed, I do not see any need to have binary heredocs
james_nak
16-Oct-2010
[61]
maxim - regarding namespaces: Are you kidding? I'm not even going 
to touch those. My excuse is if xml-object.r can't handle it, neither 
will I. :^(
Oldes
16-Oct-2010
[62x2]
Because it's complicated? Heredoc is enough for me and I'm lucky 
I'm not alone who found it missing.
oh.. my message is reply to Gramam's:

@Ladislav ... the { } only have special significance because the 
interpreter is so written ...  so why not make it user definable 
?
I've not noticed I'm not at the end of the chat :/
BrianH
16-Oct-2010
[64]
Three reasons:

- The { and } are not chosen at random, they are the consequence 
of using those characters to delimit strings themselves.

- There aren't many characters or character sequences that can be 
optional without conflicting with other stuff in the grammar.

- Syntax processors with user-defined stuff in them are much slower 
than ones without them.
james_nak
16-Oct-2010
[65]
Hi, I have a client whose ftp username has an ampersand @ in it. 
I think that's causing a problem accessing it. Is there a way around 
that or should that work?
Sunanda
16-Oct-2010
[66]
Try read (write etc) like this:

     read [ scheme: 'ftp host: domain.com user: user-domain.com pass: 
     "password" ]

Or see what other people have tried in the past:
    http://www.rebol.org/ml-topic-index.r?i=ftp
james_nak
16-Oct-2010
[67]
Thanks Sunanda. that was it.
Ladislav
17-Oct-2010
[68x2]
Regarding the "why the heredoc syntax was proposed" question, here 
is another reason, that is important for me, proving, that even users 
not planning to use other languages than REBOL can take advantage 
of it:


In my REBOL script files, I usually write code examples, or code 
tests, that are meant to demonstrate the newly defined functions, 
or to test whether they work as expected. That code is in no way 
meant to be run every time the script is run. Therefore, I use the 
COMMENT function to ensure the example/test code does not run every 
time the script is run.


Since it is a comment, I prefer to use a string to be able to write 
it "free-form" not being bound by the requirement of loadability 
of the text. However, when the code examples in the comment contain 
special "escaped characters", this would look ugly, therefore I rather 
give up the free-formness of the comment gaining the advantage of 
not being forced to escape the special characters, but being forced 
to keep the comment REBOL-loadable. The proposed heredoc syntax can 
solve this, and similar problems nicely and naturally.
(this property just "mimics" the property of the ; single-line comments, 
which does not need any character escaping as well)