r3wp [groups: 83 posts: 189283]
  • Home
  • Script library
  • AltME Archive
  • Mailing list
  • Articles Index
  • Site search
 

World: r3wp

[!REBOL3-OLD1]

BrianH
12-Feb-2009
[11035]
This /into option proposal is based on profile-driven *language design*. 
It's a new approach to REBOL.
[unknown: 5]
12-Feb-2009
[11036x2]
I like the idea Brian.  In fact, I use that manner often in the things 
I do.
But why not make the /into default behavior?  And then use /copy 
for the alternative?
BrianH
12-Feb-2009
[11038x2]
Because the series creatiion builder model is easier for less advanced 
REBOL programmers to use. Buffers need management (as #1789 explains). 
This new model is for more advanced programming, such as mezzanine 
and library code.
Well, right now i need discussion and consensus. I have CureCode 
wishes, mezzanine modifications, new functions, and massive optimizations 
all waiting on this. I'm really blocked until this is decided on.
[unknown: 5]
12-Feb-2009
[11040]
I guess, I don't follow.  Maybe, I'm jumping to conclusions about 
what /into actually implies.  To me, /into implies that the original 
series passed to the function is what is modified and not a copy 
of it.
BrianH
12-Feb-2009
[11041]
No, that's the /no-copy option - it's a completely different thing. 
See #1789.
[unknown: 5]
12-Feb-2009
[11042]
Well, then I'm in favor of the no-copy option as the default behavior.
BrianH
12-Feb-2009
[11043]
Please read #1789 before you make that decision. The difference between 
/no-copy and /into is important.
Oldes
12-Feb-2009
[11044]
So you mean something like:
rejoin/into [a b c] out
instead of:
append out rejoin [a b c]
? I think I like it.
BrianH
12-Feb-2009
[11045]
Yes. But not for (RE)JOIN because JOIN or REJOIN /into is exactly 
the same as INSERT and INSERT REDUCE. What we *do* need to add /into 
to is (RE)MOLD, (RE)FORM, REDUCE, COMPOSE, READ, COLLECT (needs changes), 
MAP and EXTRACT.
[unknown: 5]
12-Feb-2009
[11046x2]
Brian, I read it and the option I believe should be default behavior 
is for the original series to be modified.
Which is what I thought the /into option was implying.
Dockimbel
12-Feb-2009
[11048]
BrianH: I'm 100% for /into and, if possible, it would be one of my 
top features to backport to R2.
Henrik
12-Feb-2009
[11049]
I thought that too.
BrianH
12-Feb-2009
[11050x2]
No, those are modifier functions. We already have those, and the 
algorithm for implementing a modifier is completely different than 
a builder. Some functions (like DELINE) could be changed into modifiers 
with /no-copy and that wouuld make sense. It doesn't make sense when 
the builder is creating something based on a spec or template, which 
often has a completely different datatype than the result. For these 
functions an output buffer makes more sense.
(That was a reply to Paul)
[unknown: 5]
12-Feb-2009
[11052]
Ok, then I'm for the modifier as the default operation.
BrianH
12-Feb-2009
[11053x2]
Modifiers and builders are completely different classes of functions, 
with some overlap (see DELINE or REPLACE vs. REWORD). When the function 
is making a copy of the spec with some tweaks, a /no-copy option 
makes sense, or a completely separate function with complex mezzanines. 
When the builder function creates something really different than 
the spec (like ARRAY or READ), there is no point to a /no-copy refinement 
since there is no eqivalent modifier even in theory.
That is the difference between /no-copy (which turns a builder into 
a modifier), /copy (which turns a modifier into a builder), and /into 
(which makes a builder insert into a provided buffer instead of creating 
its own series).
Henrik
12-Feb-2009
[11055]
Brian, now you say it inserts. Does it clear the series and inserts, 
just inserts or overwrites the existing content?
[unknown: 5]
12-Feb-2009
[11056]
So read-io in R2 would be an example of /into?
BrianH
12-Feb-2009
[11057x2]
Henrik: Inserts and *doesn't* overwrite the existing content. Paul: 
Yes.
The functions for which /into would give us the absolutely biggest 
bang-for-the-buck are REDUCE and COMPOSE. Have you ever done chained 
calls to INSERT or APPEND of intermediate blocks created by REDUCE 
or COMPOSE? All that code goes away.
[unknown: 5]
12-Feb-2009
[11059]
Ok, now I follow what your saying.
Henrik
12-Feb-2009
[11060]
I don't follow. I'm an example guy.
[unknown: 5]
12-Feb-2009
[11061x2]
henrik, the key is your supplying a buffer.
for /into.
BrianH
12-Feb-2009
[11063]
READ/into would replace READ-IO, which doesn't exist in R3.
Henrik
12-Feb-2009
[11064]
Paul, I think I follow that far. What happens to the buffer is what 
I don't get. When BrianH says it inserts, that implies a lot of overhead 
on manipulating that buffer to me.
[unknown: 5]
12-Feb-2009
[11065]
I think the same way Henrik.
BrianH
12-Feb-2009
[11066]
Henrik, the trick is that it would work *exactly* like INSERT. If 
you want to replace the contents, CLEAR or REMOVE/part it yourself. 
If you want it on the end, use TAIL. If you want to chain, go for 
it. The calling code can manage their buffers however they want, 
and we can provide buffer management helper functions if we need 
to. I did extensive testing and INSERT is the most general function 
to model after.
Henrik
12-Feb-2009
[11067]
BrianH, now I'm confused. Is INSERT without tail much faster on blocks 
in R3 than in R2? Or do you say: "reduce/into [blurp] tail out" ?
[unknown: 5]
12-Feb-2009
[11068]
BrianH, but are we not doubling in size the memory space for that 
function?  If were passing it a huge series to begin with - would 
we not end up with a buffer that is potentially as big or bigger 
than the original series if were inserting data?
BrianH
12-Feb-2009
[11069x5]
If you *only* model after INSERT it minimizes the changes to the 
builder code. We aren't saving memory on a single call, we allowing 
careful reuse of memory on multiple calls. I've done the tests, the 
overall memory overhead goes down drastically.
Most of the time a function using build/into functions will preallocate 
a series, build some data into it, then return the series. Builder 
functions can use lower-level builder functions. You get a reduction 
in intermediate series creation overall.
Give me a moment and I'll show you an example, using variants of 
the COLLECT function.
; Here's a version of COLLECT without /into, a typical example of 
a builder.
collect: func [

 "Evaluates a block, storing values via KEEP function, and returns 
 block of collected values."
	body [block!] "Block to evaluate"
	/local output
][
	output: make block! 16
	do func [keep] body func [value /only] [
		apply :append [output :value none none only]
		:value
	]
	output
]

; Here's COLLECT with the /into option.
collect: func [

 "Evaluates a block, storing values via KEEP function, and returns 
 block of collected values."
	body [block!] "Block to evaluate"
	/into "Collect into a given series, rather than a new block"
	output [series!] "The series to output to"
][
	unless output [output: make block! 16]
	do func [keep] body func [value /only] [
		output: apply :insert [output :value none none only]
		:value
	]
	either into [output] [head output]
]


Note that the version with /into also lets you use other series types 
than just block!. This option added to REDUCE and COMPOSE would let 
you create parens and paths as well, even though REDUCE and COMPOSE 
can only take block! specs.
This is where we get our "bang for the buck": minimal changes, maximum 
benefit.
Henrik
12-Feb-2009
[11074]
Is INTO not meant to be handled natively?
BrianH
12-Feb-2009
[11075x3]
The /into option will be handled natively when added to natives, 
but handled by mezzanine code when added to mezzanines. It's a minimal 
change either way. Backports to R2 of native /into options might 
need mezzanine wrappers though.
The overhead of /into is two comparisons per function, and one assignment 
per insert in the function. Really low.
There aren't many applicable mezzanine builders right now - just 
COLLECT, EXTRACT and REWORD (proposed) - so the REBOL /into overhead 
will be small. And when more REBOL builder functions get made, we'll 
be able to make them more efficiently.
[unknown: 5]
12-Feb-2009
[11078]
BrianH, if you feel those mezzanine changes are necessary then I'm 
all in favor of it.
BrianH
12-Feb-2009
[11079x5]
It's already done for REWORD, which is blocked because of critical 
map! bugs. An easy addition to EXTRACT and tweak to COLLECT.
EXTRACT and COLLECT enhancements submitted.
New discovery: You can assign an op! to another word and it still 
works as an op!, infix style. Want to rename your ops? Now you can 
:)
>> &: :and
== op!
>> false & true
== false
New discovery: APPLY treats 'a and :a parameters like a, ignoring 
the special treatment that specifying parameters that way would otherwise 
give you. This is a great thing: Now you can write functions that 
function values without the code injection risk that those special 
parameters have.
Janko, to reanswer your question: "will R3 have a way to define custom 
infix words?"

Yes, at least custom words to refer to existing infix functions for 
now. In theory this new behavior could be used to make new op! functions, 
but there will likely not be a facility to do so until the plugin 
model is finalized, and the redirect-to-an-action functionality may 
remain even then.
[unknown: 5]
12-Feb-2009
[11084]
We can already rename our ops in 2.7.6