World: r3wp

Join the discussions in the REBOL3 world...

[!REBOL3]

older newer	first last
BrianH 3-May-2011 [8349]	Btw, this comment in #1882: "and since you've requested that lit-word! and lit-path! be returned to their R2-style inconsistency" may not be an accurate representation of your proposal (here earlier in conversation). You might be proposing that R3 do a better job at being inconsistent than R2 is doing (as demonstrated in #1434). If so, cool.
Gregg 3-May-2011 [8350]	I prefer convenience, but understand the concerns about security. Less aggressive evaluation by DO doesn't solve the security problem though, does it? If we say "Never DO untrusted data", DO can provide more convenience. Of course, that means it may be less convenient if you have to evaluate untrusted data, but at least the line is clear.
Geomol 3-May-2011 [8351]	>> o: make object! [f: does [42]] >> do in o 'f ; This is a problem, as nothing seems to be happening! >> o/f == 42 I'm not sure, I understand the security concern.
Ladislav 4-May-2011 [8352x3]	Frankly, there is no security problem which can be influenced by this.
	Stating otherwise is just pretending
	If somebody wants to use DO, he is responsible for knowing what he is doing
BrianH 4-May-2011 [8355x2]	Pretending that security doesn't matter is a worse policy. Here is what would resolve the security issue: - Putting warnings in the docs for DO, in the same section where they talk about the special treatment of functions and blocks. - Make parameters not work, and don't do blocks and parens through word values, same as R2's DO of path values. - Make sute that we don't try to make set-words and set-paths do assignment when you DO them. Treat them like get-words and get-paths. Together, those restrictions would make DO of word and path values no more insecure than DO of block and paren values. For functions, we have APPLY.
BrianH 4-May-2011 [8355x2]	sute -> sure
Maxim 4-May-2011 [8357]	btw, I've been using apply in R2.7.8 and it works really well :-)
BrianH 4-May-2011 [8358]	DO of block and paren values is something that we can say is secure enough already, assuming that variables and such are protected and secured, so that is a good set of restrictions to follow for words and paths. Calling functions through inline words is secure enough if you can control the binding and writeablility of those words. DO of function values has the argument problem, but it's known and has built-in workarounds (APPLY, putting function calls in parens), and we already have simple ways to screen for them.
Gregg 4-May-2011 [8359x2]	DO is seductive, because sometimes I want to create (easily) a "dialect environment" and just use DO to evaluate my dialect., safely and securely. Is there a security page in the docs (I don't see one in the R3 docs right now)? If not, that would be good to have. If we have a list of functions and operations you shouldn't use on untrusted data, and what the risks are, that's a good start.
Gregg 4-May-2011 [8359x2]	And, as Brian mentions, having workarounds or being able to screen for exploitable features.
Sunanda 4-May-2011 [8361]	I'd still like to see the sort of safe evaluation as dreamt of in this older ML thread: http://www.rebol.org/ml-display-thread.r?m=rmlNVBC
BrianH 4-May-2011 [8362x2]	There isn't much of a security page right now, though it would be a good idea to make one if only to document the stuff that doesn't currently work (like SECURE in the last 4 versions). I don't know if anyone else has made a concerted effort to attack REBOL and then fix the security problems found.
BrianH 4-May-2011 [8362x2]	I would love it if we as a community were to really think through the (UN)PROTECT model, because the current model is incomplete (even for the stuff that works) and the proposed model is starting to look a bit awkward to use. Keep in mind that PROTECT may also be used to make series sharable among tasks, but that this isn't implemented and there is likely a better way to do this. I would love it if there was a good security model that can integrate well with REBOL semantics.
Kaj 4-May-2011 [8364]	Capabilities
BrianH 4-May-2011 [8365]	Won't work within a process, only on a process boundary.
Kaj 4-May-2011 [8366]	Depends on if you make it work
BrianH 4-May-2011 [8367]	It's inherent in the semantics of REBOL, a side effect of the code-vs-data thing.
Kaj 4-May-2011 [8368]	Do you know the Genode architecture?
BrianH 4-May-2011 [8369]	That might work for SECURE but not for (UN)PROTECT.
Kaj 4-May-2011 [8370]	Why not?
BrianH 4-May-2011 [8371]	(I am trying to write a long starting message here and have to put it in the clipboard to answer these questions, sorry.)
Kaj 4-May-2011 [8372]	That's OK, I'm interested in your opinion. I haven't formulated an answer for myself yet
BrianH 4-May-2011 [8373]	Some factors to consider about the REBOL semantic limitations: - There is no such thing as trusted-vs-untrusted code in a REBOL process, nor can there be, really. Levels of trust need to be on a process boundary. You can't (even hypothetically) do LOAD/secure level or DO/secure level, but you can do LAUNCH/secure level. - If you want to make something readable or writeable to only certain code within a process, binding visibility tricks are the only way to do it. The only way to ensure that your code has access to something and other code doesn't is to make sure that other code can't even see yours. This is why BODY-OF function returns an unbound copy of the body in R3, not the original. - We need a way to make protection stick so you can't unprotect things that are protected, or protect things that need to stay unprotected, but still allow changes to the protection status of other stuff. The currently proposed model does this through a chain of PROTECT and UNPROTECT calls, then a PROTECT/lock, not allowing unlocking if there is a SECURE 'protect. However, the proposed model seems too difficult to use, and as the pre-110 module system demonstrated, people won't use something that is too complex to use, or will use it badly. We need a better way of specifying this stuff.
Kaj 4-May-2011 [8374x3]	OK, that's the current REBOL model, but you asked about alternative models. Capabilities are not about trust levels, but about capability tokens. They're meant to take trust out of the equation
	Trying to hammer every hole shut with SECURE and PROTECT is the classic method of sticking all your fingers in the dike. When you run out of fingers for all the holes, the water comes gushing in. Capabilities are about making it impossible to get through the next dike. It's a different way of compartmentalising
	An E language fan once visited Carl to explain to him that true capabilities can be implemented in REBOL very well. Carl apparently rejected it based on complexity, but if the problem with the current new R3 method is rising complexity, maybe this decision is worth reviewing
BrianH 4-May-2011 [8377x4]	Now, for your questions, Kaj. Mezzanines execute arbitrary code with DO. You can't even know if something is code or not until you pass it to a dialect interpreter like DO or PARSE - code is data. Blocks don't have bindings, only their any-word contents do, so the code blocks of functions are not bound to functions, only their contents are. The same goes for functions in modules or objects - they aren't bound to their objects or modules, only referenced by them. (making this up on the fly) It could be possible to make the binding visibility of words be defined as a set of capability tokens as part of the object spec (in the SPEC-OF sense), and have the function spec dialect be extended to contain such tokens. This would have to be checked with every word access, and we would have to be careful to make the model in such a way to avoid unauthorized privilege escalation. Then changes in capabilities would happen on the function call stack, which is task-specific. The problem with this is making sure code can't make functions with more capabilities than the code making them currently possesses. Though R3 doesn't really have a user model, it does have a task model and we could make the capability level task-specific. Code could constrain capabilities for code it calls, but we don't want privilege escalation at function creation time. It would be possible to have privilege escalation at function call time if the function called was created by something with the necessary capabilities. Drawbacks: - If we do this for binding visibility, this means a capabilities check would go into every word access. Word access would be SLOW. - This doesn't add anything to the PROTECT/hide model, which handles binding visibility without the slowdown. Capabilities would be like the SECURE model, but more flexible, so that's something to consider there. What SECURE protects is heavy enough that a capabilities check wouldn't add much to the overhead.
	Remember, R3 currently has three separate security models: SECURE, (UN)PROTECT, and PROTECT/hide.
	Of the 3, SECURE seems like the most likely to be enhanceable with capabilities. Functions could be enhanced by capabilities specs, where the function code could only create other functions of equal to or lesser capabilities than are currently available in the call stack. Once a function is created, it could run code with the capabilities that it was created with (with the exception of that function creation limitation earlier). There could be a function like DO that reduces capabilities and then does a block of code, and maybe MAKE module! could be made to use that function based on capabilities in the module spec.
	Since MAKE object! isn't a hybrid function like MAKE module! (which calls sys/make-module*), we probably don't want to reduce capabilities on a per-object basis.
Kaj 4-May-2011 [8381x3]	It seems to me that you are still talking in terms of plugging all the holes in the myriad of capability that would supposedly be around. This is not how true capabilities work. They implement POLA: there is no capability unless it is needed, and in that case, it needs to be handed down as a token by the assigner of the work. If the boss doesn't have the token, the employee will by definition not be able to do the work
	REBOL is a virtual machine with strong typing (as long as extensions are protected well enough). You have complete control over the world the code executes in, so the potential is there to make the process/thread separation irrelevant for security
	I don't see why capabilities would need to be checked on every word access. The critical point is the binding, and REBOL uses this well to optimise word access. Capabilities would need to be determined at binding time, so that binding will fail if the required capability token isn't available
BrianH 4-May-2011 [8384]	Three security models: - SECURE protects access to external resources. - (UN)PROTECT protects changeability of internal structures. - PROTECT/hide manages binding visibility. We don't jsut need to protect files, we need to protect things like passwords in memory, access to capability tokens, etc.
Kaj 4-May-2011 [8385x2]	Which can all be done in a capabilities model
Kaj 4-May-2011 [8385x2]	Have you studied the E language, and Genode for that matter?
BrianH 4-May-2011 [8387]	If you use capability tokens to protect binding visibility, then every word access would need to check against a capability token.
Kaj 4-May-2011 [8388]	I still don't see that. Binding doesn't change on every access; that's the point of this optimisation
BrianH 4-May-2011 [8389]	Binding visibility, not binding change.
Kaj 4-May-2011 [8390]	First you have visibility, than binding, than access. Why go through all those stages on each access?
BrianH 4-May-2011 [8391]	OK, let's work this through for only PROTECT/hide to see how the concept would affect things. PROTECT/hide works by making it so you can't make new bindings to a word - that way words that are already bound can be accessed without extra overhead. Adding capabilities to this means that you could create new bindings to the word if you had the token, but not if you didn't. However, with PROTECT/hide (currently) the already bound words don't get unbound when they are hidden, just new bindings to that word, and if you have access to such a prebound word value then you can make new words with that binding using TO, which effectively makes prebound words into their own capability tokens. So PROTECT/hide as it is now could be the basis of a capability system.
Kaj 4-May-2011 [8392]	Cool :-)
BrianH 4-May-2011 [8393]	The problem that a capability system has of making sure capability tokens don't leak is pretty comparable to the problem with leaking bindings that we already have to take into account with the PROTECT/hide model, so switching to a capability system for that model gains us nothing that we don't have already. And we've already solved many leaking binding problems by doing things like having BODY-OF function returning an unbound copy of its code block rather than the original. The PROTECT/hide model works pretty well for that, so it's just a matter of closing any remaining holes and making sure things are stable.
Kaj 4-May-2011 [8394]	The fundamental gain is that you switch to a POLA model from the current model where all code in a REBOL process has all capabilities unless you manage to stop some of them
BrianH 4-May-2011 [8395]	For PROTECT/hide we already have that. So let's move on to the other security models.
Kaj 4-May-2011 [8396]	Does all code get created PROTECT/hidden?
BrianH 4-May-2011 [8397]	No, but all code created after the word is hidden doesn't get access, and only code created before the hiding has access to a token (bound word) that will let it create new code with access. You get the same sharp separation between code with access and code without.
Kaj 4-May-2011 [8398]	A POLA model is where you start out with no access. If you have to PROTECT/HIDE afterwards, that's the reverse of POLA
older newer	first last