r3wp [groups: 83 posts: 189283]
  • Home
  • Script library
  • AltME Archive
  • Mailing list
  • Articles Index
  • Site search
 

World: r3wp

[!Cheyenne] Discussions about the Cheyenne Web Server

Dockimbel
25-May-2009
[4953x2]
If you can think of a solid method for allowing concurrency in the 
same user session, while preserving session data from corruption, 
I'll be very interested.
I should add : without loosing performances.
Maxim
25-May-2009
[4955]
a part from having handler tcp interaction, I can't think of any 
a part from identifying a way to detect which session data is being 
used by both scripts.
Dockimbel
25-May-2009
[4956]
It's always possible to write session data to disk or to a DB and 
rely on the DB locking system to ensure that data is safe, but there's 
a performance overhead in that case that I'm afraid, may not result 
in any speed gain for the user. It also requires to have a DB engine 
installed on all servers running Cheyenne using RSP sessions, which 
might be overkill in some situations.
Maxim
25-May-2009
[4957x4]
this being said... mod-remark will be using persistent liquid nodes 
for session and post/get parameter side of things, and its possible 
that I might store session data outside of the server itself, using 
a liquid-tcp node. 


so that access to actuall session data could be brokered outside 
the server's jurisdiction, extremely efficiently, locked one value 
at a time. and any access/change could be perpetuated to other handlers 
when they attempt to lock to one session value.
the idea being that interacting via xml request or full page refresh, 
with only part of a page parameters, creates the same output  :-)
sorry, stores the same session information.  allowing the same output 
depending on the required refresh.
you could actually have one page sending xml requests to the server 
in ajax, and another page refreshes with the results of those requests...


the second page could also send a page request with the some parameters 
sent, and the server will reflect all changes to the current session/page 
so far.
Dockimbel
25-May-2009
[4961x2]
I thought also about doing a TCP based session server (with session 
variable granularity), but the overhead in synchronization would 
hit performances (and scalability) too much IMO. If each time you're 
setting a new value to a session variable in a script, you have to 
query a TCP server (even a local one), that would have a huge impact 
on a script execution time.
That impact may not be significant (nor noticeable) if your server 
is handling 1req/s, but it would be if you have to handle 50req/s.
Maxim
25-May-2009
[4963x4]
might still be much faster than a db call.  but the real game-changing 
aspect of mod-remark is that a document in ram, will only render 
parts of itself which are changed, or which cannot be cached, so, 
unless you've actually changed a parameter, it knows that it hasn't 
changed and will not need to ask for the broker to give it back the 
value.  in fact, it wont even process that part of the page, only 
using the data in its local node cache directly.
header/ footer / current login state box/ static page content... 
this can be shared accross sessions, across the entire site, or as 
granular as a single page request.
a news reader could have all the news items cached already and all 
its doing is assembling those which are matched via the request... 
really only doing a fast rejoin of many items, and then a rejoin 
of header news and footer and then spit it out.
all the code which was originally called to create the news items 
is skipped... until some site parameter changes how the news items 
are built, which will automatically cause the news item to reprocess... 
 but on the page and server side, nothing special needs to be done. 
 liquid's internal caching will fork the processing by itself, and 
then start serving the new cached data the very next time any single 
news item is asked for again.
Dockimbel
25-May-2009
[4967]
That's quite close to how RSP are working. Each RSP script is compiled 
to optimized REBOL code and cached in memory. A RSP request will 
then result in evaluating the REBOL code which mostly just append 
static and dynamic parts in the output buffer.
Maxim
25-May-2009
[4968x4]
nice.
the difference is that with liquid, its explicit, and I can explicitely 
share some data accross many scripts, but just some of them.    the 
headers can be half cached, where each page uses half the common 
header, inserts its local menu into it, and that header is now permanently 
cached for all further requests of that page, but it used part of 
data which was already cached by another page.
the current issue is trying to share the cached data between different 
handlers... which pits me against a liquid value broker, but its 
the creation of the data on the broker which might become complex... 
so not yet shure how I will proceed..
with threads, this issue would be much simpler to solve.
Dockimbel
25-May-2009
[4972]
Max, applying dataflows principles to a web framework looks really 
appealing, when will be able to see some demos?
Maxim
25-May-2009
[4973x4]
I am aiming for sometime next week, just a few simple prototypes 
to explore how to handle the difference in  nature of web requests 
opposed to persistent  data.
I am pretty sure I wont tackle handlers for a while...  because of 
the code sharing complexity added.
if the lazy processing equates to as much saved processing as I have 
seen with graphics work so far, I might not even need external handlers 
to handle many requests/sec.  It would be nice if you could pit your 
stress testing setup against mod-remark once it starts actually doing 
its thing.  :-)
at this point, this is still all research, but the more I apply liquid 
to specific problems, the more confidence I am generating,  as it 
almost always lives up to its potential.  When it doesnt', its usually 
that I just didn't find the proper way to use it.
Robert
25-May-2009
[4977]
query-string: Ah, the problem comes from a wrong rewrite rule for 
GET requests. I need to figure out how to handle this case.


That's what I like about web-stuff: There are so many possibilities 
and places that something can fail...
Dockimbel
25-May-2009
[4978]
Graham: Is this a cheyenne issue?
Robert: Yes, I just checked the LOG files. [...]

So, no, it's not an issue with Cheyenne.
Graham
25-May-2009
[4979]
I didn't think so
Robert
25-May-2009
[4980]
No, it's not. The LOG files are correct but I didn't saw the double 
" ?" ... Sorry, as said: To many places where something can go wrong.


Perhaps, it€s possible to throw an error if a male-formed query string 
is found.
Maxim
25-May-2009
[4981]
I'd rather throw parties on *female* formed queries   ;-D
Dockimbel
25-May-2009
[4982]
It's hard to define "mal-formed" precisely. If you strictly apply 
the RFC rules for URL, you'll find out that a lot of web sites are 
using "mal-formed" URL.
Maxim
25-May-2009
[4983]
its actually a global property of php that you can change the url 
parameter separators for your site in order to fight back against 
bots and make hacking a bit more complicated.
Dockimbel
25-May-2009
[4984]
Wouldn't that break most of JS/AJAX frameworks?
Maxim
25-May-2009
[4985x4]
not if you edit the code so the urls match your site's separators. 
 I have come across some sites with strange separators.
but as written in the php docs... obfuscation isn't a replacement 
for real security.
so most people put their time on tackling the real problems like 
sql injection and https use properly
and use https properly.
Maxim
29-May-2009
[4989x2]
doc, I have noticed a usage in your mods which you might want to 
change for speed reasons.


you use the word return a lot... and in my tests, it causes a BIG 
performance hit on function calling... I really mean noticeable when 
you do loop profiling ... a minimum of 20% slow down IIRC.

so instead of:

    phase: func [svc req conf][
        if declined? [return none]
        ...
        if let-others [return false]
        ...
        true
    ]

you really should be doing 

phase: func [svc req conf][
        if accept? req [
              ...
             true =  let-others? req
       ]
] 

just my two cents.
oops... reversed the emaning of that last return value...  this is 
better

phase: func [svc req conf][
        if accept? req [
              ...
             end-phase? req
       ]
]
Graham
29-May-2009
[4991]
so just drop off the end?
Janko
29-May-2009
[4992]
Doc.. just a tiny bug report.. when I am working with latest version 
in debug mode firebug reports me javascript warning: I haven't been 
digging into if this is supposed to be a bug or intentional...
>> test for equality (==) mistyped as assignment (=)?

>> [Break on this error] if (node = document.getElementById('menu' 
+ id)) {\n
Maxim
29-May-2009
[4993]
yes graham, all functions return a value, a rare few return unset!.

'IF, for example,  returns none when the condition isn't true.
Graham
29-May-2009
[4994]
well, I wasn't aware of a hit using 'return.
Maxim
29-May-2009
[4995]
returns enables the catch/throw as does break IIRC, so there is overhead 
in the function's calling.
Graham
29-May-2009
[4996]
so 20% of what?
Maxim
29-May-2009
[4997x3]
let me do an example
>> s: now/precise a: func [][return none] loop 1000000 [a]  difference 
now s
== 0:00:01.032

>> s: now/precise a: func [][none] loop 1000000 [a]  difference now 
s
== -0:00:00.296
in this case the difference is 70%
Graham
29-May-2009
[5000]
now if I have a 1000 line function that either teminates in a return 
or just the value .. how much does it affect it?
Maxim
29-May-2009
[5001x2]
the larger the func, the less hit, you can see that each function 
call is taking more time to initialise because of the return.
its just a question of maximising performance, and I know dockimbel 
is very serious about improving cheyenne's .