r3wp [groups: 83 posts: 189283]
  • Home
  • Script library
  • AltME Archive
  • Mailing list
  • Articles Index
  • Site search
 

World: r3wp

[!REBOL3-OLD1]

BrianH
31-Dec-2008
[8965]
The current server is implemented as CGI scripts, though FastCGI 
is planned for later.
PeterWood
1-Jan-2009
[8966]
Not trusting non-ascii characters infers  that the current desing 
of  RebDev is "ignorant"of character encoding. If that is the case, 
it is a shame as RebDev could have been a great example of an "up-to-date" 
application built with R3.
Henrik
1-Jan-2009
[8967]
Doesn't this all depend on the limitations of the front-end used?
Kaj
1-Jan-2009
[8968]
It has to be this way as long as the server is running on R2. Says 
nothing about R3 or the design of RebDev
PeterWood
1-Jan-2009
[8969x2]
Even if the server is running on R2, all the strings could be stored 
with a consistent encoding method, such as ISO-8859-1. Of course, 
there'd be a lot of work detecting the client encoding method and 
converting all input strings to the chosen consistent method. Most 
of this work would be needed even if the server supported Unicode 
strings.
Personally, I think ignoring character encoding does say something 
about the design of RebDev.
Gabriele
2-Jan-2009
[8971x2]
considering that R3 will probably just send everything as UTF-8, 
I don't think the server has any reason at all to worry about encodings.
you have to worry about encodings when you do conversions. i don't 
see where the R2 server is doing any of that. Also, with UTF-8 there 
is no need to worry about encodings on searches and things like that. 
The only issue could be sorting, but that is also region specific 
so it's a completely different issue that R3 cannot solve globally 
either.
PeterWood
2-Jan-2009
[8973]
As you say, if all the input is UTF-8 everything will be fine. I 
mistakenly thought that the inputs from different environments would 
have been differenlty encoded as they are with R2.
BrianH
2-Jan-2009
[8974x2]
That would have to be the case with R2 clients, as the client is 
the part that handles character encoding. However, there are no R2 
clients yet. The messages appear to be UTF-8 encoded end-to-end, 
stored in binary on the server, which is encoding agnostic. Once 
we have R2 clients, they will have to handle the codepage-to-UTF-8 
encoding, or just stick to ASCII.
And yes, it does say something about the design of RebDev, that character 
encoding issues of R2 won't affect it, by design.
Reichart
2-Jan-2009
[8976]
This is one of those things where a picture is worth a thousand words. 
 We need a diagram of the hardware and software set up, and show 
WHERE encoding becomes a problem.

For example, if you paste some text from a Word doc into a webbrowser, 
 this then gets moved to the server.  Then it gets rendered out again...you 
wil run into problems with encoding.

Word use some SPECIAL encodoing for things like " : - and '
Kaj
3-Jan-2009
[8977]
That sounds like an issue between Word and the browser edit field, 
or between Word and the clipboard
Henrik
3-Jan-2009
[8978]
RebDev now has a webfeed suitable for iPhone.
Gabriele
3-Jan-2009
[8979]
Reichart, I still think that is not the fault of Word (except for 
changing your character on you) or the browser.
Henrik
3-Jan-2009
[8980]
Some shots of rebdev on iPhone:

http://rebol.hmkdesign.dk/files/r3/gui/176.png
http://rebol.hmkdesign.dk/files/r3/gui/177.png
Geomol
3-Jan-2009
[8981]
:-)
Steeve
3-Jan-2009
[8982]
wahouuuuuuu !!!!
Henrik
3-Jan-2009
[8983]
I think this will be open, so someone can get to prettify it later. 
The mobile version is simpler to do so it's done first.
NickA
3-Jan-2009
[8984]
That's great!
amacleod
3-Jan-2009
[8985]
Henrik, Is that in iphone's browser?
Henrik
3-Jan-2009
[8986]
yes
amacleod
3-Jan-2009
[8987]
10 seconds to reply...your getting slow
Henrik
3-Jan-2009
[8988]
:-) it runs in the iPhone simulator, but it works the same as on 
a real iPhone. If you have XCode and OSX Leopard you can test it 
out yourself.
amacleod
3-Jan-2009
[8989]
So its html...why is mobile easier?
Henrik
3-Jan-2009
[8990x2]
because it's just a simple HTML page. you can't post yet and you 
can't login to your own account yet like the console version. it's 
meant to be used with lesser browsers for smaller phones than iPhone 
as well.
and it was convenient for Carl since he owns an iPhone.
Reichart
3-Jan-2009
[8992]
Gab, not an issue of "fault", I'm simply modeling examples of problems 
I see on dozens of websites, due to encoding "issues".  Don't care 
where the fault is, just that we need better black box tools fro 
dealing with it.
PeterWood
3-Jan-2009
[8993x2]
Reichart: From my point if view, the root of the problem is not so 
much that Word replaces key certain key sequences with other characters 
but on eof character encoding. The text will look okay on your machine 
but unless it is correctly converted may display incorrectly on other 
machines.


As I understand, Rebol/View uses the users default "codepage" on 
Windows and  MacRoman encoding on Mac. AltME doesn't take into account 
the different the different text encodings so when I type £ (a British 
pound sign) you will probably see some thing different.
 .
eof => the
Chris
3-Jan-2009
[8995x2]
Pound is similar over many code pages: more diverse -- ½ -- Mac: 
omega; Win/west: 1/2; Win/east: right-smart quote.
On rebol.org, you get a ? -- http://www.rebol.org/aga-display-posts.r?post=r3wp453x8946

On rebol.net, we shall see -- http://www.rebol.net/altweb/rebol3/chat453.html
Sunanda
3-Jan-2009
[8997x2]
REBOL.org shows a ? because if blindly emits all Alte pages as charset=utf-8.

If (this works in Firefox....You change your default for the page 
-- view/character encoding / western iso-8859-1) then:
-- Peter's post shows a GBP [for his char 163]
-- Chris' post shows a 1/2 [for his char 189]
_Part_ of the problem is that altme server --> client channel (ie 
what is sent to each of us when someone posts a message) does not 
seem to contain charset information......I may be wrong about that, 
but I do not see it anywhere.

I have not idea if client-->server channel does send that information.
Chris
3-Jan-2009
[8999]
I don't imagine it would -- I'm not sure there's a way within Rebol 
to detect the charset in use...
Reichart
3-Jan-2009
[9000]
Peter....I'm confused....

Word, nor REBOL have anything to do with the problem....


Encoding problems happen on hundreds of websites (big, popular website), 
that do not use REBOL, and where Word is not the source.


I'll state again... we need strong clear black box logic that unifies 
all character maps (yeah, all).

WE need a single unified character system.
btiffin
3-Jan-2009
[9001]
If I was a betting man, by 2020 UTF-8 will reign and compsci grads 
will need a history book to learn about ASCII.
PeterWood
4-Jan-2009
[9002]
Reichart ...you are right thep problem is one of encoding. My point 
is that because Rebol/View uses different encoding systems on different 
platforms it  is left to the application to either ignore the encoding 
differences or handle them.


This may be quite difficult if, as Chris indicated, it is not possible 
to determine which Windows Codepage is in use from Rebol/View. 


Tthere is a single unified character system (Unicode ) but there 
are at least five different ways of representing it (UTF-8, UTF-16LE, 
UTF-16BE, UTF-32LE & UTF-32BE). Standardisation is a long way off.
Maarten
4-Jan-2009
[9003x2]
Never copy what you can do better your self.
English got in the way, I think that should read: "Never copy what 
you can improve easily"/
Gabriele
4-Jan-2009
[9005]
Reichart, what I mean is that you don't even need tools, as long 
as the server software properly emits only utf-8 and reports that 
it accepts only utf-8... after doing that, if there are still browsers 
that do not comply, then we can start talking about tools (which 
are trivial, most of the time, by the way).
Sunanda
4-Jan-2009
[9006]
Another part of the problem, at least from the webpage viewpoint, 
is that each of us could be posting AltME messages in different charsets.


All the HTML emitters for AltME worlds that I know of (AltME's own, 
REBOL.org, REBOL.net) emit a single webpage file, so it can only 
have one charset.


To do it right, each post should be emitted as a separate document/frame 
item. Then they'll each have their own charset.....That's a lot of 
extra work. Let's hope Gabriele's solution (a utf-8 universe) happens 
before that becomes essential.
Reichart
4-Jan-2009
[9007]
Gab, yes...I think we are in agreement.  However, this is not just 
about websites for me.  


I guess, what I want for us to reach a level where THIS is not something 
programmers even talk about.  The core routines take care of themselves.
Chris
4-Jan-2009
[9008x2]
Brian -- ASCII is a subset of UTF-8...
With QM, I try to assume (and enforce) UTF-8 (declaring on forms, 
html escaping everything ASCII+), but it's definitely a chore.
Reichart
4-Jan-2009
[9010]
Exactly, it is the chore part that concerns me.
Maarten
4-Jan-2009
[9011]
Yes. It's 2009 and we (as in industry) don't even get the computers 
to do *this* for us.
Reichart
4-Jan-2009
[9012]
THAT is my POINT!
Maarten
4-Jan-2009
[9013x2]
It's like when traveling you know you can have dinner. It's just 
that you have to learn to explain that the food you're being served 
is not poisoned for every language, country.
You just want food!!!!