World: r4wp
[#Red] Red language group
older newer | first last |
Pekr 26-Sep-2012 [2182x2] | I don't get any chars for Chinese, and in the Greek string third char is incorrect too ... |
font I use is Lucida Console ... | |
Henrik 26-Sep-2012 [2184] | You can use any Unicode character in your Red words. - ok, that was what I was looking for, as R3 can do this. Thanks. |
Jerry 26-Sep-2012 [2185] | Graham is right, In Chinese, "World Hello" is better than "Hello World". You can change it if you want, Doc. :-) But I saw other langauge, such as Falcon, use "Hello World" in Chinese to demo their unicode support. |
DocKimbel 26-Sep-2012 [2186x2] | Pekr: switch to Consolas which has the best range of Unicode glyphs support. |
(you won't be able to display CJK anyway though) | |
Pekr 26-Sep-2012 [2188x3] | I can select only two fonts there - raster, or Consolas .... |
btw - Czeh version of hello world does not display any character correctly :-) | |
dunno if bug, or I should somehow adapt font, anything else, etc. | |
PeterWood 26-Sep-2012 [2191] | Is the source file of your Czech version UTF-8 encoded? |
DocKimbel 26-Sep-2012 [2192x3] | Third greek character incorrect : be sure you're using latest revision. |
Pekr: try typing your Czech characters in Notepad (it has excellent Unicode support). | |
(just select UTF-8 when saving) | |
Pekr 26-Sep-2012 [2195x2] | hello.red is already UTF-8, I just added one line and saved ... |
going to lunch, willing to debug, if I can get any instructions .... | |
DocKimbel 26-Sep-2012 [2197x2] | Be sure you've saved it in UTF-8. |
Anyway, we need more people testing Unicode support for Windows console, just in case we missed something. | |
Pekr 26-Sep-2012 [2199] | well, anyway - how is R2 being able to read utf-8 anyway? |
DocKimbel 26-Sep-2012 [2200] | It reads it as a stream of bytes. As UTF-8 doesn't use null bytes in its encoding (except for codepoint 0), it can be fully loaded as string! or binary! in R2 (but you'll see garbage for non-ASCII characters). |
PeterWood 26-Sep-2012 [2201] | If anybody can provide the UTF-8 chars (hex values) for Hello World in Czech. I'll run a test. |
DocKimbel 26-Sep-2012 [2202x3] | Peter: should be "Dobr^(C3)^(BD) den sv^(C4)^(9B)t" |
I've just tested it on Windows console (using Consolas font), it works fine. | |
The above string doesn't work as-is in Red though, you should pass the codepoints escaped instead of the UTF-8 encoding. | |
PeterWood 26-Sep-2012 [2205] | I noticed :-) |
DocKimbel 26-Sep-2012 [2206] | I haven't implemented full char! support yet, so I can't write a Red script to print me the right codepoint values...(char! will be implemented later today though). |
Jerry 26-Sep-2012 [2207] | Hello in many languages http://www.omniglot.com/language/phrases/hello.htm |
DocKimbel 26-Sep-2012 [2208x2] | Good source, even Klingon is there! :-) |
Hello in Klingon: nuqneH (What do you want?) - used when confronted by another | |
PeterWood 26-Sep-2012 [2210] | Code points are 00FD & 011B |
DocKimbel 26-Sep-2012 [2211x2] | I really need to learn more, my Klingon is currently limited to " Qapla' " only (means Goodbye). |
Thanks Peter. It works for me with "Dobr^(FD) den sv^(011B)t" | |
Pekr 26-Sep-2012 [2213] | Above works ... but when I write it directly in Notepad (and the file claims it is UTF-8), it does not work ... strange then ... |
Henrik 26-Sep-2012 [2214] | Not sure if Notepad is the best for UTF-8 work... |
DocKimbel 26-Sep-2012 [2215x3] | Pekr: it might be a BOM issue with Red loader, I don't remember testing it... |
Pekr: try to set the "encoding" field to UTF-8 in the saving panel (Save as...). | |
Here, I have no issue using Notepad to write Red Unicode scripts. | |
Pekr 26-Sep-2012 [2218] | it is set to UTF-8 already .... |
DocKimbel 26-Sep-2012 [2219] | You can still download Notepad++ (or any other text editor with decent Unicode support). I have to drop my good old TextPad as it doesn't have good Unicode support. |
Pekr 26-Sep-2012 [2220x5] | I will do some more testing later ... |
There are definitely some bugs somewhere ... | |
where can I find codepage for our chars, so that I can give you escaped value to try? some chars of czech extended alphabet are OK, some are not ... | |
above mentioned C4 9B is cuasing following output at the end of the phrase - strange .... http://www.xidys.com/pekr/red/red-unicode-bug.jpg | |
I used no punctuation char, just "e", then changed it to "e" (with hook above it), and it added that strange chars to the end of the string ... | |
DocKimbel 26-Sep-2012 [2225x2] | Pekr: you should look at our more recent posts, the first string I posted had the wrong codes. The right string is: Dobr^(FD) den sv^(011B)t |
You can find the codepoints you need here: http://en.wikipedia.org/wiki/List_of_Unicode_characters | |
Ladislav 26-Sep-2012 [2227] | Doc, a minor nitpicking: it is a vocative and thus the correct spelling should be Dobr^(FD) den sv^(011B)te |
Pekr 26-Sep-2012 [2228x3] | OK, but why do I need those codes in the first place? |
In R3, if script is in the UTF-8 format, I can imo directly type it in Notepad ... | |
... but - it is a long time I tried it, so not sure ... | |
DocKimbel 26-Sep-2012 [2231] | Pekr: I'm giving you the codes because it seems something is going wrong with your editor. In my Notepad or any other decent text editor here, I can type (or copy/paste them from the web) without any issue. |
older newer | first last |