World: r3wp
[!REBOL3 GUI]
older newer | first last |
Andreas 11-Aug-2010 [2444] | Very nice! |
Gregg 11-Aug-2010 [2445] | Woohoo! Congratulations to the team. |
Graham 11-Aug-2010 [2446] | That Tibetan script looks hard to read ... |
Gregg 11-Aug-2010 [2447] | The typographer had poor handwriting. |
Graham 11-Aug-2010 [2448x3] | If you look at the sample here http://www.omniglot.com/writing/tibetan.htm the script seems to start from just above the top line and drops to the bottom with some accents above. |
This is a screen shot from the Moscow tibetan OCR project http://www.buddism.ru///ocrlib/OCRLib21_07_2010.png Again the text looks very different to that in Henrik's image | |
I guess we need to see what it is supposed to look like :) | |
Gregg 11-Aug-2010 [2451] | And I thought you were being funny. Different baseline logic? I can't say. |
Graham 11-Aug-2010 [2452x2] | It just looked different to what I've seen before posted to the OCR mailing list I subscribe to. |
Doesn't mean it's wrong though! | |
Andreas 11-Aug-2010 [2454] | Hmm, now that you say it. The vertical alignment of the Arabic also looks a bit off. |
Graham 11-Aug-2010 [2455x4] | I don't' read Chinese either but there doesn't seem to be any spacing between the words |
Jerry's console script looks a lot better too but it's traditional Chinese http://rebollovesjerry.blogspot.com/2008/03/rebol-30unicode.html | |
Chinese is a heiroglyph based system so there needs to be spacing between each word ... | |
BTW, the "Chinese" script looks more like Korean or Japanese ...but I'm no expert | |
Graham 12-Aug-2010 [2459] | we are coming to the opinion that what is said to be Chinese is actually Japanese... |
WuJian 12-Aug-2010 [2460x2] | You're right |
That is actually Hiragana - a Japanese syllabary | |
Graham 12-Aug-2010 [2462] | Good job we didn't let this error propagate ... otherwise we might get 1billion complaints! :) |
Steeve 12-Aug-2010 [2463] | Actually, when I looked at the katakana showed here, i felt something weird (yeah, back in the time, I learned them) I couldn't remember the 5th one. then, I checked (http://fr.wikipedia.org/wiki/Katakana) Eh ! Right in the spot. (Wi) is not used anymore. |
Pekr 12-Aug-2010 [2464] | Cool - that is kind of milestone for View apps - finally being able to display more than one charset :-) As for spacing needed, that is not problem of View unicode support, or is it? |
Graham 12-Aug-2010 [2465x2] | Kerning .. who's responsibility? |
and tracking | |
Cyphre 12-Aug-2010 [2467] | Hey guys, don't be so picky ;) about the test text strings. I only cut'n'pasted random chars from some of these pages http://www.alanwood.net/unicode/#links that's all. I personally don't plan to spend time on all the 'unicode normalization' and other tricky stuff and details. It seems there is enough linguistic experts even for tibetan script! So I'll kindly left that part as an exercise to the comunity :-P |
Pekr 12-Aug-2010 [2468] | OK, and what can be done to get more quality to the font display? |
Cyphre 12-Aug-2010 [2469] | BTW, the "Chinese" example is taken from this http://www.alanwood.net/unicode/cjk_compatibility.html which should be chinese/japanese/korean. I'm really not expert in this(nor I would like to be) so I just hope I haven't created any offending word from the random chars in some of the language :) |
Graham 12-Aug-2010 [2470] | so, it's all a mixture :) |
Cyphre 12-Aug-2010 [2471] | what can be done to get more quality to the font display? I personally would make research about the technique McSeem(AGG author) published in his well known font rendering related article and tried to use the core idea of his demo application for making 'production quality' version which can be then used in the HostKit code base. |
Graham 12-Aug-2010 [2472x2] | How does the baseline get set? |
I'm wondering if the Tibetan baseline should be at the top and not the bottom | |
Maxim 12-Aug-2010 [2474] | cyphre... seems like the best plan, for the best quality font rendering on any platform :-) |
Cyphre 12-Aug-2010 [2475] | Graham, sorry, I really don't know the rules how code points in Unicode should be positioned. That needs to be studied and corrrections being made in the text rendering/layouting loop. Just a note, even now the text rendering/layouting code ismore than 50Kb of C++ code so either someone needs to study all the Unicode rules and make new replacement or publish patches to the current version. |
Graham 12-Aug-2010 [2476] | oh well, at least it means omeone from tibet can study the C++ source :) |
Cyphre 12-Aug-2010 [2477] | Maxim, yes I agree. But currently I have plenty of other tasks with higher priority so unles something changes this needs to wait or someone else needs to contribute. |
Graham 12-Aug-2010 [2478] | So, does this mean these chars render inside a text field and we can backspace, del etc? |
Cyphre 12-Aug-2010 [2479x3] | Graham, teeling the truth I'm glad I can write Czech chars properly (yes I'm selfish in that case :)) and I don't care about the details of oter scripts unless I need them ;) |
But seriously, even if you don't know C++ it should be theoreticaly even write own positioning of the chars in the RICHTEXT dialect at Rebol level. | |
yes, all these chars are treated as any other chars in the RICHTEXT dialect so you can format it, select, edit. | |
Graham 12-Aug-2010 [2482] | so where can we download this? :) |
Cyphre 12-Aug-2010 [2483] | I guess all these gfx changes will be part of the next official HostKit release. |
Pekr 12-Aug-2010 [2484] | font quality code - that will have to be done one day, as the View app font quality is really not nice. So let's hope that one day it will move up in the priority list :-) |
Cyphre 12-Aug-2010 [2485] | (I still need to do a few changes before handing it over to Carl) |
Pekr 12-Aug-2010 [2486] | Cyphre - what's next on your part? Effect pipeline? |
Cyphre 12-Aug-2010 [2487] | Yes, effects are next on the list. Hope I'll get to them during today. |
Graham 12-Aug-2010 [2488] | What's the mechanism of selecting a different charset ? Just windows ?? |
Cyphre 12-Aug-2010 [2489] | There is no charset selection. You just provide valid UTF-8 codes for the appropriate unicode chars, thats all. Also ofcourse you need to have font that contains that chars. I was using 'Arial Unicode MS' font in the test screens which is a huge font containing big chunk from all the unicode pages. |
Graham 12-Aug-2010 [2490] | so you can't just type Czech at the keyboard? |
Pekr 12-Aug-2010 [2491] | maybe you can type Czech, but you have to somehow "convert" from local codepage to the Unicode on the go (as you type?) :-) Sorry for my very basic understanding of how things might work :-) |
Cyphre 12-Aug-2010 [2492x2] | I guess you can as the key events will give you that nuicode chars you are typing (if your OS supports unicode) but this was not tested yet. I'll try it today. |
...because R3 already supports char! values bigger than 255 so this should work transparent IMO | |
older newer | first last |