Pirates@Home logo

Pirates@Home

Berkeley Open Infrastructure
BOINC!
for Network Computing
Home Help Status Forums Glossary Account

Character encoding problems.

log in

Advanced search

Message boards : Help! : Character encoding problems.

Author Message
Profile Mchl
Volunteer tester
Avatar
Send message
Joined: 23 Sep 04
Poland
BOINC@Poland
Credit: 1,549.7
RAC: 0.00
Joined: Sep 23, 2004
Verified: May 1, 2009
Punishment: Mess Duty
Message 2254 - Posted: 16 Jan 2006 | 15:18:33 UTC
Last modified: 16 Jan 2006 | 15:20:32 UTC

I noticed, taht forum has some problems with displaying Polish chcaracters

---> Zażółć gęślą jaźń.

(This is probably the sentence with most possible number of non-standard characters ;) )

There are no such problems on teams' pages for example:
http://pirates.spy-hill.net/team_display.php?teamid=121
____________
Me Pirrrate name be Mad William Flint an' don't ye dare t' call me with any other name me lad!
Arrr!

Profile Wormholio
Captain
Avatar
Send message
Joined: 6 Jun 04
United States
Away
Credit: 4,065.6
RAC: 0.00
Joined: Jun 6, 2004
Verified: Mar 13, 2008
Dubloons: 3
Pieces of Eight: 10
Punishment: Aztec curse
Message 2301 - Posted: 17 Jan 2006 | 12:49:39 UTC - in response to Message 2254.

I noticed, taht forum has some problems with displaying Polish chcaracters

---> Zażółć gęślą jaźń.

(This is probably the sentence with most possible number of non-standard characters ;) )

There are no such problems on teams' pages for example:
http://pirates.spy-hill.net/team_display.php?teamid=121


Noted for the log. I do know that the message boards go through an extra layer of sanitization, which may affect the display (only). As I'm writing this I can see in the quoted text what looks like the correct sentence.

I don't have an immediate fix but we will watch this thread as we upgrade to BOINC 5.3, and beyond...
____________
-- Eric Myers

"Education is not the filling of a pail, but the lighting of a fire." -- William Butler Yeats

Profile Mchl
Volunteer tester
Avatar
Send message
Joined: 23 Sep 04
Poland
BOINC@Poland
Credit: 1,549.7
RAC: 0.00
Joined: Sep 23, 2004
Verified: May 1, 2009
Punishment: Mess Duty
Message 2318 - Posted: 17 Jan 2006 | 17:41:47 UTC - in response to Message 2301.


I don't have an immediate fix but we will watch this thread as we upgrade to BOINC 5.3, and beyond...


Aye!
Indeed it looks all right when quotin'

____________
Me Pirrrate name be Mad William Flint an' don't ye dare t' call me with any other name me lad!
Arrr!

Profile KWSN - A Shrubbery, arrr
Avatar
Send message
Joined: 18 Jan 06
Nepal
The Knights Who Say Ni!
Credit: 7,488.4
RAC: 0.00
Joined: Jan 18, 2006
Verified: Mar 4, 2010
Dubloons: 3
Pieces of Eight: 8
Punishment: Aztec curse
Message 2449 - Posted: 21 Jan 2006 | 2:08:36 UTC

Not necessarily a board problem. I'm pretty certain it won't display characters that aren't in your Windows font.
____________

Profile Mchl
Volunteer tester
Avatar
Send message
Joined: 23 Sep 04
Poland
BOINC@Poland
Credit: 1,549.7
RAC: 0.00
Joined: Sep 23, 2004
Verified: May 1, 2009
Punishment: Mess Duty
Message 2466 - Posted: 21 Jan 2006 | 9:42:44 UTC - in response to Message 2449.

Not necessarily a board problem. I'm pretty certain it won't display characters that aren't in your Windows font.


The problem is, it splits one character in two. In the sentence above, word 'zażółć' has 6 letters, 4 of them special. If you count characters being displayed, you'll notice there are 10 characters, 8 of them speical. It seems like encoding problem to me. It may be conected with 16bit unicode standard, but this is my guessing.
On the other hand, I know how Polish letters look, when displayed with non CE (Central Europe) font. They sure don't look like these.
____________
Me Pirrrate name be Mad William Flint an' don't ye dare t' call me with any other name me lad!
Arrr!

Profile Ananas
Send message
Joined: 23 Mar 05
Germany
Nordlichter
Credit: 378.4
RAC: 0.00
Joined: Mar 23, 2005
Verified: Mar 28, 2009
Pieces of Eight: 5
Punishment: Mess Duty
Message 2559 - Posted: 23 Jan 2006 | 23:01:13 UTC
Last modified: 23 Jan 2006 | 23:02:20 UTC

Profile Ananas
Send message
Joined: 23 Mar 05
Germany
Nordlichter
Credit: 378.4
RAC: 0.00
Joined: Mar 23, 2005
Verified: Mar 28, 2009
Pieces of Eight: 5
Punishment: Mess Duty
Message 2560 - Posted: 23 Jan 2006 | 23:04:31 UTC
Last modified: 23 Jan 2006 | 23:21:03 UTC

A trick made them appear : I switched the input page to ISO-8859-1 before I typed them. I wonder if that would be a quick bugfix ?

Without switching to 8859 it looks a little not so nice either : äöüÄÖÜß

For SZTAKI this is a very massive problem btw., it would be really good if there was a way to solve it. I will test something in the next post ...


edit : I have not the slightest idea what I wrote down there but it looks correct :-)

Profile Ananas
Send message
Joined: 23 Mar 05
Germany
Nordlichter
Credit: 378.4
RAC: 0.00
Joined: Mar 23, 2005
Verified: Mar 28, 2009
Pieces of Eight: 5
Punishment: Mess Duty
Message 2561 - Posted: 23 Jan 2006 | 23:11:29 UTC

Profile Pepo
Chief Petty Officer
Volunteer tester
Avatar
Send message
Joined: 13 Sep 04
Slovakia
TeamVision42
Credit: 928.1
RAC: 0.00
Joined: Sep 13, 2004
Verified: Aug 4, 2009
Dubloons: 3
Pieces of Eight: 5
Punishment: Cat o' Nine Tails
Message 2562 - Posted: 24 Jan 2006 | 1:15:36 UTC
Last modified: 24 Jan 2006 | 1:36:33 UTC

Profile Ageless
Chief Petty Officer
Volunteer tester
Avatar
Send message
Joined: 20 Jul 04
Netherlands
Machinae Supremacy
Credit: 1,524.1
RAC: 0.00
Joined: Jul 20, 2004
Verified: Jul 9, 2011
Dubloons: 3
Pieces of Eight: 7
Punishment: Walk Plank
Message 2563 - Posted: 24 Jan 2006 | 3:47:40 UTC - in response to Message 2301.

I don't have an immediate fix but we will watch this thread as we upgrade to BOINC 5.3, and beyond...

Try editing http://pirates.spy-hill.net/language_select.php ... It's what it's there for. Mchl may want to test the link on his Polish setup.
____________
Jord.

Used to be a single voice that vanished in a crowd. Vague just like a distant sun when hidden by the clouds.
Found a way to surface and to speak my truth aloud. Be powerful. Stand fast and proud

Profile Ananas
Send message
Joined: 23 Mar 05
Germany
Nordlichter
Credit: 378.4
RAC: 0.00
Joined: Mar 23, 2005
Verified: Mar 28, 2009
Pieces of Eight: 5
Punishment: Mess Duty
Message 2564 - Posted: 24 Jan 2006 | 7:18:15 UTC
Last modified: 24 Jan 2006 | 7:34:22 UTC

Profile Ageless
Chief Petty Officer
Volunteer tester
Avatar
Send message
Joined: 20 Jul 04
Netherlands
Machinae Supremacy
Credit: 1,524.1
RAC: 0.00
Joined: Jul 20, 2004
Verified: Jul 9, 2011
Dubloons: 3
Pieces of Eight: 7
Punishment: Walk Plank
Message 2565 - Posted: 24 Jan 2006 | 8:09:40 UTC

Ananas, your text (all of it) shows up correctly in UTF-8 code (Unicode).
Mozilla Seamonkey 1.5 Alpha here.
____________
Jord.

Used to be a single voice that vanished in a crowd. Vague just like a distant sun when hidden by the clouds.
Found a way to surface and to speak my truth aloud. Be powerful. Stand fast and proud

Profile Ananas
Send message
Joined: 23 Mar 05
Germany
Nordlichter
Credit: 378.4
RAC: 0.00
Joined: Mar 23, 2005
Verified: Mar 28, 2009
Pieces of Eight: 5
Punishment: Mess Duty
Message 2566 - Posted: 24 Jan 2006 | 8:10:28 UTC
Last modified: 24 Jan 2006 | 8:32:38 UTC

Profile Pepo
Chief Petty Officer
Volunteer tester
Avatar
Send message
Joined: 13 Sep 04
Slovakia
TeamVision42
Credit: 928.1
RAC: 0.00
Joined: Sep 13, 2004
Verified: Aug 4, 2009
Dubloons: 3
Pieces of Eight: 5
Punishment: Cat o' Nine Tails
Message 2567 - Posted: 24 Jan 2006 | 11:30:31 UTC

Ananas, could you please make another attempt, with texts from http://www.boinc.sk/ page?

Thanks.
____________
Peter .-)

Profile Ananas
Send message
Joined: 23 Mar 05
Germany
Nordlichter
Credit: 378.4
RAC: 0.00
Joined: Mar 23, 2005
Verified: Mar 28, 2009
Pieces of Eight: 5
Punishment: Mess Duty
Message 2569 - Posted: 24 Jan 2006 | 13:25:54 UTC

Profile Ananas
Send message
Joined: 23 Mar 05
Germany
Nordlichter
Credit: 378.4
RAC: 0.00
Joined: Mar 23, 2005
Verified: Mar 28, 2009
Pieces of Eight: 5
Punishment: Mess Duty
Message 2570 - Posted: 24 Jan 2006 | 13:30:36 UTC

Новостное агентство LENTA.RU сообщает, что NASA говит проект, отдаленно напоминающий проект РВ.

" Специалисты космического агентства NASA, ожидающие прибытия аппарата Stardust с образцами звездной пыли 15 января, объявили о поиске добровольцев для нового проекта. Желающие поучаствовать в проекте Stardust@home смогут помочь в поиске частиц звездной пыли, собранных "пылесборником" аппарата, передает DPA. Для этого, пояснили организаторы проекта на собрании Американского астрономического общества во вторник, им понадобится подключение к Интернету и специальная бесплатная программа.

Поиск звездной пыли будет более интерактивным процессом, так как добровольцам предстоит тщательно рассматривать снимки поверхности уловителя на предмет пробоин, оставленных частицами внеземного вещества. Специальный микроскоп автоматически отсканирует всю площадь поверхности пористого вещества в уловителе и сделает около 1,5 миллиона снимков, каждый из которых будет отправлен на изучение четырем участникам программы.

Всем желающем принять участие в проекте будет направлено тестовое задание. В качестве награды за выполненную работу NASA предоставляет добровольцам право давать имя найденным им частицам звездной пыли. Если приземление капсулы с образцами, собранными аппаратом Stardust, пройдет успешно, то снимки "виртуального микроскопа" будут доступны пользователям Интернета в середине марта. ...

Profile Ananas
Send message
Joined: 23 Mar 05
Germany
Nordlichter
Credit: 378.4
RAC: 0.00
Joined: Mar 23, 2005
Verified: Mar 28, 2009
Pieces of Eight: 5
Punishment: Mess Duty
Message 2571 - Posted: 24 Jan 2006 | 13:31:33 UTC
Last modified: 24 Jan 2006 | 13:39:49 UTC

Profile Ananas
Send message
Joined: 23 Mar 05
Germany
Nordlichter
Credit: 378.4
RAC: 0.00
Joined: Mar 23, 2005
Verified: Mar 28, 2009
Pieces of Eight: 5
Punishment: Mess Duty
Message 2572 - Posted: 24 Jan 2006 | 17:20:56 UTC
Last modified: 24 Jan 2006 | 17:24:08 UTC

hm ... the Japanese one looked good with Firefox@work but shows only ???'s with Mozilla@home. I guess Firefox comes with some sets installed that the Mozilla suite doesn't have yet

Profile Pepo
Chief Petty Officer
Volunteer tester
Avatar
Send message
Joined: 13 Sep 04
Slovakia
TeamVision42
Credit: 928.1
RAC: 0.00
Joined: Sep 13, 2004
Verified: Aug 4, 2009
Dubloons: 3
Pieces of Eight: 5
Punishment: Cat o' Nine Tails
Message 2575 - Posted: 24 Jan 2006 | 20:27:59 UTC
Last modified: 24 Jan 2006 | 20:28:31 UTC

The Russian cyrillic text seems to be correct, no wrong characters observed here, but the Slovak one is not. I tried different viewing encodings, but it did't help.

There are few wrong characters, you can check yourself with the source page, it is evident.

The first one I noticed on all places is "z" with reversed "^" over it, in Windows (Unicode) called "Latin Letter Z With Caron", I don't know how in X-Windows. It was replaced by something like "_,".

Another one is "Latin Letter S With Caron", this character seems to be replaced with something (double dot horizontally over the character position) which is called "Diaeresis" or also "Umlaut" in German language.
____________
Peter .-)

Profile Nikolay A. Saharov
Send message
Joined: 14 Oct 04
Russia
Russia
Credit: 527.2
RAC: 0.00
Joined: Oct 14, 2004
Verified: Jan 17, 2010
Pieces of Eight: 4
Punishment: Mess Duty
Message 2576 - Posted: 24 Jan 2006 | 20:32:55 UTC

Yes, the russian text is correct.

Profile Pepo
Chief Petty Officer
Volunteer tester
Avatar
Send message
Joined: 13 Sep 04
Slovakia
TeamVision42
Credit: 928.1
RAC: 0.00
Joined: Sep 13, 2004
Verified: Aug 4, 2009
Dubloons: 3
Pieces of Eight: 5
Punishment: Cat o' Nine Tails
Message 2577 - Posted: 24 Jan 2006 | 20:45:45 UTC - in response to Message 2563.
Last modified: 24 Jan 2006 | 21:06:03 UTC

Try editing http://pirates.spy-hill.net/language_select.php ... It's what it's there for. Mchl may want to test the link on his Polish setup.

Comments to this page, from my point of view:
There should be these two (or three) choices there (it is actually valid for all Boinc projects, usually there is "cs | Czech" as choice):

cz | Czech (Cestina)
sk | Slovak (Slovencina) (I'm not able to write local names correctly)
cs | ( ) - hard to say, there was no such common language, the shortcut was valid until some 15 years ago when Czechoslovakia was split into separate Czech and Slovak republiks.

But both czech and slovak languages share very similar latin alphabets from the same code page (Central European, ISO-8859-1 or Windows-1250).
____________
Peter .-)

Profile Ananas
Send message
Joined: 23 Mar 05
Germany
Nordlichter
Credit: 378.4
RAC: 0.00
Joined: Mar 23, 2005
Verified: Mar 28, 2009
Pieces of Eight: 5
Punishment: Mess Duty
Message 2580 - Posted: 24 Jan 2006 | 21:53:50 UTC
Last modified: 24 Jan 2006 | 22:08:03 UTC

I cannot make those with caron (think it's called Haschek here), tried several character sets, even ISO-8859-2 which should contain them didn't do the job - sorry, I cannot help with those, I guess something has to be done in the PHP and/or database scripts to fix that.

Best would be to go to UTF with all the processing and pages I guess.

Profile Pepo
Chief Petty Officer
Volunteer tester
Avatar
Send message
Joined: 13 Sep 04
Slovakia
TeamVision42
Credit: 928.1
RAC: 0.00
Joined: Sep 13, 2004
Verified: Aug 4, 2009
Dubloons: 3
Pieces of Eight: 5
Punishment: Cat o' Nine Tails
Message 2583 - Posted: 24 Jan 2006 | 23:17:40 UTC - in response to Message 2577.

[quote]But both czech and slovak languages share very similar latin alphabets from the same code page (Central European, ISO-8859-1 or Windows-1250).

Excuse me, please, I'm blind and wrong: ISO-8859-2. (Too late to edit my post.)
____________
Peter .-)

Profile Pepo
Chief Petty Officer
Volunteer tester
Avatar
Send message
Joined: 13 Sep 04
Slovakia
TeamVision42
Credit: 928.1
RAC: 0.00
Joined: Sep 13, 2004
Verified: Aug 4, 2009
Dubloons: 3
Pieces of Eight: 5
Punishment: Cat o' Nine Tails
Message 2600 - Posted: 25 Jan 2006 | 15:53:52 UTC
Last modified: 25 Jan 2006 | 15:55:58 UTC

Profile Pepo
Chief Petty Officer
Volunteer tester
Avatar
Send message
Joined: 13 Sep 04
Slovakia
TeamVision42
Credit: 928.1
RAC: 0.00
Joined: Sep 13, 2004
Verified: Aug 4, 2009
Dubloons: 3
Pieces of Eight: 5
Punishment: Cat o' Nine Tails
Message 2601 - Posted: 25 Jan 2006 | 16:10:08 UTC - in response to Message 2600.

The only way is to write under Western ISO-8859-1 mode and then also view the message under some non-UTF8 mode. But still the S and Z characters are wrong.
While editing in the Western ISO mode all characters are correct.
____________
Peter .-)

Profile Mchl
Volunteer tester
Avatar
Send message
Joined: 23 Sep 04
Poland
BOINC@Poland
Credit: 1,549.7
RAC: 0.00
Joined: Sep 23, 2004
Verified: May 1, 2009
Punishment: Mess Duty
Message 2637 - Posted: 27 Jan 2006 | 20:30:29 UTC - in response to Message 2601.
Last modified: 27 Jan 2006 | 20:39:46 UTC

Profile Mchl
Volunteer tester
Avatar
Send message
Joined: 23 Sep 04
Poland
BOINC@Poland
Credit: 1,549.7
RAC: 0.00
Joined: Sep 23, 2004
Verified: May 1, 2009
Punishment: Mess Duty
Message 2639 - Posted: 27 Jan 2006 | 20:43:44 UTC - in response to Message 2637.

Back to UTF-8...

With my last try I generated some kind of error I suppose.
When I tried to edit my post, I got some HTML into edit window, and bottom of the page (together with 'Submit' button) did not display...

This is what I got:

?Ɗ??ӏ? ?檳?󟿼/textarea>





<P>



[url=/]Home[/url]
[url=/forum_help_desk.php]Help Desk[/url]
[url=/forum_index.php]Message Boards[/url]
[url=/server_status.php]Server Status[/url]
[url=/home.php]Your Account[/url]



[url=http://pirates.spy-hill.net/]Return to Pirates@Home main
page[/url]



Copyright © 2006
Capt. Jack Sparrow


____________
Me Pirrrate name be Mad William Flint an' don't ye dare t' call me with any other name me lad!
Arrr!

Profile Pepo
Chief Petty Officer
Volunteer tester
Avatar
Send message
Joined: 13 Sep 04
Slovakia
TeamVision42
Credit: 928.1
RAC: 0.00
Joined: Sep 13, 2004
Verified: Aug 4, 2009
Dubloons: 3
Pieces of Eight: 5
Punishment: Cat o' Nine Tails
Message 3304 - Posted: 24 May 2006 | 11:45:48 UTC
Last modified: 24 May 2006 | 11:46:18 UTC

Something with text encoding seems to be fixed now (maybe with the 5.x series?) so I'll try to post SK alphabet once more, to see...

aáäbcčdďeéfghiíjklľĺmnňoóôpqrŕsštťuúvwxyýzž
AÁÄBCČDĎEÉFGHIÍJKLĽĹMNŇOÓÔPQRŔSŠTŤUÚVWXYÝZŽ

Yes, is fine now!
____________
Peter .-)

Profile Fuzzy Hollynoodles
Volunteer tester
Avatar
Send message
Joined: 18 Jan 06
International
BOINC Synergy
Credit: 90.6
RAC: 0.00
Joined: Jan 18, 2006
Verified: NEVER
Dubloons: 2
Punishment: Misfit
Message 3306 - Posted: 24 May 2006 | 14:30:18 UTC
Last modified: 24 May 2006 | 14:30:55 UTC

abcdefghijklmnopqrstuvwxyzæøå

ABCDEFGHIJKLMNOPQRSTUVWXYZÆØÅ

Danish is fine also. :-)




____________

[color=navy][size=12][b]Those who can, do. Those who can't, bully.[/b][/size][/color]
From here

Post to thread

Message boards : Help! : Character encoding problems.

Home Help Status Forums Glossary Account


Return to Pirates@Home main page


Copyright © 2017 Capt. Jack Sparrow