Problem in Understanding encoding in browser context

Hi,

I have a problem in understanding what happens when we select different encoding options in the browser.

I have loaded a webpage encoded in shift-jis format (www.honda.jp.co), then I just the utf-8 encoding in the browser (IE or firefox), the text gets cluttered.

As I understand, the browser should have converted the text from shift-jis format to utf-8 format and then displayed the page.That is,

any format--> UTF-8 format should be losseless.

The glyphs (fonts) that get displayed should be according to the encoding format.

But what i see in either of the browser is that it displays question marks and invalid chars.

Note: I have reqd fonts installed in my box.

Please let me know if my understanding is correct.

Thanks

>> Manjunath

[820 byte] By [manjunath_suna] at [2007-10-1 19:06:19]
# 1

Your understanding is not correct. Browsers do not perform conversion between different encodings. So if your web page is encoded in Shift-JIS and you select to view it as UTF-8, you will see exactly what you describe - question marks and invalid characters.

Your assumption that coversion from any native code page to utf8 should be losless IS correct - but in the case of browsers you will have to perform the conversion yourself, the browser does not do it for you.

one_danea at 2007-7-11 14:27:41 > top of Java-index,Desktop,I18N...