Avoiding this ambiguity, the prevailing character set (for example, JIS X 0208-1983) is switched back into ASCII before HTML markup signals, and only those characters which would be interpreted as special characters in plain text should be interpreted as markup signals in HTML.
Therefore octet 60, 62 and 38 in the character sets except ASCII should not be interpreted as HTML markup signals. The following is a patch for WWWLibrary to do this.
-DISO_2022_JP
.
I made an unofficial (quick and dirty :-) patch for WWWLibrary_2.11 by WWW project at CERN. It is also applicable to 2.09a with a few offset.
Apply this and compile with -DISO_2022_JP
,
you could then handle ISO-2022-JP encoded HTML documents correctly.
________________________________________________________________________
TAKADA Toshihiro