unicode conversion
I am using an open source HTML parser and i have discovered that it cannot parse certain extended, or high unicode characters, including:
[code]
TRADE = '\u2122';
EMDASH = '\u2014';
ENDASH = '\u2013';
RSQUO = '\u2019';
LSAQUO = '\u2039';
LDQUO = '\u201c';
RDQUO = '\u201d';
BULL = '\u2022';
OELIG = '\u0152';
HELLIP = '\u2026';
to deal with the problem i am currently checkng each token to see if it is one of these before passing it to the parser, but i was just wondering if anyone could point me in an efficient driection of doing this. i currently use a switch-case statement, that breaks once it finds the token is equal to one of these final CHARs. i have been told something about the new enumerator package with java1.5m but dont know where to begin. Thanks in advance

