unicode conversion

I am using an open source HTML parser and i have discovered that it cannot parse certain extended, or high unicode characters, including:

[code]

TRADE = '\u2122';

EMDASH = '\u2014';

ENDASH = '\u2013';

RSQUO = '\u2019';

LSAQUO = '\u2039';

LDQUO = '\u201c';

RDQUO = '\u201d';

BULL = '\u2022';

OELIG = '\u0152';

HELLIP = '\u2026';

to deal with the problem i am currently checkng each token to see if it is one of these before passing it to the parser, but i was just wondering if anyone could point me in an efficient driection of doing this. i currently use a switch-case statement, that breaks once it finds the token is equal to one of these final CHARs. i have been told something about the new enumerator package with java1.5m but dont know where to begin. Thanks in advance

[875 byte] By [Mr_Blacka] at [2007-10-2 15:27:21]
# 1
forgot to add that after i check if it is one of these tokens all i have to do is change it to a question mark for the parser and to be happy.
Mr_Blacka at 2007-7-13 14:45:26 > top of Java-index,Other Topics,Algorithms...
# 2
cross-posted http://forum.java.sun.com/thread.jspa?threadID=719651
dubwaia at 2007-7-13 14:45:26 > top of Java-index,Other Topics,Algorithms...