How to find base character from an accented character

Hi, given an accented character (? ? ? etc...) is there a way to retrieve its base character? In the case above, a, o and c respectivelly?

I searched in this forum and google and didn't find a definitive answer.

The reason i need this is because in my database, some records have accented content, and now i need to generate a textfile to transfer daily to a bank, but the bank doesn't accept accented characters.

Thanks.

[448 byte] By [fabianobonina] at [2007-10-2 17:53:43]
# 1
I think this Unicode report is what you're asking about. Probably the section on canonical decomposition: http://www.unicode.org/reports/tr15/
DrClapa at 2007-7-13 19:12:11 > top of Java-index,Desktop,I18N...
# 2
Paul, you're a marvel.~
yawmarka at 2007-7-13 19:12:11 > top of Java-index,Desktop,I18N...
# 3

It was exactly what i was looking for. Thanks a lot.

I'm looking at the java examples provided in the page, and they use some classes from ibm ICU package (com.ibm.icu.text.UTF16, com.ibm.text.utility.Utility). Do you know if there is some equivalent native classes in Java?

Regards.

fabianobonina at 2007-7-13 19:12:11 > top of Java-index,Desktop,I18N...
# 4

Decompose the Unicode string -- i.e., perform an NFC transformation -- and then strip off the diacritical marks, as done in [url=http://vietpad.sf.net]VietPad[/url] editor.

There is a native class, java.text.Normalizer, but it is not made public until Mustang release.

http://java.sun.com/javase/6/jcp/beta/

nguyenq87a at 2007-7-13 19:12:11 > top of Java-index,Desktop,I18N...
# 5
Sorry, I should say an NFD transformation, not NFC.
nguyenq87a at 2007-7-13 19:12:11 > top of Java-index,Desktop,I18N...