How to find base character from an accented character
Hi, given an accented character (? ? ? etc...) is there a way to retrieve its base character? In the case above, a, o and c respectivelly?
I searched in this forum and google and didn't find a definitive answer.
The reason i need this is because in my database, some records have accented content, and now i need to generate a textfile to transfer daily to a bank, but the bank doesn't accept accented characters.
Thanks.
It was exactly what i was looking for. Thanks a lot.
I'm looking at the java examples provided in the page, and they use some classes from ibm ICU package (com.ibm.icu.text.UTF16, com.ibm.text.utility.Utility). Do you know if there is some equivalent native classes in Java?
Regards.
Decompose the Unicode string -- i.e., perform an NFC transformation -- and then strip off the diacritical marks, as done in [url=http://vietpad.sf.net]VietPad[/url] editor.
There is a native class, java.text.Normalizer, but it is not made public until Mustang release.
http://java.sun.com/javase/6/jcp/beta/