Problem in search

hi all,

i write the code for the search module in for my website project. the code is working fine and fetch the data for the corresponding keywords. the problem is, i my code i replace all the html tags by empty strings. using the follwing line,

replaceAll("<a>"," ");

my problem is i have one line in my html page,

<a href="javascript:#" class="content" onclick="MM_openBrWindow('saudi_arabia_distribut.html','','scrollbars=yes,width=620,height=199')">

Saudi Arabia

</a>

if i search for a Saudi Arabis it returns the value of the <A> tag.

any one can guid eme to solve this problem.

thanx for advance.

Bala

[706 byte] By [art84a] at [2007-11-27 3:51:08]
# 1
Which is Saudi Arabia in this case? What is it supposed to return?
kdajania at 2007-7-12 8:55:06 > top of Java-index,Enterprise & Remote Computing,Web Tier APIs...
# 2
hi ,if i search for Saudi, it ll return the Saudi Arabia.but it will return the Values in the <A> tags.Bala.
art84a at 2007-7-12 8:55:06 > top of Java-index,Enterprise & Remote Computing,Web Tier APIs...
# 3
You've to use more advanced regexp techniques to strip out all HTML tags so that only pure text will be left behind. But what you better have to do is to use a DOM or HTML parser for Java, for example DOM4J or HTML4J, to split your HTML structure in an easy-to-crawl object oriented tree.
BalusCa at 2007-7-12 8:55:06 > top of Java-index,Enterprise & Remote Computing,Web Tier APIs...
# 4
hi Bala,thanx for u'r comments.i am not familiar on that concept.can u light me the some samples to how to use it.Bala
art84a at 2007-7-12 8:55:06 > top of Java-index,Enterprise & Remote Computing,Web Tier APIs...