URLScraper ?How to use at localhost?

Scraping URLs is quite an easy job: just create a new channel of type URLScraperProvider and fill afterwards the attribute url in the way http://www.abcd.com.

But it gets complicated when trying to use content which is local. So I want to define the url of the URLScraper something like

/docroot/afolder/index.html

I found out that (among others) the URL is written in the display profile:

<String name="url" value="file:///var/opt/sun/portal/portals/portal1/desktop/default/SampleURLScraper/ipinfo.html"/>

So it抯 not a relative path to a predefined docroot but it抯 an absolute one.

Now I don抰 know, from where URLScraper will get images defined in the html-file.

Assume the docroot would be

/var/opt/sun/portal/portals/portal1/default/SampleURLScraper

Then an image, referenced in the html-file like

<img src="/images/show.gif" />

could have been stored in

?default/SampleURLScraper/images/show.gif

Any idea how URLScraper can access local stored content which has image-links?

Thanx in advance

Jochen

[1187 byte] By [DschayTia] at [2007-11-27 2:11:08]
# 1
When serving content from local disk, I don't think URLScraper will do that for you.If you want to embed an image in the local file content, give a complete valid url of the image available from your local webserver/appserver and see. Browser will take care of fetching it.
unni_blra at 2007-7-12 2:04:07 > top of Java-index,Web & Directory Servers,Portal Servers...