beautypg.com

Crawl urls, Retrieving crawl urls, Updating crawl urls – Google Search Appliance Administrative API Developers Guide: Protocol User Manual

Page 12

background image

Google Search Appliance: Administrative API Developer’s Guide: Protocol

12

Crawl URLs

Retrieve and update crawl URLs for a search appliance using the crawlURLs entry of the config feed.

Retrieving Crawl URLs

To get the crawl URLs information for a search appliance, send an authenticated GET request to the
config feed URL:

http://Search_Appliance:8000/feeds/config/crawlURLs

The following example requests the current crawl URLs values from a search appliance:


xmlns:gsa=’http://schemas.google.com/gsa/2007’>
http://gsa:8000/feeds/config/crawlURLs
2008-12-12T07:49:32.957Z

href=’http://gsa:8000/feeds/config/crawlURLs’/>

href=’http://gsa:8000/feeds/config/crawlURLs’/>

crawlURLs
http://www.example.com/
.xls$
http://www.example.com/

Updating Crawl URLs

To update Crawl URLs information for a search appliance, send an authenticated PUT request to the
config feed URL:

http://Search_Appliance:8000/feeds/config/crawlURLs

The following example overwrites the crawl URLs specified in the entry to update:


xmlns:gsa=’http://schemas.google.com/gsa/2007’>
http://gsa:8000/feeds/config/crawlURLs
crawlURLs
http://www.example.com/
.xls$
http://www.example.com/

Property

Description

doNotCrawlURLs

Do not crawl URLs with the following URL patterns.

followURLs

Follow and crawl only URLs with the following URL patterns.

startURLs

Start crawling from the following URL patterns.