Crawl urls, Retrieving crawl urls, Updating crawl urls – Google Search Appliance Administrative API Developers Guide: Protocol User Manual
Page 12

Google Search Appliance: Administrative API Developer’s Guide: Protocol
12
Crawl URLs
Retrieve and update crawl URLs for a search appliance using the crawlURLs entry of the config feed.
Retrieving Crawl URLs
To get the crawl URLs information for a search appliance, send an authenticated GET request to the
config feed URL:
http://Search_Appliance:8000/feeds/config/crawlURLs
The following example requests the current crawl URLs values from a search appliance:
xmlns:gsa=’http://schemas.google.com/gsa/2007’> href=’http://gsa:8000/feeds/config/crawlURLs’/> href=’http://gsa:8000/feeds/config/crawlURLs’/>
Updating Crawl URLs
To update Crawl URLs information for a search appliance, send an authenticated PUT request to the
config feed URL:
http://Search_Appliance:8000/feeds/config/crawlURLs
The following example overwrites the crawl URLs specified in the entry to update:
xmlns:gsa=’http://schemas.google.com/gsa/2007’>
Property
Description
doNotCrawlURLs
Do not crawl URLs with the following URL patterns.
followURLs
Follow and crawl only URLs with the following URL patterns.
startURLs
Start crawling from the following URL patterns.