Document status, Pausing or resuming crawl, Retrieving document status – Google Search Appliance Administrative API Developers Guide: Protocol User Manual
Page 30

Google Search Appliance: Administrative API Developer’s Guide: Protocol
30
The response result is as follows:
xmlns:gsa=’http://schemas.google.com/gsa/2007’> href=’http://gsa:8000/feeds/command/pauseCrawl’/> href=’http://gsa:8000/feeds/command/pauseCrawl’/>
Pausing or Resuming Crawl
To pause or resume crawl, send an authenticated PUT request to the following URL:
http://Search_Appliance:8000/feeds/command/pauseCrawl
The following is an example of a request to resume crawl:
xmlns:gsa=’http://schemas.google.com/gsa/2007’>
Document Status
Retrieve the status of the documents that have been crawled and served using the documentStatus
entry of the status feed. The properties for the document status are:
Retrieving Document Status
To retrieve document status, send an authenticated GET request to the following URL:
http://Search_Appliance:8000/feeds/status/documentStatus
Property
Description
crawledURLsToday
The number of documents crawled since midnight. (Midnight pertains to
the time that is set on the search appliance.)
crawlPagePerSecond
Current crawling rate measured in pages per second.
errorURLsToday
Document errors that occurred since midnight on the search appliance.
filteredBytes
Document bytes that have been filtered by domain, language, file type, or
metadata.
foundURLs
The number of URLs found that match crawl patterns.
servedURLs
The number of total documents that have been served.