beautypg.com

Document status, Pausing or resuming crawl, Retrieving document status – Google Search Appliance Administrative API Developers Guide: Protocol User Manual

Page 30

background image

Google Search Appliance: Administrative API Developer’s Guide: Protocol

30

The response result is as follows:


xmlns:gsa=’http://schemas.google.com/gsa/2007’>
http://gsa:8000/feeds/command/pauseCrawl
2008-12-11T08:55:57.824Z

href=’http://gsa:8000/feeds/command/pauseCrawl’/>

href=’http://gsa:8000/feeds/command/pauseCrawl’/>

pauseCrawl
0

Pausing or Resuming Crawl

To pause or resume crawl, send an authenticated PUT request to the following URL:

http://Search_Appliance:8000/feeds/command/pauseCrawl

The following is an example of a request to resume crawl:


xmlns:gsa=’http://schemas.google.com/gsa/2007’>
pauseCrawl
0

Document Status

Retrieve the status of the documents that have been crawled and served using the documentStatus
entry of the status feed. The properties for the document status are:

Retrieving Document Status

To retrieve document status, send an authenticated GET request to the following URL:

http://Search_Appliance:8000/feeds/status/documentStatus

Property

Description

crawledURLsToday

The number of documents crawled since midnight. (Midnight pertains to
the time that is set on the search appliance.)

crawlPagePerSecond

Current crawling rate measured in pages per second.

errorURLsToday

Document errors that occurred since midnight on the search appliance.

filteredBytes

Document bytes that have been filtered by domain, language, file type, or
metadata.

foundURLs

The number of URLs found that match crawl patterns.

servedURLs

The number of total documents that have been served.