beautypg.com

Host load schedule, Updating a crawler access rule, Deleting a crawler access rule – Google Search Appliance Administrative API Developers Guide: Protocol User Manual

Page 21

background image

Google Search Appliance: Administrative API Developer’s Guide: Protocol

21

Updating a Crawler Access Rule

To update a crawl access rule, send an authenticated PUT request to the following URL:

http://Search_Appliance:8000/feeds/crawlAccessNTLM/urlPattern

The following example request body shows the result:


xmlns:gsa=’http://schemas.google.com/gsa/2007’>
#new URL pattern
newdomain
0
2
newuser
newpass

Deleting a Crawler Access Rule

To delete a crawl access rule, send an authenticated DELETE request to the following URL:

http://Search_Appliance:8000/feeds/crawlAccessNTLM/urlPattern

Host Load Schedule

Retrieve and update the host load schedule for a search appliance using the hostLoad entry of the
config feed.

Property

Description

defaultHostLoad

The default web server host load, a float value.

exceptionHostLoad

Exceptions to the default web server host load. This property consists of
one or more lines of text in the following format:

hostName startTime endTime loadFactor

Where:

hostName is a URL or asterisk (*) to represent all hosts. If a hostName
contains multiple load data values, separate the host name into
multiple lines with each line containing one load value. The values
cannot overlap.

startTime and endTime are integer values between 0 and 23 and
represent when to start and end crawling.

loadFactor is a float value from 0 to 4 that represents the processing
load on a search appliance, where 0 is unloaded and 4 is overloaded.

maxURLs

Maximum number of URLs to crawl, an integer value.