beautypg.com

Feed files awaiting processing, Changing the display url in search results, License limits – Google Search Appliance Feeds Protocol Developers Guide User Manual

Page 32

background image

Google Search Appliance: Feeds Protocol Developer’s Guide

32

Feed Files Awaiting Processing

To view a count of how many feed files remain for the search appliance to process into its index, add /
getbacklogcount to a search appliance URL at port 19900. The count that this feature provides can be
used to regulate the feed submission rate. The count also includes connector feed files.

The syntax for /getbacklogcount is as follows:

http://SearchApplianceHostname:19900/getbacklogcount

Changing the Display URL in Search Results

You can change the display URL on search results by pushing a feed with the displayurl attribute set.

Use this feature when you want to use one URL in the index and another for display to the user. For
example, you might change the display URL if URL content is not in a web enabled server (and you need
to specify a proxy server that uses doc IDs in a back-end content management system) or if you split a
large file into segments and each segment is indexed with a separate URL and the display URL for each
result points to the original file.

The following example shows use of the displayurl attribute.



replace
incremental

beautypg.com


displayurl="http://newurl.example.com/myscript.cgi?docID=123"
action="add" mimetype="text/html" lock="true">
Hello World - document data goes here!

License Limits

If your index already contains the maximum number of URLs, or your license limit has been exceeded,
then the index is full.

When the index is full, the system reduces the number of indexed documents as follows:

Documents are removed to bring the total number of documents to the license limit.

Documents with the lock attribute set to true are deleted last.

Increasing the Maximum Number of URLs to Crawl

To increase the maximum number of URLs in your index, log into the Admin Console and choose
Content Sources > Web Crawl > Host Load Schedule. Check the Maximum Number of URLs to
Crawl. This number must be smaller than the license limit for your search appliance. To increase the
license limit, contact Sales.