beautypg.com

Examples of rules, Appendix d: compressed results – Google Search Appliance Protocol Reference User Manual

Page 111

background image

Google Search Appliance: Search Protocol Reference

Appendices

111

Examples of Rules

Because the document http://www.foo.com/example/foo.html matches the URL pattern in rule 1,
the search appliance first checks for the date in the title of the document. The URL doesn’t match rule 2,
so the search appliance checks against rule 3. If the search appliance is unable to find a valid date in the
title or the URL, the search appliance looks for the date in the meta tag named publication_date
according to rule 3. If the search appliance is unable to find a valid date in the meta tag, the search
appliance defaults to the last modified date of the HTTP server, according to rule 5.

The search appliance extracts the date from the http://www.foo2.com/archives/20040605/
abc.html URL.

Because the document http://www.foo.com/foo.html does not match the URL pattern in rule 1, the
search appliance looks for the date in the meta tag, according to rule 3 and defaults to rule 5 if the
search appliance cannot find a valid date in rule 3.

For the document http://www.foo2.com/foo.html, the search appliance looks for the date in the
body and defaults to the last-modified date.

For the document http://www.foo3.com/foo.html, the search appliance looks for the date only on
the last-modified header as it only matches the URL pattern of rule 5.

Appendix D: Compressed Results

The Google Search Appliance supports serving compressed results.

The search appliance serves compressed results to browsers that support compression. The browser
must send the following HTTP header to the search appliance:

Accept-Encoding: gzip

The search appliance will then serve compressed results. The browser uncompresses the results.

This applies to both XML and XSLT-transformed results. If the Accept-Encoding: gzip header is not
present, the results are not compressed.

Rule #

Host or URL Pattern

Date Located In

Meta Tag Name

1

www.foo.com/example/

Title

2

www.foo2.com/archives/

URL

3

www.foo.com/

Meta Tag

publication_date

4

www.foo2.com/

Body

5

/

Last Modified