beautypg.com

Internationalization, Xml output – Google Search Appliance Protocol Reference User Manual

Page 53

background image

Google Search Appliance: Search Protocol Reference

Results Format

53

Notes:

XSL stylesheets used by the XSLT server are cached for 15 minutes. To force the XSLT server to use
the latest version of an XSL stylesheet, set the proxyreload input parameter to a value of 1 in your
search request.

XSL stylesheets that include other files may not be used with the Google search engine. An XSL
stylesheet that contains the following tags generates an error result:

xmlns:

document()

When you request cached results in custom HTML output, the BLOB XML tag and associated value
are automatically converted to the original text before the XSL stylesheet rules are applied. When
using an XSL stylesheet that customizes cache results, simply use the values of the
CACHE_LEGEND_TEXT, CACHE_LEGEND_NOTFOUND and CACHE_LEGEND_HTML XML tags directly
instead of applying a rule on the BLOB subtag.

If you use input or output encodings other than latin1, see “Internationalization” on page 53 for
more details.

More information about XSL and XSLT can be found on the W3C (

http://www.w3.org/Style/XSL/

)

web site.

Internationalization

The Google Search Appliance handles over 20 character encoding schemes. This section discusses
special considerations for the custom HTML output format with encoding schemes other than latin1.

To support all the encoding schemes supported by Google, the XSLT server follows a process to ensure
that the results are returned in the correct encoding scheme. When requesting search results through
the XSLT server, the server translates the results to the UTF8 encoding scheme before applying the
selected XSL stylesheet. After the XSL stylesheet rules are applied to generate the results, the results are
converted to the encoding scheme that is specified by the output encoding parameter, oe. The one
exception to this rule is cached result pages, which get converted to the encoding scheme of the cached
document after XSLT processing.

Each front end for your search appliance is associated with an underlying stylesheet. All XSL stylesheets
must be in latin1 or UTF8 formats.

XML Output

The description of the XML results format contains the following sections:

“XML Output Overview” on page 54

“Character Encoding Conventions” on page 54

“Google XML Results DTD” on page 54

“Google XML Tag Definitions” on page 55