beautypg.com

Using blacklists – Google Search Appliance Creating the Search Experience User Manual

Page 61

background image

Google Search Appliance: Creating the Search Experience

Best Practices

61

Examples of valid entries:

{run, runs, running, ran}
{widgets, parts, items}

Example of an invalid entry:

{safe}

For information about uploading a synonyms file, click Help Center > Serving > Query Settings in the
Admin Console.

Using Blacklists

You can control query expansion by creating a blacklist. A blacklist is a set of words that are excluded
from query expansion. A blacklist can be useful for eliminating unwanted search results that result from
synonym matching and clarifying special words used in your environment.

For example, suppose mediacompany.com starts a new project with the code name “glue.” You could add
“glue” to the blacklist to ensure that user queries do not expand to include “adhesive.”

A blacklist file is a text file of three megabytes or less, containing a simple case-insensitive list of single
words, with one word on each line.

Follow these steps to create a blacklist file:

1.

Create a text (.txt) file.

2.

If the file will contain accented characters and you have not already checked your editor's ability to
save a file with UTF-8 encoding, do so now. As an example, if you are using Notepad, do this:

From the File menu, choose Save As.

Check that the Save options include Encoding, as well as Name and File Type.

Pull down the Encoding menu and choose UTF-8.

3.

Edit the file as follows:

Put one word on each line.

Blacklists should use only alphanumeric characters. Spaces are not allowed.

Use the pound sign (#) to start a comment line.

4.

Save the file. If the file has accented characters, save it with UTF-8 encoding.

This is an example of a blacklist file:

#Blacklist file created July 2006
#Author: lana
glue
component

This file prevents queries for “glue” and “component” from being enhanced with synonyms.

For information about uploading a blacklist file, click Help Center > Serving > Query Settings in the
Admin Console.