Configuring crawl for cookie-based access, Configuring crawl for http basic or ntlm http – Google Search Appliance Managing Search for Controlled-Access Content User Manual
Page 10

Google Search Appliance: Managing Search for Controlled-Access Content
10
You can specify a different set of access credentials for each URL pattern in the Admin Console. The
means by which you provide these credentials is different for each kind of authentication, but the
general process remains the same.
Configuring Crawl for Cookie-Based Access
The search appliance supports cookie-based access (single sign-on, forms). For sites that require the use
of a cookie for authentication during crawl and index, you can define your content with a forms
authentication rule. When you set up the search appliance to crawl cookie-based content, consider the
following points:
•
Define a rule under Crawl and Index > Forms Authentication for controlled-access content
sources that require the search appliance to obtain a session cookie from a login form. Content
accessed through a forms authentication site can be secure or public during serve. For more
information click Help Center > Crawl and Index > Forms Authentication in the Admin Console.
•
If the URL pattern that matches the forms authentication rule includes a logout page, the search
appliance attempts to crawl the logout page, which essentially results in cookie expiration. If the
SSO system includes a logout page, then exclude the logout page by adding it to Do Not Crawl
URLs with the Following Patterns on the Crawl and Index > Crawl URLs page. For more
information click Help Center > Crawl and Index > Crawl URLs in the Admin Console.
•
A forms authentication rule must generate at least one action for the search appliance to consider
it valid. If a rule doesn’t generate any action for a URL, the search appliance logs an error and
doesn’t crawl the URL again.
Google has certified the following Single Sign-On systems for use with software release 6.2 and later:
•
Computer Associates SiteMinder 6.0, Policy Server and Web Agent
•
Oracle Access Manager 7.0.4 (formerly Oblix)
•
Cams by Cafesoft, version 3.0
Configuring Crawl for HTTP Basic or NTLM HTTP
When you set up the search appliance to crawl controlled-access content with HTTP Basic or NTLM
HTTP, consider the following points: