beautypg.com

Overview of the gsa connector for sharepoint – Google Search Appliance Connectors Deploying the Connector for SharePoint User Manual

Page 4

background image

Overview of the GSA Connector for SharePoint

The Connector for SharePoint 4.0 enables the Google Search Appliance to crawl and index
content from Microsoft SharePoint. Each connector instance can support only one
SharePoint Web Application. If you have more than one Web Application, you need to
deploy one connector instance for each one.

The following diagram provides an overview of how the search appliance gets content from
SharePoint through the connector. For explanations of the numbers in the process, see the
steps following the diagram.

1. The Connector for SharePoint starts communicating with the repository by

presenting authentication credentials.

2. The repository sends a limited number of Doc IDs of documents in the repository to

the connector.

3. The connector constructs URLs from the Doc IDs and pushes it to the search

appliance in a metadata-and-URL feed. Take note that this feed does not include the
document contents.

4. The search appliance gets the URLs to crawl from the feed.
5. The search appliance crawls the repository according to its own crawl schedule, as

specified in the GSA Admin Console. It crawls the content by sending GET requests
for content to the connector.

6. The connector requests the content from the repository.
7. The repository sends the content to the connector.