Overview of the gsa connector for sharepoint – Google Search Appliance Connectors Deploying the Connector for SharePoint User Manual
Page 4
![background image](/manuals/552824/4/background.png)
Overview of the GSA Connector for SharePoint
The Connector for SharePoint 4.0 enables the Google Search Appliance to crawl and index
content from Microsoft SharePoint. Each connector instance can support only one
SharePoint Web Application. If you have more than one Web Application, you need to
deploy one connector instance for each one.
The following diagram provides an overview of how the search appliance gets content from
SharePoint through the connector. For explanations of the numbers in the process, see the
steps following the diagram.
1. The Connector for SharePoint starts communicating with the repository by
presenting authentication credentials.
2. The repository sends a limited number of Doc IDs of documents in the repository to
the connector.
3. The connector constructs URLs from the Doc IDs and pushes it to the search
appliance in a metadata-and-URL feed. Take note that this feed does not include the
document contents.
4. The search appliance gets the URLs to crawl from the feed.
5. The search appliance crawls the repository according to its own crawl schedule, as
specified in the GSA Admin Console. It crawls the content by sending GET requests
for content to the connector.
6. The connector requests the content from the repository.
7. The repository sends the content to the connector.