beautypg.com

Google Search Appliance Administrative API Developers Guide: Java User Manual

Page 24

background image

Google Search Appliance: Administrative API Developer’s Guide: Java

24

Crawl Errors:

Crawl Exclusions:

Errors

Retrieval Error

7

Redirect without a location header

11

Document not found (404)

12

Other HTTP 400 errors

14

HTTP 0 error

15

Permanent DNS failure

16

Empty document

17

Image conversion failed

22

Authentication failed

25

Conversion error

32

HTTP 500 error

33

The robots.txt file is unreachable

35

Temporary DNS failure

36

Connection failed

37

Connection timeout

38

Connection closed

40

Connection refused

41

Connection reset

43

No route to host

50

Other error

Excluded

Description

3

Not in the URLs to crawl

4

In the URLs to not crawl

5

Off domain redirect

6

Long redirect chain

8

Infinite URL space

9

Unhandled protocol

10

URL is too long

13

The robots.txt file indicates to not index

18

Rejected by rewrite rules

19

Unknown extension

20

Disallowed by a meta tag

24

Disallowed by the robots.txt file