APC SGI 15000 RAID User Manual
APC Tools
Table of contents
Document Outline
- SGI® InfiniteStorage 15000 RAID User’s Guide
- Preface
- International Standards
- Potential for Radio Frequency Interference
- Note This equipment has been tested and found to comply with the limits for a class A digital device, pursuant to Part 15 of the...
- Important SGI InfiniteStorage 15000 drive enclosures must be always installed in SGI InfiniteStorage 15000 racks. SGI does not a...
- Warning The SGI InfiniteStorage 15000 MUST be grounded before applying power.
- Warning To ensure protection against electric shock caused by HIGH LEAKAGE CURRENT (TOUCH CURRENT), the SGI InfiniteStorage 15000 must be connected to at least two separate and independent sources. This is to ensure a reliable earth connection.
- Warning To ensure your system has warning of a power failure please disconnect the power from the power supply, by either the switch (where present) or by physically removing the power source, prior to removing the PCM from the enclosure/shelf.
- Warning Do not remove covers from the PCM. Danger of electric shock inside. Return the PCM to your supplier for repair.
- Warning Operation of the Enclosure with ANY modules missing will disrupt the airflow and the drives will not receive sufficient cooling. It is ESSENTIAL that all apertures are filled before operating the unit.
- Important Observe all applicable safety precautions, e.g. weight restrictions, handling batteries and lasers etc., detailed in the preceding paragraphs when dismantling and disposing of this equipment
- Important SGI InfiniteStorage 15000 drive enclosures should only be installed in SGI InfiniteStorage 15000 racks. Mounting and installing these drive enclosures in any other rack is not authorized or supported by SGI.
- Warning It is recommended that you do not slide more than one enclosure out of the rack at a time, to avoid danger of the rack tipping over.
- 1.1 Controller Features
- 1.2 The Controller Hardware
- Figure 1-1 SGI InfiniteStorage 15000 IB - Front and Rear Views
- 1.2.1 Power Supply and Fan Modules
- 1.2.2 I/O Connectors and Status LED Indicators
- 1.2.3 Uninterruptible Power Supply (UPS)
- Chapter 2
- Controller Installation
- 1. Unpack the controller system.
- 2. If it is necessary to install the controller in the 19-inch cabinet(s), contact your service provider.
- 3. Set up and connect the drive enclosures to the controller.
- 4. Connect the controller to your Infiniband (IB) or Fibre Channel (FC) switch and host computer(s).
- 5. Connect your RS-232 terminal to the controller.
- 6. Power up the system.
- 7. Configure the storage array (create and format LUNs - Logical Units) via RS-232 interface, Telnet, or GUI.
- 8. Define and provide access rights for the clients in your SAN environment. Shared LUNs need to be managed by SAN management software. Individual dedicated LUNs appear to the client as local storage and do not require management software.
- 9. Initialize the system LUNs for use with your server/client systems. Partition disk space and create file systems as needed.
- 2.1 Setting Up the Controller
- 2.2 Unpacking the System
- 2.2.1 Rack-Mounting the Controller Chassis
- 2.2.2 Connecting the Controller in Dual Mode
- 2.2.3 Connecting the Controller
- 2.2.4 Selecting SAS- ID for Your Drives
- 2.2.5 Laying Out your Storage Drives
- 2.2.6 Connecting the RS-232 Terminal
- 2.2.7 Powering On the Controller
- 1. Verify that the power switches on the two (2) power supply module at the back of each controller are off.
- 2. Connect the two AC connectors, using the power cords provided at the back to the AC power source for each controller unit. For maximum redundancy, connect the two power connectors to two different AC power circuits for each unit.
- 3. Check that all your drive enclosures are powered up.
- 4. Check that the drives are spun up and ready.
- 5. Turn on the power supplies on the controller unit(s). The controller will undergo a series of system diagnostics and the bootup sequence is displayed on your terminal.
- 6. Wait until the bootup sequence is complete and the controller system prompt is displayed.
- 2.3 Configuring the Controller
- 2.3.1 Planning Your Setup and Configuration
- 2.3.2 Configuration Interface
- 2.3.3 Login as Administrator
- 2.3.4 Setting System Time & Date
- 2.3.5 Setting Tier Mapping Mode
- 2.3.6 Checking Tier Status and Configuration
- Figure 2-5 Tier Status Screen
- 1. If a drive is not displayed at all (that is, it is “missing”), check to ensure that the drive is properly seated and in good condition. To search for the drive, enter:
- 2. If the same channel is missing on all tiers, check the cable connections for that channel.
- 3. If “automatic disk rebuilding” is not enabled, enable it by entering:
- 4. To display the detailed disk configuration information for all of the tiers (Figure 2-6) enter:
- Figure 2-6 Current Tier Configuration
- 2.3.6.1 Heading Definitions
- Figure 2-5 Tier Status Screen
- 2.3.7 Cache Coherency and Labeling in Dual Mode
- 2.3.8 Configuring the Storage Arrays
- 1. To display the current cache settings, type: cache
- 2. Select a cache segment size for your array. For example, to set the segment size to 128 KBytes, type:
- 3. Type: lun
. - 4. To create a new LUN, type: lun add=x
- 5. You will be prompted to enter the parameter values for the LUN. In this example:
- 6. When you are asked to format the LUN, type: y
- Figure 2-9 Logical Unit Status - Formatting
- Figure 2-10 LUN Status - Ready
- 1. To display the current cache settings, type: cache
- 2.3.9 Setting Security Levels
- 2.3.9.1 User Authentication (Recommended for SAN Environment)
- 1. Enter: zoning
- 2. Check to ensure that the LUN Zoning chart is empty (Figure 2-11).
- Figure 2-11 LUN Zoning Screen
- To add a user:
- 1. Type: user audit=on
The controller reports which users are connected. - 2. Type: user add
. - 3. Specify a new Host User’s world wide name, enter s.
- 4. Specify a 64-bit world wide name or GUID, taken from the list of available anonymous users.
- 5. Enter an alias name for the user. The name may contain up to 12 characters. Type in a name and press
. - 6. Host users can have their port access zoned. Enter y to specify host port zoning.
- 7. For Unit 1, enter each active port on a new line and then exit. For this example, type: 1
4 e - 8. For Unit 2, enter each active port on a new line and then exit. For this example, type: 1
4 e - 9. Enter y to specify the unique LUN mapping (Table 2-2).
- 10. Enter a new unique LUN mapping for this user. Options are shown in Table 2-2 on page 26.
- 1. Type: user audit=on
- To add a user:
- Table 2-2 LUN Mapping Options.
- Figure 2-12 Security Settings Screen
- 1. Enter: zoning
- 2.3.9.2 Host Port Zoning (Anonymous Access)
- 1. To edit the default zoning on a host port, type: zoning edit
. The current settings are displayed. - 2. Select a host port (1..4): 1
- 3. Specify the internal LUN (0..1023) to be mapped to the external LUN. The new settings will display .
- 4. Repeat steps 1-3 to configure other host ports.
- Chapter 3
- Controller Management
- 1. To edit the default zoning on a host port, type: zoning edit
- 2.3.9.1 User Authentication (Recommended for SAN Environment)
- 3.1 Managing the Controller
- Locally - Serial Interface
- Remotely - Telnet
- Host WWN
- Disk and Channel Information
- Figure 3-9 Disk Channel Screen
- Figure 3-10 Disk Status Screen
- Figure 3-11 Disk Defect List Screen
- Table 3-2 PHY Link Error Status Block Information
- Figure 3-12 Disk PLS Tier 1 Status Screen
- 3.2.2.1 Tier View
- 3.2.2.2 Tier Configuration
- 3.2.2.3 LUN View
- 3.2.2.4 LUN Configuration
- 3.2.2.5 LUN Reservations
- 3.2.2.6 Adding/Removing Storage Assets
- 3.2.2.7 Status of Drive Enclosures
- 3.2.2.8 Display SES Devices Information
- 3.2.2.9 Visual Indication of Drive
- 3.2.2.10 Visual Indication of Tier
- 3.2.2.11 Visual Indication of Channel
- 3.2.3 Tier Mapping for Enclosures
- 3.2.4 System Network Configuration
- 3.2.5 Restarting the Controller
- 3.2.6 Setting the System’s Date and Time
- 3.2.7 Saving the Controller’s Configuration
- 3.2.8 Restoring the System’s Default Configuration
- 3.2.9 LUN Management
- SCSI Reservations
- Non-Cache Coherent
- Display Current Cache Settings
- Disk Configuration Settings
- Figure 3-30 Disk Configuration Setting Screen
- 3.3.2 Audio/Visual Settings of the System
- 3.3.3 Locking LUN in Cache
- 3.3.3.1 Locking / Unlocking a LUN
- 3.3.3.2 System Performance Statistics
- Figure 3-33 System Performance Statistics Screen
- Figure 3-34 Command Delay Statistics Screen
- Figure 3-35 Host Delay Statistics Screen
- Figure 3-36 Tier Delay Statistics Screen
- Figure 3-37 Host Command Offsets Screen
- Figure 3-38 Dual Message Statistics Screen
- Figure 3-39 Command Length Statistics Screen
- Figure 3-40 Host Command Offsets Screen
- 3.3.4 Resources Allocation
- Background Format/Rebuild Operations
- Figure 3-41 Displaying the Current Rebuild Parameters
- 3.3.4.1 Background LUN Verify Operations
- 3.3.4.2 Background TIER Verify Operations
- 1. At the prompt, type TIER VERIFY=ON
. - 2. The system will ask which tier you wish to verify. Enter the tier number or type a for “All.”
- 3. The system will ask if you want run the Tier Verify operation continuously or not. Type y to run continuously or N to run just once. The default is N.
- Figure 3-42 Tier Verify ON Screen
- Figure 3-43 Tier Verify OFF Screen
- 1. At the prompt, type TIER VERIFY=ON
- 3.3.4.3 Rebuild Journaling
- 3.3.4.4 SES Device Monitoring Rate
- 3.3.4.5 Host Command Timeout
- 3.4 Security Administration
- 3.5 Firmware Update Management
- 3.5.1 Displaying Current Firmware Version
- 3.5.2 Firmware Update Procedure
- 1. Collect and save the output of the following commands before you update the firmware: VERSION AV CACHE DISK DISK LIST DUAL HOST HOST STATUS LOG LUN LUN CONFIG NETWORK STATS STATS DELAY STATS TIER DELAY TIER TIER CONFIG
- 2. Copy the new firmware file to your TFTP server.
- 3. Connect to the controller via Telnet or serial (CLI port).
- 4. Enter TFTP
- 5. You will be asked to confirm action (Figure 3-53). Enter y to continue.
- Figure 3-53 Downloading Controller Firmware
- 6. Enter the TFTP server’s IP address: TFTP
- 7. Enter the firmware path and filename: TFTP
- 8. For the couplet controller configuration, connect and log into the other controller. Repeat Steps 4- 7 above to update the firmware.
- 9. Enter RESTART to restart the unit(s).
- 10. (For dual mode only): After both controllers are back on-line, use the DUAL command to verify that both controller units are healthy. If either controller shows failed, login to the healthy controller and issue the DUAL HEAL command.
- 6. Enter the TFTP server’s IP address: TFTP
- 3.6 Remote Login Management
- 3.7 System Logs
- 3.7.1 Message Log
- 3.7.2 System and Drive Enclosure Faults
- Figure 3-59 Current System Faults
- To display the current SDRAM memory faults (ECC- error controller counters), use FAULTS MEMORY command. To clear the values in the memory faults (ECC) statistics, use FAULTS MEMCLEAR command.
- To display the current status of the host and disk SFPs, use the FAULTS SFP command.
- To display the number of LUN array parity errors detected by the system, use the FAULTS ARRAYPARITY command. The system saves th...
- FAULTS BUSPARITY displays the number of bus parity and data path errors detected by the system.
- FAULTS BUSPARITYCLEAR clears the count of errors.
- You may set a parameter (ECCSHUTDOWN) that allows the system to automatically shutdown if it encounters an unrecoverable error. ...
- The EXCEPTIONSHUTDOWN command parameter allows the system to automatically shutdown if it encounters a task exception.
- FAULTS EXCEPTIONSHUTDOWN=ON enables automatic shutdown for task exceptions. This is the default setting.
- The FAULTS EXCEPTIONSHUTDOWN=OFF disables automatic shutdown and allows the system to continue to run in spite of task exceptions.
- Figure 3-59 Current System Faults
- 3.7.3 Displaying System Uptime
- 3.7.4 Saving a Comment to the Log
- 3.8 Other Utilities
- 4.1 Remote Management of the Controller
- 4.1.1 Network Connection
- 4.1.2 Network Interface Set Up
- 1. Use the NETWORK command to display the current settings (Figure 4-2).
- 2. Change the controller’s IP address for your network environment: network IP=
. - 3. Change the netmask of the controller (if needed): network netmask=
. - 4. Enable the Telnet capability (if needed): network telnet=ON.
- 5. Decide whether the SNMP functionality should be enabled.
- Figure 4-2 Current Network Configuration Screen
- 6. If the SNMP function is enabled, enter the IP address of the computer to be used to monitor the SNMP traps: network trapip=
- 7. Decide whether the Syslog capability should be enabled. To enable (ON) or disable (OFF) the Syslog, enter: network syslog=on|off
- 8. If the SYSLOG function is enabled, enter the destination IP address for the Syslog packets: network SYSLOGIP=
- 9. The default destination port number for Syslog packets is 514. To change it, enter: network SYSLOGPORT=
- 10. Set up the routing table. This table describes how the controller communicates with the hosts on other networks. Use the ROUTE command to display the current settings (Figure 4-3).
- 6. If the SNMP function is enabled, enter the IP address of the computer to be used to monitor the SNMP traps: network trapip=
- Figure 4-3 Current Controller Routing Table
- 4.1.3 Login Names and Passwords
- 4.1.4 SNMP Set Up on Host Computer
- 1. Ensure the host computer, which will receive the traps, has an SNMP browser (such as HP OpenView) properly installed and configured. Refer to your browser’s documentation for instructions on how to load the MIB files.
- 2. Set up the host computer to listen to Ports 161 and 162.
- 3. Load the following controller MIB. Note these MIB files:
- 4. Start a query. You should be able to see various controller information such as tempLevel, powerNumber, fanNumber, powerTable, tierNumber, and tierTable.
- 5. To verify that the SNMP function has been set up correctly:
- a) Unplug one of the power supplies.
- b) Check that a power failure message appears in your SNMP browser.
- 4.1.4.1 Controller Implementation of SNMP
- InfiniteStorage 15000 MIB (The InfiniteStorage 15000 MIB is read only.)
- 4.1.4.2 Traps
- 4.2 Troubleshooting the Controller
- 4.2.1 Component Failure Recovery
- 4.2.1.1 Power Supply Failure
- Figure 4-4 Power Modules - Front View
- 1. Remove the cover panel (if the panel is installed).
- 2. Locate the failed power supply module (indicated by an off Status LED).
- 3. Turn off module’s power switch (found on back of unit).
- 4. Remove the module’s thumbscrews, then slide the module out of the bay.
- 5. Slide the module into the bay making sure it is fully inserted. Install the two thumbscrews to secure it.
- 6. Turn on the power switch. Check that the Status LED is green, indicating that the module is operating normally.
- 7. Replace the cover panel (if necessary).
- Figure 4-4 Power Modules - Front View
- 4.2.1.2 Fan Failure
- 4.2.1.1 Power Supply Failure
- 4.2.2 Recovering from Drive Failures
- 4.2.2.1 Single Drive Failures
- 4.2.2.2 Returning the System to a Fault-Tolerant State
- 4.2.2.3 Manually Replacing a Failed Disk with Spare Disk
- 4.2.2.4 Changing the Rate of Rebuild
- 4.2.2.5 Interrupting the Rebuild Operation
- 4.2.2.6 Multiple Drive and Channel Failures
- 1. Use the TIER command to check the current disk status and see if the drives failed are all on the same channel.
- 2. Use the DISK command to check the status of the channel.
- 1. Contact SGI technical support to determine the cause of error and what steps may need to be taken to recover form the error.
- 2. Isolate the cause of channel failure (for example, loose cable connections).
- 3. Rebuild all the drives on that channel individually by issuing the command:
- 4.2.3 Component Failure on Enclosures
- 4.2.1 Component Failure Recovery
- 5.1 The SGI InfiniteStorage 15000 Drive Enclosure
- 5.2 Enclosure Core Product
- 5.3 The Plug-in Modules
- 5.4 Indicators
- 5.5 Visible and Audible Alarms
- 5.6 Drive Enclosure Technical Specification
- 5.7 Environment
- 6.1 Introduction
- 6.2 Planning Your Installation
- Before you begin installation, you should become familiar with the configuration requirements of your drive enclosure system and the correct positions of each of the optional plug-in modules (Table 6-1).
- Table 6-1 Drive Enclosure Configuration
- Figure 6-1 DEM Pair Locations
- 6.2.1 Enclosure Bay Numbering Convention
- 6.3 Enclosure Installation Procedures
- Important SGI InfiniteStorage 15000 drive enclosures should only be installed in SGI InfiniteStorage 15000 racks. Mounting and installing these drive enclosures in any other rack is not authorized or supported by SGI.
- Warning The drive enclosure with all its component parts installed is too heavy for a single person to easily install into a Rack cabinet.
- Caution Ensure that you have fitted and checked a suitable anti-static wrist or ankle strap and observe all conventional ESD pre...
- Note Drive enclosures are supplied and delivered populated with Backplane, Baseplane, Front Panel, DEMs, and with PCMs installed. The Drive Carrier Modules are supplied as a separate package.
- 6.4 I/O Module Configurations
- 6.5 SAS DEM
- 6.6 SATA Interposer Features
- 6.7 Drive Enclosure Device Addressing
- 6.8 Grounding Checks
- 7.1 Before You Begin
- 7.2 Power On / Power Down
- Caution Do not operate the subsystem until the ambient temperature is within the specified operating range. If the drives have been recently installed, ensure they have had time to acclimatize before operating them.
- Important If AC power is lost for any reason, on restoration of power the enclosure will re-start automatically.
- 7.2.1 PCM LEDs
- 7.2.2 I/O Panel LEDs
- 8.1 Overview
- 8.2 Initial Start-up Problems
- 8.2.1 Faulty Cords
- 8.2.2 Alarm Sounds On Power Up
- 8.2.3 Green “Signal Good” LED on I/O Module Not Lit
- 8.2.4 Computer Doesn’t Recognize the Drive Enclosure Subsystem
- 1 Check that the SAS interface cables from the drive enclosure to the host computer, or I/O controller, are fitted correctly.
- 2 Check that all drive carrier modules have been correctly installed.
- 3 Check that there is a valid SAS signal present at the I/O connector. If there is no signal present, ensure the cable has been properly inserted.
- 4 Check the I/O module setup as follows:
- 8.3 LEDs
- 8.4 Audible Alarm
- 8.5 Troubleshooting
- 8.5.1 Thermal Control
- 1 Check the installation for any airflow restrictions at either the front or rear of the enclosure. A minimum gap of 25mm at the front and 50mm at the rear is recommended.
- 2 Check for restrictions due to dust build-up; clean as appropriate.
- 3 Check for excessive re-circulation of heated air from rear to the front. Use in a fully enclosed rack installation is not recommended.
- 4 Check that all Blank modules are in place.
- 5 Reduce the ambient temperature.
- 8.5.2 Thermal Alarm
- 1 Check local ambient environment temperature is below the upper 40˚C specification.
- 2 Check the installation for any airflow restrictions at either the front or rear of the enclosure. A minimum gap of 25mm at the front and 50mm at the rear is recommended.
- 3 Check for restrictions due to dust build-up. Clean as appropriate.
- 4 Check for excessive re-circulation of heated air from rear to the front. Use in a fully enclosed rack installation is not recommended.
- 5 If possible, shutdown the enclosure and investigate the problem before continuing.
- 8.5.3 Thermal Shutdown
- 8.5.1 Thermal Control
- 8.6 Dealing with Hardware Faults
- 8.7 Continuous Operation During Replacement
- 8.8 Replacing a Module
- Warning Whenever replacing a module NEVER leave an EMPTY bay in the rear of the enclosure, obtain a replacement before removing the problem part.
- Warning Observe all conventional ESD precautions when handling drive enclosure modules and components. Avoid contact with Backplane components and module connectors, etc.
- 8.8.1 Power Cooling Modules
- Warning Do not remove covers from the Power Cooling module (PCM). There is a danger of electric shock. Return the PCM to your supplier for repair.
- Warning To ensure your system has warning of a power failure please disconnect the power from the power supply, by either the sw...
- Warning Do not remove the faulty PCM/Cooling module unless you have a replacement unit of the correct type ready for insertion.
- 1 Make sure you identify the faulty PCM correctly from the two modules installed.
- 2 Lift latch which secures the power supply cord.
- 3 Switch off and disconnect the power supply cord.
- 4 Lift the PCM handle to the open position to cam the PCM out of the enclosure.
- 5 Grip the handle and withdraw the PCM.
- 1 Make sure you identify the faulty PCM correctly from the two modules installed.
- 2 Lift latch which secures the power supply cord.
- 3 Switch off and disconnect the power supply cord.
- 4 Remove the two thumbscrews on the right and left sides of the unit.
- 5 Firmly grip the handle on the bottom of the unit and withdraw the PCM.
- 6 Check for damage, especially to the rear connector on the PCM.
- 7 With the PCM handle in the open position, slide the module into the enclosure.
- 8 Cam the module home by manually closing the PCM handle. A click should be heard as the handle latches engage.
- 9 Connect the power supply cord to the power source, secure the latch over the cord, and switch the power supply ON.
- 1 Check for damage, especially to the rear connector on the PCM.
- 2 Slide the module into the enclosure and push unit you hear a click as the latch is engaged.
- 3 Secure the two thumbscrews on the left and right sides of the unit.
- 4 Connect the power supply cord to the power source; secure latch, and switch the power supply ON.
- 8.8.2 I/O Module
- Warning Do not remove this module unless a replacement can be immediately added. The system must not be run without all units in place.
- 1 Release the two latches on the bottom of the unit by simply pulling each latch out and away from unit.
- 2 Pull the latches forward to cam the module out of the enclosure.
- 3 Grip the unit securely and withdraw the module.
- 4 With the latches in the open position, slide the EBOD module into the enclosure until the latch engages automatically.
- 5 Cam the module home by manually closing the latches. A click should be heard as the latch engages.
- 8.8.3 Replacing the Drive Carrier Module
- Warning Observe all conventional ESD precautions when handling drive enclosure modules and components. Avoid contact with backplane components and module connectors, etc.
- 1 Release the carrier handle by sliding the latch backwards.
- 2 Insert the carrier into the enclosure.
- 3 Slide the drive carrier, gently, all the way into the enclosure.
- 4 Cam the drive carrier home. The camming foot on the base of the carrier will engage into a slot in the enclosure.
- 5 When the carrier is fully home, close the handle. You should hear a click as the latch engages and holds the handle closed.
- 6 Close the enclosure.
- 8.9 Replacing the DEM
- A
- B
- C