beautypg.com

2 on the client, 6 lustre performance monitoring – HP StorageWorks Scalable File Share User Manual

Page 41

background image

Run this command on each server node for all the mpaths which that node normally mounts.

4.

chkconfig heartbeat off

on all server nodes and reboot them.

5.

Restart the file system as described in section 5.3 in this order: MGS, MDS, OSTs.

5.5.1.2 On the Client

Use the following command on a client to check whether the client can communicate properly
with the MDS node:

# lfs check mds
testfs-MDT0000-mdc-ffff81012833ec00 active

Use the following command to check OSTs or servers for both MDS and OSTs. This will show
the Lustre view of the file system. You should see an MDT connection, and all expected OSTs
showing a total of the expected space. For example:

# lfs df -h /hpcsfsc
UUID bytes Used Available Use% Mounted on
hpcsfsc-MDT0000_UUID 1.1T 475.5M 1013.7G 0% /hpcsfsc[MDT:0]
hpcsfsc-OST0000_UUID 1.2T 68.4G 1.1T 5% /hpcsfsc[OST:0]
hpcsfsc-OST0001_UUID 1.2T 68.1G 1.1T 5% /hpcsfsc[OST:1]
hpcsfsc-OST0002_UUID 1.2T 67.9G 1.1T 5% /hpcsfsc[OST:2]
hpcsfsc-OST0003_UUID 1.2T 69.1G 1.1T 5% /hpcsfsc[OST:3]
hpcsfsc-OST0004_UUID 1.2T 71.2G 1.1T 5% /hpcsfsc[OST:4]
hpcsfsc-OST0005_UUID 1.2T 71.7G 1.1T 5% /hpcsfsc[OST:5]
hpcsfsc-OST0006_UUID 1.2T 68.1G 1.1T 5% /hpcsfsc[OST:6]
hpcsfsc-OST0007_UUID 1.2T 68.4G 1.1T 5% /hpcsfsc[OST:7]
hpcsfsc-OST0008_UUID 1.2T 68.6G 1.1T 5% /hpcsfsc[OST:8]
hpcsfsc-OST0009_UUID 1.2T 73.1G 1.1T 6% /hpcsfsc[OST:9]
hpcsfsc-OST000a_UUID 1.2T 72.9G 1.1T 6% /hpcsfsc[OST:10]
hpcsfsc-OST000b_UUID 1.2T 68.8G 1.1T 5% /hpcsfsc[OST:11]
hpcsfsc-OST000c_UUID 1.2T 68.6G 1.1T 5% /hpcsfsc[OST:12]
hpcsfsc-OST000d_UUID 1.2T 68.3G 1.1T 5% /hpcsfsc[OST:13]
hpcsfsc-OST000e_UUID 1.2T 82.5G 1.0T 6% /hpcsfsc[OST:14]
hpcsfsc-OST000f_UUID 1.2T 71.0G 1.1T 5% /hpcsfsc[OST:15]

filesystem summary: 18.9T 1.1T 16.8T 5% /hpcsfsc

The following commands will show the file system component connections and the network
interfaces that serve them.

# ls /proc/fs/lustre/*/*/*conn_uuid
/proc/fs/lustre/mdc/testfs-MDT0000-mdc-ffff81012833ec00/mds_conn_uuid
/proc/fs/lustre/mgc/MGC172.31.97.1@o2ib/mgs_conn_uuid
/proc/fs/lustre/osc/testfs-OST0000-osc-ffff81012833ec00/ost_conn_uuid

# cat /proc/fs/lustre/*/*/*conn_uuid
172.31.97.1@o2ib
172.31.97.1@o2ib
172.31.97.2@o2ib

5.6 Lustre Performance Monitoring

You can monitor the performance of Lustre clients, Object Storage Servers, and the MetaData
Server with the open source tool collectl. Not only can collectl report a variety of the more
common system performance data such as CPU, disk, and network traffic, it also supports
reporting of both Lustre and InfiniBand statistics. Read/write performance counters can be
reported in terms of both bytes-per-second and operations-per-second.

For more information about the collectl utility, see

http://collectl.sourceforge.net/

Documentation.html

. Choose the Getting Started section for information specific to Lustre.

Additional information about using collectl is also included in the HP XC System Software
Administration Guide Version 3.2.1
in section 7.7 on the HP website at:

http://docs.hp.com/en/A-XCADM-321/A-XCADM-321.pdf

5.6 Lustre Performance Monitoring

41