This chapter is a detailed tutorial on how to use the LSF commands that report information about the LSF cluster. Cluster information includes:
Cluster information is available through commands and a graphical user interface.
The lsid
command tells you the version of LSF, the name of the load-sharing cluster, and the current master host.
% lsidLSF 3.1, Dec 1, 1997
Copyright 1992-1997 Platform Computing Corporation
My cluster name is sample_cluster
My master name is hostA
The lsinfo
command displays the resources, host types, and host models in the LSF cluster. See `Displaying Available Resources' on page 14 for an example of the lsinfo
command.
The -l
option to the lsinfo
command displays all information available about load indices. You can also specify load indices on the command line to display information about the selected indices:
% lsinfo -l swp cs irix
RESOURCE_NAME: swp
DESCRIPTION: Available swap space (Mbytes) (alias: swap)
TYPE ORDER INTERVAL BUILTIN DYNAMIC
Numeric Dec 60 Yes Yes
RESOURCE_NAME: cpuf
DESCRIPTION: CPU factor
TYPE ORDER INTERVAL BUILTIN DYNAMIC
Numeric Dec 0 Yes No
RESOURCE_NAME: irix
DESCRIPTION: IRIX UNIX
TYPE ORDER INTERVAL BUILTIN DYNAMIC
Boolean N/A 0 No No
TYPE
Indicates whether the resource is numeric, string, or Boolean.
ORDER
Inc
if the numeric value of the load index increases as the load
it measures increases, such as ut
(CPU utilization). Dec
if the numeric value decreases as the load increases. N/A
if the
resource is not numeric.
INTERVAL
The load index is updated every
INTERVAL
seconds. a value
of 0 means the value never changes.
BUILTIN
If
BUILTIN
is Yes
, the resource name is defined
internally by the LIM. If BUILTIN
is No
, the resource
name is site-specific defined by the LSF administrator.
DYNAMIC
If
DYNAMIC
is Yes
the resource value changes
over time. If DYNAMIC
is No
the resource represents
information that does not change such as the total swap space in the host.
A load-sharing cluster may consist of hosts of different architecture and speeds. The lshosts
command displays static information about hosts.
%
lshosts
HOST_NAME type model cpuf ncpus maxmem maxswp server RESOURCES
hostD SUNSOL SunSparc 6.0 1 64M 112M Yes (solaris cserver)
hostB ALPHA DEC3000 10.0 1 94M 128M Yes (alpha cserver)
hostM RS6K IBM350 7.0 1 64M 124M Yes (cserver aix)
hostC SGI6 ORIGIN2K 14.0 16 1024M 1896M Yes (irix cserver)
hostA HPPA HP715 6.0 1 98M 200M Yes (hpux fserver)
type
- the host CPU architecture. Hosts that can run the same binary programs should have the same type
.
model
and cpuf
- the host model and relative CPU performance factor. The higher the number, the faster the CPU.
ncpus
- the number of processors on this host.
maxmem -
t
he maximum amount of physical memory available for user processes.
maxswp
- the total swap space available.
server
- Yes
if the host is an LSF server, No
if the host is a client.
RESOURCES
- the boolean resources defined for this host.
All these parameters are defined by the LSF administrator in the LSF configuration files, or determined directly from the system.
The -l
option to the lshosts
command displays more detailed information about hosts, including the load thresholds. You can also use the -R
resreq option to display hosts with specific resources, or specify hosts on the command line.
There is also the -w
option which displays information in table format without truncating the fields. This is especially useful for scripts that parse the output of lshosts
.
% lshosts -l hostC
HOST_NAME: hostC
type model cpuf ncpus ndisks maxmem maxswp maxtmp rexpri server
SGI6 ORIGIN2K 14 16 4 1024M 1896M 120M 0 Yes
RESOURCES: irix cserver
RUN_WINDOWS: (always open)
LOAD_THRESHOLDS:
r15s r1m r15m ut pg io ls it tmp swp mem
- 3.5 - - - - - - - 974M 451M
The extra fields displayed by the -l
option are:
ndisks
- the number of disk drives directly attached to the host
maxtmp
- the size of the disk partition that contains the /tmp directory
![]()
rexpri
- the execution priority of remote jobs run by the RES.rexpri
is a number between -20 and 20, with -20 representing the highest priority and 20 the lowest. The defaultrexpri
is 0, which corresponds to the default scheduling priority of 0 on BSD-based UNIX systems and 20 on System V-based systems.
RUN_WINDOWS
The time windows during which LIM considers the host as available to execute remote jobs.
These run windows have the same function for LSF hosts as dispatch windows have for LSF Batch hosts.
LOAD_THRESHOLDS
The host is considered busy if any load index is above its threshold (or below, for decreasing load indices). LSF does not send interactive jobs to busy hosts. If the threshold is displayed as a dash `-', the value of that load index does not affect the host's status.
The lsload
command prints out current load information.
% lsload
HOST_NAME status r15s r1m r15m ut pg ls it tmp swp mem
hostD ok 0.1 0.0 0.1 2% 0.0 5 3 81M 82M 45M
hostC ok 0.7 1.2 0.5 50% 1.1 11 0 322M 337M 252M
hostM ok 0.8 2.2 1.4 60% 15.4 0 136 62M 37M 44M
hostA busy *5.2 3.6 2.6 99% *34.4 4 0 70M 34M 18M
hostB lockU 1.0 1.0 1.5 99% 0.8 5 33 12M 24M 23M
The first line lists all the load index names. The load indices are described in `Load Indices' on page 37.
The status
column gives the load status of the host. A host is busy if any load index is beyond its configured threshold. When a load index is beyond its threshold, it is printed with an asterisk `*
'. In the above example, hostA
is busy because load indices r15s
and pg
are too high.
By default hosts are sorted based on CPU and memory load, with the best host listed first. You can specify an order string using the -R
resreq option to sort the hosts in other ways.
The -l
option displays the values of all load indices, including external load indices. You can also specify host names on the command line to display the load of specific hosts. In this example nio
is an external load index defined by the LSF administrator.
% lsload -l hostM
HOST_NAME status r15s r1m r15m ut pg io ls it tmp swp mem nio
hostM lockW 0.7 0.5 0.1 21% 0.0 228 7 0 31M 52M 25M 3.5
The -l
option displays the full host name, rather than truncating the name to fit in a limited screen width. The output from lsload -l
is better suited for automatic processing, since the host name is always complete. It also shows all the load indices rather than just some of them.
The lsmon
command provides an updating display of load information. An example display from lsmon
is shown below. You can specify the resource requirements, refresh interval, and other parameters interactively or on the command line. See the lsmon(1)
manual page for more information.
% lsmon hostA hostB
Hostname: hostA
Refresh rate: 10 secs
HOST_NAME status r15s r1m r15m ut pg ls it swp mem tmp
hostB ok 0.0 0.0 0.0 6% 0.0 7 45 82M 51M 206M
hostA ok 0.2 0.2 0.1 5% 8.9 2 0 140M 70M 119M
xlsmon
is a graphical user interface (GUI) application that displays host status, load levels, load history, and LSF cluster configuration information. The xlsmon
main window shows an icon for each host in the cluster, with each host labelled with its status. Hosts change colour as their status changes. The xlsmon
main window is shown below.
Each xlsmon
window has a "Help" menu item that calls up online help. For more information about using xlsmon
, see the online help.
You can choose other displays from the "View" menu.
The `Detailed Host Load' window displays load levels as bar graphs. You can select which load indices and which hosts are displayed by choosing options from the "View" menu in the `Detailed Host Load' window.
Figure 6. xlsmon
Detailed Host Load Window
The `History of Host Load' window displays the load levels as stripcharts, so you can see the load history starting from when the `History of Host Load' window is first displayed. As with the `Detailed Host Load' window, you can select hosts and load indices by choosing options from the "View" menu.
Figure 7. xlsmon
History of Host Load Window
The `Cluster Configuration' window below, shows the same host information as the lshosts
command displays.
Figure 8. xlsmon
Cluster Configuration Window