[Contents] [Index] [Top] [Bottom] [Prev] [Next]


Contents

Preface

Audience
LSF Suite 3.2
LSF Enterprise Edition
LSF Standard Edition
Related Documents
Online Documentation
Technical Assistance

1 - LSF Batch Concepts

LSF Base
LSF Batch
LSF MultiCluster
Definitions
Jobs, Tasks, and Commands
Hosts, Machines, and Computers
Clusters
Local and Remote Hosts
Submission, Master, and Execution Hosts
Fault Tolerance
Shared Directories and Files
Shared User Directories
Executables and the PATH Environment Variable
Time Windows
Resource and Resource Requirements
Shared Resources
Remote Execution Control
User Authentication Methods
How LSF Chooses Authentication Methods
Host Authentication Methods
User Account Mapping
Job Starters
Command-Level Job Starters
Queue-Level Job Starters
Load Sharing with LSF Base
How LSF Batch Schedules Jobs
Job States
Eligible Hosts
Dispatch Windows
Run Windows
Resource Requirements
Host Lists
Host Load Levels
Order of Job Dispatching
Job Slot Limits
User Job Slot Limits
Host Job Slot Limits
Queue Job Slot Limits
Resource Limits and Resource Usage
Scheduling Policies
Suspending Jobs
Resuming Suspended Jobs
User Suspended Jobs
Interactive Batch Job Support
Pre- and Post-execution Commands
Checkpointing and Migration
Job Migration
Job Control Actions
Resource Reservation
Processor Reservation
Remote File Access
Job Requeue
External Submission and Execution Executables
External Load Indices and ELIM
External Group Membership Definition

2 - Managing LSF Base

Managing Error Logs
LSF Daemon Error Log
FLEXlm Log
Controlling LIM and RES Daemons
Checking Host Status
Restarting LIM and RES
Remote Startup of LIM and RES
Shutting down LIM and RES
Locking and Unlocking Hosts
Managing LSF Configuration
Overview of LSF Configuration Files
Configuration File Formats
Example Configuration Files
Changing LIM Configuration
Reconfiguring an LSF Cluster
External Resource Collection
Restrictions
Writing an External LIM
Overriding Built-In Load Indices
LIM Policies
Tuning CPU Factors
Tuning LIM Load Thresholds
Cluster Monitoring with LSF
LSF License Management
How FLEXlm Works
Updating an LSF License
Changing the FLEXlm Server TCP Port
Modifying LSF Products and Licensing

3 - Managing LSF Batch

Managing LSF Batch Logs
LSF Batch Accounting Log
LSF Batch Event Log
Duplicate Event Logging
Configuring Duplicate Event Logging
How Duplicate Event Logging Works
Controlling LSF Batch Servers
LSF Batch System Status
Remote Start-up of sbatchd
Restarting sbatchd
Shutting Down LSF Batch Daemons
Opening and Closing of Batch Server Hosts
Controlling LSF Batch Queues
bqueues -- Queue Status
Opening and Closing Queues
Activating and Inactivating Queues
Managing LSF Batch Configuration
Adding a Batch Server Host
Removing a Batch Server Host
Adding a Batch Queue
Removing a Batch Queue
Validating Job Submissions
Controlling LSF Batch Jobs
Moving Jobs -- bswitch, btop, and bbot
Signalling Jobs -- bstop, bresume, and bkill
Forcing Job Execution -- brun -f
Managing an LSF Cluster Using xlsadmin
xlsadmin Management Mode
xlsadmin Configuration Mode

4 - Tuning LSF Batch

Tuning LSF Batch
Controlling Interference via Load Conditions
Understanding Suspended Jobs
Controlling Fairshare
Hierarchical Fairshare
Understanding How Fairshare Works
Job Dispatching According to Fairshare
Limits and Windows
Dispatch and Run Windows
Controlling Job Slot Limits
Resource Limits
Reservation Based Scheduling
Resource Reservation
Processor Reservation and Backfilling
Controlling Job Execution
Understanding Job Execution Environment
Environment Variable Handling
NICE Value
Pre-execution and Post-execution commands
Queue-Level Job Starters
Using Licensed Software with LSF Batch
Host Locked Licenses
Host Locked Counted Licenses
Floating Licenses
Example LSF Batch Configuration Files
Example Queues
Example lsb.hosts file

5 - Managing LSF MultiCluster

What is LSF MultiCluster?
Enabling MultiCluster Functionalities
The lsf.shared File
The lsf.cluster.cluster File
Root Access
LSF Batch Configuration
Remote-Only MultiCluster Queues
Inter-cluster Load and Host Information Sharing
Running Interactive Jobs on Remote Clusters
Distributing Batch Jobs Across Clusters
Account Mapping Between Clusters
User Level Account Mapping
System Level Account Mapping

6 - LSF Base Configuration Reference

The lsf.conf File
The lsf.shared File
Clusters
Host Types
Host Models
Resources
The lsf.cluster.cluster File
Parameters
LSF Administrators
Hosts
Resource Map
The lsf.task and lsf.task.cluster Files
The hosts File
The lsf.sudoers File

7 - LSF Batch Configuration Reference

The lsb.params File
Parameters
Handling Cray NQS Incompatibilities
The lsb.users File
UNIX/NT User Groups
LSF Batch User Groups
Share Tree Defined in User Groups
External User Groups
User and Group Job Slot Limits
The lsb.hosts File
Host Section
Host Groups
External Host Groups
Host Partitions
The lsb.queues File
General Parameters
Processor Reservation for Parallel Jobs
Backfill Scheduling
Deadline Constraint Scheduling
Flexible Expressions for Queue Scheduling
Load Thresholds
Resource Limits
Eligible Hosts and Users
Scheduling Policy
Migration
Queue-Level Pre-/Post-Execution Commands
Job Starter
Configurable Job Control Actions
Automatic Job Requeue
Exclusive Job Requeue
Default Host Specification for CPU Speed Scaling
NQS Forward Queues
Queue Level Checkpoint and Rerun
The lsb.nqsmaps File
Hosts
Users

A - Troubleshooting and Error Messages

Error Log Messages
Finding the Error Logs
Shared File Access
Shared Files Across UNIX and NT
Common LSF Base Problems
LIM Dies Quietly
LIM Unavailable
RES Does Not Start
User Permission Denied
Non-uniform File Name Space
Common LSF Batch Problems
Batch Daemons Die Quietly
sbatchd Starts But mbatchd Does Not
Host Not Used By LSF Batch
Error Messages
General Errors
Configuration Errors
LIM Messages
RES Messages
LSF Batch Messages

B - LSF Directories

C - Sample System Support

IRIX 6 Processor Sets
Time-Based Processor Allocation
User-Based Processor Allocation
Other Situations
Support for Solaris Processor Sets
Time-Based Processor Allocation
User-Based Processor Allocation
Other Situations
IBM SP-2 Support
Support for HP Exemplar Technical Servers
Adding Load Indices Definitions
Adding Queue Definitions
Configuring NQS Interoperation
Registering LSF with NQS
lsb.nqsmaps
Configuring Queues for NQS jobs
Handling Cray NQS Incompatibilities
Support for Atria ClearCase
Using LSF Without Shared File Systems

D - LSF on Windows NT

Requirements
Recommended
Differences Between LSF for UNIX and NT
File Permissions
Mail
The cmd.exe Program
Heterogeneous NT/UNIX Environments
User Accounts
Configuration Files
Environment Variables
Cross-Platform Daemon Startup
Signal Conversion
Starting Services and Daemons
Using LSF
Miscellaneous

E - The LSF SNMP Agent

About the Agent
Requirements
Distribution
Starting the Agent
Structure of the LSF MIB
The lsfHosts MIB Group
The lsfResources MIB Group
The lsfBatch MIB Group
Optional Configuration of the Agent

Index



[Contents] [Index] [Top] [Bottom] [Prev] [Next]


doc@platform.com

Copyright © 1994-1998 Platform Computing Corporation.
All rights reserved.