[Contents] [Index]
[Top] [Bottom] [Prev] [Next]
Contents
Preface
-
Audience
-
LSF Suite 3.2
-
LSF Enterprise Edition
- LSF Standard Edition
-
Related Documents
- Online Documentation
- Technical Assistance
1 - LSF Batch Concepts
-
LSF Base
-
LSF Batch
-
LSF MultiCluster
-
Definitions
-
Jobs, Tasks, and Commands
-
Hosts, Machines, and Computers
-
Clusters
-
Local and Remote Hosts
- Submission, Master, and Execution Hosts
-
Fault Tolerance
-
Shared Directories and Files
-
Shared User Directories
-
Executables and the PATH Environment Variable
- Time Windows
-
Resource and Resource Requirements
- Shared Resources
-
Remote Execution Control
-
User Authentication Methods
-
How LSF Chooses Authentication Methods
-
Host Authentication Methods
- User Account Mapping
-
Job Starters
-
Command-Level Job Starters
- Queue-Level Job Starters
-
Load Sharing with LSF Base
-
How LSF Batch Schedules Jobs
-
Job States
-
Eligible Hosts
-
Dispatch Windows
-
Run Windows
-
Resource Requirements
-
Host Lists
-
Host Load Levels
-
Order of Job Dispatching
-
Job Slot Limits
-
User Job Slot Limits
-
Host Job Slot Limits
-
Queue Job Slot Limits
-
Resource Limits and Resource Usage
-
Scheduling Policies
-
Suspending Jobs
-
Resuming Suspended Jobs
-
User Suspended Jobs
- Interactive Batch Job Support
-
Pre- and Post-execution Commands
-
Checkpointing and Migration
- Job Migration
-
Job Control Actions
-
Resource Reservation
-
Processor Reservation
-
Remote File Access
-
Job Requeue
-
External Submission and Execution Executables
-
External Load Indices and ELIM
- External Group Membership Definition
2 - Managing LSF Base
-
Managing Error Logs
-
LSF Daemon Error Log
- FLEXlm Log
-
Controlling LIM and RES Daemons
-
Checking Host Status
-
Restarting LIM and RES
-
Remote Startup of LIM and RES
-
Shutting down LIM and RES
- Locking and Unlocking Hosts
-
Managing LSF Configuration
-
Overview of LSF Configuration Files
-
Configuration File Formats
-
Example Configuration Files
- Changing LIM Configuration
-
Reconfiguring an LSF Cluster
-
External Resource Collection
-
Restrictions
-
Writing an External LIM
- Overriding Built-In Load Indices
-
LIM Policies
-
Tuning CPU Factors
-
Tuning LIM Load Thresholds
-
Cluster Monitoring with LSF
-
LSF License Management
-
How FLEXlm Works
-
Updating an LSF License
-
Changing the FLEXlm Server TCP Port
- Modifying LSF Products and Licensing
3 - Managing LSF Batch
-
Managing LSF Batch Logs
-
LSF Batch Accounting Log
- LSF Batch Event Log
-
Duplicate Event Logging
-
Configuring Duplicate Event Logging
- How Duplicate Event Logging Works
-
Controlling LSF Batch Servers
-
LSF Batch System Status
-
Remote Start-up of sbatchd
-
Restarting sbatchd
-
Shutting Down LSF Batch Daemons
- Opening and Closing of Batch Server Hosts
-
Controlling LSF Batch Queues
-
bqueues -- Queue Status
-
Opening and Closing Queues
- Activating and Inactivating Queues
-
Managing LSF Batch Configuration
-
Adding a Batch Server Host
-
Removing a Batch Server Host
-
Adding a Batch Queue
- Removing a Batch Queue
-
Validating Job Submissions
-
Controlling LSF Batch Jobs
-
Moving Jobs -- bswitch, btop, and bbot
- Signalling Jobs -- bstop, bresume, and bkill
-
Forcing Job Execution -- brun -f
-
Managing an LSF Cluster Using xlsadmin
-
xlsadmin Management Mode
- xlsadmin Configuration Mode
4 - Tuning LSF Batch
-
Tuning LSF Batch
-
Controlling Interference via Load Conditions
-
Understanding Suspended Jobs
-
Controlling Fairshare
-
Hierarchical Fairshare
-
Understanding How Fairshare Works
- Job Dispatching According to Fairshare
-
Limits and Windows
-
Dispatch and Run Windows
-
Controlling Job Slot Limits
- Resource Limits
-
Reservation Based Scheduling
-
Resource Reservation
- Processor Reservation and Backfilling
-
Controlling Job Execution
- Understanding Job Execution Environment
-
Environment Variable Handling
-
NICE Value
-
Pre-execution and Post-execution commands
- Queue-Level Job Starters
-
Using Licensed Software with LSF Batch
-
Host Locked Licenses
-
Host Locked Counted Licenses
- Floating Licenses
-
Example LSF Batch Configuration Files
-
Example Queues
- Example lsb.hosts file
5 - Managing LSF MultiCluster
-
What is LSF MultiCluster?
-
Enabling MultiCluster Functionalities
-
The lsf.shared File
- The lsf.cluster.cluster File
- Root Access
-
LSF Batch Configuration
- Remote-Only MultiCluster Queues
-
Inter-cluster Load and Host Information Sharing
-
Running Interactive Jobs on Remote Clusters
-
Distributing Batch Jobs Across Clusters
-
Account Mapping Between Clusters
-
User Level Account Mapping
- System Level Account Mapping
6 - LSF Base Configuration Reference
-
The lsf.conf File
-
The lsf.shared File
-
Clusters
-
Host Types
-
Host Models
- Resources
- The lsf.cluster.cluster File
-
Parameters
-
LSF Administrators
- Hosts
-
Resource Map
- The lsf.task and lsf.task.cluster Files
-
The hosts File
- The lsf.sudoers File
7 - LSF Batch Configuration Reference
-
The lsb.params File
-
Parameters
- Handling Cray NQS Incompatibilities
-
The lsb.users File
-
UNIX/NT User Groups
-
LSF Batch User Groups
-
Share Tree Defined in User Groups
-
External User Groups
- User and Group Job Slot Limits
-
The lsb.hosts File
-
Host Section
-
Host Groups
-
External Host Groups
- Host Partitions
-
The lsb.queues File
-
General Parameters
-
Processor Reservation for Parallel Jobs
-
Backfill Scheduling
-
Deadline Constraint Scheduling
-
Flexible Expressions for Queue Scheduling
-
Load Thresholds
-
Resource Limits
-
Eligible Hosts and Users
-
Scheduling Policy
-
Migration
-
Queue-Level Pre-/Post-Execution Commands
-
Job Starter
-
Configurable Job Control Actions
-
Automatic Job Requeue
-
Exclusive Job Requeue
-
Default Host Specification for CPU Speed Scaling
- NQS Forward Queues
-
Queue Level Checkpoint and Rerun
-
The lsb.nqsmaps File
-
Hosts
- Users
A - Troubleshooting and Error Messages
-
Error Log Messages
- Finding the Error Logs
-
Shared File Access
- Shared Files Across UNIX and NT
-
Common LSF Base Problems
-
LIM Dies Quietly
-
LIM Unavailable
-
RES Does Not Start
-
User Permission Denied
- Non-uniform File Name Space
-
Common LSF Batch Problems
-
Batch Daemons Die Quietly
-
sbatchd Starts But mbatchd Does Not
- Host Not Used By LSF Batch
-
Error Messages
-
General Errors
-
Configuration Errors
-
LIM Messages
-
RES Messages
- LSF Batch Messages
B - LSF Directories
C - Sample System Support
-
IRIX 6 Processor Sets
-
Time-Based Processor Allocation
-
User-Based Processor Allocation
- Other Situations
-
Support for Solaris Processor Sets
-
Time-Based Processor Allocation
-
User-Based Processor Allocation
- Other Situations
-
IBM SP-2 Support
-
Support for HP Exemplar Technical Servers
-
Adding Load Indices Definitions
- Adding Queue Definitions
-
Configuring NQS Interoperation
-
Registering LSF with NQS
-
lsb.nqsmaps
-
Configuring Queues for NQS jobs
- Handling Cray NQS Incompatibilities
-
Support for Atria ClearCase
- Using LSF Without Shared File Systems
D - LSF on Windows NT
-
Requirements
- Recommended
-
Differences Between LSF for UNIX and NT
-
File Permissions
-
Mail
-
The cmd.exe Program
-
Heterogeneous NT/UNIX Environments
-
User Accounts
-
Configuration Files
-
Environment Variables
-
Cross-Platform Daemon Startup
- Signal Conversion
-
Starting Services and Daemons
-
Using LSF
- Miscellaneous
E - The LSF SNMP Agent
-
About the Agent
-
Requirements
- Distribution
-
Starting the Agent
-
Structure of the LSF MIB
-
The lsfHosts MIB Group
-
The lsfResources MIB Group
- The lsfBatch MIB Group
- Optional Configuration of the Agent
Index
[Contents] [Index]
[Top] [Bottom] [Prev] [Next]
doc@platform.com
Copyright © 1994-1998 Platform Computing Corporation.
All rights reserved.