Grid Account
In
order to obtain a grid account, please send an email to Steve Spencer
containing your full name, Mac ID, Student Number and supervisor name. Your
password will initially be your student number (with no leading zeros) but you can
change it by logging in and typing
passwd
Background
NOTE:
Files stored on grid are NOT backed up so it is in your best interests to
ensure that you have copies of your important files stored elsewhere as well as
on grid.
The Blade Center consists of 140 compute nodes capable of running Matlab 7.1. They are all named gridxxx
where xxx = 001 through 140. Their hardware configuration is as follows:
·
grid001
- grid004 - 4 nodes with dual 3.33 GHz Dual Core Xeon
(X5260) CPUs and 6 GB of memory
·
grid005
- grid014 - 10 nodes with dual 2.4 GHz
Pentium 4 Xeon CPUs and 8 GB of memory
·
grid015
- grid064 - 50 nodes with dual 2.4 GHz
Pentium 4 Xeon CPUs and 1 GB of memory
·
grid065
- grid069 - 5 nodes with dual 2.6 GHz Pentium 4 Xeon CPUs and 1 GB of
memory
·
grid070 - 1 node
with dual 2.6 GHz Pentium 4 Xeon CPUs
and 4 GB of memory
·
grid071
- grid084 - 14 nodes with dual 2.0 GHz dual core Opteron CPUs and 2 GB of
memory
·
grid085
- grid094 - 10 nodes with dual 2.2 GHz dual core Opteron CPUs and 4 GB of
memory
·
grid095
- grid098 - 4 nodes with dual 2.2 GHz dual core Opteron CPUs and 8 GB
of memory
·
grid099
- grid140 - 42 nodes with dual 3.06 GHz Pentium 4 Xeon CPUs and 1 GB of memory
Sun Grid Engine is installed on an external management system to distribute
users across the compute nodes in an efficient manner. Users can only connect
directly to the management system which then schedules their task on one of the
compute nodes. Users' files reside on an external file server and are
accessible on the management system and on every compute node.
To Access the
Blade Center
ssh grid.ece.mcmaster.ca
If
you don't have ssh, we can install a copy on your PC.
Once connected you will be using bash by default and if you want to view
graphics then you need to identify your Xserver, eg.
export DISPLAY=your-computer:0
To
submit batch jobs
qsub your-batch-script
For
a sample batch script see /usr/local/sge/examples/jobs/simple.sh on grid
To Run an Interactive Session
If
you are using a Windows PC, make sure to start X-Win32 or Cygwin on your PC
first. If you don't have X-Win32 or Cygwin, we can install it on your PC.
If
you are using a Unix system or Cygwin, you may need to allow other systems to
connect to your Xserver by typing
xhost grid
Once you have an X-windows environment
up and running, open a shell and type
qsh
which
will start an xterm on your display, connected to one
of the grid nodes.
NOTE:
DO NOT start more than 1 job in any interactive shell because this circumvents
the load manager to the eventual detriment of all.
It
is assumed that if you use qsh then your intention is
to run a task where you interact with an application. If you do not intend to
interact with your application then use qsub.
Matlab can be run in
either mode depending on your requirements. DO NOT run anything locally on
grid, use qsh, qsub or qmon to schedule a task on one of the compute nodes.
Monitoring your
Job(s)
To monitor or schedule jobs of any type you can use the graphical interface
qmon&
Other Documentation
For
more information about these commands,
man qsub
man qsh
man qmon
For
more information about the load manager, grid engine, and the various commands
see
/usr/local/sge/doc/SGE53AdminUserDoc.pdf
on
grid.ece.mcmaster.ca. Since there is intentionally no Acrobat reader on grid
you will have to use sftp to copy this file to your
desktop system for viewing.