OverviewTeaching: 10 min
Exercises: 5 minQuestions
How do I open a terminal?
How do I connect to a remote computer?Objectives
Succesfully connect to a remote HPC system.
Connecting to an HPC system is most often done through a tool known as “SSH” (Secure SHell) and usually SSH is run through a terminal. So, to begin using an HPC system we need to begin by opening a terminal. Different operating systems have different terminals, none of which are exactly the same in terms of their features and abilities while working on the operating system. When connected to the remote system the experience between terminals will be identical as each will faithfully present the same experience of using that system.
Here is the process for opening a terminal in each operating system.
There are many different versions (aka “flavours”) of Linux and how to open a terminal window can change between flavours. Fortunately most Linux users already know how to open a terminal window since it is a common part of the workflow for Linux users. If this is something that you do not know how to do then a quick search on the Internet for “how to open a terminal window in” with your particular Linux flavour appended to the end should quickly give you the directions you need.
A very popular version of Linux is Ubuntu. There are many ways to open a terminal window in Ubuntu but a very fast way is to use the terminal shortcut key sequence: Ctrl+Alt+T.
Macs have had a terminal built in since the first version of OSX since it is built on a Linux flavour known as BSD (Berkeley Systems Designs). The terminal can be quickly opened through the use of the Spotlight tool. Hold down the command key and press the spacebar. In the search bar that shows up type “terminal”, choose the terminal app from the list of results (it will look like a tiny, black computer screen) and you will be presented with a terminal window. Alternatively, you can find Terminal under “Utilities” in the Applications menu.
While Windows does have a command-line interface known as the “Command Prompt” that has its roots in MS-DOS (Microsoft Disk Operating System) it does not have an SSH tool built into it and so one needs to be installed. There are a variety of programs that can be used for this, two common ones we describe here, as follows:
MobaXterm is a terminal window emulator for Windows and the home edition can be downloaded for free from mobatek.net. If you follow the link you will note that there are two editions of the home version available: Portable and Installer. The portable edition puts all MobaXterm content in a folder on the desktop (or anywhere else you would like it) so that it is easy add plug-ins or remove the software. The installer edition adds MobaXterm to your Windows installation as any other program you might install. If you are not sure that you will continue to use MobaXterm in the future you are likely best to choose the portable edition.
Download the version that you would like to use and install it as you would any other software on your Windows installation. Once the software is installed you can run it by either opening the folder installed with the portable edition and double-clicking on the file named MobaXterm_Personal_10.2 or, if the installer edition was used, finding the executable through either the start menu or the Windows search option.
Once the MobaXterm window is open you should see a large button in the middle of that window with the text “Start Local Terminal”. Click this button and you will have a terminal window at your disposal.
It is strictly speaking not necessary to have a terminal running on your local computer in order to access and use a remote system, only a window into the remote system once connected. PuTTy is likely the oldest, most well-known, and widely used software solution to take this approach.
PuTTY is available for free download from www.putty.org. Download the version that is correct for your operating system and install it as you would other software on you Windows system. Once installed it will be available through the start menu or similar.
Running PuTTY will not initially produce a terminal but intsead a window full of connection options. Putting the address of the remote system in the “Host Name (or IP Address)” box and either pressing enter or clicking the “Open” button should begin the connection process.
If this works you will see a terminal window open that prompts you for a username through the “login as:” prompt and then for a password. If both of these are passed correctly then you will be given access to the system and will see a message saying so within the terminal. If you need to escape the authentication process you can hold the control/ctrl key and press the c key to exit and start again.
Note that you may want to paste in your password rather than typing it. Use control/ctrl plus a right-click of the mouse to paste content from the clipboard to the PuTTY terminal.
For those logging in with PuTTY it would likely be best to cover the terminal basics already mentioned above before moving on to navigating the remote system.
With all of this in mind, let’s connect to a cluster.
For this lesson, we will use Cirrus - an HPC system located at EPCC./ Although it’s unlikely that every system will be exactly like Cirrus, it’s a good example of what you can expect from an HPC system. To connect to our example computer, we will use SSH (if you are using PuTTY, see above for instructions).
SSH allows us to connect to Linux computers remotely, and use them as if they were our own.
The general syntax of the connection command follows the format
Let’s attempt to connect to the HPC system now:
Your Instructor will give you the correct username to use in place of “yourUsername”.
The authenticity of host 'login.cirrus.ac.uk (22.214.171.124)' can't be established. ECDSA key fingerprint is SHA256:JRj286Pkqh6aeO5zx1QUkS8un5fpcapmezusceSGhok. ECDSA key fingerprint is MD5:99:59:db:b1:3f:18:d0:2c:49:4e:c2:74:86:ac:f7:c6. Are you sure you want to continue connecting (yes/no)? # type "yes"! Warning: Permanently added the ECDSA host key for IP address '126.96.36.199' to the list of known hosts. yourUsername@login.cirrus.ac.uk password: # no text appears as you enter your password Last login: Mon Jun 18 16:21:52 2018 from cpc102380-sgyl38-2-0-cust601.18-2.cable.virginm.net ================================================================================ Cirrus HPC Service -------------------------------------------------------------------------------- This is a private computing facility. Access to this system is limited to those who have been granted access by the operating service provider on behalf of the issuing authority and use is restricted to the purposes for which access was granted. All access and usage are governed by the terms and conditions of access agreed to by all registered users and are thus subject to the provisions of the Computer Misuse Act, 1990 under which unauthorised use is a criminal offence. -------------------------------------------------------------------------------- For help please contact the Cirrus helpdesk at: email@example.com ================================================================================
If you’ve connected successfully, you should see a prompt like the one below.
This prompt is informative, and lets you grasp certain information at a glance:
in this case
(If you don’t understand what these things are, don’t worry!
We will cover things in depth as we explore the system further.)
You may have noticed that the prompt changed when you logged into the remote
system using the terminal (if you logged in using PuTTY this will not apply
because it does not offer a local terminal). This change is important because
it makes it clear on which system the commands you type will be run when you
pass them into the terminal. This change is also a small complication that we
will need to navigate throughout the workshop. Exactly what is reported
$ in the terminal when it is connected to the local system and
the remote system will typically be different for every user. We still need
to indicate which system we are entering commands on though so we will adopt
the following convention:
[local]$ when the command is to be entered on a terminal connected to your local computer
[remote]$ when the command is to be entered on a terminal connected to the remote system
$ when it really doesn’t matter which system the terminal is connected to.
Being Certain Which System your Terminal is connected to
If you ever need to be certain which system a terminal you are using is connected to then use the follwing command:
Keep Two Terminal Windows Open
It is strongly recommended that you have two terminals open, one connected to the local system and one connected to the remote system, that you can switch back and forth between. If you only use one terminal window then you will need to reconnect to the remote system using one of the methods above when you see a change from
[remote]$and disconnect when you see the reverse.
Now we can log into the Cirrus HPC system we will look at the nodes. You should remember that there are at least two types of node on the system: login nodes and compute nodes.
We can use the
/proc file system to examine what the hardware looks like on the login node. Use
cat command to print information on the processors on the login nodes to the terminal:
[remote]$ cat /proc/cpuinfo
processor : 71 vendor_id : GenuineIntel cpu family : 6 model : 79 model name : Intel(R) Xeon(R) CPU E5-2695 v4 @ 2.10GHz stepping : 1 microcode : 0xb000010 cpu MHz : 1199.707 cache size : 46080 KB physical id : 1 siblings : 36 core id : 27 cpu cores : 18 apicid : 119 initial apicid : 119 fpu : yes fpu_exception : yes cpuid level : 20 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc aperfmperf eagerfpu pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer xsave avx f16c rdrand lahf_lm abm 3dnowprefetch ida arat epb pln pts dtherm tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm rdseed adx smap xsaveopt cqm_llc cqm_occup_llc bogomips : 4205.47 clflush size : 64 cache_alignment : 64 address sizes : 46 bits physical, 48 bits virtual power management:
The text above just shows a fragment of the output giving information on a single core of the login node. We can get some useful information from this output:
cpu cores : 18
physical id : 1(numbering starts at zero and this is the last entry so there are two processors at most)
We can extract information about the amount of memory available in a similar way. In this case we only want the first line as it will give
us the total amount of memory available so we use the
head -1 command:
[remote]$ head -1 /proc/meminfo
MemTotal: 263772152 kB
This tells us that there are 252 GB of memory available (this is out of 256 GB, ~4GB are reserved for various parts of computing hardware.
A computer’s memory and disk are measured in units called bytes. The magnitude of a file or memory use is measured using the same prefixes of the metric system: kilo, mega, giga, tera. So 1000 bytes is a kilobyte (kB), 1000 kilobytes is a megabyte (MB), and so on.
You may also see units that use multiples of 1024 rather than 1000. So, 1024 bytes is a kibibyte (KiB), 1024 KiB is a mebibyte (MiB), 1024 MiB is a gibibyte (GiB) and so on. For low amounts of storage the differences between these two units are negligible but as the size increases the differences can be significant.
We can now repeat this for the compute nodes. All interaction with the compute nodes is handled by a specialized piece of software called a scheduler (the scheduler used in this lesson is called PBS Pro). We will learn more about how to use the scheduler to submit jobs later, but for now, will use it to tell us more information about the compute nodes and what is available.
We are going to repeat the commands above on a compute node. To do this, we will run an interactive job which will give
us access to a bash command line on a compute node. The
qsub command is used to submit a job to the scheduler:
[remote]$ qsub -I -l select=1 -A z04
qsub: waiting for job 316267.indy2-login0 to start qsub: job 316267.indy2-login0 ready [compute ~]$
You should see your prompt change to indicate that your shell is now on a compute node rather than a login node.
Differences on the compute node?
Use the commands you saw above to determine if there are any differences between the processors and memory available on the compute node compared to the login node.
Is the answer what you expected?
Explore your local computer
Try to find out the processor and memory details on your own computer. How do these differ from the hardware available on the HPC system?
Now we know how to connect to the HPC system we will next learn how to transfer data on and off the system.
To connect to a remote HPC system using SSH: