Using environment modules: Difference between revisions

From HPCwiki
Jump to navigation Jump to search
No edit summary
(Rewrite!)
Line 1: Line 1:
<-- this is only preliminary text; this article needs more work -->
<-- this is only preliminary text; this article needs more work -->
[http://modules.sourceforge.net/|Environment modules] are a simple way to allow multiple potentially clashing programs to coexist on a large shared machine such as an HPC. It allows a user to specify exactly which programs are loaded, and even which version of each program, whilst simultaneously allowing the administrator the ability to automatically configure the appropriate environment variables for the system itself.
Upon logging in to the HPC, you should find that when you do:
  module list
You will see something like this:
<source lang='bash'>
-bash-4.1$ module list
Currently Loaded Modulefiles:
  1) shared        2) slurm/2.5.7
</source>
This is a list of all loaded modules in your shell session. To get a list of all available modules, simply
  module available
And this will show you the (very exhaustive) list of modules on the HPC:
<source  lang='bash'>
-bash-4.1$ module avail
---------------------------- /cm/shared/modulefiles ----------------------------
acml/gcc/64/5.3.1                    netcdf/gcc/64/4.1.3
acml/gcc/fma4/5.3.1                  netcdf/gcc/64/4.3.0
acml/gcc/mp/64/5.3.1                  netcdf/gcc/64/4.3.2
acml/gcc/mp/fma4/5.3.1                netcdf/gcc/64/4.3.3
acml/gcc-int64/64/5.3.1              netcdf/gcc/64/4.3.3.1
acml/gcc-int64/fma4/5.3.1            netcdf/intel/64/4.1.3
...
</source>
Let's look at each of these module names. Each module is named for the application it provides, plus a subfolder of which compiler it was compiled with (if compiled), the number of address bits or options (if compiled), and the version.
If you want to see a list for a specific module, you can
  module avail netcdf
And the complete list of versions will be shown.
To load a module, simply
  module load foo
And the most recent version of module foo will automatically be loaded. If foo is compiled, it will automatically select the gcc version. If you want to specify a certain version, then
  module load foo/gcc/64/1.0.0
Will load foo version 1, compiled with gcc. Be advised that this may not always work, as some modules are not compatible with each other, but a message will be shown if this is the case. Additionally, some modules will automatically load other modules with them for them to operate.
If you want to remove a module that you've loaded, then
  module unload foo
Will remove all module foo's loaded.


=== Example ===
=== Example ===
Line 12: Line 64:


=== Loading modules ===
=== Loading modules ===
In order for this script to run, the first thing that is needed is that Python3, which is not the default Python version on the cluster, is load into your environment. Availability of (different versions of) software can be checked by the following command:
This script will not run at all in the default 2.4 version of Python on the cluster. In order for this script to run you must use Python3. To do this, first list all versions of Python:
  module avail
<source>
-bash-4.1$ module avail python
 
---------------------------- /cm/shared/modulefiles ----------------------------
python/2.7.6 python/3.3.3 python/3.4.2
</source>


In the list you should note that python3 is indeed available to be loaded, which then can be loaded with the following command:
Then you can load the specific version you need:
   module load python/3.3.3
   module load python/3.3.3
Now you have access to the executable python3.


== See also ==
== See also ==

Revision as of 14:28, 4 September 2015

<-- this is only preliminary text; this article needs more work --> modules are a simple way to allow multiple potentially clashing programs to coexist on a large shared machine such as an HPC. It allows a user to specify exactly which programs are loaded, and even which version of each program, whilst simultaneously allowing the administrator the ability to automatically configure the appropriate environment variables for the system itself.

Upon logging in to the HPC, you should find that when you do:

 module list

You will see something like this: <source lang='bash'> -bash-4.1$ module list Currently Loaded Modulefiles:

 1) shared        2) slurm/2.5.7

</source>

This is a list of all loaded modules in your shell session. To get a list of all available modules, simply

  module available

And this will show you the (very exhaustive) list of modules on the HPC:

<source lang='bash'> -bash-4.1$ module avail


/cm/shared/modulefiles ----------------------------

acml/gcc/64/5.3.1 netcdf/gcc/64/4.1.3 acml/gcc/fma4/5.3.1 netcdf/gcc/64/4.3.0 acml/gcc/mp/64/5.3.1 netcdf/gcc/64/4.3.2 acml/gcc/mp/fma4/5.3.1 netcdf/gcc/64/4.3.3 acml/gcc-int64/64/5.3.1 netcdf/gcc/64/4.3.3.1 acml/gcc-int64/fma4/5.3.1 netcdf/intel/64/4.1.3 ... </source>

Let's look at each of these module names. Each module is named for the application it provides, plus a subfolder of which compiler it was compiled with (if compiled), the number of address bits or options (if compiled), and the version.

If you want to see a list for a specific module, you can

 module avail netcdf

And the complete list of versions will be shown.


To load a module, simply

 module load foo

And the most recent version of module foo will automatically be loaded. If foo is compiled, it will automatically select the gcc version. If you want to specify a certain version, then

 module load foo/gcc/64/1.0.0

Will load foo version 1, compiled with gcc. Be advised that this may not always work, as some modules are not compatible with each other, but a message will be shown if this is the case. Additionally, some modules will automatically load other modules with them for them to operate.


If you want to remove a module that you've loaded, then

 module unload foo

Will remove all module foo's loaded.


Example

Consider this simple python3 script that should calculate Pi to 1 million digits: <source lang='python'> from decimal import * D=Decimal getcontext().prec=10000000 p=sum(D(1)/16**k*(D(4)/(8*k+1)-D(2)/(8*k+4)-D(1)/(8*k+5)-D(1)/(8*k+6))for k in range(411)) print(str(p)[:10000002]) </source>

Loading modules

This script will not run at all in the default 2.4 version of Python on the cluster. In order for this script to run you must use Python3. To do this, first list all versions of Python: <source> -bash-4.1$ module avail python


/cm/shared/modulefiles ----------------------------

python/2.7.6 python/3.3.3 python/3.4.2 </source>

Then you can load the specific version you need:

 module load python/3.3.3

Now you have access to the executable python3.

See also

External links