Some specificities about CNES clusters¶
Contents:
Using S1Tiling Lmod module on HAL/TREX¶
S1Tiling is already installed on HAL and TREX (since June 2023). It’s available through Lmod; see also HAL/TREX user guides.
# Use the lastest version
ml s1tiling
# Check versions available
ml av version
# Activate a specific version
ml s1tiling/1.0.0rc2-otb7.4.2
Note
For the moment, only S1Tiling 1.0 RC2 is installed with a dependency to OTB 7.4.2, Python 3.8.4 and G++ 8.2.
Installation on HAL/TREX¶
You may prefer to install S1Tiling yourself. In that case, there are mainly two X two ways to install S1Tiling on CNES clusters.
If one wants to install S1Tiling from sources instead of pipy, it could be done
from the following context. Then, in later steps, use "${S1TILING_SRC_DIR}"
instead of s1tiling
as pip
parameter.
# Proposed directories where it could be installed
TST_DIR=/work/scratch/${USER}/S1Tiling/install
S1TILING_ROOT_DIR=/work/scratch/${USER}/S1Tiling/
S1TILING_SOURCES=sources
S1TILING_SRC_DIR=${S1TILING_ROOT_DIR}/${S1TILING_SOURCES}
cd "${S1TILING_ROOT_DIR}"
git clone git@gitlab.orfeo-toolbox.org:s1-tiling/s1tiling.git ${S1TILING_SOURCES}
…from available OTB module (and w/ pip)¶
ml otb/7.4.2-python3.8.4-gcc8.2
# Create a pip virtual environment
python -m venv install_with_otb_module
# Configure the environment with:
source install_with_otb_module/bin/activate
# - an up-to-date pip
python -m pip install --upgrade pip
# - an up-to-date setuptools==57.5.0
python -m pip install --upgrade setuptools==57.5.0
# Finally, install S1Tiling from sources
mkdir /work/scratch/${USER}/tmp
TMPDIR=/work/scratch/${USER}/tmp/ python -m pip install s1tiling
deactivate
ml purge
To use it
ml purge
ml otb/7.4.2-python3.8.4-gcc8.2
source install_with_otb_module/bin/activate
S1Processor requestfile.cfg
deactivate
ml purge
Note
This is the approach that has been chosen by the installation script we use
internally. See: install-CNES.sh
Prefer the later approach based on conda if you wish to use a different version of Python.
…from available OTB module (and w/ conda)¶
Note
This approach permits to select a different version of Python, but it will be a bit more complex to correctly adjust the desired version of gdal python bindings to be exactly the same as the one used to generate OTB module. This isn’t demonstrated here.
ml otb/7.4.2-python3.8.4-gcc8.2
# Create a conda environment
ml conda
conda create --prefix ./conda_install_with_otb_distrib python==3.8.4
# Configure the environment with:
conda activate "${TST_DIR}/conda_install_with_otb_distrib"
# - an up-to-date pip
python -m pip install --upgrade pip
# - an up-to-date setuptools==57.5.0
python -m pip install --upgrade setuptools==57.5.0
# Finally, install S1Tiling from sources
mkdir /work/scratch/${USER}/tmp
TMPDIR=/work/scratch/${USER}/tmp/ python -m pip install s1tiling
conda deactivate
ml purge
To use it
ml purge
ml conda
ml otb/7.4.2-python3.8.4-gcc8.2
conda activate "${TST_DIR}/conda_install_with_otb_distrib"
S1Processor requestfile.cfg
conda deactivate
ml purge
…from released OTB binaries…¶
Given otbenv.profile
cannot be unloaded, prefer the above methods based
on OTB module.
First let’s start by installing OTB binaries somewhere in your personnal (or project) environment.
# Start from a clean environment
ml purge
cd "${TST_DIR}"
# Install OTB binaries
wget https://www.orfeo-toolbox.org/packages/OTB-7.4.2-Linux64.run
bash OTB-7.4.2-Linux64.run
# Patches gdal-config
cp "${S1TILING_SRC_DIR}/s1tiling/resources/gdal-config" OTB-7.4.2-Linux64/bin/
# Patches LD_LIBRARY_PATH
echo "export LD_LIBRARY_PATH=\"$(readlink -f OTB-7.4.2-Linux64/lib)\${LD_LIBRARY_PATH:+:\$LD_LIBRARY_PATH}\"" >> OTB-7.4.2-Linux64/otbenv.profile
Note
gdal-config
is either available from the sources
(${S1TILING_SRC_DIR}/s1tiling/resources/gdal-config
) or to download
from here: gdal-config
.
…and with conda¶
Given the OTB binaries installed, we still need to update the Python bindings for the chosen version of Python.
# Create a conda environment
ml conda
conda create --prefix ./conda_install_with_otb_distrib python==3.8.4
# Configure the environment with:
conda activate "${TST_DIR}/conda_install_with_otb_distrib"
# - an up-to-date pip
python -m pip install --upgrade pip
# - an up-to-date setuptools==57.5.0
python -m pip install --upgrade setuptools==57.5.0
# - numpy in order to compile OTB python bindinds for Python 3.8.4
pip install numpy
# - gdal python bindinds shall be compatible with libgdal.so shipped w/ OTB binaries
pip --no-cache-dir install "gdal==$(gdal-config --version)" --no-binary :all:
# - load OTB binaries
source OTB-7.4.2-Linux64/otbenv.profile
# load cmake and gcc to compile the binding
ml cmake gcc
# And update the bindings
(cd OTB-7.4.2-Linux64/ && ctest -S share/otb/swig/build_wrapping.cmake -VV)
ml unload cmake gcc
# Finally, install S1Tiling from sources
mkdir /work/scratch/${USER}/tmp
TMPDIR=/work/scratch/${USER}/tmp/ python -m pip install s1tiling
conda deactivate
ml purge
To use it
ml purge
ml conda
conda activate "${TST_DIR}/conda_install_with_otb_distrib"
source "${TST_DIR}/OTB-7.4.2-Linux64/otbenv.profile"
S1Processor requestfile.cfg
conda deactivate
ml purge
…and with pip¶
Given the OTB binaries installed, we still need to update the Python bindings for the chosen version of Python.
# Create a pip virtual environment
ml python
python -m venv install_with_otb_binaries
# Configure the environment with:
source install_with_otb_binaries/bin/activate
# - an up-to-date pip
python -m pip install --upgrade pip
# - an up-to-date setuptools==57.5.0
python -m pip install --upgrade setuptools==57.5.0
# - numpy in order to compile OTB python bindinds for Python
pip install numpy
# - gdal python bindinds shall be compatible with libgdal.so shipped w/ OTB binaries
pip --no-cache-dir install "gdal==$(gdal-config --version)" --no-binary :all:
# - load OTB binaries
source OTB-7.4.2-Linux64/otbenv.profile
# load cmake and gcc to compile the binding
ml cmake gcc
# And update the bindings
(cd OTB-7.4.2-Linux64/ && ctest -S share/otb/swig/build_wrapping.cmake -VV)
ml unload cmake gcc
# Finally, install S1Tiling from sources
mkdir /work/scratch/${USER}/tmp
TMPDIR=/work/scratch/${USER}/tmp/ python -m pip install s1tiling
deactivate
ml purge
To use it
ml purge
source install_with_otb_binaries/bin/activate
source "${TST_DIR}/OTB-7.4.2-Linux64/otbenv.profile"
S1Processor requestfile.cfg
deactivate
ml purge
Executing S1 Tiling as a job¶
The theory¶
A few options deserve our attention when running S1 Tiling as a job on a cluster like HAL or TREX.
Option | Need to know |
---|---|
[PATHS].tmp | Temporary files shall not be generated on the GPFS, instead, they are
best generated locally in [PATHS]
tmp : %(TMPDIR)s/s1tiling
Warning Of course, we shall not use |
[PATHS].srtm | Original SRTM files are stored in
[PATHS]
srtm : /work/datalake/static_aux/MNT/SRTM_30_hgt
|
[Processing].cache_srtm_by | SRTM files should be copied locally on [PATHS].tmp instead of being symlinked over the GPFS. [Processing]
cache_srtm_by : copy
|
[Processing].nb_otb_threads | This is the number of threads that will be used by each OTB application pipeline. |
[Processing].nb_parallel_processes | This is the number of OTB application pipelines that will be executed in parallel. |
[Processing].ram_per_process | RAM allowed per OTB application pipeline, in MB. |
PBS resources |
This means, that for # The request file
[Processing]
nb_parallel_processes: 10
nb_otb_threads: 2
ram_per_process: 4096
Then the job request shall contain at least #PBS -l select=1:ncpus=20:mem=40gb
# always 1 for select
# cpu = 2 * 10 => 20
# mem = 10 * 4096 => 40gb
|
TL;DR: here is an example¶
SLRUM job file (TREX)¶
#!/bin/bash
#SBATCH --account=...
#SBATCH --partition=cpu2022 # jobs < 72h
#SBATCH --qos=...
#SBATCH -N 1 # number of nodes (or --nodes=1)
#SBATCH -n 1 # number of tasks (or --ntasks=1)
#SBATCH --cpus-per-task=20 # number of cpus par task
#SBATCH --mem=160G # memory per core
#SBATCH --time=00:59:00 # Wall Time 59mn
#SBATCH -J job-s1tiling
# The number of allocated CPUs
NCPUS=${SLURM_CPUS_PER_TASK}
# Let's use 2 threads in each OTB application pipeline
export NB_OTB_THREADS=2
# Let's deduce the number of OTB application pipelines to run in parallel
export NB_OTB_PIPELINES=$(($NCPUS / $NB_OTB_THREADS))
# These two variables have been exported to be automatically used from the
# S1tiling request file.
# Let's use an existing S1Tiling module
s1tiling/1.0.0rc2-otb7.4.2
# Expecting S1Processor.cfg in ${SLURM_SUBMIT_DIR}, the logs will be
# produced in a subdirectory named after the the JOB ID.
WORK_DIR="${SLURM_SUBMIT_DIR}/${SLURM_JOB_ID}"
mkdir -p "${WORK_DIR}"
cd "${WORK_DIR}"
S1Processor --cache-before-ortho ../S1Processor.cfg || {
code=$?
echo "Echec de l'exécution de programme" >&2
exit ${code}
}
PBS job file (HAL)¶
#!/bin/bash
#PBS -N job-s1tiling
#PBS -l select=1:ncpus=20:mem=40gb
#PBS -l walltime=1:00:00
# NB: Using 5Gb per cpu
# The number of allocated CPUs is in the select parameter let's extract it
# automatically
NCPUS=$(qstat -f "${PBS_JOBID}" | awk '/resources_used.ncpus/{print $3}')
# Let's use 2 threads in each OTB application pipeline
export NB_OTB_THREADS=2
# Let's deduce the number of OTB application pipelines to run in parallel
export NB_OTB_PIPELINES=$(($NCPUS / $NB_OTB_THREADS))
# These two variables have been exported to be automatically used from the
# S1tiling request file.
# Let's use an existing S1Tiling module
s1tiling/1.0.0rc2-otb7.4.2
# Expecting S1Processor.cfg in ${PBS_O_WORKDIR}, the logs will be
# produced in a subdirectory named after the the JOB ID.
WORK_DIR="${PBS_O_WORKDIR}/${PBS_JOBID}"
mkdir -p "${WORK_DIR}"
cd "${WORK_DIR}"
S1Processor --cache-before-ortho ../S1Processor.cfg || {
code=$?
echo "Echec de l'exécution de programme" >&2
exit ${code}
}
S1 Tiling request file: S1Processor.cfg
¶
[PATHS]
tmp : %(TMPDIR)s/s1tiling
srtm : /work/datalake/static_aux/MNT/SRTM_30_hgt
...
[Processing]
cache_srtm_by: copy
# Let's use the exported environment variables thanks to "%()s" syntax
nb_parallel_processes: %(NB_OTB_PIPELINES)s
nb_otb_threads: %(NB_OTB_THREADS)s
ram_per_process: 4096
...