File transfer using sftp
¶
Learning outcomes
- Practice using the documentation of your favorite HPC cluster
- Can transfer files using
sftp
For teachers
Teaching goals are:
- Learners have practiced using the documentation of their favorite HPC cluster
- Learners have transferred files using
sftp
Lesson plan:
gantt
title File transfer using sftp
dateFormat X
axisFormat %s
section First hour
Prior : prior, 0, 2s
Present: present, after prior, 3s
Challenge: crit, challenge, after present, 10s
Feedback: feedback, after challenge, 2s
Prior questions:
- Who has heard of
sftp
? - Who has already used
sftp
?
Introduction¶
SFTP is an abbreviation of 'SSH File Transfer Protocol' and 'SSH' is an abbreviation of 'Secure Shell protocol'.
- The difference from
rsync
andscp
is that you login to a SFTP session- just like a log in to a server with SSH.
- You use the terminal to transfer the files.
- You need the credentials only when establishing the connection.
Pros
- Secure
- Use cases:
- copying large or many files
- asks if you want to overwrite target files
- possible to resume file transfer after a broken connection.
- handle bulk file transfers
- modify directories
- and encrypt or decrypt specific files
When not to use
- Rsync is better for "syncing"
Procedure¶
In the terminal, run sftp
to connect to Rackham by doing:
In sftp
session, upload/download files to/from the server.
put <filename>
- Copy a file from the local computer to the remote hostget <filename>
- Copy a file from the remote host to the local computer
Example for Rackham
Connect to SFTP session
In the terminal, run sftp
to connect to Rackham by doing:
where [username]
is your UPPMAX username, for example:
If asked, give your center's password. You can get rid of this prompt if you have setup SSH keys
Upload/download files to/from Rackham
In sftp
session, upload/download files to/from the server.
put <filename>
- Copy a file from the local computer to the remote hostget <filename>
- Copy a file from the remote host to the local computer
Cheat sheet in the SFTP session
?
/help
- List of all commands or explain a command, eg ? get
Remote
put
- Copy a file from the local computer to the remote hostcd
- Change the active directory path on the remote hostdir
/ls
- List the contents of the current directory on the remote hostpwd
- Show the present working directory path on the remote hostmkdir
- Create a directory on the remote hostrename
- Rename a file on the remote hostrm
- Delete a file on the remote hostrmdir
- Remove an empty directory on the remote hostexit
/quit
- Close the connection and leave SFTP
Locally
get
- Copy a file from the remote host to the local computerlcd
- Change the active directory on the local systemlls
- List the contents of the current directory on the local computerlmkdir
- Create a directory on the local computerlpwd
- Show the present working directory on the local computer
Transferring many files?
- As always, it is better to transfer few large files than many small files.
- Therefore, make archive(s) and while doing that, compress as well, see last section.
Server to server¶
- You can transfer files in an SSH session on one cluster to another cluster.
rsync
,scp
, orsftp
will all work!- "Local" will now be the server you connect from, that is, where you run the commands.
Large or many files (copy from scp
session)¶
- Shorten download/upload time by reducing the size of the file!
- A common tool in Linux environments is
gzip
. - Usage:
gzip <filename>
. You'll get agz
file ending
- A common tool in Linux environments is
- Transferring many files will create so called overhead
- each file has to be addressed individually.
- Solution is to gather the files in an archive,
like tar.
- A folder with content then behaves like ONE file.
- Usage:
tar -cf archive.tar /path/files
ortar -cf archive.tar /path/folder
- While TARing you may compress the data as well!
tar -czf archive.tar.gz /path/files
Extract/inflate
gunzip compressed_file.gz
tar -xf archive.tar
tar -xzf compressed_archive.tar.gz
gzip
cheat sheet
- [``gzip`` manual](https://www.gnu.org/software/gzip/manual/gzip.html#Sample)
tar
cheat sheet
- [``tar`` manual](https://devhints.io/tar)
Can I use archiving and compressing in all transfer methods?
- Yes!
Exercises¶
Use the documentation of your HPC cluster
- Search for how to transfer files to/from your HPC cluster using
scp
. Which URL is it described?- Tip: not all HPC centers have documented this, so you should give up searching after a while.
Where is that documentation?
HPC Cluster | Link to documentation |
---|---|
Alvis | FIX? None. |
Berzelius | here |
Bianca | here, through transit |
COSMOS | here |
Dardel | The similar psftp is described here |
Kebnekaise | Here |
LUMI | To do |
Rackham | here |
Tetralith | here |
Vera | FIX? None. |
Dardel documentation uses psftp
, what is that?
- It is just a SFTP component of the client PuTTY.
- Same commands as for SFTP.
Exercise 1: Establish the sftp connection and find the relevant folders locally and remotely
Tips
- Connect from your local terminal
- Orient yourself remotely and locally with
ls
,cd
,pwd
,lls
etc... - Where are you remotely?
- Go to the course folder where you have files
- Where are you locally?
- Go to your local folder used for the course
Answer (Tetralith example)
Locally
- Establish connection
$ sftp sm_bcarl@tetralith.nsc.liu.se
(sm_bcarl@tetralith.nsc.liu.se) Password:
(sm_bcarl@tetralith.nsc.liu.se) Verification code:
Connected to tetralith.nsc.liu.se.
- Use
pwd
to find where you are remotely - Use
cd
to reach relevant remote folder - Use
lpwd
to find where you are locally -
Use
lcd
to reach relevant local folder
Exercise 2: Upload
Tips
- (If you want to create a file in local terminal:
$ touch local_file_sftp
) - Send it to an existing folder (e.g.
transfer
) on Tetralith - Check on server (in sftp session) that it is there
Answer (Tetralith example)
Locally
- (If you want to create a file in local terminal:
$ touch local_file_sftp
) - use the put command:
> put local_file_sftp
Check on server (in sftp session) that it is there
-
> ls ~/transfer
Exercise 3: Download
Tips
- (If you want to create a file in remote terminal:
$ touch remote_file_sftp
) - Send it to an existing local folder
- Check locally (in sftp session) that it is there
Answer (Tetralith example)
On Server
- use the get command:
> get remote_file_sftp
Check locally (in sftp session) that it is there
-
> lls ~/transfer
-
(or in the File explorer)
(Optional if you have multiple cluster/server accounts) Exercise 4: Transfer with favorite terminal tool between servers
Tips
- Have a SSH session running on your favorite cluster
- Use a terminal tool (
rsync
,scp
, orsftp
) to establish connection between your account on Tetralith or Alvis and your present server.- address to Tetralith: use address from Getting started
- address to Alvis: use address from Login using a Terminal
- Transfer a file between the clusters
- Check that it is there!
Answer (Example with Tetralith to Rackham , using sftp)
Same principles as we learned before!
Connection from Tetralith:
[sm_bcarl@tetralith3 ~]$ sftp bjornc@rackham.uppmax.uu.se
The authenticity of host 'rackham.uppmax.uu.se (89.44.250.84)' can't be established.
ECDSA key fingerprint is SHA256:W/MazH3WrH0wKrHBOJpPbDaU4qeYGqiv3FRPsdXIsb4.
This key is not known by any other names
Are you sure you want to continue connecting (yes/no/[fingerprint])? yes
Warning: Permanently added 'rackham.uppmax.uu.se' (ECDSA) to the list of known hosts.
bjornc@rackham.uppmax.uu.se's password:
Connected to rackham.uppmax.uu.se.
-
Proceed with
put
if you want to transfer from Tetralith (now local) to Rackham (remote)
(Optional): Exercise 5: Test the difference between transferring one or several files
Tips
In an SSH session (not SFTP) with REMOTE/server
- Create 1000 files REMOTELY in a directory with name
many_files
$ mkdir many_files_sftp
$ cd many_files_sftp
$ touch file_{1..1000}.txt
- Check content:
$ ls
for checking - Leave directory to be able to perform next step:
$ cd ..
- Also archive and zip the
many_files_sftp
folder tomany_files_sftp.tar.qz
Establish the SFTP session (Exercise 1)
- Download (to local) the directory and note the time needed (not shown in numbers so count the seconds!)
- Download (to local) the
.tar.gz
file and note the time needed - Was there a significant difference?
Answer (Example with Tetralith)
Archiving and compressions step REMOTELY
tar -cvzf many_files_sftp.tar.gz many_files
Establish SFTP connection
$ sftp sm_bcarl@tetralith.nsc.liu.se
Download
> get -r many_files_sftp
(we need the recursive command-r
)-
> get many_files_sftp.tar.gz
Some other tools¶
Terminal tools
- Rclone: Rclone is a command-line program to manage and sync your files on cloud storage.