Skip to content

Reflection

  • Author: Richel
  • Course: Intermediate Bianca
  • Date: 2024-11-11

Course schedule

When Who What
9:00 R Introduction
9:10 R Prevaluation
9:20 P Transferring files p1
10:00 . Break
10:15 P Transferring files p2
10:45 R Efficient jobs
11:00 . Break
11:15 R Efficient jobs
11:20 R Complex jobs
11:40 R Replicate jobs
12:00 . Lunch
13:00 B Software and packages installation p1
13:50 . Break
14:00 B Software and packages installation p2
14:40 BR [IDEs on Bianca p1
14:50 . Break
15:05 L IDEs on Bianca p2
15:40 R Summary
15:45 R Evaluation
15:45 All Optional Q/A
16:00 . END
  • Who: B: Björn, P: Pavlin, R: Richèl

Reflection during the day

  • [x] [ADDED TO COURSE MATERIAL] Add more arguments for rsync, that differ from FileZilla:
    • can transfer data from everywhere to everywhere, especially transfer sensitive data between HPC center
    • (ignore) can transfer (soft) symbolic links
Method Can transfer big files? Can transfer between any HPC center? Can transfer soft links? Keeps file attributes? Can do incremental file transfer? Can send deletions?
rsync Yes Yes Yes Yes Yes Yes
FileZilla ?Yes No ?No (P) ? Yes ?
scp ?Yes ?Yes ?No (P) ? ? ?
  • [x] There is something in the documentation with rsync where it is missing that a command should be run on a local computer
    • [x] Ask Pavlin where exactly
    • Was already correct :-)

I imagined having all learners in the same main Zoom room. Then I saw Pavlin using breakout rooms. He was right! Thanks to him, I started using breakout rooms too. Thanks Pavlin!

Changes to schedule: - Prevaluation takes 3 minutes to fill in - Due to the prevaluation, 10 minutes is enough for the full introduction, so next session can start 10 minutes earlier - File transfer can be shortened by 15 mins - There was too little time for 'Efficient workflow', needs 15 more minutes - Installation part 1 can be from 13:00-13:45 (instead of to 13:50).

RStudio took only 15 minutes, as most learners had already done some

  • [ ] Suggest move RStudio to Basic

  • 13:57: Hmm, I forgot to properly prepare the summary :-/ , let's do so now :-)

  • 15:19 Evaluation started
  • 15:20 A learner thanked me and Björn

None of teachers used a shared document.

  • [ ] Remove a link to a shared document from the course material

The learners were very enjoyable to work with, as they were very social. I guess some credit goes to us teachers for that, but they made it work on the other end of the conversation.

When Who What Suggest
9:00 R Introduction 9:00
9:10 R Prevaluation 9:03
9:20 P Transferring files p1 9:10
10:00 . Break 10:00
10:15 P Transferring files p2 10:15
10:45 R Efficient jobs 10:30
11:00 . Break 11:00
11:15 R Efficient jobs .
11:20 R Complex jobs 11:30 (parallel)
11:40 R Replicate jobs 11:30 (parallel)
12:00 . Lunch
13:00 B Software and packages installation p1 13:00
13:50 . Break 13:45
14:00 B Software and packages installation p2 14:00
14:40 BR [IDEs on Bianca p1 14:30
14:50 . Break 14:45
15:05 L IDEs on Bianca p2 -> something else 15:00
15:40 R Summary 15:30
15:45 R Evaluation 15:35
15:45 All Optional Q/A 15:45
16:00 . END 16:00
  • Complex and replicate jobs are in parallel sessions, suggest to replace IDEs by one of these

Evaluation results

Evaluation can be found here.

I think the most interesting is:

question mean_pre mean_post p_value different teacher
I can transfer files to/from Bianca using rsync 2.2 4.6 0.2857143 FALSE P
I can see the CPU and memory usage of jobs 2.4 3.8 0.2857143 FALSE R
I understand how to set up jobs efficiently 2.0 3.6 0.0476190 TRUE R
I can schedule a simple workflow of jobs that depend on each other using Slurm 1.2 3.4 0.0793651 FALSE R
I can schedule a simple workflow of jobs that depend on each other using Nextflow 0.2 1.2 0.2063492 FALSE R
I can run replicate jobs using Slurm job arrays 1.4 2.2 0.6825397 FALSE R
I understand how to install software myself 1.2 3.6 0.0476190 TRUE B
I understand how to use packages and libraries for scripts 3.0 3.8 0.5238095 FALSE B
I understand what containers are 2.0 3.0 0.2857143 FALSE B
I understand how to build from source 0.8 1.8 0.1904762 FALSE B
I can can run the voted-for IDE on Bianca 1.6 4.6 0.0476190 TRUE BR
I understand how to deal with sensitive data 3.6 4.4 0.2857143 FALSE None

Let's go trough mine:

question mean_pre mean_post p_value different teacher
I can see the CPU and memory usage of jobs 2.4 3.8 0.2857143 FALSE R

Nice improvement, keep it as it is.

question mean_pre mean_post p_value different teacher
I understand how to set up jobs efficiently 2.0 3.6 0.0476190 TRUE R

A significant positive effect! Keep it as it is!

question mean_pre mean_post p_value different teacher
I can schedule a simple workflow of jobs that depend on each other using Slurm 1.2 3.4 0.0793651 FALSE R

These was done by 3 out of the 5 learners, which has had a great impact on the confidences. Keep it as it is!

question mean_pre mean_post p_value different teacher
I can schedule a simple workflow of jobs that depend on each other using Nextflow 0.2 1.2 0.2063492 FALSE R

Great, I did not do that at all :-)

  • [ ] Remove this learning objective for next time
question mean_pre mean_post p_value different teacher
I can run replicate jobs using Slurm job arrays 1.4 2.2 0.6825397 FALSE R

These was done by 2 out of the 5 learners. There has been a 3 and 4 for confidence added, so it may have been a good session.

question mean_pre mean_post p_value different teacher
I can can run the voted-for IDE on Bianca 1.6 4.6 0.0476190 TRUE BR

A significant positive effect! Keep it as it is.

Judging all this, my findings are:

  • I don't care about the prevaluation much: let's not bother the learners with it
  • Remove the 'I can schedule a simple workflow of jobs that depend on each other using Nextflow' learning objective
  • I suggest to move RStudio to the Basic Bianca course
  • I suggest to remove a link to a shared document from the course material
  • If both complex and replicate jobs are important, I suggest to add 30 mins to that session, over going parallel
  • I suggest a simpler schedule