Running TensorBoard on Snowy¶
Info
- TensorBoard can be used to visualise, debug, and profile your model.
- TensorBoard is accessed from a web browser. You may start it in an interactive session and access it via an ssh-tunnel or ThinLinc.
1. Connect to Rackham using ThinLinc.¶
You may download the ThinLinc client from here: https://www.cendio.com/thinlinc/download
You may also use the web browser to connect: https://rackham-gui.uppmax.uu.se You need to set up 2FA for the ThinLinc web.
2. Open a terminal in ThinLinc and ask for an interactive session to Snowy.¶
Note: We have a magnetic 4-node reservation for today: naiss2023-22-247_1
.
Is the GPU visible?
3. Load the needed modules¶
ORTensorBoard can be started from the command line as tensorboard
. You need to specify a directory for the logs using the --logdir
option.
Example:
tensorboard --bind_all --logdir=./tensorboard-log-dir
...
TensorBoard 2.5.0 at http://s???.uppmax.uu.se:6006/
./tensorboard-log-dir
as you wish. Each running job should have its own directory for TensorBoard to parse the logs correctly.
By default, TensorBoard will attach the port 6006
to localhost. If it's taken, it will chose another one. You may see the chosen port in the output when you start TensorBoard from the command line.
4. Accesss TensoBoard¶
Solution 1: Using ssh-tunnelling¶
Update the port if needed.Next, on your computer open a web browser and visit http://localhost:8080. You should now see the Tensorboard UI.
Solution 2: Using ThinLic¶
- Launch a web browser on ThinLinc.
- Visit
http://s???.uppmax.uu.se:6006
orlocalhost:8080
. Update the local port, in this example8080
, and the snowy nodes???
as needed.
Where to go from here?¶
Have a look at this getting started tutorial from TensorFlow: https://www.tensorflow.org/tensorboard/get_started.
Warning
Limit the memory needs of your application. What is a suitable TF_FORCE_GPU_ALLOW_GROWTH
? Read more on this and other memory growth pointers at https://www.tensorflow.org/guide/gpu.