sinfo¶
sinfo
is a tool to view information about Slurm nodes and partitions.
How does that look like on Bianca?
[richel@sens2016001-bianca ~]$ sinfo
PARTITION AVAIL TIMELIMIT NODES STATE NODELIST
all down 10-00:00:0 204 drain* sens2016001-b[1-8,10-204,1178]
all down 10-00:00:0 89 unk* sens2016001-b[205-210,301-312,1073-1084,1119-1177]
all down 10-00:00:0 1 idle sens2016001-b9
node up 10-00:00:0 204 drain* sens2016001-b[1-8,10-204,1178]
node up 10-00:00:0 89 unk* sens2016001-b[205-210,301-312,1073-1084,1119-1177]
node up 10-00:00:0 1 idle sens2016001-b9
core* up 10-00:00:0 204 drain* sens2016001-b[1-8,10-204,1178]
core* up 10-00:00:0 89 unk* sens2016001-b[205-210,301-312,1073-1084,1119-1177]
core* up 10-00:00:0 1 idle sens2016001-b9
devel up 1:00:00 192 drain* sens2016001-b[10-200,1178]
devel up 1:00:00 71 unk* sens2016001-b[1073-1084,1119-1177]
devel up 1:00:00 1 idle sens2016001-b9
devcore up 1:00:00 192 drain* sens2016001-b[10-200,1178]
devcore up 1:00:00 71 unk* sens2016001-b[1073-1084,1119-1177]
devcore up 1:00:00 1 idle sens2016001-b9
Although it may seem unexpected that only 1 node is idle, this is the expected behavior from a virtual cluster: most physical nodes are not allocated to this project and hence unavailable.
How does that look like on Rackham?
[richel@rackham3 ~]$ sinfo
PARTITION AVAIL TIMELIMIT NODES STATE NODELIST
all down 10-00:00:0 22 comp r[2,36,66,68,94,110,112,132,139,163,185,200,206,216,247,281,288,293,319,326,418,481]
all down 10-00:00:0 10 plnd r[49-50,58-60,63,283-285,287]
all down 10-00:00:0 72 drain$ r[1001-1072]
all down 10-00:00:0 18 drain* r[167,175,186,252,258,318,431,437-438,440,455-462]
all down 10-00:00:0 45 down* r[13,23,57,99,108-109,122,165,177-184,187,218,254,331,423,432-436,439,441,452,463-470,479,483-484,1189-1190,1199,1212,1240]
all down 10-00:00:0 8 drain r[29,35,78,154,212,226,335,485]
all down 10-00:00:0 115 mix r[37-41,43,45-46,65,70-72,76-77,79,85,98,102,106,116,120,127-128,135-136,142,146,152-153,161,169,171-172,174,189,210-211,222,227,230-231,234,237,243,250,260,264,266,273,275-276,280,289,292,302,311,313-314,316-317,332-333,344,360-361,363-365,368,373,376,382,386-388,391,393-395,398,402-403,410,417,422,425,430,449,453,472-473,475-477,480,482,486,1180-1181,1203,1208,1210-1211,1217,1223,1227,1231,1235,1237,1239,1242-1246]
all down 10-00:00:0 317 alloc r[1,3,6,9,19,25-28,30,32-34,42,44,47-48,51-56,62,64,67,69,73-75,80-84,86-93,95-97,100-101,103-105,107,111,113-115,117,119,121,123-126,129-131,133-134,137-138,140-141,143,147-151,155-160,162,164,166,168,170,173,176,188,190-199,201-205,207-209,213-215,217,220-221,223-225,228-229,232-233,235-236,238-242,244-246,248-249,251,253,255-257,259,261-263,265,267-272,274,277,279,282,286,290-291,294-301,303-310,312,315,320-325,327-330,334,336-343,345-359,362,366-367,369-372,374-375,377-381,383-385,389-390,392,396-397,399-401,404-409,411-416,419-421,424,426-429,442-448,450-451,454,471,474,478,1179,1182-1188,1191-1198,1200-1202,1204-1207,1209,1213-1216,1218-1222,1224-1226,1228-1230,1232-1234,1236,1238,1241,1247-1250]
all down 10-00:00:0 13 idle r[8,10-12,14-18,20-22,24]
all down 10-00:00:0 10 down r[4-5,7,31,61,118,144-145,219,278]
core* up 10-00:00:0 21 comp r[36,66,68,94,110,112,132,139,163,185,200,206,216,247,281,288,293,319,326,418,481]
core* up 10-00:00:0 10 plnd r[49-50,58-60,63,283-285,287]
core* up 10-00:00:0 72 drain$ r[1001-1072]
core* up 10-00:00:0 18 drain* r[167,175,186,252,258,318,431,437-438,440,455-462]
core* up 10-00:00:0 41 down* r[57,99,108-109,122,165,177-184,187,218,254,331,423,432-436,439,441,452,463-470,479,1189-1190,1199,1212,1240]
core* up 10-00:00:0 5 drain r[35,78,154,212,226]
core* up 10-00:00:0 114 mix r[37-41,43,45-46,65,70-72,76-77,79,85,98,102,106,116,120,127-128,135-136,142,146,152-153,161,169,171-172,174,189,210-211,222,227,230-231,234,237,243,250,260,264,266,273,275-276,280,289,292,302,311,313-314,316-317,332-333,344,360-361,363-365,368,373,376,382,386-388,391,393-395,398,402-403,410,417,422,425,430,449,453,472-473,475-477,480,482,1180-1181,1203,1208,1210-1211,1217,1223,1227,1231,1235,1237,1239,1242-1246]
core* up 10-00:00:0 301 alloc r[33-34,42,44,47-48,51-56,62,64,67,69,73-75,80-84,86-93,95-97,100-101,103-105,107,111,113-115,117,119,121,123-126,129-131,133-134,137-138,140-141,143,147-151,155-160,162,164,166,168,170,173,176,188,190-199,201-205,207-209,213-215,217,220-221,223-225,228-229,232-233,235-236,238-242,244-246,248-249,251,253,255-257,259,261-263,265,267-272,274,277,279,282,286,290-291,294-301,303-310,312,315,320-325,327-330,334,340,342-343,345-359,362,366-367,369-372,374-375,377-381,383-385,389-390,392,396-397,399-401,404-409,411-416,419-421,424,426-429,442-448,450-451,454,471,474,478,1179,1182-1188,1191-1198,1200-1202,1204-1207,1209,1213-1216,1218-1222,1224-1226,1228-1230,1232-1234,1236,1238,1241,1247-1250]
core* up 10-00:00:0 6 down r[61,118,144-145,219,278]
node up 10-00:00:0 22 comp r[2,36,66,68,94,110,112,132,139,163,185,200,206,216,247,281,288,293,319,326,418,481]
node up 10-00:00:0 10 plnd r[49-50,58-60,63,283-285,287]
node up 10-00:00:0 18 drain* r[167,175,186,252,258,318,431,437-438,440,455-462]
node up 10-00:00:0 38 down* r[13,23,57,99,108-109,122,165,177-184,187,218,254,331,423,432-436,439,441,452,463-470,479]
node up 10-00:00:0 7 drain r[29,35,78,154,212,226,335]
node up 10-00:00:0 96 mix r[37-41,43,45-46,65,70-72,76-77,79,85,98,102,106,116,120,127-128,135-136,142,146,152-153,161,169,171-172,174,189,210-211,222,227,230-231,234,237,243,250,260,264,266,273,275-276,280,289,292,302,311,313-314,316-317,332-333,344,360-361,363-365,368,373,376,382,386-388,391,393-395,398,402-403,410,417,422,425,430,449,453,472-473,475-477,480,482]
node up 10-00:00:0 268 alloc r[1,3,6,9,19,25-28,30,32-34,42,44,47-48,51-56,62,64,67,69,73-75,80-84,86-93,95-97,100-101,103-105,107,111,113-115,117,119,121,123-126,129-131,133-134,137-138,140-141,143,147-151,155-160,162,164,166,168,170,173,176,188,190-199,201-205,207-209,213-215,217,220-221,223-225,228-229,232-233,235-236,238-242,244-246,248-249,251,253,255-257,259,261-263,265,267-272,274,277,279,282,286,290-291,294-301,303-310,312,315,320-325,327-330,334,336-343,345-359,362,366-367,369-372,374-375,377-381,383-385,389-390,392,396-397,399-401,404-409,411-416,419-421,424,426-429,442-448,450-451,454,471,474,478]
node up 10-00:00:0 13 idle r[8,10-12,14-18,20-22,24]
node up 10-00:00:0 10 down r[4-5,7,31,61,118,144-145,219,278]
devel up 1:00:00 2 down* r[483-484]
devel up 1:00:00 1 drain r485
devel up 1:00:00 1 mix r486
devcore up 1:00:00 2 down* r[483-484]
devcore up 1:00:00 1 drain r485
devcore up 1:00:00 1 mix r486