sungridengine - Purging Dead Nodes from SGE -
my qstat -g c
indicates have dead nodes (formally 'cdsue
'):
cluster queue cqload used res avail total aoacds cdsue -------------------------------------------------------------------------------- all.q 0.11 18 0 9 37 0 10
is there easy way purge or remove these nodes queue?
sge smart enough not allocate work them clutter various displays.
i hardway.
- kill jobs "running" or stuck on dead nodes.
- run qconf remove node pipeline
-
qconf -dattr hostgroup hostlist <nodealias> @allhosts' qconf -purge queue slots all.q@<nodealias> qconf -dconf <nodealias> qconf -de <nodealias>
Comments
Post a Comment