Node usage graph: Difference between revisions
Jump to navigation
Jump to search
Created page with "There is a graphing tool that uses elements directly from sacct to display information about the current cluster usage, node_usage_graph (located at /cm/shared/apps/accounting..." |
you first need to load legacy before loading anunna |
||
| (5 intermediate revisions by 4 users not shown) | |||
| Line 1: | Line 1: | ||
There is a graphing tool that uses elements directly from sacct to display information about the current cluster usage, node_usage_graph (located at | There is a graphing tool that uses elements directly from sacct to display information about the current cluster usage, node_usage_graph (located at in the anunna module ). | ||
Example: | Example: | ||
< | <pre> | ||
[user@ | [user@login0 ~]# module load legacy | ||
node: |0% | [user@login0 ~]# module load anunna | ||
fat001: | [user@login0 ~]# usage_graph | ||
fat002: | node: |0% 100%| | ||
node001: | fat001: DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD | ||
node002: | DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD | ||
node003: | fat002: CCCCCCCCC | ||
node004: | MMMMMmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm | ||
node005: | node001: | ||
node006: | |||
node007: | node002:cccccccccc | ||
node008: | MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMmmmmmmmmmm | ||
node009: | node003:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC | ||
node010: | MM | ||
node011: | node004:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC | ||
node012: | M | ||
node013: | node005:CCCCCCCCCC | ||
node014: | |||
node015: | node006:CCCCCCCCCC | ||
node016: | |||
node017: | node007:CCCCCCCCCC | ||
node018: | |||
node019: | node008:CCCCCCCCCCccccc | ||
node020: | MMMMMMMMMMMMMMMMMMMMM | ||
node021: | node009:cccccccccc | ||
node022: | MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMM | ||
node023: | node010: | ||
node024: | |||
node025: | node011: | ||
node026: | |||
node027: | node012:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC | ||
node028: | M | ||
node029: | node013: | ||
node030: | |||
node031: | node014: | ||
node032: | |||
node033: | node015:CCCCC | ||
node034: | MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm | ||
node035: | node016:CCCCCCCCCCCCCCCCCCCCC | ||
node036: | MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm | ||
node037: | node017: | ||
node038: | |||
node039: | node018: | ||
node040: | |||
node041: | node019:CCCCC | ||
node042: | MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm | ||
node049: | node020: | ||
node050: | |||
node051: | node021: | ||
node052: | |||
node053: | node022: | ||
node054: | |||
</ | node023: | ||
node024:CCCCCCCCCCCCCCC | |||
node025:CCCCCCCCCCCCCCCCCCCCC | |||
node026:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC | |||
node027:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC | |||
node028:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC | |||
MMM | |||
node029: | |||
node030: | |||
node031: | |||
node032: | |||
node033: | |||
node034: | |||
node035: | |||
node036: | |||
node037: | |||
node038: | |||
node039: | |||
node040:DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD | |||
DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD | |||
node041:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCcccccc | |||
MMMMMMMMMMMMMMMMMMMMMM | |||
node042:RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR | |||
RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR | |||
node049:DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD | |||
DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD | |||
node050:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC | |||
M | |||
node051: | |||
node052:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC | |||
MMMMMMmmmmmmmmmmmmmmm | |||
node053:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC | |||
M | |||
node054:DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD | |||
DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD | |||
</pre> | |||
This gives an overview of the current per-node resource usage. It cannot however give you an indication of how much the queue is right now for any node. | This gives an overview of the current per-node resource usage. There are four types of letter: | ||
* M: Memory reserved and in use | |||
* m: Memory reserved and not in use | |||
* C: CPU reserved and in use | |||
* c: CPU reserved and not in use | |||
* D: Drained node (not available for job submission) | |||
* R: Reserved node | |||
* P: Node is powered off (for energy-saving) | |||
It cannot however give you an indication of how much the queue is right now for any node. for that, squeue is a better resource. | |||
Latest revision as of 09:21, 8 April 2025
There is a graphing tool that uses elements directly from sacct to display information about the current cluster usage, node_usage_graph (located at in the anunna module ).
Example:
[user@login0 ~]# module load legacy
[user@login0 ~]# module load anunna
[user@login0 ~]# usage_graph
node: |0% 100%|
fat001: DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
fat002: CCCCCCCCC
MMMMMmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm
node001:
node002:cccccccccc
MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMmmmmmmmmmm
node003:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
MM
node004:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
M
node005:CCCCCCCCCC
node006:CCCCCCCCCC
node007:CCCCCCCCCC
node008:CCCCCCCCCCccccc
MMMMMMMMMMMMMMMMMMMMM
node009:cccccccccc
MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMM
node010:
node011:
node012:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
M
node013:
node014:
node015:CCCCC
MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm
node016:CCCCCCCCCCCCCCCCCCCCC
MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm
node017:
node018:
node019:CCCCC
MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm
node020:
node021:
node022:
node023:
node024:CCCCCCCCCCCCCCC
node025:CCCCCCCCCCCCCCCCCCCCC
node026:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
node027:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
node028:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
MMM
node029:
node030:
node031:
node032:
node033:
node034:
node035:
node036:
node037:
node038:
node039:
node040:DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
node041:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCcccccc
MMMMMMMMMMMMMMMMMMMMMM
node042:RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
node049:DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
node050:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
M
node051:
node052:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
MMMMMMmmmmmmmmmmmmmmm
node053:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
M
node054:DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
This gives an overview of the current per-node resource usage. There are four types of letter:
- M: Memory reserved and in use
- m: Memory reserved and not in use
- C: CPU reserved and in use
- c: CPU reserved and not in use
- D: Drained node (not available for job submission)
- R: Reserved node
- P: Node is powered off (for energy-saving)
It cannot however give you an indication of how much the queue is right now for any node. for that, squeue is a better resource.