Node usage graph: Difference between revisions
No edit summary |
No edit summary |
||
Line 3: | Line 3: | ||
Example: | Example: | ||
<source lang="text"> | <source lang="text"> | ||
[user@nfs01 ~]# /cm/shared/apps/accounting/ | [user@nfs01 ~]# /cm/shared/apps/accounting/node_reserved_usage_graph | ||
node: |0% | node: |0% 100%| | ||
fat001: | fat001: DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD | ||
fat002: | DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD | ||
node001: | fat002: CCCCCCCCC | ||
node002: | MMMMMmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm | ||
node003: | node001: | ||
node004: | |||
node005: | node002:cccccccccc | ||
node006: | MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMmmmmmmmmmm | ||
node007: | node003:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC | ||
node008: | MM | ||
node009: | node004:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC | ||
node010: | M | ||
node011: | node005:CCCCCCCCCC | ||
node012: | |||
node013: | node006:CCCCCCCCCC | ||
node014: | |||
node015: | node007:CCCCCCCCCC | ||
node016: | |||
node017: | node008:CCCCCCCCCCccccc | ||
node018: | MMMMMMMMMMMMMMMMMMMMM | ||
node019: | node009:cccccccccc | ||
node020: | MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMM | ||
node021: | node010: | ||
node022: | |||
node023: | node011: | ||
node024: | |||
node025: | node012:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC | ||
node026: | M | ||
node027: | node013: | ||
node028: | |||
node029: | node014: | ||
node030: | |||
node031: | node015:CCCCC | ||
node032: | MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm | ||
node033: | node016:CCCCCCCCCCCCCCCCCCCCC | ||
node034: | MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm | ||
node035: | node017: | ||
node036: | |||
node037: | node018: | ||
node038: | |||
node039: | node019:CCCCC | ||
node040: | MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm | ||
node041: | node020: | ||
node042: | |||
node049: | node021: | ||
node050: | |||
node051: | node022: | ||
node052: | |||
node053: | node023: | ||
node054: | |||
node024:CCCCCCCCCCCCCCC | |||
node025:CCCCCCCCCCCCCCCCCCCCC | |||
node026:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC | |||
node027:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC | |||
node028:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC | |||
MMM | |||
node029: | |||
node030: | |||
node031: | |||
node032: | |||
node033: | |||
node034: | |||
node035: | |||
node036: | |||
node037: | |||
node038: | |||
node039: | |||
node040:DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD | |||
DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD | |||
node041:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCcccccc | |||
MMMMMMMMMMMMMMMMMMMMMM | |||
node042:RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR | |||
RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR | |||
node049:DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD | |||
DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD | |||
node050:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC | |||
M | |||
node051: | |||
node052:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC | |||
MMMMMMmmmmmmmmmmmmmmm | |||
node053:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC | |||
M | |||
node054:DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD | |||
DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD | |||
</source> | </source> | ||
This gives an overview of the current per-node resource usage. It cannot however give you an indication of how much the queue is right now for any node. | This gives an overview of the current per-node resource usage. There are four types of letter: | ||
* M: Memory reserved and in use | |||
* m: Memory reserved and not in use | |||
* C: CPU reserved and in use | |||
* c: CPU reserved and not in use | |||
* D: Drained node (not available for submission for some adminstrative reason | |||
* R: Reserved node | |||
It cannot however give you an indication of how much the queue is right now for any node. for that, squeue is a better resource. |
Revision as of 09:51, 4 September 2017
There is a graphing tool that uses elements directly from sacct to display information about the current cluster usage, node_usage_graph (located at /cm/shared/apps/accounting/node_usage_graph ).
Example: <source lang="text"> [user@nfs01 ~]# /cm/shared/apps/accounting/node_reserved_usage_graph node: |0% 100%| fat001: DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
fat002: CCCCCCCCC
MMMMMmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm
node001:
node002:cccccccccc
MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMmmmmmmmmmm
node003:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
MM
node004:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
M
node005:CCCCCCCCCC
node006:CCCCCCCCCC
node007:CCCCCCCCCC
node008:CCCCCCCCCCccccc
MMMMMMMMMMMMMMMMMMMMM
node009:cccccccccc
MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMM
node010:
node011:
node012:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
M
node013:
node014:
node015:CCCCC
MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm
node016:CCCCCCCCCCCCCCCCCCCCC
MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm
node017:
node018:
node019:CCCCC
MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMMmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm
node020:
node021:
node022:
node023:
node024:CCCCCCCCCCCCCCC
node025:CCCCCCCCCCCCCCCCCCCCC
node026:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
node027:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
node028:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
MMM
node029:
node030:
node031:
node032:
node033:
node034:
node035:
node036:
node037:
node038:
node039:
node040:DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
node041:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCcccccc
MMMMMMMMMMMMMMMMMMMMMM
node042:RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR
node049:DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
node050:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
M
node051:
node052:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
MMMMMMmmmmmmmmmmmmmmm
node053:CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
M
node054:DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD
</source>
This gives an overview of the current per-node resource usage. There are four types of letter:
- M: Memory reserved and in use
- m: Memory reserved and not in use
- C: CPU reserved and in use
- c: CPU reserved and not in use
- D: Drained node (not available for submission for some adminstrative reason
- R: Reserved node
It cannot however give you an indication of how much the queue is right now for any node. for that, squeue is a better resource.