Hi! I am aware of tools like top, htop, atop, and sar that can be used to monitor usage. The *top programs seem to only do any reporting in real time, while the latter sar tool can provide historical usage data only (as percentage by CPU).

The problem that I am trying to get information on is what processes are running, and their stats, at times when the system is unresponsive (making the *top programs impossible to use).

What is the best way to log process stats in real time so when the system becomes unresponsive and requires a reboot, we can go and look to see what state the system was in to hopefully troubleshoot what causes the system to become unresponsive?

Thank you!

  • frongt@lemmy.zip
    link
    fedilink
    arrow-up
    1
    ·
    9 hours ago

    Kernel dumps? I doubt that any monitoring agent would be any more responsive than what you’ve already listed.