Author Topic: Linux watchdog handler  (Read 1268 times)

0 Members and 1 Guest are viewing this topic.

Offline andreea9322

  • Jr. Member
  • **
  • Posts: 2
  • Karma: 0
  • I've just joined!
    • View Profile
    • Awards
Linux watchdog handler
« on: January 21, 2015, 08:59:36 am »
Hi all,

I’m interested in monitoring the processes running in a Linux system and determining when they are stuck/running endlessly  very quickly.
Once I determine this, I also want to take on some actions (like dumping some debug info, restarting the process, etc..).

I know I can detect stuck processes using systemd, but unfortunately I wasn’t able to take action (where can I specify a script that I want to run when some process heartbeats are missed ?)

Are you aware about other tools that act like watchdog monitors ?
(processes can register to them, start sending heartbeats, and in case some heartbeats are missed, the tools takes some actions.

I am aware I can write my own tool – I just want to know if there’s anything else offering this functionality.

Thank you,
Andreea

Offline Mark Greaves (PCNetSpec)

  • Administrator
  • Hero Member
  • *****
  • Posts: 18231
  • Karma: 476
  • Gender: Male
  • "-rw-rw-rw-" .. The Number Of The Beast
    • View Profile
    • PCNetSpec
    • Awards
Re: Linux watchdog handler
« Reply #1 on: January 21, 2015, 09:16:41 am »
monit

Quote
monit is a utility for monitoring and managing daemons or similar
programs running on a Unix system. It will start specified programs
if they are not running and restart programs not responding.

monit supports:
 * Daemon mode - poll programs at a specified interval
 * Monitoring modes - active, passive or manual
 * Start, stop and restart of programs
 * Group and manage groups of programs
 * Process dependency definition
 * Logging to syslog or own logfile
 * Configuration - comprehensive controlfile
 * Runtime and TCP/IP port checking (tcp and udp)
 * SSL support for port checking
 * Unix domain socket checking
 * Process status and process timeout
 * Process cpu usage
 * Process memory usage
 * Process zombie check
 * Check the systems load average
 * Check a file or directory timestamp
 * Alert, stop or restart a process based on its characteristics
 * MD5 checksum for programs started and stopped by monit
 * Alert notification for program timeout, restart, checksum, stop
   resource and timestamp error
 * Flexible and customizable email alert messages
 * Protocol verification. HTTP, FTP, SMTP, POP, IMAP, NNTP, SSH, DWP,
   LDAPv2 and LDAPv3
 * An http interface with optional SSL support to make monit
   accessible from a webbrowser

or maybe m/monit if you want a pretty UI
https://mmonit.com/
WARNING: You are logged into reality as 'root'
logging in as 'insane' is the only safe option.
pcnetspec.co.uk

 


SimplePortal 2.3.3 © 2008-2010, SimplePortal