0

I have one application which runs 24x7 in my system. But some how it was killed abruptly. I observe it 2 or 3 times from last 10 days.

Now I want to find from how much time my application is stopped. So I can notify it and able to find bug from application. And also it will help me to create cronjob.

ravibhuva9955
  • 247
  • 1
  • 11

1 Answers1

1

I'd recommend atop with it's service atopsar. It monitors start and stop time of processes, besides disk usage and (via an extra service) network activity.

atopsar monitors your processes on a regular interval (e.g. 5 minutes) and logs that to a file. You can open that file afterwards and step through the history, showing all process details values like CPU and memory usage. Maybe this will provide you hints why your program crashed.

Also make sure that your /etc/security/limits.conf is propperly configured so that you get a core dump. This gives you something to debug and a timestamp.

trapicki
  • 619