[jira] [Created] (IGNITE-6587) Ignite watchdog service

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (IGNITE-6587) Ignite watchdog service

Anton Vinogradov (Jira)
Alexey Goncharuk created IGNITE-6587:
----------------------------------------

             Summary: Ignite watchdog service
                 Key: IGNITE-6587
                 URL: https://issues.apache.org/jira/browse/IGNITE-6587
             Project: Ignite
          Issue Type: Improvement
          Components: general
    Affects Versions: 2.2
            Reporter: Alexey Goncharuk
             Fix For: 2.4


We need to come up with a 'watchdog service' to monitor for Ignite node local health and kill the process under some critical conditions.
For example, if one of the mission-critical Ignite threads die, the Ignite node must be stopped.
At the first glance, the list of critical threads is:
All TCP discovery threads
All communication NIO threads (acceptor and workers)
Exchange worker
Striped pool threads
Timeout Worker
Checkpointer
WAL archiver

The mechanism should support pluggable components so that self-check can be extended via plugins.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)