Zookeeper mass timeouts

classic Classic list List threaded Threaded
2 messages Options
Reply | Threaded
Open this post in threaded view
|

Zookeeper mass timeouts

Dmitriy Pavlov
Hi,

First of all I apologize for mass emails came from zookeeper timeout
failures.

Both failures and the bot should be researched. I believe we can improve
the bot notification rules to avoid mass emails in case of flaky timeouts.

I believe it is better solution to make rules perfect instated of creating
separate channel.

Sincerely,
Dmitriy Pavlov
Reply | Threaded
Open this post in threaded view
|

Re: Zookeeper mass timeouts

Alexey Goncharuk
Dmitriy,

The zookeeper timeouts were caused by my commit (I looked through the wrong
PR when was checking tests), already reverted from master.

As for the bot, does it already have the logic to detect continuous
timeouts and send notification only after a successful run? If not, I guess
we should put it on our helper roadmap because this will not be the last
timeout change.

--AG

вс, 2 сент. 2018 г. в 10:11, Dmitriy Pavlov <[hidden email]>:

> Hi,
>
> First of all I apologize for mass emails came from zookeeper timeout
> failures.
>
> Both failures and the bot should be researched. I believe we can improve
> the bot notification rules to avoid mass emails in case of flaky timeouts.
>
> I believe it is better solution to make rules perfect instated of creating
> separate channel.
>
> Sincerely,
> Dmitriy Pavlov
>