Yakov Zhdanov created IGNITE-5155:
-------------------------------------
Summary: Need to improve stats dump on exchange timeout
Key: IGNITE-5155
URL:
https://issues.apache.org/jira/browse/IGNITE-5155 Project: Ignite
Issue Type: Improvement
Reporter: Yakov Zhdanov
Assignee: Stanilovsky Evgeny
Fix For: 2.1
Currently, on large topologies info dumped on "Failed to wait for partition map exchange" (org/apache/ignite/internal/processors/cache/GridCachePartitionExchangeManager.java:1713) floods the log and we need to reduce information dumped.
1. Reduce output for exchange futures that are already done. Keep event, topology version, servers count, clients count (more?)
2. Do not dump the whole communication stats, but send message to exchange coordinator, ask for its status and for number of messages received and for acked messages from local node.
3. we can think of sending new message from cache node to coordinator that may be a sign of a problem on that node (e.g. unreleased tx locks or still renting partitions) and coordinator may include this info to a status thus every Ignite node may point to a problem node in the logs.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)