Semen Boikov created IGNITE-882:
-----------------------------------
Summary: Node can join twice with the same ID
Key: IGNITE-882
URL:
https://issues.apache.org/jira/browse/IGNITE-882 Project: Ignite
Issue Type: Bug
Components: general
Reporter: Semen Boikov
Assignee: Yakov Zhdanov
Priority: Critical
Fix For: sprint-5
Observed in the test 'GridCacheColocatedFailoverSelfTest.testOptimisticRepeatableReadTxConstantTopologyChange':
Node joined:
{noformat}
[15:53:24,163][INFO ][disco-event-worker-#121%dht.GridCacheColocatedFailoverSelfTest0%][GridDiscoveryManager] Added new node to topology: TcpDiscoveryNode [id=10cf7906-50af-4f46-9c31-baf419539001, addrs=[127.0.0.1], sockAddrs=[/127.0.0.1:47525], discPort=47525, order=400, intOrder=202, loc=false, ver=1.0.3#19700101-sha1:00000000, isClient=false]
{noformat}
Node failed:
{noformat}
[15:53:24,171][WARN ][disco-event-worker-#121%dht.GridCacheColocatedFailoverSelfTest0%][GridDiscoveryManager] Node FAILED: TcpDiscoveryNode [id=10cf7906-50af-4f46-9c31-baf419539001, addrs=[127.0.0.1], sockAddrs=[/127.0.0.1:47525], discPort=47525, order=400, intOrder=202, loc=false, ver=1.0.3#19700101-sha1:00000000, isClient=false]
{noformat}
This see this message from the thread starting new node:
{noformat}
[15:53:29,047][WARN ][topology-change-thread-1][TcpDiscoverySpi] Node has not been connected to topology and will repeat join process. Check remote nodes logs for possible error messages. Note that large topology may require significant time to start. Increase 'TcpDiscoverySpi.networkTimeout' configuration property if getting this message on the starting nodes [networkTimeout=5000]
{noformat}
Node joined again with the same ID:
{noformat}
[15:53:29,212][INFO ][disco-event-worker-#121%dht.GridCacheColocatedFailoverSelfTest0%][GridDiscoveryManager] Added new node to topology: TcpDiscoveryNode [id=10cf7906-50af-4f46-9c31-baf419539001, addrs=[127.0.0.1], sockAddrs=[/127.0.0.1:47525], discPort=47525, order=404, intOrder=205, loc=false, ver=1.0.3#19700101-sha1:00000000, isClient=false]
{noformat}
Then test hangs (in the log I see that future mapped on the node '10cf7906-50af-4f46-9c31-baf419539001' did not finish).
The same issue observed in tests extending GridCacheAbstractNodeRestartSelfTest.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)