Vladislav Pyatkov created IGNITE-8710:
-----------------------------------------
Summary: Applying WAL works long time or fail at all, when *.wal files been removed
Key: IGNITE-8710
URL:
https://issues.apache.org/jira/browse/IGNITE-8710 Project: Ignite
Issue Type: Bug
Reporter: Vladislav Pyatkov
In specific cases when removed *.wal files or unmounted wal directories we got some warning message on start:
{noformat}
2018-06-02 12:10:06.127[INFO ][Thread-100][o.a.i.i.p.c.p.GridCacheDatabaseSharedManager] Checking memory state [lastValidPos=FileWALPointer [idx=0, fileOff=0, len=0], lastMarked=FileWALPointer [idx=0, fileOff=0, len=0], lastCheckpointId=00000000-0000-0000-0000-000000000000]
2018-06-02 12:10:06.546[WARN ][Thread-100][o.a.i.i.p.c.p.GridCacheDatabaseSharedManager] Found unexpected checkpoint marker, skipping [cpId=94b5ce03-87b7-489e-b08b-b4c5dc522bd5, expCpId=00000000-0000-0000-0000-000000000000, pos=FileWALPointer [idx=0, fileOff=44266869, len=977]]
2018-06-02 12:10:57.860[WARN ][Thread-100][o.a.i.i.p.c.p.GridCacheDatabaseSharedManager] Found unexpected checkpoint marker, skipping [cpId=3f6ab238-23f7-4924-b4ef-0cb68d914a04, expCpId=00000000-0000-0000-0000-000000000000, pos=FileWALPointer [idx=7, fileOff=872888269, len=460112]]
2018-06-02 12:11:46.600[INFO ][Thread-100][o.a.i.i.p.c.p.w.FileWriteAheadLogManager] Stopping WAL iteration due to an exception: EOF at position [1073741824] expected to read [1] bytes, ptr=FileWALPointer [idx=15, fileOff=1073741824, len=0]
2018-06-02 12:12:21.181[WARN ][Thread-100][o.a.i.i.p.c.p.GridCacheDatabaseSharedManager] Found unexpected checkpoint marker, skipping [cpId=3fe33806-ee11-49b7-8c47-648cd1adacbc, expCpId=00000000-0000-0000-0000-000000000000, pos=FileWALPointer [idx=23, fileOff=693360866, len=460112]]
{noformat}
And trying to recovery from WAL hangs a long try without success.
Should to stop the node and print message about not found necessary wal-files.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)