[jira] [Created] (IGNITE-14078) Deadlock on GridCacheSharedTtlCleanupManager#mgrs if cache is created when ttl cleanup is running

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (IGNITE-14078) Deadlock on GridCacheSharedTtlCleanupManager#mgrs if cache is created when ttl cleanup is running

Anton Vinogradov (Jira)
Mirza Aliev created IGNITE-14078:
------------------------------------

             Summary: Deadlock on GridCacheSharedTtlCleanupManager#mgrs if cache is created when ttl cleanup is running
                 Key: IGNITE-14078
                 URL: https://issues.apache.org/jira/browse/IGNITE-14078
             Project: Ignite
          Issue Type: Bug
    Affects Versions: 2.9.1
            Reporter: Mirza Aliev


ttl-cleanup-worker does a block of work in ConcurrentHashMap.compute() and tries to acquire checkpoint read lock:


{code:java}
Thread [name="ttl-cleanup-worker-#120%1%", id=225, state=WAITING, blockCnt=0, waitCnt=81486]
    Lock [object=java.util.concurrent.locks.ReentrantReadWriteLock$NonfairSync@35608c45, ownerName=null, ownerId=-1]
        at sun.misc.Unsafe.park(Native Method)
        at java.util.concurrent.locks.LockSupport.park(LockSupport.java:175)
        at java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:836)
        at java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireShared(AbstractQueuedSynchronizer.java:967)
        at java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireShared(AbstractQueuedSynchronizer.java:1283)
        at java.util.concurrent.locks.ReentrantReadWriteLock$ReadLock.lock(ReentrantReadWriteLock.java:727)
        at o.a.i.i.processors.cache.persistence.GridCacheDatabaseSharedManager.checkpointReadLock(GridCacheDatabaseSharedManager.java:1730)
        at o.a.i.i.processors.cache.IgniteCacheOffheapManagerImpl.expireInternal(IgniteCacheOffheapManagerImpl.java:1346)
        at o.a.i.i.processors.cache.IgniteCacheOffheapManagerImpl.expire(IgniteCacheOffheapManagerImpl.java:1323)
        at o.a.i.i.processors.cache.GridCacheTtlManager.expire(GridCacheTtlManager.java:242)
        at o.a.i.i.processors.cache.GridCacheSharedTtlCleanupManager$CleanupWorker.lambda$body$0(GridCacheSharedTtlCleanupManager.java:178)
        at o.a.i.i.processors.cache.GridCacheSharedTtlCleanupManager$CleanupWorker$$Lambda$619/1960552474.apply(Unknown Source)
        at java.util.concurrent.ConcurrentHashMap.computeIfPresent(ConcurrentHashMap.java:1769)
        - locked java.util.concurrent.ConcurrentHashMap$Node@4f66c754
        at o.a.i.i.processors.cache.GridCacheSharedTtlCleanupManager$CleanupWorker.body(GridCacheSharedTtlCleanupManager.java:177)
        at o.a.i.i.util.worker.GridWorker.run(GridWorker.java:119)
        at java.lang.Thread.run(Thread.java:748)
{code}


Meanwhile, exchange thread is waiting on the same ConcurrentHashMap node:


{code:java}
Thread [name="exchange-worker-#93%1%", id=193, state=BLOCKED, blockCnt=8, waitCnt=1669]
    Lock [object=java.util.concurrent.ConcurrentHashMap$Node@4f66c754, ownerName=ttl-cleanup-worker-#120%1%, ownerId=225]
        at java.util.concurrent.ConcurrentHashMap.transfer(ConcurrentHashMap.java:2426)
        at java.util.concurrent.ConcurrentHashMap.addCount(ConcurrentHashMap.java:2288)
        at java.util.concurrent.ConcurrentHashMap.putVal(ConcurrentHashMap.java:1070)
        at java.util.concurrent.ConcurrentHashMap.put(ConcurrentHashMap.java:1006)
        at o.a.i.i.processors.cache.GridCacheSharedTtlCleanupManager.register(GridCacheSharedTtlCleanupManager.java:68)
        at o.a.i.i.processors.cache.GridCacheTtlManager.start0(GridCacheTtlManager.java:107)
        at o.a.i.i.processors.cache.GridCacheManagerAdapter.start(GridCacheManagerAdapter.java:49)
        at o.a.i.i.processors.cache.GridCacheProcessor.initCacheContext(GridCacheProcessor.java:2176)
        at o.a.i.i.processors.cache.GridCacheProcessor.prepareCacheContext(GridCacheProcessor.java:1964)
        at o.a.i.i.processors.cache.GridCacheProcessor.prepareCacheStart(GridCacheProcessor.java:1883)
        at o.a.i.i.processors.cache.GridCacheProcessor.lambda$prepareStartCaches$55a0e703$1(GridCacheProcessor.java:1758)
        at o.a.i.i.processors.cache.GridCacheProcessor$$Lambda$527/649205444.apply(Unknown Source)
        at o.a.i.i.processors.cache.GridCacheProcessor.lambda$prepareStartCachesIfPossible$14(GridCacheProcessor.java:1728)
        at o.a.i.i.processors.cache.GridCacheProcessor$$Lambda$526/1117407359.handle(Unknown Source)
        at o.a.i.i.processors.cache.GridCacheProcessor.prepareStartCaches(GridCacheProcessor.java:1755)
        at o.a.i.i.processors.cache.GridCacheProcessor.prepareStartCachesIfPossible(GridCacheProcessor.java:1726)
        at o.a.i.i.processors.cache.CacheAffinitySharedManager.processCacheStartRequests(CacheAffinitySharedManager.java:1005)
        at o.a.i.i.processors.cache.CacheAffinitySharedManager.onCacheChangeRequest(CacheAffinitySharedManager.java:891)
        at o.a.i.i.processors.cache.distributed.dht.preloader.GridDhtPartitionsExchangeFuture.onCacheChangeRequest(GridDhtPartitionsExchangeFuture.java:1459)
        at o.a.i.i.processors.cache.distributed.dht.preloader.GridDhtPartitionsExchangeFuture.init(GridDhtPartitionsExchangeFuture.java:967)
        at o.a.i.i.processors.cache.GridCachePartitionExchangeManager$ExchangeWorker.body0(GridCachePartitionExchangeManager.java:3376)
        at o.a.i.i.processors.cache.GridCachePartitionExchangeManager$ExchangeWorker.body(GridCachePartitionExchangeManager.java:3195)
        at o.a.i.i.util.worker.GridWorker.run(GridWorker.java:119)
        at java.lang.Thread.run(Thread.java:748)
{code}






--
This message was sent by Atlassian Jira
(v8.3.4#803005)