Performance overhead of DebugProbesImpl for non-concurrent coroutines #3527

qwwdfsad · 2022-11-15T13:56:37Z

Snippet to reproduce:

class DebugProbesTest : DebugTestBase() {
    fun <Node> generateRecursiveSequence(initialSequence: Sequence<Node>, children: (Node) -> Sequence<Node>): Sequence<Node> {
        return sequence {
            val initialIterator = initialSequence.iterator()
            if (!initialIterator.hasNext()) {
                return@sequence
            }
            val visited = HashSet<Node>()
            val sequences = ArrayDeque<Sequence<Node>>()
            sequences.addLast(initialIterator.asSequence())
            while (sequences.isNotEmpty()) {
                val currentSequence = sequences.removeFirst()
                for (node in currentSequence) {
                    if (visited.add(node)) {
                        yield(node)
                        sequences.addLast(children(node))
                    }
                }
            }
        }
    }

    @Volatile
    var a = 2
    @Test
    fun stressGenerateRecursive() {
        while (true) {
            runBlocking {
                repeat(8) {
                    launch(Dispatchers.Default) {
                        val seq = generateRecursiveSequence((1..100).asSequence()) {
                            (1..it).asSequence()
                        }

                        for (i in seq) {
                            a = i
                        }
                    }
                }
            }
        }
    }
}

Profile:

Up to 30% is occupied by updateState

The text was updated successfully, but these errors were encountered:

qwwdfsad · 2022-11-15T14:03:48Z

It seems like a ReentrantLock is a rudiment from pre-concurrent implementation times.
It indeed incurs some non-trivial overhead, while providing only a single benefit -- strongly consistent snapshot in read operations (dump/hierarchyToString etc.). The guarantee seems to be way too strong for such an overhead, and the proposed solution is just to remove this lock.

The trick here is to ensure that we do have a proper ordered access to memory locations everywhere

maxmedvedev · 2023-03-30T19:26:40Z

Hey, when is this fix going to be released?

qwwdfsad · 2023-03-30T21:42:13Z

It's already been released as part of 1.7.0-Beta

qwwdfsad added debug performance labels Nov 15, 2022

qwwdfsad mentioned this issue Nov 21, 2022

Improve DebugProbes performance #3534

Merged

qwwdfsad closed this as completed in cca82e7 Mar 9, 2023

qwwdfsad mentioned this issue Jun 19, 2023

Ignore coroutines-based sequence and iterator builders in coroutine debugger #3782

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance overhead of DebugProbesImpl for non-concurrent coroutines #3527

Performance overhead of DebugProbesImpl for non-concurrent coroutines #3527

qwwdfsad commented Nov 15, 2022

qwwdfsad commented Nov 15, 2022 •

edited

maxmedvedev commented Mar 30, 2023

qwwdfsad commented Mar 30, 2023

Performance overhead of DebugProbesImpl for non-concurrent coroutines #3527

Performance overhead of DebugProbesImpl for non-concurrent coroutines #3527

Comments

qwwdfsad commented Nov 15, 2022

qwwdfsad commented Nov 15, 2022 • edited

maxmedvedev commented Mar 30, 2023

qwwdfsad commented Mar 30, 2023

qwwdfsad commented Nov 15, 2022 •

edited