Poor performance on real-world codebase #30

bgamari · 2018-07-04T16:58:58Z

I was looking into using Ward to lint GHC's runtime system, starting with simple lock checking. Unfortunately even with only no privileges defined and enabling enforcement for a single file the check runs for more than 10 minutes before sending my laptop with 32GB of RAM into swap-death. This seems a bit high for a 50kLoC codebase.

Checking each source file individually typically takes around 30 seconds per file. Is this the recommended strategy for non-small projects?

bgamari · 2018-07-04T22:02:03Z

For the record, I am now following the example of check-mono.sh and first producing call-maps of the sources and then using the compiler mode to check these maps. This is still slow, but much more bearable. I did a bit of profiling of the compiler mode and noticed that the callmap parser appears to be responsible for most of the time and allocations.

evincarofautumn · 2018-07-05T23:40:16Z

Yeah, this is a known issue, and honestly one of the reasons my work on this fizzled out before I left Microsoft—I found it hard to iterate on the project when it was so slow on a nontrivial codebase, and not obvious to me how to fix that.

The performance issues were rooted primarily in language-c—the entire AST is lazy and includes a large amount of detail & indirections. @lambdageek wrote the call map code as a workaround for this, so yes, this is the recommended approach. He might have a better idea of what’s up with the perf there, but I’ll look into it.

lambdageek · 2018-07-06T00:06:21Z

Hey, I kind of got distracted for a while, but one of the last things that I pushed to language-c about six months ago was NFData instances for all the syntax datatypes. That means we can finally do something about

Ward/src/Graph.hs

Lines 74 to 76 in 05b02cf

    
           -- Why can't we just deepseq tus'? Because language-c doesn't provide 
        
           -- NFData instances :-( 
        
           whnfList tus' `seq` CTranslUnit tus' firstLocation

There are a few other places in Graph.hs that use Data.Generics that could use some deepseq.

That ought to help with the memory usage if I interpreted the profiler output correctly.

In terms of time - i'm sure there's some low-hanging fruit in terms of the data representation, but after that we'll probably need to get smarter about the order in which we recompute the permissions on each iteration. Unfortunately I couldn't get tests to pass when I tried rewriting the algorithm as a classic dataflow analysis, so I don't have a good grasp on how to reason about possible transformations.

bgamari · 2018-07-06T03:14:32Z

Thanks for adding a note to the readme, @evincarofautumn!

lambdageek · 2018-07-06T04:35:27Z

@bgamari

I did a bit of profiling of the compiler mode and noticed that the callmap parser appears to be responsible for most of the time and allocations.

That's interesting.

For analyzing Mono, I saw the C parser + callmap generator taking a lot of time and memory, but the analysis run (parsing all the callmap files and running the global analysis) was relatively speedy.

bgamari · 2018-07-06T13:53:51Z

Well, to be clear I was profiling compiler mode with only callgraph inputs, so the C parser didn't have much of a chance to show up. That being said, a majority of the time was spent call-graph parsing.

evincarofautumn added the bug label Jul 5, 2018

evincarofautumn added this to the 1.0 milestone Oct 27, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Poor performance on real-world codebase #30

Poor performance on real-world codebase #30

bgamari commented Jul 4, 2018

bgamari commented Jul 4, 2018

evincarofautumn commented Jul 5, 2018

lambdageek commented Jul 6, 2018

bgamari commented Jul 6, 2018

lambdageek commented Jul 6, 2018

bgamari commented Jul 6, 2018

Poor performance on real-world codebase #30

Poor performance on real-world codebase #30

Comments

bgamari commented Jul 4, 2018

bgamari commented Jul 4, 2018

evincarofautumn commented Jul 5, 2018

lambdageek commented Jul 6, 2018

bgamari commented Jul 6, 2018

lambdageek commented Jul 6, 2018

bgamari commented Jul 6, 2018