Separate finding modules from generating mutants? #329

sourcefrog · 2024-04-13T16:46:17Z

In 24.3, we do one pass over source files, parsing them using syn and identifying (1) other source files that we need to recurse into and (2) mutants from this source file.

Possibly these should be separated into separate passes that each walk over the syn AST:

Parse a file into an AST and find just the mod statements that point to other files we need to read. Queue those other files and repeat until we have all the source files loaded into memory.
Walk each source file and generate mutants from it.

That is to say the two fields that are currently in Discovered would be generated separately:

cargo-mutants/src/visit.rs

Lines 33 to 36 in 4424126

    
           pub struct Discovered { 
        
               pub mutants: Vec<Mutant>, 
        
               pub files: Vec<SourceFile>, 
        
           }

Why?

It would make the AST walk code somewhat simpler by separating concerns: not only less code in the visitor but also simpler interactions with the code that calls it
The mutant-generation code would be a pure function of a source tree already in memory
Probably it would be easier to test each part, e.g. we could write unit tests for mutations that just work off a string of source code, without needing a whole tree
It would probably also make it simpler to generate mutants or parse files in parallel on multiple threads.
Maybe this makes ownership simpler: the per-file mutant generator and all the mutants generated from it could reference and not outlive the source file they point into.

Why not?

We would walk each AST twice which would have some CPU cost.
Maybe it's just not worth it as a refactor.

Generally all the time taken walking the tree and generating mutants should be pretty trivial compared to the time taken actually running tests, so perhaps it's not important to make changes to optimize speed.

Maybe we should benchmark just --list --json on some large tree.

The text was updated successfully, but these errors were encountered:

sourcefrog added performance Making cargo mutants faster maybe Uncertain if this is a good idea, discuss before implementing internal Internal refactors and infrastructure labels Apr 13, 2024

sourcefrog mentioned this issue Apr 15, 2024

Follow path attribute on inline mod blocks #335

Closed

sourcefrog added the modules Bugs about following Rust `mod` statements label Apr 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Separate finding modules from generating mutants? #329

Separate finding modules from generating mutants? #329

sourcefrog commented Apr 13, 2024

Separate finding modules from generating mutants? #329

Separate finding modules from generating mutants? #329

Comments

sourcefrog commented Apr 13, 2024