Support incremental analysis #322

simondel · 2017-06-12T09:09:23Z

The mutation testing process could be sped up by only testing changed mutants.

We could argue that a mutant should not have to be tested if:

It was killed in the previous run
And the test(s) that killed it still exists
And the same mutant was generated (same mutator, same file, same columns, same original code, same mutated code)

I would not advise to run incremental analysis during your build because it will probably not be flawless. One of the flaws of my proposed setup is that the test(s) that killed the mutant in the previous run could have changed. Right now we have no good way of knowing they were changed.

j-truax · 2017-06-28T02:04:04Z

With Karma using Chrome, I am seeing scan times for ~3,500 mutants take ~5 hours. Using PhantomJS decreases scan times by ~50%. Incremental analysis while not perfect would go a long way in improving performance while still killings a majority of the generated mutants.

jmpresso · 2017-09-04T15:32:50Z

Yeah, this would be pretty crucial. Stryker works well in my very small apps. My main prod app has ~1k unit tests. I had to reduce runners to 6 to allow it to run, and think after a few hours it was less than 50% done.

You could probably go two ways with this one- just running on changes from that commit, or using a history file like pitest for Java.

nicojs · 2017-09-05T14:51:49Z

Yeah, this would be pretty crucial. Stryker works well in my very small apps. My main prod app has ~1k unit tests. I had to reduce runners to 6 to allow it to run, and think after a few hours it was less than 50% done.

Wow this is cool. Are you using coverage analysis for the running of the tests? If so, is it OK to share some statistics? How much faster is it with/without it. Also #366 could also make a big difference for these kind of projects.

With Karma using Chrome, I am seeing scan times for ~3,500 mutants take ~5 hours. Using PhantomJS decreases scan times by ~50%

What exactly do you mean with "scan times". Would that be the initial test run by any chance?

Another way of solving this is with sharding. So you could create a build per file to be mutated. You could share the same stryker.conf.js file, but choose to mutate different files using an env variable and parameterised build. We could force stryker to merge reports together in a big report. It might also be a good idea to split your project into different modules all together, but that's not up to me ;)

You could probably go two ways with this one- just running on changes from that commit, or using a history file like pitest for Java.

@jmpresso How does it work exactly? Does every developer need to build the history themselves using a full run once? Or is there a centralized store somewhere?

jmpresso · 2017-09-05T23:15:41Z

Wow this is cool. Are you using coverage analysis for the running of the tests? If so, is it OK to share some statistics? How much faster is it with/without it. Also #366 could also make a big difference for these kind of projects.

Not sure exactly what you mean there. It was taking so long to run I gave up on letting it finished. So currently I only have the istanbul type line coverage. I think karma runs in <2 minutes by itself, the code coverage is pretty fast too.

I do agree #366 would help as well. The temporary folder was also very large during my run.

What exactly do you mean with "scan times". Would that be the initial test run by any chance?
I don't want to speak for him, but I think he means the total execution time of the mutant testing.

Another way of solving this is with sharding. So you could create a build per file to be mutated. You could share the same stryker.conf.js file, but choose to mutate different files using an env variable and parameterised build. We could force stryker to merge reports together in a big report. It might also be a good idea to split your project into different modules all together, but that's not up to me ;)

I'm not totally sure how much that would help. Are you suggesting sharding it and running the shards in parallel? It might help, but might bog down the build machine. Perhaps we could make it work with SauceLabs. We'd also have to figure out different ports for all the different Karma servers, so it might get pretty messy.

I agree... modularizing absolutely helps, it's just not always possible.

@jmpresso How does it work exactly? Does every developer need to build the history themselves using a full run once? Or is there a centralized store somewhere?

Only the first time using the plugin. From there, it records the results in the history file, which could then gets versioned along with the code. I'm honestly not sure if/how it would handle merges and what not though.

FWIW, it mutates in memory also.

nicojs · 2017-09-07T08:49:04Z

Not sure exactly what you mean there. It was taking so long to run I gave up on letting it finished. So currently I only have the istanbul type line coverage. I think karma runs in <2 minutes by itself, the code coverage is pretty fast too.

Wouw, 2 mins. Not sure there is something we can do in that case. Are there slow tests in the mix? Or just in general a large application footprint?

If you have slow tests, you could try to move them to a separate test suite (for example: "integration tests"). If you have a big application foodprint, it could help to bundle files together. For example: 1 bundle for all tests and one bundle for your production code. It might be that the majority of the time has to do with loading files in the browser. HTTP 1.1 is terribly bad optimized for a big number of files.

Are you suggesting sharding it and running the shards in parallel?

No, running the shards one by one, or on multiple build servers.

Only the first time using the plugin. From there, it records the results in the history file, which could then gets versioned along with the code.

@jmpresso Just to be sure: you wouldn't check in the history file (or files) into source control?

j-truax · 2017-09-08T15:16:57Z

@nicojs Sorry for the confusion, by "scan times", I meant "execution time". Similar to @jmpresso the project I was referring to executes 5,000+ tests in 4 minutes using just karma. However, when executing Stryker with multiple runners, it takes 4+ hours.

I think #366 will help too. However, the biggest benefit seems like it would come from a history file feature like PiTest uses. This way previously generated mutants' results are used for files not modified since the last build. This history file IS committed to source control and referenced in the maven config settings in the pom.xml. This way the CI system can see it during builds and mutants generated in unmodified code are not rechecked again. Developers can keep this file up to date by running PiTest locally, generating a new pitestHistory.txt and committing it to source control.

In the example below if FooClass is not updated on a subsequent commit, PiTest will use its previous results from the pitestHistory.txt file instead of regenerating its mutants and running tests against them.

Example class...

package path.to.src.class;

import path.to.src.class.BarClass;
import path.to.src.class.BazClass;

public class FooClass {
	public BazClass methodFromClass(BarClass barClass) {
		return  Object.doStuff(BarClass)
	}
}

Resulting entry in pitestHistory.txt after running PiTest...

1
<classHistory><id hierarchicalHash="f9bf32"><classId hash="16367410"><name><name>path/to/src/class/FooClass</name></name></classId></id><coverageId>0</coverageId></classHistory>
<result><id><location><clazz><name>path/to/src/class/FooClass</name></clazz><method><name>methodFromClass</name></method><methodDesc>(Lpath/to/src/class/BarClass;)Lpath/to/src/class/BazClass;</methodDesc></location><indexes><int>7</int></indexes><mutator>org.pitest.mutationtest.engine.gregor.mutators.ReturnValsMutator</mutator></id><status numberOfTestsRun="0" status="NO_COVERAGE"><killingTest class="org.pitest.functional.Option$None"/></status></result>

avassem85 · 2017-11-26T12:15:22Z

It is also described http://pitest.org/quickstart/incremental_analysis/

CarlosRosario · 2018-04-01T15:36:46Z

I created a fork of Stryker in order to begin working on this feature. After cloning the Stryker project and running npm install when i try to run npm test in an individual package folder i see test failures due to no tests found (I think this may be due to the fact that the test runner is looking for .js files, but what exists in the different test directories are .ts files).

When i run npm test at the root level i am also seeing some build failures. I've attached the debug log that is produced. Any ideas why i'm unable to run the tests out-of-the-box so to speak?

2018-04-01T15_32_51_572Z-debug.log

simondel · 2018-04-01T15:51:18Z

@CarlosRosario Thanks for wanting to take a shot at this! Could you try running npm run build from the root of the project first? That should build (and link) every project in the repository. After that you should be able to run npm test for a specific package (or run even run the tests in debug mode if you use VS Code)

nicojs · 2018-04-02T06:58:03Z

@simondel that is what @CarlosRosario did (npm test at the root does an npm run build first).

As far as I can tell it should work. We run this command on linux and windows machines to develop and build stryker. Maybe something was not installed correctly? Can you try clearing your node_modules and try again?

After running npm run build from the root directory you should be able to run npm test from any child package. Just remember to run npm run build in a child package if you change some code.

If you use vscode, you can open a child package in vscode and use F1 + "Run Build Task". It will start the incremental tsc compiler (tsc -w). With the debug pane on the left you can run/debug unit tests or integration tests (located in "test/unit" and "test/integration"). Breakpoints in ts code should work out-of-the box.

Please contact us on gitter if you have any problems or questions.

simondel · 2018-04-02T07:19:57Z

My bad! I do notice that your fork could be using typescript 2.8, which we don't support yet. Although the error is different then the one we got with typescript 2.8. You could try to clone the core repo instead of your fork to see if you can build/test that.

Djaler · 2019-06-25T17:46:27Z

Hey, any news/plans about this?

jmpresso · 2019-06-26T20:47:40Z

I don't think Carlos is working on it any longer. I know one of us made a fork to work on it, and made some progress to at least creating the history file, but then it got put on hold. I think there also is still a lot of work needed to migrate to the new version of Stryker, and finish the implementation. It's possible it will get picked back up at some point, but not sure. I know we have also been pushing to start using faster test runners like Jest over Karma, to try and reduce the need.

TL/DR, unless the Stryker team is working on it, probably not, as I don't think some of us here that have been discussing it have made significant progress on it.

nicojs · 2019-07-01T05:20:42Z

We're not actively working in it. We've also decided to start implementing #1514 since that will give us a lossless performance boost. This might be something we would consider doing after that.

stale · 2020-11-20T17:14:02Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

Djaler · 2020-11-20T17:43:59Z

Dear bot, don't close this, please

nicojs · 2021-02-19T19:36:06Z

I'm closing this issue in favor of #2753

Remove `Mutant.range` from the plugin API. Plugin creators should rely on `location` instead (which has `start` and `end` positions). See changes in the TypeScript checker for an example. The `range` property was a way to identify changed code. It was a tuple defined as `[start: number, end: number]`. As plugin creator, you had the opportunity to use this instead of `location`. We've chosen to remove it because having 2 ways to identify location is redundant and this will allow us to implement #322 in the future. BREAKING CHANGE: The `range` property is no longer present on a `mutant`. Note, this is a breaking change for plugin creators only. Co-authored-by: Simon de Lang <simondelang@gmail.com>

simondel added 🚀 Feature request New feature request help wanted Extra attention is needed labels Jun 12, 2017

nicojs mentioned this issue Dec 18, 2017

Feature Idea: Mutate only modified files (based on git) #551

Closed

simondel added this to Features in Backlog Jan 10, 2018

simondel removed this from Features in Backlog Apr 18, 2019

Djaler mentioned this issue Jul 2, 2019

perf: reduce next stryker runs #1610

Closed

nicojs mentioned this issue Sep 13, 2019

feat(Git diff): Mutate only changed files based on git diff stryker-mutator/stryker-net#708

Merged

simondel mentioned this issue Nov 11, 2019

Setting up Iteration Plan / Roadmaps #1834

Closed

simondel added this to Planning in Backlog Nov 13, 2019

simondel moved this from Planning to Long term goals in Backlog Nov 13, 2019

simondel mentioned this issue Jan 31, 2020

Specify lines to mutate #1980

Closed

nicojs mentioned this issue Jun 8, 2020

Ignore Already Run Mutant #2242

Closed

nicojs mentioned this issue Jun 27, 2020

CICD pipeline Best Practice #2247

Closed

nicojs mentioned this issue Sep 8, 2020

Performance benchmark: Express #2417

Closed

Lakitna mentioned this issue Nov 16, 2020

Experiment: random mutant sampling #2584

Closed

stale bot added the ☠ stale Marked as stale by the stale bot, will be removed after a certain time. label Nov 20, 2020

stale bot removed the ☠ stale Marked as stale by the stale bot, will be removed after a certain time. label Nov 20, 2020

simondel mentioned this issue Dec 17, 2020

Running stryker only on changed code stryker-mutator/stryker-mutator.github.io#234

Closed

nicojs closed this as completed Feb 19, 2021

Backlog automation moved this from Long term goals to Done Feb 19, 2021

nicojs mentioned this issue May 9, 2021

feat(range): remove Range from the API #2882

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support incremental analysis #322

Support incremental analysis #322

simondel commented Jun 12, 2017

j-truax commented Jun 28, 2017

jmpresso commented Sep 4, 2017

nicojs commented Sep 5, 2017

jmpresso commented Sep 5, 2017

nicojs commented Sep 7, 2017 •

edited

j-truax commented Sep 8, 2017

avassem85 commented Nov 26, 2017

CarlosRosario commented Apr 1, 2018

simondel commented Apr 1, 2018

nicojs commented Apr 2, 2018

simondel commented Apr 2, 2018

Djaler commented Jun 25, 2019

jmpresso commented Jun 26, 2019

nicojs commented Jul 1, 2019

stale bot commented Nov 20, 2020

Djaler commented Nov 20, 2020

nicojs commented Feb 19, 2021

Support incremental analysis #322

Support incremental analysis #322

Comments

simondel commented Jun 12, 2017

j-truax commented Jun 28, 2017

jmpresso commented Sep 4, 2017

nicojs commented Sep 5, 2017

jmpresso commented Sep 5, 2017

nicojs commented Sep 7, 2017 • edited

j-truax commented Sep 8, 2017

avassem85 commented Nov 26, 2017

CarlosRosario commented Apr 1, 2018

simondel commented Apr 1, 2018

nicojs commented Apr 2, 2018

simondel commented Apr 2, 2018

Djaler commented Jun 25, 2019

jmpresso commented Jun 26, 2019

nicojs commented Jul 1, 2019

stale bot commented Nov 20, 2020

Djaler commented Nov 20, 2020

nicojs commented Feb 19, 2021

nicojs commented Sep 7, 2017 •

edited