[MNT] remove coverage reporting and `pytest-cov` from PR CI #6363

fkiraly · 2024-04-29T14:32:43Z

This PR removes generation of coverage reports, installation and use of pytest-cov from standard CI. Also removes the (unreliable) coverage badge from the README

Reasons:

generation of coverage reports seems to make CI take substantially more time: [BUG] long test collection time and test timeouts #6344
coverage reports are unreliable (spuriously low) due to incremental/differential testing: [MNT] ensuring correct coverage from incremental testing #5090
in consequence, the README coverage badge showed randomly fluctuating coverage depending on which set of partial tests were executed. I think it makes sense to remove it until we have a way to address [MNT] ensuring correct coverage from incremental testing #5090.

fkiraly · 2024-05-01T13:49:10Z

Anecdotal, but looks like this leads to substantial runtime improvements.

Before:

After:

yarnabrina · 2024-05-01T18:35:25Z

@fkiraly I want to be catious for this. Can we test these:

what happens if instead of removing completely we only skip the xml and html reports? I believe a base one (.coverage) gets generated always, and these two run separately.
if it is removed completely (assuming only for CI runs in PR), how does coverage report appear in README (after merge to main)? If it shows missing or 0% or similar, that will be misleading. To test, may be you can try to edit the link of README in this branch without actually merging.

My caution is mainly for the reason that it's highly counter intuitive to me that coverage will affect timing by this much. It's more than 3-4 times in your screenshots, and if had that been the general effect of pytest-cov, it's expected to be detected by users quite ago. It's very popular and standard, so I am really wondering if we are missing something else (though I don't have any alternative ideas yet).

fkiraly · 2024-05-01T19:10:38Z

what happens if instead of removing completely we only skip the xml and html reports? I believe a base one (.coverage) gets generated always, and these two run separately.

According to the profiler, indeed these parts create the overhead.

How would you turn these off separately? Can you help? Would it be removing the --cov-report html etc, but not --cov?

how does coverage report appear in README

This PR also removes the badge from the readme, because it is misleading anyway, with or without this PR.

We should find a way to display genuine coverage in the readme (see #5090) - I would consider that a separate issue (namely, #5090), and it would then include adding the correct coverage display to the readme.

fkiraly · 2024-05-01T19:12:05Z

so I am really wondering if we are missing something else (though I don't have any alternative ideas yet).

so am I.
A wild guess is that we have some runaway import chains in the style of #6355, which is causing the long runtimes.

Or, perhaps cause/effect are hard to detect in general?

yarnabrina · 2024-05-02T09:21:31Z

How would you turn these off separately? Can you help? Would it be removing the --cov-report html etc, but not --cov?

Yes, that only. Let's see what happens.

This PR also removes the badge from the readme, because it is misleading anyway, with or without this PR.

I think if README shows 0% or etc. it may give potential users/contributors a negative impression that this framework is untested (e.g. I know I'll feel the same for a new tool).

fkiraly · 2024-05-02T09:31:32Z

ok - I've added it back in the test-nosoftdeps-full job now, and in the pyproject.toml, to see what happens

yarnabrina · 2024-05-02T13:52:17Z

Only 5 jobs got triggered, not a single testing job! How did it ran everything earlier?

fkiraly · 2024-05-02T23:12:50Z

Only 5 jobs got triggered, not a single testing job! How did it ran everything earlier?

I see - I think I understand why the difference.

Previously, pytest-cov was removed from pyproject.toml, and that triggered "test all". Now, we added it back, and pyproject.toml is no longer modified, so there is no trigger for testing anything anymore.

yarnabrina · 2024-05-03T14:13:36Z

https://github.com/sktime/sktime/actions/runs/8940323391

I triggered a manual test all workflow on this branch for debugging.

fkiraly · 2024-05-03T14:56:27Z

Thanks.

Sth is taking hours again, how do we find out with which estimator it gets stuck?

yarnabrina · 2024-05-03T17:49:51Z

I am not aware of a better solution than going into verbose mode.

By the way, have you seen the failures? It seems every single "other" run failed with this:

FAILED sktime/tests/tests/test_test_utils.py::test_run_test_for_class - AssertionError: assert 'True_run_always' in ['True_pyproject_change', 'True_changed_class', 'True_changed_tests']

fkiraly · 2024-05-03T17:57:44Z

By the way, have you seen the failures? It seems every single "other" run failed with this:

Thanks for pointing this out - this is a bug with a test I added to make sure we test the run_test_for_class utility.

The bug surfaces only in the test_all workflow which has a certain combination of conditions. Fix here:
#6383

yarnabrina · 2024-05-03T18:25:34Z

I checked other jobs, and so far no timeout failure. Only one module job failed and it's for forecasting:

FAILED sktime/forecasting/model_evaluation/tests/test_evaluate.py::test_evaluate_common_configs[backend8-scoring1-refit-1-10-fh5-ExpandingWindowSplitter] - OverflowError: Python int too large to convert to C long

Ref. https://github.com/sktime/sktime/actions/runs/8940323391/job/24558260007#step:3:6594

Any idea if it's sporadic? We'll probably know from random seed diagnostic.

(FYI @benHeid )

fkiraly · 2024-05-03T18:29:15Z

FAILED sktime/forecasting/model_evaluation/tests/test_evaluate.py::test_evaluate_common_configs[backend8-scoring1-refit-1-10-fh5-ExpandingWindowSplitter] - OverflowError: Python int too large to convert to C long

This is definitely a new one - have not seen this before.

However, there have been failures in test_evaluate_common_configs in ancient times, but these seem unrelated?
#1194

We'll probably know from random seed diagnostic.

Probably not, as that one does not add random seeds except in TestAllForecasters, the test_evaluate_common_configs lives elsewhere.

remove cov

4b2acb9

fkiraly added maintenance Continuous integration, unit testing & package distribution module:tests test framework functionality - only framework, excl specific tests labels Apr 29, 2024

fkiraly added 2 commits April 29, 2024 16:26

Update README.md

835b9b9

Merge branch 'main' into remove-pytest-cov

4223442

add back cov

eeda87c

Merge branch 'main' into remove-pytest-cov

ad6facd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MNT] remove coverage reporting and `pytest-cov` from PR CI #6363

[MNT] remove coverage reporting and `pytest-cov` from PR CI #6363

fkiraly commented Apr 29, 2024

fkiraly commented May 1, 2024

yarnabrina commented May 1, 2024

fkiraly commented May 1, 2024

fkiraly commented May 1, 2024

yarnabrina commented May 2, 2024

fkiraly commented May 2, 2024

yarnabrina commented May 2, 2024

fkiraly commented May 2, 2024

yarnabrina commented May 3, 2024

fkiraly commented May 3, 2024

yarnabrina commented May 3, 2024

fkiraly commented May 3, 2024

yarnabrina commented May 3, 2024

fkiraly commented May 3, 2024 •

edited

[MNT] remove coverage reporting and pytest-cov from PR CI #6363

Are you sure you want to change the base?

[MNT] remove coverage reporting and pytest-cov from PR CI #6363

Conversation

fkiraly commented Apr 29, 2024

fkiraly commented May 1, 2024

yarnabrina commented May 1, 2024

fkiraly commented May 1, 2024

fkiraly commented May 1, 2024

yarnabrina commented May 2, 2024

fkiraly commented May 2, 2024

yarnabrina commented May 2, 2024

fkiraly commented May 2, 2024

yarnabrina commented May 3, 2024

fkiraly commented May 3, 2024

yarnabrina commented May 3, 2024

fkiraly commented May 3, 2024

yarnabrina commented May 3, 2024

fkiraly commented May 3, 2024 • edited

[MNT] remove coverage reporting and `pytest-cov` from PR CI #6363

[MNT] remove coverage reporting and `pytest-cov` from PR CI #6363

fkiraly commented May 3, 2024 •

edited