Possible improvements for sniff performance analysis #3784

jrfnl · 2023-03-29T14:56:13Z

As a sniff author, it is not uncommon to receive reports about sniffs/standards being "slow".

To analyze which sniffs could be the culprit/which sniffs to focus on to improve performance, the -vvv command can be used, which at the very end of the debug logs shows a "SNIFF PROCESSING REPORT" with the sniffs which have been triggered during a run and the time each sniff took, ordered from slowest to fastest sniff.

This information is super-useful, however to get a good impression of the performance of various sniffs, the best way to do so is to run the sniffs/standard against a large number of files, which makes the -vvv output completely unworkable as even for a simple run against 4 files, the output is easily over 60.000 lines, with a test run against 300 files, the output is well over 4 million lines...

The current available verbosity levels and their output are defined as such:

-v: file specific token and line counts with running times
-vv: the info from level 1 + verbose tokeniser output
-vvv: the info from level 1 + 2 + token processing output with sniff running times

Now there are a couple of ways I can think of to improve this situation and I'd like to hear if any of these could/would be acceptable.

Add an extra debugging level between the current 1 (-v) and 2 (-vv) levels, which would contain the information from level 1 + the sniff performance report.
This would mean that -vv would now show the performance report, while to get the "old" -vv report (+ performance info), one would now need to use -vvv, and the "old" -vvv would become -vvvv.
While this would be a BC-break, this type of verbosity is generally only ever used by sniff authors/PHPCS tokenizer fix contributors. So considering the very limited public this BC-break would hit, I hope this could be considered acceptable as an enhancement, which could go into a PHPCS 3.x minor.
Add a separate CLI option - outside of the verbosity settings - to allow for the performance information to be printed.
Maybe using the -r for "(sniff) run time information" ?
This would avoid the BC-break and would still make the information available in a more workable manner.
Create a Performance Report type, possibly externally hosted, like in PHPCSDevTools.
To allow for this, some changes would be needed to the File::process() method as it currently only records "time taken by each sniff" to the $this->listenerTimes property if PHP_CODESNIFFER_VERBOSITY > 2, i.e. when the -vvv option is enabled, which would run counter to the problem I'm trying to solve.
It might also make life easier if, similar to the File::getMetrics() method, there would also be a File::getListenerTimes() method.
It would still need some work-arounds as $this->listenerTimes is cumulative and only the Report::generateFileReport() method has access to the File object, while the report would need to be generated in the generate() method which doesn't have access to the File object, but that is doable.

In my opinion, option 1 or 2 would be the preferred options as it would also allow standard maintainers to ask performance issue reporters to run the report, even when they can't share access to the (private) code base on which the performance issue was detected.

Having said that, option 3 would still be an improvement over the current situation.

Opinions ?

/cc @gsherwood @kukulich

P.S.: I'd, of course, be happy to make the changes, but would like some direction before spending significant time on this.

The text was updated successfully, but these errors were encountered:

kukulich · 2023-03-30T06:32:34Z

I like it! And I think I like the option 2 most.

jrfnl · 2023-04-18T17:48:40Z

I discussed this off-GitHub with @gsherwood and the current line of thinking is option 3, but then a PHPCS native version of that report, which would be the best of both worlds.

jrfnl · 2023-04-19T20:04:50Z

Not completely finished yet, but this is what the output for the initial version currently looks like:

jrfnl · 2023-04-19T20:22:00Z

Open question: I'm currently reporting the time in microseconds. Should it be seconds instead ? (to lower cognitive load)

fredden · 2023-04-21T09:27:13Z

Should the report be seconds or milliseconds or microseconds?

I read ms as milliseconds and μs microseconds. If we're using ms then I think it should be milliseconds. Pushing the number through number_format() will help with legibility. I vote for seconds, but milliseconds would also work.

jrfnl · 2023-04-23T15:36:31Z

Pushing the number through number_format() will help with legibility.

Actually it won't as in most cases, we're talking very small numbers, i.e. 0.0000123 seconds type of numbers and number_format() won't help any with that.

jrfnl · 2023-04-24T13:53:34Z

Next iteration:

As part of this iteration, I've started to highlight sniffs which take more than 2x the average run time per sniff in colour (> 2x orange, >3x red). Average runtime per sniff is based on the sniffs being used in the scan.

What do you all think ?

jrfnl · 2023-04-24T20:12:55Z

PR #3810 is now open to address this.

gsherwood · 2023-04-30T22:39:04Z

Sorry I didn't comment sooner. This is excellent.

jrfnl · 2023-12-02T14:10:40Z

Closing as replaced by PHPCSStandards/PHP_CodeSniffer#28

jrfnl mentioned this issue Mar 30, 2023

PHPC 10 takes an extremely long time with certain PHP files PHPCompatibility/PHPCompatibility#1477

Closed

1 task

jrfnl mentioned this issue Apr 24, 2023

New Performance report to measure sniff run time performance #3810

Closed

jrfnl added Type: enhancement Core Component: Reports Focus: Performance labels Oct 25, 2023

This was referenced Nov 8, 2023

Possible improvements for sniff performance analysis PHPCSStandards/PHP_CodeSniffer#28

Closed

New Performance report to measure sniff run time performance PHPCSStandards/PHP_CodeSniffer#60

Merged

jrfnl closed this as not planned Won't fix, can't repro, duplicate, stale Dec 2, 2023

jrfnl closed this as completed in PHPCSStandards/PHP_CodeSniffer#60 Dec 5, 2023

jrfnl mentioned this issue Jan 4, 2024

Proposal: Informing external standards/integrators of important changes/tickets asking for community input PHPCSStandards/PHP_CodeSniffer#227

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible improvements for sniff performance analysis #3784

Possible improvements for sniff performance analysis #3784

jrfnl commented Mar 29, 2023 •

edited

kukulich commented Mar 30, 2023 •

edited

jrfnl commented Apr 18, 2023

jrfnl commented Apr 19, 2023

jrfnl commented Apr 19, 2023

fredden commented Apr 21, 2023

jrfnl commented Apr 23, 2023

jrfnl commented Apr 24, 2023 •

edited

jrfnl commented Apr 24, 2023

gsherwood commented Apr 30, 2023

jrfnl commented Dec 2, 2023

Possible improvements for sniff performance analysis #3784

Possible improvements for sniff performance analysis #3784

Comments

jrfnl commented Mar 29, 2023 • edited

kukulich commented Mar 30, 2023 • edited

jrfnl commented Apr 18, 2023

jrfnl commented Apr 19, 2023

jrfnl commented Apr 19, 2023

fredden commented Apr 21, 2023

jrfnl commented Apr 23, 2023

jrfnl commented Apr 24, 2023 • edited

jrfnl commented Apr 24, 2023

gsherwood commented Apr 30, 2023

jrfnl commented Dec 2, 2023

jrfnl commented Mar 29, 2023 •

edited

kukulich commented Mar 30, 2023 •

edited

jrfnl commented Apr 24, 2023 •

edited