Add workflow run times to ARC metrics #2359

kkaresz-tw · 2023-03-03T11:25:59Z

What would you like added?

In addition to the labels discussed under #2176 and implemented in #2218 and #2225 it would also be good to see workflow related metrics reported by the metrics server. These could be collected from the workflow_run events.

In addition to:

github_workflow_job_run_duration_seconds_bucket
github_workflow_job_run_duration_seconds_count
github_workflow_job_run_duration_seconds_sum

github_workflow_jobs_started_total
github_workflow_jobs_completed_total

the following would also be useful to see:

github_workflow_run_duration_seconds_bucket
github_workflow_run_duration_seconds_count
github_workflow_run_duration_seconds_sum

github_workflows_started_total
github_workflows_completed_total

I might try to open a PR for this unless someone quicker beats me to it.

Why is this needed?

The sum of each job run times of a given workflow isn't equal to the actual time the workflow took to finish because of parallel jobs, queuing of the jobs, etc.

If I wanted to measure how long engineers in our organisation need to wait for CI, workflow run times make more sense for us. If our CI team improved a shared or required workflow by replacing/rewriting a job or an action, or changed anything around the (self-hosted) infrastructure e.g. using larger nodes, changing the autoscaling, etc. we would like to measure what the impact of those changes were, if any.

Also, if we wanted to feed into our business metrics by measuring the time a PR took from start to finish including how much time CI took in the process, workflow run times would be better than individual job run times.

Additional context

In my organisation we're working on our own version of workflow and job metrics, but it has its own issues and adds TOIL to the team which could be discarded if the ARC provided these numbers out of the box.

The text was updated successfully, but these errors were encountered:

github-actions · 2023-03-03T11:26:46Z

Hello! Thank you for filing an issue.

The maintainers will triage your issue shortly.

In the meantime, please take a look at the troubleshooting guide for bug reports.

If this is a feature request, please review our contribution guidelines.

mumoshu · 2023-03-06T23:09:10Z

Hey @kkaresz-tw!

If I wanted to measure how long engineers in our organization need to wait for CI, workflow run times make more sense for us

Great. That does make sense to me!

kkaresz-tw added enhancement New feature or request needs triage Requires review from the maintainers labels Mar 3, 2023

mumoshu mentioned this issue Mar 6, 2023

Add more labels to metrics-server metrics #2176

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add workflow run times to ARC metrics #2359

Add workflow run times to ARC metrics #2359

kkaresz-tw commented Mar 3, 2023

github-actions bot commented Mar 3, 2023

mumoshu commented Mar 6, 2023

Add workflow run times to ARC metrics #2359

Add workflow run times to ARC metrics #2359

Comments

kkaresz-tw commented Mar 3, 2023

What would you like added?

Why is this needed?

Additional context

github-actions bot commented Mar 3, 2023

mumoshu commented Mar 6, 2023