Add support for total accumulated process CPU usage #1044

bruceg · 2023-08-14T19:27:56Z

Most, if not all, CPU usage accounting for processes provides values that count from the creation of the process. This total value is useful for a variety of accounting tasks beyond the snapshot value that is currently available in sysinfo. This change adds a fn total_cpu_usage to trait ProcessExt to provide that value.

Note that this has explicit breaking stubs for the FreeBSD and Windows implementations, as I was unclear how best to implement those functions. I would be happy to fill those in to complete this PR but I would appreciate a little direction what the best implementation path is between storing a new field in struct Process or computing the value on-the-fly in fn total_cpu_usage.

Most, if not all, CPU usage accounting for processes provides values that count from the creation of the process. This total value is useful for a variety of accounting tasks beyond the snapshot value that is currently available in sysinfo. This change adds a `fn total_cpu_usage` to `trait ProcessExt` to provide that value.

src/linux/process.rs

src/traits.rs

src/windows/process.rs

src/freebsd/process.rs

GuillaumeGomez · 2023-08-14T19:50:09Z

So for Windows, there is hope depending if there are queries to allow to get this information or not. For FreeBSD, might be more complicated depending if https://docs.rs/libc/latest/x86_64-unknown-freebsd/libc/struct.kinfo_proc.html has the information you need or not.

Also one thing to check: are you sure that utime and stime never get reset in linux? (and mac too)

bruceg · 2023-08-15T15:18:26Z

Also one thing to check: are you sure that utime and stime never get reset in linux? (and mac too)

I believe those values are 64-bit counters in the kernel, and so won't reset for a large number of millennia.

bruceg · 2023-08-15T16:41:10Z

For FreeBSD, might be more complicated depending if https://docs.rs/libc/latest/x86_64-unknown-freebsd/libc/struct.kinfo_proc.html has the information you need or not.

~~The info is available in the (more) portable getrusage system call. What are your thoughts on calling that, either when the struct Process is created or when calling total_accumulated_cpu_usage?~~ Never mind, struct kinfo_proc includes the rusage data, so everything I need is there already.

bruceg · 2023-08-15T23:09:45Z

Would you like me to squash down some of the fixup commits?

GuillaumeGomez · 2023-08-28T11:53:06Z

Hi, just to say I didn't forget about this PR. Still wondering if I want this feature or not. I don't have the need for it myself so it'd be complicated to find out when it's broken or to know what's to be expected when adding support for new platforms.

bruceg · 2023-08-28T16:31:57Z

FWIW I'm not the only user looking for this data (though that user would want other bits as well).

GuillaumeGomez · 2023-08-28T16:52:24Z

I hear that, doesn't conflict the reasons I listed why I was hesitating. ;)

cederberg · 2023-09-09T14:52:08Z

I saw this PR just now. Might I suggest that the API simply return the accumulated CPU use in seconds instead of a percentage? It should be easy enough to calculate a percentage from CPU-seconds, but the inverse would lose a lot of precision. If at all possible.

Also, traditionally people like to see utime and stime as two separate counters. Although personally, I'd just sum them right up. But for a library it might be better to more closely expose the available data as the OS provides?

Reading the file changes I also cannot help but to note that skipping the percentage calculus would also make the changes smaller... 😀

GuillaumeGomez · 2023-09-09T19:21:18Z

I saw this PR just now. Might I suggest that the API simply return the accumulated CPU use in seconds instead of a percentage? It should be easy enough to calculate a percentage from CPU-seconds, but the inverse would lose a lot of precision. If at all possible.

That means we should likely provide an API to get CPU seconds as well separately in addition to getting total CPU percentage. Forcing both codes to exist would likely make it more clear, not sure.

Also, traditionally people like to see utime and stime as two separate counters. Although personally, I'd just sum them right up. But for a library it might be better to more closely expose the available data as the OS provides?

I think it'd be better to not provide internals too much and just give the total time. Simpler to handle different OSes.

Reading the file changes I also cannot help but to note that skipping the percentage calculus would also make the changes smaller... 😀

I think it's fine to keep it. Like I said above, keeping total CPU time on one side and total process time on the other will likely make things more clear (simpler I don't know).

Anyway, @bruceg, after thinking about it for quite some time, I think having this feature is ok. What do you think about what @cederberg and I talked about?

bruceg · 2023-09-11T09:03:50Z

Sorry, I'm not entirely clear what "the API" being discussed references. The proposed method returns cumulative CPU seconds, so I think it's already doing what @cederberg is requesting. Am I misunderstanding?

There is one way to reconcile the possible usages, though it is a breaking change. The method could return a new type that has methods to access both values. I don't believe there is a way to make the new type work like the f32 it currently returns, though. This would, however, making extending support to add separate system and user times simpler.

bruceg · 2023-09-11T09:05:28Z

In any case, I'd be happy to do whatever is needed to move the code forward to make it acceptable to all.

GuillaumeGomez · 2023-09-11T13:37:02Z

I didn't double-check after reading:

Reading the file changes I also cannot help but to note that skipping the percentage calculus would also make the changes smaller... 😀

but I should definitely have as you indeed don't return percentage but the total CPU seconds (in f32, that's where I think @cederberg got confused). So it seems good to me as is, sorry about that.

@cederberg: Does it look like what you want with this clarification?

cederberg · 2023-09-11T15:04:01Z

Right. I was confused by two things:

Using f32 instead of u64 as I would've expected.
The changes in windows/process.rs, in particular total_accumulated_cpu_usage that seems to perform a calculation of percentage of total system CPU time.

On another note, isn't the hard-coded 100 value for ticks per second in /linux/process.rs defined properly somewhere?

cederberg · 2023-09-11T15:33:59Z

src/freebsd/process.rs

+    // from FreeBSD source /bin/ps/print.c
+    let accum_cpu_usage = (kproc.ki_runtime as f64 / 1000000.0) as f32
+        + kproc.ki_childtime.tv_sec as f32
+        + kproc.ki_childtime.tv_usec as f32 / 1000000.0;


I don't think this is quite right. Here is the relevant code from FreeBSD:

secs = k->ki_p->ki_runtime / 1000000; psecs = k->ki_p->ki_runtime % 1000000; if (sumrusage) { secs += k->ki_p->ki_childtime.tv_sec; psecs += k->ki_p->ki_childtime.tv_usec; }

It is clear that child time is only added if sumrusage is true. It is set by the -S switch:

-S Change the way the process times, namely cputime, systime, and usertime, are calculated by summing all exited children to their parent process.

Ah, you're right, I thought this needed to account for the child's time, but that is handled separately.

bruceg · 2023-09-12T12:40:44Z

2. The changes in [windows/process.rs](https://github.com/GuillaumeGomez/sysinfo/pull/1044/files#diff-54c5b015ad8152587800fa45b984b4bce88054dda2afc331cf5b4bfc7c9fe2ae), in particular `total_accumulated_cpu_usage` that seems to perform a calculation of percentage of total system CPU time.

Right, that confused me too, but that is essentially a copy of the calculations in compute_cpu_usage without calculating a delta.

On another note, isn't the hard-coded 100 value for ticks per second in /linux/process.rs defined properly somewhere?

The relevant fields in stat export their data in units of "jiffies" aka "ticks". That tick is defined by HZ which is fixed at 100 (for all arches except for alpha apparently, which effectively doesn't matter at this point).

cederberg · 2023-09-13T04:09:44Z

src/windows/process.rs

+                .old_system_user_cpu
+                .saturating_add(self.old_system_sys_cpu) as f32
+            * self.nb_cpus as f32
+    }


Again, I don't think this is right. It should only return old_process_user_cpu + old_process_sys_cpu. The other parts are dividing with the global CPU time and number of CPUs, which just isn't right here.

cederberg · 2023-09-13T05:15:09Z

Could I ask again why f32 is used to return the result instead of u32 or u64?

The source values are all integers. Floats are slow and prone to rounding errors. And if parts of a second is really of interest, we could just as well return milliseconds in a u64, right?

cederberg · 2023-09-13T05:23:17Z

The relevant fields in stat export their data in units of "jiffies" aka "ticks". That tick is defined by HZ which is fixed at 100 (for all arches except for alpha apparently, which effectively doesn't matter at this point).

You are surely right, but what I'm after is a reference to SystemInfo that already contains this value (fetched via sysconf instead of hard-coded): clock_cycle: sysconf(_SC_CLK_TCK)

bruceg · 2023-09-13T10:41:58Z

Could I ask again why f32 is used to return the result instead of u32 or u64?

The source values are all integers. Floats are slow and prone to rounding errors. And if parts of a second is really of interest, we could just as well return milliseconds in a u64, right?

f32 was used to mirror the existing interfaces that use the same types and to allow returning a base unit of seconds independent of the underlying calculations. I would actually prefer to use f64 for the rounding/scaling issue, so I would be easily convinced to go that way. I strongly doubt any performance difference would be measurable here, if it even exists.

I will defer to the judgement of @GuillaumeGomez on the correct type/units to use if he has an opinion.

bruceg · 2023-09-13T10:55:19Z

You are surely right, but what I'm after is a reference to SystemInfo that already contains this value (fetched via sysconf instead of hard-coded): clock_cycle: sysconf(_SC_CLK_TCK)

Ah, I had not seen that sysinfo was already fetching that, thanks for the pointer. I'll use that, even though it will always everywhere be 100 (thus saith Linus).

bruceg · 2023-09-13T11:26:01Z

I'm seeing a bunch of tests failing on CI that I can't explain. In particular, this test on Ubuntu is failing now, but I haven't changed either the test or the code underlying it AFAICT. Any idea what might be going on?

GuillaumeGomez · 2023-09-13T11:54:03Z

The check_processes_cpu_usage test is flaky so you can ignore it. I suppose check_processes_total_accumulated_cpu_usage needs to be fixed though. ;)

f32 was used to mirror the existing interfaces that use the same types and to allow returning a base unit of seconds independent of the underlying calculations. I would actually prefer to use f64 for the rounding/scaling issue, so I would be easily convinced to go that way. I strongly doubt any performance difference would be measurable here, if it even exists.

I was also surprised you used f32 and not u64. If there is no big reason to use f32, I'd prefer if you used u64 for coherency. Also, what are you referring to when you talk about "the existing interfaces" btw?

The source values are all integers. Floats are slow and prone to rounding errors. And if parts of a second is really of interest, we could just as well return milliseconds in a u64, right?

About "Floats are slow", they are in fact not that slow. At same size, a float is always slower than an integer, however a smaller sized float is faster than an integer, so f16 is faster than i32, f32 is faster than u64, etc. I should really try to find where I saw these benchmarks but it was very interesting. However in here, performance isn't really relevant I think considering that the impact should be close to non-existent.

bruceg · 2023-09-13T14:55:31Z

The check_processes_cpu_usage test is flaky so you can ignore it. I suppose check_processes_total_accumulated_cpu_usage needs to be fixed though. ;)

Agreed, I'm looking at them.

I was also surprised you used f32 and not u64. If there is no big reason to use f32, I'd prefer if you used u64 for coherency. Also, what are you referring to when you talk about "the existing interfaces" btw?

I used f32 mostly because cpu_usage used f32, and the natural base unit for the return value is seconds for which the fractional part is significant. Using u64 is possible but would require scaling it to some arbitrary amount to cover the existing OSes. AFAICT every other interface that returns time is in units of seconds too. Mind you, those others tend to have lower underlying granularity (i.e. uptime).

GuillaumeGomez · 2023-09-13T15:20:11Z

From what I can see, only FreeBSD seems to actually have a significant fractional part. We could keep using f32 (and stop having intermediate f64 too, unless precision is that important?) or we could instead return CPU total time in milliseconds and use u64. If you have an opinion @cederberg here too?

bruceg · 2023-09-20T20:19:23Z

So, in the most recent commit, I added a OnceLock to store the global clock tick scaling data for MacOS, needed to turn the process timing values into seconds, since this is constant once it is generated. However, the std version of that type is not stabilized yet in the MSRV for this project. How would you prefer I move forward on this?

Pass down timing info into the Process::new functions and store it in the struct for each process (FWIW I started down that path, and the parameter ended up needing to be added to quite a few functions),
Re-calculate the scaling data in Process::new, causing repeated calls to mach_timebase_info,
Add a dependency on once_cell to get the stable version of this type, or
Implement the OnceLock manually using Once and an unsafe block?

Note that some of this is applicable to the Linux side as well, but there we are only fetching the clock tick value through sysconf, which is less onerous.

Edit: Technically there is a 5th option, which is to bump the MSRV to 1.70.0, but I figure that's a bridge too far given that it becomes a breaking change for some users.

GuillaumeGomez · 2023-09-21T09:04:09Z

Edit: Technically there is a 5th option, which is to bump the MSRV to 1.70.0, but I figure that's a bridge too far given that it becomes a breaking change for some users.

Next release will be a major one so it's definitely not a concern.

So, in the most recent commit, I added a OnceLock to store the global clock tick scaling data for MacOS, needed to turn the process timing values into seconds, since this is constant once it is generated. However, the std version of that type is not stabilized yet in the MSRV for this project. How would you prefer I move forward on this?

I had equivalent "static" info. For example in freebsd. I don't think using a static is a good tactic here. Why not creating a struct which query this information on initialization and then keep it around? If the object is destroyed/recreated, then we re-query the information, it's fine. In the documentation it's explicitly written that the System object should be instantiated only once and then should be kept around. Did I miss something obvious maybe?

bruceg · 2023-09-21T17:04:48Z

Why not creating a struct which query this information on initialization and then keep it around? If the object is destroyed/recreated, then we re-query the information, it's fine. In the documentation it's explicitly written that the System object should be instantiated only once and then should be kept around. Did I miss something obvious maybe?

I can set that up in the System object and pass it down. ~~It just ends up adding a parameter to quite a few functions in the chain, but it's all internal so not a big deal.~~ I'll make that change.

Edit: Handling the conversion in update_process instead of when retrieving the value ended up simplifying the code.

GuillaumeGomez · 2023-10-08T19:41:33Z

Hi, sorry forgot a bit about it. It seems like some test is failing on freebsd. Do you need help?

bruceg · 2023-10-17T19:58:22Z

Hi, sorry forgot a bit about it. It seems like some test is failing on freebsd. Do you need help?

Yes, I think I will. We opted to take another path to provide this value since we only need to support Linux and MacOS at this point, and so the motivating factor necessitating this change is effectively moot. We may come back to this if we ever need to support Windows, but that's a long ways down the road.

GuillaumeGomez · 2023-10-17T20:00:31Z

Noted. I'll try to come back to this when I have enough time (hopefully in a few weeks...) then if you didn't before.

GuillaumeGomez reviewed Aug 14, 2023

View reviewed changes

src/linux/process.rs Outdated Show resolved Hide resolved

GuillaumeGomez reviewed Aug 14, 2023

View reviewed changes

src/traits.rs Outdated Show resolved Hide resolved

Change method name to total_accumulated_cpu_usage

cf86f66

GuillaumeGomez reviewed Aug 14, 2023

View reviewed changes

src/windows/process.rs Outdated Show resolved Hide resolved

GuillaumeGomez reviewed Aug 14, 2023

View reviewed changes

src/freebsd/process.rs Outdated Show resolved Hide resolved

Fix the Linux HZ constant mistake

2a888c9

Fix MacOS compilation typo

0819473

bruceg force-pushed the total-cpu-usage branch from f17aae6 to ae6505d Compare August 15, 2023 20:43

bruceg added 2 commits August 15, 2023 16:07

Implement Windows calculation

3586f8b

Fill in the FreeBSD implementation

46b9b74

bruceg force-pushed the total-cpu-usage branch from ae6505d to 46b9b74 Compare August 15, 2023 22:12

bruceg marked this pull request as ready for review August 15, 2023 23:08

bruceg requested a review from GuillaumeGomez August 15, 2023 23:10

Add test for new method

668c7f9

bruceg force-pushed the total-cpu-usage branch from 79c0176 to 668c7f9 Compare August 28, 2023 17:57

GuillaumeGomez mentioned this pull request Sep 9, 2023

Is it possible to get cumulative process execution time (utime + stime)? #1061

Open

cederberg reviewed Sep 11, 2023

View reviewed changes

bruceg added 3 commits September 12, 2023 14:49

Update the FreeBSD calculation to not include child usage

61835a1

Improve test asserts to pin down failures

2cd0834

Fix too-new format usage

1bc46bd

cederberg reviewed Sep 13, 2023

View reviewed changes

Use the Linux clock tick value retrieved from sysconf

9174a33

bruceg added 4 commits September 15, 2023 13:49

Use actual times when calculating the max delta to avoid false positives

0018374

Fixes for MacOS

c6b0456

Fix Windows computations

21ef386

Merge remote-tracking branch 'upstream/master' into total-cpu-usage

9af939f

bruceg added 3 commits September 21, 2023 12:26

Drop the static timebase info and use the copy in struct System

2eaa4dd

Clippy fix

61bdf99

Fix Windows usage and improve test

8e14b45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for total accumulated process CPU usage #1044

Add support for total accumulated process CPU usage #1044

bruceg commented Aug 14, 2023

GuillaumeGomez commented Aug 14, 2023

bruceg commented Aug 15, 2023

bruceg commented Aug 15, 2023 •

edited

bruceg commented Aug 15, 2023

GuillaumeGomez commented Aug 28, 2023

bruceg commented Aug 28, 2023

GuillaumeGomez commented Aug 28, 2023

cederberg commented Sep 9, 2023

GuillaumeGomez commented Sep 9, 2023

bruceg commented Sep 11, 2023

bruceg commented Sep 11, 2023

GuillaumeGomez commented Sep 11, 2023

cederberg commented Sep 11, 2023

cederberg Sep 11, 2023

bruceg Sep 12, 2023

bruceg commented Sep 12, 2023

cederberg Sep 13, 2023

cederberg commented Sep 13, 2023

cederberg commented Sep 13, 2023

bruceg commented Sep 13, 2023 •

edited

bruceg commented Sep 13, 2023

bruceg commented Sep 13, 2023

GuillaumeGomez commented Sep 13, 2023

bruceg commented Sep 13, 2023

GuillaumeGomez commented Sep 13, 2023

bruceg commented Sep 20, 2023 •

edited

GuillaumeGomez commented Sep 21, 2023

bruceg commented Sep 21, 2023 •

edited

GuillaumeGomez commented Oct 8, 2023

bruceg commented Oct 17, 2023

GuillaumeGomez commented Oct 17, 2023

Add support for total accumulated process CPU usage #1044

Are you sure you want to change the base?

Add support for total accumulated process CPU usage #1044

Conversation

bruceg commented Aug 14, 2023

GuillaumeGomez commented Aug 14, 2023

bruceg commented Aug 15, 2023

bruceg commented Aug 15, 2023 • edited

bruceg commented Aug 15, 2023

GuillaumeGomez commented Aug 28, 2023

bruceg commented Aug 28, 2023

GuillaumeGomez commented Aug 28, 2023

cederberg commented Sep 9, 2023

GuillaumeGomez commented Sep 9, 2023

bruceg commented Sep 11, 2023

bruceg commented Sep 11, 2023

GuillaumeGomez commented Sep 11, 2023

cederberg commented Sep 11, 2023

cederberg Sep 11, 2023

Choose a reason for hiding this comment

bruceg Sep 12, 2023

Choose a reason for hiding this comment

bruceg commented Sep 12, 2023

cederberg Sep 13, 2023

Choose a reason for hiding this comment

cederberg commented Sep 13, 2023

cederberg commented Sep 13, 2023

bruceg commented Sep 13, 2023 • edited

bruceg commented Sep 13, 2023

bruceg commented Sep 13, 2023

GuillaumeGomez commented Sep 13, 2023

bruceg commented Sep 13, 2023

GuillaumeGomez commented Sep 13, 2023

bruceg commented Sep 20, 2023 • edited

GuillaumeGomez commented Sep 21, 2023

bruceg commented Sep 21, 2023 • edited

GuillaumeGomez commented Oct 8, 2023

bruceg commented Oct 17, 2023

GuillaumeGomez commented Oct 17, 2023

bruceg commented Aug 15, 2023 •

edited

bruceg commented Sep 13, 2023 •

edited

bruceg commented Sep 20, 2023 •

edited

bruceg commented Sep 21, 2023 •

edited