Add better verbosity to the etcdctl endpoint status output that includes percentage of DB used (to illustrate Defrag rate) #17872

Scotchman0 · 2024-04-24T16:37:14Z

What would you like to be added?

I have opened a PR with the specific changes: #17871

Why is this needed?

Adding the output from OMC which grants visibility of the size of the DB, how much is in use, and then providing the difference as a percentage for quick visual on storage impact on the cluster. A minor change but one that makes it much easier to illustrate where storage is going.

Current release:

$ etcdctl endpoint status -w table

sh-4.4# etcdctl endpoint status -w table
+----------------------------+------------------+---------+---------+-----------+------------+-----------+------------+--------------------+--------+
|          ENDPOINT          |        ID        | VERSION | DB SIZE | IS LEADER | IS LEARNER | RAFT TERM | RAFT INDEX | RAFT APPLIED INDEX | ERRORS |
+----------------------------+------------------+---------+---------+-----------+------------+-----------+------------+--------------------+--------+
| https://xxx.xxx.xxx:2379 | 1fc63d8d0eb5fa7b |  3.5.9 |  6.7 GB |      false |      false |         5854 |    4073918328 |            4073918328 |        |
| https://xxx.xxx.xxx:2379 | 3bd2d35c028a6a2f |  3.5.9 |  6.8 GB |     true |      false |         5854 |    4073918329 |            4073918329 |        |
| https://xxx.xxx.xxx:2379 | 41cbbc084d60007d |  3.5.9 |  6.8 GB |     false |      false |         5854 |    4073918329 |            4073918329 |        |
+----------------------------+------------------+---------+---------+-----------+------------+-----------+------------+--------------------+--------+

proposed change:
$ etcdctl endpoint status -w table

+-----------------------------+------------------+---------+----------------+----------+-----------+------------+-----------+------------+--------------------+--------+
|          ENDPOINT           |        ID        | VERSION | DB SIZE/IN USE | NOT USED | IS LEADER | IS LEARNER | RAFT TERM | RAFT INDEX | RAFT APPLIED INDEX | ERRORS |
+-----------------------------+------------------+---------+----------------+----------+-----------+------------+-----------+------------+--------------------+--------+
| https://xxx.xxx.xxx:2379 | 1fc63d8d0eb5fa7b | 3.5.9   | 6.7 GB/5.6 GB  | 18%      | false     | false      |      5853 | 4073918326 |         4073918326 |        |
| https://xxx.xxx.xxx:2379 | 3bd2d35c028a6a2f | 3.5.9   | 6.8 GB/5.6 GB  | 19%      | true      | false      |      5853 | 4073918327 |         4073918327 |        |
| https://xxx.xxx.xxx:2379 | 41cbbc084d60007d | 3.5.9   | 6.8 GB/5.6 GB  | 18%      | false     | false      |      5853 | 4073918330 |         4073918330 |        |
+-----------------------------+------------------+---------+----------------+----------+-----------+------------+-----------+------------+--------------------+--------+

This output is available in OMC (used for analysis of existing etcd databases running on openshift presently) and visual representation is not available to customers as a default, when it could be.

The text was updated successfully, but these errors were encountered:

ivanvc · 2024-04-24T20:42:38Z

@Scotchman0, it may be helpful if you can add the current output to compare to your suggestion.

/assign @Scotchman0

ivanvc · 2024-04-24T20:52:08Z

I like the idea, but after seeing the implementation, I wonder if it would be better to add more columns rather than reusing/multi-purposing the current column. I defer to @ahrtr / @serathius with their point of view.

Scotchman0 · 2024-04-24T20:54:24Z

updated the initial with the comparative of what we've got now versus proposed change - I'm open to separating by column; fairly easy to make those a separate block instead of joining.

Scotchman0 · 2024-04-24T21:41:08Z

alternative implementation with separate columns for comparison might look like the following if it's desirable to segment instead of conjoin an output block:

+-----------------------------+------------------+---------+----------------+----------+-----------+------------+-----------+------------+--------------------+--------+
|          ENDPOINT           |        ID        | VERSION | DB SIZE | IN USE | NOT USED | IS LEADER | IS LEARNER | RAFT TERM | RAFT INDEX | RAFT APPLIED INDEX | ERRORS |
+-----------------------------+------------------+---------+----------------+----------+-----------+------------+-----------+------------+--------------------+--------+
| https://xxx.xxx.xxx:2379    | 1fc63d8d0eb5fa7b | 3.5.9   | 6.7 GB | 5.6 GB  | 18%      | false     | false      |      5853 | 4073918326 |         4073918326 |        |
| https://xxx.xxx.xxx:2379    | 3bd2d35c028a6a2f | 3.5.9   | 6.8 GB | 5.6 GB  | 19%      | true      | false      |      5853 | 4073918327 |         4073918327 |        |
| https://xxx.xxx.xxx:2379    | 41cbbc084d60007d | 3.5.9   | 6.8 GB | 5.6 GB  | 18%      | false     | false      |      5853 | 4073918330 |         4073918330 |        |
+-----------------------------+------------------+---------+----------------+----------+-----------+------------+-----------+------------+--------------------+--------+

serathius · 2024-04-25T08:18:54Z

Would it be easier to put the storage information as usage 5.6/6.7 GB (18%) ?

I would be also interested if we could quota somewhere, not sure it's accessible via API.

ahrtr · 2024-04-25T09:47:13Z

I agree it's a minor enhancement, but on other hand, it's minor breaking on the user experience. I like the idea of adding a separate new column.

I wonder if it would be better to add more columns rather than reusing/multi-purposing the current column

Makes sense to me.

I would be also interested if we could quota somewhere, not sure it's accessible via API.

Not sure I got the point. Do you mean that etcdserver also returns storage quota (e.g. 2GiB) to client when users run etcdctl endpoint status? If yes, I definitely support it. Once it's supported, then we can remove --etcd-storage-quota-bytes from the etcd-defrag tool :)

tjungblu · 2024-04-25T14:24:58Z

I got a few minutes today to just pull the config through:
#17877

will add more tests in a bit, you guys can have a look in the meantime 👍

Scotchman0 · 2024-04-25T22:15:17Z

I went ahead and modified the PR #17871 to include separated columns for each output + a new block for Quota pending #17877 approval/merge to reference the new quota block: might look like the below:

+-----------------------------+------------------+---------+----------------+----------+-----------+------------+-----------+------------+--------------------+--------+
|          ENDPOINT           |        ID        | VERSION | DB SIZE | IN USE | NOT USED | QUOTA | IS LEADER | IS LEARNER | RAFT TERM | RAFT INDEX | RAFT APPLIED INDEX | ERRORS |
+-----------------------------+------------------+---------+----------------+----------+-----------+------------+-----------+------------+--------------------+--------+
| https://xxx.xxx.xxx:2379    | 1fc63d8d0eb5fa7b | 3.5.9   | 6.7 GB | 5.6 GB  | 18%      | ??    | false     | false      |      5853 | 4073918326 |         4073918326 |        |
| https://xxx.xxx.xxx:2379    | 3bd2d35c028a6a2f | 3.5.9   | 6.8 GB | 5.6 GB  | 19%      | ??    | true      | false      |      5853 | 4073918327 |         4073918327 |        |
| https://xxx.xxx.xxx:2379    | 41cbbc084d60007d | 3.5.9   | 6.8 GB | 5.6 GB  | 18%      | ??    | false     | false      |      5853 | 4073918330 |         4073918330 |        |
+-----------------------------+------------------+---------+----------------+----------+-----------+------------+-----------+------------+--------------------+---------

serathius · 2024-04-26T06:53:19Z

I agree it's a minor enhancement, but on other hand, it's minor breaking on the user experience.

Human readable table is not an API. We don't guarantee column names or contents. It should be different to other formats like json, but table is not parsable and it should not be so.

ahrtr · 2024-04-26T09:04:57Z

Human readable table is not an API.

API != user experience. API is part of user experience. Since it doesn't change the json format which is usually parsed by program, so it's the reason why I call it a minor breaking.

Scotchman0 added the type/feature label Apr 24, 2024

k8s-ci-robot assigned Scotchman0 Apr 24, 2024

ivanvc added the area/etcdctl label Apr 24, 2024

ahrtr mentioned this issue Apr 25, 2024

add quota to endpoint status response #17877

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add better verbosity to the etcdctl endpoint status output that includes percentage of DB used (to illustrate Defrag rate) #17872

Add better verbosity to the etcdctl endpoint status output that includes percentage of DB used (to illustrate Defrag rate) #17872

Scotchman0 commented Apr 24, 2024 •

edited

ivanvc commented Apr 24, 2024

ivanvc commented Apr 24, 2024

Scotchman0 commented Apr 24, 2024

Scotchman0 commented Apr 24, 2024 •

edited

serathius commented Apr 25, 2024 •

edited

ahrtr commented Apr 25, 2024

tjungblu commented Apr 25, 2024

Scotchman0 commented Apr 25, 2024 •

edited

serathius commented Apr 26, 2024 •

edited

ahrtr commented Apr 26, 2024

Add better verbosity to the etcdctl endpoint status output that includes percentage of DB used (to illustrate Defrag rate) #17872

Add better verbosity to the etcdctl endpoint status output that includes percentage of DB used (to illustrate Defrag rate) #17872

Comments

Scotchman0 commented Apr 24, 2024 • edited

What would you like to be added?

Why is this needed?

ivanvc commented Apr 24, 2024

ivanvc commented Apr 24, 2024

Scotchman0 commented Apr 24, 2024

Scotchman0 commented Apr 24, 2024 • edited

serathius commented Apr 25, 2024 • edited

ahrtr commented Apr 25, 2024

tjungblu commented Apr 25, 2024

Scotchman0 commented Apr 25, 2024 • edited

serathius commented Apr 26, 2024 • edited

ahrtr commented Apr 26, 2024

Scotchman0 commented Apr 24, 2024 •

edited

Scotchman0 commented Apr 24, 2024 •

edited

serathius commented Apr 25, 2024 •

edited

Scotchman0 commented Apr 25, 2024 •

edited

serathius commented Apr 26, 2024 •

edited