Kubernetes readiness probe endpoint returning 404 #22562

chadlwilson · 2020-07-25T06:33:11Z

There appears to be some change in behaviour for the Kubernetes-oriented readiness group endpoint on 2.3.2 compared to 2.3.1.

For a service that has no external dependencies (and only readinessState in the health group), the /actuator/health/readiness endpoint is returning a 404.

Configuration we are using:

management.server.port=9083
management.health.probes.enabled=true
management.endpoints.enabled-by-default=false
management.endpoint.info.enabled=true
management.endpoint.health.enabled=true
management.endpoint.health.show-details=always
management.endpoint.health.group.liveness.include=livenessState,diskSpace,refreshScope
management.endpoint.health.group.readiness.include=readinessState
management.endpoint.health.group.liveness.show-details=always
management.endpoint.health.group.readiness.show-details=always
management.endpoints.web.exposure.include=health

Expected Behaviour
We expect this to just return 200 with { "status": "UP" }

Actual Behaviour

$ http http://localhost:9083/actuator/health/readiness
HTTP/1.1 404 Not Found

Full health call:

$ http http://localhost:9083/actuator/health
HTTP/1.1 200 OK
Connection: keep-alive
Content-Type: application/json
Date: Sat, 25 Jul 2020 06:27:55 GMT
Transfer-Encoding: chunked

{
    "components": {
        "discoveryComposite": {
            "components": {
                "discoveryClient": {
                    "description": "Discovery Client not initialized",
                    "status": "UNKNOWN"
                }
            },
            "description": "Discovery Client not initialized",
            "status": "UNKNOWN"
        },
        "diskSpace": {
            "details": {
                "exists": true,
                "free": 287311962112,
                "threshold": 10485760,
                "total": 499963174912
            },
            "status": "UP"
        },
        "livenessStateProbeIndicator": {
            "status": "UP"
        },
        "ping": {
            "status": "UP"
        },
        "reactiveDiscoveryClients": {
            "components": {
                "Simple Reactive Discovery Client": {
                    "description": "Discovery Client not initialized",
                    "status": "UNKNOWN"
                }
            },
            "description": "Discovery Client not initialized",
            "status": "UNKNOWN"
        },
        "readinessStateProbeIndicator": {
            "status": "UP"
        },
        "refreshScope": {
            "status": "UP"
        }
    },
    "groups": [
        "liveness",
        "readiness"
    ],
    "status": "UP"
}

This may relate to #22107.

The text was updated successfully, but these errors were encountered:

chadlwilson · 2020-07-25T07:42:55Z

After a bit more digging, I'm not really sure why or whether it was intended, however the issue seems to be that readinessState has become readinessStateProbeIndicator (and same for livenessState) so the old configuration was not correctly including the indicator at all, leaving the readiness group empty.

This seems to work as expected.

management.endpoint.health.group.liveness.include=livenessStateProbeIndicator,diskSpace,refreshScope
management.endpoint.health.group.readiness.include=readinessStateProbeIndicator

bclozel · 2020-07-25T10:11:13Z

Yes this is an unintended side effect of #22107. The workaround you're mentioning is the right one in the meantime.

Thanks for raising this issue!

chadlwilson · 2020-07-25T10:20:15Z

No problem - feel free to re-title it as appropriate.

Unfortunately this is a transparently breaking change for many people, they probably won't realise the probe status isn't being included in the status in addition to, say, db, redis etc because including a non-existent indicator in a group doesn't seem to fail startup :(

bclozel · 2020-07-25T11:12:03Z

I've tagged this issue as a regression.

I'm really sorry for letting in that one.

spring-projects/spring-boot#22562

OrangeDog · 2020-07-27T10:24:15Z

Does this cover the fact that they are listed under groups at /health, but then don't actually exist?

agrappin · 2020-07-29T11:42:57Z

I precisely have the same issue than @OrangeDog . On my container with management.endpoint.health.probes.enabled=true:

When executing GET /actuator/health:
{ "status": "UP", "groups": [ "liveness", "readiness" ] }
When executing GET /actuator/health/liveness:
404 Not Found

chadlwilson · 2020-07-29T13:16:19Z

* When executing GET `/actuator/health`:
  `{ "status": "UP", "groups": [ "liveness", "readiness" ] }`

* When executing GET `/actuator/health/liveness`:
  `404 Not Found`

I agree this is potentially confusing, but doesn't seem to be the main problem here?

I wonder whether the /actuator/health endpoint behaved differently under 2.3.1 if a group has no configured components? i.e it filtered them out from groups: [] ?

I guess this is a matter of design - the group exists but has no (valid) components, therefore its status is indeterminate, therefore the implementation returns a 404? It certainly can't return 200 OK....

Would we

want to be aware the groups exist, so we know we can add components to them with include ?
or have them disappear from the top level endpoint so we don't even know they are there?

ttddyy · 2020-07-29T14:54:18Z

Instead of referencing readinessStateProbeIndicator and livenessStateProbeIndicator, I think you need to set management.health.livenessstate.enabled and management.health.readinessstate.enabled properties introduced by spring-boot 2.3.2. So that, you could use readinessState and livenessState reference.

When management.health.[readiness|livenessstate].enabled properties are set to false(by default), AvailabilityProbesAutoConfiguration creates readinessStateProbeIndicator and livenessStateProbeIndicator beans which need to be referenced as [readiness|liveness]StateProbeIndicator(full bean name).

On the other hand, when properties are enabled, AvailabilityHealthContributorAutoConfiguration creates [readiness|liveness]StateHealthIndicator beans which can be referenced as [readiness|liveness]State.

The problem is in AvailabilityProbesHealthEndpointGroups created by AvailabilityProbesHealthEndpointGroupsPostProcessor, this creates readiness/liveness groups with [readiness|liveness]State.
So, if [readiness|liveness]State are not available, groups are created but referenced HealthIndicator beans are not there.

OrangeDog · 2020-07-29T14:58:58Z

want to be aware the groups exist, so we know we can add components to them with include ?

The API response is supposed to be for consumers of the API, not documenting configuration options for the developer. Like the rest of the actuator system, only endpoints that are currently available should be listed as available.

agrappin · 2020-07-30T07:27:25Z

When management.health.[readiness|livenessstate].enabled properties are set to false(by default)

FYI surprisingly enough Spring Boot decided to name the readiness state property management.health.readynessstate.enabled with a y in the 2.3.2.RELEASE version (most recent release at this date).

See the reference: https://docs.spring.io/spring-boot/docs/2.3.2.RELEASE/reference/html/appendix-application-properties.html#actuator-properties

OrangeDog · 2020-07-30T08:19:16Z

@antoinegrappin no, that's just a documentation error. The property is readiness.

agrappin · 2020-07-30T08:53:55Z

@OrangeDog indeed, I confirm after tests.

bclozel · 2020-08-01T20:05:26Z

This issue is now fixed in the 2.3.3 and 2.4.0 SNAPSHOTs.

I've carefully read the comments on this issue regarding the following surprising behavior: getting a 404 status on a configured health group, when no indicator is present. In this very case it's arguably wrong, but we're in a case of a regression. But some of you thought that

a missing indicator in a group should fail the application at startup or
that an empty group should disappear from the list of groups on the main endpoint.

The first alternative sounds nice, especially for detecting bad configurations. But it's also likely to fail in perfectly valid cases. Your application could configure a group management.endpoint.health.group.custom.include=ping,redis and fail in a test environment where no redis instance is available. Because Spring Boot reacts to the environment, it's expected to behave differently and adapt to the situation.

The second alternative is debatable. Right now our health groups support is auto-configured with the configuration properties and does not look into the application context to check for the existence of health indicators. We seem to all agree that a 404 response status is right in this case. Removing the group information would, in my opinion, make things less consistent as we wouldn't know that a group has been configured. After all, a health group is just a way to wrap several indicators under the same name and customize its global health status - but health indicators are still dynamic.

After discussing that briefly with the team, we didn't think that this needs to be changed. Note that this behavior exists since the introduction of the health groups feature. If you can make a stronger case for changing this, please create a dedicated issue and explain how this behavior is inconsistent or could lead to issues.

Thanks!

It seems that this regression in 2.3.2 causes the liveness endpoint to 404: spring-projects/spring-boot#22562 and the app goes in a crash loop

chadlwilson · 2020-08-16T14:58:30Z

Thanks @bclozel - fix is working fine in 2.3.3 after removing the workaround to the probe names I mentioned above :-)

salaboy · 2020-08-30T11:15:37Z

@chadlwilson can you share your configurations in 2.3.3? I am finding the same issue there..

bclozel · 2020-08-30T11:42:46Z

@salaboy If your application runs on kubernetes, you don't need any specific configuration.
If it doesn't, you need to enable the probes with the following:

management.endpoint.health.probes.enabled=true

vishalmamidi · 2022-07-13T09:19:24Z

@bclozel what are values am supported to give in my deployment manifests

ahmetgeymen · 2022-07-13T12:43:50Z

@bclozel what are values am supported to give in my deployment manifests

...
livenessProbe:
  httpGet:
    path: /actuator/health/liveness
    port: http
readinessProbe:
  httpGet:
    path: /actuator/health/readiness
    port: http
...

The issue has been resolved with version 2.3.3. You can expose separate probes with dedicated Health Indicators. You may want to look up here.

spring-projects-issues added the status: waiting-for-triage An issue we've not yet triaged label Jul 25, 2020

bclozel self-assigned this Jul 25, 2020

bclozel added type: bug A general bug and removed status: waiting-for-triage An issue we've not yet triaged labels Jul 25, 2020

bclozel added this to the 2.3.3 milestone Jul 25, 2020

bclozel added type: regression A regression from a previous release and removed type: bug A general bug labels Jul 25, 2020

joergjo added a commit to joergjo/springboot-samples that referenced this issue Jul 27, 2020

fix: roll back to Spring Boot 2.3.1

9222c5b

spring-projects/spring-boot#22562

wilkinsona changed the title ~~Kuberenetes readiness probe endpoint returning 404 on Spring Boot 2.3.2~~ Kubernetes readiness probe endpoint returning 404 on Spring Boot 2.3.2 Jul 27, 2020

sabahirfan mentioned this issue Jul 27, 2020

build(deps): bump org.springframework.boot from 2.3.1.RELEASE to 2.3.2.RELEASE hmcts/unspec-service#79

Closed

bclozel mentioned this issue Aug 1, 2020

Kubernetes readiness probe endpoint returning 404 #22698

Closed

bclozel changed the title ~~Kubernetes readiness probe endpoint returning 404 on Spring Boot 2.3.2~~ Kubernetes readiness probe endpoint returning 404 Aug 1, 2020

bclozel closed this as completed in 8dedeb4 Aug 1, 2020

janolaveide added a commit to navikt/foreldrepengesoknad-api that referenced this issue Aug 5, 2020

https://github.com/spring-projects/spring-boot/issues/22562

0638b01

janolaveide added a commit to navikt/foreldrepengesoknad-api that referenced this issue Aug 5, 2020

https://github.com/spring-projects/spring-boot/issues/22562

33f869b

janolaveide added a commit to navikt/foreldrepengesoknad-api that referenced this issue Aug 5, 2020

https://github.com/spring-projects/spring-boot/issues/22562

26fdea9

tomaszpowroznik mentioned this issue Aug 7, 2020

chore(deps): bump org.springframework.boot from 2.3.1.RELEASE to 2.3.2.RELEASE hmcts/fpl-ccd-configuration#1450

Closed

mbhave added a commit to spring-io/start.spring.io that referenced this issue Aug 11, 2020

Revert to Spring Boot 2.3.1

7de6dea

It seems that this regression in 2.3.2 causes the liveness endpoint to 404: spring-projects/spring-boot#22562 and the app goes in a crash loop

Davio mentioned this issue Aug 21, 2020

Endpoints for liveness and readiness are changed in Spring Boot 2.3.3 #23035

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kubernetes readiness probe endpoint returning 404 #22562

Kubernetes readiness probe endpoint returning 404 #22562

chadlwilson commented Jul 25, 2020 •

edited

chadlwilson commented Jul 25, 2020

bclozel commented Jul 25, 2020

chadlwilson commented Jul 25, 2020

bclozel commented Jul 25, 2020

OrangeDog commented Jul 27, 2020

agrappin commented Jul 29, 2020 •

edited

chadlwilson commented Jul 29, 2020

ttddyy commented Jul 29, 2020

OrangeDog commented Jul 29, 2020

agrappin commented Jul 30, 2020 •

edited

OrangeDog commented Jul 30, 2020

agrappin commented Jul 30, 2020

bclozel commented Aug 1, 2020

chadlwilson commented Aug 16, 2020

salaboy commented Aug 30, 2020

bclozel commented Aug 30, 2020

vishalmamidi commented Jul 13, 2022

ahmetgeymen commented Jul 13, 2022

Kubernetes readiness probe endpoint returning 404 #22562

Kubernetes readiness probe endpoint returning 404 #22562

Comments

chadlwilson commented Jul 25, 2020 • edited

chadlwilson commented Jul 25, 2020

bclozel commented Jul 25, 2020

chadlwilson commented Jul 25, 2020

bclozel commented Jul 25, 2020

OrangeDog commented Jul 27, 2020

agrappin commented Jul 29, 2020 • edited

chadlwilson commented Jul 29, 2020

ttddyy commented Jul 29, 2020

OrangeDog commented Jul 29, 2020

agrappin commented Jul 30, 2020 • edited

OrangeDog commented Jul 30, 2020

agrappin commented Jul 30, 2020

bclozel commented Aug 1, 2020

chadlwilson commented Aug 16, 2020

salaboy commented Aug 30, 2020

bclozel commented Aug 30, 2020

vishalmamidi commented Jul 13, 2022

ahmetgeymen commented Jul 13, 2022

chadlwilson commented Jul 25, 2020 •

edited

agrappin commented Jul 29, 2020 •

edited

agrappin commented Jul 30, 2020 •

edited