Skip to content

Releases: cloudprober/cloudprober

Pre-release binaries

16 May 08:08
Compare
Choose a tag to compare
Pre-release binaries Pre-release
Pre-release
  • This pre-release is built from the latest source code. Whenever a new change is pushed to the master, this release gets updated.
  • Docker equivalent of this release will be the "main" or "master" tag.
    docker run ghcr.io/cloudprober/cloudprober:main or
    docker run cloudprober/cloudprober:main
    

v0.13.5 Couple of bug fixes and support for reading files from cloud

22 May 17:18
9a9736f
Compare
Choose a tag to compare

What's Changed

  • [cloudprober] Add support for reading files from S3 and GCS. by @manugarg in #396
  • [surfacer.postgres] Switch to newer pgx postgres driver by @manugarg in #305
  • [validators] Public validators package by @manugarg in #747
  • [probes.grpc] Run validators only if probe succeeded. by @manugarg in #737

Bug fixes

  • [probes.external] Fix a bug in serverutils (#748). If you run an external probe in server mode, and your external probe binary uses serverutils Go library, you may be impacted by this bug. This was bug was introduced in the last version (v0.13.4) during refactoring for more tests coverage.
  • [probes.external] Fix logical race condition in external server probe handling by @apolcyn in #744

Other changes

  • [targets.example] Add targets extensions example. by @manugarg in #742
  • [probes.external] Fix sporadic pipe read error logs. by @manugarg in #739

Full Changelog: v0.13.4...v0.13.5

v0.13.4 - Improved gRPC and external probes, HTTP latency breakdown and more

23 Apr 16:23
afd7fa7
Compare
Choose a tag to compare

New Features / Enhancements

  • Latency Breakdown for HTTP Probes
    Provide a way to report latency breakdown for HTTP probes (#699).

  • gRPC Probe Enhancements

    • Wait for successful connection before sending the request (#726, #729). By default gRPC's DialContext returns immediately even for the non-retryable errors. This mechanism doesn't work very well for probing -- there is no error message to show problems with the connection and connection is in the TRANSIENT_FAILURE state. We now use grpcurl's BlockingDial which takes care of these issues.
    • Use client TLS by default for encryption. (#727)
    • Capture request error messages better. (#731)
  • Streaming Metrics for external probes
    Parse and export external probes' metrics as soon as they are available. (#708, #712, #713, #715, #716, #722)
    This feature enables use cases where external probe runs less frequently (say 1 every 60s) but runs many tasks (for performance measurement, e.g.) and export results many times (say every 5s) within that interval. See discussion in #691 and #689 for more background.

  • Bulk writes in postgres surfacer
    Batch postgres surfacer writes to improve peformance (#717).

  • Allow DNS Overrides
    Allow overriding DNS server (#707).

Other Changes

  • Add a command line flag to control prometheus metrics prefix: --prometheus_metrics_prefix (#732).
  • Fix ostgres surfacer metric filter (#711).
  • [prober.saveconfig] Write probe configs to disk in order (#721).
  • [tls] Reload server certificates as well (#719).
  • [build.tools] Fix where python proto stub is copied to (#700).
  • [build] Update net dependency for sec alert (#728).
  • [build] Update protobuf package to fix security alert (#714).

Contributors

Full Changelog: v0.13.3...v0.13.4

v0.13.3 - Probe scheduling, DNS probing improvements, jsonnet configs, dynamic config persistence and more

19 Mar 20:08
b0f610d
Compare
Choose a tag to compare

What's Changed

  • Probe scheduling capability:
    • You can now run probes only on the weekdays, only business hours, or turn them off during certain time periods. (#652, #662, #683)
  • External probe server:
    • Provide a python package for external probe serverutils. You can now write external server probes in python. (#694, see #696)
  • DNS Probe Improvements:
    • Allow running multiple probes in parallel. (#670)
    • Add support for TCP DNS probes. (#681)
  • Dynamic config: Provide a mechanism to save config to disk on dynamic change. If you program probes dynamically using gRPC interface for example, you can configure cloudprober such that it will reload the config on restart (see #645 for more background). (#671)
  • Jsonnet configs support. You can now write Cloudprober configs in textpb (protobuf text), YAML, JSON, and Jsonnet (#687)
  • [surfacers.otel] Additional resource attribute support. (#664)
  • [tls] Fix client cert handling when cert reloading is enabled. (#697)
  • [logs] Redirect container logs to journald. (#682)
  • [website.homepage] Add a diagram to Cloudprober homepage. (#674)
  • [servers.http] Fix /healthcheck when lameduck lister is not initialized. (#684)

Contributors: @manugarg, @cbroglie, @aitorpazos, @ls692, @AdamEAnderson

New Contributors

Full Changelog: v0.13.2...v0.13.3

v0.13.2 OpenTelemetry surfacer and composable config files

20 Dec 22:37
ae2ac84
Compare
Choose a tag to compare

What's Changed

  • [config] Make cloudprober configs composable. You can now include other configs within a cloudprober config. This will make multi-team config management easier (yay!! 🎉) . #643 (Note this doesn't work for YAML configs yet)
    • Cloudprober helm chart also supports specifying additional configs now.
  • [surfacers] Add OpenTelemetry surfacer. OpenTelemetry is becoming very popular and almost all metrics systems support it. Adding this surfacer increases Cloudprober's integration capabilities multifold. There may still be some rough edges but give it try. #642
  • [surfacers] "failure" metric for all. Now all surfacers, except FILE and PUBSUB, export "failure" metric by default. You can still disable it if you want. #648
  • [surfacers] Filtering metrics by name works for all surfacers now except FILE and PUBSUB. #648
  • [probes.ping] Take a small pause between pushing packets to avoid overwhelming the network buffers by @jumpojoy in #634
  • [docs] Documentation enhancements. #627 #650
  • [build] Fix Dockerfile.dev by @jumpojoy in #632

Breaking Change

  • [targets] Rename targets.endpoints to targets.endpoint (#646). With this change targets { endpoints {} } config fields will result an error. Sorry for the breaking change, but since these fields were introduced only in the last release (v0.13.1), impact should be minimal.

Security Update

  • Upgrade "crypto" package to fix the security alert by @manugarg in #653

New Contributors

Full Changelog: v0.13.1...v0.13.2

v0.13.1 - Opsgenie integration, alerting enhancements and other changes

06 Nov 19:53
1e70553
Compare
Choose a tag to compare

Alerting Enhancements

  • Opsgenie integration 🎉 #570
  • Add a generic HTTP notifier. This can be used for pretty much anything. #599
  • Resolve alerts automatically wherever possible. #556, #558, #580, #561.
  • Use alert name and target to deduplicate alerts instead of condition start timestamp: #583
  • Add severity to alerts. #569
  • Add target IP to available alert fields. #548
  • Improve documentation. #573, #549, #552, #585

Other Changes (consistency, logs verbosity, documentation, etc)

  • [probes] Make DNS resolve errors behavior more consistent: #616, #619, #620
  • Provide a way to specify detailed targets configuration in the config directly. This will simplify configuration quite significantly. #606
  • [targets] Make targets optional for certain probes. #614
  • [probes] Return an error if interval is smaller than timeout. #560
  • Reduce logs verbosity: #555, #562, #563
  • [docs.targets] Improve targets documentation. #617
  • [config] Streamline config usage and loading: #622
  • [probes.http] Support for new Cloudprober internal scheme, host and path labels. #607, #608
  • [website] Fix company list fonts on the homepage. #612

Build and testing

  • [build] Move a bunch of packages to internal #589, #590, #591, #592
  • [build] Test example configs during build. #543
  • [probes.test] Add tests to verify that empty configs work. #544
  • [cleanup] Cleanup usage of deprecated packages: #603, #604, #605
  • [probes.grpc] Disable connect failures test for macos. #579
  • [examples] Fix myprober example and simplify it. #568
  • [build] Don't fail fast. Run as many tests as possible. #574
  • [build] Run certain actions only in the main repository. #621

Bug fixes

  • [config] Fix bug in envSecret handling. #546
  • [config] Fix a bug in the /config-running disabling functiionality. #554

Security updates

  • [security] Upgrade gRPC package to fix the security issue. #602
  • [build] Update some depdendencies to fix security alerts. #577

Full Changelog: v0.13.0...v0.13.1

v0.13.0 - Alerts dashboard, alerting support for all probes, more config template functions, config documentation navigation

02 Oct 16:46
68833d7
Compare
Choose a tag to compare

Enhancements

  • [config] Much expanded config template functions (functions documentation). Use sprig's functions for config template by @manugarg in #524
  • [alerting] Add alerts dashboard at /alerts. It will show currently firing alerts and some historical alerts. by @manugarg in #530
  • [alerting] Add alerting to all probe types. by @manugarg in #533 & #532
  • [config] Provide a way to hide secret environment variables. You can use 'envSecret' instead of 'env' macro to make sure environment variable doesn't show on the expanded config (/config-running) page. by @manugarg in #536
  • [validators.http] Add last-modified header validation by @robpickerill in #493
  • [gcp] Add an option to override GCP API endpoint for resource discovery and logging. by @shellyvilenko in #525 & #537

Bug fixes

  • [bugfix] Fix a bug wherein HTTP probe will not consider a target as new if only its IP address changes. To fix that behavior, include target's IP in the target key. (#535) by @ls692 in #538

Documentation

  • [docs] Rename /config root file to _index.md by @manugarg in #497
  • [docs] Update protobuf documentation. by @manugarg in #499
  • [docs.alerting] Add first draft alerting documentation. by @manugarg in #501
  • [docs.config] Add ToC to the config documentation pages. by @manugarg in #503
  • [docs] Add config documentation for the postgres surfacer. by @manugarg in #512
  • [docs] Change default config language to textpb. by @manugarg in #516
  • [docs] Fix anchor links handling. by @manugarg in #518
  • [docs] Fix one more quirk of Firefox. by @manugarg in #521
  • [surfacers.doc] Fix surfacers documentation links. by @manugarg in #526

Full Changelog: v0.12.9...v0.13.0

v0.12.9 - YAML configs, structured logging, improved gRPC probe, more alerting targets, and much more

05 Sep 15:56
a07e5f7
Compare
Choose a tag to compare

Enhancements

Bug fixes

  • [metrics] Clone latency and other similar metrics. (by @manugarg in #437)
  • [metrics] In prometheus metrics metadata string, add a space after #. (by @aymeric-ledizes in #471)

Internal

  • [metrics.Map] Use generics to implement Map. #444
  • [metrics.latency] Add a new value type LatencyValue. #448
  • [metrics] Remove (now) unnecessary metric size check. #447
  • [config] Refactor config.go to kill unnecessary functions. #461
  • [alerting.email] Change config and add more tests. #472
  • [logger] Trim base paths from file and function names #474
  • [probestatus] Show debug info only if 'debug' parameter is set. #478

Build and Testing

  • Add flags to generate test reports. #432
  • [build] Run Sonar scanner explicitly. #433
  • [build] Run sonar scanner only for local PRs. #436
  • [surfacer.cw] Add tests for map values. #446
  • [surfacers.sd] Add more tests for stackdriver surfacer. #449
  • [docs] Add K8s targets page. #452
  • [website] Some style changes and small fixes. #454, #456
  • [build] Prevent import of surfacers code outside of surfacers package. #477
  • [probestatus] Add a minimal test for probestatus template rendering. #479
  • Exclude cmd files from coverage analysis. #480
  • [docs] Make sidebar menu better. #483

New Contributors

Full Changelog: v0.12.8...v0.12.9

v0.12.8 Bug fixes

25 Jul 19:17
8056812
Compare
Choose a tag to compare

Bug fixes

  • [probes.http] Fix HTTP request body handling and add a lot of tests. We were using a custom reader for HTTP request body to optimize some aspects of it, but some parts of the Go's HTTP implementation expect the request body to be a buffered reader. #408, #407, #409, #411
  • [oauth2] Fix a bug in HTTP token refresh, introduced by the above change. #422
  • [probes.http] Fix how we determine whether to change the TLS config servername. #420
  • [probes.udp] Fix additional labels handling for UDP probe. #421

Other changes

  • [alerting] Add support for email notifications. #403
  • [alerting] Fix config protobuf package name. #402
  • [build] Exclude "tip" from releases. #400
  • [probes.http] Include headers configured with header option in requests. #418

New Contributors

Full Changelog: v0.12.7...v0.12.8

v0.12.7 Alerting (new), enhancements, and bug fixes

01 Jun 20:33
e1aba46
Compare
Choose a tag to compare

Alerting

It's official. Cloudprober is getting its own alerting functionality 🎉. This release marks the beginning of the alerting implementation. We'll add more ways to notify an alert, but you can already trigger a command on alert. User documentation coming soon, but in the meantime, you can take a look at the alerting config proto to get an idea.

PRs: #359, #361, #364, #365, #380, #384, #387

Enhancements

  • [probes.http] Provide a way to set custom user-agent. (#317)
  • [probes.external] Add support for environment variables. (#379, #386).
  • [surfacers.datadog] Add compression, and use batching, when posting metrics. (#357, #358)
  • [surfacers] Adding a bigquery surfacer. (#274)
  • [cmd] Exit on non-flag arguments. (#372)
  • [prober] Start probing more quickly. (#382)
  • [validators] Log validation failures. (#377)

Bug fixes

  • [probes.http] Set request's GetBody explicitly to make redirect work for POST requests with data. To make HTTP requests work same across Cloudprober, move HTTP request creation to a common package. This also allows for better testing. (#373, #375, #376, #381, #383)
  • [probes.http] Fix a bug in connect_event metric update. (#388)
  • [rds.gcp] Delete a zone from cache if it's not discovered again. (#318)
  • [probestatus] Fix JS variable declarations. (#321, #322)
  • [probestatus] Fix the graph issue for probes with "." in name. (#363)
  • [oauth] Fix a bug in http_token. (#371)
  • [probes.external] Synchronize access to cmdRunning (#395). Add tests to discover the same issue (#397).

Build

  • [oauth] Improve token source test reliability. (#325)
  • [build] Update protoc-gen-go version to the latest. (#374)
  • [metrics.testutils] Refactor testutils to make it more convenient. (#378)

Docs

Complete website (cloudprober.org) redesign: #327. Other PRs: #335, #344, #349, #351.

New Contributors

Full Changelog: v0.12.6...v0.12.7