Azure Kubernetes Service: 2020-06 updates #7233

tombuildsstuff · 2020-06-05T14:36:24Z

There's a number of Pull Requests open for Azure Kubernetes Service - so that we can merge these without conflicting in every Pull Request - this PR consolidates these changes into a single branch.

To achieve this these branches have been pulled locally, squashed, had any fixes applied and then cherry-picked onto this "combined" branch - so the original authors credit should still apply.

At the same time this PR adds support for a number of Feature Requests - specific details are below:

dependencies: updating to use version 2020-03-01 of the containerservice API
Data Source: azurerm_kubernetes_cluster
- bug fix: fixing an issue where some read-only fields were unintentionally marked as user-configurable.
- exposing the orchestrator_version being used for each Node Pool
- exposing the field disk_encryption_set_id
New Data Source: azurerm_kubernetes_cluster_node_pool
azurerm_kubernetes_cluster
- Azure China:
  - the Azure Policy add-on is not available and no longer sent
- Azure US Government:
  - the Azure Policy add-on is not available and no longer sent
  - the Kubernetes Dashboard add-on is not available and no longer sent
- Support for configuring/updating the version of Kubernetes used in the Default Node Pool (Add Orchestrator version for Kubernetes Cluster and Node Pool #6047 by @titilambert)
- Support for Azure Active Directory (Managed) Integration v2 (incorporating Enable AKS AAD integration v2 + SDK version bump #6530 by @jlpedrosa)
- Support for using a Disk Encryption Set (incorporating Add support for encrypted disks in aks #7110 by @jlpedrosa)
- Support for configuring the Auto Scale Profile (incorporating kubernetes_cluster - Add support for auto_scale_profile #6620 by @aristosvo)
- Support for configuring outbound_ports_allocated and idle_timeout_in_minutes within the load_balancer_profile block (incorporating Support more properties for azurerm_kubernetes_cluster #5824 by @neil-yechenwei)
- Support for the Uptime SLA / Paid SKU (tracked in Support for AKS uptime SLA #6912)
- Exposing the private_fqdn of the cluster
- Checking for a System Node Pool when importing
azurerm_kubernetes_cluster_node_pool
- Support for configuring/updating the version of Kubernetes being used (incorporating Add Orchestrator version for Kubernetes Cluster and Node Pool #6047 by @titilambert)
- Support for Spot Node Pools (superseding the work done in Spot Priority support for AKS #7145 by @pbrit / WIP : Support for spot node pool #6231 by @rajalokan)
- Support for configuring System/User Node Pools (tracked in Support for mode:system pools in AKS #6058)
- Changes to the node_taints now force a new resource, matching the updated API behaviour
azurerm_linux_virtual_machine_scale_set
- adding validation for the max_bid_price field
azurerm_windows_virtual_machine_scale_set
- adding validation for the max_bid_price field

Whilst pulling this many changes into a single Pull Request isn't ideal (and we'd generally avoid it) - this allows us to merge the changes above whilst avoiding merge conflicts in all the open PR's and so appeared to be the most pragmatic approach to move forward.

One thing in particular to call out is updating Node Pools - where a Kubernetes Cluster/Control Plane must be updated before the Node Pools can be updated; as such we've added a check during the Apply to ensure that the version of Kubernetes used for the Node Pool is supported by the Control Plane (or if we need to upgrade that first). Example of that error:

Error:
The Kubernetes/Orchestrator Version "1.16.9" is not available for Node Pool "default".

Please confirm that this version is supported by the Kubernetes Cluster "tom-dev-aks"
(Resource Group "tom-dev-rg") - which may need to be upgraded first.

The Kubernetes Cluster is running version "1.16.8".

The supported Orchestrator Versions for this Node Pool/supported by this Kubernetes Cluster are:
 * 1.14.7
 * 1.16.8
 * 1.15.10
 * 1.14.8
 * 1.15.11

Node Pools cannot use a version of Kubernetes that is not supported on the Control Plane. More
details can be found at https://aka.ms/version-skew-policy.

Fixes #1915
Fixes #4327
Fixes #5134
Fixes #5487
Fixes #5541
Fixes #5646
Fixes #6058
Fixes #6086
Fixes #6462
Fixes #6464
Fixes #6612
Fixes #6702
Fixes #6912
Fixes #6994
Fixes #7092
Fixes #7136
Fixes #7198

mbfrahry

LGTM with some minor additional checks

azurerm/internal/services/containers/kubernetes_cluster_data_source.go

azurerm/internal/services/containers/kubernetes_cluster_node_pool_resource.go

azurerm/internal/services/containers/kubernetes_cluster_node_pool_data_source.go

azurerm/internal/services/containers/kubernetes_cluster_node_pool_resource.go

pbrit · 2020-06-05T21:22:05Z

@tombuildsstuff Thank you for your work.

However, I'm concerned about your approach. Yes, it's an effective way to add more functionality, yet, it's equally effective for introducing more bugs.

Even after a short review session, I see many issues with the way SpotPriority was re-implemented. It's just one chunk, there are 10+ more which I'm not familiar with.

I'm going to take this branch for a ride and will provide the feedback.
Also, I'm wondering what the rest of community think of about it.

Porting over the changes from #5824 X-Committed-With: neil-yechenwei

…nflicts going forward

… prior to deployment This raises an error with more information about which Kubernetes Versions are supported by the Kubernetes Cluster - and prompts a user to upgrade the Kubernetes Cluster first if necessary. This commit also adds acceptance tests to confirm the upgrade scenarios

…constraints for AKS

…ixes #6702.

Fixes #6912

…d now it's available

This allows for setting `mode` to `System` (or `User`) for definining secondary System node pools. Fixes #6058

tombuildsstuff · 2020-06-08T09:36:00Z

@pbrit

Due to the nature of AKS unfortunately we're frequently faced with breaking changes (either behaviourally, or code changes) within AKS every few months. As such over time we've built up an extensive test suite for AKS which covers all kinds of configurations, functionality and bugs - which we're continually adding to as new functionality gets added/bugs get fixed.

We'd had a heads-up of some potentially breaking changes coming to the AKS API's in the near-future - as such we've been trying to find out more information about the specific details. In the interim, there's been a number of PR's opened (including yours) which have looked to add new functionality/fix bugs in this resource.

We've recently got more details about these changes and it appears that they won't break us in the way we'd been expecting - as such we're good to proceed with these PR's, however having a number of open PR's puts us in an unfortunate position where merging one causes merge conflicts in the other.

Having spent some time reviewing these PR's and determining how's best to proceed, we came to the conclusion that consolidating these (7 PR's) together was the most pragmatic way forward. Since a number of these PR's also required (and vendored) a newer API version - the best way to do this was to squash the commits down, remove any vendoring changes and then fix comments from PR review.

Whilst generally speaking I'd agree with you regarding a single large PR's vs multiple small PR's - in this instance we're fortunate to have sufficient test coverage of the AKS Resources that we're able to detect bugs both in our code and in the AKS API's themselves. To give one specific example here: the test suite had detected a breaking behavioural change in the AKS API when updating a Load Balancer in a given configuration (which I'd also spotted during development) - but will be pushing a fix for shortly.

Whilst we appreciate there's some downsides to this approach - and we'd have preferred to have gone through and merged those PR's individually - after digging into it consolidating these appeared to be the more pragmatic solution. That said, generally speaking we'd tend to prefer/merge more, smaller PR's where possible - this is only possible due to the large test coverage we've got for AKS.

Even after a short review session, I see many issues with the way SpotPriority was re-implemented. It's just one chunk, there are 10+ more which I'm not familiar with.

To answer your specific questions around Spot Max Price - unfortunately different Azure Teams can refer to the same functionality using different terminology. As such, whilst we could seek to use a consistent name here - in this instance we're following the name used by the AKS Service, rather than the Compute Service (since this is exposed as "Spot Max Price" in the AKS API rather than "Spot Bid Price" in the VMSS API).

If you've noticed specific issues/bugs in the PR then please feel free to comment on the PR Review and we can take a look/work through those however :)

…_profile` block within the `network_profile` block

aristosvo · 2020-06-08T13:11:17Z

@tombuildsstuff Thanks for refactoring 'my' code regarding the delta-updates for the load_balancer_profile. I'm still a bit afraid #6534 might resurface, as it's not clear from the code and the tests how this works out.

I probably won't have time available to write the test for it, but it comes down to the following scenario:

if idle_timeout_in_minutes is changed
outbound_ip_address_ids was configured, but not changed
Is the outbound_ip_adress picked up or will it trigger an error?

tombuildsstuff · 2020-06-08T14:13:00Z

@aristosvo thanks for looking - for context there's a breaking change in the new API version where the field idle_timeout_in_minutes now always needs to be specified (which caused the tests TestAccAzureRMKubernetesCluster_addAgent and TestAccAzureRMKubernetesCluster_removeAgent to fail) - which is why this is being updated.

From what I can tell, the scenario for #6534 is captured in the Test TestAccAzureRMKubernetesCluster_changingLoadBalancerProfile (which is currently failing / I'm working through fixing at the moment) - but we can add an additional one if that's not sufficient, we'll see shortly :)

jackofallops

LGTM 👍 Thanks Tom!

…count`, `outbound_ip_address_ids` and `outbound_ip_prefix_ids` fields"

…etesClusterNodePoolDataSource_basic`

…e_policy` In a change from last week - the API now defaults to v2. Since v1 is deprecated, this commit removes support for it.

tombuildsstuff · 2020-06-10T13:31:41Z

Running the test suite for this, the tests look good:

Of the failures:

1 was a permissions error (since resolved)
3 are previously known/open Azure API bugs
2 require re-working - for one AKS changed it's implementation a while back and these need updating, the other will be fixed by fixing Changing "azurerm_kubernetes_cluster" default_node_pool.vm_size forces replacement of the whole kubernetes cluster #7093

So this should be good 👍

…licy` block

…both the Data Source & Resource

tombuildsstuff · 2020-06-10T14:46:29Z

Data Source test passes after updating the CheckDestroy helper:

$ TF_PROVIDER_SPLIT_COMBINED_TESTS=1 TF_ACC=1 go test -v ./azurerm/internal/services/containers/tests/ -timeout=60m -run=TestAccAzureRMKubernetesClusterNodePoolDataSource_basic
=== RUN   TestAccAzureRMKubernetesClusterNodePoolDataSource_basic
=== PAUSE TestAccAzureRMKubernetesClusterNodePoolDataSource_basic
=== CONT  TestAccAzureRMKubernetesClusterNodePoolDataSource_basic
--- PASS: TestAccAzureRMKubernetesClusterNodePoolDataSource_basic (1428.48s)
PASS
ok  	github.com/terraform-providers/terraform-provider-azurerm/azurerm/internal/services/containers/tests	1428.528s

ghost · 2020-06-11T13:37:49Z

This has been released in version 2.14.0 of the provider. Please see the Terraform documentation on provider versioning or reach out if you need any assistance upgrading. As an example:

provider "azurerm" {
    version = "~> 2.14.0"
}
# ... other configuration ...

ghost · 2020-07-11T14:52:26Z

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues.

If you feel this issue should be reopened, we encourage creating a new issue linking back to this one for added context. If you feel I made an error 🤖 🙉 , please reach out to my human friends 👉 hashibot-feedback@hashicorp.com. Thanks!

tombuildsstuff added enhancement dependencies new-data-source service/vmss Virtual Machine Scale Sets service/kubernetes-cluster labels Jun 5, 2020

tombuildsstuff added this to the v2.14.0 milestone Jun 5, 2020

tombuildsstuff requested a review from a team June 5, 2020 14:36

ghost added the size/XXL label Jun 5, 2020

tombuildsstuff mentioned this pull request Jun 5, 2020

Support more properties for azurerm_kubernetes_cluster #5824

Closed

mbfrahry approved these changes Jun 5, 2020

View reviewed changes

azurerm/internal/services/containers/kubernetes_cluster_data_source.go Outdated Show resolved Hide resolved

azurerm/internal/services/containers/kubernetes_cluster_node_pool_resource.go Outdated Show resolved Hide resolved

tombuildsstuff self-assigned this Jun 5, 2020

pbrit suggested changes Jun 5, 2020

View reviewed changes

azurerm/internal/services/containers/kubernetes_cluster_node_pool_data_source.go Show resolved Hide resolved

azurerm/internal/services/containers/kubernetes_cluster_node_pool_resource.go Show resolved Hide resolved

tombuildsstuff and others added 17 commits June 8, 2020 10:27

r/kubernetes_cluster: support for load balancer settings

f25d744

Porting over the changes from #5824 X-Committed-With: neil-yechenwei

d/kubernetes_cluster: making all read-only properties read-only

5c0da25

Add Orchestrator version for Kubernetes Cluster and Node Pool

5f01359

tests: splitting the test dictionaries amongst the files to reduce co…

2c7d831

…nflicts going forward

r/kubernetes_cluster: comments from PR review

a5299f3

r/kubernetes_cluster: mapping spot_max_price for the default node pool

3ef2bdc

r/kubernetes_cluster_(node_pool): adding a note covering the version …

84cd429

…constraints for AKS

r/kubernetes_cluster: Azure Policy is not supported in US Government. F…

fb3ec28

…ixes #6702.

init kubernetes_cluster auto_scale_profile

6d2256c

r/kubernetes_cluster: fixing pr comments

0188f09

r/kubernetes_cluster: grouping optional fields

e222692

dependencies: updating to use 2020-03 of the containerservice api

0af18ac

r/kubernetes_cluster: support for the Paid SKU (a.k.a. Uptime SLA)

5348f2a

Fixes #6912

fixing broken tests from merge conflicts

7b59502

r/kubernetes_cluster: exposing the balance_similar_node_groups fiel…

2101cdf

…d now it's available

r/kubernetes_cluster_node_pool: support for configuring mode

ccd95cd

This allows for setting `mode` to `System` (or `User`) for definining secondary System node pools. Fixes #6058

tombuildsstuff added 3 commits June 8, 2020 10:27

terrafmt

6dee2b2

r/kubewrnetes_cluster: keeping the linter happy

53bbabd

r/kubernetes_cluster: fixing pr comments

b9e698f

tombuildsstuff force-pushed the f/aks-updates-2020-06 branch from 88ccfd8 to e1e63cc Compare June 8, 2020 08:28

r/kubernetes_cluster: fixing the key vault test

2d5198b

tombuildsstuff force-pushed the f/aks-updates-2020-06 branch from e1e63cc to 2d5198b Compare June 8, 2020 09:38

r/kubernetes_cluster: supporting delta-updates for the `load_balancer…

8c22d58

…_profile` block within the `network_profile` block

linting

fe08fce

jackofallops approved these changes Jun 8, 2020

View reviewed changes

tombuildsstuff added 4 commits June 8, 2020 16:48

r/kubernetes_cluster: conditionally setting the `managed_outbound_ip_…

ed3c9ce

…count`, `outbound_ip_address_ids` and `outbound_ip_prefix_ids` fields"

r/kubernetes_cluster_node_pool: fixing the test `TestAccAzureRMKubern…

e3b88ce

…etesClusterNodePoolDataSource_basic`

(d|r)/kubernetes_cluster: version is no longer applicable for `azur…

6cd6681

…e_policy` In a change from last week - the API now defaults to v2. Since v1 is deprecated, this commit removes support for it.

r/kubernetes_cluster: conditionally nil-ing the load balancer profile

d4e1e0a

tombuildsstuff force-pushed the f/aks-updates-2020-06 branch from f3e934d to d4e1e0a Compare June 9, 2020 08:13

gosimple

05378e6

tombuildsstuff added 2 commits June 10, 2020 15:39

d/kubernetes-cluster: removing the version field from the `azure_po…

c4cee44

…licy` block

r/kubernetes_cluster_node_pool: making the CheckDestroy test support …

87d0519

…both the Data Source & Resource

tombuildsstuff merged commit 277f16c into master Jun 10, 2020

tombuildsstuff deleted the f/aks-updates-2020-06 branch June 10, 2020 14:51

tombuildsstuff added a commit that referenced this pull request Jun 10, 2020

updating to include #7233

e15bb5d

thomas-riccardi mentioned this pull request Jun 10, 2020

Changing "azurerm_kubernetes_cluster" default_node_pool.vm_size forces replacement of the whole kubernetes cluster #7093

Closed

etiennetremel mentioned this pull request Jun 16, 2020

Load balancer profile allocated ports is not in an allowable range for azurerm_kubernetes_cluster #7340

Closed

hashicorp locked and limited conversation to collaborators Jul 11, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Azure Kubernetes Service: 2020-06 updates #7233

Azure Kubernetes Service: 2020-06 updates #7233

tombuildsstuff commented Jun 5, 2020 •

edited

mbfrahry left a comment

pbrit commented Jun 5, 2020

tombuildsstuff commented Jun 8, 2020

aristosvo commented Jun 8, 2020 •

edited

tombuildsstuff commented Jun 8, 2020

jackofallops left a comment

tombuildsstuff commented Jun 10, 2020

tombuildsstuff commented Jun 10, 2020

ghost commented Jun 11, 2020

ghost commented Jul 11, 2020

Azure Kubernetes Service: 2020-06 updates #7233

Azure Kubernetes Service: 2020-06 updates #7233

Conversation

tombuildsstuff commented Jun 5, 2020 • edited

mbfrahry left a comment

Choose a reason for hiding this comment

pbrit commented Jun 5, 2020

tombuildsstuff commented Jun 8, 2020

aristosvo commented Jun 8, 2020 • edited

tombuildsstuff commented Jun 8, 2020

jackofallops left a comment

Choose a reason for hiding this comment

tombuildsstuff commented Jun 10, 2020

tombuildsstuff commented Jun 10, 2020

ghost commented Jun 11, 2020

ghost commented Jul 11, 2020

tombuildsstuff commented Jun 5, 2020 •

edited

aristosvo commented Jun 8, 2020 •

edited