You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -783,6 +787,169 @@ new tasks.EmrModifyInstanceGroupByName(this, 'Task', {
783
787
});
784
788
```
785
789
790
+
## EMR on EKS
791
+
792
+
Step Functions supports Amazon EMR on EKS through the service integration pattern.
793
+
The service integration APIs correspond to Amazon EMR on EKS APIs, but differ in the parameters that are used.
794
+
795
+
[Read more](https://docs.aws.amazon.com/step-functions/latest/dg/connect-emr-eks.html) about the differences when using these service integrations.
796
+
797
+
[Setting up](https://docs.aws.amazon.com/emr/latest/EMR-on-EKS-DevelopmentGuide/setting-up.html) the EKS cluster is required.
798
+
799
+
### Create Virtual Cluster
800
+
801
+
The [CreateVirtualCluster](https://docs.aws.amazon.com/emr-on-eks/latest/APIReference/API_CreateVirtualCluster.html) API creates a single virtual cluster that's mapped to a single Kubernetes namespace.
802
+
803
+
The EKS cluster containing the Kubernetes namespace where the virtual cluster will be mapped can be passed in from the task input.
804
+
805
+
```ts
806
+
newtasks.EmrContainersCreateVirtualCluster(this, 'Create a Virtual Cluster', {
The [StartJobRun](https://docs.aws.amazon.com/emr-on-eks/latest/APIReference/API_StartJobRun.html) API starts a job run. A job is a unit of work that you submit to Amazon EMR on EKS for execution. The work performed by the job can be defined by a Spark jar, PySpark script, or SparkSQL query. A job run is an execution of the job on the virtual cluster.
845
+
846
+
Required setup:
847
+
848
+
- If not done already, follow the [steps](https://docs.aws.amazon.com/emr/latest/EMR-on-EKS-DevelopmentGuide/setting-up.html) to setup EMR on EKS and [create an EKS Cluster](https://docs.aws.amazon.com/cdk/api/latest/docs/aws-eks-readme.html#quick-start).
- Enable [IAM Role access](https://docs.aws.amazon.com/emr/latest/EMR-on-EKS-DevelopmentGuide/setting-up-enable-IAM.html)
851
+
852
+
The following actions must be performed if the virtual cluster ID is supplied from the task input. Otherwise, if it is supplied statically in the state machine definition, these actions will be done automatically.
853
+
854
+
- Create an [IAM role](https://docs.aws.amazon.com/cdk/api/latest/docs/@aws-cdk_aws-iam.Role.html)
855
+
- Update the [Role Trust Policy](https://docs.aws.amazon.com/emr/latest/EMR-on-EKS-DevelopmentGuide/setting-up-trust-policy.html) of the Job Execution Role.
856
+
857
+
The job can be configured with spark submit parameters:
resources: ['*'],// We need * permissions for creating a virtual cluster https://docs.aws.amazon.com/emr/latest/EMR-on-EKS-DevelopmentGuide/setting-up-iam.html
0 commit comments