Snippets Groups Projects

2 months ago
08a078dc

Updated HPA parameters · 08a078dc
Ihor Anikeiev [EPAM / GCP] authored 2 months ago and Oleksandr Kosse (EPAM) committed 2 months ago

08a078dc

History

Updated HPA parameters
Ihor Anikeiev [EPAM / GCP] authored 2 months ago and Oleksandr Kosse (EPAM) committed 2 months ago

Code owners

Assign users and groups as approvers for specific file changes. Learn more.

To learn more about this project, read the wiki.

README.md 9.90 KiB

GC Policy service

Introduction

This chart deploys policy service on a Kubernetes cluster using Helm package manager.

Prerequisites

The code was tested on Kubernetes cluster (v1.23.12) with Istio (1.15)

It is possible to use other versions, but it hasn't been tested

Operation system

The code works in Debian-based Linux (Debian 10 and Ubuntu 20.04) and Windows WSL 2. Also, it works but is not guaranteed in Google Cloud Shell. All other operating systems, including macOS, are not verified and supported.

Packages

Packages are only needed for installation from a local computer.

HELM (version: v3.7.1 or higher) helm
Kubectl (version: v1.23.12 or higher) kubectl

Installation

First you need to set variables in values.yaml file using any code editor. Some of the values are prefilled, but you need to specify some values as well. You can find more information about them below.

Global variables

Name	Description	Type	Default	Required
global.domain	your domain for the external endpoint, ex `example.com`	string	-	yes
global.onPremEnabled	whether baremetal is enabled	boolean	`false`	yes
global.limitsEnabled	whether CPU and memory limits are enabled	boolean	`true`	yes
global.dataPartitionId	data partition id	string	-	yes
global.logLevel	severity of logging level	string	`ERROR`	yes
global.tier	Only PROD must be used to enable autoscaling	string	""	no
global.autoscaling	enables horizontal pod autoscaling, when tier=PROD	boolean	true	yes

Common variables

Name	Description	Type	Default	Required
data.logLevel	logging severity level for this service only	string	-	yes, only if differs from the `global.logLevel`
data.image	policy image name	string	-	yes
data.requestsCpu	amount of requests CPU	string	`10m`	yes
data.requestsMemory	amount of requests memory	string	`200Mi`	yes
data.limitsCpu	CPU limit	string	`1`	only if `global.limitsEnabled` is true
data.limitsMemory	memory limit	string	`1G`	only if `global.limitsEnabled` is true
data.serviceAccountName	name of your service account	string	-	yes
data.imagePullPolicy	when to pull image	string	`IfNotPresent`	yes
data.bucketName	bucket name	string	-	yes
data.scopes	scope of OPA	string	`https://www.googleapis.com/auth/devstorage.read_only`	yes
data.entitlementsHost	Entitlements host	string	`http://entitlements`	yes
data.entitlementsBasePath	Entitlements path	string	`/api/entitlements/v2/groups`	yes
data.useBundles	use bundle or not	string	`yes`	yes
data.legalHost	Legal host	string	`http://legal`	yes
data.partitionHost	Partition host	string	`http://partition`	yes

Baremetal variables

Name	Description	Type	Default	Required
data.minioHost	minio host	string	`http://minio:9000`	yes
conf.minioSecretName	secret name for the app	string	`policy-minio-secret`	yes

Config variables

Name	Description	Type	Default	Required
conf.appName	name of the app	string	`policy`	yes
conf.configmap	configmap to be used	string	`policy-config`	yes
conf.bootstrapSecretName	secret name for the bootstrap	string	`minio-bootstrap-secret`	yes
conf.minDelaySeconds	min delay for bundle download	num	`6`	yes
conf.maxDelaySeconds	max delay for bundle download	num	`12`	yes

Bootstrap variables

Name	Description	Type	Default	Required
data.bootstrapImage	image for bootstrap deployment	string	-	yes
data.bootstrapServiceAccountName	service account for bootstrap deployment	string	-	yes

OPA variables

Name	Description	Type	Default	Required
opa.conf.configmap	configmap to be used	string	`opa-config`	yes
opa.conf.envConfig	configmap with env vars	string	`opa-env-config`	yes
opa.conf.appName	name of the app	string	`opa`	yes
opa.data.serviceAccountName	name of your service account	string	`opa-k8s`	yes
opa.data.image	image name	string	-	yes

ISTIO variables

Name	Description	Type	Default	Required
istio.proxyCPU	CPU request for Envoy sidecars	string	`10m`	yes
istio.proxyCPULimit	CPU limit for Envoy sidecars	string	`500m`	yes
istio.proxyMemory	memory request for Envoy sidecars	string	`100Mi`	yes
istio.proxyMemoryLimit	memory limit for Envoy sidecars	string	`512Mi`	yes
istio.bootstrapProxyCPU	CPU request for Envoy sidecars	string	`10m`	yes
istio.bootstrapProxyCPULimit	CPU limit for Envoy sidecars	string	`100m`	yes

Horizontal Pod Autoscaling (HPA) variables (works only if tier=PROD and autoscaling=true)

Name	Description	Type	Default	Required
hpa.minReplicas	minimum number of replicas	integer	`3`	only if `global.autoscaling` is true and `global.tier` is PROD
hpa.maxReplicas	maximum number of replicas	integer	`15`	only if `global.autoscaling` is true and `global.tier` is PROD
hpa.targetType	type of measurements: AverageValue or Value	string	`"AverageValue"`	only if `global.autoscaling` is true and `global.tier` is PROD
hpa.targetValue	threshold value to trigger the scaling up	integer	`90`	only if `global.autoscaling` is true and `global.tier` is PROD
hpa.behaviorScaleUpStabilizationWindowSeconds	time to start implementing the scale up when it is triggered	integer	`10`	only if `global.autoscaling` is true and `global.tier` is PROD
hpa.behaviorScaleUpPoliciesValue	the maximum number of new replicas to create (in percents from current state)	integer	`50`	only if `global.autoscaling` is true and `global.tier` is PROD
hpa.behaviorScaleUpPoliciesPeriodSeconds	pause for every new scale up decision	integer	`15`	only if `global.autoscaling` is true and `global.tier` is PROD
hpa.behaviorScaleDownStabilizationWindowSeconds	time to start implementing the scale down when it is triggered	integer	`60`	only if `global.autoscaling` is true and `global.tier` is PROD
hpa.behaviorScaleDownPoliciesValue	the maximum number of replicas to destroy (in percents from current state)	integer	`25`	only if `global.autoscaling` is true and `global.tier` is PROD
hpa.behaviorScaleDownPoliciesPeriodSeconds	pause for every new scale down decision	integer	`60`	only if `global.autoscaling` is true and `global.tier` is PROD

Limits variables

Name	Description	Type	Default	Required
limits.maxTokens	maximum number of requests per fillInterval	integer	`70`	only if `global.autoscaling` is true and `global.tier` is PROD
limits.tokensPerFill	number of new tokens allowed every fillInterval	integer	`70`	only if `global.autoscaling` is true and `global.tier` is PROD
limits.fillInterval	time interval	string	`"1s"`	only if `global.autoscaling` is true and `global.tier` is PROD

Methodology for Parameter Calculation variables: hpa.targetValue, limits.maxTokens and limits.tokensPerFill

The parameters hpa.targetValue, limits.maxTokens and limits.tokensPerFill were determined through empirical testing during load testing. These tests were conducted using the N2D machine series, which can run on either AMD EPYC Milan or AMD EPYC Rome processors. The values were fine-tuned to ensure optimal performance under typical workloads.

Recommendations for New Instance Types

When changing the instance type to a newer generation, such as the C3D series, it is essential to conduct new load testing. This ensures the parameters are recalibrated to match the performance characteristics of the new processor architecture, optimizing resource utilization and maintaining application stability.

Install the helm chart

Run this command from within this directory:

helm install gc-policy-deploy .

Uninstalling the Chart

To uninstall the helm deployment:

helm uninstall gc-policy-deploy