Performance Optimization in Kubernetes

June 16, 2025 · 3 min read

Aytan Jalilova

Developer Advocate @ IOMETE

Kubernetes gives you incredible flexibility — but with flexibility comes complexity. Without the right configurations, even powerful workloads like Spark and Flink can underperform or waste resources.

This post covers practical strategies for optimizing performance in Kubernetes-native data pipelines, using patterns supported by platforms like IOMETE.

1. Define Resource Requests and Limits

Kubernetes scheduling depends on well-defined resource boundaries.

Best practices:

Set requests for CPU/memory to guarantee baseline availability
Set limits to prevent a Pod from over-consuming
Avoid running without requests — it leads to eviction or starvation

IOMETE dynamically sets executor resources and includes autoscaling policies out of the box.

resources:
  requests:
    cpu: "1"
    memory: "2Gi"
  limits:
    cpu: "2"
    memory: "4Gi"

2. Use Node Affinity and Taints

For heavy workloads (e.g., large joins, shuffle-heavy Spark jobs), bind compute Pods to high-performance nodes.

affinity:
  nodeAffinity:
    requiredDuringSchedulingIgnoredDuringExecution:
      nodeSelectorTerms:
        - matchExpressions:
            - key: node-type
              operator: In
              values:
                - high-memory

IOMETE allows custom node pools for Spark and ML jobs — including GPU-backed nodes or NVMe-optimized instances.

3. Leverage Horizontal and Vertical Autoscaling

HPA (Horizontal Pod Autoscaler) adds/removes Pods based on CPU or Prometheus metrics.
VPA (Vertical Pod Autoscaler) adjusts Pod resource requests automatically over time.

IOMETE supports both — scaling Spark clusters based on workload demand and terminating idle resources to cut costs.

4. Tune Spark for Execution Efficiency

Fine-tune Spark’s runtime behavior for better parallelism and memory usage.

Key settings:

spark.executor.memoryOverhead: Buffer for off-heap memory
spark.sql.shuffle.partitions: Controls parallelism during wide transformations
spark.dynamicAllocation.enabled: Automatically add/remove executors

IOMETE offers preconfigured Spark profiles optimized for batch, streaming, and ad-hoc SQL workloads.

5. Batch vs. Real-Time: Optimize by Pipeline Type

Batch Workloads

Typical tools: Spark (batch), dbt, Airflow

Recommendations:

Use Kubernetes CronJobs or Airflow DAGs with KubernetesExecutor
Run jobs on spot/preemptible nodes to reduce cost
Use Iceberg or Delta Lake for table storage

Real-Time Workloads

Typical tools: Flink, Kafka consumers, Spark Structured Streaming

Recommendations:

Use StatefulSets for checkpointing and reliability
Persist state to HDFS, S3, or RocksDB
Monitor latency and backpressure using Prometheus and Grafana

IOMETE supports both modes — using Iceberg as a bridge between batch and streaming layers.

Case Study: ETL Modernization with IOMETE + Kubernetes

Challenge: A global enterprise running on Hadoop needed faster, cheaper ETL with better governance.

Solution:

Deployed IOMETE on Kubernetes (via Rancher)
Used MinIO as an object storage backend
Rebuilt pipelines using Spark + dbt on Iceberg tables
Airflow + ArgoCD handled orchestration via GitOps

Results:

Job times improved 10x
Infrastructure costs dropped 40%
CI/CD reduced deployment from days to minutes

Summary

Performance optimization in Kubernetes-native environments is a continuous process — but it pays dividends in speed, cost, and stability.

With platforms like IOMETE, many of these best practices are automated:

Autoscaling Spark clusters
Node affinity for compute jobs
GitOps-managed configurations
Real-time monitoring via Prometheus

1. Define Resource Requests and Limits​

2. Use Node Affinity and Taints​

3. Leverage Horizontal and Vertical Autoscaling​

4. Tune Spark for Execution Efficiency​

5. Batch vs. Real-Time: Optimize by Pipeline Type​

Batch Workloads​

Real-Time Workloads​

Case Study: ETL Modernization with IOMETE + Kubernetes​

Summary​

ON THIS PAGE