job.svoxx
Job
A

Site Reliability Engineer — Kubernetes & Observability

SEED

SRE for multi-region Kubernetes—SLOs, game days, and incident response that make Mondays survivable.

Industry

Other

Employment Type

Internship

Get directionsDirect message

General

Description

Role at a glance

Run production like you mean to sleep sometimes.

Copperline Logistics (demo) tracks shipments across carriers. You will own reliability runbooks, capacity planning, and the automation that makes incidents rare and short—not heroic.

What you will do

  • Define SLOs and error budgets with product; negotiate scope cuts when budgets burn, before customers notice first
  • Run quarterly game days: inject failures, measure detection vs. response time, and file concrete fixes for blind spots
  • Automate the boring pages: runbooks, diagnostics bundles, and safe one-click mitigations for known failure classes

What we need

  • 3+ years SRE or platform with ownership of on-call in K8s environments
  • You are calm in incidents: communicate clearly, delegate, and stop the bleeding before root cause theater

How we will interview you

On-call experience deep-dive, a simulated sev-2 in our staging, and a discussion on toil and how you measured removing it at prior roles.

Note: this posting is demo data for the portal. Compensation band tests filters only. Apply via example.com addresses in this record.

Summary

SRE for multi-region Kubernetes—SLOs, game days, and incident response that make Mondays survivable.

Classification

Industry

Other

Employment Type

Internship

Work Mode

On-site

Requirements

Experience Level

Executive

Required Skills

KubernetesPrometheusGrafanaIstioPagerDutySLOsBash/GoChaos eng

Compensation

Min. salary (per year)

107,000

Max. salary (per year)

156,000

Application

Application Email

hiring-seed10@example.com

Application Deadline

2026-02-15

Location

Loading map…

Singapore

Demo seed

job-portal-seed-v1

May 1, 2026 — sample only, not a real person

Job