Site Reliability Engineer — Kubernetes & Observability

SEED

SRE for multi-region Kubernetes—SLOs, game days, and incident response that make Mondays survivable.

Industry

Other

Employment Type

Internship

Apply

Get directions Direct message

General

Description

Role at a glance

Run production like you mean to sleep sometimes.

Copperline Logistics (demo) tracks shipments across carriers. You will own reliability runbooks, capacity planning, and the automation that makes incidents rare and short—not heroic.

What you will do

Define SLOs and error budgets with product; negotiate scope cuts when budgets burn, before customers notice first
Run quarterly game days: inject failures, measure detection vs. response time, and file concrete fixes for blind spots
Automate the boring pages: runbooks, diagnostics bundles, and safe one-click mitigations for known failure classes

What we need

3+ years SRE or platform with ownership of on-call in K8s environments
You are calm in incidents: communicate clearly, delegate, and stop the bleeding before root cause theater

How we will interview you

On-call experience deep-dive, a simulated sev-2 in our staging, and a discussion on toil and how you measured removing it at prior roles.

Note: this posting is demo data for the portal. Compensation band tests filters only. Apply via example.com addresses in this record.

Summary

SRE for multi-region Kubernetes—SLOs, game days, and incident response that make Mondays survivable.

Classification

Industry

Other

Employment Type

Internship

Work Mode

On-site

Requirements

Experience Level

Executive

Required Skills

KubernetesPrometheusGrafanaIstioPagerDutySLOsBash/GoChaos eng

Compensation

Min. salary (per year)

107,000

Max. salary (per year)

156,000

Application

Application URL

example.com/apply/seed-job-10

Application Email

hiring-seed10@example.com

Application Deadline

2026-02-15

Location

Loading map…

Singapore

Demo seed

job-portal-seed-v1

May 1, 2026 — sample only, not a real person

Job