Kubernetes Engineer
Not Disclosed
Austin, TX
12 Months
Not Disclosed
Not Disclosed
Manufacturing
$79.24/hour - $84.24/hour
Job Posted on (Apr 28, 2026)
Reference Number:
GDTXKE28
Job Description
Job Description:
Kubernetes Engineer
We are seeking a Kubernetes Engineer to help manage, support, and improve 6+ on-premises Kubernetes clusters across multiple global sites. This role is responsible for maintaining platform reliability, enhancing observability, driving infrastructure and operational improvements, and supporting cluster lifecycle management across a distributed on-prem environment. The engineer will work within a platform ecosystem that includes Charmed Kubernetes, Juju, MAAS, ArgoCD, Harbor, Prometheus, Grafana, and Ceph to ensure the environment remains stable, scalable, and efficient.
This role requires strong hands-on experience in production Kubernetes operations, with a focus on on-prem infrastructure, multi-site reliability, and continuous platform improvement. The engineer will partner with infrastructure, networking, storage, and application teams to troubleshoot issues, improve platform standards, and support the growth and maturity of Kubernetes services across all supported locations.
Key Responsibilities
Manage and support 6+ on-prem Kubernetes clusters across multiple sites
Maintain platform reliability, availability, and operational consistency across all environments
Monitor and troubleshoot issues involving cluster health, workloads, nodes, networking, ingress, storage, and supporting platform services
Support cluster lifecycle activities including provisioning, upgrades, patching, scaling, and general maintenance
Improve observability, monitoring, and alerting using tools such as Prometheus and Grafana
Support GitOps and deployment workflows using ArgoCD
Help manage and support container image workflows and registry integrations through Harbor
Work with MAAS, Juju, and Charmed Kubernetes to support cluster and infrastructure lifecycle management in an on-prem environment
Support and improve persistent storage capabilities leveraging Ceph and related storage integrations
Drive platform enhancements that improve resiliency, scalability, security, automation, and supportability across all sites
Standardize operational processes, cluster configurations, and support models where possible
Participate in incident response, root cause analysis, and long-term reliability improvements
Create and maintain documentation for architecture, operational procedures, and best practices
Required / Preferred Experience
Strong experience administering Kubernetes in production environments
Experience supporting multiple Kubernetes clusters across distributed or multi-site environments
Strong Linux systems administration and infrastructure troubleshooting skills
Experience with Charmed Kubernetes, Juju, and MAAS in an on-prem environment
Experience with ArgoCD or similar GitOps deployment tools
Experience with Prometheus and Grafana for monitoring, alerting, and platform observability
Experience with Harbor or similar container registries
Experience with Ceph or similar distributed storage platforms supporting Kubernetes workloads
Solid understanding of Kubernetes networking, ingress, storage, and cluster operations
Experience improving reliability, automation, and operational maturity in production platforms
Strong collaboration skills across infrastructure, platform, and application teams
VIVA is an equal opportunity employer. All qualified applicants have an equal opportunity for placement, and all employees have an equal opportunity to develop on the job. This means that VIVA will not discriminate against any employee or qualified applicant on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status