Site Reliability Engineer
TensorWave
Location
Remote
Employment Type
Full time
Location Type
Remote
Department
Infrastructure
At TensorWave, we’re leading the charge in AI compute, building a versatile cloud platform that’s driving the next generation of AI innovation. We’re focused on creating a foundation that empowers cutting-edge advancements in intelligent computing, pushing the boundaries of what’s possible in the AI landscape.
About the Role:
We're looking for a Senior SRE Engineer with a strong software engineering background to build and maintain highly scalable, secure, and resilient infrastructure. You’ll play a critical role in designing low-level systems, automating infrastructure with modern tooling, and ensuring platform reliability. This role is ideal for someone who’s comfortable working at the intersection of systems programming and DevOps—writing code in Go, Javascript, Rust, C, or Zig while also managing infrastructure with NixOS, Kubernetes, and Terraform.
Responsibilities:
Design, build, and maintain infrastructure systems using Linux and NixOS.
Manage infrastructure-as-code with Terraform to provision and scale resources.
Architect and operate Kubernetes clusters with a focus on performance, security, and automation.
Write high-performance tooling and internal utilities in Go, Javascript, Rust.
Develop and maintain CI/CD pipelines for infrastructure and code deployments.
Monitor system performance, resolve issues, and improve reliability through observability tooling.
Collaborate closely with engineering teams to support deployment strategies and development workflows.
Essential Skills & Qualifications:
5+ years in DevOps, Site Reliability, or Infrastructure Engineering roles.
Deep experience with Linux systems and configuration management (preferably NixOS).
Hands-on experience with Terraform, Kubernetes, and containerized environments.
Proficiency in one or more low-level languages: Rust, C, Zig, Javascript, and Go.
Strong understanding of systems programming, performance tuning, and operating system internals.
Familiarity with CI/CD practices and infrastructure monitoring/alerting tools.
We’re looking for resilient, adaptable people to join our team—folks who enjoy collaborating and tackling tough challenges. We’re all about offering real opportunities for growth, letting you dive into complex problems and make a meaningful impact through creative solutions. If you're a driven contributor, we encourage you to explore opportunities to make an impact at TensorWave. Join us as we redefine the possibilities of intelligent computing.
What We Bring:
Stock Options
100% paid Medical, Dental, and Vision insurance
Life and Voluntary Supplemental Insurance
Short Term Disability Insurance
Flexible Spending Account
401(k)
Flexible PTO
Paid Holidays
Parental Leave
Mental Health Benefits through Spring Health