# On Prem Infrastructure Engineer

**Company:** [C-Serv](http://jobs.workable.com/companies/1y4ScqFzivikXwRMBgxhit.md)
**Location:** Mumbai, India
**Workplace:** hybrid

[Apply for this job](http://jobs.workable.com/view/f8c7c2dd-c790-4d25-8f23-cc1ac986ec73)

## Description

We are looking for an experienced On-Prem Infrastructure Engineer to design, deploy, and operate enterprise grade platform services across on-premises and hybrid cloud environments. This is a hands-on engineering role for someone who has done it at scale not in theory.

The successful candidate will bring real production experience translating cloud-native capabilities such as AWS EMR, S3, Lambda, and SQS — into robust on-prem solutions. You will be a trusted technical specialist embedded in a high-performing delivery team.

What you will own:

1.  Design and operate enterprise-grade on-prem and hybrid platform services
2.  Stand up and manage Big Data infrastructure (EMR on Prem / Hadoop) in production environments
3.  Implement and maintain distributed storage solutions (MinIO / S3-compatible)
4.  Build and manage distributed cache infrastructure (Redis, Memcached)
5.  Drive infrastructure-as-code practices using Terraform, ArgoCD, and GitHub Actions
6.  Own GitOps pipelines and full pipeline automation workflows
7.  Administer and optimise Kubernetes clusters at advanced usage levels
8.  Implement end-to-end observability using OpenTelemetry, Prometheus, Grafana, and Loki

## Requirements

What we need from you:

**Infrastructure & IaC**

1.  Terraform (infrastructure provisioning)
2.  ArgoCD (GitOps-driven deployments)
3.  GitHub Actions (CI/CD pipeline automation)
4.  GitOps methodology and full pipeline automation

**Kubernetes**

1.  Advanced cluster usage and administration
2.  Production-grade workload orchestration

**Big Data & Distributed Storage**  — Production experience required

1.  **EMR on Prem / Hadoop — production setup and management**
2.  **MinIO / S3-compatible distributed storage — at production scale**
3.  **Redis and Memcached — distributed cache infrastructure**

**Observability**

1.  OpenTelemetry (OTel) instrumentation
2.  Prometheus + Grafana monitoring stacks
3.  Loki for log aggregation

**Preferred (Advantageous)**

1.  Experience with local artifact registries such as Nexus or similar
2.  Hypervisor and OS-level troubleshooting and performance analysis
3.  Backup and recovery strategy and implementation (data layer)
4.  Capacity analysis and scalability forecasting

**The Candidate We’re Looking For**

You have been in the engine room not just advising on it. You understand the operational realities of running cloud-like infrastructure without the cloud: the edge cases, the failure modes, the performance tuning. You bring both the technical depth and the ownership mindset to match.

1.  Production-first mindset — you’ve operated at scale
2.  Strong ownership and accountability
3.  Clear communicator across technical and delivery teams
4.  Comfortable with ambiguity and fast-paced environments
5.  Detail-oriented with high engineering standards
6.  Collaborative — you build solutions with others, not around them

## Benefits

What we Offer

·       Flexible, work environment 

·       Direct access to Leadership

·       A clear path to quickly progress in your role

·       Salary with Performance Bonus.
