This role is for one of the Weekday's clients
Min Experience: 3 years
Location: Bengaluru
JobType: full-time
We are seeking a Senior Platform Engineer to enhance and scale our infrastructure, which powers mission-critical logistics solutions. In this role, you will play a key part in improving reliability, security, and developer experience, driving the evolution of our platform.
This is a high-impact, hands-on role with ownership over infrastructure automation, observability, security, and multi-tenant compute environments.
Key Responsibilities
- Infrastructure as Code (IaC): Manage and enhance our AWS-based infrastructure (ECS, RDS, Redshift, S3/MinIO, Elasticsearch) using Terraform.
- Platform Reliability: Ensure platform stability, security, and scalability while managing ECS orchestration, network policies, and IAM configurations.
- Observability: Establish observability standards, enhance logging, tracing, and monitoring using New Relic and other tools.
- CI/CD & Developer Experience: Optimize CI/CD pipelines, improve test reliability, and introduce best practices for efficient deployments.
- Multi-Tenancy & Sandbox Environments: Design and implement sandbox environments for isolated tenant workloads and testing.
- Security & Access Control: Enforce least-privilege security principles, manage privilege escalation, and ensure compliance with best practices.
- Collaboration: Partner with Backend, Security, and QA teams to align infrastructure with business needs.
- Mentorship: Guide and mentor mid-level engineers on platform engineering best practices and infrastructure reliability.
Required Skills & Experience
- 5+ years of experience in infrastructure, platform, or backend engineering.
- Strong expertise in AWS services (ECS with Fargate, RDS - PostgreSQL, S3, IAM, CloudWatch).
- Proficiency in Infrastructure as Code (Terraform or similar tools).
- Experience with observability solutions (logs, metrics, traces, alerting) using tools like New Relic or ELK stack.
- Strong knowledge of Docker and container orchestration.
- Proficiency in scripting languages such as Python, Bash, or Go.
- Experience working in secure, production-grade environments with restricted access and compliance boundaries.
- Proven ability to maintain uptime, optimize performance, and improve deployment reliability.
Nice to Have
- Experience with MinIO, Redshift performance tuning, and Elasticsearch scaling.
- Familiarity with CI/CD tools (GitHub Actions, Jenkins).
- Exposure to secure transport systems or logistics-heavy platforms.
- Knowledge of flaky test analysis, sandbox isolation strategies, and test reliability tooling.
The difference between ordinary and extraordinary is that little extra.
“Jimmy Johnson”