Role Overview
We are seeking a Lead DevOps Engineer to serve as the foundational in-house lead for our infrastructure and delivery systems. After three years of outsourced management, we are bringing this function internal to drive a proactive, integrated engineering culture.
You will be the bridge between our current "ticket-based" outsource legacy and a future-proof, automated platform. This role requires a blend of hands-on technical mastery (IaC, CI/CD) and strategic decision-making regarding cloud providers, cost-efficiency, and regulatory compliance.
Key Responsibilities
Strategic Infrastructure & Cloud Governance
- Infrastructure Roadmap: Define and execute a long-term strategy for infrastructure evolution, moving away from reactive fixes to proactive improvements.
- Cloud Strategy & Cost Analysis: Conduct a comprehensive comparison between AWS and other cloud providers. You will advise leadership on the best path forward regarding cost-efficiency, feature sets, and long-term scalability.
- Resource Optimization: Own the AWS/Cloud budget. Implement proactive monitoring and "Shift-Left" cost estimation so that new features are built with resource efficiency in mind.
- DORA Compliance & Exit Strategy: In light of DORA legislation, you will lead the "Multi-Cloud Readiness" initiative, ensuring we have a viable migration or redundancy plan to avoid total vendor lock-in.
Engineering Integration & Support
- Embed with Engineering: Move away from "external vendor" silos. You will participate in development sprints and feature design from day one.
- Shift-Left Operations: Participate in the feature development process from the design phase to ensure infrastructure requirements are baked in from day one.
- On-Call & Vendor Coordination: While you will be the internal point of contact, you will manage our relationship with the outsource provider, who will continue to provide 24/7 coverage and "firefighting" support for urgent weekend issues.
- Knowledge Sharing: Create the company’s first Infrastructure Knowledge Base to eliminate tribal knowledge and document the architecture of our release processes.
- Mentorship: Act as a bridge between teams, teaching QE and Developers how the pipelines work and how to leverage infrastructure tools effectively.
CI/CD & Performance Engineering
- Pipeline Modernization: Finalize the transition from GitHub Actions to Jenkins while fundamentally resolving the root causes of daily pipeline instability.
- Proactive Monitoring: Implement advanced alerting and observability to catch resource constraints and pipeline warnings before they stall development.
- Developer Experience (DX): Build and maintain a seamless feature-branch deployment workflow, empowering QE and Dev teams to manage their own releases and E2E test environments.
- Stability: Identify and resolve root causes of daily pipeline failures rather than applying "band-aid" fixes.