<- All Jobs
Lead Platform Development Engineer - USDS
We鈥檙e looking for a hands-on technical Lead to design, build, and operate internal platforms and systems that power our core technology. You will focus on creating highly available, reliable, scalable, and efficient infrastructure, container based architecture and tools used across the organization. This role is ideal for someone with a strong background in systems engineering, distributed infrastructure, containerized environment and backend development. You should enjoy solving complex technical problems, writing high-quality code, and improving internal platforms that support various use cases. A key part of this role is to help build out a greenfield, AI-native development initiative focused on solving complex internal infrastructure and productivity challenges at scale. You will work across cloud environments like OCI (Primary), AWS, Azure and GCP, using modern technologies such as containers, infrastructure-as-code, and automation to build and evolve our internal platform ecosystem. This will be a highly cross functional role to foster a culture of innovation, collaboration, and continuous improvement.
In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager/department. We regularly review our hybrid work model, and the specific requirements may change at any time.
Responsibilities
- Lead and perform hands-on technical work, including architecture design, code development and reviews, and internal tooling development to support scalable infrastructure platforms.
- Oversee end-to-end project planning, requirements gathering, execution, and delivery鈥攅nsuring alignment with business goals while designing and developing internal platforms and automation tools to support infrastructure and IT operations.
- Develop and maintain infrastructure-as-code (IaC) using tools like Terraform, Ansible, or cloud native solutions to automate deployment and management processes across environments.
- Architect, implement, and manage Virtual Desktop Infrastructure (VDI) solutions across Oracle Cloud Infrastructure (OCI), ensuring high availability, scalability, performance, and security.
- Collaborate with cloud service providers and internal stakeholders to optimize resource utilization, reduce costs, and improve platform efficiency.
- Implement security best practices and ensure compliance with regulatory frameworks (e.g., NIST, FedRAMP); conduct regular security assessments and audits to identify and mitigate vulnerabilities.
- Establish monitoring and alerting systems, analyze system metrics, and proactively resolve performance issues to enhance platform reliability, user experience, and operational support for internal teams.
In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager/department. We regularly review our hybrid work model, and the specific requirements may change at any time.
Responsibilities
- Lead and perform hands-on technical work, including architecture design, code development and reviews, and internal tooling development to support scalable infrastructure platforms.
- Oversee end-to-end project planning, requirements gathering, execution, and delivery鈥攅nsuring alignment with business goals while designing and developing internal platforms and automation tools to support infrastructure and IT operations.
- Develop and maintain infrastructure-as-code (IaC) using tools like Terraform, Ansible, or cloud native solutions to automate deployment and management processes across environments.
- Architect, implement, and manage Virtual Desktop Infrastructure (VDI) solutions across Oracle Cloud Infrastructure (OCI), ensuring high availability, scalability, performance, and security.
- Collaborate with cloud service providers and internal stakeholders to optimize resource utilization, reduce costs, and improve platform efficiency.
- Implement security best practices and ensure compliance with regulatory frameworks (e.g., NIST, FedRAMP); conduct regular security assessments and audits to identify and mitigate vulnerabilities.
- Establish monitoring and alerting systems, analyze system metrics, and proactively resolve performance issues to enhance platform reliability, user experience, and operational support for internal teams.