<- All Jobs
Production Systems Engineer, Foundation Labs
The RTP team is responsible for the end-to-end Hardware Lifecycle of all Meta servers, from exploration and development to production health. We work closely with various teams to ensure the smooth operation of systems across the planet, troubleshooting complex issues at various scales, from microscopic to fleet-wide, and driving projects to successful business outcomes.Production Systems Engineer, Foundation Labs Responsibilities
Equal Employment Opportunity Meta is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here. Meta is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, fill out the Accommodations request form.
- Set up preproduction hardware for testing
- Perform tests on server hardware systems and modules
- Perform issue duplication and isolation
- Support system debug work
- Maintain test servers and modules with latest Operating System, firmware, software
- Assist with validation and verification of new hardware platforms
- Create documentation and training materials, and ensure they remain up to date
- Clone and deploy OS images onto clients
- Analyze failure logs, providing a quick summary to Test / Firmware developers
- Track and manage test resources including rack, system, module, and bench
- Monitor test runs and recover unresponsive clients
- Test development, automation and failure analysis
- Gather, process, and summarize test data and results by scripting
- BS in Electrical Engineering or Computer Engineering, or related Engineering Degree or Equivalent experience
- 2+ years of industry experience in data center equipment testing or similar experience
- Experienced working in Linux Systems Environments and server hardware testing
- Experience with setting up test beds for SI testing (High BW Scopes and BERT/Bit Error Rate Test Scopes)
- Experience with system function, stability, and power consumption testing
- Experience with stress test tool with CPU, memory, storage, and I/O subsystem
- Experience with server BMC/Baseboard Management Controller
- Successful candidates must remain in role in the same team in India for a minimum period of 24 months before being eligible for transfer to another role, team or location
- Experience and understanding of using PCIe test fixtures
- Experience with test automation scripting in Python
Equal Employment Opportunity Meta is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here. Meta is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, fill out the Accommodations request form.