
Data Center Engineer II
Responsibilities
Qualifications & Requirements
Experience Level: Mid Level
Full Job Description
Sauce Labs is seeking an experienced and forward-thinking Data Center Engineer II to join our dynamic global team. This crucial role is responsible for ensuring the stability, scalability, and efficiency of our data center infrastructure and device farm environments. You will be hands-on in deploying, maintaining, troubleshooting, and managing the lifecycle of physical infrastructure, while also contributing to automation and performance improvements in high-demand settings.
Key Responsibilities:
- Infrastructure Deployment & Maintenance: Rack, stack, and cable servers, switches, storage devices, and real devices. Assist with hardware installations, upgrades, and replacements. Monitor power usage and network connectivity at the rack level. Support device farm setups involving real mobile devices.
- Troubleshooting & Support: Identify and resolve basic hardware, cabling, or network issues. Collaborate with remote teams for diagnostics and provide 'smart hands' support. Execute hardware diagnostics and maintain asset inventory systems. Utilize monitoring systems for performance tracking.
- Security & Compliance: Enforce strict physical access controls and security protocols within the data center. Escort third-party vendors and monitor access logs as required.
- Inventory & Logistics: Track all incoming and outgoing equipment using internal tools. Conduct regular audits of hardware and parts inventory. Support the Return Merchandise Authorization (RMA) process and coordinate with vendors for defective hardware.
- Documentation & Reporting: Maintain accurate records of work orders, hardware changes, and incidents. Contribute to the development of operational runbooks, standard operating procedures (SOPs), and knowledge base articles.
- Data Center Operations & Infrastructure Management: Support the deployment, configuration, and lifecycle management of servers, storage, and networking equipment within enterprise-grade data centers. Participate in scheduled maintenance and infrastructure upgrades with a focus on minimizing downtime.
- Device Farm Management: Assist in managing a distributed device farm that supports mobile testing across local and global regions. Deploy device infrastructure using both manual methods and CI/CD pipelines (Jenkins, GitLab, GitHub, etc.). Support mobile and lab teams in debugging device connectivity, performance, and compatibility issues across Android, iOS, Linux, and other platforms.
- Networking & Systems Administration: Work extensively with Linux, macOS, Android, and iOS systems utilized across server and device platforms. Employ tools such as iDRAC, iLO, and other system management interfaces for server hardware resource management.
- Monitoring, Security & Compliance: Assist in configuring and utilizing off-the-shelf monitoring tools (e.g., Prometheus, Zabbix, Nagios) for comprehensive full-stack observability. Adhere to documented procedures for deploying and maintaining hardware, following best practices for security, access control, data protection, and maintenance. Participate in scheduled maintenance and support incident response activities.
- Collaboration & Cross-Functional Support: Partner with Site Reliability Engineering (SRE), Security, Operations, and Development teams to ensure seamless cross-functional support for testing and deployment initiatives. Respond to tickets and support requests promptly, ensuring accurate follow-up and resolution.
Required Skills:
- 2-4 years of experience in technical support and data center operations, with proven system and infrastructure support experience.
- Familiarity with cable management standards, rack layouts and design, and system/device/cable labeling standards.
- Basic troubleshooting skills for hardware, networking, and device issues.
- Understanding of TCP/IP networking, DHCP, DNS, and various cabling types.
- Willingness to travel domestically up to 20% of the time to support in-person collaboration and provide hands-on support for remote data center locations.
Nice to Haves:
- A Bachelor's degree in a technical field such as Computer Science, Information Systems, or Electrical Engineering is preferred, or equivalent hands-on experience in data center, infrastructure, or hardware support roles.
- Experience with containerization technologies (Docker, K3s, etc.) and virtualization platforms (KVM, VMware, Hyper-V, etc.).
- Exposure to mobile operating systems (Android, iOS) and mobile debugging tools.
- Hands-on experience with Linux systems and command-line tools.
- Familiarity with monitoring tools is a plus.
Soft Skills:
- Strong attention to detail and organizational habits.
- Effective communication skills, able to interact with both technical and non-technical teams.
- A proactive willingness to learn and grow into a more advanced role.
- Ability to work effectively both independently and as part of a larger team.
This is an in-person role requiring work from our Data Center. Please review our privacy terms when applying.
Sauce Labs is committed to fostering a diverse and inclusive workplace and is an Equal Opportunity employer. We do not discriminate based on race, religion, color, national origin, gender identity/expression/status, sexual orientation, age, marital status, veteran status, or disability status.
Security responsibilities are integral to this role at Sauce. You will be expected to support the health and safety of employees and properties by actively learning and implementing evolving security protocols and procedures. Full compliance with all departmental and organizational security policies and procedures is required, adopting a security-first approach in all aspects of designing, building, and running our products and services.
Company
Sauce Labs
Sauce Labs is a leading provider of continuous quality solutions, empowering global enterprises such as Walmart, Bank of America, and Indeed to deliver high-quality web and mobile applications at spee...