
Senior Site Reliability Engineer
Responsibilities
Qualifications & Requirements
Experience Level: Senior Level
Full Job Description
Procore is seeking a highly skilled Senior Site Reliability Engineer with strong backend software engineering expertise to join our team in Pune, Maharashtra, India. This role is responsible for designing, implementing, and maintaining our cloud infrastructure to ensure the seamless operation of our applications and services. You will also contribute to the development of our backend software systems, collaborating with the engineering team to create scalable and reliable software solutions. This position reports to the Senior Engineering Manager, Finance Engineering.
What You'll Do:
- Collaborate with peers to design and develop reusable, toil-reducing, and resilient solutions.
- Identify and address systemic issues across the organization.
- Make technical decisions aligned with Procore's architectural vision through a collaborative approach.
- Partner with internal customers, peers, and leadership on planning, prioritization, and roadmap development.
- Mentor and develop teammates through code reviews, pairing, and training.
- Act as a subject matter expert on tools, processes, and procedures to guide the creation and maintenance of a healthy codebase.
- Foster an open-source mindset and culture, both internally and externally, through community participation and contributions.
- Design, develop, and deploy scalable and reliable backend software systems using languages such as Java, Python, or Go.
- Work with engineering teams to design and implement microservices architecture.
- Develop and maintain APIs using RESTful APIs, GraphQL, or gRPC.
- Ensure high-quality code through rigorous code reviews, testing, and continuous integration.
- Serve as a subject matter expert in a specific domain, guiding others in creating and maintaining a healthy codebase.
What We're Looking For:
- Proficiency in container orchestration, including Kubernetes (preferably EKS).
- Experience with ArgoCD.
- Experience with Terraform or similar Infrastructure as Code (IaC) tools.
- Understanding of observability (o11y), with OpenTelemetry being ideal.
- Experience with public cloud platforms such as AWS, GCP, or Azure.
- Familiarity with cloud automation tooling (e.g., CloudFormation, Terraform, Ansible).
- Experience with Kafka and Kafka connectors.
- Strong knowledge of Linux Systems.
- Understanding of compliance with security and regulatory requirements, such as HIPAA, SOX, FedRAMP.
Preferred Experience:
- Continuous Integration Tooling (e.g., Circle CI, Jenkins, Travis).
- Continuous Deployment Tooling (e.g., ArgoCD, Spinnaker).
- Service Mesh / Discovery Tooling (e.g., Consul, Envoy, Istio, Linkerd).
- Networking concepts, including WAF and Cloudflare.
- Experience with event-driven architecture (Event Sourcing, CQRS).
- Experience with Flink or other streaming processing technologies.
- Experience with RDBMS and NoSQL databases.
- Proven experience in working with and developing APIs through REST, gRPC, or GraphQL.
- Professional experience in Java, GoLang, or Python.
Company
Procore
Procore Technologies is a leading provider of cloud-based construction management software, dedicated to building the software that builds the world. Our solutions empower clients to efficiently manag...