Back to all jobs

Senior Site Reliability Engineer at Coupa

Senior Posted about 2 hours ago RemoteFirstJobs Product
Engineer

AI summary: Site Reliability Engineer owns end-to-end availability and performance of critical cloud services, automates infrastructure solutions, and manages Linux/Windows systems across web and application servers.

Description

Coupa makes margins multiply through its community-generated AI and industry-leading total spend management platform for businesses large and small. Coupa AI is informed by trillions of dollars of direct and indirect spend data across a global network of 10M+ buyers and suppliers. We empower you with the ability to predict, prescribe, and automate smarter, more profitable business decisions to improve operating margins.

Why join Coupa?

🔹 Pioneering Technology: At Coupa, we’re at the forefront of innovation, leveraging the latest technology to empower our customers with greater efficiency and visibility in their spend.

🔹 Collaborative Culture: We value collaboration and teamwork, and our culture is driven by transparency, openness, and a shared commitment to excellence.

🔹 Global Impact: Join a company where your work has a global, measurable impact on our clients, the business, and each other.

Learn more on Life at Coupa blog and hear from our employees about their experiences working at Coupa.

The Impact of a Sr. Site Reliability Engineer at Coupa:

Coupa’s Site Reliability Engineers are part of the Cloud Operations team, owning end-to-end availability and performance of mission critical service and building automation to prevent problem recurrence.  SREs provide administration of Linux machines, web servers, application servers and infrastructure support for customer environments.

What You’ll Do:

  • Own end-to-end availability and performance of critical services, including building automation to prevent recurring issues
  • Administer Linux and Windows systems across web, application, and database servers
  • Develop and automate solutions using various programming languages
  • Provide application and infrastructure support, including participating in on-call rotations for emergencies
  • Enhance monitoring, alerting, and observability to ensure reliability and performance
  • Collaborate with cross-functional teams on releases, infrastructure, troubleshooting, and maintain documentation such as RCAs

What You Will Bring to Coupa:

  • Bachelor’s degree in Computer Science, Information Systems, or related field, with 5+ years of experience in system administration and large-scale web operations
  • Strong programming skills (PowerShell, Python, Bash, or OOP languages) and experience with automation and configuration management tools (Chef, Puppet, Ansible, etc.)
  • Hands-on experience managing cloud infrastructure (AWS, GCP) and container platforms (EKS, GKE), plus Infrastructure as Code tools like Terraform
  • Proficiency in CI/CD pipelines, source control (Git with complex branching), and deployment/automation tools (Jenkins, Octopus, Rundeck)
  • Solid understanding of networking and operations concepts (DNS, load balancing), monitoring tools (Datadog, Splunk, New Relic), and database administration (MS SQL Server)
  • Strong Agile/Scrum experience (JIRA), ITIL practices (incident/change management, RCA), and excellent communication, problem-solving, and ownership skills

#LI-TC1

#LI-Remote

Coupa complies with relevant laws and regulations regarding equal opportunity and offers a welcoming and inclusive work environment. Decisions related to hiring, compensation, training, or evaluating performance are made fairly, and we provide equal employment opportunities to all qualified candidates and employees.

Please be advised that inquiries or resumes from recruiters will not be accepted.

By submitting your application, you acknowledge that you have read Coupa’s Privacy Policy and understand that Coupa receives/collects your application, including your personal data, for the purposes of managing Coupa’s ongoing recruitment and placement activities, including for employment purposes in the event of a successful application and for notification of future job opportunities if you did not succeed the first time. You will find more details about how your application is processed, the purposes of processing, and how long we retain your application in our Privacy Policy.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.