Post a job
Site Reliability Engineer

Company:
Salesforce
Location:
Dublin, Country: Ireland
Job Type:
Job Description

Job Category

Products and Technology

Job Details

Job Title: Site Reliability Engineer (Public Cloud SRE)

Location: Dublin, Ireland

About the job

Imagine being part of a vibrant team where your ideas have the potential to shape up the direction of a new organization. Picture yourself working on new transformational technologies. Envision yourself in a team solving thought-provoking technical problems and driving our customers’ success. Please come join us as we look to begin a new Site Reliability Engineering journey for Salesforce!

The Public Cloud Site Reliability Engineering (Cloud SRE) team is a brand new Organization within Production Engineering, with an exciting mission to bootstrap adoption of the industry’s leading-edge SRE principles and best practices at Salesforce. We are looking for experienced Software Engineers/System Engineers to join this new team. Working closely with counterparts in the Infrastructure and Engineering organizations, this Cloud SRE group owns the reliable delivery of service to Salesforce engineering teams and customers running on public cloud infrastructure. This organization provides round-the-clock, follow-the-sun situational awareness and leadership in the swift repair of any service-impacting issues, driving customer success.

As a member of the Cloud SRE team, you will be responsible for detecting and resolving system failures and complex outages, including creation of the observability tooling necessary for your success. This objective is met by monitoring the services, reacting to problems, proactively addressing issues before they affect performance or availability, and working with Engineering teams to define service level objectives and improving service design and implementation to increase reliability through closed-loop feedback.

Cloud SRE balances proactive automation with reactive operations, and targets 50%+ time spent on improving service design for reliability, extending monitoring and operational automation, driving self-healing and resiliency initiatives and game day exercises. The incumbent in this role would demonstrate a strong focus on tactical operations, as well as large-scale production engineering and orchestration.

Minimum qualifications:

  • BS or MS in Computer Science or a related technical field involving systems engineering
  • 2+ years infrastructure and applications systems engineering experience in enterprise-scale Internet services. Experience in analyzing and troubleshooting systems using logging, distributed tracing, stack traces, and debuggers
  • 2+ years experience configuring and managing any of the Public Clouds using CLI/SDKs and automation (AWS or GCP preferred)
  • 2+ years experience in at least one of the following languages: C, C++, Java, Python, Go. Ability to pick up new languages
  • Experience in Unix/Linux environments with good understanding of operating systems internals (e.g., filesystems, system calls)
  • Working knowledge of the TCP/IP stack, routing and load balancing technologies
  • Working knowledge of design principles of monitoring and alerting systems
  • Ability to operate in a high-pressure environment, troubleshoot complex issues quickly, and successfully handle multiple priorities
  • Systematic problem-solving approach, coupled with a strong sense of ownership and drive
  • Strong communication skills (written and oral)

Preferred qualifications:

  • A good understanding and practice in large-scale distributed systems
  • Experience in designing and deploying high performance production services with extensive monitoring and logging practices
  • CI/CD automation experience, including understanding of key open source technologies like Jenkins, Spinnaker, and Docker
  • Experience defining immutable infrastructure via Terraform/CloudFormation or other approaches across large footprints and distributed teams
  • Experience with on-call rotation, leading incident response and no-blame postmortem analysis
  • Ability to debug, optimize code, and automate routine tasks

Accommodations - If you require assistance due to a disability applying for open positions please contact the Salesforce.com Recruiting Department .

Posting Statement

Salesforce.com and Salesforce.org are Equal Employment Opportunity and Affirmative Action Employers. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, or disability status. Headhunters and recruitment agencies may not submit resumes/CVs through this Web site or directly to managers. Salesforce.com and Salesforce.org do not accept unsolicited headhunter and agency resumes. Salesforce.com and Salesforce.org will not pay fees to any third-party agency or company that does not have a signed agreement with Salesforce.com or Salesforce.org.

Job Settings
Number of jobs: 1 hires
Information about the advertiser
Company: Salesforce
Company size: 1-49
Contact: NA