JOB DETAILS

Sr. SRE (Site Reliability Engineering)

Why should you find your next job with Revelo?

Revelo's developers are professionals with advanced technical ability and are prepared for all the challenges of the tech industry. We are looking for talent who are motivated to help build our clients' businesses and are also looking to grow in their career. If you want to work at the best tech companies in the United States, be paid in US dollars with unique growth opportunities, then Revelo is the right place for you.

About Revelo:

Revelo is a platform that streamlines and simplifies the process of hiring tech talent by connecting candidates to the best companies in the US. Unlike traditional processes, you apply only once for the desired position and with your profile approved, you can start receiving interview invitations to get your new job as a Sr. SRE (Site Reliability Engineering) and be paid in US dollars.

About the role:

We are looking for a Sr. SRE (Site Reliability Engineering) that will have the following main responsibilities:

  • Design and implement orchestration and tooling solutions to ensure that repetitiveadministration tasks are performed at a high level of efficiency and free of defect
  • Design and implement monitoring and recovery tools to provide for site high availability (HA) and disaster recovery (DR)
  • Design and develop highly available infrastructure and platform components to meet theneeds of our growing and evolving product lines
  • Design and implement security engineering best practices in all our deployed platform andenvironments
  • Triage alerts & diagnose/resolve critical issues, manage implementation of changes
  • Manage the coordination, documentation and tracking of critical incidents ensuring rapid and complete issue resolution and appropriate closed loop to customers and other key stakeholders.
  • Develop continuous integration/continuous deployment orchestration system to reduce frictionfor software delivery to production
  • Evangelize SRE mindset and mentor others about reliability and best practices of SRE
  • Identify and work with engineering to implement opportunities for automation, signal noisereduction, recurring issues and other actions to reduce time to mitigate service impacting events and increase the productivity of cloud operations and development resources
  • Maintain a strong understanding of IaaS, Paas and SaaS offerings with building andmaintaining a state-of-the-art, cloud-based environment for massive-scale data processing
  • Ensure that implementation and solution are fully documented, and solution deployed withfully operationalized processes to support the solution lifecycle


  • Requirements:

  • Advanced or Fluent English
  • 10+ years of experience in infrastructure, system engineering, QA/testing automation
  • Demonstrable and subject matter expert with experience in testing methodology, testing automation framework
  • Advanced level of Linux/Unix experience
  • Full stack software test engineer, having experience at least 3-4 in the following technology stack: React, Go, Git, Bitbucket, Python, No-SQL Databases, Docker, Kuberentes andmonitoring tools like New Relic and Stack Driver
  • A systematic problem-solving approach, coupled with strong communications skills and asense of ownership and drive
  • Experience in designing, analyzing, scaling and troubleshooting large scale distributed system
  • Well versed with SRE methodologies and passionate about solving operation problemsthrough automation and software engineering
  • Ability to communicate effectively vertically and horizontally within the organization viademonstrated written and verbal communication skills
  • Strong understanding of cloud native architecture and microservices design and deployment pattern


  • Desired skills:

  • Experience working with Google Cloud preferred but will consider any other public cloud providers experience
  • API and front end testing automation
  • Microservices lifecycle management (integration, testing, deployment)
  • Strong experience in at least 3 of the following sets of logging and monitoring tools: ELKstack, Prometheus, Stackdriver, New Relic, Datadog, Dynatrace
  • Advanced level of knowledge for software release tooling to include but not limited toBitbucket, Jenkins, Cloud Build, Spinnaker
  • Advanced level of knowledge of Docker technologies including experience in optimizingDocker image and managing Docker image lifecycle
  • Experience with algorithms, data structures, complexity analysis and software design


  • Do you have these skills? Apply now and reveal your potential.

    Do you have these skills?

    Apply now

    Apply now and reveal your potential.

    Salary

    Up to US$ 10000 monthly

    Location

    US (Remote)

    Employment Type

    Full-time

    Apply now

    Enterprises and startups use Revelo's talent to scale
    their engineering teams

    Walrus

    We simplify the selection process so you can be found faster

    Apply now
    Totalmente gratuito

    Totally free

    No cost from registration
    to hiring.

    Suporte profissional

    Professional support

    Get expert tips and even interview training.

    Bônus na contratação

    Zero bureaucracy

    We handle everything for you:
    career support, HR management and legal issues.

    How to become a Revelo developer?

    Work with top software companies in just 4 easy steps

    Create your profile

    It's extremely quick and easy! Just tell us a little bit about yourself and your career path.

    Tell us what you like

    Take a look at all of our open roles and let us know which ones interest you the most!

    Get matched to the best opportunities

    Based on the interests you selected, our matchers will reach out when your profile matches an open role and we will immediately start your recruitment process!

    Find your dream job

    Work where you want, the way you want, and let us take care of the red tape for you.

    Revelo
    Ⓒ Revelo. All rights reserved. Privacy Policy and Terms of Use.
    Apply now