Imagen institucional
Imagen institucional

Site Reliability Engineer

São Paulo, Brasil

Tecnología, Sistemas y Telecomunicaciones/Infraestructura

No especificado
Híbrido

Hace 8 días

Postularse

Hace 8 días

São Paulo, Brasil

Tecnología, Sistemas y Telecomunicaciones/Infraestructura

No especificado
Híbrido

Hace 8 días

Postularse
Descripción del puesto

As a Site Reliability Engineer, you will play a key role in supporting existing customers with their managed or private cloud deployments, as well as in launching new deployments on major cloud platforms such as Azure, AWS, and GCP. Your mission will include
ensuring the smooth operation, scalability, and security of cloud services, as well as automating processes to increase both efficiency and reliability.

Responsibilities:

1. Deployment Setup and Management:

Lead the design and implementation of new cloud deployments, tailoring solutions to meet stakeholder requirements on platforms like Azure, AWS, GCP, and Kubernetes.

Optimize cloud architectures for scalability and cost-effectiveness, adhering to best practices for networking, security, and access controls.

Gain and maintain deep knowledge of cloud infrastructure providers to create robust solutions.

2. Automation and CI/CD::

Craft and manage automation scripts and infrastructure as code (IaC) with Terraform, Ansible, or CloudFormation.

Deploy CI/CD pipelines to streamline software delivery, testing, and deployment processes, ensuring efficient version control and configuration management.

3. Managed Cloud Support:

Ensure the availability of the services by configuring system monitors and alerts and attending to critical alerts in a timely manner.

Offer continuous support and maintenance for existing deployments, monitoring system performance and swiftly resolving issues to maintain high availability and reliability.

Implement strategies for performance optimization and failure prevention, conducting thorough root cause analyses to avoid future issues.

4. Monitoring and Security:

Establish comprehensive monitoring and alerting systems to oversee customer deployments, setting thresholds for incident response.

Conduct regular security assessments and stay abreast of the latest threats and trends to fortify cloud environments against risks.

5. Collaboration and Knowledge Sharing:

Foster a collaborative environment with product developers, operations, and QA teams to enhance workflows and product quality.

Share knowledge and best practices, contributing to the team’s collective expertise through documentation, training, and mentorship.

  • Location São Paulo
  • Working hours: from 12PM to 9PM BRT
  • On duty: availability needed for On duty work over the weekend (frequency is about 1 weekend every 6 weeks).

Requisitos

Fluent in English.

Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent experience.

Expertise in cloud platforms such as Azure, AWS and GCP.

Expertise in Linux, virtualization and containerization technologies such as Docker and Kubernetes.

A solid understanding of networking, security principles, and compliance frameworks.

Proficiency in IaC tools (Terraform, CloudFormation), configuration management (Puppet, Chef, Helm), and scripting languages (Python, Bash, PowerShell).

Experience with CI/CD tools (Github Actions, Jenkins) and monitoring/logging tools (Prometheus, ELK stack, Splunk).

Exceptional problem-solving, analytical, and troubleshooting skills, coupled with a proactive, customer-centric mindset.

Strong communication skills and the ability to collaborate effectively in a team environment.

Beneficios

Benefits: Meal tkt / health insurance (Porto Seguro Diamante R2+)/ Life & Dental insurance/ stock options/ annual bonus

Detalles

Nivel mínimo de educación: Universitario (Indistinto)

Nosotros

Founded in 2005, our client is the largest independent software provider offering open source API management, integration, and identity and access management (IAM) to thousands of companies in more than 90 countries. The company's products and platforms enable organizations to unlock the full potential of artificial intelligence and APIs to securely deliver the next generation of AI-powered digital services and applications.
Our open source, AI-powered, API-centric approach frees developers and architects from single-vendor lock-in and enables rapid digital product creation.
Recognized as a leader by industry analysts, the company has more than 800 employees worldwide and offices in Australia, Brazil, Germany, India, Sri Lanka, the United Arab Emirates, the United Kingdom, and the United States, with more than $100 million in annual recurring revenue.

Powered by Logo