São Paulo, Brazil · on-site in São Paulo and remote for Brazil and Latin America

Cloud infrastructure, Kubernetes and DevOps/SRE without operational chaos

DevOps, SRE and Platform Engineering consulting for CTOs, founders and tech leads. On-site in São Paulo and remote for Brazil and Latin America. I help companies stabilize CI/CD and GitOps, run Kubernetes/EKS with Terraform and IaC, improve observability with Prometheus, Grafana and Datadog, increase system reliability and reduce cloud costs with FinOps.

When I can help

Real cloud infrastructure, automation, reliability and operations challenges that impact engineering teams in production environments across Brazil and Latin America.

Unstable, manual deployments or no clear rollback path

Limited visibility into errors, logs, metrics and alerts

Cloud infrastructure growing without standardization or governance

Kubernetes/EKS that is hard to operate, scale or sustain

Disorganized Terraform with no modular structure or best practices

Recurring incidents and low operational predictability

Slow, fragile pipelines with no security or cost validation

Growing cloud costs without visibility or control

Development teams overloaded with operational work

Infrastructure, cluster, region or critical service migrations

Services

One-off projects or recurring support for companies that need to improve infrastructure, automation, observability, reliability and cloud costs.

DevOps/SRE Assessment

Technical review of the environment, identification of risks, bottlenecks, operational issues and quick wins, with a prioritized action plan.

Cloud and Kubernetes Health Check

Assessment of AWS, GCP, Azure, Kubernetes, EKS, GKE, AKS, workloads, costs, security, scalability, deployments and operations.

Terraform & Infrastructure as Code

Module organization, structure review, standardization, automation, validation, governance and IaC best practices.

CI/CD and GitOps

Design or improvement of pipelines, deployment automation, rollback paths, quality gates, approvals, Terraform validation, security and cost controls.

Observability and SRE

Dashboards, metrics, logs, alerts, basic SLOs, runbooks, log retention, incident response improvements and reliability practices.

FinOps and Cost Optimization

Waste analysis, use of Spot, Karpenter, Kubecost, ephemeral workloads, right-sizing and governance to reduce costs without compromising critical workloads.

Migrations and Platform Modernization

Support for production migrations involving clusters, applications, databases, messaging, DNS, load balancers, WAF, cloud regions and post-migration validation.

Technical support for teams

Mentoring, technical support and direction for teams that need to mature DevOps, SRE, Platform Engineering, GitOps and cloud operations.

How it works

A practical DevOps and SRE consulting approach to understand the current environment, prioritize risks and deliver sustainable improvements in cloud operations.

01

Initial conversation

Understand the context, current pain points, technical goals, constraints and the maturity level of the team.

02

Technical assessment

Assess infrastructure, cloud, Kubernetes, pipelines, observability, security, costs and operational processes.

03

Action plan

Prioritize risks, improvements, quick wins and deliverables based on impact, urgency and complexity.

04

Execution or ongoing support

Implement improvements or support the team through execution with safety, documentation and governance.

Engagement examples

Hands-on experience with Kubernetes, AWS, Terraform, CI/CD, observability, migrations and FinOps in production environments.

Production Kubernetes platforms

Architecture, evolution and ongoing operation of EKS, GKE and AKS clusters supporting critical workloads, autoscaling, security and operational governance.

Critical infrastructure migrations

Production migrations involving clusters, cloud regions, databases, messaging, DNS, load balancers and post-migration validation without business disruption.

CI/CD, GitOps and Infrastructure as Code

Reliable pipelines with Azure DevOps, ArgoCD, Jenkins and GitLab CI, combined with organized Terraform, security and cost validation, and predictable rollouts.

Observability, SRE and FinOps

Useful metrics, logs and alerts with Prometheus, Grafana, Loki and Datadog, along with SRE practices, incident management and cost reduction using Karpenter, Spot and Kubecost.

Experience built across technology, retail, SaaS and international environments — including work with Track.co, Cherokee Nation Businesses and Centauro.

About

About Roberta

I am a Tech Lead in Platform Engineering and founder of RSD Lab, working across DevOps, SRE, Cloud, Kubernetes and FinOps consulting.

I have experience designing, evolving and sustaining critical platforms in AWS, GCP and Azure environments, with focus on Kubernetes, Terraform, CI/CD, observability, automation, security, complex migrations, operational governance and platform modernization.

My work connects technical strategy, hands-on execution and production reliability, helping companies reduce operational risk, improve deployments, organize cloud infrastructure, evolve DevOps/SRE practices and optimize costs.

My focus is to deliver practical, sustainable solutions aligned with each company's current stage, without unnecessary complexity.

Education

  • Postgraduate degree in Software Architecture — FIAP
  • MBA in Cloud Computing Projects and Architectures — FIAP

Certifications and courses

  • DevOps & SRE Speedy — XP Educação
  • Microsoft Azure — Microsoft
  • Cloud Computing & Data Science — FIAP
  • DevOps & Agile Culture — FIAP

Languages

  • Portuguese: native
  • Technical English for reading, documentation and asynchronous communication

Technical stack

Prometheus, Grafana, Datadog, Terraform, Kubernetes EKS, CI/CD, GitOps and infrastructure automation across projects.

Cloud

AWSGCPAzureMagalu Cloud

Kubernetes & Containers

KubernetesEKSGKEAKSDockerDocker SwarmHelmKustomizeKEDAKarpenter

IaC & Automation

TerraformCrossplaneAnsiblePuppetVagrantBashPowerShell

CI/CD & GitOps

Azure DevOpsArgoCDJenkinsGitLab CIGitHub ActionsNexus

Observability & SRE

PrometheusGrafanaLokiDatadogCloudWatchAlertmanagerPagerDutyElastic APM

Databases

PostgreSQLMySQLMongoDBRedisInfluxDBSQL ServerOracle

Networking & Security

KongAWS WAFALB/NLBAkamaiIstioCalicoVaultKeycloakFalcoAWS Secrets Manager

Languages

PythonGoBashGroovyNode.jsPowerShell

Practices

SREDevOpsGitOpsFinOpsPlatform EngineeringIncident ManagementKanbanScrum

Project formats

Flexible Platform Engineering and DevOps consulting formats for companies in Brazil and Latin America, with on-site support in São Paulo or remote engagement.

Focused infrastructure, cloud and operations assessment
Kubernetes, EKS, GKE or AKS health check
Scoped improvement project in cloud, Terraform, CI/CD or observability
Pipeline and GitOps setup or review
Monitoring, logs, metrics and alerts setup or review
Support for infrastructure, cluster, region or critical service migrations
Cost reduction and FinOps initiatives
Ongoing support for a few hours each week
Technical mentoring for development or infrastructure teams
Environment preparation for growth, audits, launches or stabilization

Community and initiatives

Talks, community initiatives and knowledge-sharing efforts.

Talks and community

Participation in events and initiatives such as DevOpsBootcampLive, AWS Members Day and BrazilClouds Talks, sharing hands-on experience in infrastructure, Kubernetes and service mesh.

Volunteer work

Support for public-school students from Capão Redondo through the admissions process to international universities.

Open source and technical projects

Alongside consulting work, RSD Lab also organizes technical projects and open source initiatives focused on automation, infrastructure, Kubernetes, DevOps/SRE and knowledge sharing.

Link soon

Library Jenkins for Windows

A support library for Jenkins pipelines in Windows/PowerShell environments, making automation, standardization and CI/CD step reuse easier.

JenkinsPowerShellCI/CDAutomation
View project
Link soon

N8N Workflow Automation

An installation and setup guide for n8n with Docker, PostgreSQL and Slack integration, focused on workflow automation and integrations.

n8nDockerPostgreSQLSlackAutomation
View project
Link soon

RSD Lab Projects

Open source projects, technical experiments, automations and documentation related to DevOps, Kubernetes, infrastructure as code, observability and SRE.

DevOpsKubernetesSREIaCObservability
View GitHub

DevOps/SRE consulting for companies in Brazil and Latin America

RSD Lab supports companies that need to evolve cloud infrastructure, Kubernetes, Terraform, CI/CD, observability, operational reliability and cloud cost management. Work can start with a technical assessment and continue through focused projects or ongoing support in DevOps, SRE, Platform Engineering and FinOps.

Technical assessment

Start with a technical assessment

In an initial review, I assess cloud infrastructure, Kubernetes, Terraform, CI/CD, observability, security and cost structure to identify risks, quick wins and priorities.

Deliverables

  • Technical risk map
  • Prioritized recommendations
  • Quick wins
  • Execution action plan

Does your infrastructure need more predictability?

If your company is dealing with unstable deployments, difficult-to-run Kubernetes, disorganized Terraform, limited operational visibility or rising cloud costs without clarity, we can talk about a technical assessment.

A first conversation to understand the context and assess whether it makes sense to work together.