Production Kubernetes platforms
Architecture, evolution and ongoing operation of EKS, GKE and AKS clusters supporting critical workloads, autoscaling, security and operational governance.
Real cloud infrastructure, automation, reliability and operations challenges that impact engineering teams in production environments across Brazil and Latin America.
Unstable, manual deployments or no clear rollback path
Limited visibility into errors, logs, metrics and alerts
Cloud infrastructure growing without standardization or governance
Kubernetes/EKS that is hard to operate, scale or sustain
Disorganized Terraform with no modular structure or best practices
Recurring incidents and low operational predictability
Slow, fragile pipelines with no security or cost validation
Growing cloud costs without visibility or control
Development teams overloaded with operational work
Infrastructure, cluster, region or critical service migrations
One-off projects or recurring support for companies that need to improve infrastructure, automation, observability, reliability and cloud costs.
Technical review of the environment, identification of risks, bottlenecks, operational issues and quick wins, with a prioritized action plan.
Assessment of AWS, GCP, Azure, Kubernetes, EKS, GKE, AKS, workloads, costs, security, scalability, deployments and operations.
Module organization, structure review, standardization, automation, validation, governance and IaC best practices.
Design or improvement of pipelines, deployment automation, rollback paths, quality gates, approvals, Terraform validation, security and cost controls.
Dashboards, metrics, logs, alerts, basic SLOs, runbooks, log retention, incident response improvements and reliability practices.
Waste analysis, use of Spot, Karpenter, Kubecost, ephemeral workloads, right-sizing and governance to reduce costs without compromising critical workloads.
Support for production migrations involving clusters, applications, databases, messaging, DNS, load balancers, WAF, cloud regions and post-migration validation.
Mentoring, technical support and direction for teams that need to mature DevOps, SRE, Platform Engineering, GitOps and cloud operations.
A practical DevOps and SRE consulting approach to understand the current environment, prioritize risks and deliver sustainable improvements in cloud operations.
Understand the context, current pain points, technical goals, constraints and the maturity level of the team.
Assess infrastructure, cloud, Kubernetes, pipelines, observability, security, costs and operational processes.
Prioritize risks, improvements, quick wins and deliverables based on impact, urgency and complexity.
Implement improvements or support the team through execution with safety, documentation and governance.
Hands-on experience with Kubernetes, AWS, Terraform, CI/CD, observability, migrations and FinOps in production environments.
Architecture, evolution and ongoing operation of EKS, GKE and AKS clusters supporting critical workloads, autoscaling, security and operational governance.
Production migrations involving clusters, cloud regions, databases, messaging, DNS, load balancers and post-migration validation without business disruption.
Reliable pipelines with Azure DevOps, ArgoCD, Jenkins and GitLab CI, combined with organized Terraform, security and cost validation, and predictable rollouts.
Useful metrics, logs and alerts with Prometheus, Grafana, Loki and Datadog, along with SRE practices, incident management and cost reduction using Karpenter, Spot and Kubecost.
Experience built across technology, retail, SaaS and international environments — including work with Track.co, Cherokee Nation Businesses and Centauro.
I am a Tech Lead in Platform Engineering and founder of RSD Lab, working across DevOps, SRE, Cloud, Kubernetes and FinOps consulting.
I have experience designing, evolving and sustaining critical platforms in AWS, GCP and Azure environments, with focus on Kubernetes, Terraform, CI/CD, observability, automation, security, complex migrations, operational governance and platform modernization.
My work connects technical strategy, hands-on execution and production reliability, helping companies reduce operational risk, improve deployments, organize cloud infrastructure, evolve DevOps/SRE practices and optimize costs.
My focus is to deliver practical, sustainable solutions aligned with each company's current stage, without unnecessary complexity.
Prometheus, Grafana, Datadog, Terraform, Kubernetes EKS, CI/CD, GitOps and infrastructure automation across projects.
Flexible Platform Engineering and DevOps consulting formats for companies in Brazil and Latin America, with on-site support in São Paulo or remote engagement.
Talks, community initiatives and knowledge-sharing efforts.
Participation in events and initiatives such as DevOpsBootcampLive, AWS Members Day and BrazilClouds Talks, sharing hands-on experience in infrastructure, Kubernetes and service mesh.
Support for public-school students from Capão Redondo through the admissions process to international universities.
Alongside consulting work, RSD Lab also organizes technical projects and open source initiatives focused on automation, infrastructure, Kubernetes, DevOps/SRE and knowledge sharing.
A support library for Jenkins pipelines in Windows/PowerShell environments, making automation, standardization and CI/CD step reuse easier.
An installation and setup guide for n8n with Docker, PostgreSQL and Slack integration, focused on workflow automation and integrations.
Open source projects, technical experiments, automations and documentation related to DevOps, Kubernetes, infrastructure as code, observability and SRE.
RSD Lab supports companies that need to evolve cloud infrastructure, Kubernetes, Terraform, CI/CD, observability, operational reliability and cloud cost management. Work can start with a technical assessment and continue through focused projects or ongoing support in DevOps, SRE, Platform Engineering and FinOps.
In an initial review, I assess cloud infrastructure, Kubernetes, Terraform, CI/CD, observability, security and cost structure to identify risks, quick wins and priorities.
If your company is dealing with unstable deployments, difficult-to-run Kubernetes, disorganized Terraform, limited operational visibility or rising cloud costs without clarity, we can talk about a technical assessment.
A first conversation to understand the context and assess whether it makes sense to work together.