Skip to content

About

Thais Vaz

My name is Thais Vaz. I’ve been a data engineer for 8+ years.

I started at Itaú Unibanco, where I received the MÉRITO Prize for data quality work. Then moved to EBANX while it was still a fast-growing fintech, and built ETL pipelines processing 100M+ daily transactions. After that, I led an international team at HCL Technologies on Apple projects in Silicon Valley, managing 500M+ daily events. Today I’m a senior data engineer at Bradesco, one of Brazil’s largest banks.

My core stack is Databricks. Not because I read the docs. Because it’s what runs in production where I’ve worked.

Why this blog exists

In 2024 I started a Master’s in Numerical Methods in Engineering at UFPR (Federal University of Paraná). My research is on AI-driven predictive monitoring using LLMs for operational systems.

Along the way I noticed something that was bothering me. Almost nobody was writing about production data engineering in Portuguese. Not the way I wanted to read it: with depth, from someone who actually shipped it, at a real bank, with real LGPD compliance constraints, SLA requirements, and regulatory oversight.

So I started writing.

What you’ll find here

The first track is production data engineering. Databricks, Delta Lake, Spark, dbt, Airflow. Real architecture decisions, mistakes I made and what I learned. Brazilian context where it’s relevant.

The second is the crypto AI agent, built in public. Architecture, code, backtesting, on-chain analysis. Every step documented. If something breaks, you’ll know why.

The third is the master’s research translated to practice. What academic research has to say about the problems you face every day, no filter.

Published in Portuguese and English, every week.

Where to find me

  • Newsletter on Substack: vazdeng.substack.com, a summary of what’s published here straight to your inbox
  • GitHub: @thaiscvaz
  • LinkedIn: thacvaz
  • Contact: reply to any post or ping me on LinkedIn