1
2 Comments

I built an end-to-end data pipeline tool in Go called Bruin

Hi all, I have been pretty frustrated with how I had to bring together bunch of different tools together, so I built a CLI tool that brings together data ingestion, data transformation using SQL and Python and data quality in a single tool called Bruin:

https://github.com/bruin-data/bruin

Bruin is written in Golang, and has quite a few features that makes it a daily driver:

it can ingest data from many different sources using ingestr
it can run SQL & Python transformations with built-in materialization & Jinja templating
it runs Python fully locally using the amazing uv, setting up isolated environments locally, mix and match Python versions even within the same pipeline
it can run data quality checks against the data assets
it has an open-source VS Code extension that can do things like syntax highlighting, lineage, and more.
We had a small pool of beta testers for quite some time and I am really excited to launch Bruin CLI to the rest of the world and get feedback from you all. I know it is not often to build data tooling in Go but I believe we found ourselves in a nice spot in terms of features, speed, and stability.

Looking forward to hearing your feedback!

https://github.com/bruin-data/bruin

posted to Icon for group Developers
Developers
on December 17, 2024
  1. 1

    This is really impressive! Love the local-first design, especially the ability to mix Python versions per pipeline.
    Go isn’t a common pick for data tools, but I bet the speed and stability benefits are huge here. Can’t wait to give it a spin—congrats on the launch!

  2. 1

    hey @denizhd, my first concern would be scale of data it can run if i use bruin. Most of the daily ingestion tasks of companies can finish on single small ec2 machine but it is hard to understand how much resource it will need. So they use spark which can dynamically allocate resource. I have worked on project to estimate resource need of the tasks before migrating to new platform and it is very hard to be precise.

Trending on Indie Hackers
The most underrated distribution channel in SaaS is hiding in your browser toolbar User Avatar 194 comments How are you handling memory and context across AI tools? User Avatar 109 comments I gave 7 AI agents $100 each to build a startup. Here's what happened on Day 1. User Avatar 105 comments Do you actually own what you build? User Avatar 66 comments Code is Cheap, but Scaling AI MVPs is Hard. Let’s Fix Yours. User Avatar 34 comments How to see your entire business on one page User Avatar 29 comments