Home - Confessions of a Data Guy

March 23, 2023

AWS Lambdas. Useful for Data Engineering?

Are lambdas one of those tools that everyone uses and no one talks about? I guess I’ve taken them for granted over the years, even though they are incredibly useful. For a lot of my Data Engineering career I didn’t really think about or use AWS lambdas, I just saw them as little annoying flies […]

March 20, 2023

Uncategorized

5 git Commands your Grandma uses.

March 11, 2023

Data, Data Engineering

Contributing to Open-Source.

March 7, 2023

Big Data, Data, Data Engineering

What is a Data Mesh?

March 2, 2023

Uncategorized

GitHub’s CoPilot Writes Data Pipelines

March 1, 2023

Data, Data Engineering, Python, Rust

AWS Lambdas – Python vs Rust. Performance and Cost Savings.

Save money, save money!! Hear Hear! Someone on Linkedin recently brought up the point that companies could save gobs of money by swapping out AWS Python lambdas for Rust ones. While it raised the ire of many a Python Data Engineer, I thought it sounded like a great idea. At least it’s an excuse to […]

February 26, 2023

Big Data, Data, Data Engineering, Data Warehousing

Data Types in Delta Lake + Spark. Join and Storage Performance.

Hmm … data types. We all know they are important, but we don’t take them very seriously. I mean we know the difference between boolean, string, and integers, those are easy to get right. But we all get sloppy, sometimes we got the string and varchar route because we don’t spend enough time on the […]

February 11, 2023

Data, Data Engineering, Rust

Ownership and Borrowing in Rust – Data Engineering Gold Mine.

As I started to use Rust on and off, more out of curiosity than anything, I discovered some specs of gold buried down in the depths. Some of the things I’m going to talk about, well … all of it, is probably fairly obvious to most Rust folk, but it’s enjoyable to learn what new […]

February 7, 2023

Big Data, Data, Data Engineering, Python

Replacing Pandas with Polars. A Practical Guide.

I remember those days, oh so long ago, it seems like another lifetime. I haven’t used Pandas in many a year, decades, or whatever. We’ve all been there, done that. Pandas I mean. I would dare say it’s a rite of passage for most data folk. For those using Python, it’s probably one of the […]

January 19, 2023

Future Proof Yourself Against AI.

AWS Lambdas. Useful for Data Engineering?

5 git Commands your Grandma uses.

Contributing to Open-Source.

What is a Data Mesh?

GitHub’s CoPilot Writes Data Pipelines

AWS Lambdas – Python vs Rust. Performance and Cost Savings.

Data Types in Delta Lake + Spark. Join and Storage Performance.

Ownership and Borrowing in Rust – Data Engineering Gold Mine.

Replacing Pandas with Polars. A Practical Guide.

Interesting links

Pages

Categories

Archive