The best insights come from real conversations, with real leaders making tough calls, solving messy problems, and figuring out how to scale their data teams in the real world. That’s why I started The Data Leaders Digest.
Each month, we share what’s actually happening inside data teams, pulled from our podcasts, live events, meetups, and honest behind-the-scenes conversations.
No theory. Just practical takeaways you can use right now to make better decisions, avoid common pitfalls, and keep moving forward.
July has been a month of unexpected headlines in the data world. From Astronomer’s PR crisis to the ripple effects of Andrej Karpathy’s June keynote still reshaping how teams think about AI-first infrastructure, it’s a good time to step back and ask: what kind of companies do we want to build our data infrastructure on?
In this edition:
When Kiss Cams Go Wrong: What Astronomer’s PR Crisis Says About Vendor Culture
You’ve probably seen the headlines. Astronomer’s CEO got caught in an awkward kiss cam moment at a Coldplay concert, leading to a social media storm and the company hiring Gwyneth Paltrow as “chief brand officer” to manage the fallout.
It’s corporate theatre that’ll blow over. But for data leaders choosing vendors for critical infrastructure, it raises a question: what does this say about judgment and company culture?
The Pattern Problem The same judgment that led to that kiss cam situation, and then hiring Gwyneth Paltrow as a PR fix, makes decisions about your data platform. When pipelines fail at 2am, do you want it handled by a team that thinks celebrity endorsements solve technical problems?
The Airflow Reality While Astronomer managed their PR crisis, something more interesting happened with their core product. Apache Airflow, the open-source orchestrator Astronomer builds on is increasingly questioned by data teams.
As one client put it: “Airflow should only ever orchestrate, no compute should be done whatsoever.” Yet you see teams push Airflow beyond scheduling tasks into heavy data loading.
What’s Actually Happening Teams are moving away from force-fitting Airflow into every workflow. Two of our recent clients switched off their Airflow setups entirely, no more fragile DAGs failing at 2am.
The pattern: use the right tool for the right job. Teams are combining purpose-built tools, Meltano for extraction, dbt for transformation, Spring for scheduling, rather than forcing one tool to do everything.
The Ethics Question When a vendor’s response to controversy is hiring a celebrity spokesperson rather than addressing underlying issues, what does that tell you about their priorities?
The Bottom Line While Astronomer plays damage control with movie stars, the real question is: do you want critical infrastructure built by a company that thinks celebrity endorsements solve technical problems?
Most of us would rather focus on solving problems that actually cause sleepless nights in data ops.
The Astronomer situation highlights three critical questions every data leader should ask when choosing vendors:
1. Do their values align with yours?
Look beyond the marketing. How does the company handle controversy? Do they address root causes or just manage PR?
2. Are they using the right technical approach?
Don’t get trapped by vendor marketing about their “best-in-class” solution. Ask whether their tool is actually designed for your use case.
3. What happens when things go wrong?
Every platform has issues. The question is: how does the vendor respond? With genuine solutions or celebrity distractions?
If you’re currently using Airflow for data loading (rather than just orchestration), or if you’re questioning whether your current vendors align with your values, it might be time for a review.
Questions to ask your current setup:
Consider alternatives that:
“Software is Changing (Again)” What AI Revolution Means for ETL
Andrej Karpathy (ex-Tesla AI lead) outlined software’s third major evolution at AI Startup School: from explicit code (1.0) to neural networks (2.0) to LLM-driven programming (3.0). But here’s what he didn’t mention is this shift exposes the fundamental problems with how we’ve been buying data tools.
The ETL pricing trap in an AI-first world:
Karpathy’s “partial autonomy” insight matters for data teams: The future isn’t fully automated pipelines, it’s human-supervised systems that can adapt quickly. But current ETL vendors force you to choose between expensive flexibility or cheap rigidity.
The Matatika take: If everyone can now “vibe code” with LLMs, data infrastructure should be just as flexible. That’s why we built around performance-based pricing (pay for what you actually use) and open standards (no proprietary lock-in). When the next AI breakthrough arrives, you can adapt without renegotiating contracts or rewriting integrations.
Bottom line: Software 3.0 demands infrastructure that evolves with you, not against you. The teams that embrace this shift will build better AI applications – whilst those stuck in legacy pricing models will spend their budgets on vendor margins instead of innovation.
What a night! Our 10 July meetup at Google London HQ was absolutely brilliant – possibly the best yet.
Joseph Lane from IAG Loyalty delivered my favourite talk of the evening. It was frank, hilarious, and packed with hard-earned insights from inside the real data project trenches. His migration lessons were golden: don’t rewrite legacy systems (even when they’re “correct but no one knows how”), and remember – as soon as your numbers are wrong, you’ve lost all credibility.
On team structures, Joseph’s take was spot on: centralised approaches create silos and slow everything down, whilst full autonomy with solid “town planning” works brilliantly at scale (think 40+ engineers, 1000+ models).
Holly Foster and Varun S Gangoor followed up with fascinating insights into data challenges in cyber security – a brilliant reminder that one team’s solution isn’t necessarily yours. Most of us aren’t dealing with 50TB of data per day!
Brilliant turnout, fantastic venue, and exactly the kind of honest, practical insights that make these meetups worth your evening.
Maximising Data ROI? Stack It With the Best
On 27th June, we hosted a session with Jack Colsy (Incident.io), Harry Gollop (Cognify Search), and Aaron Phethean (Matatika) exploring how to get the most out of your team, your tools, and your structure. The conversation highlighted a key insight: people, not platforms, are often your biggest cost—so how do you maximise what you’ve got?
If you’re dealing with budget pressure or wondering whether your current stack is holding your team back, our detailed recap breaks down the specific strategies each speaker shared for building high-performing data teams without breaking the bank.
Key takeaways from the session
Missed the event?
🔗 Prefer to read instead? Get the full recap article here
We’re officially on our summer break until September, but before you switch off completely, make sure you’ve caught up on Season 2 of the Data Matas podcast – or if you missed it entirely, now’s the perfect time to dive in.
This season delivered exactly what we’re about: proper conversations with data leaders who’ve actually been there and done that.
The common thread? The most successful data teams have stopped firefighting and started fundamentally rethinking how they work – simplifying stacks, ditching vendor lock-in, and making data genuinely reliable.
This is why Matatika is passionate about delivering real value to the data community through honest insights from people who’ve solved the problems you’re wrestling with right now.
🎧Listen to the full episode 📺 Watch on You Tube
See you in September with fresh voices and more conversations that actually matter
Make the Right Call on Architecture and Cost
This month we’ve published four new guides to help you cut through architectural trade-offs, clarify storage decisions, and navigate ETL pricing with confidence:
This month we’ve published four new guides to help you cut through architectural trade-offs, clarify storage decisions, and navigate ETL pricing with confidence:
Help us reach more data leaders who need these insights:
Follow Matatika on LinkedIn | Subscribe for Insights | Visit Our Website
Stay up to date with the latest news and insights for data leaders.