Latest Update:
Running Code AI Locally: An Engineering Reality Check
Over the last couple of days, my LinkedIn feed has been flooded with euphoric posts about “Code AI” and “local coding assistants”. Screenshots of terminals, bold claims about productivity exploding,…
Most recent updates
Teaching a Machine to Recognize Traveling Bears
This project did not start as an attempt to build a generic image recognition system or to benchmark computer vision frameworks. It started with three teddy bears that have been traveling with me since 2017. Over the years, they have accompanied me on flights, through airports, into hotel rooms, conference venues, cafés, and occasionally onto beaches. Airline by airline, the family slowly grew. Each bear developed its own character, its own role, and eventually its own name. They are not famous on Social Media, and they were never meant to…
Vibe Coding: Why It Feels Productive and Why It Fails Engineering
There is a growing belief that software engineering has become an optional skill and a 20-dollar subscription with the right prompts can build complex systems without understanding architecture, versioning, security, or operational reality. Engineers, according to this narrative, are a bottleneck that can be removed. I am skeptical of claims like these, but I do not consider dismissal without evidence a serious position. Instead of arguing in the abstract, I tested the approach myself under real constraints, using a realistic technology stack and enough complexity to move beyond toy examples….
Teaching a Machine to Clean Up My Document Chaos
This „project“ did not start with the ambition to build a generic document classifier or to compete with existing document management systems. It started with a much more personal and probably familiar situation. I wanted to explore whether machine learning could help me to organize my PDFs better. Not reminding me of deadlines or summarizing documents, but assisting with a very concrete task: taking a new PDF and proposing the folder where it belongs, based on how similar documents were filed in the past. And for safety reasons, the system…
Part II: From Models to Systems: Building Real AI Infrastructure
How streaming, feedback, and governance turn algorithms into intelligence. In Part I, we established that a Large Language Model is not Artificial Intelligence. LLMs generate text but AI systems generate outcomes. Now we’ll look at what makes those systems real: data flow, feedback, and accountability. The Lifecycle of Real Intelligence A genuine AI implementation is a closed-loop system. It doesn’t stop when the model outputs a prediction. It begins there. Every iteration passes through five continuous phases: Sense – collect signals from the world (transactions, sensors, logs, or user actions).Infer…
The „LLM = AI“ Myth
Why equating generative models with intelligence is technically wrong — and dangerous. At some point in the last two years, the term Artificial Intelligence stopped meaning what engineers and scientists meant by it.It became shorthand for anything that calls an OpenAI API or produces text that sounds clever. But a system that completes your sentences isn’t the same as a system that learns from experience. Calling a Large Language Model „AI“ is like calling a typewriter a journalist. LLMs Are Statistical Text Engines, Not Cognitive Systems A Large Language Model…
Why I’m the BigData Pilot
People often ask me why I call myself the BigData Pilot.It started as a metaphor but over time it became my way of working, thinking and leading projects in the world of data and AI. Checklists over ego In aviation, even a captain with 30,000 flight hours still uses a checklist. Not because they don’t know what to do, but because precision and repeatability matter more than ego and a complex system don’t reward improvisation. They reward discipline, awareness, and a clear order of operations. It’s the same in data….
Older updates
AI Ethics 101: The 3 Big Questions
Artificial Intelligence isn’t just a technical breakthrough—it’s a societal one.From hiring to healthcare to content creation, AI systems are making…
Do You Need to Be a Programmer to Work in AI?
Let’s settle something upfront:No—you don’t need to be a programmer to work in AI.But you do need to understand what…
AI in 2030: 5 Predictions
If the last decade was about proving that AI works, the next will be about proving it can work responsibly,…
5 Real AI Applications Already Improving Your Life
AI often gets sold like a movie trailer: robots, revolutions, and the promise of sentient assistants that’ll someday file your…
How Machine Learning Works
You’ve heard the buzz. AI is changing everything. Machine learning is everywhere. And yet, behind all the jargon and hype,…
Vibe Coding Isn’t Engineering—It’s Playing With Matches in a Server Room
You’ve seen them.The self-declared AI builders who’ve discovered ChatGPT’s code generation and now think they’ve unlocked developer superpowers. They copy,…
5 AI Myths That Need to Die (And the Truth Behind Them)
AI has become the default headline material for everything—from the end of jobs to the rise of sentient machines to…
Stop Mixing Up AI, ML, and Data Science
If you’ve ever heard someone say “We’re doing AI” and then describe a dashboard… you’re not alone. Too many conversations…
If Your AI Use Case Needs Perfect Data, It’s Not a Use Case—It’s a Wishlist
Let’s get something out of the way:Your data isn’t perfect. It never was. It never will be. It’s late. It’s…
Most AI Fails in Deployment
Ask any AI leader how their last project went and you’ll likely hear about accuracy. F1 scores. AUC. The model…
The Hardest Part of Machine Learning Isn’t the Machine Learning
Spend enough time in the AI space and you start to notice a pattern. There’s a lot of talk about…
Real-Time Data Is Real—Your Enterprise Roadmap Isn’t
There’s a disconnect we don’t talk about enough.Data systems have gone real-time.Enterprise planning hasn’t. While Kafka pipes millions of events…
Real-Time AI Isn’t Built in Slides—It’s Built Like a Cockpit
You don’t fly a plane with a keynote. You fly it with systems that work under pressure. There’s something strangely…
Kafka Isn’t Just a Queue. And Flink Isn’t Just a Buzzword.
Why real-time systems aren’t luxury infrastructure—they’re how smart businesses stay ahead. Let’s get one thing out of the way:Batch is…
GenAI Is the Loudest Kid in Class—Not the Smartest
Why the future of AI depends on more than fancy prompts and flashy demos? A Useful Tool—But Not a Mastermind…
Batch Is Fine—for Laundry. Not for Business Decisions
Expectations Have Changed—Permanently We live in a world that expects everything now. People track their parcels obsessively. They refresh flight…
PhD Diaries: Research Isn’t What You Think
When people hear I’m pursuing a PhD in Artificial Intelligence, the reactions are nearly predictable: “You must be incredibly smart,”…
Tech Is Only as Smart as the People Behind It
Artificial Intelligence continues to redefine industries, promising automation, efficiency, and unprecedented insights. From self-driving cars to generative language models, AI…
AI and Data Streaming Trends in 2025
As we move into the second quarter of 2025, it is clear that AI and data streaming are evolving at…
Viral AI Trends and the Shadow We Cast: Why That Action Figure Selfie Might Cost More Than You Think
If you’ve spent any time on social media recently, you’ve probably seen the trend: people uploading selfies to AI services…
Not All AI is ChatGPT!
Artificial Intelligence has become synonymous with Generative AI in recent years. Whenever AI is discussed in public forums, mainstream media,…
How AI Powers Smart Investing
The world of investing has always been a numbers game, but in the age of AI, those numbers are being…
Data is the New Oil, but Not All Oil is Refined
The phrase “data is the new oil” has become a common metaphor in the digital economy, emphasizing data’s immense value…
Real-Time Data Streaming with Kafka & Flink: The Foundation for AI and Modern Applications
In today’s digital world, real-time data streaming is no longer a luxury—it’s a necessity. Businesses and AI-driven applications thrive on…
Implementing Real-Time Data Products with Apache Kafka and Apache Flink (Part 3)
As we have explored in the previous parts of this series, high-quality and real-time data are essential for AI and…
Challenges in Building and Maintaining Data Products for AI and ML (Part 2)
Building and maintaining data products for AI and ML is not just about collecting data—it is about ensuring data quality,…
What Are Data Products and Why Do They Matter for AI and ML? (Part 1)
I still remember the days in the early 2010s, when the term „big data“ was widely discussed in the tech…
The $6M AI Startup or how to erase $2 Trillion worth of Market Cap
The AI landscape is undergoing a seismic shift, and at the epicenter is DeepSeek—a $6M Chinese AI startup that has…
Why Generative AI Tools Are Not Suitable for Learning or why we still need good or even better human teacher
Generative AI (GenAI) tools, such as ChatGPT or other large language models (LLMs), are hailed as groundbreaking innovations for generating…
Why Generative AI Tools Are Not Suitable for Literature Research
Generative AI tools like ChatGPT are revolutionizing how we interact with information. They excel at producing natural, human-like responses, summarizing…
Trade Monitoring and Pattern Matching with Flink and Kafka
Financial markets generate one of the densest streams of real-time data we can observe today. Price ticks, order submissions, cancellations,…
Reflections on Gitex 2024: A Glimpse into the Future of Technology
Attending Gitex 2024 in Dubai felt like a step back into the golden days of CeBIT, Germany’s once world-renowned tech…
Apache Flink: Unleashing the Power of Stream Processing
In the evolving landscape of real-time data processing, Apache Flink has emerged as one of the leading stream processing frameworks,…
Apache Kafka 101: The Backbone of Real-Time Data Streaming
In today’s data-driven world, businesses need to process and analyze vast amounts of information in real time. This demand has…
Navigating AWS SageMaker and Bedrock: Understanding Their Differences and Use Cases
In the landscape of AI and machine learning, Amazon Web Services (AWS) has introduced two major services—SageMaker and Bedrock—that cater…
GDPR Compliance with Apache Kafka: Handling Data Deletion Requests Effectively
Introduction The General Data Protection Regulation (GDPR), or Datenschutz-Grundverordnung (DSGVO) in German, mandates that organizations must be able to delete…
