Marc Matt Archives - Page 2 of 5 - DATA DO

Generative AI

AI Agent Workflows: Pydantic AI

by Marc Matt, Saidah Kafka January 16, 2026 0

Building Intelligent Multi-Agent Systems with Pydantic AI In the rapidly evolving landscape of artificial intelligence, multi-agent systems have emerged as a powerful... Read more.

Generative AI

The Ultimate Vector Database Showdown: A Performance and Cost Deep Dive on AWS

by Marc Matt, Saidah Kafka January 8, 2026 0

In the age of AI, Retrieval-Augmented Generation (RAG) is king. The engine powering this revolution? The vector database. Choosing the right one is critical for... Read more.

Analytics Platform, Big Data, Data Lake, Data Warehouse, Tools

Apache Nifi on Google Cloud Kubernetes Engine (GKE)

by Marc Matt December 6, 2022 0

Apache Nifi on GKE can be a good solution, if you want to have a low code solution for processing streaming data. If you set it up on GKE, a managed version of Kubernetes,... Read more.

Analytics Platform, Big Data, Data Lake, Data Warehouse, Tools

Data Infrastructure in the Cloud

by Marc Matt January 30, 2021 0

Having your data infrastructure in the cloud has become a real option for a lot of companies, especially since the big cloud providers have a lot of managed services... Read more.

Analytics Platform, Data Science, Visualization

Bringing machine learning models into production

by Marc Matt May 29, 2020 0

Developing and bringing machine learning models into production is a task with a lot of challenges. These include model and attribute selection, dealing with missing... Read more.

Big Data, Data Warehouse, Machine Learning, Tools

Google Cloud Data Engineer Exam Preparation

by Marc Matt August 19, 2019 0

This is a little text with all the stuff that helped me prepare for the Google Cloud Data Engineer Exam. There are a lot of courses and resources, that help you... Read more.

Analytics Platform, Big Data, Data Lake

AVRO schema generation with reusable fields

by Marc Matt October 7, 2018 0

Why use AVRO and AVRO Schema? There are several serialized file formats out there, so chosing the one most suited for your needs is crucial. This blog entry will... Read more.

Analytics Platform, Data Science, Machine Learning, Tools

Plumber: Getting R ready for production environments?

by Marc Matt June 13, 2018 0

R Project and Production Running R Project in production is a controversially discussed topic, as is everything concerning R vs Python. Lately there have been some... Read more.

Analytics Platform, Big Data, Data Lake, Data Warehouse

Analytics Platform: An Evolution from Data Lake

by Marc Matt October 29, 2017 0

Analytics Platform Having built a Data Lake for your company’s analytical needs, there soon will arise new use cases, that cannot be easily covered with the... Read more.

Data Lake, Data Warehouse, Tools

Building a Productive Data Lake: How to keep three systems in sync

by Marc Matt February 26, 2017 0

Three Systems for save Development When you are building a productive Data Lake it is important to have at least three environments: Development: for development,... Read more.

Author: Marc Matt

Author