Skip to content

DATA DO – データ道

About
Data Science: What is it?
Data Scientist: Hype or Sexy?
Code Repository
Imprint
Terms u0026#038; Conditions

DataScientists: a blog about everything data related.

The Future of Automation is Local: Why German Firms are Trading the Cloud for On-Premise AI

In early 2026, the AI landscape reached a crossroads. On one side, we have the “reasoning giants”: GPT-5.4 and Gemini 3.1 Pro. These models offer unprecedented cognitive abilities, but they come with a “Data Tax” that many German firms are no longer willing to pay. On the other side, a revolution in Small Language Models…

April 15, 2026
From Generalist to Specialist: Benchmarking the 25x Speedup of Fine-Tuned “Tiny Compilers”

We measured a 96.7% reduction in inference latency by migrating our EDI logic from Llama 4 (70B) to a fine-tuned Llama 3.2 (1B) “Tiny Compiler.” In high-volume logistics testing, the generalist model averaged 2,800ms per transaction, while the specialized 1B model, quantized to 4-bit, stabilized at $92ms$ on consumer-grade hardware. We accept the 0.4% decay…

April 8, 2026
The LLM-as-a-Compiler Pattern for High-Precision EDI Pipelines

As we look toward the next phase of industrial AI, the German Mittelstand is poised to move beyond “AI as a Chatbot” and toward the LLM-as-a-Compiler pattern. This represents a fundamental shift from “AI as a Librarian” to a “Deterministic Data Engineer.” The following architecture serves as a primary example of how this compiler pattern…

March 31, 2026
Part 4: The Human Interface — Enterprise RAG Deployment for 100+ Users

1. Introduction: From Prototype to Enterprise Building a Retrieval-Augmented Generation (RAG) system that works on a laptop is a common starting point, but it is rarely enough for a corporate environment. Consequently, deploying it to handle 100+ concurrent employees each with unique access levels, real-time streaming requirements, and finite GPU resources represents an entirely different…

March 23, 2026
Part 3: The Validation Layer — Reranking, Cross-Encoders, and Automated Evaluation

1. Introduction: Why Vector Search Alone Isn’t Enough In Part 2, we optimized our system for Recall—using expansion and routing to ensure the “needle” is somewhere in our top 50 results. However, in production, being “somewhere in the top 50” is a liability, not a feature. Vector search is fast—it takes milliseconds to retrieve candidates.…

March 13, 2026
Part 2: The Multi-Step Retriever — Implementing Agentic Query Expansion

1. Introduction: The Death of the “Simple Search” In Part 1, we defined the blueprint for a production-grade Agentic RAG system. We moved away from passive retrieval toward a “reasoning-first” architecture. But even the best reasoning engine fails if the data fed into it is garbage. When a business user asks, “What’s our policy on…

March 2, 2026
Building Production-Grade Agentic RAG: A Technical Deep Dive – Part 1

Beyond Fixed Windows — Agentic & ML-Based Chunking Introduction: The RAG Gap The promise of Retrieval-Augmented Generation (RAG) is compelling: ground large language models in enterprise data, reduce hallucinations, enable real-time knowledge updates. But in practice, most RAG systems fail silently. They fail not because embedding models are weak or vector databases are slow, but…

February 18, 2026
Modernizing Data Warehouses for AI: A 4-Step Roadmap

It’s the same conversation in every boardroom and Slack channel: “How are we using LLMs? Where are our AI agents? When do we get our Copilot?” But for the teams in the trenches, the hype is hitting a wall of legacy infrastructure. The truth is that Modernizing Data Warehouses for AI is the invisible hurdle…

February 12, 2026
How Poor Data Engineering Corrodes GenAI Pipelines

Generative AI (GenAI) has captivated the world with its ability to create, synthesize, and reason. From crafting compelling marketing copy to assisting in scientific discovery, its potential seems boundless. However, the dazzling outputs often mask a critical vulnerability: the quality of the data underpinning these systems. When data engineering falters, issues of data quality, governance,…

February 5, 2026
Designing Production-Grade GenAI Automation

A dbt Ops Agent Case Study A small, well-instrumented workflow can turn dbt failures into reviewable Git changes by combining deterministic parsing, constrained LLM tooling, and VCS-native delivery — while preserving governance through traces, guardrails, and CI. This is a blueprint to build a first Production-Grade GenAI Agent. You can find the complete implementation and…

January 28, 2026

←Newer Posts Older Posts→

Got any book recommendations?

DATA DO – データ道

Proudly powered by WordPress

Manage Consent

To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.

Functional Functional Always active

The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.

Preferences Preferences

The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.

Statistics Statistics

The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.

Marketing Marketing

The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.

Manage options
Manage services
Manage {vendor_count} vendors
Read more about these purposes

View preferences

{title}
{title}
{title}