saidah Archives - DATA DO

Analytics Platform, Data Engineering, Data Warehouse, Generative AI

Start Fresh, Don’t Lift and Shift: Scaling Analytics Platforms with dbt-core and PostgreSQL

by Marc Matt, saidah July 9, 2026 0

We observed that executing a “lift and shift” of legacy, sprawling SQL scripts onto an enterprise cloud data warehouse fails to resolve core structural... Read more.

Analytics Platform, Data Engineering, Data Lake, Data Mesh, Data Warehouse

PostgreSQL Data Mesh: A Technical Guide to Schema Segmentation, Boundaries, and Governance

by Marc Matt, saidah July 3, 2026 0

We deploy PostgreSQL natively to execute a decentralized data mesh architecture, proving that multi-million dollar cloud platforms and proprietary vendor ecosystems... Read more.

Data Engineering, Generative AI

Deterministic RAG Auditing: Implementing Verifiable Grounding & Lineage on Unified PostgreSQL

by Marc Matt, saidah June 26, 2026 0

The pervasive “lost in the middle” phenomenon is a failure of semantic retrieval, not just context window capacity. While increasing token limits is... Read more.

Data Warehouse, Generative AI

Beating “Lost in the Middle”: Unified Graph RAG on PostgreSQL

by Marc Matt, saidah June 19, 2026 0

Our evaluation shows that by substituting naive chunk-based vector lookups with relationally injected context, the model’s $F_1$ verification score increased from... Read more.

Data Engineering, Generative AI

RAG Context Pruning for Efficiency and Cost Optimization

by Marc Matt, saidah June 3, 2026 0

After baseline production runs across our clients’ financial discovery pipelines, we observed an increase in Time-to-First-Token (TTFT) when retrieved context... Read more.

Data Engineering, Generative AI

Production-Grade Compliance: Engineering the EU AI Act into Sovereign Agentic Pipelines

by Marc Matt, saidah May 21, 2026 0

We measured a 42% increase in inference latency when we shifted from standard RAG to a cryptographically-verifiable audit chain. We accept this overhead. After 2,000... Read more.

Data Engineering, Generative AI

Unified Graph-RAG in a Single Postgres Engine

by Marc Matt, saidah May 13, 2026 0

Our production benchmarks confirm that consolidating Hybrid Graph-RAG into a single PostgreSQL instance via pgvector and Apache AGE reduced cross-service network... Read more.

Data Engineering, Data Warehouse, Generative AI

Production Metric: 14.2% Semantic Decay

by Marc Matt, saidah May 6, 2026 0

After processing 2.8 million unstructured retail fragments, we observed that 14.2% of records passing traditional NOT NULL and regex constraints contained semantic... Read more.

Generative AI

Cost-Aware Agentic Workflows with PydanticAI

by Marc Matt, saidah April 29, 2026 0

Introduction: The Hidden Price of Autonomy The Architecture of a Cost Guardrail Implementing Usage Limits with PydanticAI PydanticAI provides the primary library-level... Read more.

Data Engineering, Generative AI, Machine Learning

Specialized Judges: Scaling RAG Evaluation with Prometheus-2 and PydanticAI

by Marc Matt, saidah April 22, 2026 0

Our production benchmarks utilize the Feedback Collection and Preference Collection datasets to establish the performance delta between generalist and specialized... Read more.

Author: saidah

Author

Start Fresh, Don’t Lift and Shift: Scaling Analytics Platforms with dbt-core and PostgreSQL

PostgreSQL Data Mesh: A Technical Guide to Schema Segmentation, Boundaries, and Governance

Deterministic RAG Auditing: Implementing Verifiable Grounding & Lineage on Unified PostgreSQL

Beating “Lost in the Middle”: Unified Graph RAG on PostgreSQL

RAG Context Pruning for Efficiency and Cost Optimization

Production-Grade Compliance: Engineering the EU AI Act into Sovereign Agentic Pipelines

Unified Graph-RAG in a Single Postgres Engine

Production Metric: 14.2% Semantic Decay

Cost-Aware Agentic Workflows with PydanticAI

Specialized Judges: Scaling RAG Evaluation with Prometheus-2 and PydanticAI