Data ArchitectureExplainerJun 16, 2026, 8:46 PM· 3 min read· #4 of 4 in technology

Databricks Unveils Unified Architecture to Eliminate Data Pipelines for AI Agents

Databricks has introduced a new database architecture that collapses the decades-old separation between transactional and analytical systems, removing the latency bottlenecks that hinder autonomous AI agents.

By Factlen Editorial Team

Share this story

Enterprise Data Leaders 40%AI Application Developers 35%Data Engineering Practitioners 25%

Enterprise Data Leaders: Focused on reducing cloud compute costs, simplifying governance, and eliminating redundant infrastructure.
AI Application Developers: Focused on securing real-time, low-latency data access to empower autonomous AI agents.
Data Engineering Practitioners: Focused on the operational relief of no longer maintaining brittle ETL pipelines and side-car serving layers.

What's not represented

· Legacy Database Vendors defending the necessity of specialized, separated database architectures.
· FinOps Analysts evaluating the true compute cost of running continuous real-time analytical queries directly on object storage at scale.

Why this matters

As AI agents move from answering questions to executing complex business workflows, they require real-time access to live operational data. Eliminating the pipelines between databases removes a critical bottleneck, enabling faster, more autonomous enterprise AI.

Key points

Databricks unveiled a new architecture at its Data + AI Summit designed to eliminate the 40-year separation between transactional and analytical databases.
The new LTAP system allows AI agents to access live operational data instantly, removing the need for slow, brittle ETL pipelines.
A new real-time engine, Lakehouse//RT, delivers millisecond query latency directly on the data lake, eliminating separate serving databases.
The overhaul is driven by the rise of AI agents, which now create 80% of the databases on the Databricks platform and require real-time context to function.

10ms

Lakehouse//RT latency on small datasets

16x

Performance gain over existing serving stacks

80%

Databricks databases now created by AI agents

The enterprise data stack has relied on a fundamental compromise for 40 years: separating the databases that run applications from the databases that analyze them.[3]

At the Data + AI Summit in San Francisco on Tuesday, Databricks announced a sweeping overhaul of this paradigm, introducing a unified architecture designed to eliminate the brittle pipelines that connect these disparate systems.[1][2]

The core issue stems from competing optimizations. Operational databases (OLTP) are built for fast, individual transactions—like recording a retail sale or updating a user profile. Analytical databases (OLAP), conversely, are built to scan massive historical datasets to find trends and generate reports.[2]

Because these systems are fundamentally incompatible, data engineers have spent decades building Extract, Transform, and Load (ETL) pipelines. These pipelines periodically copy data from the operational side, reformat it, and paste it into the analytical data warehouse.[1][2]

How LTAP collapses the traditional separation between transactional and analytical databases.

While human analysts could tolerate the hours-long latency of ETL pipelines, artificial intelligence cannot. Databricks CEO Ali Ghodsi noted that 80% of the databases on their platform are now being spun up by AI agents rather than human engineers.[3]

A system that reasons continuously and acts on live data cannot wait for a nightly batch update. If an AI agent is tasked with dynamically rerouting supply chain shipments based on live inventory, it needs millisecond access to the exact operational state of the business.[1][2]

To solve this, Databricks unveiled Lake Transactional/Analytical Processing (LTAP). LTAP allows both transactional and analytical engines to operate directly on a single, open-format data copy stored in a data lake.[2][3]

Leveraging "Lakebase"—a serverless PostgreSQL database stemming from Databricks' $1 billion acquisition of Neon—LTAP writes transactional data directly into open formats like Delta and Iceberg from the moment of creation.[1][3]

By writing operational data directly to the lakehouse format, LTAP theoretically removes the need for change data capture (CDC) pipelines and replicated databases entirely. The data is instantly available for analytical reasoning without ever being moved.[2][4]

The data is instantly available for analytical reasoning without ever being moved.

However, even with unified storage, enterprises historically had to stand up separate "serving layers"—like Redis or ClickHouse side-cars—to deliver that analytical data to user-facing dashboards and applications at high speeds.[4]

Alongside LTAP, Databricks introduced Lakehouse//RT, a real-time analytics engine powered by a new compute engine dubbed "Reyden" to address this final hurdle.[2][4]

Databricks claims Reyden can deliver query response times as low as 10 milliseconds for smaller workloads and under 100 milliseconds for larger datasets, supporting tens of thousands of concurrent users and agents.[2][4]

Lakehouse//RT aims to deliver millisecond latency without the need for separate serving databases.

Early preview customers reported up to 16 times better performance compared to their existing specialized real-time serving stacks, all while querying data natively within the lakehouse.[4]

The shift is already rippling through the data ecosystem. Business intelligence platforms like Sigma announced immediate support as launch partners, allowing customers to build operational dashboards directly on Lakehouse//RT without moving data.[6]

Beyond speed, collapsing the stack solves a massive security headache. With all data residing in one place, Databricks' Unity Catalog can apply a single set of governance rules and access permissions, rather than forcing security teams to audit a fragmented web of copied databases.[4][5]

AI agents require real-time access to operational data to execute complex business workflows autonomously.

This architectural pivot reflects a broader industry race. Microsoft, Snowflake, and Databricks are all aggressively retooling their infrastructure to support "agentic data engineering," where autonomous systems handle ingestion, transformation, and quality management.[5]

As enterprises move from experimental AI chatbots to autonomous agents that execute core business processes, the underlying data infrastructure has become the primary bottleneck.[1][2]

By declaring the end of the data pipeline, Databricks is betting that the future of enterprise AI requires a radically simpler foundation—one where data is written once, governed centrally, and queried instantly by the agents running the business.[3][4]

How we got here

1980s–2010s
Enterprises establish the standard practice of separating fast transactional databases from large analytical data warehouses, connecting them with ETL pipelines.
2014
Analysts coin the term HTAP (Hybrid Transactional/Analytical Processing) as early vendors attempt to unify these workloads within single database engines.
2020
Databricks pioneers the 'Lakehouse' architecture, successfully unifying data lakes and data warehouses for analytical workloads.
February 2026
Databricks launches Lakebase, a serverless PostgreSQL database designed to bring transactional workloads directly onto lakehouse storage.
June 16, 2026
At the Data + AI Summit, Databricks announces LTAP and Lakehouse//RT, officially aiming to eliminate data pipelines and unify transactional and analytical data for AI agents.

Viewpoints in depth

Enterprise AI Developers

Focused on the speed and autonomy of AI agents.

For developers building agentic workflows, the elimination of data pipelines is a massive unlock. Agents require 'ground truth' operational data to make decisions—whether that is approving a loan, rerouting a delivery, or flagging fraud. When forced to rely on stale data from a replicated analytical warehouse, agents hallucinate or make poor operational choices. This camp views unified architectures like LTAP as the mandatory foundation for moving AI from read-only chatbots to autonomous actors.

Data Engineering Teams

Focused on infrastructure maintenance and reliability.

Data engineers have historically spent the majority of their time building, monitoring, and fixing brittle ETL (Extract, Transform, Load) pipelines. Every time an upstream application changes its schema, downstream pipelines break, causing late-night pager alerts. For this group, the promise of 'zero copies' and 'zero pipelines' means shifting their focus from plumbing maintenance to data quality, governance, and building high-value data products.

Database Architecture Purists

Skeptical of 'one-size-fits-all' database claims.

Veterans of the database industry note that the separation of OLTP and OLAP exists for fundamental computer science reasons. Transactional systems are optimized for rapid, row-based writes, while analytical systems are optimized for massive, columnar reads. While Databricks claims to have solved this 40-year-old computer science problem with Lakebase and LTAP, skeptics argue that unifying these workloads on object storage will inevitably introduce performance trade-offs or hidden compute costs under extreme enterprise scale.

What we don't know

How LTAP will perform under the extreme transactional loads of the world's largest e-commerce or financial platforms compared to legacy specialized databases.
The exact compute costs associated with running continuous, real-time analytical queries directly on object storage at scale.
How quickly legacy enterprises, heavily invested in traditional ETL pipelines and data warehouses, will be willing to migrate to this unified architecture.

Key terms

ETL Pipeline: Extract, Transform, Load—a set of processes that copy data from an operational system, reformat it, and move it into a data warehouse for analysis.
OLTP (Online Transaction Processing): Database systems optimized for executing a large number of short, fast, and reliable transactions, like processing e-commerce orders.
OLAP (Online Analytical Processing): Database systems optimized for complex queries and analyzing large volumes of historical data to find business trends.
AI Agent: An artificial intelligence system designed to continuously perceive its environment, reason through complex problems, and autonomously take actions to achieve a goal.
Data Lakehouse: A modern data architecture that combines the flexibility and cheap storage of a data lake with the management and querying capabilities of a traditional data warehouse.
Change Data Capture (CDC): A software process that identifies and tracks changes to data in a database, often used to trigger updates in downstream analytical systems.

Frequently asked

What is the difference between LTAP and traditional databases?

Traditional architectures separate databases into transactional (for apps) and analytical (for reporting). LTAP (Lake Transactional/Analytical Processing) unifies both workloads on a single copy of data, eliminating the need to copy data between systems.

Why do AI agents need real-time data?

Unlike human analysts who look at historical trends, AI agents are designed to execute live business processes. If they act on stale data delayed by pipelines, they risk making incorrect operational decisions.

What is Lakehouse//RT?

It is a new real-time analytics engine from Databricks that allows users and AI agents to query data directly on the lakehouse with millisecond latency, removing the need for separate, specialized serving databases.

Does this replace existing data warehouses?

Databricks positions Lakehouse//RT as a drop-in replacement for current warehouse deployments, aiming to consolidate the fragmented stacks enterprises currently use for real-time serving.

Sources

[1]VentureBeatAI Application Developers
Databricks says it solved the decades-old data pipeline problem that's been slowing AI agents
Read on VentureBeat →
[2]SiliconANGLEEnterprise Data Leaders
Databricks declares the end of pipelines with a unified platform for operational and analytical data
Read on SiliconANGLE →
[3]ForbesEnterprise Data Leaders
Databricks CEO Says He's Cracked A 40-Year-Old Database Problem With LTAP
Read on Forbes →
[4]DatabricksAI Application Developers
Introducing Lakehouse//RT: Real-Time Performance on a Unified Lakehouse
Read on Databricks →
[5]Nitor InfotechData Engineering Practitioners
Why Data Pipelines Break and AI Agents Fix It
Read on Nitor Infotech →
[6]Business WireData Engineering Practitioners
Sigma Named 2026 Databricks ISV Business Intelligence Partner of the Year and Joins as Launch Partner for Lakehouse//RT
Read on Business Wire →

Up next

Open-Weights AI

Z.ai Releases Open-Weights GLM-5.2, Outperforming Proprietary Models in Autonomous Coding

Chinese AI startup Z.ai has launched GLM-5.2, a 753-billion parameter open-weights model that beats industry leaders on complex engineering benchmarks. Available at a fraction of the cost of proprietary alternatives, the model marks a significant milestone in democratizing access to advanced autonomous coding tools.

Every angle. Every day.

Get technology stories with full source coverage and perspective breakdowns delivered to your inbox.

Get the briefing →Browse technology