Databases Are Not Dead; the Paradigm Has Changed

From the evolution of file systems to relational databases, and then to Markdown's next stop in the AI era

Recently, discussions about whether the database industry is disappearing have clearly increased.

The reasons look convincing:

Agents can directly use Markdown as memory
Vector databases are replacing relational databases
More and more applications no longer explicitly design schemas
Many systems no longer even "choose a database"

If you only look at these phenomena, it is easy to reach one conclusion:

Databases are exiting the stage.

But if we extend the time scale, we will find that what is really changing is not the importance of databases, but the migration of the data abstraction layer.

The question we face today is not whether databases are still important. It is whether we are entering a new era of data abstraction layers, just as relational databases emerged when file systems could no longer support application complexity.

Databases have not disappeared. The paradigm has changed.

1. Relational Databases Were Not a Technical Upgrade, but a New Layer of Abstraction

The emergence of relational databases was not a simple technical improvement, but a leap in abstraction layers.

Early computer systems organized data in a very direct way:

Applications managed files
Programs defined format structures
Programs determined access paths

This approach worked perfectly in the era of single programs. But as system scale grew, structural problems quickly appeared:

Data could not be shared
Structural changes affected the entire system
Concurrent access was uncontrollable
Complex queries were difficult to implement
Data structures were hard to evolve

The real problem solved by the relational model was not "how to store data," but:

How to let multiple programs share state.

For the first time, it achieved logical data independence, allowing applications to stop depending on underlying storage paths. Around this capability, it built a unified data model, query language, transaction mechanism, consistency guarantees, and indexing system.

From the beginning, databases were a shared state abstraction layer.

Understanding this is very important, because the problem we face today is actually very similar to the file-system era.

2. Markdown Is Repeating the Role File Systems Once Played

After entering the AI era, more and more systems have begun using:

Markdown
JSON
Vector indexes
Object storage

to build the memory layer.

In many scenarios, this really works. For example:

Personal knowledge bases
Lightweight Agent memory
RAG demo systems
Single-user context management

These systems usually share common characteristics:

Low concurrency
Weak consistency
Append-only writes
Simple queries

Under these conditions:

Markdown + embedding index

can indeed run.

But this does not mean databases are no longer needed.

It means the system is still in a typical pre-relational stage.

Today's Markdown is much like the flat file of the past.

It solves the expression problem, not the state problem.

It provides:

Readability
Exchangeability
Weak structural expression
Knowledge recording

But it does not solve:

Concurrency control
Transactional consistency
Permission isolation
Complex queries
Schema evolution
State recovery

Recent Agent memory research has already begun to describe these limitations systematically. For example, ICLR 2025 work on conversational memory points out that the key problem of long-term memory is no longer "how much context to store," but how to construct compressible, retrievable, and evolvable memory units.

This shows that memory is moving from a text cache toward a structured state system.

3. New Constraints in the Agent Era Have Appeared and Are Rapidly Amplifying

Relational databases were born because file systems could not bear application scale.

Today, similar pressure is emerging again.

But this time, it does not come from programs. It comes from Agents.

If future software systems need to run tens of thousands, hundreds of millions, or even trillions of Agents at the same time, then the objects facing data infrastructure will no longer be programs, but cognitive entities.

Databases will no longer manage data records, but cognitive trajectories:

context
reasoning
tool calls
knowledge
observations
tasks

Databases begin to manage:

state trajectories

rather than:

data rows

At the same time, the concurrency model is changing.

Traditional databases face multi-program concurrency, while future databases will face multi-Agent concurrency.

Programs are usually deterministic, while Agents are usually non-deterministic.

This means:

State merging
Version control
Conflict detection
Reasoning-path coordination

will all become database-level problems.

MemoryAgentBench has already decomposed Agent memory capabilities into:

retrieval
test-time learning
long-horizon reasoning
selective forgetting

These capabilities essentially correspond to indexing, updating, compression, and recovery mechanisms in databases.

At the same time, token cost is becoming a new access-path constraint.

ICLR 2026 work MEM1 further proves that simply expanding the context window cannot solve the long-term memory problem. The system must maintain a compact internal state.

Another class of research, such as ACON, directly defines context compression as a core system problem for long-horizon Agents.

This shows that:

context engineering is becoming the new schema design

harness engineering is becoming the new transaction system

A new data abstraction layer is taking shape.

4. Training Is Becoming a New Data Infrastructure Problem

Another change that is happening but often overlooked is that training itself is changing from a one-time offline process into a continuous online process.

In traditional machine learning systems, model training happens before deployment, while databases manage business state after deployment.

But in Agent systems:

retrieval
reflection
tool feedback
test-time learning

are continuously changing model behavior itself.

This means memory is no longer merely an auxiliary component during inference. It is becoming part of the training process.

When learning becomes a continuous state-evolution process, the objects managed by databases expand from business data to cognitive state, and even further to learning state.

In other words:

In the past, databases managed business state
Today, databases are beginning to manage cognitive state
In the future, databases may manage learning state

This is another important signal that the database paradigm shift is happening.

5. The Next Generation of Databases Points Toward a Unified Cognitive State Space

If relational databases solved:

multi-program shared state

then the next generation of databases needs to solve:

multi-Agent shared cognition state

This state is not a single type of data, but a multidimensional state space:

Current state (OLTP)
Historical state (OLAP)
Semantic state (Vector)
Evolutionary state (Time-series)

The database community is already moving in this direction.

PVLDB 2025 research on vector databases explicitly defines vector stores as the long-term memory substrate of LLM systems.

GraphRAG research is beginning to extend retrieval into structured reasoning paths, rather than limiting it to semantic matching.

When data infrastructure needs to support tens of thousands, hundreds of millions, or even trillions of Agents, databases will for the first time become cognitive infrastructure.

Systems must support:

decision trace
reasoning history
tool invocation lineage
state evolution timeline

Permission models will also change:

from

user -> table

agent -> memory region
agent -> tool state
agent -> world model fragment

The role of databases is changing from:

data storage layer

to:

cognition state infrastructure layer

Markdown will not disappear, but it will not be the destination.

It is more likely to become the entry format before the next-generation data abstraction layer appears, and evolve along this path:

Markdown

-> metadata

-> chunking

-> embedding

-> vector index

-> relational state

-> agent memory substrate

In this process:

Markdown changes from storage format to interface format.

The real data layer still requires a new database abstraction.

6. Databases Have Not Disappeared; They Have Entered a New Position

File systems did not disappear. They simply moved underneath databases.

Relational databases will not disappear either. They may be moving underneath Agent memory.

Markdown is not the endpoint of databases. It is only the entry format before the next layer of databases is born.

Databases are not dead.

The paradigm has changed.

Databases Are Not Dead; the Paradigm Has Changed

1. Relational Databases Were Not a Technical Upgrade, but a New Layer of Abstraction

2. Markdown Is Repeating the Role File Systems Once Played

3. New Constraints in the Agent Era Have Appeared and Are Rapidly Amplifying

4. Training Is Becoming a New Data Infrastructure Problem

5. The Next Generation of Databases Points Toward a Unified Cognitive State Space

6. Databases Have Not Disappeared; They Have Entered a New Position

Previous / Next

Did Your Agent Crash? Did the Context Window Overflow? Did It Forget Everything After Switching Models?

Memoria Backup and Restore Is Now Available: Your Agent Finally Has Save Points

Related Posts

MatrixOne Git4Data Deep Dive (Part 4): The Data-Versioning Landscape — How MatrixOne, lakeFS, DVC, Neon, and Dolt Actually Differ

MatrixOne Git4Data Deep Dive (Part 4): The Data-Versioning Landscape — How MatrixOne, lakeFS, DVC, Neon, and Dolt Actually Differ

MatrixOne Git4Data Deep Dive (Part 3): Under the Hood — Why Snapshot, Diff, and Merge Are This Fast

MatrixOne Git4Data Deep Dive (Part 3): Under the Hood — Why Snapshot, Diff, and Merge Are This Fast

MatrixOne Git4Data Deep Dive (Part 2): From Zero, Through Every Git Primitive

MatrixOne Git4Data Deep Dive (Part 2): From Zero, Through Every Git Primitive