Background
NeorunBase

NeorunBaseOLTP, Vector, Full-Text & Graph in One SQL Engine

NeorunBase Key Features

A PostgreSQL-compatible multi-modal Lakebase that handles relational, JSONB, vector, full-text, graph, and spatial data in a single distributed ACID engine.

PostgreSQL Wire Protocol

psql, JDBC, pgAdmin, LangChain, and pgvector clients connect with zero code changes. Move from PostgreSQL to a distributed engine without rewriting applications.

Distributed ACID + Self-Healing

Hash-based sharding with online rebalancing, ZooKeeper sticky leader election, automatic shard / disk recovery, and 2PC distributed transactions — all on a RocksDB-backed ACID core.

Multi-Modal in One SQL

Combine relational, JSONB, vector, full-text, graph, and spatial data inside a single ACID SQL statement. WHERE · JOIN · weighted blending in one query — no separate stores, no glue ETL.

Vector Search (pgvector compatible)

pgvector-compatible VECTOR type with distance operators (<->, <#>, <=>) and distributed HNSW ANN indexes. Rows, metadata, and embeddings live in the same ACID transaction — ready as a RAG retrieval backend.

Full-Text (Lucene BM25)

Lucene-backed BM25 inverted index with PostgreSQL-compatible FTS syntax (@@, ts_rank) and multi-language tokenization. Hybrid weighted retrieval (BM25 + vector ANN) is expressible as a single SQL statement.

Graph Traversal

Run BFS over edge tables with the GRAPH_NEIGHBORS(...) table-valued function — depth, relationship-type, and result-cap controlled as function arguments. Composes with JOINs, WHERE, vector / FTS operators, and serves as the retrieval backend for ontologies, knowledge graphs, and agent workflows.

Geospatial (PostGIS compatible)

PostGIS-compatible spatial functions (ST_Distance, ST_Contains, …) with Z-order spatial indexing. Location-based services, GIS, and geo-aware RAG, all from one engine.

Iceberg CDC + Kafka Ingest

OLTP table changes auto-sync to Iceberg/Parquet — Ontul reads the same data via the Iceberg catalog, no separate ETL. Direct Kafka consumer integration plus REST bulk insert and MERGE / upsert.

Enterprise Security & Admin Console

AES-256-GCM envelope encryption (data, WAL, metadata), built-in KMS with key rotation, IAM/RBAC + STS, TLS for the pgwire endpoint. A React-based console + Prometheus metrics covers cluster, shards, IAM, Iceberg, and Kafka in one place.

Use Cases

AI Agent Retrieval Backend

Combine relational filters + vector ANN + BM25 + graph traversal in a single SQL statement to power RAG and agent retrieval from one database. LangChain and pgvector clients work unchanged.

Knowledge Graph & Ontology Store

Model entities and relationships as ordinary tables, then traverse them with GRAPH_NEIGHBORS BFS. RBAC, transactions, and backup come for free — no separate graph DB to operate.

Unified OLTP & Analytics

OLTP transactions auto-sync to Iceberg via CDC, and Ontul / Trino / Spark analyse the same data. No bespoke ETL or CDC pipeline to maintain.

Large-Scale Distributed DB & Real-Time Ingest

Horizontal sharding with automatic rebalancing plus direct Kafka ingestion handles petabyte-scale transactional workloads.

Location-Aware & Geo RAG

Combine PostGIS-compatible spatial functions with vector embeddings for location-based recommendation, GIS, and geo-aware retrieval — all in one engine.

Considering NeorunBase for your data platform?

OLTP · Vector · FTS · Graph · Spatial — One ACID SQL Engine.

Multi-modal data from relational to graph, in one PostgreSQL-compatible engine.