
NeorunBase Key Features
A PostgreSQL-compatible multi-modal Lakebase that handles relational, JSONB, vector, full-text, graph, and spatial data in a single distributed ACID engine.
PostgreSQL Wire Protocol
psql, JDBC, pgAdmin, LangChain, and pgvector clients connect with zero code changes. Move from PostgreSQL to a distributed engine without rewriting applications.
Distributed ACID + Self-Healing
Hash-based sharding with online rebalancing, ZooKeeper sticky leader election, automatic shard / disk recovery, and 2PC distributed transactions — all on a RocksDB-backed ACID core.
Multi-Modal in One SQL
Combine relational, JSONB, vector, full-text, graph, and spatial data inside a single ACID SQL statement. WHERE · JOIN · weighted blending in one query — no separate stores, no glue ETL.
Vector Search (pgvector compatible)
pgvector-compatible VECTOR type with distance operators (<->, <#>, <=>) and distributed HNSW ANN indexes. Rows, metadata, and embeddings live in the same ACID transaction — ready as a RAG retrieval backend.
Full-Text (Lucene BM25)
Lucene-backed BM25 inverted index with PostgreSQL-compatible FTS syntax (@@, ts_rank) and multi-language tokenization. Hybrid weighted retrieval (BM25 + vector ANN) is expressible as a single SQL statement.
Graph Traversal
Run BFS over edge tables with the GRAPH_NEIGHBORS(...) table-valued function — depth, relationship-type, and result-cap controlled as function arguments. Composes with JOINs, WHERE, vector / FTS operators, and serves as the retrieval backend for ontologies, knowledge graphs, and agent workflows.
Geospatial (PostGIS compatible)
PostGIS-compatible spatial functions (ST_Distance, ST_Contains, …) with Z-order spatial indexing. Location-based services, GIS, and geo-aware RAG, all from one engine.
Iceberg CDC + Kafka Ingest
OLTP table changes auto-sync to Iceberg/Parquet — Ontul reads the same data via the Iceberg catalog, no separate ETL. Direct Kafka consumer integration plus REST bulk insert and MERGE / upsert.
Enterprise Security & Admin Console
AES-256-GCM envelope encryption (data, WAL, metadata), built-in KMS with key rotation, IAM/RBAC + STS, TLS for the pgwire endpoint. A React-based console + Prometheus metrics covers cluster, shards, IAM, Iceberg, and Kafka in one place.
Use Cases
AI Agent Retrieval Backend
Combine relational filters + vector ANN + BM25 + graph traversal in a single SQL statement to power RAG and agent retrieval from one database. LangChain and pgvector clients work unchanged.
Knowledge Graph & Ontology Store
Model entities and relationships as ordinary tables, then traverse them with GRAPH_NEIGHBORS BFS. RBAC, transactions, and backup come for free — no separate graph DB to operate.
Unified OLTP & Analytics
OLTP transactions auto-sync to Iceberg via CDC, and Ontul / Trino / Spark analyse the same data. No bespoke ETL or CDC pipeline to maintain.
Large-Scale Distributed DB & Real-Time Ingest
Horizontal sharding with automatic rebalancing plus direct Kafka ingestion handles petabyte-scale transactional workloads.
Location-Aware & Geo RAG
Combine PostGIS-compatible spatial functions with vector embeddings for location-based recommendation, GIS, and geo-aware retrieval — all in one engine.
Considering NeorunBase for your data platform?
OLTP · Vector · FTS · Graph · Spatial — One ACID SQL Engine.
Multi-modal data from relational to graph, in one PostgreSQL-compatible engine.
