@sigma-db/core

v0.14.12

Published

3 years ago

A relational database engine that incorporates some of the latest findings in database theory.

Downloads

0High
0Medium
0Low

dlw93

database theory relational databases conjunctive queries DBMS

sigmaDB

node version

sigmaDB is a relational database engine that aims to incorporate some of the latest findings in database theory. While many of the proposed approaches are provably optimal in some theoretical sense, it usually remains an open question how the performance would be in practice. One such approach is the Tetris join algorithm introduced in Joins via Geometric Resolutions: Worst Case and Beyond by Khamis et al, which we implement to evaluate joins.

We developed sigmaDB in a manner that facilitates easy modification of almost any aspect of the database engine, be it query parsing, transaction logging or join evaluation. Still, please keep in mind that this is a research project and as such lacks many features of a full-fledged RDBMS (cf. Limitations).

Build and Installation

Prerequisites

In order to run sigmaDB, you need Node.js 14 or newer to be present on your system.

Installation

To install the package from npm, run npm install -g @sigma-db/core. Thereafter, you can run sigmaDB from the command line with sigma --database=</path/to/database>.

Custom Build

If you want to clone the repository and build sigmaDB from source by yourself, follow these teps:

Clone the project with git clone https://dev.azure.com/dlw/sigmaDB/_git/core.
From within the project directory, run npm install to download build dependencies and build the project.
To make the package accessible from other projects and the command line, run npm link.

You can now use sigmaDB as described in Installation.

Query Language

We discern four types of queries, whose syntax we outline by example:

To create a new relation Employee with attributes id, name, salary and boss of types int, string, int and int, respectively, write Employee: (id: int, name: string, salary: int, boss: int). Supported data types are int, string, char and bool.
To insert a tuple into the (existing) relation Employee, write Employee(id=1, name="Sam", salary=4200, boss=0). Alternatively, the less verbose syntax Employee(1, "Sam", 4200, 0) can be used. Please note that in the latter case, the order of the attributes matters, while it does not in the former.
To select all employees' IDs whose salary is 4,200, write either (name=x) <- Employee(name=x, salary=4200) or (name=y) <- Employee(x, y, 4200, z). Again, the order the attributes appear in only matters for the second form. In addition, attributes that are not required to formulate the query may be omitted in the named syntax, while they have to be explicitly mentioned in the unnamed syntax.
To print the schema of the database or a specific relation, write ? or <rel>?, respectively, where <rel> is the name of the relation to get the schema of.

In addition, we allow <rel>! as a shortcut for a select-query that simply yields <rel> itself.

Usage as a Library

The following script creates a database with two relations Employee and Division, inserts some tuples and selects all employees and their respective division head.

% create tables
Employee: (
    id:       int,
    name:     string(32),
    salary:   int,
    division: int
)
Division: (
    id:   int,
    name: string(64),
    head: int
)

% insert into tables
Division(0, "Research and Development", 1)
Division(1, "Marketing", 0)
Employee(0, "Jack Walls", 8360, 1)
Employee(1, "Nicole Smith", 7120, 0)
Employee(2, "Joan Waters", 2700, 0)
Employee(3, "David Brown", 4200, 1)
Employee(4, "Marc Wilson", 4200, 1)

% ask a query
Order(master=x, servant=y) <-
    Employee(name=x, division=z, id=u),
    Employee(name=y, division=z),
    Division(id=z, head=u)

Assuming the script is stored in a file named employees.cqs, we can evaluate it using sigmaDB as follows:

import { pipeline } from "stream";
import { Engine, Instance, Parser } from "@sigma-db/core";

const database = Instance.create();
const parser = Parser.create({ schema: database.schema });
const engine = Engine.create({ instance: database });

pipeline(
    process.stdin,
    parser,
    engine,
    process.stdout,
    err => err && socket.write(err.message),
);

database.close();

Now, run the above server and feed in some input by running cat employees.cqs | node test.js assuming test.js to be the result of the compiled TypeScript script above.

Limitations

There are quite a few functionalities essential to a real-world DBMS we do not support as of now, including:

aggregation queries
concurrent access
parallel/distributed query evaluation
...and much more

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme