dwebfs

v2.0.0

Published

3 years ago

dwebfs is a secure, real time distributed file system

Downloads

0High
0Medium
0Low

shikhars3711

dwebfs dwtrie

DWebFs

DWebFs is a secure, real-time distributed file system designed for easy P2P file sharing.

It has a handful of cool features:

Version Controlled: Files are versioned by default, making it easy to see historical changes and prevent data loss.
Composable: Using our mount system, Hyperdrives can be nested within other Hyperdrives, enabling powerful multi-user collaboration tools.
Shareable with One Link: You can share an entire DWebFs with others by sending them a single 32-byte key. If you'd like more granularity, our mount system enables the fine-grained sharing of specific directories.
Sparse Downloading By default, readers only download the portions of files they need, on demand. You can stream media from friends without jumping through hoops! Seeking is snappy and there's no buffering.
Fast Lookups: File metadata is stored in a distributed trie structure, meaning files can be located with minimal network lookups.
Version Tagging: You can assign string names to DWebFs versions and store these within the drive, making it straightforward to switch between semantically-meaningful versions.

DWebFs can also be used in a variety of ways:

The Daemon: The DWebFs daemon provides both a gRPC API for managing remote Hyperdrives, and a FUSE API that turns Hyperdrives into normal folders on your computer.
The Client: A Node.js client for the daemon. With this you can build services targeting remote drives.
DBrowser: An experimental browser that has first-class support for DWebFs.
Standalone: DWebFs has flexible storage/networking interfaces, making it easy to embed within larger projects.

Installation

If you're looking for a "batteries included" experience, check out the DWebFs daemon.

For standalone use in your modules, you can install through NPM:

npm install dwebfs

Usage

DWebFs aims to implement the same API as Node.js' core fs module, and mirrors many POSIX APIs.

var dwebfs = require('dwebfs')
var drive = dwebfs('./my-first-dwebfs') // content will be stored in this folder

drive.writeFile('/hello.txt', 'world', function (err) {
  if (err) throw err
  drive.readdir('/', function (err, list) {
    if (err) throw err
    console.log(list) // prints ['hello.txt']
    drive.readFile('/hello.txt', 'utf-8', function (err, data) {
      if (err) throw err
      console.log(data) // prints 'world'
    })
  })
})

Hyperdrives can easily be replicated to other machines over any stream-based transport layer!

var net = require('net')

// ... on one machine

var server = net.createServer(function (socket) {
  socket.pipe(drive.replicate()).pipe(socket)
})

server.listen(10000)

// ... on another

var clonedDrive = dwebfs('./my-cloned-dwebfs', origKey)
var socket = net.connect(10000)

socket.pipe(clonedDrive.replicate()).pipe(socket)

It also comes with build in versioning, live replication (where the replication streams remain open, syncing new changes), and nested DWebFs mounting. See more below.

API

`var drive = dwebfs(storage, [key], [options])`

Create a new DWebFs.

The storage parameter defines how the contents of the drive will be stored. It can be one of the following, depending on how much control you require over how the drive is stored.

If you pass in a string, the drive content will be stored in a folder at the given path.
You can also pass in a function. This function will be called with the name of each of the required files for the drive, and needs to return a random-access-storage instance.
If you require complete control, you can also pass in a dwebx instance (or an API-compatible replacement).
- name: the name of the file to be stored
- opts
  - key: the feed key of the underlying DDatabase instance
  - discoveryKey: the discovery key of the underlying DDatabase instance
- drive: the current DWebFs instance

Options include:

{
  sparse: true, // only download data on content feed when it is specifically requested
  sparseMetadata: true // only download data on metadata feed when requested
  extensions: [], // The list of extension message types to use
}

For more storage configuration, you can also provide any dwebx option.

Note that a cloned dwebfs drive is fully "sparse" by default, meaning that the sparse and sparseMetadata options are both true. This is usually the best way to use DWebFs, but you can also set these options to false to enable eager downloading of both the content and the metadata. If you'd like more control over download strategies, you can use the download method directly.

Replication

DWebFs replication occurs through streams, meaning you can pipe a drive's replication stream into any stream-based transport system you'd like. If you have many nested Hyperdrives mounted within a parent drive, replicate will sync all children as well.

`var stream = drive.replicate([options])`

Replicate this drive. Options include

{
  live: false, // keep replicating,
  encrypt: true // Enable NOISE encryption.
}

Public Fields

`drive.version`

Get the current version of the drive (incrementing number).

`drive.key`

The public key identifying the drive.

`drive.discoveryKey`

A key derived from the public key that can be used to discovery other peers sharing this drive.

`drive.writable`

A boolean indicating whether the drive is writable.

`drive.peers`

A list of peers currently replicating with this drive

Lifecycle Events

`drive.on('ready')`

Emitted when the drive is fully ready and all properties has been populated.

`drive.on('error', err)`

Emitted when a critical error during load happened.

`drive.on('update')`

Emitted when there is a new update to the drive.

`drive.on('peer-add', peer)`

Emitted when a new peer has been added.

const drive = DWebFs()

drive.on('peer-add', (peer) => {
  console.log('Connected peer', peer.remotePublicKey)
})

`drive.on('peer-open', peer)`

Emitted when a peer has been added and has finished handshaking.

`drive.on('peer-remove', peer)`

Emitted when a peer has been removed.

`drive.on('close')`

Emitted when the drive has been closed.

Extension Management

DWebFs supports ddatabase extensions, letting you plug custom logic into a drive's replication streams.

`ext = drive.registerExtension(name, handlers)`

{
  encoding: 'json' | 'binary' | 'utf-8' | anyAbstractEncoding,
  onmessage (message, peer) {
    // called when a message is received from a peer
    // will be decoded using the encoding you provide
  },
  onerror (err) {
    // called in case of an decoding error
  }
}

`ext.send(message, peer)`

Send an extension message to a specific peer.

`ext.broadcast(message)`

Send a message to every peer you are connected to.

Version Control

Since DWebFs is built on top of append-only logs, old versions of files are preserved by default. You can get a read-only snapshot of a drive at any point in time with the checkout function, which takes a version number. Additionally, you can tag versions with string names, making them more parseable.

`var oldDrive = drive.checkout(version, [opts])`

Checkout a readonly copy of the drive at an old version. Options for the checkout are duplicated from the parent by default, but you can also pass in additional DWebFs options.

`drive.createTag(name, [version], cb)`

Create a tag that maps to a given version. If a version is not provided, the current version will be used.

Tags are stored inside the drive's "hidden trie," meaning they're not enumerable using DWebFs's standard filesystem methods. They will replicate with all the other data in the drive, though.

`drive.getTaggedVersion(name, cb)`

Return the version corresponding to a tag.

Combined with checkout, this lets you navigate between tagged versions.

`drive.deleteTag(name, cb)`

Delete a tag. If the tag doesn't exist, this will be a no-op.

`drive.getAllTags(cb)`

Return a Map of all tags. The Map will be of the form:

{
  name => version
}

Downloading

In sparse mode (which is the default), data will be downloaded from peers on-demand. If you'd like more control over this, you can use the download function to explicitly mark certain files/directory for immediate downloading.

`drive.download([path], [callback])`

Download all files in path of current version. If no path is specified this will download all files.

You can use this with .checkout(version) to download a specific version of the drive.

drive.checkout(version).download()

Reading and Writing

`var stream = drive.createReadStream(name, [options])`

Read a file out as a stream. Similar to fs.createReadStream.

Options include:

{
  start: optionalByteOffset, // similar to fs
  end: optionalInclusiveByteEndOffset, // similar to fs
  length: optionalByteLength
}

`drive.readFile(name, [options], callback)`

Read an entire file into memory. Similar to fs.readFile.

Options can either be an object or a string

Options include:

{
  encoding: string
}

or a string can be passed as options to simply set the encoding - similar to fs.

`var stream = drive.createWriteStream(name, [options])`

Write a file as a stream. Similar to fs.createWriteStream. If options.cached is set to true, this function returns results only if they have already been downloaded. options.metadata is optionally an object with string keys and buffer objects to set metadata on the file entry.

`drive.writeFile(name, buffer, [options], [callback])`

Write a file from a single buffer. Similar to fs.writeFile.

`drive.unlink(name, [callback])`

Unlinks (deletes) a file. Similar to fs.unlink.

`drive.mkdir(name, [options], [callback])`

Explictly create an directory. Similar to fs.mkdir

`drive.rmdir(name, [callback])`

Delete an empty directory. Similar to fs.rmdir.

`drive.readdir(name, [options], [callback])`

Lists a directory. Similar to fs.readdir.

Options include:

{
    recursive: false, // Recurse into subdirectories and mounts
    noMount: false // Do not recurse into mounts when recursive: true
}

`drive.stat(name, [options], callback)`

Stat an entry. Similar to fs.stat. Sample output:

Stat {
  dev: 0,
  nlink: 1,
  rdev: 0,
  blksize: 0,
  ino: 0,
  mode: 16877,
  uid: 0,
  gid: 0,
  size: 0,
  offset: 0,
  blocks: 0,
  atime: 2017-04-10T18:59:00.147Z,
  mtime: 2017-04-10T18:59:00.147Z,
  ctime: 2017-04-10T18:59:00.147Z,
  linkname: undefined 
}

The stat may include a metadata object (string keys, buffer values) with metadata that was passed into writeFile or createWriteStream.

The output object includes methods similar to fs.stat:

var stat = drive.stat('/hello.txt')
stat.isDirectory()
stat.isFile()
stat.isSymlink()

Options include:

{
  wait: true|false // default: true
}

If wait is set to true, this function will wait for data to be downloaded. If false, will return an error.

`drive.lstat(name, [options], callback)`

Stat an entry but do not follow symlinks. Similar to fs.lstat.

Options include:

{
  wait: true|false // default: true
}

If wait is set to true, this function will wait for data to be downloaded. If false, will return an error.

`drive.info(name, callback)`

Gets mount information about an entry.

The mount information takes the form:

{
  feed, // The metadata feed for the mountpoint.
  mountPath, // The absolute path of the entry's parent mount.
  mountInfo  // The mount metadata record
}

`drive.access(name, [options], callback)`

Similar to fs.access.

Options include:

{
  wait: true|false // default: true
}

If wait is set to true, this function will wait for data to be downloaded. If false, will return an error.

File Descriptors

If you want more control over your reads and writes, you can open file descriptors. The file descriptor API mirrors Node's descriptors. Importantly, DWebFs does not currently handle random-access writes. Similarly, appends require the previous contents of the file to be duplicated, though this all happens internally. Random-access reads, on the other hand, are fully supported and very fast.

We're still investigating more performant solutions to random-access write and appends, and it's high on our priority list!

`drive.open(name, flags, callback)`

Open a file and get a file descriptor back. Similar to fs.open.

Note that currently only read mode is supported in this API.

`drive.read(fd, buf, offset, len, position, callback)`

Read from a file descriptor into a buffer. Similar to fs.read.

`drive.write(fd, buf, offset, len, pos, cb)`

Write from a buffer into a file descriptor. Similar to fs.write.

`drive.symlink(target, linkname, cb)`

Create a symlink from linkname to target.

DWebFs Mounting

DWebFs supports "mounting" other Hyperdrives at paths within a parent drive. This means that if your friend has a photo album drive, you can nest their drive within your own by calling myDrive.mount('photos/my-friends-album', <my-friends-album-key>).

This feature is useful for composing larger collections out of smaller shareable units, or for aggregating content from many users into one aggregate drive. One pattern you might want to try is a "group" where each user has a structured drive with standard directory names within a parent (i.e. my-group/userA/docs, my-group/userB/docs). Using this pattern, it's easy to aggregate all "docs" with a recursive readdir over the group.

`drive.mount(name, key, opts, cb)`

Mounts another DWebFs at the specified mountpoint.

If a version is specified in the options, then the mountpoint will reference a static checkout (it will never update).

Options include:

{
  version: (drive version) // The drive version to checkout.
}

`drive.unmount(name, cb)`

Unmount a previously-mounted DWebFs.

`drive.createMountStream(opts)`

Create a stream containing content/metadata feeds for all mounted Hyperdrives. Each entry in the stream has the form:

{
  path: '/',                // The mountpoint
  metadata: DDatabase(...), // The mounted metadata feed
  content: DDatabase(...)   // The mounted content feed
}

`drive.getAllMounts(opts, cb)`

Returns a Map of the content/metadata feeds for all mounted Hyperdrives, keyed by their mountpoints. The results will always include the top-level feeds (with key '/').

Options include:

{
  memory: true|false // Only list drives currently cached in memory (default: false).
}

Closing

`drive.close(fd, [callback])`

Close a file. Similar to fs.close.

`drive.close([callback])`

Closes all open resources used by the drive. The drive should no longer be used after calling this.

`archive.destroyStorage([callback])`

Destroys the data stored in the archive and closes it. Does not affect mounted archives. The archive should no longer be used after calling this.

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

DWebFs

Installation

Usage

API

var drive = dwebfs(storage, [key], [options])

Replication

var stream = drive.replicate([options])

Public Fields

drive.version

drive.key

drive.discoveryKey

drive.writable

drive.peers

Lifecycle Events

drive.on('ready')

drive.on('error', err)

drive.on('update')

drive.on('peer-add', peer)

drive.on('peer-open', peer)

drive.on('peer-remove', peer)

drive.on('close')

Extension Management

ext = drive.registerExtension(name, handlers)

ext.send(message, peer)

ext.broadcast(message)

Version Control

var oldDrive = drive.checkout(version, [opts])

drive.createTag(name, [version], cb)

drive.getTaggedVersion(name, cb)

drive.deleteTag(name, cb)

drive.getAllTags(cb)

Downloading

drive.download([path], [callback])

Reading and Writing

var stream = drive.createReadStream(name, [options])

drive.readFile(name, [options], callback)

var stream = drive.createWriteStream(name, [options])

drive.writeFile(name, buffer, [options], [callback])

drive.unlink(name, [callback])

drive.mkdir(name, [options], [callback])

drive.rmdir(name, [callback])

drive.readdir(name, [options], [callback])

drive.stat(name, [options], callback)

drive.lstat(name, [options], callback)

drive.info(name, callback)

drive.access(name, [options], callback)

File Descriptors

drive.open(name, flags, callback)

drive.read(fd, buf, offset, len, position, callback)

drive.write(fd, buf, offset, len, pos, cb)

drive.symlink(target, linkname, cb)

DWebFs Mounting

drive.mount(name, key, opts, cb)

drive.unmount(name, cb)

drive.createMountStream(opts)

drive.getAllMounts(opts, cb)

Closing

drive.close(fd, [callback])

drive.close([callback])

archive.destroyStorage([callback])

License

`var drive = dwebfs(storage, [key], [options])`

`var stream = drive.replicate([options])`

`drive.version`

`drive.key`

`drive.discoveryKey`

`drive.writable`

`drive.peers`

`drive.on('ready')`

`drive.on('error', err)`

`drive.on('update')`

`drive.on('peer-add', peer)`

`drive.on('peer-open', peer)`

`drive.on('peer-remove', peer)`

`drive.on('close')`

`ext = drive.registerExtension(name, handlers)`

`ext.send(message, peer)`

`ext.broadcast(message)`

`var oldDrive = drive.checkout(version, [opts])`

`drive.createTag(name, [version], cb)`

`drive.getTaggedVersion(name, cb)`

`drive.deleteTag(name, cb)`

`drive.getAllTags(cb)`

`drive.download([path], [callback])`

`var stream = drive.createReadStream(name, [options])`

`drive.readFile(name, [options], callback)`

`var stream = drive.createWriteStream(name, [options])`

`drive.writeFile(name, buffer, [options], [callback])`

`drive.unlink(name, [callback])`

`drive.mkdir(name, [options], [callback])`

`drive.rmdir(name, [callback])`

`drive.readdir(name, [options], [callback])`

`drive.stat(name, [options], callback)`

`drive.lstat(name, [options], callback)`

`drive.info(name, callback)`

`drive.access(name, [options], callback)`

`drive.open(name, flags, callback)`

`drive.read(fd, buf, offset, len, position, callback)`

`drive.write(fd, buf, offset, len, pos, cb)`

`drive.symlink(target, linkname, cb)`

`drive.mount(name, key, opts, cb)`

`drive.unmount(name, cb)`

`drive.createMountStream(opts)`

`drive.getAllMounts(opts, cb)`

`drive.close(fd, [callback])`

`drive.close([callback])`

`archive.destroyStorage([callback])`