jude-seaweedfs
v1.1.4
Published
Node.js client for seaweed-fs, a distributed file store
Downloads
22
Maintainers
Readme
node-seaweedfs (weed-fs)
This project is a node.js client library for the SeaweedFS REST interface. This is a rewrite of cruzrr's implementation to support Promises for better error handling. Also tests have been rewritten to use mocha and check for more error cases. This module supports readable streams to be written SeaweedFS and be writable streams to fetch files.
This module requires at least node 0.12 to enable native Promises.
What is SeaweedFS?
SeaweedFS is a simple and highly scalable distributed file system. It focuses on two objectives:
- storing billions of files!
- and serving them fast!
SeaweedFS chose to implement only a key~file mapping instead of supporting full POSIX file system semantics. This can be called "NoFS". (Similar to "NoSQL")
Instead of managing all file metadata in a central master, SeaweedFS manages file volumes in the central master, and allows volume servers to manage files and the metadata. This relieves concurrency pressure from the central master and spreads file metadata into memory on the volume servers allowing faster file access with just one disk read operation!
SeaweedFS models after Facebook's Haystack design paper and costs only 40 bytes disk storage for each file's metadata. It is so simple with O(1) disk read that anyone is more than welcome to challenge the performance with actual use cases.
Install
npm install node-seaweedfs
Basic Usage
var weedClient = require("node-seaweedfs");
var seaweedfs = new weedClient({
server: "localhost",
port: 9333
});
seaweedfs.write("./file.png").then(function(fileInfo) {
return seaweedfs.read(fileInfo.fid);
}).then(function(Buffer) {
//do something with the buffer
}).catch(function(err) {
//error handling
});
Test
adjust test/testconf.js to your needs and just run
gulp test
If you want to create new tests this watch task might be handy
gulp
API
write(file(s), [{opts}])
Use the write() function to store files. The callback recieves the parsed JSON response.
Anything passed to the {opts} is made into a query string and is used with the /dir/assign HTTP request. You can use this to define the replication strategy.
client.write("./file.png", {replication: 000}).then(function(fileInfo) {
console.log(fileinfo);
}).catch(function(err) {
//error handling
});
Instead of a path you can also pass a buffer or a stream
//using a Buffer
client.write(new Buffer("atroo")).then(function(fileInfo) {
// The fid's will be the same, to access each variaton just
// add _ARRAYINDEX to the end of the fid. In this case fileB
// would be: fid + "_1"
var fidA = fileInfo;
var fidB = fileInfo + "_1";
console.log(fileInfo);
}).catch(function(err) {
//error handling
})
//using a Stream
client.write(getReadableStreamSomeHow()).then(function(fileInfo) {
// The fid's will be the same, to access each variaton just
// add _ARRAYINDEX to the end of the fid. In this case fileB
// would be: fid + "_1"
var fidA = fileInfo;
var fidB = fileInfo + "_1";
console.log(fileInfo);
}).catch(function(err) {
//error handling
})
You can also write multiple files:
client.write(["./fileA.jpg", "./fileB.jpg"]).then(function(fileInfo) {
// The fid's will be the same, to access each variaton just
// add _ARRAYINDEX to the end of the fid. In this case fileB
// would be: fid + "_1"
var fidA = fileInfo;
var fidB = fileInfo + "_1";
console.log(fileInfo);
}).catch(function(err) {
//error handling
})
For multiple files any combinations of path's, Buffers or Streams are allowed
read(fileId, [stream])
The read function supports streaming. To use simply do:
client.read(fileId, fs.createWriteStream("read.png"));
If you prefer not to use streams just use:
client.read(fileId).then(function(Buffer) {
//do something with the buffer
}).catch(function(err) {
//error handling
});
find(file)
This function can be used to find the location(s) of a file amongst the cluster.
client.find(fileId).then(function(json) {
console.log(json.locations);
});
remove(file)
This function will delete a file from all locations.
client.remove(fileId).then(function() {
console.log("removed filed");
}).catch(function(err) {
console.log("could not remove " + fileId);
});
masterStatus()
This function will query the master status for status information. The callback contains an object containing the information about which master server is the leader and which master servers are available.
client.masterStatus().then(function(status) {
console.log(status);
});
systemStatus()
This function will query the master server for information about the current topology and available storage layouts.
client.systemStatus().then(function(status) {
console.log(status);
});
volumeStatus(host)
This function will query an individual volume server for information about the volumes on this server.
client.status("127.0.0.1:8080").then(function(status) {
console.log(status);
});
vacuum(opts)
This function will force the master server to preform garbage collection on volume servers.
Force Garbage Collection
If your system has many deletions, the deleted file's disk space will not be synchronously re-claimed. There is a background job to check volume disk usage. If empty space is more than the threshold, default to 0.3, the vacuum job will make the volume readonly, create a new volume with only existing files, and switch on the new volume. If you are impatient or doing some testing, vacuum the unused spaces this way.
client.vacuum({garbageThreshold: 0.4}).then(function(status) {
console.log(status);
});
License
Copyright (c) 2015, atroo GbR
Permission to use, copy, modify, and/or distribute this software for any purpose with or without fee is hereby granted, provided that the above copyright notice and this permission notice appear in all copies.
THE SOFTWARE IS PROVIDED "AS IS" AND THE AUTHOR DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE INCLUDING ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS. IN NO EVENT SHALL THE AUTHOR BE LIABLE FOR ANY SPECIAL, DIRECT, INDIRECT, OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE.