npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2025 – Pkg Stats / Ryan Hefner

cluster-task-sharder

v0.0.10

Published

A task distribution and coordination library for use in clusters

Downloads

3

Readme

Cluster Task Sharder

Overview

This module implements a simple cluster task distribution mechanism. The task orchestration is done via native locks on the file system, which greatly simplifies deployment - you only need a shared data directory (e.g. an NFS share), no centralized databased or messaging infrastructure needed. The attribute mtime to check for stale locks, which is almost universally supported across filesytems.

There are two main features availabe: exclusive task attribution and sharding range attribution.

Exclusive task attribution

These are tasks that, at any given time, should have only one instance running. Simply call isTaskMine("myTaskName") with an arbitrary task name, and the task will be immediately assigned to a single instance. There's no other setup needed.

For example, imagine your system needs to periodically fetch weather information from an external source. It would be sufficient for one instance to fetch the information and update a database:

const cluster = require('cluster-task-sharder');

function tryWeatherUpdate(){
    if (cluster.isTaskMine(`weatherUpdate`)){
        // check and persist the information
    }
}

cluster.join({sharedDirectory: '/tmp/cluster'});
setTimeout(tryWeatherUpdate, 1000);

Sharding range attribution

These are tasks that run concurrently across instances, but with each instance attending to a specific shard range. This allows you to share the load across the computing resources instead of limiting to a single instance.

For example, imagine a simple REST API that accepts new emails, and later tries do send them asynchronously. Each new email received via the REST endpoint would be persisted with a random shard index (available by calling getNewShardIndex()). A background task would periodically sweep the pending emails within the shard range attributed for that instance (available by calling getMyShardRange()). This ensures the load can be evenly distributed across instances without competing for emails:

const cluster = require('cluster-task-sharder');

function createEmail(email){
    email.shardIndex = cluster.getNewShardIndex();
    // persist email
}
const cluster = require('cluster-task-sharder');

function sendPendingEmails(){
    let myShardRange = cluster.getMyShardRange();
    // send emails that are within this shard range (e.g. 0 to 1000)
}

cluster.join({sharedDirectory: '/tmp/cluster'});
setTimeout(sendPendingEmails, 1000);

Configuration

The module behaviour can be customized by passing options to the join({}) function.

  • sharedDirectory: the directory shared across instances (e.g. mounted NFS share). Default: /tmp/cluster.
  • heartbeatInterval: the interval between heartbeats in ms. Default: 500.
  • heartbeatTimeout: timeout in ms after which tasks are reassigned to other instances. Default: 2000.
  • checkInterval: interval in ms between checks for stale instances. Default: 1000.
  • pauseAfterWorkersetChange: interval in ms between a change in available instances and redistribution of tasks/sharding ranges. Allow for enough time so that any ongoing tasks can be finished before being redistributed. Default: 5000.

Important

From a reliability standpoing, it is important to assure your tasks are idempotent, and that eventual concurrent execution won't cause corruption of data. This is important in the event that one instance misbehaves (e.g. stops heartbeating and keeps executing a given task).

Credits

icon by alvianwijaya from the Noun Project