non-overlapping-periodic-job-scheduler
v3.0.2
Published
Periodic job scheduler for Node.js projects, ensuring non-overlapping executions and deterministic termination. Features a dynamically chosen interval between executions, determined by a user-injected calculator function, which can take into account the d
Downloads
88
Maintainers
Keywords
Readme
The NonOverlappingPeriodicJobScheduler
class implements a slim yet highly flexible periodic-job scheduler for Node.js projects, ensuring non-overlapping job executions.
The delay between executions is determined by a user-defined calculator function, allowing scheduling to be either interval-based or time-based, while also considering runtime factors known to the user.
Table of Contents
- Key Features
- Non-Overlapping Executions
- Graceful and Deterministic Termination
- Dynamic Execution Interval
- Zero Over-Engineering, No External Dependencies
- Non-Persistent Scheduling
- API
- Getter Methods
- Error Handling
- Use-case Example
- Time-Based Scheduling Policies
- Interval-Based Scheduling Policies
- Breaking Change in Version 2.0.0
- Breaking Change in Version 3.0.0
- Naming Convention
- License
Key Features :sparkles:
- Non-Overlapping Executions.
- Deterministic Termination: If the
stop
method is called during a job-execution, it will resolve only once the execution completes. - Dynamic Interval between Executions: This design allows users to consider various runtime factors if required, while the scheduler remains agnostic to the user-defined scheduling policy.
- Comprehensive documentation :books:: The class is thoroughly documented, enabling IDEs to provide helpful tooltips that enhance the coding experience.
- Fully Tested :test_tube:: Extensively covered by unit tests.
- No External Runtime Dependencies: Lightweight component, only development dependencies are used.
- Non-durable scheduling: If the app crashes or goes down, scheduling stops.
- ES2020 Compatibility.
- TypeScript support.
Non-Overlapping Executions
Executions do not overlap because the (i+1)th execution is scheduled immediately after the ith execution completes. This is suitable for scenarios where overlapping executions may cause race conditions, or negatively impact performance.
Graceful and Deterministic Termination :hourglass:
This topic is often overlooked in the context of schedulers.
When stopping periodic executions, it is crucial to ensure that any potentially ongoing execution is completed before termination. This deterministic termination approach ensures that no unfinished executions leave objects in memory, which could otherwise lead to unexpected behavior.
Without deterministic termination, leftover references from incomplete executions can cause issues, such as unexpected behavior during unit tests. A clean state is necessary for each test, and ongoing jobs from a previous test can interfere with subsequent tests.
Dynamic Execution Interval
User provides a custom calculator function, to determine the delay until the next execution, based on the runtime metadata of the just-finished execution (duration, error if thrown).
This calculator is invoked at the end of each execution, enabling flexible interval policies based on user-defined criteria. This approach ensures that the scheduler remains agnostic of scheduling-policy preferences, focusing solely on the scheduling process. In this way, we adhere to the following principles:
- Information Expert Principle: The interval policy is defined by the user.
- Single Responsibility Principle: The scheduler's sole responsibility is to manage the scheduling process.
Zero Over-Engineering, No External Dependencies
This component offers a lightweight, dependency-free solution. It can also serve as a building block for more advanced implementations, if necessary.
Non-Persistent Scheduling
This component features non-durable scheduling, which means that if the app crashes or goes down, scheduling stops.
If you need to guarantee durability over a multi-node deployment, consider using this scheduler as a building block or use other custom-made solutions for that purpose. Generally, maintaining a timestamp of the last successful execution in a persistent database is usually sufficient to introduce durability.
API :globe_with_meridians:
The NonOverlappingPeriodicJobScheduler
class provides the following methods:
- start: Initiates the scheduling of periodic jobs.
- stop: Stops the scheduling of periodic jobs. If this method is invoked during an ongoing execution, it resolves once the current execution is complete. This guarantee provides determinism and allows for graceful termination.
- waitUntilCurrentExecutionCompletes: Resolves when the current execution completes, whether it resolves or rejects, if called during an ongoing execution. If no execution is in progress, it resolves immediately.
If needed, refer to the code documentation for a more comprehensive description of each method.
Getter Methods :mag:
The NonOverlappingPeriodicJobScheduler
class provides the following getter methods to reflect the scheduler's current state:
- isCurrentlyExecuting: Indicates whether the periodic job is actively running, as opposed to being between executions.
- isStopped: Indicates whether the instance is currently not managing periodic executions.
Error Handling :warning:
If a periodic job throws an error, the error will be passed to the calculator function. The scheduler does not perform any logging, as it is designed to be agnostic of user preferences, such as specific loggers or logging styles.
Use-case Example :man_technologist:
import {
NonOverlappingPeriodicJobScheduler,
PeriodicJob,
CalculateDelayTillNextExecution,
NO_PREVIOUS_EXECUTION
} from 'non-overlapping-periodic-job-scheduler';
const MS_DELAY_AFTER_COMPLETION = 5000;
const calculateDelayTillNextFetch: CalculateDelayTillNextExecution =
(_: number): number => {
// Simplest possible implementation:
// After each execution, the scheduler waits a fixed duration (5000 ms),
// before triggering the next one.
// First execution starts 5000ms after `start()` is called.
return MS_DELAY_AFTER_COMPLETION;
};
class ThreatIntelligenceAggregator {
private readonly _threatFeedsScheduler =
new NonOverlappingPeriodicJobScheduler(
this.fetchLatestThreatFeeds.bind(this),
calculateDelayTillNextFetch
);
public start(): void {
this._threatFeedsScheduler.start();
// Additional start operations.
}
public async stop(): Promise<void> {
// Stop may not be immediate, as given a job-execution is currently ongoing,
// `stop` resolves only once that execution completes.
await this._threatFeedsScheduler.stop();
// Additional stop operations.
}
private async fetchLatestThreatFeeds(): Promise<void> {
// Do your magic here.
}
}
Time-Based Scheduling Policies :clock3:
Time-based scheduling disregards the execution's metadata (such as duration or thrown errors) and is measured against absolute timestamps on the clock.
Every 20 minutes on the clock :man_technologist:
Consider a scenario where executions should occur at fixed times of the day, for example, three times per hour at XX:00:00, XX:20:00, and XX:40:00. In other words, every 20 minutes on the clock. This scheduling policy can be implemented using the following calculator:
const MS_DELAY_BETWEEN_STARTS = 20 * 60 * 1000; // 20 minutes in milliseconds.
const calculateDelayTillNextExecution: CalculateDelayTillNextExecution =
(_: number): number => {
return MS_DELAY_BETWEEN_STARTS - Date.now() % MS_DELAY_BETWEEN_STARTS;
};
Please note that due to the non-overlapping nature of this scheduler, if an execution exceeds 20 minutes, its subsequent scheduled start time (e.g., 00:40:00) will be skipped.
Daily execution at a Fixed Hour :man_technologist:
Consider a scenario where the execution should occur once a day at 16:00 (4 PM). A suitable calculator function might be:
const MS_IN_ONE_DAY = 24 * 60 * 60 * 1000;
const calculateDelayTillNextExecution: CalculateDelayTillNextExecution =
(_: number): number => {
const todayAt16 = new Date();
todayAt16.setHours(16, 0, 0, 0);
const msTillExecution = todayAt16.getTime() - Date.now();
if (msTillExecution >= 0) {
return msTillExecution;
}
return MS_IN_ONE_DAY + msTillExecution;
};
Interval-Based Scheduling Policies :repeat:
Interval-based scheduling ignores absolute timestamps on the clock. It is applicable when the interval between executions matters more than the exact timing of each execution. Unlike most schedulers, this variant allows the gap to be determined during runtime, enabling consideration of runtime factors.
Basic example :man_technologist:
Let's start with the simplest example, which involves having a fixed interval. Formally, the determined interval is the delay between the end of the i-th execution and the start of the (i+1)-th execution.
const FIXED_MS_DELAY_BETWEEN_EXECUTIONS = 5000;
const calculateDelayTillNextExecution: CalculateDelayTillNextExecution =
(justFinishedExecutionDurationMs: number) => FIXED_MS_DELAY_BETWEEN_EXECUTIONS;
Considering the Error Argument :man_technologist:
A more advanced example might consider the error argument. For example, if the user prefers a more frequent interval until successful execution.
The CalculateDelayTillNextExecution
type alias accepts a generic JobError
type, defaulting to Error if not explicitly specified. Refer to the full documentation of the CalculateDelayTillNextExecution
alias for more information if needed.
import {
CalculateDelayTillNextExecution,
NO_PREVIOUS_EXECUTION
} from 'non-overlapping-periodic-job-scheduler';
const FIRST_EXECUTION_MS_DELAY = 10 * 1000;
const MS_DELAY_AFTER_SUCCESS = 20 * 1000;
const MS_DELAY_AFTER_FAILURE = 4000;
// Note the use of a generic CustomError type, defaulting to Error.
const calculateDelayTillNextExecution: CalculateDelayTillNextExecution<CustomError> = (
justFinishedExecutionDurationMs: number,
justFinishedExecutionError?: CustomError
): number => {
if (justFinishedExecutionDurationMs === NO_PREVIOUS_EXECUTION) {
return FIRST_EXECUTION_MS_DELAY;
}
if (justFinishedExecutionError) {
const { message } = justFinishedExecutionError;
console.error(`Last execution failed. Reason: ${message}`);
return MS_DELAY_AFTER_FAILURE;
}
return MS_DELAY_AFTER_SUCCESS;
};
Mimicking 'setInterval' :man_technologist:
If you want to mimic the behavior of setInterval
, which maintains a fixed interval between start times, you should be aware that the duration of a job execution might exceed the interval. A simple scheduling policy might decide that, under such circumstances, the next execution should occur immediately.
import {
CalculateDelayTillNextExecution,
NO_PREVIOUS_EXECUTION
} from 'non-overlapping-periodic-job-scheduler';
const FIXED_MS_DELAY = 5000;
const calculateDelayTillNextExecution: CalculateDelayTillNextExecution = (
justFinishedExecutionDurationMs: number
): number => {
if (justFinishedExecutionDurationMs === NO_PREVIOUS_EXECUTION) {
return FIXED_MS_DELAY;
}
if (justFinishedExecutionDurationMs > FIXED_MS_DELAY) {
return 0;
}
// For example, if a just-finished execution took 1000ms, and the desired interval-between-starts is
// 5000ms, the next execution should start within 4000ms.
return FIXED_MS_DELAY - justFinishedExecutionDurationMs;
};
Alternative 'setInterval' mimicking :man_technologist:
Another approach to mimicking the setInterval
policy, while dealing with potential overlapping executions, is to schedule only according to the originally planned start time. Overlapped start times will be skipped.
Formally, start times will correspond to the formula START_TIMESTAMP + N * FIXED_MS_DELAY, where N is a natural number. For example, the following ascending start times sequence implies that the first execution took more than FIXED_MS_DELAY, but less than 2 * FIXED_MS_DELAY. This can be deduced by the missing start timestamp:
- START_TIMESTAMP + FIXED_MS_DELAY
- START_TIMESTAMP + 3 * FIXED_MS_DELAY
- START_TIMESTAMP + 4 * FIXED_MS_DELAY
- START_TIMESTAMP + 5 * FIXED_MS_DELAY
Such a scheduling policy can be useful for aggregation jobs, where a recently executed job implies that the data is still fresh.
import {
CalculateDelayTillNextExecution,
NO_PREVIOUS_EXECUTION
} from 'non-overlapping-periodic-job-scheduler';
const FIXED_MS_DELAY = 5000;
const calculateDelayTillNextExecution: CalculateDelayTillNextExecution = (
justFinishedExecutionDurationMs: number
): number => {
if (justFinishedExecutionDurationMs === NO_PREVIOUS_EXECUTION) {
return FIXED_MS_DELAY;
}
// For example, if a just-finished execution took 6000ms, and the desired interval-between-starts is
// 5000ms, it means that next execution should start within 4000ms.
return FIXED_MS_DELAY - justFinishedExecutionDurationMs % FIXED_MS_DELAY;
};
Breaking Change in Version 2.0.0
In version 2.0.0, the target compatibility has been upgraded from ES6 to ES2020. This change was made to leverage the widespread adoption of ES2020, in particular its native support for async/await.
Breaking Change in Version 3.0.0
In version 3.0.0, the method waitTillCurrentExecutionSettles
was renamed to waitUntilCurrentExecutionCompletes
for improved clarify.
Naming Convention
It is highly recommended to assign a use-case-specific name to your scheduler instances. This practice helps in clearly identifying the purpose of each scheduler in the codebase. Examples include:
- deleteExpiredDataScheduler
- syncAccessPermissionsScheduler
- updateFirewallRulesScheduler
- healthDiagnosticsScheduler
- archiveOldLogsScheduler