@elliottcable/giraphe

v1.0.0

Published

2 years ago

A exceedingly-generic graph walking API.

Downloads

0High
0Medium
0Low

elliottcable

Giraphe

This is an API stub for a generic graph-climbing library in JavaScript, extracted from (and developed for) Paws.js. It's intended to present the the same API for different use-cases, while simultaneously optimizing for each use-case individually. (It does not currently do this.)

The primary purpose of Giraphe's walk() function is, given an arbitrary graph structure expressed in JavaScript objects, to allow you to explore that graph in a static fashion — given a set of callbacks, to walk through the graph, both 1. ‘discovering’ new subgraphs that need to be recursed into, and 2. ‘collecting’ nodes according to some critereon.

Interface

The `Walker()` constructor

The default export, this must be called with an options argument, and it returns a walk() function as described by those options.

import Walker from 'giraphe'
const walk = new Walker( ... )

The passed options-object must have exactly one of the keyer: or key: properties:

key: — the most basic mode of operation, this expresses that nodes beloning to your graph have unique String-ish identifiers, stored on the node-Object at the property with the given name.
```
const walk = new Walker({ key: 'id', ... })
    , root = { id: 'foobar' }

walk(root, ...)
```
keyer: — for more flexibility, you may instead provide a function to generate a unique key for a given node. The function you provide as the keyer will be invoked with the object to be uniquely identified, and must return a String unique to that Object.
```
const identify = node => /* generate key! */
    , walk = new Walker({ keyer: identify, ... })
    , root = new MyNodeType

walk(root, ...)
```

The options must also include exactly one of the class: or predicate: properties:

class: — again, this causes the simplest mode of operation; when passed a JavaScript Function (i.e. a “class”), this will cause the walking-function to use a simple instanceof check to determine whether a touched JavaScript object is a node in your graph or not.
```
const walk = new Walker({ key: ..., class: MyNodeType })
    , root = new MyNodeType

walk(root, ...)
```
predicate: — when functioning across JavaScript contexts, or otherwise operating on a graph of noes of non-homogenous JavaScript type, you may instead provide a predicate function which will indicate to Giraphe whether a given JavaScript Object is a node in your graph or not.
```
const isNode = node => /* verify nodey-ness! */
    , walk = new Walker({ key: ..., predicate: isNode })
    , root = new MyNodeType

walk(root, ...)
```

The produced `walk()` function

Once configured, Giraphe will return to you a function-object, which is to be called with a root node and a collection of various sorts of callbacks. The returned walk() function may be called either function-style, or method-style:

const walk = new Walker( ... )
walk(root, ...)

MyNodeClass.prototype.walk = walk
root.walk(...)

The first argument to walk() (or alternatively the object upon which it is invoked, if it is invoked method-style) must be an object of the class passed to the Walker constructor (or one satisfying the predicate, if such was passed instead — henceforth, I'll just call such an object “a node.”); that node will be the first walked, that is, the root of the subgraph that gets walked.

Other arguments must be functions; these behave as callbacks manipulating the behaviour of the recursive walk() process. Such callbacks must behave in one of two ways: as a so-called ‘supplier’, or as a ‘filter.’

All callbacks are invoked with the same arguments (none of which may be safely modified):

The current node being visited,
the parent node that was being visited when a supplier-callback yielded that current node,
a set-map (key-to-node-object) of nodes supplied by prior suppliers during this visit,
a set-map of all nodes seen during the walk thus far (including rejected nodes),
and the complete list of callbacks with which the current walk() is operating.

When all discovered nodes in the graph have been exhausted, walk() produces a final object-mapping of nodes; the properties of which are the unique-keys of all collected (and non-rejected) nodes, each with the corresponding node itself as the value.

‘Supplier’ callbacks

Suppliers are how walk() recurses into your graph-structure. At each node, additional nodes may be ‘discovered’ by your supplier-callback; these'll be added to the set of nodes pending visitation.

Suppliers may indicate further nodes for visitation by,

returning a node directly:

root.walk(node => { return node.child })

returning an Array of nodes:

root.walk(node => { return [node.left, node.right] })

returning an object-mapping of nodes (such as that returned from another walk() process):
```
root.walk(node => { return node.walk_children() })
```

(Note that, although returning undefined technically makes a callback a ‘filter’, it counts as a pass; so it's equivalent to a supplier that adds nothing. However, filter-vs-supplier behaviour may change in future releases, especially w.r.t. caching!)

‘Filter’ callbacks

If suppliers are how you find new nodes to visit, filters are how you select which of those nodes contribute to the overall result of the walk(). Where suppliers operate on descendants of the current node being visited, filters operate on that current node itself.

Any callback returning a boolean value is treated as a filter.

The default behaviour of a filter (i.e. if it returns undefined), is to pass the current node — i.e. as a noop, simply let it, and any nodes supplied by other callbacks, through into the final output of walk().

root.walk(node => { return node.is_blue || node.is_green })

However, if a callback explicitly returns boolean false, then it's considered to reject the current node — despite having been ‘supplied’ by a previous callback, this node will not be included in the results of the walk(); and any newly-supplied nodes from the current walk-step will be discarded¹ as well.

root.walk(node => { return false if node.age >= 12 })

Such rejection is short-circuiting — the current walk-step is aborted, no further callbacks (of either type) are evaluated, any nodes collected via the rejected current node during this step will be discarded, so on and so forth.

(Of note, this simply invalidates their discovery by way of the current, rejected node — they are not, themselves, ‘rejected’; and may be re-supplied by a different walk-step / be re-discovered by a different path through your graph.)

Examples

const MyNode = function(){
   this._children = new Array
   this.id = MyNode.max = (MyNode.max || 0) + 1 }

MyNode.prototype.walk = new Walker({ key: 'id', class: MyNode })
MyNode.prototype.descendants = MyNode.prototype.walk(node => node.children)

// ... NYI

Why?

This exists because I got tired of writing almost the same graph-walking function, over and over, for Paws.js; while each function was slightly different from the previous one, and yet occured in a ‘hot’ enough location to preclude using a truly generic walk() function for each situation.

The intention, unfulfilled, was to pre-‘compile’ optimized graph-walking functions for each situation, based on the options passed to the Walker constructor; this would allow me to centralize testing efforts and interface design (the API) to this single library. My first attempt at that task was laughable (it involved mustache templates. Seriously.); and thus, for now, this library is simply going to be an external API, with a generic (slow) walk() implementation satisfying all of my needs; however, I still hope to eventually extract the necessary optimizations into this library.

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme