object-graph-as-json
v2.0.0
Published
Library for representing arbitrary javascript objects in JSON.
Downloads
4
Readme
object-graph-as-json
Given an arbitrary javascript object, this library encodes it using only JSON-safe data types. Tools for decoding the data are provided too.
This enables serializing/deserializing objects that JSON.stringify
cannot handle.
Contents:
Goals
- Support encoding circular / cyclic references among objects.
- Preserve as much detail as feasible about every type of javascript object.
- Even if decoding to an equivalent object will be impossible, consumers should still be able to access as much information as possible about what the original object was like.
- Be safe for automated tooling to inject at arbitrary points in a program without changing the program's behavior.
- Therefore, property getters will not be called unless they are built-in parts of javascript that do not have side effects. (But the existence of the getter, and its function object, will be encoded.)
- As noted in the limitations section, this goal is not necessarily achieved for
Proxy
objects.
- If desired, encode information about an object's identity.
- Enables determining whether two encoded objects (whether they are otherwise identical or not) originated from the same in-memory object or not.
- Represent common object types in a way that minimizes the amount of visual noise for a human inspecting the encoded graph.
- Keep the encoder small.
Usage
Install:
npm i --save object-graph-as-json
Import the encoder:
import { Encoder } from 'object-graph-as-json';
Make whatever objects you want to encode; we'll demonstrate by creating a counter:
function increment() {
this.current++;
}
const counter = {
current: 1,
increment,
};
counter.increment();
console.log(counter.current); // 2
Encode the objects:
const encoder = new Encoder();
const encoded = encoder.encode(counter);
const json = JSON.stringify(encoded, null, 2);
console.log(json);
Output:
{
"id": "1",
"type": "object",
".current": 2,
".increment": {
"id": "2",
"type": "function",
"source": "function increment() {\n this.current++;\n}",
".length": {
"type": "property",
"value": 0,
"configurable": true
},
".name": {
"type": "property",
"value": "increment",
"configurable": true
},
".arguments": {
"type": "property",
"value": null
},
".caller": {
"type": "property",
"value": null
},
".prototype": {
"type": "property",
"value": {
"id": "3",
"type": "object",
".constructor": {
"type": "property",
"value": {
"id": "2",
"type": "ref"
},
"writable": true,
"configurable": true
}
},
"writable": true
}
}
}
See the docs in src/Encoder.js for more info.
If you like to live on the edge, you can also decode the output back to real javascript objects:
import { UnsafeDecoder } from 'object-graph-as-json';
const parsed = JSON.parse(json);
const decoder = new UnsafeDecoder();
// DO NOT DO THIS WITH UNTRUSTED INPUT!
const decoded = decoder.decode(parsed);
console.log(decoded.current); // 2
decoded.increment();
console.log(decoded.current); // 3
UnsafeDecoder allows the input to run arbitrary code at decode time, as demonstrated in src/index.test.js, so it is not suitable for use in most situations without modification. See the jsdoc in src/UnsafeDecoder.js for more info.
Spec
Numbers (excluding NaN
and Infinity
), strings, booleans, and null
are unchanged by encoding.
Everything else is encoded to an object which contains, at minimum, a string field named type
.
Allowed values for type
are:
builtin
bigint
symbol
array
function
object
property
(only allowed for array/function/object property values)ref
unknown
symbol
, array
, function
, object
, and ref
all contain fields named id
which are used for indicating when the same original in-memory object appears at multiple points in the object graph.
Within a single encoded graph:
- All symbols with the same id are the same symbol, and all symbols with different ids are different symbols.
- The same id cannot appear on more than one array, function, or object; instead, after the first occurrence, a
ref
will be used instead.- Objects/arrays/functions with different ids originate from different in-memory objects.
- All refs to the same id originated from the same in-memory object as the object/array/symbol on which that id appeared.
- No objects, arrays, functions, or refs should have the same id as a symbol.
builtin
Certain values are recognized and simply referred to by name using an object with two fields:
type
="builtin"
name
: See src/builtins.js for a current list of supported names.
bigint
BigInts are encoded to an object with two fields:
type
="bigint"
string
: The result of calling.toString()
on the BigInt.
symbol
Symbols are encoded to an object with three fields:
type
="symbol"
id
description
: The result of retrieving.description
from the Symbol.
When the same id occurs on multiple symbols within a graph, the description must be the same on all of them.
array
If an array meets all the following conditions:
- Its
typeof
isobject
. - Its prototype is
Array.prototype
. - Its
length
field is writable but not enumerable or configurable, and not an accessor. - Its highest index is its
length
field minus 1. - Its
length
field is >= 0.
Then it may be encoded to an object with type
= "array"
.
These should be interpreted in the same way as type
= "object"
except:
- The original object's
typeof
is implied to beobject
. - The
prototype
field will not be included, since it is implied to beArray.prototype
. - The
length
field will not be included, since it is assumed to equal the highest index plus 1 and to be writable but not enumerable or configurable and not an accessor.
function
Anything whose typeof
is 'function'
will be encoded to an object with type
= "function"
.
This is equivalent to type
= "object"
except:
- The original object's
typeof
is implied to befunction
. - A
source
field is added containing the source code of the function as a string, to the best of the encoder's ability to determine it.- This is not guaranteed to be valid javascript code.
Native functions, for example, may include the string
[native code]
in place of valid javascript code.
- This is not guaranteed to be valid javascript code.
Native functions, for example, may include the string
prototype
is assumed to beFunction.prototype
if it is absent.
object
Objects are encoded to objects with the following fields:
type
="object"
id
prototype
: The encoded result of callingObject.getPrototypeOf
on the object. May be omitted if it isObject.prototype
.- All of the original object's properties are encoded as properties.
- If the key is a string, it is encoded as that string, but prefixed with
"."
. - If the key is a symbol and is a builtin, it is encoded as
"@name"
- If the key is a symbol that is not a builtin and does not have a description, it is encoded as
"~id"
, whereid
is the unique identifier assigned to the symbol for the purposes of encoding. - If the key is a symbol that is not a builtin and has a description, it is encoded as
"~id|description"
. Note that the description may be an empty string, which is different from having no description (symbol.description === ''
vssymbol.description === undefined
). - The value is encoded as described in the section on properties below.
- If the key is a string, it is encoded as that string, but prefixed with
property
If a property meets all the following conditions:
- It is not an accessor.
- It is writable, enumerable, and configurable.
Then the encoded value will be stored directly on the object. Otherwise, a property object will be created, which has the following structure:
type
="property"
value
: The encoded value of the property; omitted for accessor properties.get
: The encoded value of the getter, if any, for accessor properties.set
: The encoded value of the setter, if any, for accessor properties.writable
: Boolean, may be omitted if false.enumerable
: Boolean, may be omitted if false.configurable
: Boolean, may be omitted if false.
ref
While encoding an object graph, if any object/array/function appears more than once, the content of the object will only be encoded the first time. On subsequent encounters, it's encoded as a wrapper object with two fields:
type
="ref"
id
: Theid
that was used the first time the object was encoded.
Note that refs are not used for symbols, because symbols can also occur in property keys (see section on objects above), and it is simpler to just duplicate the information about the symbol than to account for the possibility of a ref to a property key. But all symbols with the same id within the graph should be decoded to the same Symbol object.
unknown
If the encoder encounters a value whose typeof
it does not recognize, it simply returns a wrapper object with two fields:
type
="unknown"
typeof
: The result oftypeof
.
Status & Known Limitations
- More types will likely be added in order to compactly represent things such as regexes.
- The encoder makes no attempt to detect whether an object is a
Proxy
; proxied objects will be encoded as the proxy handler presents them, and the proxy handler's traps may be invoked. - There are more objects that should be added to the list of recognized builtins.
- Recreating functions in general is janky.
- Bound functions (i.e. the result of calling
.bind(...)
on a function) cannot be recreated from their encoded form. As far as I'm aware, javascript does not provide a way to programmatically determine that a function object is a bound function or to retrieve the bindings or the original function from a bound function. The source code string for a bound function generally refers to native code. - Variable bindings from enclosing lexical scopes are not preserved.
- The current UnsafeDecoder implementation assumes that the source code of the function is a valid function declaration, expression, or arrow function expression. But depending on your execution environment, calling toString() on a method (which is what the encoder does to determine the source code) may result in a string that starts with the method name instead.
- Bound functions (i.e. the result of calling
- An instance of
Encoder
orUnsafeDecoder
will never forget a symbol that it has seen. If you use long-lived instances of these, and for some reason your program produces large numbers of new symbols which are included in objects that it encodes, this is a memory leak.
Development
Clone this repo and run npm install
to install development dependencies.
Run npm test
or npm test:watch
to run the unit tests.
Run npm build
to produce builds in the target
folder targeting CommonJS, ECMAScript Modules, and UMD.
Contributing
Bug reports and pull requests are welcome on GitHub at https://github.com/brokensandals/object-graph-as-json.
License
This is available as open source under the terms of the MIT License.