charwise
v3.0.1
Published
encode/decode with same encoded sort order as bytewise
Downloads
64,729
Readme
charwise
like bytewise, except as strings.
codec for js values (including arrays) that preserves lexiographic sort order when encoded. (the order is compatible with bytewise and thus indexeddb and couchdb, but the encoded format is different)
The api provided follows the level codec standard so this encoding can easily be used with level and flumedb
motivation
for building indexes on top of leveldb, bytewise is great! it lets you structure you keys and reason about how they will be ordered in a very simple and reliable way.
But bytewise is too slow! it's slow enough to have quite visible effects on a bulk load on a reasonable sized database with a couple of indexes. (i.e. 100k secure-scuttlebutt messages with indexes, measured by bench-ssb)
stability: experimental
Expect breaking changes to encoded format. We are still making breaking changes if necessary to improve performance.
(although, codec api is fully stable and will not change)
simple benchmark
run a simple benchmark for one second, encoding & decoding ops in one second.
# name, ops, multiplier
bytewise encode 35661
charwise encode 131366 x3.6
bytewise decode 107571
charwise decode 144557 x1.3
It was easy to make charwise faster than bytewise when it was only a partial implementation, but once correct escaping and nested arrays where added it got slow.
But then @PaulBlanche had the genious idea of encoding items in an array with their depth inside the array. This supports deeply nested arrays or shallowly nested arrays with only one pass escaping the items. This made encoding much faster again!
License
MIT