token-types
v6.0.0
Published
Common token types for decoding and encoding numeric and string values
Downloads
22,068,771
Maintainers
Readme
token-types
A primitive token library used to read and write from a Uint8Array
.
Although it is possible to use this module directly, it is primarily designed to be used with strtok3 tokenizer.
Compatibility
Module: version 5 migrated from CommonJS to pure ECMAScript Module (ESM). JavaScript is compliant with ECMAScript 2019 (ES10).
Installation
npm install --save token-types
Usually in combination with strtok3:
npm install --save strtok3
Using TypeScript you should also install @tokenizer/token as a development dependency:
npm install --save-dev @tokenizer/token
Example
import * as strtok3 from 'strtok3';
import * as token from 'token-types';
(async () => {
const tokenizer = await strtok3.fromFile("somefile.bin");
try {
const myNumber = await tokenizer.readToken(token.Float32_BE);
console.log(`My number: ${myNumber}`);
} finally {
tokenizer.close(); // Close the file
}
})();
Tokens
Numeric tokens
token-types
supports a wide variety of numeric tokens out of the box:
| Token | Number | Bits | Endianness |
|---------------|------------------|------|----------------|
| UINT8
| Unsigned integer | 8 | n/a |
| UINT16_BE
| Unsigned integer | 16 | big endian |
| UINT16_LE
| Unsigned integer | 16 | little endian |
| UINT24_BE
| Unsigned integer | 24 | big endian |
| UINT24_LE
| Unsigned integer | 24 | little endian |
| UINT32_BE
| Unsigned integer | 32 | big endian |
| UINT32_LE
| Unsigned integer | 32 | little endian |
| UINT64_BE
| Unsigned integer | 64 | big endian |
| UINT64_LE
* | Unsigned integer | 64 | little endian |
| INT8
| Signed integer | 8 | n/a |
| INT16_BE
| Signed integer | 16 | big endian |
| INT16_LE
| Signed integer | 16 | little endian |
| INT24_BE
| Signed integer | 24 | big endian |
| INT24_LE
| Signed integer | 24 | little endian |
| INT32_BE
| Signed integer | 32 | big endian |
| INT32_LE
| Signed integer | 32 | little endian |
| INT64_BE
| Signed integer | 64 | big endian |
| INT64_LE
* | Signed integer | 64 | little endian |
| Float16_BE
| IEEE 754 float | 16 | big endian |
| Float16_LE
| IEEE 754 float | 16 | little endian |
| Float32_BE
| IEEE 754 float | 32 | big endian |
| Float32_LE
| IEEE 754 float | 32 | little endian |
| Float64_BE
| IEEE 754 float | 64 | big endian |
| Float64_LE
| IEEE 754 float | 64 | little endian |
| Float80_BE
* | IEEE 754 float | 80 | big endian |
| Float80_LE
* | IEEE 754 float | 80 | little endian |
(*) The tokens exceed the JavaScript IEEE 754 64-bit Floating Point precision, decoding and encoding is best effort based.
String tokens
StringType decoding is implemented using TextDecoder which supports a large number of encodings including but not limited to:
- UTF-8 (the default)
- Windows-1252
- ISO-8859-1
Check out the MDN web docs for the TextDecoder for a complete list
Custom tokens
Custom tokens can be added, suitable for reading binary files or network messages:
ExtendedHeader = {
len: 10,
get: (buf, off) => {
return {
// Extended header size
size: Token.UINT32_BE.get(buf, off),
// Extended Flags
extendedFlags: Token.UINT16_BE.get(buf, off + 4),
// Size of padding
sizeOfPadding: Token.UINT32_BE.get(buf, off + 6),
// CRC data present
crcDataPresent: common.strtokBITSET.get(buf, off + 4, 31)
};
}
};