pegase
v0.6.3
Published
An inline, fast, powerful and lightweight PEG parser generator for JavaScript and TypeScript, with semantic actions, parametrized rules, support for native regexps, error recovery, warnings, integrated AST generation and visitors, cut operator, back refer
Downloads
36
Maintainers
Readme
Pegase
⚠️ This library is still under development. This is a pre-release but some functionalities might still change.
Pegase is a PEG parser generator for JavaScript and TypeScript. It's:
- Inline, meaning grammars are directly expressed as tagged template literals. No generation step, no CLI. Pegase works in symbiosis with JS.
- Fast. Pegase is heavily optimized to be extremely fast while providing an extensive range of features.
- Complete. Pegase has everything you will ever need: an elegant grammar syntax with lots of flexibility, semantic actions, parametrized rules, support for native regexps, error recovery, warnings, integrated AST generation and visitors, cut operator, back references, grammar merging, and a lot more.
- Lightweight. Pegase is a zero-dependency package, and weights around 9kB gzipped.
- Intuitive, in that it lets you express complex processes in very simple ways. You will never feel lost.
- Extensible: You can define your own
Parser
subclasses, add plugins, write custom directives, etc.
🔗 Go to the official website
What Pegase has to offer:
Concise, readable yet powerful parsers in symbiosis with JS
The following parses math expressions and calculates the result on the fly:
import peg from "pegase";
function calc(left, op, right) {
switch (op) {
case "+": return left + right;
case "-": return left - right;
case "*": return left * right;
case "/": return left / right;
}
}
const expr = peg`
expr: term % ("+" | "-") @infix(${calc})
term: fact % ("*" | "/") @infix(${calc})
fact: $integer | '(' expr ')'
$integer @number: '-'? [0-9]+
`;
expr.value("2 + (17-2*30) *(-5)+2")
219
expr.test("2* (4 + )/32")
false
expr.parse("2* (4 + )/32").log()
(1:9) Failure: Expected integer or "("
> 1 | 2* (4 + )/32
| ^
Read more in Building parsers and Semantic action and dataflow.
Automatic whitespace skipping
const bitArray = peg`'[' (0 | 1) % ',' ']'`;
bitArray.test(" [ 0,1 ,0 , 1, 1] "); // true
With, obviously, the possibility to opt-out and do it yourself:
const g = peg`
array: '[' _ '1' % ',' _ ']'
_: \s+
`;
g.test("[1,1,1]", { skip: false }); // Opt-out as a one-off
// Or for all executions by default:
g.defaultOptions.skip = false;
g.test("[1,1,1]"); // false
g.test("[ 1,1,1 ]"); // true
g.test("[ 1, 1,1 ]"); // false
Read more in Handling whitespaces.
Great warning and failure reports
import peg, { $raw, $warn } from "pegase";
function isCap(str) {
return /^[A-Z]/.test(str)
}
const g = peg`
classDef:
'class'
($identifier ${() => {
if (!isCap($raw())) $warn("Class names should be capitalized");
}})
'{' '}'
$identifier: [a-zA-Z]+
`;
g.parse("class test {").log()
(1:7) Warning: Class names should be capitalized
> 1 | class test {
| ^
(1:13) Failure: Expected "}"
> 1 | class test {
| ^
Read more in Failures and warnings.
Parametrized rules
With the possibility of omitted parameters and default parameter values.
const g = peg`
root: array | array('a') | array('b' | 'c')
array(item = \d): '[' commaList(item) ']'
commaList(item): item % ','
`;
g.test("[ a, a, a, a]"); // true
g.test("[ a, 5, a, a]"); // false
g.test("[b, c]"); // true
g.test("[b, a]"); // false
g.test("[4, 5, 3, 9, 0]"); // true
Report multiple failures with recovery
const g = peg`
bitArray: '[' (bit | sync) % ',' ']'
bit: 0 | 1
sync: @@commit ...&(',' | ']')
`;
g.parse("[0, 4, 1, 2, 0, 1]").log()
(1:5) Failure: Expected "0" or "1"
> 1 | [0, 4, 1, 2, 0, 1]
| ^
(1:11) Failure: Expected "0" or "1"
> 1 | [0, 4, 1, 2, 0, 1]
| ^
Read more in Error recovery.
Support for native RegExp
const time = /(\d+):(\d+)/;
const minutes = peg`
${time} ${() => {
const [hr, min] = $children();
return 60 * Number(hr) + Number(min);
}}
`;
minutes.value("2:43"); // 163
Pegase also supports regex literals:
const minutes = peg`
/(\d+):(\d+)/ ${() => {
const [hr, min] = $children();
return 60 * Number(hr) + Number(min);
}}
`;
Named capturing groups are converted to Pegase captures:
const date = /(?<year>\d{4})-(?<month>\d{2})-(?<day>\d{2})/;
const yearIs = peg`
${date} ${({ year }) => "The year is " + year}
`;
yearIs.value("2021-08-19"); // "The year is 2021"
Read more in Working with RegExp.
Painless AST and visitors
const prefix = peg`
expr:
| <>$integer => 'INT'
| '+' <a>expr <b>expr => 'PLUS'
$integer @raw: \d+
`;
const sumVisitor = {
INT: node => Number(node.$integer),
PLUS: node => $visit(node.a) + $visit(node.b)
};
prefix.value("182", { visit: sumVisitor }); // 182
prefix.value("+ 12 + 42 3", { visit: sumVisitor }); // 57
Read more in AST and visitors.
Extensible functionalities
Here we define a directive @max
import peg, { $children } from "pegase";
peg.plugins.push({
directives: {
max: parser => peg`${parser} ${() => Math.max(...$children())}`
}
});
const max = peg`
list: $int+ @max
$int: \d+ @number
`;
max.value("36 12 42 3"); // 42
Read more in Writing a plugin.
And a lot more
There is so much more to see. To learn more about Pegase, please go to the official website.