npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2024 – Pkg Stats / Ryan Hefner

annotatedtext

v1.2.1

Published

A lightweight JavaScript library for converting various markup documents into an annotated text format consumable by LanguageTool.

Downloads

4,346

Readme

annotatedtext

Node.js CI

A lightweight JavaScript library for converting markup documents into an annotated text format consumable by LanguageTool as AnnotatedText.

Usage

See API below for details.

Motivation

Provide an easier way of running LanguageTool on documents by separating the text from the markup.

Installation

npm:

npm install annotatedtext

API

compose(text, node, options = defaults)

Returns Annotated Text as described by LanguageTool's API:

{
  "annotation": [
    { "text": "A " },
    { "markup": "<b>" },
    { "text": "test" },
    { "markup": "</b>" }
  ]
}

Run the object through JSON.stringfy() to get a string suitable for passing to LanguageTool's data parameter.

This is the main function you'll use in implementing for different parsers.

import { compose } from "annotatedtext";
import { parse } from "remark-parse";
import { unified } from "unified";

const text = "This is a sentence.";
const processor = unified()
  .use(remarkparse, options.remarkoptions)
  .use(frontmatter, ["yaml", "toml"])
  .parse(text);

const annotatedtext = compose(text, parse);
JSON.stringify(annotatedtext);
  • text: The text from the markup document in its original form.
  • node: The root AST node that conforms to INode.
  • options: (optional) See defaults.

~~build(text, parse, options = defaults)~~

Deprecated

Use compose instead.

Returns Annotated Text as described by LanguageTool's API:

{
  "annotation": [
    { "text": "A " },
    { "markup": "<b>" },
    { "text": "test" },
    { "markup": "</b>" }
  ]
}

Run the object through JSON.stringfy() to get a string suitable for passing to LanguageTool's data parameter.

This is the main function you'll use in implementing for different parsers.

import * as builder from ‘annotatedtext’;
const processor = unified().use(mark, { commonmark: true });
var annotatedtext = builder.build(text, processor.parse);
JSON.stringify(annotatedtext);
  • text: The text from the markup document in its original form.
  • parse: A function that parses a markup document and returns an abstract syntax tree.
  • options: (optional) See defaults.

collecttextnodes(ast, options = defaults)

Returns an array of annotated text nodes used in the final annotated text object.

  • ast: An abstract syntax tree.
  • options: (optional) See defaults.

composeannotation(text, annotatedtextnodes, options = defaults)

defaults

annotatedtext comes with the following default functions used throughout.

const defaults = {
  children: function (node) {
    return node.children;
  },
  annotatetextnode: function (node) {
    if (node.type === "text") {
      return {
        text: node.value,
        offset: {
          start: node.position.start.offset,
          end: node.position.end.offset,
        },
      };
    } else {
      return null;
    }
  },
  interpretmarkup: function (text = "") {
    return "";
  },
};

Functions can be overriden by making a copy and assigning a new function. For example, the tests use markdown and need to interpret new lines in the markup as new lines. The interpretmarkup function is overriden as:

var options = builder.defaults;
options.interpretmarkup = function (text) {
  let count = (text.match(/\n/g) || []).length;
  return "\n".repeat(count);
};

children(node)

Expected to return an array of child nodes.

annotatetextnode(node)

Expected to return a struture for a text ast node with at least the following:

  • text is the natural language text from the node, devoid of all markup.
  • offset contains offsets used to extract markup text from the original document.
    • start is the offset start of the text
    • end is the offset end of the text
{
  "text": "A snippet of the natural language text from the document.",
  "offset": {
    "start": 1,
    "end": 57
  }
}

If the node is not a text node, it must return null;

interpretmarkup(node)

Used to make sure LanguageTool knows when markup represents some form of whitespace. As mentioned above, the tests override this function to ensure new lines captured as markup are also visible to LanguageTool.

var options = builder.defaults;
options.interpretmarkup = function (text) {
  let count = (text.match(/\n/g) || []).length;
  return "\n".repeat(count);
};

Tests

Unit tests are also run via npm:

npm test

Implemented Parsers

The following packages wrap annotatedtext for specific parsers:

License

MIT © David L. Day