npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2024 – Pkg Stats / Ryan Hefner

streamsearch-web

v1.0.0

Published

Streaming Boyer-Moore-Horspool searching for the browser

Downloads

253

Readme

streamsearch-web

A web stream that allows searching for a needle in the stream data using the Boyer-Moore-Horspool algorithm.

This module is forked from streamsearch, which is based heavily on the Streaming Boyer-Moore-Horspool C++ implementation by Hongli Lai here.

Install

npm install streamsearch-web

Usage

Example

import { StreamSearch } from 'streamsearch-web'

const needle = new TextEncoder().encode('\r\n')
const ss = new StreamSearch(needle)

const chunks = [
  'foo',
  ' bar',
  '\r',
  '\n',
  'baz, hello\r',
  '\n world.',
  '\r\n Browser rules!!\r\n\r\n',
]
const src = new ReadableStream({
  pull (controller) {
    const chunk = chunks.shift()
    if (!chunk) return controller.close()
    const buf = new TextEncoder().encode(chunk)
    controller.enqueue(buf)
  }
})

for await (const { isMatch, data, begin, end } of src.pipeThrough(ss)) {
  if (data)
    console.log(`data: "${new TextDecoder().decode(data.subarray(begin, end))}"`)
  if (isMatch)
    console.log('match!')
}

// output:
//
// data: "foo"
// data: " bar"
// match!
// data: "baz, hello"
// match!
// data: " world."
// match!
// data: " Browser rules!!"
// match!
// data: ""
// match!

Advanced

The transform stream allows for easy streaming usage, but due to the async nature of streams involves extra copy operations to ensure all data is safe to use further down the pipeline, even after the next chunk of data has been read and processed by the stream. To potentially avoid these additional copies, the underlying SBMH class can be used directly, which operates similarly to the original streamsearch module, except that it uses generator functions instead of a callback (and uses Uint8Arrays instead of the Node.js Buffer).

import { SBMH } from 'streamsearch-web'

const needle = new TextEncoder().encode('\r\n')
const sbmh = new SBMH(needle)

const chunks = [
  'foo',
  ' bar',
  '\r',
  '\n',
  'baz, hello\r',
  '\n world.',
  '\r\n Browser rules!!\r\n\r\n',
]

for (const chunk of chunks) {
  const buf = new TextEncoder().encode(chunk)
  for (const { isMatch, data, begin, end, isSafe } of sbmh.push(buf)) {
    if (data) {
      console.log(`data: "${new TextDecoder().decode(data.subarray(begin, end))}"`)
      // `isSafe` indicates if it is safe to store a reference to `data` (e.g.
      // as-is or via `data.slice()`) or not, as in some cases `data` may point
      // to a `Uint8Array` whose contents change over time.
      if (!isSafe)
        // Take a copy if data is needed after this loop iteration.
        data.slice()
    }
    if (isMatch)
      console.log('match!')
  }
}

API

class StreamSearch extends TransformStream<Uint8Array, MaybeMatch>

StreamSearch is a TransformStream that searches for a needle within the data piped to the writable side. The readable side yields MaybeMatch objects, which is either a indication that a match was made or a chunk of data that didn't match.

constructor (needle: Uint8Array, options?: { maxMatches?: number })

Creates and returns a new instance for searching for a Uint8Array needle.

The maxMatches option indicates the maximum number of matches that can be made. Defaults to Infinity.

matches: number

The current match count.

reset (): void

Resets internal state. Useful for when you wish to start searching a new/different stream for example.

type MaybeMatch

This object is yielded from the readable side of the stream any time there is non-matching data or there is a needle match.

  1. isMatch: boolean - Indicates whether a match has been found.
  2. hasNonMatchData: boolean - Indicates whether non matched data is also present.
  3. data?: Uint8Array - If set, this contains data that did not match the needle.
  4. begin?: number - The index in data where the non-matching data begins (inclusive).
  5. end?: number - The index in data where the non-matching data ends (exclusive).

Contributing

Feel free to join in. All welcome. Open an issue!

License

Licensed under MIT