npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2024 – Pkg Stats / Ryan Hefner

bm25-se

v0.0.3

Published

A lightweight search engine based on Okapi BM25 algorithm

Downloads

5

Readme

bm25-se

This is a simple search engine implementation that can index and search documents based on the input query. The search engine uses a scoring algorithm based on the BM25 ranking function.

Installation

To use this search engine, you need to have Node.js installed on your machine. You can clone this repository and run npm install to install any dependencies required.

npm i bm25-se

Usage

const { SearchEnginee, TokenizerEn } = require('bm25-se')

// create a new search engine instance
const searchEngine = new SearchEnginee({ tokenizer: new TokenizerEn() })

// add documents to the index
searchEngine.addDocument("this is an example document")
searchEngine.addDocument("this is another document")
searchEngine.addDocument("yet document")

// search for documents that contain the word "example"
const searchResults = searchEngine.search("example document")

console.log(searchResults) // should output a list of documents that match the query, ordered by relevance

searchResults

Demo

You can try out the search engine in your browser by running the following command:

git clone https://github.com/Selenium39/bm25-se.git 
cd bm25-se
npm install
npm run web

Then open http://localhost:3000 in your browser.

demo

API

SearchEnginee(options)

Creates a new instance of the search engine.

  • options (optional): An object with the following properties:
    • tokenizer (optional): A tokenizer object to use for tokenizing the documents. Defaults to new TokenizerEn().
    • k1 (optional): A parameter for the BM25 scoring function. Defaults to 1.2.
    • b (optional): A parameter for the BM25 scoring function. Defaults to 0.75.

reset()

Resets the search engine, removing all indexed documents and terms.

addDocument(document)

Indexes a new document in the search engine.

  • document: A string representing the content of the document to index.

search(query)

Searches the indexed documents for the given query string and returns a list of matching documents.

  • query: A string representing the search query.

TokenizerEn

A tokenizer object for tokenizing English text.

TokenizerZh

A tokenizer object for tokenizing Chinese text.

Contributing

If you find any bugs or have any suggestions, feel free to open an issue or a pull request.

References

License

MIT License