@sephir/easy-search
v0.9.6
Published
easy search system
Downloads
1
Readme
A Painfully Simple Search System.
Make files in folder/s searchable. (NODE JS)
Install
npm i @sephir/easy-search
Usage
var easysearch = require('@sephir/easy-search')
// feed an array of folders containing the documents. It doesn't do subfolders.
var searchIndex = await easysearch.indexFolder( ["some/folder/here", "another/folder/here"], html=false, contextWindowSize=10 )
//If your documents are html, then flip the html parameter to true, so that tags can be ignored.
// To search feed the index object and the query
var results = easysearch.search( searchIndex, "your query here")
// results contains an array of objects, each object contains a document number and a tf-idf score assigned to it , given your query.
Example
For a working example check out test.js. It's a simple file showing how to index a folder, store the created index and how to retrieve results.
Serialise/Deserialise the Index
// Store your index in a folder
await easysearch.storeIndex( searchIndex, "/some/folder/name" ) // path is a folder
// Reload your index from a file
var searchIndex = await easysearch.reloadIndex("/some/folder/name") // path is a folder
// If you want to create a single file with the index
await easysearch.storeIndexAsJSONFile(searchIndex, "some/folder/theindex.json") // path is a file
UPDATES! v0.9.6
- Saving index to a single JSON file. This facilitates for a search index to be used statically on a UI interface.
UPDATES! v0.9
- Improved storage which allows now a LOT more documents to be indexed. Limit being what you can load into RAM.
- Performance note: Indexing 128k html documents takes ~300 seconds on my hardware.
- Added context of query words. In search results you can find chunks of text where the query terms have been found.
- Size of chunks can be configured within the indexFolder parameter ``contextWindowSize''
Get in touch!
If you like the project and want to tell me or propose features let me know!: mailto
Disclaimer
There are many search solutions much more powerful than this one, however they all require substantial learning before using them. Easy-search has been built to provide a very simple search functionality that should be more than enough for most cases, when your dataset consists of a few thousand documents.