@groupdocs/groupdocs.search
v24.7.0
Published
Powerful full-text search engine for PDF, Word, Excel, PowerPoint and image files.
Downloads
7
Readme
GroupDocs.Search for Node.js via Java helps build reliable, smart and feature-rich search application for your end-users, supporting all popular document formats. It extracts text and metadata from different files and performs search over all documents. In order to make search process fast and accurate, index is created and documents are added to it. Hence all the search queries or advanced searches are performed over the index.
Document Indexing Features
- 80+ supported file formats.
- Create index in memory or on disk.
- Update index to take into account changed, deleted and added documents.
- Merge several indexes into one.
- Optimize index to improve search performance.
- Indexing password protected documents.
- Indexing with stop words.
- Support for indexing additional fields.
- Support for blended characters.
- Support for characters indexed as a whole word.
- Support for character replacement during indexing.
- Support for custom text extractors.
- Option for compact and metadata index.
- Ability to save extracted text in index with different level of compression.
- Document filtering during indexing.
- Deleting indexed paths from index.
- Ability to separately extract data from documents and index them.
- Support for optical text recognition on images.
- Calculation and indexing of image hashes for reverse image search.
- The ability to create a distributed search network that automatically balances the load across nodes.
Document Search Features
- Simple word search.
- Boolean search.
- Regular expression search.
- Faceted search.
- Case sensitive search.
- Flexible fuzzy search.
- Synonym search.
- Homophone search.
- Wildcard search.
- Phrase search with wildcards.
- Search for different word forms.
- Date range search.
- Numeric range search.
- Search by chunks (pages).
- Document filtering in search result.
- Search for different object types: text, numbers, dates, file names, document types, metadata fields, document creation/modification dates.
- Combine different types of search into one search query.
- Alias substitution in search queries.
- Perform spell check during search.
- Perform keyboard layout correction during search.
- Search queries in text or flexible object form.
- Highlight search results in the text of the entire document or in text segments.
- Multiple simultaneous thread safe search.
- Thread safe search during indexing, updating or merging operation.
- Search over several indexes simultaneously.
- Built-in support for reverse image search.
Getting Started with GroupDocs.Search for Node.js via Java
Installation
From the command line:
npm i @groupdocs/groupdocs.search