figolia
v0.4.6
Published
⚡️ Keep your Algolia search indexes in sync with your Firebase datasets!
Downloads
22
Maintainers
Readme
Figolia
⚡️ Keep your Algolia search indexes in sync with your Firebase datasets!
When using Firebase as a web or mobile backend, Algolia search-as-a-service is a really nice addition, considering the poor search and querying capabilities of Firebase. Figolia is the perfect companion to make their synchronization happen in no time.
(Codebase inspired by Scott Smith's work in this blog post)
2017 note: as of this year, Google released Firebase Functions, that you can use as triggers to update Algolia indices from your Firebase database. The functions are still in Beta and might get a bit pricey to use if you have a lot of mutations in your indexed search data. I am still using figolia
for the moment (in production) and keep maintaining this package.
Features
- Sync multiple Firebase references
- All CRUD Firebase operations reflected in Algolia
- Restarts from last indexing timestamp the next time it's launched
- Throttle indexing to limit Algolia API calls
- Make middle-of-word (infix) search possible
Install
$ [sudo] npm install -g figolia
Or, clone the github repository :
$ git clone https://github.com/webstylestory/figolia.git
$ cd figolia
$ npm install
Usage
$ figolia --help
Usage: figolia [options]
Options:
-h, --help output usage information
-V, --version output the version number
-c, --config [path] Specify configuration (default ~/.figolia.conf.js)
-l, --live-index Keep server running to live index Firebase operations (otherwise exit after indexing)
-r, --reset Force index reset (clear & full reindex)
-t, --timestamp-field [name] Object field name containing last modification timestamp (default 'updatedAt')
-d, --throttle-delay [n] Minimum throttle delay between Algolia API calls (in seconds, default 10)
Note: between each throttle delay, a maximum of
{ 3 * number of datasets } API calls can be made (add, update & delete)
Note: if downloaded from github, try using npm link
first, or, ./bin/figolia
Important: for the moment, there is no way to run the server without a config file, at least to provide the schema you wish to index. See the configuration.
Configuration
Copy the defaults.conf.js
and modify it according to your needs, before running the server.
(type figolia --help
to see the default configuration file location)
var CONFIG = {
// Firebase credentials
firebase: {
// Firebase project name, as seen in your web config:
// `databaseUrl: 'https://<XXXXX>.firebaseio.com'`
instance: '<XXXXX>',
// The bellow file can be downloaded from the Firebase Console in the
// last tabs of the settings of your project. NEVER SHARE THAT FILE.
// Note: you can also import `path` and use `__dirname` to refer to this file directory
// eg. serviceAccountFile: path.join(__dirname, 'serviceAccountFile.json'),
serviceAccountFile: '/path/to/serviceAccountFile.json',
// Where to store server metadata
path: 'algolia',
// Firebase token will be generated with this uid (to write above path)
uid: 'algolia'
},
// Algolia credentials
algolia: {
// Algolia application ID
applicationId: '<XXXXX>',
// *Admin* API Key
apiKey: '<XXXXX>'
},
// Fully reindex all datasets (ERASE PREVIOUS INDEX DATA)
reset: false,
// Stay running and live-index all firebase operations.
liveIndex: false,
// Minimum throttle delay between Algolia API calls (in seconds)
// Note: between each throttle delay, a maximum of
// [3 * dataset number] calls can be made (add, update & delete)
throttleDelay: 10,
// Optional, this field will be checked against last
// run date to see if reindexing is necessary.
// Field type must be UNIX timestamp (example Javascript Date.now()).
// WARNING: Without this field being corectly configured,
// everything is re-indexed at each rerun.
timestampField: 'updatedAt',
// Firebase datasets to index in Algolia (examples)
schema: {
todoLists: {
// Firebase path
path: 'app/todo',
// Algolia index (must exist already)
index: 'dev_todo_lists',
// Optional, name of ID field (otherwise,
// the Firebase object key will be used)
key: 'id',
// Optional, dataset-specific update time field
// (default is use global setting above)
timestampField: 'createdAt',
// Optional, list of fields to index
// (otherwise, every field will be indexed)
includeFields: [
'name',
'updatedAt'
],
// Optional, list of fields to exclude from index
// Note: if both are specified, `excludeFields`
// is applied *after* `includeFields`
// Nested properties can be accessed using dot notation
excludeFields: [
'passwdHash',
'nested.property'
],
// Optional, list of fields for which you need N-Gram tokens
// ex. "username: 'hermione'", will also create an additional
// field "usernameNGrams: ['ermione', 'rmione', 'mione', 'ione']"
// for each word (down to 4 chars), making *infix* search possible
// in addition to default prefix search available in Algolia out of the box
// letting users search with keyword "mione"
// Note: this can be storage-consumming for long fields, use with
// caution ! (preferably on fields with enforced size)
// Nested properties can be accessed using dot notation
ngrams: ['username', 'profile.fullName']
},
todoItems: {
// Second example dataset to index, minimal config
path: 'app/todoItems',
index: 'dev_todo_items'
}
}
};
Update from v0.3.x to v0.4.x
firebase
package is now deprecated on the server, hence the update to use firebase-admin
. You have to update your figolia configuration to remove the firebase.secret
entry, and replace it with a firebase.serviceAccountFile
pointing to your firebase key json file. It can be downloaded from the Firebase Console in the last tabs of the settings of your project. NEVER SHARE THAT FILE.
Basic steps (for a globval install):
npm i -g figolia@^0.4.x
- Download the service account json file from your Firebase console
- In your figolia configuration file, replace
secret
byserviceAccountFile
which value is the path to the file you just downloaded - If you store the service account file in a project with versioned source code, do not commit it (for example by adding a line in you
.gitignore
file) - Restart your
figolia
services, and you're done! - Report any issues ;-)
Firebase configuration
In order for figolia
to work properly, it must store
the last known indexing date in firebase. You can specify the path where you
want this information stored in the config (see above).
Default is to use the path named figolia
at the root of your Firebase reference.
Partial word matching (infix search)
Out of the box, Algolia only suports prefix search by design. However, this can be mitigated by generating N-Grams of the words up to 4 characters (otherwise relevance falls too much). Figolia does this for you!
Example: activating the ngrams
on username
field will generate, for the
value hermione
, the following tokens: ['ermione', 'rmione', 'mione', 'ione']
.
As a consequence, users can find Hermione with 'mione' keyword, which would sadly return no results otherwise.
Reindexing, incremental indexing
This daemon supports a simple mode where every indexed object in Algolia is dropped at runtime, and then re-indexed from the current Firebase connection. Any previously indexed data will be lost.
This is not reccomended, as it can lead to many useless operations, and bandwith waste.
I really suggest to specify path
and uid
in firebase
config field, and give
write access to that uid in the Firebase Rules. Also mandatory, the last update
time of each items must be set in the schema definition (for example,
in a updatedAt
field).
This is in your app, if you did not implement such feature to track the last update time of each of your objects, you'll have to do so before using this tool efficiently. You can also chose to leave this field unchanged for certain minor operations that does not need reindexation.
To allow algolia
user to write in your Firebase algolia
path,
add the following in your Firebase instance security rules:
// Let figolia daemon keep track of what is in sync
"algolia": {
".read": "auth.uid == 'algolia'",
".write": "auth.uid == 'algolia'",
}
// Optional, to avoid Firebase warnings when running the tests
"tests": {
"testData": {
".indexOn": "updatedAt"
}
}
Production setup
For production setup, I strongly encourage the use of a good process manager like PM2 or foreverjs.
To make them work with figolia, which is developped with ES2015 and deployed without pre-compilation, you need to specify the full path of the executable :
$ which figolia
/usr/local/bin/figolia
$ pm2 start /usr/local/bin/figolia
Release notes
- 0.4.6 - Display version when starting
- 0.4.5 - Small fixes and compliance with third party tools like TravisCI
- 0.4.0 - Update deps, firebase to firebase-admin, auth by secret to serviceAccount file
- 0.3.5 - Optimize ngrams storage by grouping multiple fields and deduplicating
- 0.3.4 - throw is key does not exists. excludedFields can be nested prop
- 0.3.3 - Object key can be nested prop:
'prop.id
' - 0.3.2 - NGrams can be nested prop:
'prop.nested'
- 0.3.1 - Fix - do not clear index when relaunching
- 0.3.0 - Add NGrams generation for middle-of-word (infix) search
- 0.2.8 - Change default timestamp field name to 'updatedAt', bug fixes
- 0.2.4 - Fix babel ignore option and config loading
- 0.2.0 - Add throttle option to limit API calls
- 0.1.5 - Add ignore/only in babel-register options as .babelrc switches ignored
- 0.1.1 - Fix commandline issue with missing npm package
- 0.1.0 - Initial release
Known issues
- #15 When figolia server stops running, your Firebase can continue to
change. Next time you run figolia, it will pickup the changes (well, if you
have the
timestampField
option correctly set up), however, it will not see if items have been deleted. Workaround is to fully reindex the datasets. The fix (TODO) will be to list all object ID and remove the extraneous ones.
Developers
Logging & debugging
By default, basic info is output in the console. Should you need more debug information, you can use the following command line:
DEBUG=figolia* figolia
...or go full throttle including Babel, Algolia and Firebase debug info :
DEBUG=* figolia
Testing
Because Firebase and Algolia accounts are needed for this app, you have to provide
all the necessary credentials as environment variables while running npm test
:
FIREBASE_INSTANCE=CHANGE_ME FIREBASE_ACCOUNT=CHANGE_ME ALGOLIA_APP_ID=CHANGE_ME \
ALGOLIA_API_KEY=CHANGE_ME DEBUG=quiet npm test
Note 1: The tests are pretty slow sometimes, because they wait for all Algolia write and indexing operations to finish in order to validate the results. You can speed up your testing of a specific file by appending its name to the command line:
FIREBASE_INSTANCE=CHANGE_ME FIREBASE_ACCOUNT=CHANGE_ME ALGOLIA_APP_ID=CHANGE_ME \
ALGOLIA_API_KEY=CHANGE_ME DEBUG=quiet npm test ./test/testfile.spec.js
Note 2: although the server can work with a read-only access to Firebase, the tests
cannot, because they have to write fixture data, hence the need for a full Firebase
configuration, including serviceAccountFile
and uid
(see configuration)
Contribute
PRs are more than welcome! Your PR should not break current usage and pass all tests. Even better if you write the tests for the added code, and even better if the new features are documented in this README ;-)
I will have a look at anything you will have the time to propose.
License
MIT © 2016 Aurélien Chivot [email protected]
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the “Software”), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.