neptune-gremlin
v0.0.6
Published
An SDK for querying an Amazon Neptune graph database using gremlin
Downloads
10
Maintainers
Readme
neptune-gremlin
The neptune-gremlin
package is an SDK for querying an Amazon Neptune graph
database using gremlin. Amazon Neptune is is a fast, reliable, fully managed
graph database service that makes it easy to build and run applications. It
allows you to build connections between identities, build knowledge graphs,
detect fraud patterns, and make predictions. It can also simply be used as a
general purpose database, which is made easier by this package.
The source for this package includes an AWS CDK application that creates a Neptune cluster and a Lambda function in the same VPC to facilitate integration testing.
NOTICE This is an experimental package that is not supported in any way by AWS. Please do not use it for mission critical production workloads!
Installation
npm install neptune-gremlin
Usage
Connect to Neptune:
const gremlin = require("neptune-gremlin")
// Get configuration values from the environment
const host = process.env.NEPTUNE_ENDPOINT
const port = process.env.NEPTUNE_PORT
const useIam = process.env.USE_IAM === "true"
// Create a new connection to the Neptune database
const connection = new gremlin.Connection(host, port, {useIam})
await connection.connect()
Save a node (vertex):
const node1 = {
"unique-id-1",
properties: {
name: "Test Node",
a: "A",
b: "B",
},
labels: ["label1", "label2"],
}
await connection.saveNode(node1)
Save an edge
const edge1 = {
id: uuid.v4(),
label: "points_to",
to: node2.id,
from: node1.id,
properties: {
"a": "b",
},
}
await connection.saveEdge(edge1)
Get all nodes and edges in the graph
const searchResult = await connection.search({})
Run a custom traversal:
const f = (g) => {
return await g.V()
.has("person", "name", "Eric")
.bothE().bothV().dedup()
.valueMap(true).toList()
}
const result = await connection.query(f)
Partition the graph
A Neptune cluster does not have a native paritioning concept. All nodes are in the same database. Gremlin has a feature called a partition strategy that adds a property to each node and edge automatically to segment your graph into different sub graphs.
All you have to do with this library is set the partition on the connection:
connection.setPartition("test_partition")
All subsequent calls using that connection will have the _partition
property added by default.
Development
Building the project
This package is all Javascript, so the build script just runs eslint, copies neptune-gremlin.js
into the cdk app's lambda folder, and then synthesizes the cdk app.
npm run build
Sample application
There is a sample app at https://github.com/aws-samples/cdk-neptune-knowledge-graph where you can see how to incorporate this library into a REST API.
Making changes
If you want to make a change or an addition to the package (contributions welcome!), please
add a test to cdk-test-app/lambda/integration-test.js
. Deploy the stack to your AWS account
and invoke the function to make sure everything works as expected.
Deploying the cdk test app
Make sure you run the top level build script, since it copies the latest neptune-gremlin.js
file into the cdk app's lambda folder.
Also keep in mind that the very first call to a new Neptune cluster tends to fail with a 500, so if that happens, just try again.
Note that this app will result in charges in your AWS account! Be sure to destroy the stack when you are done!
npm run build
cd cdk-test-app
npm install
cd lambda
npm install
cd ..
npx cdk bootstrap
npx cdk synth
npx cdk diff
npx cdk deploy
Cleaning up the CDK stack to avoid charges to your AWS account
cdk destroy