objlearning
v1.1.9
Published
Train lightweight machine learning models directly on JavaScript objects
Downloads
23
Maintainers
Readme
objlearning
Note: updates for objlearning are being pushed to the object-learning package
objlearning is module that lets you train lightweight machine learning regression and clustering models directly from JavaScript objects on the fly.
Usage
To install ObjLearning, simply run:
npm install objlearning
Linear Regression
You can run a normalized linear regression by running ObjLearning#runLinearReg
:
const ObjLearning = require('objlearning');
const cats = [
{
height: 1,
weight: 2,
fluffiness: 5
},
{
height: 2,
weight: 3,
fluffiness: 8
},
{
height: 5,
weight: 6,
fluffiness: 17
},
];
const regressionModel =
ObjLearning.runLinearReg(
cats,
['height', 'weight'],
'fluffiness',
{iter: 1000, alpha: .01}
);
runLinearReg
accepts the following parameters:
- An array of objects or a JavaScript objects
- An array attributes that specify the parameters the model should be trained on. Each object in the objects array must have have all of the specified attributes.
- A target parameter the model should try to predict. All objects in the array must the specified target attribute.
- An optional options array that can specify:
iter
: the number of iterations of gradient descent that should be performed when training the model.alpha
: The constant alpha that should be used for each regression gradient descent step. Use this to tune the model. Values between .1 and .001 are recommended in most scenarios.
Running the regression returns an object with the following attributes:
theta
: an array with the normalized parameters for the modelsevalObject
: a function that predicts the target attribute value for an object with the attributes the model was trained ontestObjects
: a function that accepts a test set of objects with the required parameter and target attributes and returns the square error per test object of the modelcost
: The square error per object for the model
Continuing from the example above:
const regressionModel = ObjLearning.runLinearReg(cats, ['height', 'weight'], 'fluffiness', {iter: 1000, alpha: .01});
// model is approximately fluffiness = height + 2 * weight
regressionModel.evalObject({height: 4, weight: 5});
// => ~14
regressionModel.test([
{
height: 1,
weight: 2,
fluffiness: 6
},
{
height: 2,
weight: 3,
fluffiness: 7
},
]);
// => ~1
Logistic Regression
You can run a normalized logistic regression by running ObjLearning#runLogisticReg
:
const ObjLearning = require('objlearning');
const students = [
{SAT: 1500, GPA: 3.8, acceptedToCollege: true},
{SAT: 1200, GPA: 3.5, acceptedToCollege: true},
{SAT: 400, GPA: 1.2, acceptedToCollege: false},
{SAT: 900, GPA: 2.4, acceptedToCollege: false},
{SAT: 850, GPA: 2.6, acceptedToCollege: true},
{SAT: 950, GPA: 2.7, acceptedToCollege: true},
{SAT: 200, GPA: 1.0, acceptedToCollege: false},
{SAT: 1000, GPA: 3.0, acceptedToCollege: true},
{SAT: 1040, GPA: 2.6, acceptedToCollege: true},
{SAT: 760, GPA: 2.3, acceptedToCollege: false},
{SAT: 660, GPA: 2.5, acceptedToCollege: false},
{SAT: 720, GPA: 2.8, acceptedToCollege: true}
];
const regressionModel =
ObjLearning.runLogisticReg(
students,
['SAT', 'GPA'],
'acceptedToCollege'
);
The parameters for the runLogisticReg
are identical to the parameters for runLinearReg
with one caveat:
- For each object, the value of the target attribute must be
true
,false
,1
, or0
.
The model object returned by the regression has the same attributes as the model returned by the runLinearReg
. Note the following differences:
- The
evalObject
function will now return a number between 0 and 1 that represents the probability that the target attribute for the test object will be 1 or true. - The
testObjects
andcost
properties use a cost function specific to logistic regression.
Continuing the example of above:
const regressionModel = ObjLearning.runLogisticReg(students, ['SAT', 'GPA'], 'acceptedToCollege');
regressionModel.evalObject({SAT: 430, GPA: 1.9});
// => .00145...
regressionModel.evalObject({SAT: 860, GPA: 2.5});
// => .55366...
regressionModel.evalObject({SAT: 1370, GPA: 3.8});
// => .9968...
k-means clustering
You can run normalized k-means clustering on a set of objects by running ObjLearning#runKClustering
:
const students = [
{SAT: 1500, GPA: 3.8},
{SAT: 1200, GPA: 3.5},
{SAT: 1300, GPA: 3.4},
{SAT: 400, GPA: 1.2},
{SAT: 900, GPA: 2.4},
{SAT: 850, GPA: 2.6},
{SAT: 950, GPA: 2.7},
{SAT: 200, GPA: 1.0},
{SAT: 1000, GPA: 3.0},
{SAT: 1040, GPA: 2.6},
{SAT: 760, GPA: 2.3},
{SAT: 660, GPA: 2.5},
{SAT: 720, GPA: 2.8}
];
const clusteringModel =
ObjLearning.runKClustering(
students,
['SAT', 'GPA'],
{
maxIter: 100,
groups: 3,
groupNames: ['low', 'med', 'high']
}
);
runKClustering
accepts the following parameters:
- An array of objects or class objects
- An array of attributes containing the parameters for the clustering
- An optional options hash with the following arguments:
maxIter
: The max number of times to run through the clustering analysisgroups
: The number groups to cluster the objects intogroupNames
: Names for the groups. Groups are sorted low to high according to total of the group average for the normalized model parameters. If no names are specified, the group names will default to 0, 1, 2...
runKClustering
returns an object with the following attributes:
groups
: An object of objects where each sub-object has the parameter averages and objects for each group.findGroup
: A function that accepts an object and returns the name of the cluster that the object would be part of
Continuing the example above:
const clusteringModel = ObjLearning.runKClustering(students, ['SAT', 'GPA'], {maxIter: 100, groups: 3, groupNames: ['low', 'med', 'high']});
clusteringModel.findGroup({SAT: 1600, GPA: 3.9});
// => 'high'
clusteringModel.groups;
// =>
{
'low': {
groupAvgs: { SAT: 300, GPA: 1.1 },
objects: [
{ SAT: 400, GPA: 1.2 },
{ SAT: 200, GPA: 1.0 }
]
},
'med': {
groupAvgs: { SAT: 860, GPA: 2.6125 },
objects: [
{ SAT: 900, GPA: 2.4 },
{ SAT: 850, GPA: 2.6},
{ SAT: 950, GPA: 2.7},
{ SAT: 1000, GPA: 3.0},
{ SAT: 1040, GPA: 2.6},
{ SAT: 760, GPA: 2.3 },
{ SAT: 660, GPA: 2.5 },
{ SAT: 720, GPA: 2.8}
]
},
'high': {
groupAvgs: { SAT: 1333.3333333333333, GPA: 3.5666666666666664 },
objects: [
{ SAT: 1500, GPA: 3.8 },
{ SAT: 1200, GPA: 3.5 },
{ SAT: 1300, GPA: 3.4 }
]
}
}
Future directions
ObjLearning is still relatively new, and there's a lot I'd like to do expand it going forward:
- More learning model types: I'd like to expand the available models to include other common machine learning techniques like PCA, SVM, etc...
- More powerful models: I'd also like to expand existing model functionality to make them more useful. E.g. For the regression model, I'd like it to be possible to apply additional techniques like feature mapping.