@automattic/big-sky-agents

v1.2.2

Published

3 months ago

The Big Sky Agents SDK

Downloads

142

Big Sky Agents 🤖

Overview

Big Sky Agents is a front-end library for building AI-enhanced applications. It is designed to work with Gutenberg UI components and WordPress.com APIs and authentication. It exists to provide a consistent AI UX across Automattic's products and to distribute innovations across projects and platforms.

Links:

Package Homepage
Complete React Example - an example of using this library in a standalone React app

Getting Started

To use this package in your project, you can install it via npm:

npm install

To view the components live, you can run Storybook.

Storybook

You can interact with the components in Storybook. For convenience, you can set the API key in .env like this. It is important to include the STORYBOOK_ prefix for the variable to be picked up:

STORYBOOK_OPENAI_API_KEY="sk-..."

Then run Storybook:

npm run storybook

Using Storybook with WPCOM APIs

In order to authenticate to WordPress.com, the storybook client uses "implicit oauth", and in the case of Jetpack services it also needs a blog ID in order to fetch a JWT for the Jetpack AI query endpoint.

STORYBOOK_WPCOM_CLIENT_ID=104816 # A WPCOM client ID
STORYBOOK_WPCOM_BLOG_ID=280895 # use a blog that you are an author on

If you wish to use your own WPCOM client ID, you will need to create the OAuth client with the following settings:

Type: Native (not web)

Redirect URLs:

http://localhost:6006/iframe.html
http://localhost:6006/

Javascript Origins:

http://localhost:6006

Each client must be approved by Automattic staff before it can be used.

Developing against this package locally

These instructions show you how to incorporate this package into another project for development, for example a WordPress plugin or React app.

Note that Electron doesn't currently work in this mode due to strangeness with node gyp linking.

First, make sure you're using the same Node version for both apps, e.g.

% nvm current
v20.14.0

Now let's link in this package.

In this directory:

npm link

In the other project:

npm link @automattic/big-sky-agents

Publishing the package

[!NOTE] This section applies only to members of the Automattic Github organization.

You will need to be authenticated to NPM.

npm login --scope=@automattic --registry=https://registry.npmjs.org/

Enter your npmjs.com credentials when prompted.

Now you should be able to publish the package. Don't forget to bump the version in package.json first!

npm publish --access public

You can view the updated package here to verify.

Running evaluations using LangSmith

You'll also need an OpenAI API key for generating responses and running some evaluations.

Add the following to your .env file:

OPENAI_API_KEY="sk-..."
LANGCHAIN_TRACING_V2=true
LANGCHAIN_ENDPOINT="https://api.smith.langchain.com"
LANGCHAIN_API_KEY="your-api-key"
LANGCHAIN_PROJECT="your-project"

Then run:

# some examples
node src/eval-agents.js --name test --agent ./src/ai/agents/weather-agent/v1.js --agent ./src/ai/agents/weather-agent/v2.js --dataset data/tool-examples.json
node src/eval-agents.js --name test --agent ./src/ai/agents/weather-agent/v1.js --dataset data/name-examples.js

Right now it runs a trivial example, which you can find in name-example.json.

For more information on the eval-agents.js command, read the docs.

For more information on the example formats, see Eval Dataset and Eval Output documentation.

For more information on evaluator functions, see the Big Sky Evaluators documentation.

LocalAI

Using LocalAI, you can run the Chat Completion inference locally on your machine. This is useful for debugging, experimentation, testing, and just privacy in general.

Make sure your .env has an entry for HUGGINGFACEHUB_API_TOKEN.

I've only tried this on a Mac. Kinda works, might need a better model, but they're only going to improve. This is the worst they'll ever be ;)

Install deps: brew install abseil grpc llama.cpp
Update your .env with the correct address and settings:

LOCALAI_ADDRESS=127.0.0.1:1234
LOCALAI_CONTEXT_SIZE=8192
LOCALAI_CORS=true

Download local-ai binary:

curl -Lo local-ai "https://github.com/mudler/LocalAI/releases/download/v2.16.0/local-ai-$(uname -s)-$(uname -m)" && chmod +x local-ai

Run the binary:

If you have the .env vars, just run:

./local-ai

This will download the models, hermes-2-pro-mistral and mistral-0.3.

Alternatively, without the .env vars:

 ./local-ai --address=":1234" --cors --context-size=8192

Features

The Big Sky Agent Framework is a set of core libraries and UI adapters to Gutenberg components which enables simple, pluggable, isolated agents and tools that can collaborate on shared data structures and tasks with the user. It's similar in some ways to LangChainJS/LangGraph but has no dependencies other than WordPress' core libraries (@wordpress/components, etc). It is intended to be accessible to developers at all levels of experience - but we're not there yet.

The Big Sky Agent Framework currently has the following features:

Goals. At the top level, there is a current Goal (and, in the future, a current Plan). This, along with the prompt, data and tools, gives the Agent a clear idea of what to do next.
Chat: The Chat object exposes message history and functions to add tool calls, various kinds of messages, and invoke a Chat Completion.
Agents. Agents expose tools and prompt templates, as well as lifecycle callbacks like "onStart" and "onConfirm".
- Wapuu: Here to understand your goal and choose the best agent to help you.
- Site Settings and Pages Assistant: Set site title, description, type, topic, location, and pages.
- Page Content Assistant: Add and edit pages, set page title, description, category, and sections.
- Design Assistant: Helps you design your site, set fonts and colors, etc. Can also analyze any URL to extract style and layout information.
- WordPress Tutor: Helps you learn about WordPress, joining the community, and more.
- Site Stats Assistant: Helps you understand your site's traffic and SEO.
- WooCommerce Store Assistant: Helps you manage your online store
Toolkits. A toolkit exposes tools, callbacks and state.
- Agent Toolkit
  - Values: goal, agent, thought, etc.
  - Tools: setGoal, setAgent, setThought, etc.
- Site Toolkit
  - Values: title, description, pages, page sections, fonts, styles, content, etc
  - Tools: setSiteTitle, setSiteDescription, etc.
- URL Analyzer Toolkit
  - Values: which can extract colors, fonts, layouts, images, business type, metadata and content from any web page.
Chat Models. Integrates with a wide range of proprietary and open-weights conversational LLMs, including:
- OpenAI 4o (with multimodal support)
- Our own self-hosted Llama3 70b
- Groq (LLama 70b)
- LocalAI
- LMStudio
Prompt Templating. We support two very lightweight templating languages: f-string and an optimized rewrite of DoT.js. Could be extended to others. Prompt templating allows for very flexible composition and rendering of content into Markdown or other formats. This also allows us to evaluate templates more easily (as if they're models).
Zero-dependency. Most core Agent classes can be used outside the browser and WordPress, though UI components use @wordpress/components.
Core WP Integrations. We have default integrations with WordPress Core Components and Data. Works in React Standalone, Redux, and Node.js (CLI) modes (latter coming soon).
Pluggable. Pluggable by any WP plugin -- add your own agent, tools, or toolkit, wrapping your own core functionalit, from eCommerce to Advertising and SEO.
Multimodal. Natively multi-modal. Chat using text and images, and soon audio and video.

Roadmap

The roadmap is roughly the following (in no particular order):

Needs work to get CLI working again.
MLFlow and/or LangSmith integration for prompt monitoring and evaluation.
Gutenberg integrations?
Lots of documentation and examples needed.
Add "back-end" agents which have a model limited set of tools but can operate in the background, e.g. via wp-cron, as distinguished from front-end agents which can manipulate blocks etc.
Add more formal hooks/filters and other integration points
Add "Plans", which have multiple steps potentially driven by separate agents.