ai-maestro-edge

v1.0.0

Published

6 months ago

Edge server to control AI Docker containers

Downloads

0High
0Medium
0Low

tensorflowz

AI Maestro Edge

Node service controled by ai-maestro-api.

npm install ai-maestro-edge npx ai-maestro-edge

Description

This application is a simple server built using Express that manages the lifecycle and model loading of Ollama or StableDiffusion containers running in Docker. It provides endpoints for creating, destroying, and managing the instances with specific GPUs and ports. The application also supports loading models into these containers.

Endpoints

POST /up-container: Creates a new instance by spinning up a Docker container with the specified name, GPU IDs, and port. If this is creating a stable diffusion container, you must also pass in diffusionModel with either 'sdxl-turbo' or 'sd-turbo', depending on which model you want the container to run. Returns a 200 status code upon success.
POST /down-container: Stops and removes an existing instance (Docker container) based on its name. If the model you are downing is a diffusion model, also pass in mode with the value "diffusion". Returns a 200 status code upon success.
POST /down-all-containers: Stops and removes all instances (Docker containers) - both diffusion models and LLMs. Returns a 200 status code upon success.
POST /load-model: Loads a specified model into a given container by running the 'ollama run MODEL' command for llms, and issues a request to a diffusion container to load that into VRAM. If the model you are downing is a diffusion model, also pass in mode with the value "diffusion". Returns a 200 status code upon success.
GET /health: A simple health check endpoint, which returns a 200 status code if the server is running.

Requirements

Node.js (>=18)
Docker
Ollama/ollama image in Docker

Installation and Usage

Clone this repository: git clone <repository_url>
Navigate to the project directory: cd <project_directory>
Install dependencies: npm install
Start the server: npm start or node main.ts
Use an API client such as Postman, Curl, or Insomnia to send requests to the provided endpoints.

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

AI Maestro Edge

Description

Endpoints

Requirements

Installation and Usage