qllm-lib

v3.14.16

Published

a month ago

Core library providing robust AI engineering functionalities tailored for Large Language Model (LLM) applications, enabling developers to build, deploy, and optimize AI solutions with ease.

Downloads

389

0High
0Medium
0Low

raphael.mansuy

ai llm qllm library typescript aws-sdk

🚀 qllm-lib

npm version Stars Forks

📚 Table of Contents

🌟 Introduction

qllm-lib is a powerful TypeScript library that provides a unified interface for interacting with various Large Language Model (LLM) providers. It simplifies the process of working with different AI models and offers advanced features like templating, streaming, and conversation management.

💻 Installation

To install qllm-lib, use npm:

npm install qllm-lib

🔧 Usage

Basic Usage

🚀 Initializing a Provider

To start using the API, first import the necessary functions and initialize a provider:

import { createLLMProvider, LLMProvider } from 'qllm-lib';

// Make sure to set the appropriate environment variables for your chosen provider
// e.g., OPENAI_API_KEY for OpenAI
const provider: LLMProvider = createLLMProvider({ name: 'openai' });

📋 Listing Models

To get a list of available models:

const models = await provider.listModels();
console.log(models);

Intermediate Usage

💬 Generating Chat Completions

To generate a chat completion:

const result = await provider.generateChatCompletion({
  messages: [
    {
      role: 'user',
      content: { type: 'text', text: 'What is the capital of France?' },
    },
  ],
  options: {
    model: 'gpt-4',
    maxTokens: 1024,
    temperature: 0.7,
  },
});

console.log(result.text);

🌊 Streaming Chat Completions

To stream a chat completion:

const stream = await provider.streamChatCompletion({
  messages: [
    {
      role: 'user',
      content: { type: 'text', text: 'Write a short story about Paris.' },
    },
  ],
  options: {
    model: 'gpt-4',
    maxTokens: 1024,
    temperature: 0.7,
    topProbability: 1,
    frequencyPenalty: 0,
    presencePenalty: 0,
  },
});

for await (const chunk of stream) {
  if (chunk.text) {
    process.stdout.write(chunk.text);
  }
}

Advanced Usage

🧠 Generating Embeddings

To generate embeddings for text:

const embedding = await provider.generateEmbedding({
  content: 'Hello, world!',
  model: 'text-embedding-3-small',
});

console.log(embedding);

🖼️ Using Images in Chat Completions

You can include images in your chat completions:

const result = await provider.generateChatCompletion({
  messages: [
    {
      role: 'user',
      content: [
        { type: 'text', text: 'What do you see in this image?' },
        { type: 'image_url', url: 'https://example.com/eiffel-tower.jpg' },
      ],
    },
  ],
  options: {
    model: 'gpt-4-vision-preview',
    maxTokens: 1024,
    temperature: 0.7,
  },
});

console.log(result.text);

🛠️ Using Function Calling

You can use function calling to enable the AI to interact with external tools or APIs:

import { z } from 'zod';
import { createFunctionToolFromZod } from 'qllm-lib';

const weatherSchema = z.object({
  location: z.string().describe('The city and state, e.g. San Francisco, CA'),
  unit: z.enum(['celsius', 'fahrenheit']).describe('The temperature unit'),
});

const weatherTool = createFunctionToolFromZod({
  name: 'get_current_weather',
  description: 'Get the current weather in a given location',
  schema: weatherSchema,
});

const result = await provider.generateChatCompletion({
  messages: [
    { role: 'user', content: { type: 'text', text: 'What's the weather like in Paris?' } },
  ],
  tools: [weatherTool],
  toolChoice: 'auto',
  options: {
    model: 'gpt-4',
    maxTokens: 1024,
    temperature: 0.7,
  },
});

console.log(result.text);
console.log(result.toolCalls);

📝 Using Templates

Templates in qllm-lib allow you to define reusable structures for generating complex text outputs. Here's an example of how to use a template:

import { TemplateManager } from 'qllm-lib';

const templateManager = new TemplateManager();
const template = await templateManager.getTemplate('create_story');

const result = await templateManager.executeTemplate({
  template,
  variables: {
    subject: 'A day in Paris',
    genre: 'Adventure',
    role: 'Narrator',
    lang: 'English',
    max_length: 200,
  },
  provider,
  providerOptions: {
    model: 'gpt-4',
    maxTokens: 1024,
    temperature: 0.7,
  },
});

console.log(result.response);

Example Template

Here's an example of a template structure:

name: create_story
description: Generate a story based on given parameters
provider: openai
model: gpt-4
parameters:
  temperature: 0.7
  top_p: 1.0
  frequency_penalty: 0.0
  presence_penalty: 0.0
  max_tokens: 1024
  stop_sequences: []
  system_message: You are a creative storyteller.

input_variables:
  - name: subject
    description: The main subject or theme of the story
    type: string
    required: true
  - name: genre
    description: The genre of the story
    type: string
    required: true
  - name: role
    description: The narrative perspective
    type: string
    required: true
  - name: lang
    description: The language to generate the story in
    type: string
    required: true
  - name: max_length
    description: Maximum length of the story in words
    type: number
    required: true

content: |
  Write a {{genre}} story about {{subject}} from the perspective of a {{role}}.
  The story should be in {{lang}} and should not exceed {{max_length}} words.
  Make it engaging and descriptive.

🗨️ Managing Conversations

qllm-lib provides a ConversationManager to help you manage multi-turn conversations:

import { createConversationManager, createLLMProvider } from 'qllm-lib';

const provider = createLLMProvider({ name: 'openai' });
const conversationManager = createConversationManager();

// Create a new conversation
const conversation = await conversationManager.createConversation();

// Add a user message
await conversationManager.addMessage(conversation.id, {
  role: 'user',
  content: { type: 'text', text: 'Tell me about Paris.' },
});

// Get conversation history
const history = await conversationManager.getHistory(conversation.id);
const messages = history.map((msg) => ({ role: msg.role, content: msg.content }));

const result = await provider.generateChatCompletion({
  messages,
  options: { model: 'gpt-4', maxTokens: 1024, temperature: 0.7 },
});

await conversationManager.addMessage(conversation.id, {
  role: 'assistant',
  content: { type: 'text', text: result.text || 'No response' },
  providerId: provider.name,
});

📄 Templates

QLLM Templates

1. What is a Template?

A template in QLLM is a reusable prompt structure with variables that can be filled dynamically. It allows for creating flexible and customizable prompts for Large Language Models (LLMs).

Template Structure

A typical template consists of:

Metadata (name, version, description, author)
Input variables
Content (the actual prompt text with placeholders)
Output variables (optional)

Examples

Example 1: Simple Template (YAML)

name: greeting
version: '1.0'
description: A simple greeting template
author: QLLM Team
input_variables:
  name:
    type: string
    description: The name of the person to greet
content: >
  Hello {{name}}! How are you today?

Example 2: Story Creation Template (YAML)

name: create_story
version: '1.0'
description: Create a short story
author: QLLM Team
input_variables:
  genre:
    type: string
    description: The genre of the story
    default: 'Science Fiction'
  protagonist:
    type: string
    description: The main character of the story
  setting:
    type: string
    description: The setting of the story
content: >
  Write a {{genre}} story featuring a protagonist named {{protagonist}} set in {{setting}}. 
  The story should be approximately 200 words long.
output_variables:
  story:
    type: string
    description: The generated story

Example 3: Template with File Inclusion (YAML)

name: improve_prompt
version: '1.0'
description: Improve a given prompt
author: QLLM Team
input_variables:
  prompt:
    type: string
    description: The prompt to improve
content: >
  {{file:./improve_prompt.md}}

Where improve_prompt.md contains:

You are an expert prompt engineer. Your task is to improve the following prompt:

{{prompt}}

Please provide an improved version of this prompt, making it clearer, more specific, and more effective.

By using template includes, you can create more maintainable and modular prompt structures, making it easier to update and reuse common elements across different templates.

📚 API Reference

LLMProvider Interface

version: string
name: string
defaultOptions: LLMOptions
generateEmbedding(input: EmbeddingRequestParams): Promise<EmbeddingResponse>
listModels(): Promise<Model[]>
generateChatCompletion(params: ChatCompletionParams): Promise<ChatCompletionResponse>
streamChatCompletion(params: ChatCompletionParams): AsyncIterableIterator<ChatStreamCompletionResponse>

Types

ChatMessage: Represents a chat message with role and content.
LLMOptions: Options for LLM generation, including model, max tokens, temperature, etc.
EmbeddingRequestParams: Input for embedding generation, supporting text and models.
Model: Represents an LLM model with id, description, and creation date.
ChatCompletionParams: Parameters for chat completion, including messages and options.
ChatCompletionResponse: Response from chat completion, including generated text and usage statistics.

🚨 Error Handling

The API uses custom error classes for different types of errors:

LLMProviderError: Base class for all provider errors.
AuthenticationError: Thrown when authentication fails.
RateLimitError: Thrown when rate limits are exceeded.
InvalidRequestError: Thrown for invalid requests or unexpected errors.

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.