GitHub

Sage Project Description

What is the project about?

Sage is an open-source project that enables users to interact with large language models (LLMs) using voice commands.

What problem does it solve?

It provides a hands-free, natural language interface for interacting with LLMs, making them more accessible and user-friendly. It removes the need for typing, allowing for a more conversational interaction.

What are the features of the project?

  • Open: Utilizes open-source speech processing models.
  • Efficient: Designed to run on consumer-grade hardware with low latency.
  • Self-hosted: The entire pipeline can operate offline, ensuring privacy and control.
  • Modular: Easily switch between different LLM providers by changing environment variables.

What are the technologies used in the project?

  • Speech processing models (Whisper, Kokoro).
  • Large Language Models (LLMs) via Ollama or third-party providers (Deepseek, OpenAI, Anthropic, Together.ai).
  • Onnx for model execution.
  • Bun (JavaScript runtime).
  • Rust.
  • Docker (optional, for easier deployment).
  • OpenSSL, LLVM, Clang, CMake (for native builds).

What are the benefits of the project?

  • Accessibility: Enables voice-based interaction with LLMs.
  • Privacy: Can be run entirely offline.
  • Flexibility: Supports multiple LLM providers and self-hosting.
  • Efficiency: Optimized for low latency on consumer hardware.
  • Open Source: Allows for community contributions and customization.

What are the use cases of the project?

  • Hands-free interaction with AI assistants.
  • Voice-controlled applications powered by LLMs.
  • Voice-based querying of information.
  • Natural language interfaces for various tasks.
  • Accessibility tool for users who prefer voice input.
sage screenshot