Amina | Private Intelligence

📊

Local LLM Leaderboard

A comprehensive leaderboard evaluating open-weight models tested locally under 30GB of VRAM. Tracking performance across general knowledge, tool use, and agentic coding metrics.

View Leaderboard →

Capabilities

Everything runs on your hardware. Nothing leaves your machine.

🎙️

Voice Intelligence

Wake-word detection, Silero VAD, and faster-Whisper for speech-to-text. Zero-shot voice cloning via OmniVoice TTS — speaks in a natural voice with sub-200ms latency.

STT + TTS

👁️

Vision Mode

Continuous screenshot and camera analysis using local vision models (Gemma 4, Qwen3-VL). Proactive commentary and context-aware responses based on what's on screen.

Multi-modal

⏰

Autonomous Scheduling

Amina schedules her own wake-ups and sends reminders via Telegram. Full task management with /tasks, /snooze, /reschedule, /done, and /history — no server required.

🎨

Creative Studio

Generates stories chapter-by-chapter with matching images. Produces music via ComfyUI, renders video with LTX, and composes multi-track playlists with DJ intros.

Image · Video · Music · Story

🧠

Deep Memory

Tri-store memory system: short-term ring buffer, episodic journal with semantic search (sentence-transformers), and a structured profile of long-term facts per user.

Episodic + Semantic

🔐

Privacy First

All inference runs locally via llama.cpp. User data is isolated per account with optional 7-Zip AES-256 encryption at rest. No API keys, no telemetry, no cloud.

Local · Encrypted

🤖

Multi-User Support

Separate memory, personality, and media per user. GPU resource ownership with conflict resolution — competing requests queue intelligently without crashing services.

Session-Isolated

🖥️

Workspace Agent

A pi.dev terminal bridge with an agentic coding assistant. Plan, execute, and iterate on tasks in a sandboxed environment — with optional desktop control mode.

Agentic

See It In Action

Audio, images, and video in this demo — all generated by Amina.

Amina made it. Local inference, no APIs, no cloud.

100% Open Source

All code is publicly available. Self-host Amina on any machine — capabilities scale with the VRAM you throw at it.

★ View on GitHub 📄 Setup Guide

Tech Stack

Built on open-source inference — no proprietary cloud services.

llama-cpp-python faster-whisper OmniVoice TTS sentence-transformers FastAPI + Uvicorn ComfyUI yt-dlp + ffmpeg python-telegram-bot asyncio PyTorch nightly cu130

Architecture

Inference

llama.cpp (CUDA)

VRAM

More VRAM = bigger models

Context

16k – 256k tokens

STT

Whisper + Silero VAD

TTS Latency

<200ms first byte

Platform

Windows · Linux · Docker

🔒

Restricted Access

This system is closed to the public. Account creation is disabled — only authorized administrators may proceed.

Administrator Login

System online

AMINA CORE