Skip to main content

Overview

Orca Memory is built on modern infrastructure that you can deploy to your own environment. This guide covers what you’ll need.

Stack

Orca Memory consists of:
ComponentTechnologyPurpose
DashboardReact + ViteWeb interface
BackendConvexDatabase, auth, real-time
EmbeddingsPython + FastAPIVector embeddings for search
AuthBetter AuthAuthentication

Prerequisites

Convex Account

Orca Memory uses Convex as its backend. You’ll need:
  1. A Convex account (free tier available)
  2. A new Convex project for Orca Memory
Convex handles:
  • Database storage
  • Real-time subscriptions
  • Serverless functions
  • File storage

Embedding Service

For semantic search, you need an embedding service. Options:
  • Self-hosted - Run our Python embeddings service
  • OpenAI - Use OpenAI’s embedding API
  • Other providers - Any service that returns vector embeddings

Domain & SSL

For production deployment:
  • A domain name
  • SSL certificate (use Let’s Encrypt or your provider)

Hardware Requirements

Minimum (Development)

  • 1 CPU core
  • 1 GB RAM
  • 10 GB storage
  • 2+ CPU cores
  • 4 GB RAM
  • 50 GB SSD storage
The embeddings service is the most resource-intensive component. Consider running it on a GPU-enabled instance for better performance.

Next Steps