What are rbee's 6 unique advantages?

1. Multi-machine orchestration (use all your GPUs). 2. Heterogeneous hardware (CUDA+Metal+CPU). 3. SSH-based deployment (5 min, no Kubernetes). 4. User-scriptable routing (Rhai). 5. GDPR compliance built-in. 6. Lifetime pricing (€499 one-time).

Why is rbee different from competitors?

rbee combines all 6 advantages in one platform. Ollama lacks multi-machine. vLLM requires Kubernetes. Together.ai is cloud-based. Ray+KServe takes months to set up. rbee provides multi-machine orchestration, heterogeneous hardware, SSH deployment, Rhai scripting, GDPR compliance, and lifetime pricing.

What features does rbee have that others don't?

Unique to rbee: Multi-machine orchestration via SSH, heterogeneous hardware support (mix NVIDIA+Apple+AMD), user-scriptable routing with Rhai (change routing in seconds), and built-in GDPR compliance. Plus lifetime pricing (€499 one-time).

Everything rbee can do
as a platform.

Complete capabilities reference for rbee distributed AI orchestration platform. Multi-machine GPU coordination, OpenAI-compatible API, SSH-based deployment, and programmable routing policies.

Runs on your GPUsOpenAI-compatibleGDPR-readyCUDA · Metal · CPU

See all features How it works

rbee features overview showing multi-machine orchestration, SSH deployment, OpenAI API, Rhai routing, GDPR audit, and heterogeneous hardware support

42/62 BDD scenarios passing

Zero cloud dependencies

Multi-backend: CUDA · Metal · CPU

Core Capabilities

OpenAI-compatible API. Multi-machine GPU orchestration. User-scriptable routing. Real-time job streaming.

OpenAI-Compatible

Drop-in

Drop-in API

Swap endpoints, keep code. Works with Zed, Cursor, Continue.

bash

# Before: OpenAI
export OPENAI_API_KEY=sk-...

# After: rbee (same code)
export OPENAI_API_BASE=http://localhost:8080/v1

Drop-in replacement. Point to localhost.

Why it matters

No vendor lock-in
Use your models + GPUs
Keep existing tooling

Learn about OpenAI-compatible API

Advantage 1

Multi-Machine GPU Orchestration

Orchestrate GPUs across multiple machines as one unified pool. Connect gaming PCs, workstations, and servers via SSH into a single distributed AI cluster.

Pool Registry Management

Configure remote machines once. rbee-keeper handles SSH, validates connectivity, and keeps your pool registry synced.

# Add a remote machine to your pool $ rbee-keeper setup add-node \ --name workstation \ --ssh-host workstation.home.arpa \ --ssh-user vince \ --ssh-key ~/.ssh/id_ed25519 # Run inference on that machine $ rbee-keeper infer --node workstation \ --model hf:meta-llama/Llama-3.1-8B \ --prompt "write a short story"

SSH tunneling

Secure connections over SSH.

Auto shutdown

Workers exit cleanly after tasks.

Minimal footprint

No persistent daemons on nodes.

Automatic Worker Provisioning

Spawns workers over SSH on demand. Cleans up automatically. No daemons.

queen-rbee

Orchestrator

SSH

rbee-hive

Pool manager

Spawns

worker-rbee

Inference worker

On-demand start

Clean shutdown

No daemon drift

Advantage 2

Heterogeneous Hardware Support

Mix NVIDIA CUDA, Apple Metal, and CPU workers in one pool. Use RTX 4090, M2 Ultra, and AMD GPUs together seamlessly for distributed AI workloads.

GPU FAIL FAST policy

No silent fallbacks. You choose the backend.

Prohibited:

No GPU→CPU fallbackNo graceful degradationNo implicit CPU reroute

What happens:

Fail fast (exit 1)Helpful error messageExplicit backend selection

Insufficient VRAM: need 4000 MB, have 2000 MB

Use smaller quantized model (Q4_K_M instead of Q8_0)
Try CPU backend explicitly (--backend cpu)
Free VRAM by closing other applications

rbee-hive detect — workstation.home.arpa

rbee-hive detect

Available backends:

cuda × 2cpu × 1metal × 0

Total devices: 3

Cached in the registry for fast lookups and policy routing.

Detection

Scans CUDA, Metal, CPU and counts devices.

Explicit selection

Choose backend & device—no surprises.

Helpful suggestions

Actionable fixes on error.

Advantage 3

SSH-Based Deployment

Deploy distributed AI models via SSH. No Kubernetes. No Docker. No agents or daemons. Simple SSH deployment across your infrastructure in 5 minutes.

Pool Registry Management

Configure remote machines once. rbee-keeper handles SSH, validates connectivity, and keeps your pool registry synced.

# Deploy model via SSH - no agents required $ rbee-keeper deploy --model llama3.2:3b --pool homelab → Connecting to workers via SSH... ✓ gaming-pc (192.168.1.100:22) - RTX 4090 ✓ mac-studio (192.168.1.101:22) - M2 Ultra ✓ old-server (192.168.1.102:22) - CPU → Deploying model shards... ✓ Shard 1/2 → gaming-pc (12GB VRAM) ✓ Shard 2/2 → mac-studio (24GB VRAM) ✓ Model deployed successfully! No background processes. No agents. Just SSH.

SSH-Only Communication

Uses only SSH (port 22). No additional ports. Works with existing SSH setup.

No Background Processes

No agents. No daemons. Deploy, run, clean up. Zero orphaned processes.

VPN Compatible

Works seamlessly with WireGuard, Tailscale, or any VPN. SSH just works.

Zero-Agent Deployment

rbee-keeper orchestrates deployment via SSH. No background processes.

queen-rbee

Queen (Orchestrator)

SSH: Deploy model shard 1/2

worker-1

Worker 1 (gaming-pc)

SSH: Deploy model shard 2/2

worker-2

Worker 2 (mac-studio)

All communication via SSH (port 22)

No agents or daemons required

Works with existing SSH keys

Advantage 4

User-Scriptable Routing with Rhai

Programmable routing policies via embedded Rhai scripts. Custom load balancing, cost optimization, and compliance routing without recompilation.

Rust

// rate_limit.rhai
fn route_request(req) {
  let user = req.user_id;
  let count = get_request_count(user);
  
  if count > 100 {
    return reject("Rate limit exceeded");
  }
  
  return route_to_pool("default");
}

Write Rhai scripts to control request routing, load balancing, and data governance.

Advantage 5

Built-in GDPR compliance for EU organizations. 7-year audit retention, data sovereignty controls, immutable logs. Self-hosted AI with complete data privacy.

Six Specialized Security Crates

Each concern ships as its own Rust crate—focused responsibility, no monolith.

auth-min

Timing-safe tokens, zero-trust auth.

audit-logging

Append-only logs, 7-year retention.

input-validation

Injection prevention, schema validation.

secrets-management

Encrypted storage, rotation, KMS-friendly.

jwt-guardian

RS256 validation, revocation lists, short-lived tokens.

deadline-propagation

Timeouts, cleanup, cascading shutdown.

Process Isolation

Workers run in isolated processes with clean shutdown.

Sandboxed execution
Cascading shutdown
VRAM cleanup

Zero-Trust Architecture

Defense-in-depth with timing-safe auth and audit logs.

Timing-safe authentication
Immutable audit logs
Input validation

Lifetime Pricing

Self-hosted alternative to cloud AI. Pay once, own forever. No subscriptions.

Advantage 6: Lifetime Pricing

Self-Hosted Alternative to Cloud AI

Pay once, own forever. Core rbee is GPL-3.0 and MIT—completely free. Premium features (€129-499 lifetime) for businesses needing advanced scheduling, telemetry, and GDPR compliance.

View Pricing Compare Costs

No subscriptions. No recurring fees. No vendor lock-in.

Platform capabilities

Complete AI Orchestration Platform

Core Platform

Cascading Shutdown

Ctrl+C tears down keeper → queen → hive → workers. No orphans, no VRAM leaks.

Learn more

Model Catalog

Auto-provision models from Hugging Face with checksum verify and local cache.

Learn more

Network Orchestration

Run jobs across gaming PCs, workstations, and Macs as one homelab cluster.

Learn more

Developer Tools

CLI & Web UI

Automate with a fast CLI or manage visually in the web UI—your call.

Learn more

TypeScript SDK

Type-safe utilities for building agents; async/await with full IDE help.

Learn more

Security First

Six Rust crates: auth, audit logs, input validation, secrets, JWT guardian, and deadlines.

Learn more

Intelligent Model Management

Automatic model provisioning, caching, and validation. Download once, use everywhere.

Learn more

Comprehensive Error Handling

Network, resource, model, and process errors handled gracefully. Automatic retries, fallbacks, and recovery.

Learn more

Real-Time Progress Tracking

SSE-based progress narration. Watch jobs stream in real-time with cancellation support.

Learn more

Stay updated on rbee development

Get notified about new features, performance improvements, and community highlights.

Follow progress & contribute on GitHub

View Repository

Weekly dev notes. Roadmap issues tagged M0–M2.

// rate_limit.rhai fn route_request(req) { let user = req.user_id; let count = get_request_count(user); if count > 100 { return reject("Rate limit exceeded"); } return route_to_pool("default"); }

Everything rbee can doas a platform.

Core Capabilities

OpenAI-Compatible

Why it matters

Multi-Machine GPU Orchestration

Pool Registry Management

SSH tunneling

Auto shutdown

Minimal footprint

Automatic Worker Provisioning

Heterogeneous Hardware Support

GPU FAIL FAST policy

SSH-Based Deployment

Pool Registry Management

SSH-Only Communication

No Background Processes

VPN Compatible

Zero-Agent Deployment

User-Scriptable Routing with Rhai

Rate Limiting by User

Load Balancing by Model Size

Data Governance by Region

GDPR-Compliant AI Infrastructure

Six Specialized Security Crates

Process Isolation

Zero-Trust Architecture

Lifetime Pricing

Self-Hosted Alternative to Cloud AI

Complete AI Orchestration Platform

Cascading Shutdown

Model Catalog

Network Orchestration

CLI & Web UI

TypeScript SDK

Security First

Intelligent Model Management

Comprehensive Error Handling

Real-Time Progress Tracking

Stay updated on rbee development

Everything rbee can doas a platform.

Core Capabilities

OpenAI-Compatible

Why it matters

Multi-Machine GPU Orchestration

Pool Registry Management

SSH tunneling

Auto shutdown

Minimal footprint

Automatic Worker Provisioning

Heterogeneous Hardware Support

GPU FAIL FAST policy

SSH-Based Deployment

Pool Registry Management

SSH-Only Communication

No Background Processes

VPN Compatible

Zero-Agent Deployment

User-Scriptable Routing with Rhai

Rate Limiting by User

Load Balancing by Model Size

Data Governance by Region

GDPR-Compliant AI Infrastructure

Six Specialized Security Crates

Process Isolation

Zero-Trust Architecture

Lifetime Pricing

Self-Hosted Alternative to Cloud AI

Complete AI Orchestration Platform

Cascading Shutdown

Model Catalog

Network Orchestration

CLI & Web UI

TypeScript SDK

Security First

Intelligent Model Management

Comprehensive Error Handling

Real-Time Progress Tracking

Stay updated on rbee development

Everything rbee can do
as a platform.

Everything rbee can do
as a platform.