Deployment
Astra supports four deployment targets. All use the same codebase; backend selection (Metal, CUDA, CPU) is explicit via environment config.
Environments
| Environment |
Typical target |
Secrets |
| Local dev |
Docker Compose + native services |
.env / local only — not for prod secrets |
| Staging |
Smaller k8s or single node |
Vault or sealed secrets |
| Production |
GKE / EKS / bare metal + managed data |
Vault at runtime (PRD §18) |
Targets
Kubernetes namespaces
| Namespace |
Services |
control-plane |
api-gateway, identity, access-control |
kernel |
scheduler-service, task-service, agent-service, goal-service, planner-service, memory-service |
workers |
execution-worker, browser-worker, tool-runtime, worker-manager, llm-router, prompt-manager, evaluation-service |
infrastructure |
Postgres, Redis, Memcached, MinIO (local) |
observability |
Prometheus, Grafana, OpenTelemetry collector, Loki |
Scaling model
| Component |
Strategy |
| Stateless services |
HPA based on CPU / request rate |
| Workers |
HPA based on Redis queue depth + scheduler hints |
| Redis |
Cluster mode, shard count based on throughput |
| Postgres |
Primary for writes, read replicas for heavy reads |
| Memcached |
Memory-optimised nodes, horizontal scaling |