Grimoire Cortex
Grimoire Cortex: Edge AI Orchestration
Status: Active Development (Internal Alpha)
Primary Discipline: Agentic Systems / Systems Engineering
Key Constraints: 8GB RAM Budget, Passive Cooling, Offline-First Execution
1. The Constraints (Physical Boundaries)
Grimoire Cortex is architected to thrive within the strict physical and thermal limits of edge hardware, specifically the Nvidia Jetson Orin Nano.
- Memory: Hard cap of 8GB (unified memory). The system is tuned to keep the orchestration overhead under 1GB, maximizing the KV cache and parameter space for 4-bit and 8-bit GGUF models.
- Thermal: Optimized for passive cooling. High-intensity loops are mitigated through asynchronous ritual dispatch to prevent thermal throttling in compact cyberdeck enclosures.
- Connectivity: Zero-trust cloud reliance. All inference and tool execution occurs locally to ensure operational autonomy in field environments.
2. Architecture Decision Records (ADR)
ADR-001: Local GGUF over Cloud APIs
- Context: Most modern agentic frameworks assume infinite cloud compute.
- Decision: Standardize on native
llama.cppintegration for C++ model execution. - Rationale: To achieve true agentic autonomy, the “Weight of the Node” must be physical. Local execution eliminates API latency, cost volatility, and data leakage.
- Trade-off: Limits model selection to the 3B-8B parameter range to maintain interactive token velocity.
ADR-002: Asynchronous Ritual Execution
- Context: Synchronous tool-calling blocks the inference engine, causing UI stutter and engine stalls.
- Decision: Implement a decoupled “Ritual” pool for all I/O and hardware interaction.
- Rationale: By treating tool execution as an asynchronous side effect (a “Ritual”), the engine can continue managing state and context while the hardware layer performs physical operations.
3. Hardware Bill of Materials (BOM)
| Component | Specification | Purpose |
|---|---|---|
| Compute | Nvidia Jetson Orin Nano (8GB) | Primary AI Orchestration & Inference |
| Storage | 1TB NVMe Gen4 | Vector Store, Model Weights, & Event Logs |
| Power | Omnicharge 20+ | Portable DC Power & UPS |
| I/O | 7” Waveshare DSI Display | Real-time System Telemetry |
4. Current Progress
The system currently bridges high-level intent parsing with low-level hardware triggers. Current work is focused on optimizing the context window preservation during multi-turn agentic loops on the Jetson’s unified memory architecture.
Development Logs
(Technical deep-dives and update logs will populate here as development progresses.)