Open questions
These are the unresolved decisions that should be settled by prototypes, measurement, or focused review.
- llama-server vs Ollama:
llama-serverfor portability, but offer Ollama’s comfort as an optional mode? - Confidence routing: which concrete dissent measure should (later) trigger an online escalation without undermining privacy?
- VLM engine: llama.cpp multimodal vs a dedicated vision binary; weigh the integration cost.
- Eval sets: which tasks most convincingly prove “Council > Single” (code review, decision memo, document triage)? See evals.
- Browser inference: is in-browser WebGPU a useful fallback seat for hosts that can’t run the native engine, or a distraction? See explorations.
- Phone transport: should USB-gadget Ethernet become the recommended Phone Access transport, ahead of LAN-IP binding?
- Appliance form factor: does a self-hosting board belong on the roadmap, or does it become a second product that dilutes “software you copy onto a stick”?