Roadmap¶
Complete¶
Storage & Data Model¶
-
Entrydata model with structured metadata and type-specific extensions -
DistilleryStoreprotocol — async storage abstraction enabling backend migration - DuckDB backend with VSS extension and HNSW index (cosine similarity)
- Configurable embedding providers (Jina v3 default, OpenAI adapter)
- Embedding model lock via
_metatable — prevents mixed-model corruption - MCP server with 22 tools over stdio and streamable-HTTP
-
distillery.yamlconfig system with validation
Core Skills¶
-
/distill— session knowledge capture with duplicate detection -
/recall— semantic search with provenance display -
/pour— multi-pass retrieval + structured synthesis with citations -
/bookmark— URL fetch, auto-summarize, store with dedup check -
/minutes— meeting notes with--update(append) and--listmodes - Shared
CONVENTIONS.md— author/project identification, error handling patterns
Classification Pipeline¶
-
ClassificationEngine— LLM prompt-based type assignment with confidence scoring -
DeduplicationChecker— skip/merge/link/create at configurable thresholds -
/classifyskill — classify by ID, batch inbox, review queue triage
Quality & Observability¶
- Implicit retrieval feedback +
distillery_qualitytool - Stale entry detection —
distillery_staletool - Conflict detection —
distillery_check_conflictstool - Usage metrics dashboard —
distillery_metricstool
Infrastructure¶
- FastMCP 2.x/3.x with
@server.tooldecorators - Hierarchical tag namespace with validation and
distillery_tag_treetool - 12 entry types including
person,project,digest,github,feed -
distillery_type_schemasMCP tool for schema discovery
Team Access¶
- HTTP transport —
distillery-mcp --transport http - GitHub OAuth — team authentication via FastMCP
GitHubProvider - Prefect Horizon deployment (MotherDuck)
- Fly.io deployment with persistent DuckDB on volume
- Namespace taxonomy — hierarchical, validated tag system
Ambient Intelligence¶
-
/radar— ambient feed digest with AI source suggestions -
/watch— add/remove/list monitored feed sources -
/tune— adjust relevance thresholds and trust weights - Feed polling architecture —
FeedPollerwith configurable intervals - Source adapters — GitHub events (REST API) and RSS/Atom
- Relevance scoring pipeline — embedding-based cosine similarity
- Interest extractor — mines entries for tags, domains, repos, expertise
Onboarding¶
-
/setupskill — MCP connectivity wizard, auto-poll configuration
Planned¶
New Skills¶
-
/whois— evidence-backed expertise map -
/investigate— deep domain context builder -
/digest— team activity summaries -
/briefing— team knowledge dashboard -
/process— batch classify + digest + stale detection pipeline -
/gh-sync— GitHub issue/PR knowledge tracking
Infrastructure¶
- Elasticsearch migration — hybrid search (BM25 + kNN + RRF)
- Access control — team/private visibility flag
Deferred¶
- LangGraph evaluation for complex skill orchestration
- CODE pipeline formalization for team workflows
- Web UI or REST API
- Multi-team support and cross-team knowledge sharing
- Re-embedding migration tooling
Technology Stack¶
| Layer | Current | Planned |
|---|---|---|
| Interface | Claude Code skills | Same |
| Transport | stdio + streamable-HTTP | Same |
| Auth | GitHub OAuth (FastMCP) | + multi-team RBAC |
| Storage | DuckDB + VSS / MotherDuck | + Elasticsearch |
| Embeddings | Jina v3 / OpenAI | + ES native |
| Language | Python 3.11+ | Same |
| Hosting | Local / Fly.io / Prefect Horizon | Same |