Skip to content

Distillery

Roadmap

norrietaylor/distillery

Roadmap¶

Complete¶

Storage & Data Model¶

Entry data model with structured metadata and type-specific extensions
DistilleryStore protocol — async storage abstraction enabling backend migration
DuckDB backend with VSS extension and HNSW index (cosine similarity)
Configurable embedding providers (Jina v3 default, OpenAI adapter)
Embedding model lock via _meta table — prevents mixed-model corruption
MCP server with 22 tools over stdio and streamable-HTTP
distillery.yaml config system with validation

Core Skills¶

/distill — session knowledge capture with duplicate detection
/recall — semantic search with provenance display
/pour — multi-pass retrieval + structured synthesis with citations
/bookmark — URL fetch, auto-summarize, store with dedup check
/minutes — meeting notes with --update (append) and --list modes
Shared CONVENTIONS.md — author/project identification, error handling patterns

Classification Pipeline¶

ClassificationEngine — LLM prompt-based type assignment with confidence scoring
DeduplicationChecker — skip/merge/link/create at configurable thresholds
/classify skill — classify by ID, batch inbox, review queue triage

Quality & Observability¶

Implicit retrieval feedback + distillery_quality tool
Stale entry detection — distillery_stale tool
Conflict detection — distillery_check_conflicts tool
Usage metrics dashboard — distillery_metrics tool

Infrastructure¶

FastMCP 2.x/3.x with @server.tool decorators
Hierarchical tag namespace with validation and distillery_tag_tree tool
12 entry types including person, project, digest, github, feed
distillery_type_schemas MCP tool for schema discovery

Team Access¶

HTTP transport — distillery-mcp --transport http
GitHub OAuth — team authentication via FastMCP GitHubProvider
Prefect Horizon deployment (MotherDuck)
Fly.io deployment with persistent DuckDB on volume
Namespace taxonomy — hierarchical, validated tag system

Ambient Intelligence¶

/radar — ambient feed digest with AI source suggestions
/watch — add/remove/list monitored feed sources
/tune — adjust relevance thresholds and trust weights
Feed polling architecture — FeedPoller with configurable intervals
Source adapters — GitHub events (REST API) and RSS/Atom
Relevance scoring pipeline — embedding-based cosine similarity
Interest extractor — mines entries for tags, domains, repos, expertise

Onboarding¶

/setup skill — MCP connectivity wizard, auto-poll configuration

Planned¶

New Skills¶

/whois — evidence-backed expertise map
/investigate — deep domain context builder
/digest — team activity summaries
/briefing — team knowledge dashboard
/process — batch classify + digest + stale detection pipeline
/gh-sync — GitHub issue/PR knowledge tracking

Infrastructure¶

Elasticsearch migration — hybrid search (BM25 + kNN + RRF)
Access control — team/private visibility flag

Deferred¶

LangGraph evaluation for complex skill orchestration
CODE pipeline formalization for team workflows
Web UI or REST API
Multi-team support and cross-team knowledge sharing
Re-embedding migration tooling

Technology Stack¶

Layer	Current	Planned
Interface	Claude Code skills	Same
Transport	stdio + streamable-HTTP	Same
Auth	GitHub OAuth (FastMCP)	+ multi-team RBAC
Storage	DuckDB + VSS / MotherDuck	+ Elasticsearch
Embeddings	Jina v3 / OpenAI	+ ES native
Language	Python 3.11+	Same
Hosting	Local / Fly.io / Prefect Horizon	Same