An AI advisor for designing Azure Kubernetes clusters. It blends official Well-Architected best practices and Microsoft docs with guidance personalized to your requirements. I aimed to digitize the version of me that only Microsoft's enterprise clients had access to.

Microsoft's official guidance is thorough, technically correct, and comprehensive by design. It has to be: it's written for enterprise clients in regulated, compliance-heavy industries. But comprehensive isn't the same as applicable. Many teams work with resource constraints and limited budgets, and need the parts that fit their situation, not the full catalog.
Closing that gap was my job. As a customer engineer, I tailored those best practices to where each customer actually was, against the constraints they were working with. That kind of help has usually come with an enterprise relationship. Plenty of teams never get it.
A chatbot over those same docs doesn't close the gap so much as widen it, answering faster but with the same generic, unpersonalized guidance. I wanted to build the tailoring in, not hand people a quicker version of advice that was never written for their reality.
The app is a digital version of the way I worked with my customers. It takes your requirements and decisions, grounds its guidance in curated Microsoft docs, and scores your design against the Well-Architected Framework (WAF), whether you answer in a form or in conversation.
RAG pipeline, retrieval API, and chat UI. Full data-flow diagrams are in the repo.
[n] references, so nothing is taken on faith.The piece I cared most about is how the knowledge is modeled. Every requirement and decision is structured markdown with YAML frontmatter, and that one source of truth drives the form UI, the retrieval tags, and the framework the LLM reasons over. As an architect I think in relationships, not database rows, so keeping domain knowledge as structured content, separate from the application code, let me work the way I always have.
A small monorepo: a Nuxt 4 frontend and a Python FastAPI retrieval service over Postgres with pgvector.
This is a three-week prototype: the WAF scoring weights were set by judgment and tuned until they read as directionally right, not derived from research.
