| ID | Date | Time | Type | Location | Buffer | Punct. | Priority |
|---|---|---|---|---|---|---|---|
| Loading… | |||||||
Run the full three-step pipeline: (1) hybrid rule-based retrieval scoring, (2) GPT-4o-mini narrative generation grounded in KB evidence, (3) approval-gated reminder drafts for high-risk appointments.
| ID | Date | Time | Type | Location | Buffer | Priority | Punct. Score |
|---|---|---|---|---|---|---|---|
Loading appointments… | |||||||
Historical appointment records with outcomes and failure tags. Used for RAG retrieval.
| ID | Date | Type | Location | Outcome | Delay | Failure Tags | Punct. |
|---|---|---|---|---|---|---|---|
Loading KB records… | |||||||
LLM Model
GPT-4o-mini (OpenAI API) · Temperature 0.2 · Evidence-grounded · Citation-enforced
Retrieval Strategy
Hybrid BM25 + Jaccard similarity · Temporal weighting (90-day × 1.5) · Metadata pre-filtering
Embedding Model
text-embedding-3-small (OpenAI) · 1536-d vectors · ~$0.00002/1K tokens
Vector Store
Azure AI Search (hybrid BM25 + vector) · RRF merging · Enterprise-grade access control
RAG Enhancements
8 techniques: multi-query expansion, temporal decay, metadata filtering, RRF, re-ranking, evidence thresholding, CoT, self-consistency
Safety & Governance
All drafts approval-gated · Human-in-loop enforced · No auto-send · PII pseudonymised · PDPA/GDPR compliant
Performance
Rule-based p50: ~120ms · LLM narrative p50: ~1.6s · Token cost: ~$0.0003/appointment · Precision@3 = 1.00
Infrastructure
Azure App Service B1 (always-on) · Linux · Node 20 LTS · Southeast Asia (Singapore) · aria.vicparmar.com
| Method | Endpoint | Description |
|---|---|---|
| GET | /health | Health check, KB size, upcoming count |
| GET | /list-upcoming | All upcoming appointments as JSON |
| GET | /list-kb | All KB history records as JSON |
| POST | /analyze-week | Run rule-based risk scoring |
| POST | /analyze-with-llm | Rule-based + GPT-4o-mini narratives |
| POST | /draft-reminders | Generate approval-gated reminder drafts |
| POST | /appointments | Add new upcoming appointment |
| POST | /kb | Record outcome into KB (feedback loop) |
- Anthropic (2026). The Labor Market Impact of AI: An Analysis of Task-Level Exposure. anthropic.com/research/labor-market-impacts — found 14% slower hiring for workers aged 22-25 and a 33% coverage gap between benchmark and observed AI task exposure.
- Lewis, P. et al. (2020). Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. NeurIPS 2020. — foundational RAG paper.
- Wei, J. et al. (2022). Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. NeurIPS 2022.
- Robertson, S. & Zaragoza, H. (2009). The Probabilistic Relevance Framework: BM25 and Beyond. Foundations and Trends in IR.
- Wang, X. et al. (2022). Self-Consistency Improves Chain of Thought Reasoning in Language Models. ICLR 2023.
- OpenAI (2024). GPT-4o Technical Report. openai.com.