← All projects

Agentic chat

Agentic chat is an AI workspace that combines document search, memory, tools, and research flows so it can answer with better context and lower repeat cost.

Overview

Agentic chat routes requests through different execution paths based on intent: simple queries use cached responses, document questions trigger RAG, complex tasks use LangGraph research flows, and tool calls go through Google Workspace integration.

How It Was Built

The main technical choices behind the product, from system design to the parts that make it work day to day.

  • Implemented intent classification to route requests: cached responses for repeated queries, RAG for document-specific questions, LangGraph agents for multi-step research, and direct tool calls for Google Workspace actions.
  • Built document ingestion pipeline with PDF/Word parsing, semantic chunking (512 tokens with overlap), OpenAI embeddings, PostgreSQL pgvector storage, and cross-encoder reranking for relevance.

Impact

  • Semantic caching reduced API costs by 40% because similar requests stopped repeating the same expensive work.
  • Answers improved because the product chooses the right context for each request instead of pushing everything through one path.

Highlights

  • Semantic caching reduced API costs by 40% by avoiding redundant embeddings and model calls.
  • Routing layer achieves sub-100ms intent classification with 92% accuracy on path selection.

Tech Stack

Next.jsTypeScriptOpenAIMem0PostgreSQLLangGraphRAGAgentic ResearchAgents

More Projects

Additional work across AI products, developer tooling, and full-stack systems.

Browse all →