OllamaChat

MKV

Self Hosted AI Chat Platform

A self-hosted ChatGPT alternative that runs entirely on your own machine using Ollama. It can search your documents to answer questions, remember things across conversations, automatically switch to the right model for coding or vision tasks, use tools to search the web, and optionally speak and listen, all without sending anything to the cloud.

OllamaChat interface showing a conversation with RAG citations, memory badge, and dark-themed sidebar

Role

Full-Stack Engineer

Duration

Ongoing

Team

Solo Project

Year

2026

Links

Source code Read the article

Tech stack

Next.js 16React 19TypeScriptTailwind CSS v4Prisma v7libSQLOllamaServer-Sent EventsChokidarpdf-parseCheerioDocker

Project Details

I built this project to truly understand how AI tools work under the hood, not just use them as a black box. What started as a simple Ollama playground grew into a fully-featured local AI platform. It searches your own documents to give grounded answers with confidence scoring (RAG), remembers useful things you've told it across sessions, detects when you're asking a coding question or sending an image and routes to the right model, runs an agentic tool-use loop for web search and URL fetching, supports extended reasoning with think blocks, and offers optional voice input and output via a locally-hosted speech service. Everything runs on your own hardware: no subscriptions, no data leaving your machine.

Results & Impact

Challenge to Solution

Product in Use

Chat interface showing a live conversation with streaming response and sidebar

Real-time streaming chat with conversation history, model selector, and RAG toggle

Memory page showing auto-captured facts and preferences across conversations

Memory manager with auto-captured items, search, filter, and usage tracking

Settings page showing model configuration, voice options, pipeline parameters, and watched folders

Configurable model settings, voice I/O, pipeline parameters, and file watcher paths

Clean overview of the OllamaChat interface without an active conversation

Full application overview with sidebar, model selector, and empty chat state

Knowledge base page showing uploaded documents with indexing status

Document management with drag-and-drop upload, URL ingestion, and chunk browsing

OllamaChat

Project Details

Results & Impact

Document search directly

A multi-stage message pipeline (system prompt → memory → document context → history)

A memory system

Smart model routing

An agentic tool-use loop

Grounding confidence scoring (high/medium/low) on RAG answers

Extended reasoning

Optional voice I/O

Document ingestion

Per-conversation toggles

Challenge to Solution

What had to be solved

How it came together

Product in Use

Key Features

Chat with any locally installed Ollama model

Automatically switches

Auto-routes image attachments

Upload documents

Searches your documents

Remembers preferences

Agentic tool-use loop

Supports extended reasoning

Push-to-talk voice input

Per-conversation toggles

Persistent conversation history

Watch a folder