LocalGPT is an open-source, MIT-licensed project that lets users query and converse with their own documents using local LLMs, with no data transmitted to external servers. It uses a hybrid search engine combining semantic similarity, keyword matching, and late-chunking for precision. A smart router automatically selects between RAG and direct LLM answering per query. It supports GPU, CPU, HPU, and Apple MPS platforms via Ollama-backed models. Key features: - 100% on-device processing: no data leaves the machine - Hybrid search combining semantic similarity and keyword matching - Smart router that selects RAG vs. direct LLM answering per query - Supports GPU, CPU, HPU, and Apple MPS (M-series) hardware - Multiple open-source model support via Ollama - Pure-Python core with minimal dependencies
Free and open-source (MIT license)