Learn how we use massively parallel LLM inference to cheat at search. Don't leave results to chance. See what's new

Blog

Category: Explainers

Cover image

Build a Hotel Voice Agent with Trieve + Vapi

Mar 4, 2025

Step by step guide on how to create a knowledge base from a hotel website and connect it to a Vapi voice agent.

Cover image

How we Built 300μs Typo Detection for 1.3M Words in Rust

Sep 9, 2024 · Dens Sumesh

We explain how we built blazingly fast spellcheck in Rust using BKTrees, Redis queues, and Clickhouse in this blog.

Cover image

Cheating at Search with LLMs

May 21, 2025 · Nicholas Khami

Learn how we use massively parallel LLM inference to cheat at search. Don't leave results to chance.

Cover image

Build Search and RAG for Any Website with Firecrawl and Trieve

Aug 22, 2024 · Daniel S. Griffin

Use Firecrawl and Trieve to build search and RAG (in both Python and JS).

Cover image

History of HackerNews Search: From 2007 to 2024

Aug 12, 2024 · Nick Khami

The history of HackerNews (HN) search spans three generations. Starting in 2007 with Disqus founder Jason Yan followed by a series of other sites, Octopart/ThriftDB-powered HNSearch in 2011, and finally Algolia-powered search from 2014 to today.

Cover image

Streaming LLM assistant completions with the OpenAI API and Rust Actix-Web

Aug 8, 2024 · Denzell Ford

Guide on how we were able to stream LLM assistant completions in real time using Actix-Web.

Cover image

Guide for Self-Hosting Trieve on a VPS

Sep 12, 2024 · Marcin Stankiewicz

Instructions for self-hosting Trieve on a VPS using docker-compose. You'll be able to set up Trieve on a Hetzner server which comes with semantic and hybrid search, SPLADE fulltext search, re-ranker models, RAG AI Chat, recommendations, and analytics.

Cover image

Introducing TVI: Embedding and Reranking Infra Built for Kube

Nov 13, 2024

Unmetered In-VPC Embeddings and Rerankers at Ridiculously Low Latency