Semantic Caching with Valkey and Redis: Reducing LLM Cost and Latency - Martin Visser

Speakers: Martin Visser

Setting up efficient LLM applications requires more than just calling an API. In this session, Martin Visser explains how semantic caching can significantly reduce the cost and latency of Large Language Model (LLM) applications by reusing meaningfully similar responses instead of exact matches.

Using Valkey and Redis as vector databases, Martin walks through how embeddings, similarity thresholds, and TTLs work together to cache LLM responses efficiently. Participants will learn about practical architecture decisions, configuration trade-offs, and see a real-world demo showing how semantic caching can cut LLM usage by up to 60% while improving response times from seconds to milliseconds.

📅 Date: January 23, 2026
👤 Speaker: Martin Visser - Valkey and Redis Tech Lead @ Percona
🎥 Watch the Recording: https://youtu.be/7vdFUJgGOSs

Watch the Presentation!

Speaker Bios

Martin Visser

Valkey and Redis tech lead @Percona

Valkey and Redis tech lead @Percona

See all talks by Martin Visser »

Take a look at our events

Percona Live 2026 is back!

📍 Computer History Museum, Mountain View, California

Percona Live 2026 is back! Join the open source database community for three days of deep technical talks, hands-on workshops, and real conversations with people building and running MySQL, PostgreSQL, MongoDB, MariaDB, Valkey, and more in production.

May 27–29, 2026

PGConf Germany 2026

📍 Essen, Germany

Percona is proud to sponsor PGConf.de 2026 in Essen — one of Europe’s leading events for the PostgreSQL community. This conference brings together engineers, contributors, and practitioners who are shaping the future of PostgreSQL in production environments around the world.

April 21–22, 2026

Mastering Linux Packaging: Native Tools vs. Modern Abstractions - March 30, 2026

📍 Online

Building and distributing software for diverse Linux environments, like database servers and distributed systems, presents unique challenges in dependency management and lifecycle orchestration. In this session, Evgeniy Patlan will perform a deep dive into the packaging landscape, comparing the “gold standard” of native RPM and DEB tools against modern abstractions like CPack and nFPM.

Monday, March 30, 2026 · 14:00-15:00 GMT