Java Cup
Inside Java

News and views from members of the Java team at Oracle

Caching for Agentic Java Systems: Internal, Distributed, and Semantic

Caching is a first-class architectural concern in agentic systems. This talk breaks down how Java applications can layer internal, distributed, and semantic caches. We'll explore in-process caching with Caffeine for ultra-low-latency access, distributed caching with Redisson and Valkey for shared cache and semantic caching using Vector Similarity Search to reduce latency and cost while scaling LLM access.

Make sure to check the JavaOne 2026 playlist.