@MatrixVariable in Spring Boot: When and How to Use It
Explains how to use Spring Boot's @MatrixVariable annotation for embedding key-value parameters in URL path segments, with practical examples.
Rajesh P writes about building scalable, secure, and high-performance backend systems. His articles cover Spring Boot, API design and versioning, system design fundamentals, and modern GenAI concepts like rerankers, LLM limits, and latency optimization.
10 articles from this blog
Explains how to use Spring Boot's @MatrixVariable annotation for embedding key-value parameters in URL path segments, with practical examples.
Explains API versioning concepts and details the new first-class versioning support introduced in Spring Boot 4 (Spring Framework 7).
A practical guide to implementing essential API security best practices in Spring Boot, including HTTPS, JWT authentication, authorization, and rate limiting.
A technical deep dive into how AI rerankers work, explaining their scoring mechanisms, model architectures, and implementation trade-offs.
Explains how rerankers improve search and AI results by reordering retrieved documents for better precision and relevance.
Explains LLM API token limits (TPM) and strategies for managing concurrent requests to avoid rate limiting in production applications.
Explains why P95 and P99 latency metrics are crucial for understanding real user experience, not just average response times.
Explains Little's Law from queuing theory and how it applies to system performance, showing why latency increases cause concurrency to balloon under load.
Part 2 of a guide on using Docker Compose to enhance the reliability and portability of AI agents, focusing on Dockerfile and compose.yaml.
A tutorial on using Docker Compose to create reproducible, containerized runtime environments for AI agents, focusing on a weather query example.