About event

Single-vector RAG works well for basic retrieval scenarios but quickly reaches its limits when high-precision retrieval or search across complex documents, tables, charts, and visual content is required.

Join us for a practical SPD Talk live stream where we will explore modern Information Retrieval approaches based on Multi-Vector models and the Late Interaction architecture. Through real-world examples, we will demonstrate how ColBERT and ColPali work, how they differ from traditional embedding-based solutions, and how they can be used to build more accurate AI systems.

What we’ll cover:

  • Compare Single-Vector and Multi-Vector retrieval approaches;
  • Explore the Late Interaction mechanism and its advantages;
  • Build token-level retrieval using ColBERT;
  • Learn how to search documents as images with ColPali, without relying on OCR;
  • Discuss production-ready optimization techniques, including MUVERA, quantization, token pooling, and multi-stage retrieval;
  • Examine the trade-offs between retrieval quality, latency, and infrastructure costs.

    At the end of the session, participants will receive a hands-on assignment and a Google Colab notebook containing preconfigured code to experiment with multi-vector retrieval on a benchmark dataset. You’ll be able to build your own retrieval pipeline and explore how different configurations impact retrieval quality.

    For whom:

    This session is tailored for AI/ML & MLOps Engineers, Tech Leads, and Applied Scientists building advanced RAG or Information Retrieval systems. Join us if you want to move beyond traditional single-vector embeddings and explore production-ready multi-vector architectures (ColBERT, ColPali) at scale.

Speaker

Alexander Sokolov:Senior Machine Learning Operations Engineer at SPD Technology

Alexander Sokolov

Senior Machine Learning Operations Engineer at SPD Technology

Alexander specializes in the design and implementation of AI/ML solutions, Information Retrieval systems, LLM platforms, and Retrieval-Augmented Generation (RAG). His work focuses on building production-ready AI systems, optimizing retrieval pipelines, and implementing modern approaches for knowledge retrieval and document understanding in AI applications.

How to join?

  1. Register
  2. Make a charitable donation of 300 UAH or more to the KOLO Foundation.
  3. Check your email after registration — we will send you a confirmation email along with a link to add the event to your calendar.

    Register, donate, and let’s connect!

Partners