1st Workshop on
Hybrid Transactional/Analytical Processing Systems (HTAPSys)
Co-located with ICDE 2026, Montréal, Canada
May 8, 2026 (Friday)

Photo by DAVID ILIFF. License: https://creativecommons.org/licenses/by-sa/3.0/

Invited Talk - Building Bitmap Indexes for Hybrid Workloads: A tale of UPBIT, CUBIT, and RABIT

Speaker: Manos Athanassoulis (Boston University)

Abstract: In this talk, I will describe our journey to elevate bitmap indexes from read-only indexes to update-friendly, highly concurrent, range-query friendly, secondary indexes that can be used as general-purpose secondary indexes for hybrid workloads. Our designs rely on the following principles: First, we employ a horizontal bitwise representation of updated bits, which enables efficient atomic updates without locking entire bitvectors. Second, we propose a lightweight snapshotting mechanism that allows queries to run on separate snapshots and provides a wait-free progress guarantee. Third, we consolidate updates in a latch-free manner, providing a strong progress guarantee. Fourth, we propose a new encoding scheme that extends the traditional equality encoding with group bitvectors that facilitate both short- and long-range queries without sacrificing update or space efficiency. Our evaluation when integrating with DuckDB query engine shows that our bitmap index variants (CUBIT and RABIT) can outperform state-of-the-art analytical engines by several orders of magnitude. Overall, our work breaks the assumption that bitmap indexes target only read-only queries with low cardinality, making them general-purpose contenders for hybrid workloads. For HTAP workloads with real-time updates, our designs achieve 2–11x performance improvement over the state-of-the-art approaches. We will conclude by discussing future directions about bitmap indexes and query processing with bitvectors.

Speaker Bio: Manos Athanassoulis is an Associate Professor of Computer Science at Boston University, Director and Founder of the BU Data-intensive Systems and Computing Laboratory, and co-director of the BU Massive Data Algorithms and Systems Group. He also spent a summer as a Visiting Faculty at Meta. His research is in the area of data management, focusing on building data systems that efficiently exploit modern hardware (computing units, storage, and memories), are deployed in the cloud, and can adapt to the workload both at setup time and dynamically, at runtime. Before joining Boston University, Manos was a postdoc at Harvard University. Earlier, he obtained his PhD from EPFL, Switzerland, and spent one summer at IBM Research, Watson. Manos’ work has been recognized by awards like "IEEE Computer Society Distinguished Contributor", Class of 2025, “Best of SIGMOD” in 2016, “Best of VLDB” in 2010 and 2017, “Most Reproducible Paper” at SIGMOD in 2017, and "Best Demo" for VLDB 2023. His research has been supported by multiple NSF grants including an NSF CRII and an NSF CAREER award, and industry funds including a Facebook Faculty Research Award, multiple Red Hat Research Incubation Awards and gifts from Cisco, Red Hat, and Meta. He is currently serving as ACM SIGMOD Secretary/Treasurer 2025-2029, Associate Editor for the VLDB Journal, Associate Editor for ACM SIGMOD Record, Local Arrangements Chair for VLDB 2026, and Area Chair for VLDB 2027.


Invited Talk - Hybrid Workloads Redefined by a Single DDL

Speaker: Nikos R. Katsipoulakis (Snowflake)

Abstract: Snowflake's hybrid tables represent a paradigm shift in hybrid workload management by architecturally unifying siloed transactional and analytical data within a singular platform. Hybrid tables facilitate both workload types via a declarative CREATE TABLE statement, prioritizing simplicity, operational ease (zero-configuration approach), and self-optimizing performance. This presentation will detail the fundamental modifications implemented across the Snowflake stack to seamlessly integrate and support hybrid workloads.

Speaker Bio: Nikos R. Katsipoulakis is a Principal Software Engineer at Snowflake, where he builds distributed database internals — most recently on transactions, indexing, and constraints for cloud-scale HTAP workloads. He holds a Ph.D. in Computer Science from the University of Pittsburgh, where his research on stream processing and real-time analytics was published at PVLDB, ICDE, and SIGMOD. Before Snowflake, Nikos worked on AWS Redshift at Amazon, with earlier research stints at Microsoft Research and IBM Almaden. He is a co-inventor on several patents in distributed query processing and transactional processing.


Invited Talk - Enabling Fast and Correct Approximate Query Processing in HTAP Systems

Speaker: Zhuoyue Zhao (University at Buffalo)

Abstract: In HTAP workloads, replacing exact query evaluation with sampling-based Approximate Query Processing (AQP) can improve freshness of results, lower latencies, and reduce resource consumption when approximation is appropriate. However, most existing AQP systems are built as standalone systems with no transaction support, and thus cannot readily support HTAP workloads. In this talk, I will discuss the challenges and our solutions for enabling AQP in HTAP systems, including concurrency-safe sampling index design and query cost reduction techniques for data with unknown statistics. We built these ideas into PostgreSQL, a traditional transactional system, which can support fast and correct concurrent transactional updates and approximate queries under snapshot isolation. In addition, I will discuss future challenges to extend our work to in-memory systems.

Speaker Bio: Zhuoyue Zhao is currently an assistant professor at University at Buffalo. He holds a PhD degree from University of Utah, where he was advised by Prof. Feifei Li and Prof. Jeff Phillips. His research interest is in database systems, specifically query processing and optimization, transaction processing, and storage and indexing. He received an NSF CAREER award in 2024, and two SIGMOD best paper awards in 2016 and 2025.


Invited Talk - The Paths to HTAP

Speaker: Ronen Grosman (Huawei Canada)

Abstract: The desire to combine multiple workloads together on the same database has been a longstanding desire for database designers. Naive systems usually start as a single database, but due to operational reasons typically grow into a set of OLTP systems feeding into OLAP systems. However if we could without performance or operational issues query a single copy of the data it would obviously be superior. However techniques such as columnar storage, scale out analytics, and de-normalization provide such significant performance benefits they cannot be ignored. Separating systems also enable cost savings through compression, and runtime efficiency, and disaggregation. They also provide significant availability benefits by removing unpredictable analytical workloads from the transactional system, and reducing the failure blast radius. In this talk we will discuss the various approaches that have historically been taken to solve these issues and build HTAP system, from automate “Zero-ETL” to more complex solutions such as columnar indexes and their advantages and disadvantages. We will then discuss emerging approaches and opportunities presented by emerging hardware and algorithms.

Speaker Bio: Ronen Grosman, Distinguished Engineer, Chief Architect Gauss Canada Database Research Lab (Huawei)
Ronen is the chief architect of the Gauss Canada research lab focusing on next generation database systems in particular around the areas of cloud native OLTP, HTAP, storage engine design and system scale out. Throughout his career Ronen has also been the architect for numerous other OLTP and HTAP focused database systems and has had over 25 filed patents.