Spark DataFusion Comet 向量化：Rust Native ScanExec 与 Selection Vectors

背景

Apache DataFusion Comet 是苹果公司开源的加速 Spark 运行的向量化项目。本项目采用了 Spark 插件化 + Protobuf + Arrow + DataFusion 架构形式。

Spark 插件：利用 SparkPlugin 插件，分为 DriverPlugin 和 ExecutorPlugin，在 driver 和 Executor 启动时调用。
Protobuf：序列化 Spark 表达式及计划，传递给 Native 引擎执行，体积小速度快。
Arrow：Spark 与 Native 引擎高效数据交换（JNI 中利用 Arrow IPC 列式存储及零拷贝）。
DataFusion：利用 Rust Native 及 Arrow 内存格式实现的向量化执行引擎，Spark 中主要 offload 算子到该引擎。

本文重点分析 Rust Native 的 ScanExec 实现及 Selection Vectors 机制。

Selection Vectors

什么是 Selection Vectors

Selection Vectors 是向量化查询执行引擎过滤操作中的一种表达，另一种表达是 Bitmap：

Bitmap：用 BitMap 标记哪些数据被过滤选中。
Selection Vectors：用 vector 存储被命中数据的下标。

两者的区别在于 Bitmap 记录所有数据（0/1），Selection Vectors 只记录命中数据。相关论文可参考 Filter Representation in Vectorized Query Execution。

针对这两种过滤算子的表达，可以衍生出三种执行策略：

BMFull：总是对所有数据处理，未选中的数据的值未定义，优势是能充分发挥向量化的优势。
BMPartial：只对选中的数据进行处理，无法利用向量化，依然需要遍历所有下标。
SVPartial：只需要遍历选中的下标，无法利用向量化。

ScanExec 读取以及涉及到的 Selection Vectors

values：为一列中的所有原始值。
selectionIndices：为选中的数据的下标。
- 假如存在则获取每一列的 Selection Vector，否则返回 None。
  - 首先对于每一列值构造一个 Vec(FFI_ArrowArray) 和 Vec(FFI_ArrowSchema) 类型的数组以及初始化数组，并创建的 FFI_ArrowArray 和 FFI_ArrowSchema 对应的地址插入到该数组中。
  - 使用 JNIEnv.new_long_array 创建 Java Long 型数组。
  - 使用 JNIEnv.set_long_array_region 新创建的 Java Long 型数组赋值给该数组。
  - 调用 ArrayData::from_spark 方法将 Spark 端 通过 Arrow C Data Interface 传递过来的内存地址转换为 Rust 端 的 Arrow ArrayData 对象。这里主要使用了 from_ffi 方法，从这些裸指针重建出 Rust 的 ArrayData 结构，这个过程是零拷贝的，直接复用 Spark 分配的内存；并调用 align_buffers() 确保数据在 Rust 端能被正确、安全地访问。

fn get_selection_indices( env: &mut jni::JNIEnv, iter: &JObject, num_cols: usize, ) -> Result<Option<Vec<ArrayRef>>, CometError> { // Check if all columns have selection vectors let has_selection_vectors_result: jni::sys::jboolean = unsafe { jni_call!(env, comet_batch_iterator(iter).has_selection_vectors() -> jni::sys::jboolean)? }; let has_selection_vectors = has_selection_vectors_result != 0; let selection_indices_arrays = if has_selection_vectors { // Allocate arrays for selection indices export (one per column) let mut indices_array_addrs = Vec::with_capacity(num_cols); let mut indices_schema_addrs = Vec::with_capacity(num_cols); for _ in 0..num_cols { let arrow_array = Rc::new(FFI_ArrowArray::empty()); let arrow_schema = Rc::new(FFI_ArrowSchema::empty()); indices_array_addrs.push(Rc::into_raw(arrow_array) as i64); indices_schema_addrs.push(Rc::into_raw(arrow_schema) as i64); } // Prepare JNI arrays for the export call let indices_array_obj = env.new_long_array(num_cols as jsize)?; let indices_schema_obj = env.new_long_array(num_cols as jsize)?; env.set_long_array_region(&indices_array_obj, 0, &indices_array_addrs)?; env.set_long_array_region(&indices_schema_obj, 0, &indices_schema_addrs)?; // Export selection indices from JVM let _exported_count: i32 = unsafe { jni_call!(env, comet_batch_iterator(iter).export_selection_indices( JValueGen::Object(JObject::from(indices_array_obj).as_ref()), JValueGen::Object(JObject::from(indices_schema_obj).as_ref()) ) -> i32)? }; // Convert to ArrayRef for easier handling let mut selection_arrays = Vec::with_capacity(num_cols); for i in 0..num_cols { let array_data = ArrayData::from_spark((indices_array_addrs[i], indices_schema_addrs[i]))?; selection_arrays.push(make_array(array_data)); // Drop the references to the FFI arrays unsafe { Rc::from_raw(indices_array_addrs[i] as *const FFI_ArrowArray); Rc::from_raw(indices_schema_addrs[i] as *const FFI_ArrowSchema); } } Some(selection_arrays) } else { None }; Ok(selection_indices_arrays) }

Spark DataFusion Comet 向量化：Rust Native ScanExec 与 Selection Vectors

背景

Selection Vectors

什么是 Selection Vectors

ScanExec 读取以及涉及到的 Selection Vectors

更多推荐文章

相关免费在线工具

allocate_and_fetch_batch 方法及后续说明

JNI 调用 JVM 的 exportSelectionIndices 方法

get_selection_indices 方法说明

Rust 侧处理逻辑

Java 侧实现

更多推荐文章

相关免费在线工具

Spark DataFusion Comet 向量化：Rust Native ScanExec 与 Selection Vectors

背景

Selection Vectors

什么是 Selection Vectors

ScanExec 读取以及涉及到的 Selection Vectors

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具

allocate_and_fetch_batch 方法及后续说明

JNI 调用 JVM 的 exportSelectionIndices 方法

get_selection_indices 方法说明

Rust 侧处理逻辑

Java 侧实现

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具