Standard
Specifications, protocols, and file formats that define interoperability.
8 nodesS3 API
StandardThe HTTP-based API for object storage operations — PUT, GET, DELETE, LIST, multipart upload. The de-facto standard for object storage interoperability...
Apache Parquet
StandardA columnar file format specification designed for efficient analytical queries. Stores data by column, enabling predicate pushdown, projection pruning...
Apache Arrow
StandardA cross-language in-memory columnar data format specification with libraries for zero-copy reads, IPC, and efficient analytics.
Iceberg Table Spec
StandardThe specification defining how a logical table is represented as metadata files, manifest lists, manifests, and data files on object storage. Provides...
Delta Lake Protocol
StandardThe specification for ACID transaction logs over Parquet files on object storage. Defines how writes, deletes, and schema changes are recorded in a JS...
Apache Hudi Spec
StandardThe specification for managing incremental data processing on object storage — record-level upserts, deletes, change logs, and timeline-based metadata...
ORC
StandardOptimized Row Columnar file format specification — a columnar format with built-in indexing, compression, and predicate pushdown support, originally d...
Apache Avro
StandardA row-based data serialization format with rich schema definition and built-in schema evolution support. Schemas are stored with the data.