Technology

Technology

Concrete tools, systems, or platforms with version histories and maintainers.

14 nodes

AWS S3

Technology

Amazon's fully managed object storage service — the origin and reference implementation of the S3 API.

10 connections 4 resources

MinIO

Technology

An open-source, S3-compatible object storage server designed for high performance and self-hosted deployment.

7 connections 4 resources

Ceph

Technology

A distributed storage system providing object, block, and file storage in a unified platform. S3 compatibility via its RADOS Gateway (RGW).

4 connections 3 resources

Apache Ozone

Technology

A scalable, distributed object storage system in the Hadoop ecosystem with an S3-compatible interface.

4 connections 3 resources

Apache Iceberg

Technology

An open table format for large analytic datasets. Manages metadata, snapshots, and schema evolution for collections of data files (typically Parquet) ...

12 connections 4 resources

Delta Lake

Technology

An open table format and storage layer providing ACID transactions, scalable metadata, and schema enforcement on data stored in object storage. Origin...

8 connections 4 resources

Apache Hudi

Technology

A table format and data management framework optimized for incremental data processing — upserts, deletes, and change data capture — on object storage...

7 connections 4 resources

DuckDB

Technology

An in-process analytical database engine (like SQLite for analytics) that reads Parquet, Iceberg, and other formats directly from S3 without requiring...

9 connections 3 resources

Trino

Technology

A distributed SQL query engine for federated analytics across heterogeneous data sources, with deep support for S3-backed data lakes and lakehouses.

9 connections 4 resources

ClickHouse

Technology

A column-oriented DBMS designed for real-time analytical queries, with native support for reading from and writing to S3.

5 connections 4 resources

Apache Spark

Technology

A distributed compute engine for large-scale data processing — batch ETL, streaming, SQL, and machine learning — over S3-stored data.

9 connections 4 resources

LanceDB

Technology

A vector database that stores data in the Lance columnar format directly on object storage. Designed for serverless vector search without a separate i...

5 connections 4 resources

StarRocks

Technology

An MPP analytical database with native lakehouse capabilities, able to directly query S3 data in Parquet, ORC, and Iceberg formats.

5 connections 3 resources

Apache Flink

Technology

A distributed stream processing framework that processes data in real-time, with S3 as checkpoint store, state backend, and output sink.

5 connections 3 resources