Architecture

Architecture

Repeatable design patterns that combine multiple technologies to solve structural problems.

8 nodes

Lakehouse Architecture

Architecture

A unified architecture combining data lake storage (files on S3) with warehouse capabilities (ACID, schema enforcement, SQL access) by using a table f...

23 connections 3 resources

Medallion Architecture

Architecture

A layered data quality pattern — Bronze (raw), Silver (cleansed), Gold (business-ready) — with each layer stored on object storage.

8 connections 3 resources

Separation of Storage and Compute

Architecture

The design pattern of keeping data in S3 while running independent, elastically scaled compute engines against it.

9 connections 3 resources

Hybrid S3 + Vector Index

Architecture

A pattern that stores raw data on S3 and maintains a vector index over embeddings that points back to S3 objects.

8 connections 3 resources

Offline Embedding Pipeline

Architecture

A batch pattern where embeddings are generated from S3-stored data on a schedule, with resulting vectors written back to object storage or a vector in...

4 connections 3 resources

Local Inference Stack

Architecture

A pattern of running ML/LLM models on local hardware against data stored in or pulled from S3, avoiding cloud-based inference APIs.

4 connections 3 resources

Write-Audit-Publish

Architecture

A data quality pattern where data lands in a raw S3 zone, undergoes validation, and is promoted to a curated zone only after passing audits.

4 connections 3 resources

Tiered Storage

Architecture

Moving data between hot, warm, and cold storage tiers based on access frequency. S3 itself offers tiering (Standard, Infrequent Access, Glacier).

4 connections 3 resources