Selecting the right Snowflake warehouse size is an ongoing process. There are three key factors to consider:
- Query Complexity & Data Volume
- Workload Concurrency
- Data Latency SLAs and Business Requirements
Dig in to each to build a consistent, data-driven sizing strategy.
Factor 1: Query Complexity and Data Volume
Workloads vary dramatically in compute intensity. Complex analytical queries with multiple joins or large aggregations on hundreds of gigabytes require more warehouse resources. Simpler BI reporting and low-volume scans perform efficiently even on XS or S sizes.
- ETL Jobs: Batch ETL often benefits from sizing up to M, L, or XL, especially with large datasets or transformation steps that risk disk spill.
- Reporting & Dashboards: Routine BI workloads with low concurrency and light queries frequently run best on XS or S warehouses, minimizing spend.
- Data Science & Ad Hoc Analytics: Unpredictable, heavy workloads may warrant temporary scaling up or use of multi-cluster virtual warehouses for short, intensive bursts.
For more on practical application, see our Snowflake Usage Optimization case studies.
Factor 2: Workload Concurrency and Multi-Cluster Warehouses
Concurrency determines how many queries or jobs are processed in parallel. Insufficient concurrency leads to queuing and missed deadlines.
- Low Concurrency: Single-cluster warehouses are sufficient where parallel queries rarely exceed 8.
- High Concurrency: Dynamic workloads, especially in BI or integrated apps, require multi-cluster warehouses. Snowflake automatically adds compute clusters as needed, maintaining performance during peaks and scaling down when idle—one of the most effective Snowflake performance tuning methods.
- Cost vs. Performance: Multi-cluster warehouses increase cost but can actually lower total spend by eliminating inefficiency from persistent queuing and leveraging Snowflake’s per-second billing.
Factor 3: Data Latency SLAs and Business Requirements
Every business has unique timing needs—for example, daily sales must be closed and reported by 6 AM, or marketing must access live campaign analytics.
- Business SLAs: Define maximum runtime or delivery time for key workflows. Set these SLAs in advance for each major workload.
- Transparent Rationale: Documented SLAs give you a clear reason and justification for warehouse sizing in both technical and financial terms.
Selecting a warehouse size Snowflake can consistently meet SLAs without overspending is the core of right-sizing.