Better Programming

Advice for programmers.

Follow publication

Member-only story

Building a Data Warehouse on Amazon Redshift

Ben Rogojan
Better Programming
Published in
6 min readDec 16, 2019

--

Photo by Jezael Melgoza on Unsplash

As an organization grows, its data storage, monitoring and analysis requirements also exponentially increase. Traditional data warehouse don’t always easily handle massive amounts of growth. This caused a need for alternative solutions, starting from the mid 2000s. One such solution is Amazon Redshift from Amazon Web Services.

What is Amazon Redshift?

Amazon Redshift is a cloud-based data warehouse designed for large scale data sets. It was created to take on a lot of the issues previous data warehouses had and drastically improve on them. It does this with specific design decisions such as being columnar focused, having the ability to scale horizontally across multiple nodes, and massively parallel processing. Each of these specific design decisions has allowed AWS to develop a data storage system that operates very differently from your traditional RDBMS.

High-Level Architecture

Larger organizations bring in high-level architecture— which is obviously complex. How does Redshift handle it?

Column-oriented database

As opposed to traditional database management systems, Redshift stores data by column. By partitioning…

--

--

Ben Rogojan
Ben Rogojan

Written by Ben Rogojan

#Data #Engineer, Strategy Development Consultant and All Around Data Guy #deeplearning #dataengineering #datascience #tech https://linktr.ee/SeattleDataGuy

Write a response