Datalake/ Datawarehouse(store the data) – This the third step/process of data pipeline 

Data warehouse

Datawarehouse is an Environment/centralized repository to store large amount of data accumulated from a wide range of sources and it is used to guide the management decisions for key insights.

 

Top Data Warehouse providers:

  • Amazon Redshift
  • Google BigQuery
  • IBM Db2 Warehouse
  • Azure Synapse Analytic
  • Oracle Autonomous Data Warehouse
  • SAP Data Warehouse Cloud
  • Snowflake