Data Store

Data Lake

A data lake is a storage repository that holds a large amount of data in its native, raw format. Data lake stores are optimized for scaling to terabytes and petabytes of data. The data typically comes from multiple heterogeneous sources, and may be structured, semi-structured, or unstructured. The idea with a data lake is to store everything in its original, untransformed state. This approach differs from a traditional data warehouse, which transforms and processes the data at the time of ingestion.

Typical uses for a data lake include data exploration, data analytics, and machine learning.

Data lake stores are often used in event streaming or IoT scenarios, because they can persist large amounts of relational and nonrelational data without transformation or schema definition.

Data warehouse | ODS | Data mart

OLAP

RDBMS:

Oracle
Sql Server
Mysql
Postgre sql

Data Governance:
Data governance is a data management concept concerning the capability that enables an organization to ensure that high data quality exists throughout the complete lifecycle of the data. The key focus areas of data governance include availability, usability, consistency[1], data integrity and data security and includes establishing processes to ensure effective data management throughout the enterprise

Comments