5 February 2019
The Data Lake is a data-centered architecture featuring a repository capable of storing vast quantities of data in various formats. Data from enterprise systems, data bases, web server logs, social media, and third-party data is ingested into the Data Lake in a secure and governed manner. Data is cleansed, conformed, integrated and modeled into “refined and for purpose zones” for exploratory and analytical consumption. Metadata consisting of business and technical metadata is captured including lineage in the data catalog for search and discovery. Security policies, including entitlements, are also applied.