ITWissen.info - Tech know how online

data lake

A data lake is used to store large and very large amounts of data, with the data first being stored in raw format in a repository. Data lakes are suitable for Big Data analysis. Only when the data is used in an application is it converted into the appropriate data format

. Just like data warehouses, large amounts of data can be stored in data lakes. While data warehouses store data hierarchically in files and folders

, data lakes work with an extremely flat storage architecture. Structured and unstructured, coded and uncoded, unformatted and non-validated data can be stored in a data lake. Thedata structures are undefined until they are needed.

In the data lake, the data is stored unprocessed, uniquely labeled and tagged with metadata. Storage is therefore less expensive than in data warehouses, where data is stored in files and folders. During data analysis and compilation, only the tags with metadata need to be searched for relationships. Users can access the data lake via various user interfaces and compile their data.

Informationen zum Artikel
Englisch: data lake
Updated at: 18.04.2019
#Words: 152
Links:
Translations: DE