Have you ever ever been confused by these phrases — Database, Information Warehouse, Information Lake or Delta Lake? Are they the identical factor or if not how are they totally different? On this quick piece of article I’ll attempt to dive into this subject and discover the variations and similarities amongst these essential ideas in information engineering.
Database:
A database is a scientific assortment of knowledge or data that’s saved electronically. So long as your utility must retailer information, you’ll finally want a database. Relying on the character of your information and the way they relate to one another, chances are you’ll select between a large number of the next frequent database households.
- Relational databases: Oracle, MySQL, Microsoft SQL Server, and PostgreSQL
- Doc databases: MongoDB and CouchDB
- Key-value databases: Redis and DynamoDB
- Huge-column shops: Cassandra and HBase
- Graph databases: Neo4j and Amazon Neptune
DataWarehouse:
Think about a financial institution which has a number of department places throughout the nation, every department runs a separate database system to retailer information of their clients and every day transactions. How ought to the HQ department analyze the info from these a number of branches?