Fundamental data system concepts
- Cap theorem - Core theoretical limits of distributed data systems
- ¶ Local-first software - Paradigm enabling collaboration without centralized servers
- Introduction to CRDT
- Bloom filter
- Data pipeline design framework
Data storage technologies
- Hadoop distributed file system (HDFS)
- Vector database fundamentals
- Relational database Concepts
Big data processing frameworks
- ¶ MapReduce - Programming model for distributed computing
- Apache Hive ecosystem
- Introduction to Apache Pig
- Google cloud data solutions
Data transformation & analysis
- DBT - the good solution to accelerate data transformation
- Statistics in data analysis
- Data vault modelling
- Overview of BI tools
- Data analysis techniques