General Data Engineering
- Google Dataproc
- Google Data Fusion
- Data Pipeline Design Framework
- Statistics in Data Analysis
- Partitions on Apache Hive
- Overview of BI Tools
- Order By vs. Sort By vs. Distribute By vs. Cluster By
- MapReduce
- MapReduce Components
- Managed Table vs External Table
- Introduction to Apache Pig
- Introduction to Apache Hive
- Hive Window and Analytic Functions
- Data Vault Modelling
- DBT - The Good Solution to Accelerate Data Transformation
- Buckets on Apache Hive
- Behind a Hive Table
- Bloom Filter
- Cap Theorem
- Creating a Fully Local Search Engine on Memo
- Data Analyst In Retail Trading
- Database Design Circular
- Database Locking
- DuckDB Demo and Showcase
- Evolutionary Database Design
- Full-Text Search with PostgreSQL
- Google Data Fusion
- Google Dataproc
- Hadoop Distributed File System (HDFS)
- Hive Window and Analytic Functions
- How Discord Stores Messages - Part 1: From MongoDB to Cassandra
- How I Came Up With Our Security Standard
- Introduction to Apache Hive
- Introduction to Apache Pig
- Introduction to CRDT
- Local-First Software
- Managed Table vs External Table
- MapReduce
- MapReduce Components
- Multi-Column Index in DB
- Quick Learning Vector Database
- Redis Leaderboard
- Self-Balanced BSTs - AVL Trees
- SQL and How It Relates to Disk Reads and Writes
- SQL Practices ORM vs Plain SQL
- SQL Sargable Queries and Their Impact on Database Performance
- Statistics in Data Analysis
- Utilizing Cached Table for Binance Kline API Data Processing