PinnedChengzhi ZhaoinTowards Data Science10 Fantastic Classic Books For Data EngineeringThe Books Make You Success For Data EngineersDec 6, 20222Dec 6, 20222
PinnedChengzhi ZhaoinTowards Data ScienceHere Is What I Learned Using Apache Airflow over 6 YearsA journey with Apache Airflow from experiment to production hassle-freeJan 9, 20232Jan 9, 20232
PinnedChengzhi ZhaoinTowards Data ScienceDeep Dive into Handling Apache Spark Data SkewThe Ultimate Guide To Handle Data Skew In Distributed ComputeJan 3, 2023Jan 3, 2023
PinnedChengzhi ZhaoinTowards Data ScienceAirflow Schedule Interval 101The airflow schedule interval could be a challenging concept to comprehend, even for developers work on Airflow for a while find difficult…Apr 15, 20207Apr 15, 20207
Chengzhi ZhaoinLevel Up CodingHow to build a web crawler with MWAA (AWS Airflow) with CDKA comprehensive guide to MWAA with CDKOct 8Oct 8
Chengzhi ZhaoinTowards Data ScienceThe Foundation of Data ValidationDiscussing the basic principles and methodology of data validationApr 30Apr 30
Chengzhi ZhaoinILLUMINATION5 Lessons I Learned From a Totaled Car AccidentA life-changing accident changed my lifeNov 15, 2023Nov 15, 2023
Chengzhi ZhaoinData Engineering SpaceBidding War on Housing? Let’s Use R For Exploratory Data AnalysisExploratory Data Analysis 101 in RAug 25, 2023Aug 25, 2023
Chengzhi ZhaoinData Engineering SpaceVisualizing Data with ggridges: Techniques to Eliminate Density Plot Overlaps in ggplot2Data Exploratory: Avoiding Overlapping Density Plots in R Using ggridgesAug 7, 2023Aug 7, 2023
Chengzhi ZhaoinTowards Data ScienceUnlocking the Secrets of Slowly Changing Dimension (SCD): A Comprehensive View of 8 TypesDeep Dive Guide for When and How to Use 8 Types of SCDJul 17, 2023Jul 17, 2023