PinnedChengzhi ZhaoinTowards Data Science10 Fantastic Classic Books For Data EngineeringThe Books Make You Success For Data EngineersDec 6, 20222Dec 6, 20222
PinnedChengzhi ZhaoinTowards Data ScienceHere Is What I Learned Using Apache Airflow over 6 YearsA journey with Apache Airflow from experiment to production hassle-freeJan 9, 20232Jan 9, 20232
PinnedChengzhi ZhaoinTowards Data ScienceDeep Dive into Handling Apache Spark Data SkewThe Ultimate Guide To Handle Data Skew In Distributed ComputeJan 3, 2023Jan 3, 2023
PinnedChengzhi ZhaoinTowards Data ScienceAirflow Schedule Interval 101The airflow schedule interval could be a challenging concept to comprehend, even for developers work on Airflow for a while find difficult…Apr 15, 20207Apr 15, 20207
Chengzhi ZhaoinTowards Data ScienceThe Foundation of Data ValidationDiscussing the basic principles and methodology of data validationApr 30Apr 30
Chengzhi ZhaoinILLUMINATION5 Lessons I Learned From a Totaled Car AccidentA life-changing accident changed my lifeNov 15, 2023Nov 15, 2023
Chengzhi ZhaoinData Engineering SpaceBidding War on Housing? Let’s Use R For Exploratory Data AnalysisExploratory Data Analysis 101 in RAug 25, 2023Aug 25, 2023
Chengzhi ZhaoinData Engineering SpaceVisualizing Data with ggridges: Techniques to Eliminate Density Plot Overlaps in ggplot2Data Exploratory: Avoiding Overlapping Density Plots in R Using ggridgesAug 7, 2023Aug 7, 2023
Chengzhi ZhaoinTowards Data ScienceUnlocking the Secrets of Slowly Changing Dimension (SCD): A Comprehensive View of 8 TypesDeep Dive Guide for When and How to Use 8 Types of SCDJul 17, 2023Jul 17, 2023
Chengzhi ZhaoinData Engineer ThingsHow I Built a Tool to Visualize Expense In Sankey DiagramHow to Create a Sankey Diagram to Track Your Personal Finances in RJun 23, 2023Jun 23, 2023