International development organizations, researchers, and policymakers require a clear, comprehensive view of global development trends to make informed decisions and allocate resources effectively. Raw data from sources like the UN (HDI.csv
) and World Bank (WorldBank.xlsx
) is often complex, spread across multiple files with different structures, and contains significant data quality issues, making it difficult to perform reliable time-series analysis or derive clear insights.
This project’s objective was to execute a full data analysis workflow: ingesting and cleaning these complex source files, building a unified and validated dataset, and creating a suite of interactive dashboards to explore global development patterns.
The project followed a rigorous data processing and visualization workflow, pivoting between tools (from spreadsheets to a programmatic Python environment) to overcome significant data quality challenges and produce a reliable analytical tool.
HDI.csv
file, including incorrectly scaled numbers (e.g., an HDI value of 825
instead of 0.825
) and mixed data types within columns.HDI.csv
file, which was in a wide format with hundreds of columns, was transformed into a long, tidy format using the pandas.melt
function. This made the data suitable for time-series analysis.pd.to_numeric
with error coercion to handle missing values gracefully.WorldBank.xlsx
data using a left join
on country code and year, creating a single, unified master dataset.master_hdi_data_clean.csv
. This materialized dataset serves as a stable and efficient data source for the visualization phase, decoupling the data preparation from the analysis.This project successfully transformed multiple complex and error-prone raw data files into a single, validated, analysis-ready dataset. The resulting three-part Looker Studio dashboard provides a powerful tool for policymakers, researchers, and analysts to explore global development trends, compare national performance, and investigate the complex interplay between economic, health, and education indicators. The dashboard empowers users to move from a high-level global understanding to a granular, country-specific analysis seamlessly.
➡️ Open Full Report in Looker Studio
🔗 Home | GitHub |