The mortality data is available from WHO website. There are four Mortality document representing four revisions of the International Classification of Diseases (ICD). ICD10 is the latest revision, which was the main data document. Total records are 2,692,114. Last updated was on November 3, 2014.
The following features were included for this final project:
Country code is in 'country_code' document. Country name was merged with mortality data. Any sub-divisions from each country were grouped together.
The ‘availability’ document shows what’s available as for each country, year and ICD. Year 2000-2010 will be chosen since the data from this time period has better integrity (see link).
From this project, only data with 'cause' of cancer were selected. According to the ICD10 online browser, cancer is categorized in Chapter II Neoplasms, Malignant neoplasms, Blocks C00–C97.
The mortality rate was calculated based on two data sources.
From ‘morticd10’, ‘Deaths1’ column contains the total death.
From the World Bank total population data, the file was filtered and reshaped so that year 2000-2010 were included as a variable.
The crude mortality rate will be calculated by dividing the number of cancer deaths observed during a given time period by the corresponding number of person years in the population.
The data were obtained from 'availability' dataset. This dataset was filtered to contain only the code icd10 data. As seen in the chart, the most available data were reported between year 2000-2010. Therefore, the year range is set to be 2000-2010 for the final project.
Note: The mortality rate is expressed in units of deaths per 100,000 individuals per year.
The graph shows the mean mortality rate from 2000-2010. Most of countries from Europe, South and North America, and Australia reported the mortality of cancer to the WHO. The rest of the world (Asia, Africa) is grey (no data present).