Replicating and running the Compound Risk Monitor

A compilation of data sources, ingestion scripts and aggregation processes used to compile the Compound Risk Monitor.

Main scripts and data architecture

Coumpound_Risk_database.r, which is used to compile datasets for each of the indicators that feed into Compound Risk Monitor. The script saves separate databases for each source indicator as a .csv file saved in the Indicator_dataset folder (various indicators are saved together in cases where closely-related indicators are bunched together). In addition, the script is used to generate eight summary datasets that compile source indicators that feed into each respective risk component, with outputs saved in the Risk_sheets folder. Note: a number of errors are thrown out when the Natural Hazards sheet is created - though these should not be consequential to the results.

Global database and summary sheet.R, which takes inputs from each of the risk components sheets and calculates overall risk scores (both for the eight risk components as well as a total compound risk score for each country). In addition, a series of alternative risk calculations and reliability scores are generated for comparison. Lastly, the script is used to generate a summary Excel file designed to mimic the original CRM database.

Compound Plots.r which generates a series of comparative plots and visuals that are subsequently saved to the Plots folder.

Alongside the above, separate scirpts are included in the main folder and can be used to scrape and generate source datasets (e.g. 'GDACS scrape.r', 'FAO scrape.r', 'COVID scrape data.r', 'Debt scrape.r'). Note: many of these of these are included in the Coumpound_Risk_database.r script already and do not need to be run in advance.

Outcome databases

The main databases can be found in the data/published/ folder. These include:

Global_compound_risk_database.csv is a compilation of all raw source indicators that feed into the Compound Risk Monitor. These are labelled according to each of the eight risk components (see labelling codes below).

Compound_Risk_Flags_Sheet.csv is a dataset of all summarised country-level compound risk scores. The database also includes all normalised indicators that are used to generate the total risk scores.

Compound_Risk_Monitor.xslx is an Excel file that presents all summarised country-level compound risk scores. It is designed to mimic the style used in developing the original Excel CRM, and features separate tabs for each of the eight risk componenets - alongside reliability scores and alternative risk calculations.

reliabilitysheet.csv presents the proportion of missing values used in calculating risk scores for each country. It should be seen as a measure of reliability in the overall scores generated.

In addition, separate csv files are generated for each of the each risk components. Each file includes the source indicators, as well as the normalised scores used to calculate the various risk scores.

Indicator labelling and indexing

All source indicators used in the Compound Risk Monitor are labelled according to the eight separate risk categories. Short tags are included at the start of the variable label, with the indicator description immediately thereafter. For example, F_ is assigned to all food security indicators, meaning that that the FEWSNET score is classified as: F_Fewsnet_score).

Labels are classified as follows:

C_ -> Conflict.
D_ -> Debt.
FR_ -> Fragility and Institutions.
H_ -> Health / COVID Response Capacity.
F_ -> Food Security.
M_ -> Macro-economic vulnerability.
NH_ -> Natural Hazards.
S_ -> Socioeconomic vulnerability.
RELIABILITY_ -> Reliability scores.
AV_ -> Average scores.
SQ_ -> Geometric average scores.
TOTAL_ -> Total risk scores.
EMERGING_ -> Emerging risk scores.
EXISTING_ -> Existing risk scores.

Sequence in generating CRM outputs

To replicate the databases and plots used in the Compound Risk Monitor start by running R/Coumpound_Risk_database.r. This will generate all necessary indicator datasets and risk sheets. Then run R/Global database and summary sheet.R to calculate component and overall risk scores for each country. If you would like to read intermediate indicator datasets and risk sheets locally, use the --local flag (i.e. Rscript R/Global database and summary sheet.R --local). Lastly, run R/Compound Plots.r to generate summary plots and comparison graphs.

Details on risk calculation and aggregation steps

Full details on the normalisation proceedures, steps taken to calculate component risk scores and generation of total risk scores can be found in the Indicator aggregation file in the Risk_sheets folder

Name		Name	Last commit message	Last commit date
Latest commit History 359 Commits
Annotation and Data Architecture		Annotation and Data Architecture
Indicator_dataset		Indicator_dataset
Plots		Plots
Risk_sheets		Risk_sheets
data		data
.DS_Store		.DS_Store
.Rhistory (1)		.Rhistory (1)
.gitignore		.gitignore
COVID scrape data.R		COVID scrape data.R
Compound plots.R		Compound plots.R
Compound_Risk_database.R		Compound_Risk_database.R
Country_overviews.R		Country_overviews.R
Debt scrape.R		Debt scrape.R
FAO scrape.R		FAO scrape.R
FCV classification code.R		FCV classification code.R
Fragile risk sheet.R		Fragile risk sheet.R
GDACS scrape.R		GDACS scrape.R
Global database and summary sheet.R		Global database and summary sheet.R
README.md		README.md
acled.csv		acled.csv
conflict.dataset.csv		conflict.dataset.csv
conflict_dataset.csv		conflict_dataset.csv
conflict_dataset.xls		conflict_dataset.xls
country.csv		country.csv
fragile_data.csv		fragile_data.csv
fragility_emerging.csv		fragility_emerging.csv
healthsheet.csv		healthsheet.csv
klipdta2 (backup_Nov).R		klipdta2 (backup_Nov).R
klipdta_backup_nov.R		klipdta_backup_nov.R
phone_survey.csv		phone_survey.csv
test.csv		test.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Replicating and running the Compound Risk Monitor

Main scripts and data architecture

Outcome databases

Indicator labelling and indexing

Labels are classified as follows:

Sequence in generating CRM outputs

Details on risk calculation and aggregation steps

About

Releases

Packages

Languages

simonkassel/compoundriskdata

Folders and files

Latest commit

History

Repository files navigation

Replicating and running the Compound Risk Monitor

Main scripts and data architecture

Outcome databases

Indicator labelling and indexing

Labels are classified as follows:

Sequence in generating CRM outputs

Details on risk calculation and aggregation steps

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages