Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset candidate: world innovation data #9

Open
mtennekes opened this issue Sep 9, 2020 · 4 comments
Open

Dataset candidate: world innovation data #9

mtennekes opened this issue Sep 9, 2020 · 4 comments

Comments

@mtennekes
Copy link
Member

mtennekes commented Sep 9, 2020

I came across another potentially interesting dataset, the world innovation index. It is an index based 7 pillars, from which the first 5 are related to input (what does the government in order to increase innovation), and the latter 2 to output (what innovation does the society create).

There is a yearly time series from 2013 to 2019:

Rplot

Rplot02

Also interesting is the difference between 2019 and 2013:

Rplot01

As you can see, there are quite some countries without data. Let me know if this could be useful for the book. If so, I will clean up the script and make it reproducible.

@mtennekes mtennekes changed the title Dataset candidate Dataset candidate: world innovation data Sep 9, 2020
@Nowosad
Copy link
Member

Nowosad commented Sep 11, 2020

I like this dataset. I agree that having this number of missing values could be seen as an issue, but, on the other hand, it is often the case when working with real data...

@mtennekes
Copy link
Member Author

mtennekes commented Nov 3, 2020

Done! See https://github.com/r-tmap/tmap-data/blob/master/R/02-prepare-world-innovation.R

This data is a 3d array: country iso3 code x pillar x year. Which can be cast into a vector data stars object.

I've used wm (used to create world_all) as shape to join the data with, with a better coverage than tmap's World; still a few missing values, but less. Just discovered the countrycode package. Highly recommended!

Currently, tmap only supports visualization of a slice of vector data cubes (see https://github.com/mtennekes/tmap/issues/458).
Ideally, it should be possible to do tm_facets(by = c("year', "pillar")).

Where shall we store these datasets? In the book repo, via a data-package, or ...?

@Nowosad
Copy link
Member

Nowosad commented Nov 7, 2020

Awesome @mtennekes !
Regarding the question... I have some ideas - we can discuss them during our next meeting.

@Nowosad
Copy link
Member

Nowosad commented Nov 26, 2020

We can decide it later.
For now - we should keep the data in the book repo. In the future, the data can either go to the tmap repo or some new one (e.g., tmapdata.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants