Pull from CrossRef
Pull from OpenAlex, but also map to Carnegie classification (will be fuzzy match)
Using Krantz data, relate the software, and complexity/size of the repository, to the characteristics collected earlier.
docker run -it --rm -v $(pwd):/project -w /project rocker/verse /bin/bash