How to split files from the command line and integrate bash and R scripts (CC252)
The split function is a useful command line tool for splitting a file by size, number of lines, or into a specified number of files. In this episode, Pat uses split to subdivide a big problem that can’t easily be processed in R. He then shows how to integrate an executable R script into a bash script to make easy work of a very large file. The overall goal of this project is to highlight reproducible research practices using a number of tools. The specific output from this project will be a map-based visual that shows the level of drought across the globe.
Code
You can browse the state of the repository at the