The tutorial you need to maximize your use data frames in R (CC277)

April 25, 2024 • PD Schloss • 1 min read

What is your preferred method for building a data frame in R? Do you know its performance characteristics relative to other methods and the size of the data frame? In this tutorial, Pat compares 18 methods of building data frames including similar structures from the tibble, data.table, and Matrix packages. He uses the microbenchmark package to evaluate their speed for different sized vectors. You’ll likely be surprised by the results! This episode is part of an ongoing effort to develop an R package that implements the naive Bayesian classifier.

Code

You can browse the state of the repository at the