Performance: Avoid Coercing Indices To Doubles

April 2, 2018 in R

x[idxs + 1] or x[idxs + 1L]? That is the question. Assume that we have a vector

random values, e.g. > n <- 100000 > x <- rnorm(n) and that we wish to calculate the

first-order differences

where

. In R, we can calculate this using the following vectorized form: > idxs <- seq_len(n - 1) > y <- x[idxs + 1] - x[idxs] We can certainly do better if we turn to native code, but is there a more efficient way to implement this using plain R code?

Pitfall: Did You Really Mean to Use matrix(nrow, ncol)?

June 17, 2014 in R

Are you a good R citizen and preallocates your matrices? If you are allocating a numeric matrix in one of the following two ways, then you are doing it the wrong way! x <- matrix(nrow = 500, ncol = 100) or x <- matrix(NA, nrow = 500, ncol = 100) Why? Because it is counter productive. And why is that? In the above, x becomes a logical matrix, and not a numeric matrix as intended.

Performance: Avoid Coercing Indices To Doubles

Pitfall: Did You Really Mean to Use matrix(nrow, ncol)?

Henrik Bengtsson